tft.AnalyzeAndTransformDataset
andtft.TransformDataset
can now outputpyarrow.RecordBatch
es. This is controlled by a parameteroutput_record_batches
which is set toFalse
by default.
- Added
tft.make_and_track_object
to load and tracktf.Trackable
objects created inside thepreprocessing_fn
(for example, tf.hub models). This API should only be used whenforce_tf_compat_v1=False
and TF2 behavior is enabled. - The
decode
method of the available coders (tft.coders.CsvCoder
andtft.coders.ExampleProtoCoder
) have been removed. These were deprecated in the 0.25 release. Canned TFXIO implementations should be used to read and decode data instead. - Previously deprecated APIs were removed:
tft.uniques
(replaced bytft.vocabulary
),tft.string_to_int
(replaced bytft.compute_and_apply_vocabulary
),tft.apply_vocab
(replaced bytft.apply_vocabulary
), andtft.apply_function
(identity function). - Removed the
always_return_num_quantiles
arg oftft.quantiles
andtft.bucketize
which was deprecated in version 0.26. - Added support for
count_params
method to theTransformFeaturesLayer
. This will allow to call Keras Model'ssummary()
method if the model is using theTransformFeaturesLayer
. - Depends on
absl-py>=0.9,<0.13
. - Depends on
tensorflow-metadata>=0.29.0,<0.30.0
. - Depends on
tfx-bsl>=0.29.0,<0.30.0
.
- Existing caches (for all analyzers) are automatically invalidated.
- N/A