Skip to content

TFDV 0.22.1 Release

Compare
Choose a tag to compare
@dhruvesh09 dhruvesh09 released this 24 Jun 23:31

Major Features and Improvements

  • Statistics generation is now able to handle arbitrarily nested arrow
    List/LargeList types. Stats about the list elements' presence and valency
    are computed at each nest level, and stored in a newly added field,
    valency_and_presence_stats in CommonStatistics.

Bug Fixes and Other Changes

  • Trigger DATASET_HIGH_NUM_EXAMPLES when a dataset has more than the specified
    limit on number of examples.
  • Fix bug in display_anomalies that prevented dataset-level anomalies from
    being displayed.
  • Trigger anomalies when a feature has a number of unique values that does not
    conform to the specified minimum/maximum.
  • Depends on pandas>=0.24,<2.
  • Depends on tensorflow-metadata>=0.22.2,<0.23.0.
  • Depends on tfx-bsl>=0.22.1,<0.23.0.

Known Issues

Breaking Changes

Deprecations