user8797249
user8797249

Reputation: 11

Reading Dataset statistics generated by StatisticsGen

I looked at the TFX guide (https://www.tensorflow.org/tfx/guide), and I was able to run the pipeline. The StatisticsGen component wrote Dataset statistics to a file on disk. How do I visualize the stats stored in the file?

Thank you

Upvotes: 0

Views: 484

Answers (1)

Paul Suganthan
Paul Suganthan

Reputation: 86

The StatisticsGen component writes the statistics protocol buffer in a TFRecord file. You can visualize the statistics in a Jupyter notebook as follows:

import tensorflow_data_validation as tfdv
STATS_OUTPUT_PATH = ''
stats = tfdv.load_statistics(STATS_OUTPUT_PATH)
tfdv.visualize_statistics(stats)

Upvotes: 0

Related Questions