sample
: add time-series sampling options
#2589
Labels
datapusher+
for Datapusher+
DRUF
for Data Resource Upload First workflow
enhancement
New feature or request. Once marked with this label, its in the backlog.
timeseries
time series related
A lot of data hosted in data catalogs are time-series data.
Since we're focusing on just compiling high-quality, high-resolution metadata in the data catalog, we compile summary stats and frequency tables using the "complete" dataset, but only want to host a representative sample in the catalog while pointing to the source where the "complete" dataset is available.
We don't want the catalog to double as a central datastore with its attendant high capacity reqts, so we only store a sample preview.
However, if we just get the first N rows of a time series dataset, it will most likely always be the same as time-series datasets are often sorted.
Add several time-series sampling options to make the sample more dynamic:
The text was updated successfully, but these errors were encountered: