Skip to content

Enhance datasets metadata to include lastUpdateTime field #837

@swaptr

Description

@swaptr

Currently, the kaggle datasets metadata <dataset> command outputs limited dataset metadata. For example, running:

kaggle datasets metadata mlg-ulb/creditcardfraud

returns JSON that includes fields such as datasetId, title, description, usabilityRating, totalViews, totalDownloads, etc.

However, it does not include the lastUpdateTime field, even though these are available from the web API endpoint:

https://www.kaggle.com/api/i/datasets.DatasetDetailService/GetDatasetBasics

That endpoint returns richer metadata, including:

  • lastUpdateTime
  • creatorDisplayName

It would be extremely useful if the CLI’s datasets metadata command exposed at least the lastUpdateTime field, so that workflows depending on dataset freshness can be automated without having to call the API separately.

This would make it much easier to build workflows around dataset versioning and updates directly from the CLI.

Thanks for considering!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions