Skip to content

Releases: alibaba/TorchEasyRec

v0.7.0

10 Feb 07:54
a5dc775
Compare
Choose a tag to compare

Major Features and Improvements

Train/Eval/Export

Model

  • Optimize TDM gen tree speed #33
  • TDM Support string id #72
  • Rank and Match models support sample weight #50 #57 #63 #65
  • Add zero collision hash embedding #60
  • Add intervention methods for multi-target learning #49
  • Add Autodis and MLP embedding for raw features #73 #75
  • Add task space for multi-target learning loss #82
  • Add dual augmented two-tower match model #83
  • Add HSTU (WIP) #55

Feature

  • pyfg support CPU without avx512 #20
  • ExprFeature support l2_norm|dot|euclid_dist #35
  • Add fg bucketize only mode & refactor fg_encoded to fg_mode #62
  • Make default bucketize value configurable #94
  • Support multi-value sequence #96
  • Support vocab file #97

Dataset

  • Enhance stability for credential of OdpsDataset #45
  • Add complex type and credential support for sampler when use odps dataset #52
  • Support CsvDataset with null columns #56
  • Negative sampler support string id #70

Config

  • Support easyrec config convert to tzrec config #37 #39 #51 #90

Upgrade

  • Release official dlc image #26
  • Upgrade pytorch to v2.6 torchrec to v1.1.0 #99

Note

For TorchEasyRec 0.7.x, you should use Docker image version 0.7.

  • For the GPU version (CUDA 12.4):
    • mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easyrec/tzrec-devel:0.7-cu124
    • PyTorch: v2.6 CUDA: v12.4 FBGEMM: v1.1.0 TorchRec: v1.1.0 Python: v3.11
  • For the CPU version:
    • mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easyrec/tzrec-devel:0.7-cpu
    • PyTorch: v2.6 FBGEMM: v1.1.0 TorchRec: v1.1.0 Python: v3.11

Bug Fixes and Other Changes

New Contributors

Full Changelog: v0.6.0...v0.7.0

v0.6.0

30 Oct 03:10
0d34ca6
Compare
Choose a tag to compare

We are excited to announce the release of TorchEasyRec 0.6.0, the first public release for TorchEasyRec.

Major Features and Improvements

  • High-performance training, evaluation, and prediction with GPUs.
  • Supported a variety of input data types, including MaxCompute Table, OSS files, CSV files, Parquet files doc here.
  • Supported a variety of feature types, including IdFeature, RawFeature, ComboFeature, LookupFeature, MatchFeature, ExprFeature, OverlapFeature, TokenizeFeature, SequenceIdFeature, SequenceRawFeature, and SequenceFeature. The feature generation operations is also efficient and robust doc here.
  • Supported a variety of models, including DSSM, TDM, DeepFM, MultiTower, DIN, MMoE, DBMTL, PLE. It is also easy to implement customized models.
  • Supported a variety of loss, including binary_cross_entropy, softmax_cross_entropy, l2_loss, jrc_loss doc here.
  • Supported VariationalDropout feature selection.
  • Easy to deploy a TorchEasyRec model as a high-performance inference service using the TorchEasyRec Processor.

Bug Fixes and Other Changes

  • [bugfix] fix train_eval may hang when use OdpsDataset and set is_orderby_partition=true by @tiankongdeguiji in
  • [bugfix] fix offline predict input tile model with sequence by @tiankongdeguiji in #14

Note

For TorchEasyRec 0.6.x, you should use Docker image version 0.6.

  • For the GPU version (CUDA 12.1):
    • mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easyrec/tzrec-devel:0.6-cu121
  • For the CPU version:
    • mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easyrec/tzrec-devel:0.6-cpu

New Contributors

Full Changelog: https://github.com/alibaba/TorchEasyRec/commits/v0.6.0