[P1] Question about the dataset #128

csaiedu · 2025-03-07T10:29:09Z

In the tutorials,
How do you actually go about generating thise model weights?

lsreft = torch.load(f"../results/prod_{model_name}_{layer}_concept16k_lsreft/train/LsReFT_weight.pt")
lsreft_metadata = load_jsonl(f"../results/prod_{model_name}_{layer}_concept16k_lsreft/train/metadata.jsonl")

thanks

The text was updated successfully, but these errors were encountered:

frankaging · 2025-03-08T00:20:48Z

Hi, once you download the dataset from our HuggingFace repo here, you can train them using commands like:

torchrun --nproc_per_node=4 --master_port=30000 axbench/scripts/train.py \
  --config axbench/sweep/wuzhengx/2b/l20/16k_lsreft.yaml \
  --dump_dir axbench/results/prod_2b_l20_concept16k_lsreft \
  --overwrite_data_dir axbench/concept16k/prod_2b_l20_v1/generate \
  --run_name official

the yaml files are all released. see: https://github.com/stanfordnlp/axbench/blob/main/axbench/sweep/wuzhengx/2b/l20/16k_lsreft.yaml.

frankaging changed the title ~~Question about the dataset~~ [P1] Question about the dataset Mar 8, 2025

frankaging self-assigned this Mar 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[P1] Question about the dataset #128

[P1] Question about the dataset #128

csaiedu commented Mar 7, 2025

frankaging commented Mar 8, 2025

[P1] Question about the dataset #128

[P1] Question about the dataset #128

Comments

csaiedu commented Mar 7, 2025

frankaging commented Mar 8, 2025