DatPySci
/

DeepSeek-Qwen-1.5B-GRPO

Model card Files Files and versions Community

DeepSeek-Qwen-1.5B-GRPO

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

DatPySci's picture

Training in progress, step 20

f087c0b verified 13 days ago

.gitattributes

1.57 kB

Training in progress, step 20 13 days ago
config.json

704 Bytes

Training in progress, step 20 13 days ago
model.safetensors

3.55 GB
LFS

Training in progress, step 20 13 days ago
special_tokens_map.json

485 Bytes

Training in progress, step 20 13 days ago
tokenizer.json

11.4 MB
LFS

Training in progress, step 20 13 days ago
tokenizer_config.json

6.77 kB

Training in progress, step 20 13 days ago
training_args.bin
Detected Pickle imports (14)
- "accelerate.state.PartialState",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.training_args.OptimizerNames",
- "torch.bfloat16",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "transformers.trainer_utils.SchedulerType",
- "open_r1.configs.GRPOConfig",
- "accelerate.utils.dataclasses.DistributedType",
- "torch.device",
- "transformers.trainer_pt_utils.AcceleratorConfig"
How to fix it?
8.12 kB
LFS

Training in progress, step 20 13 days ago