Skip to content
This repository was archived by the owner on Oct 16, 2023. It is now read-only.
This repository was archived by the owner on Oct 16, 2023. It is now read-only.

Run error on run_clm_no_trainer_colossalai_new #14

@MikeChenfu

Description

@MikeChenfu

Hello, I tried running run_clm_no_trainer_colossalai_new.sh and get the error about Gemini Manager missing process_group. Apart from that, I saw the script doesn't use colossalai_zero.py to define warmup_non_model_data_ratio and gpu_margin_mem_ratio. Appreciate it if you have any suggestion about it.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions