Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inference code for 2 models evaluated in the paper #255

Open
bigwatergun opened this issue Dec 3, 2024 · 3 comments
Open

Inference code for 2 models evaluated in the paper #255

bigwatergun opened this issue Dec 3, 2024 · 3 comments

Comments

@bigwatergun
Copy link

Hello, thank you for releasing such great codebase. In tutel/fairseq_moe I can see you used a moe model in fairseq repository for training. However, I didn't find the corresponding inference code. Also, I noticed that SwinV2-MoE was evaluated in the paper. Could you provide the inference code for these 2 models(under Tutel framework)?

@ghostplant
Copy link
Contributor

Hi, SwinMoE has a MoE-pretrained version following this instructions: https://github.com/microsoft/Swin-Transformer/blob/main/get_started.md#evaluating-swin-moe

If you have different number of GPUs to fine-tune / interference based on the pretrained, you can further use this tool to convert the checkpointing files: https://github.com/microsoft/Tutel/blob/main/doc/CHECKPOINT.md#swin-transmformer-maintains-a-special-checkpoint-format-how-to-convert-swin-transformer-checkpoint-files-for-different-distributed-world-sizes

For Fairseq, this is the step to use it with Fairseq: https://github.com/microsoft/Tutel/tree/main/tutel/examples/fairseq_moe

@bigwatergun
Copy link
Author

Thank you for your quick reply. As far as I know, https://github.com/microsoft/Tutel/tree/main/tutel/examples/fairseq_moe only contains the training code for Fairseq. Do you have the inference code for me to test easily?

@ghostplant
Copy link
Contributor

Inference settings need to follow Fairseq's doc, like options for --forward-only

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants