Nemotron 9B × Megatron-Bridge で Mamba-2 含む全層 LoRA を NVIDIA Brev H100 で学習させてみた
https://dev.classmethod.jp/articles/nemotron-9b-megatron-bridge-brev/
If you have a fediverse account, you can quote this note from your own instance. Search https://rss-mstdn.studiofreesia.com/users/dev_classmethod/statuses/116131250094020904 on your instance and quote it. (Note that quoting is not supported in Mastodon.)
DevelopersIO 