transformers
fix Dtensor and tensor mismatch for Col/RowRep
#42924
Merged

Commits
  • begin Moe test tensor parallel
    3outeille committed 44 days ago
  • create tiny moe model + fix test tensor parallel Moe
    3outeille committed 44 days ago
  • create tiny moe model + fix test tensor parallel Moe
    3outeille committed 44 days ago
  • Merge branch 'main' into v4.57.1-test_tensor_parallel
    3outeille committed 30 days ago
  • Merge branch 'main' into v4.57.1-test_tensor_parallel
    3outeille committed 16 days ago
  • fix backward pass test in tensor parallel for Dense model (#42811)
    3outeille committed 16 days ago
  • Merge branch 'main' into v5-test_tensor_parallel_moe
    3outeille committed 12 days ago
  • use mixtral instead for testing
    3outeille committed 11 days ago
  • fix dtensor and tensor mismatch
    3outeille committed 11 days ago
  • linting
    3outeille committed 11 days ago
  • checkout test tensor parallel to be like main
    3outeille committed 11 days ago
  • Merge branch 'main' into fix_dtensor_tensor_moe_mismatch
    3outeille committed 11 days ago
  • Merge branch 'main' into fix_dtensor_tensor_moe_mismatch
    3outeille committed 11 days ago
  • avoid hack and create class instead
    3outeille committed 11 days ago
  • fix loading ep
    3outeille committed 10 days ago
  • add moe test
    3outeille committed 10 days ago
  • now EP inference works again but pass still fails
    3outeille committed 10 days ago
  • Add ColwiseParallelReplicate and RowwiseParallelReplicate classes for replicated layouts
    3outeille committed 10 days ago
  • clean
    3outeille committed 10 days ago
  • Merge branch 'main' into tp_replicate_interface
    3outeille committed 10 days ago
  • eaza
    3outeille committed 10 days ago
  • Merge branch 'tp_replicate_interface' of https://github.com/huggingface/transformers into tp_replicate_interface
    3outeille committed 10 days ago
  • aeaeaea
    3outeille committed 10 days ago
  • eaeaa
    3outeille committed 10 days ago
  • linting
    3outeille committed 10 days ago
Loading