Commits
  • initial commit of mlp_speculator support (draft)
    Narsil committed 2 years ago
  • added a bunch of cleanup based on comments in PR; removed conditionals from LayerNormParameterized and renamed to MLPSpeculatorLayerNorm; now using modules for tensor-parallel (this is work in progress - looking into if this is right approach); fixed issue with getting medusa model; fixed for more efficient loading
    Narsil committed 2 years ago
  • Hardcode a few stuff to make it work.
    Narsil committed 2 years ago
  • [REWRITTEN] added a bunch of cleanup based on comments in PR; removed conditionals from LayerNormParameterized and renamed to MLPSpeculatorLayerNorm; now using modules for tensor-parallel (this is work in progress - looking into if this is right approach); fixed issue with getting medusa model; fixed for more efficient loading
    Narsil committed 2 years ago
  • Fixed speculator.
    Narsil committed 2 years ago
  • Cleanup.
    Narsil committed 2 years ago
  • Removed a bunch of hardcodes.
    Narsil committed 2 years ago
  • Rebase after refactor.
    Narsil committed 2 years ago
  • Missing update after rebase
    Narsil committed 2 years ago
  • Remove traces of `use_medusa`.
    Narsil committed 2 years ago
  • Backport changes in medusa.
    Narsil committed 2 years ago
  • Remove old code again.
    Narsil committed 2 years ago
  • Model_type location.
    Narsil committed 2 years ago
  • Reload model_type when speculator is found.
    Narsil committed 2 years ago
Loading