Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
intel/auto-round
Pull Requests
Commits
llm-main
AutoAdamRound_bugfix
actvation_quant
add_task_args_for_lmeval
autoround_support_qbits_backend
bf16_scale
change_mxfp8
debug_time_cost
debug-nvfp4
deepseekv3
ds-fp8kv
ds-qwen
enable_llama4_int8_baseline
enable_llama4_quant
enable_mxfp_exporting
fast_config
fix_bug0627
fix_bug_0722
fix_bug_1105
fix_dq
fix_gemma3_issue
fix_gguf_fp8
fix_save_quantized_func_nvfp_checker
fix-attn-mask-b60
fix-ds
fix-gpt-oss
fix-hpu
fixbug_0717
fp4_v2
fp8-cache
fp8-cache-based-export
fp8_export_backup_stable
fp8_export_for_test
hengguo/fix_cuda_ut
hengguo/fix_cuda_ut_1224
hengguo/fix_gguf_ds
hengguo/quantizers
hengguo/smoothquant
hengguo/w4afp8_sim
henguo/refactor_format_step2
henguo/update_so
hpu_only_kg
hpu_only_pkg
hpu/only/v1
kaihui/torch_dtype
leq_opub
lib/pre-4.4.0
llama/new/9-610
llama/new/9
llm-main
llmc
llmc-backup
llmc-test
lm-head-quant
load-kv
load-w8a8-replace-mod
load-w8a8
lyt/numpy_fix
lyt/omni
main
marlin_modify
mengni/bug_fix
mengni/expert
mengni/vlm
mengniwang95-patch-1
mlperf-awq
more-ar-ext
mxfp8
new_teq
patch/for/ao/581/stable
patch-for-ao-2
pre-release/internal-inc/w4a8
quant-attn-hpu
quant-attn-hpu-o-scale
quant-attn-hpu-pr
quant-llama
qwen3-vl
qwen3_vl_moe
qwen-split
refactor-replace
refine-doc-table
replace-lm-head
revert_order
revert-318-fix/hpu/check
save_memory
static_quant
suyue/ci
suyue/fix
suyue/version
test-git
tmp
try_new_optimizer
update_fp_compile
update_0522
update_0819
upstream-ao
use-ep
ut_refactor
ut-time
v0.7.0rc
v0.7.1rc
v0.8.0rc
v0.8.0rc2
v0.9.1rc
v0.9.2-release
v0.9.2rc
v0.9.3rc
w4a4_int_quaro
w4int8dynamic
wfp8-afp8-bk
xinhe/UT
xinhe/avg_bits
xinhe/device_bug
xinhe/eval
xinhe/exp
xinhe/fix_pp
xinhe/fix-release
xinhe/hp_level
xinhe/llama_tmp
xinhe/mix-precision
xinhe/mp
xinhe/new
xinhe/nvfp4
xinhe/release_bug
xinhe/target_loss_ratio
xinhe/tmp
xinhe/whisper
xuehao/fix_install
xuehao/update_version
revert
yiliu30
committed
44 days ago
70f54fe0
clean
yiliu30
committed
44 days ago
da6fe165
fix
yiliu30
committed
44 days ago
329f67de
clean
yiliu30
committed
44 days ago
e566783d
add auto offload back
yiliu30
committed
44 days ago
b9dd50f1
merge main
yiliu30
committed
45 days ago
cb10a466
refine
yiliu30
committed
45 days ago
4e72f6f0
clean
yiliu30
committed
45 days ago
fe53fb3a
add hints
yiliu30
committed
45 days ago
6800cfe0
fix multiple devices issue in Compressor and AutoScheme (#1007)
wenhuach21
committed
46 days ago
Verified
344b40cf
update
yiliu30
committed
46 days ago
b01c91ff
Fix non auto device map (#1005)
WeiweiZhang1
committed
47 days ago
Verified
cd54e705
fix multiple devices map issue in calibration (#1003)
wenhuach21
committed
47 days ago
Verified
ead6f296
Fix diffusion multi-device ut issue (#1002)
mengniwang95
committed
47 days ago
Verified
f6745fd9
Support for immediate saving to reduce ram usage (#965)
Kaihui-intel
committed
47 days ago
Verified
daeb3bb7
Refine exllamav2 ut (#1001)
WeiweiZhang1
committed
48 days ago
Verified
758b2390
refine md tables (#994)
WeiweiZhang1
committed
48 days ago
Verified
c0aa30ab
fix mllm device_map ut (#1000)
Kaihui-intel
committed
48 days ago
Verified
4aeca3ae
fix cuda ut bug (#999)
n1ck-guo
committed
48 days ago
Verified
d1bf7e8d
add batch dim
yiliu30
committed
48 days ago
e2e2d42b
Reduce peak gpu memory usage and support moe estimation (#981)
xin3he
committed
48 days ago
Verified
84e9a776
fix lm head bug and rm clear_mem_reach_threhold (#997)
wenhuach21
committed
48 days ago
Verified
284eecdd
[CI] Update python to 3.12 and torch to 2.8.0 (#741)
XuehaoSun
committed
49 days ago
Verified
268f7dda
update
yiliu30
committed
49 days ago
0354c2ba
remove time
yiliu30
committed
49 days ago
b992c319
fix
root
committed
49 days ago
2bd3c4b1
fix bug of cannot create adam compressor (#992)
n1ck-guo
committed
49 days ago
Verified
05cab090
fix offloaf
root
committed
50 days ago
553ee5c8
support model_dtype and fix bug of scheme contains quotes, mllm eval (#985)
n1ck-guo
committed
50 days ago
Verified
cd308ddb
add support for Magistral-Small (#980)
n1ck-guo
committed
50 days ago
Verified
dc6a6d3f
Older