Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
intel/auto-round
Pull Requests
Commits
llmc
AutoAdamRound_bugfix
Chinesization
ZaneMark-patch-1
ZaneMark-patch-3
acp
actvation_quant
add_task_args_for_lmeval
ar_agent
ark_zp
autoround_support_qbits_backend
awq_algorithm
bf16_scale
chore/claude-init
copilot/add-xpu-moe-decode-implementation
copilot/fix-corner-case-in-auto-round
copilot/fix-deprecated-fp-layers-handling
copilot/fix-docstrings-in-python-files
copilot/fix-issue-with-auto-rounding
copilot/fix-llm-type-70b-bits-setting
copilot/fix-typeerror-wrapped-fn
copilot/improve-pr-template-type-of-change
copilot/investigate-quantization-group-and-ffn
copilot/replace-getset-module-torch-api
copilot/sageattention
copilot/speedup-fp8-linear-convert
copilot/speedup-fp8-linear-convert-again
copilot/speedup-fp8-linear-convert-another-one
copilot/sub-pr-1237-again
copilot/sub-pr-1237
copilot/sub-pr-1324
copilot/sub-pr-1522-again
copilot/sub-pr-1532
copilot/update-user-settings-page
copilot/vscode-mo3shmf8-8qa6
ddp
debug_time_cost
debug/usable_rotation
debug-hang
debug-nvfp4
deepseekv3
ds-qwen
ds-v5
ds-v32
dsv4
enable_glm4_moe_lite_quantization
enable_llama4_int8_baseline
enable_llama4_quant
enable_mxfp_exporting
feat/activation-checkpointing
feat/autoround-quarot
fix_bug0627
fix_bug_0722
fix_bug_1105
fix_compile
fix_disable_act_dynamic_usage_in_mxfp.py
fix_dq
fix/fp-layers-deprecation-mapping
fix_gemma3_issue
fix_gguf_fp8
fix_gptqmodel
fix_low_cpu
fix_rotation
fix_save_quantized_func_nvfp_checker
fix_0107
fix_0109
fix_0113
fix-attn-mask-b60
fix-ds
fix-flashinfer
fix-gpt-oss
fix-hpu
fix-to-meta-assertion-error-1499
fixbug_0717
fp4_v2
fp4_v3
fp8-cache
fp8-cache-based-export
fp8-static-quant-patch
fp8_export_backup_stable
fp8_export_for_test
good-flux
hengguo/auto_round_rtn_cli
hengguo/fix_cuda_ut
hengguo/fix_gguf_ds
hengguo/fix_merge_error_0514
hengguo/fix_qwen_bug
hengguo/quantizers
hengguo/refactor_init
hengguo/refactor_quant_step1
hengguo/smoothquant
hengguo/w4afp8_sim
henguo/update_so
hpu_only_kg
hpu_only_pkg
hpu/only/v1
hpu-limit-tran
kaihui/torch_dtype
lazy-model-replace
leq_opub
lib/pre-4.4.0
llama/new/9-610
llama/new/9
llm-main
llmc
llmc-backup
llmc-test
lm-head-quant
load-kv
load-w8a8-replace-mod
load-w8a8
lvl/autoscheme_ram_opt
lvl/cpu_ram_optimization
lvl/fix_no_init_weights
lvl/fix_omni_long_audio
lvl/fix_vlm_large_ram_issue
lvl/fix_vlm_large_vram
lvl/general_moe_replacement
lvl/support_bagel_mot
lvl/support_fp8_with_ark
lvl/support_hunyuan_image
lvl/support_turbo_quant
lvl/support_wan2.2
lyt/numpy_fix
lyt/omni
main
marlin_modify
mengni/bug_fix
mengni/expert
mengni/mengni/block_wise
mengni/new_vllm
mengni/vllm
mengni/vlm
mengniwang95-patch-1
more-ar-ext
mxfp8
origin/block_wise
patch/for/ao/581/stable
patch-for-ao-2
pr1775-followup
pre-release/internal-inc/w4a8
quant-attn-hpu
quant-attn-hpu-o-scale
quant-attn-hpu-pr
quant-llama
quarot-llama
qwen3-vl
qwen3_vl_moe
qwen-split
qwen-v5
refine_device
refine-doc-table
replace-lm-head
revert_order
revert-318-fix/hpu/check
revert-1231-set_disable_opt_rtn_default_2_none
revert-1562-suyue/ut
save_memory
set_disable_opt_rtn_default_2_none
static_quant
support_audio_model_quantization
support_gemma4
suyue/ark-ci
suyue/model
test-git
try_new_optimizer
try_to_fix_hadamard_regression
update_fp_compile
update_0522
update_0819
upstream-ao
use-ep
ut-time
v0.7.0rc
v0.7.1rc
v0.8.0rc
v0.8.0rc2
v0.9.1rc
v0.9.2-release
v0.9.2rc
v0.9.3rc
v0.9.4rc
v0.9.5rc
v0.9.6rc
v0.9.7rc
v0.10.0rc
v0.10.1rc
v0.10.2rc
v0.10.3rc
v0.12.0rc
v0.12.1rc
v0.12.2rc
v0.12.3rc
w4a4_int_quaro
w4int8dynamic
wangchang/fix_oom
wangchang/vllm
wenhuach21-patch-1
wfp8-afp8-bk
xin3he-patch-1
xinhe/3-20c
xinhe/3-27c
xinhe/3-27d
xinhe/4-7
xinhe/4-15
xinhe/5-13
xinhe/5-14
xuehao/cuda-ci
xuehao/test_gptq
zhenzhong/arformat_fp8
zhenzhong/toolkit_release
add hints
yiliu30
committed
187 days ago
6800cfe0
fix multiple devices issue in Compressor and AutoScheme (#1007)
wenhuach21
committed
188 days ago
Verified
344b40cf
update
yiliu30
committed
188 days ago
b01c91ff
Fix non auto device map (#1005)
WeiweiZhang1
committed
189 days ago
Verified
cd54e705
fix multiple devices map issue in calibration (#1003)
wenhuach21
committed
189 days ago
Verified
ead6f296
Fix diffusion multi-device ut issue (#1002)
mengniwang95
committed
189 days ago
Verified
f6745fd9
Support for immediate saving to reduce ram usage (#965)
Kaihui-intel
committed
189 days ago
Verified
daeb3bb7
Refine exllamav2 ut (#1001)
WeiweiZhang1
committed
190 days ago
Verified
758b2390
refine md tables (#994)
WeiweiZhang1
committed
190 days ago
Verified
c0aa30ab
fix mllm device_map ut (#1000)
Kaihui-intel
committed
190 days ago
Verified
4aeca3ae
fix cuda ut bug (#999)
n1ck-guo
committed
190 days ago
Verified
d1bf7e8d
add batch dim
yiliu30
committed
190 days ago
e2e2d42b
Reduce peak gpu memory usage and support moe estimation (#981)
xin3he
committed
190 days ago
Verified
84e9a776
fix lm head bug and rm clear_mem_reach_threhold (#997)
wenhuach21
committed
191 days ago
Verified
284eecdd
[CI] Update python to 3.12 and torch to 2.8.0 (#741)
XuehaoSun
committed
191 days ago
Verified
268f7dda
update
yiliu30
committed
191 days ago
0354c2ba
remove time
yiliu30
committed
191 days ago
b992c319
fix
root
committed
191 days ago
2bd3c4b1
fix bug of cannot create adam compressor (#992)
n1ck-guo
committed
192 days ago
Verified
05cab090
fix offloaf
root
committed
192 days ago
553ee5c8
support model_dtype and fix bug of scheme contains quotes, mllm eval (#985)
n1ck-guo
committed
192 days ago
Verified
cd308ddb
add support for Magistral-Small (#980)
n1ck-guo
committed
192 days ago
Verified
dc6a6d3f
fix guff scheme and device_map bug (#969)
n1ck-guo
committed
192 days ago
Verified
be146c7f
update bits (#986)
xin3he
committed
192 days ago
Verified
4afbe0ab
refactor
root
committed
193 days ago
a20f9df7
cli support for positional arguments model (#979)
n1ck-guo
committed
193 days ago
Verified
b9245b58
refine readme (#978)
wenhuach21
committed
193 days ago
Verified
3f7bdace
refactor
root
committed
194 days ago
7a1716e0
Merge branch 'llmc' of https://github.com/intel/auto-round into llmc
yiliu30
committed
194 days ago
2f96c13f
refine code
yiliu30
committed
194 days ago
60a00232
Newer
Older