Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
intel/auto-round
Pull Requests
Commits
fast_config
AutoAdamRound_bugfix
actvation_quant
add_task_args_for_lmeval
autoround_support_qbits_backend
bf16_scale
change_mxfp8
debug_time_cost
debug-nvfp4
deepseekv3
ds-fp8kv
ds-qwen
enable_llama4_int8_baseline
enable_llama4_quant
enable_mxfp_exporting
enable_qwen3_vl_moe_quant
fast_config
fix_bug0627
fix_bug_0722
fix_bug_1105
fix_dq
fix_gemma3_issue
fix_gguf_fp8
fix_save_quantized_func_nvfp_checker
fix-attn-mask-b60
fix-ds
fix-gpt-oss
fix-hpu
fixbug_0717
fp4_v2
fp8-cache
fp8-cache-based-export
fp8_export_backup_stable
fp8_export_for_test
hengguo/fix_cuda_ut
hengguo/fix_cuda_ut_1224
hengguo/fix_gguf_ds
hengguo/quantizers
hengguo/refactor_format
hengguo/smoothquant
hengguo/w4afp8_sim
henguo/update_so
hpu_only_kg
hpu_only_pkg
hpu/only/v1
kaihui/torch_dtype
leq_opub
lib/pre-4.4.0
llama/new/9-610
llama/new/9
llm-main
llmc
llmc-backup
llmc-test
lm-head-quant
load-kv
load-w8a8-replace-mod
load-w8a8
lyt/numpy_fix
lyt/omni
main
marlin_modify
mengni/bug_fix
mengni/expert
mengni/vlm
mengniwang95-patch-1
mlperf-awq
more-ar-ext
mxfp8
new_teq
patch/for/ao/581/stable
patch-for-ao-2
pre-release/internal-inc/w4a8
quant-attn-hpu
quant-attn-hpu-o-scale
quant-attn-hpu-pr
quant-llama
qwen3-vl
qwen3_vl_moe
qwen-split
refactor-replace
refine-doc-table
replace-lm-head
revert_order
revert-318-fix/hpu/check
save_memory
static_quant
suyue/ci
suyue/version
test-git
tmp
try_new_optimizer
update_fp_compile
update_0522
update_0819
upstream-ao
use-ep
ut_refactor
ut-time
v0.7.0rc
v0.7.1rc
v0.8.0rc
v0.8.0rc2
v0.9.1rc
v0.9.2-release
v0.9.2rc
v0.9.3rc
w4a4_int_quaro
w4int8dynamic
wfp8-afp8-bk
xinhe/UT
xinhe/avg_bits
xinhe/device_bug
xinhe/eval
xinhe/exp
xinhe/fix_pp
xinhe/fix-release
xinhe/hp_level
xinhe/llama_tmp
xinhe/mix-precision
xinhe/mp
xinhe/new
xinhe/nvfp4
xinhe/release_bug
xinhe/target_loss_ratio
xinhe/tmp
xinhe/whisper
xuehao/fix_install
fix one issue
wenhuach21
committed
1 year ago
5632f001
fix conflict
wenhuach21
committed
1 year ago
c322bc06
fix conflict
wenhuach21
committed
1 year ago
aa0b4d5c
updated
wenhuach21
committed
1 year ago
494d9d0e
fix some issue
wenhuach21
committed
1 year ago
a9a001d3
add thread limits for packing by following autogptq
wenhuach21
committed
1 year ago
b060e12d
update config
wenhuach21
committed
1 year ago
a176b829
tmp change
wenhuach21
committed
1 year ago
4d2225eb
tmp change
wenhuach21
committed
1 year ago
12c3e5fa
support marlin in auto_round format (#172)
wenhuach21
committed
1 year ago
Verified
2b1448d4
fix conflict
wenhuach21
committed
1 year ago
9bda2430
updated
wenhuach21
committed
1 year ago
4142b498
fix default_value issue of seqlen and nsample
yintong-lu
committed
1 year ago
4b105cfd
fix some issue
wenhuach21
committed
1 year ago
a1d0bbc3
add thread limits for packing by following autogptq
wenhuach21
committed
1 year ago
05687b5f
update config
wenhuach21
committed
1 year ago
5dc8cb64
tmp change
wenhuach21
committed
1 year ago
1050a14f
tmp change
wenhuach21
committed
1 year ago
4c442bcf
revert the gptq format code to fix the regression (#168)
wenhuach21
committed
1 year ago
Verified
5947e9c0
fix typos, update overview img (#166)
WeiweiZhang1
committed
1 year ago
Verified
8d5765ac
1 fix a bug in autoround format with the latest transformers 2 rename n_samples n_blocks to nsamples nblocks (#163)
wenhuach21
committed
1 year ago
Verified
f9e7d79e
bugfix (#160)
WeiweiZhang1
committed
1 year ago
Verified
31c566cc
fix bug and limit numpy version (#159)
yintong-lu
committed
1 year ago
Verified
77320b0a
support calibration dataset concat (#147)
yintong-lu
committed
1 year ago
Verified
75e3fde0
remove gpt ppl eval from lm-0.4.2 (#158)
wenhuach21
committed
1 year ago
Verified
77d6a886
fix bug at whole block is excluded from quantization (#156)
wenhuach21
committed
1 year ago
Verified
edcec56e
auto round quantizer supports gptq kernel (#155)
wenhuach21
committed
1 year ago
Verified
9cae103d
fix qbits issue (#153)
wenhuach21
committed
1 year ago
Verified
c313fa33
Qbits related log (#151)
zhewang1-intc
committed
1 year ago
Verified
34274fb3
autoround_support_qbits_backend (#145)
zhewang1-intc
committed
1 year ago
Verified
dbdc4a39
Older