Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
intel/auto-round
Pull Requests
Commits
lm-head-quant
AutoAdamRound_bugfix
actvation_quant
add_task_args_for_lmeval
autoround_support_qbits_backend
bf16_scale
copilot/fix-corner-case-in-auto-round
copilot/fix-deprecated-fp-layers-handling
copilot/fix-issue-with-auto-rounding
copilot/fix-llm-type-70b-bits-setting
copilot/fix-typeerror-wrapped-fn
copilot/speedup-fp8-linear-convert
copilot/speedup-fp8-linear-convert-again
copilot/speedup-fp8-linear-convert-another-one
copilot/sub-pr-1237-again
copilot/sub-pr-1237
debug_time_cost
debug-nvfp4
deepseekv3
ds-qwen
ds-v32
enable_llama4_int8_baseline
enable_llama4_quant
enable_mxfp_exporting
fast_config
fix_bug0627
fix_bug_0722
fix_bug_1105
fix_dq
fix/fp-layers-deprecation-mapping
fix_gemma3_issue
fix_gguf_fp8
fix_low_cpu_new
fix_low_cpu
fix_save_quantized_func_nvfp_checker
fix_0107
fix_0109
fix_0113
fix-attn-mask-b60
fix-ds
fix-flashinfer
fix-gpt-oss
fix-hpu
fixbug_0717
fp4_v2
fp8-cache
fp8-cache-based-export
fp8_export_backup_stable
fp8_export_for_test
hengguo/bug_fix_0115
hengguo/fix_cuda_ut
hengguo/fix_gguf_ds
hengguo/quantizers
hengguo/refactor_quant_step1
hengguo/smoothquant
hengguo/w4afp8_sim
henguo/update_so
hpu_only_kg
hpu_only_pkg
hpu/only/v1
kaihui/torch_dtype
lazy-model-replace
leq_opub
lib/pre-4.4.0
llama/new/9-610
llama/new/9
llm-main
llmc
llmc-backup
llmc-test
lm-head-quant
load-kv
load-w8a8-replace-mod
load-w8a8
lyt/numpy_fix
lyt/omni
main
marlin_modify
mengni/bug_fix
mengni/expert
mengni/fp8_sdpa
mengni/vllm
mengni/vlm
mengniwang95-patch-1
mlperf-awq
more-ar-ext
mxfp8
new_teq
patch/for/ao/581/stable
patch-for-ao-2
pre-release/internal-inc/w4a8
quant-attn-hpu
quant-attn-hpu-o-scale
quant-attn-hpu-pr
quant-llama
qwen3-vl
qwen3_vl_moe
qwen-split
refine-doc-table
replace-lm-head
revert_order
revert-318-fix/hpu/check
revert-1231-set_disable_opt_rtn_default_2_none
save_memory
set_disable_opt_rtn_default_2_none
static_quant
suyue/ci
suyue/version
test-git
tmp
try_new_optimizer
update_fp_compile
update_0522
update_0819
upstream-ao
use-ep
ut-time
v0.7.0rc
v0.7.1rc
v0.8.0rc
v0.8.0rc2
v0.9.1rc
v0.9.2-release
v0.9.2rc
v0.9.3rc
v0.9.4rc
v0.9.5rc
w4a4_int_quaro
w4int8dynamic
wfp8-afp8-bk
xinhe/UT
xinhe/avg_bits
xinhe/device_bug
xinhe/eval
xinhe/exp
xinhe/export
xinhe/fix_pp
xinhe/fix_xpu_ci
xinhe/hp_level
xinhe/llama_tmp
xinhe/mix-precision
xinhe/mp
xinhe/new
xinhe/nvfp4
xinhe/release_bug
xinhe/target_loss_ratio
xinhe/tmp
xinhe/ut_enhance
xinhe/whisper
xuehao/fix_install
xuehao/v0.9.4_release
Merge branch 'main' of https://github.com/intel/auto-round into main
WeiweiZhang1
committed
1 year ago
41b328c2
fix CI trigger branch (#21)
chensuyue
committed
1 year ago
Verified
19e50da4
detect eval_legacy automatically (#20)
wenhuach21
committed
1 year ago
Verified
4f7878da
Support code scan in CI test (#19)
chensuyue
committed
1 year ago
Verified
528252b9
force to fp32 tuning if amp is disabled and align sym quantization (#18)
wenhuach21
committed
1 year ago
Verified
3938d4ad
Refine quntization config (#17)
WeiweiZhang1
committed
1 year ago
Verified
2b821bca
fix typo
wenhuach21
committed
1 year ago
cf15c12e
add gemma-7b acc and shell info
wenhuach21
committed
1 year ago
1f79e54f
fix autogptq exporting issue at group_size=-1
wenhuach21
committed
1 year ago
eb0bb2d1
add shell and acc data for several models, fixed some issues
wenhuach21
committed
1 year ago
4ae7d455
Merge branch 'main' of https://github.com/intel/auto-round into main
WeiweiZhang1
committed
1 year ago
0557ed27
upgrade lm_eval (#14)
WeiweiZhang1
committed
1 year ago
Verified
440771c8
update accuracy section
wenhuach21
committed
1 year ago
3ff47860
fix typo
wenhuach21
committed
1 year ago
b81a5a55
rename export to save quantized (#15)
wenhuach21
committed
1 year ago
Verified
2fb6fd11
update readme (#13)
wenhuach21
committed
1 year ago
Verified
a868c805
upgrade lm_eval
WeiweiZhang1
committed
1 year ago
6ba09c67
fix mixtral moe autogptq exporting issue (#11)
wenhuach21
committed
1 year ago
Verified
1869cde0
fix autogptq exporting issue
wenhuach21
committed
1 year ago
d413128f
fix autogptq exporting issue and update readme
wenhuach21
committed
1 year ago
b99431b0
add auto-gptq to requirements.txt, rename export dir
wenhuach21
committed
1 year ago
f987a5c9
Merge branch 'main' of https://github.com/intel/auto-round
wenhuach21
committed
1 year ago
4524b268
fix the typo neural-chat-7b-v3 to neural-chat-7b-v3-3
wenhuach21
committed
1 year ago
0047da38
fix binary name
chensuyue
committed
1 year ago
d6aa859a
update readme
wenhuach21
committed
1 year ago
cf5e1d39
update readme
wenhuach21
committed
1 year ago
9f0db7ea
tiny change
wenhuach21
committed
1 year ago
3da3aeac
update readme
wenhuach21
committed
1 year ago
4f6c7c56
update readme
wenhuach21
committed
1 year ago
8399597a
update readme
wenhuach21
committed
1 year ago
d5e94379
Newer
Older