2025-12-9: Added the LVLLM_MOE_USE_WEIGHT environment variable to support MOE modules using two modes to infer fp8 models LVLLM_MOE_USE_WEIGHT="KEEP": lk_moe inference uses the original weight format ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results