Fix: add .model after language_model in quantization ignore/exclude_modules

by zhiyucheng - opened 7 days ago

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

+122

-122

zhiyucheng

NVIDIA org 7 days ago

This PR fixes the module path prefix in the quantization config files.

In both config.json (quantization_config.ignore) and hf_quant_config.json (quantization.exclude_modules), all entries starting with language_model. have been updated to language_model.model. to correctly reference the submodule path.

For example:

language_model.lm_head → language_model.model.lm_head
language_model.layers.*.self_attn* → language_model.model.layers.*.self_attn*

zhiyucheng changed pull request status to closed 7 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment