MlxLLM¶
Apple MLX LLM implementation.
Attributes¶
-
path_or_hf_repo: the path to the model or the Hugging Face Hub repo id.
-
tokenizer_config: the tokenizer configuration.
-
mlx_model_config: the MLX model configuration.
-
adapter_path: the path to the adapter.
-
use_magpie_template: a flag used to enable/disable applying the Magpie pre-query template. Defaults to
False
. -
magpie_pre_query_template: the pre-query template to be applied to the prompt or sent to the LLM to generate an instruction or a follow up user message. Valid values are "llama3", "qwen2" or another pre-query template provided. Defaults to
None
.