MlxLLM¶
Apple MLX LLM implementation.
Attributes¶
- 
path_or_hf_repo: the path to the model or the Hugging Face Hub repo id. 
- 
tokenizer_config: the tokenizer configuration. 
- 
mlx_model_config: the MLX model configuration. 
- 
adapter_path: the path to the adapter. 
- 
use_magpie_template: a flag used to enable/disable applying the Magpie pre-query template. Defaults to False.
- 
magpie_pre_query_template: the pre-query template to be applied to the prompt or sent to the LLM to generate an instruction or a follow up user message. Valid values are "llama3", "qwen2" or another pre-query template provided. Defaults to None.