Compact Llama 3.2 3B for efficient inference.
llama-3.2-3b-instruct
LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.
novita/llama-3.2-3b-instruct