Our Models

Explore the specifications, capabilities, and API model strings of Cloxo AI's powerful models.

Model	Provider	API Slug	Capabilities	Context Length	Data Policy & Status
CloxoGPT v1.5 (FREE)	Cloxo AI	cloxoai/cloxogpt	TEXT VISION	8k	Data Policy / Status
Llama 4 Maverick (FREE)	Deep Infra	meta/llama-4-maverick-free	TEXT VISION CODE TRANSLATION	131k	Data Policy / Status
GPT-4o Mini (FREE)	OpenAI	openai/gpt-4o-mini-free	TEXT VISION TRANSLATION CODE	128k	Data Policy / Status
Llama 4 Scout (FREE)	Parasail	meta/llama-4-scout-free	TEXT VISION CODE TRANSLATION	131k	Data Policy / Status
Deepseek R1 Distill Llama 3.3 (70B) (FREE)	DeepSeek	deepseek/deepseek-r1-distill-llama-70b-free	TEXT CODE REASONING	16k	Data Policy / Status
Llama 3.3 70B Instruct (FREE)	Lambda	meta/llama-3-3-70b-instruct-free	TEXT CODE TRANSLATION	131k	Data Policy / Status
Deepseek V3 (FREE)	DeepSeek	deepseek/deepseek-v3-free	TEXT CODE	64k	Data Policy / Status
Deepseek R1 (FREE)	DeepSeek	deepseek/deepseek-r1-free	TEXT CODE REASONING	16k	Data Policy / Status
Qwen: QwQ 32B Preview	DeepInfra	qwen/qwq-32b-preview-free	TEXT CODE REASONING	33k	Data Policy / Status
Gemini 2.0 Flash Thinking (FREE)	Google Vertex	google/gemini-20-flash-thinking-exp-free	TEXT VISION CODE REASONING	40k	Data Policy / Status
Mistral Pixtral 12b (FREE)	Hyperbolic	mistral/pixtral-12b-free	TEXT VISION	4k	Data Policy / Status
Gemini 2.0 Flash (FREE)	Google Vertex	google/gemini-2.0-free	TEXT VISION CODE	1000k	Data Policy / Status
Llama 3.1 405B Instruct (FREE)	DeepInfra	meta/llama-3-1-405b-instruct-free	TEXT CODE TRANSLATION	32k	Data Policy / Status
OpenAI: o1 (FREE)	openAI	openai/o1-free	TEXT VISION CODE REASONING	128k	Data Policy / Status
Mistral Nemo (FREE)	DeepInfra	mistral/mistral-nemo-free	TEXT VISION TRANSLATION	128k	Data Policy / Status
Qwen2.5 Coder 32B Instruct (FREE)	Lambda	qwen/qwen-2-5-coder-32b-instruct-free	TEXT CODE REASONING	33k	Data Policy / Status
Qwen-2.5 72b (FREE)	DeepInfra	qwen/qwen-2-5-72b-free	TEXT CODE TRANSLATION	32k	Data Policy / Status
Codestral Mamba (FREE)	Mistral	mistral/codestral-mamba-free	TEXT CODE REASONING	256k	Data Policy / Status
GPT-3.5 Turbo (FREE)	OpenAI	openai/gpt-3-5-turbo-free	TEXT TRANSLATION CODE	16k	Data Policy / Status
Mistral Large (FREE)	Mistral	mistral/mistral-large-free	TEXT CODE REASONING	128k	Data Policy / Status
LearnLM 1.5 Pro Experimental (FREE)	Google AI Studio	google/learnlm-1-5-pro-experimental-free	TEXT VISION	8k	Data Policy / Status
Grok 2 Vision (FREE)	xAI	x-ai/grok-2-vision-free	TEXT VISION REASONING TRANSLATION	33k	Data Policy / Status
Liquid LFM 40B (FREE)	Lambda	liquid/lfm-40b-free	TEXT	66k	Data Policy / Status
Llama 3.2 90b Vision Instruct (FREE)	SambaNova	meta/llama-3-2-90b-vision-instruct-free	TEXT VISION	131k	Data Policy / Status
Llama 3.2 11b Vision Instruct (FREE)	Together	meta/llama-3-2-11b-vision-instruct-free	TEXT VISION	131k	Data Policy / Status
GPT-o1 Mini (FREE)	OpenAI	gpt-o1-mini-free	TEXT	128k	Data Policy / Status
Cohere: Command R + (FREE)	Cohere	cohere/command-r-plus-free	TEXT CODE	128k	Data Policy / Status
Cohere: Command R (FREE)	Cohere	cohere/command-r-free	TEXT CODE	128k	Data Policy / Status
GPT-4o (FREE)	OpenAI	openai/gpt-4o-free	TEXT VISION CODE TRANSLATION	128k	Data Policy / Status
Gemini Flash-1.5 8b (FREE)	Google AI Studio	google/gemini-flash-1-5-8b-exp	TEXT VISION	1000k	Data Policy / Status
Gemini Pro 1.5 (FREE)	Google	google/gemini-pro-1-5-free	TEXT VISION CODE TRANSLATION	1000k	Data Policy / Status
Llama 3.1 Nemotron 70B Instruct (FREE)	Lambda	nvidia/llama-3-1-nemotron-70b-instruct-free	TEXT	131k	Data Policy / Status
Qwen-2 7b Instruct (FREE)	Novita	qwen-2-7b-instruct-free	TEXT	8k	Data Policy / Status
Gemma 2 27B (FREE)	DeepInfra	google/gemma-2-27b-it-free	TEXT	8k	Data Policy / Status
Gemma 2 9B (FREE)	DeepInfra	google/gemma-2-9b-it-free	TEXT	4k	Data Policy / Status
MythoMax 13B (FREE)	Together 2	gryphe/mythomax-l2-13b-free	TEXT	4k	Data Policy / Status

CloxoGPT v1.5 (FREE)

A fine-tuned version of Llama 3.2 90b optimized for text and vision tasks, designed for smooth chat and content generation.

Provider: Cloxo AI

API Slug: cloxoai/cloxogpt

Capabilities:

TEXT VISION OCR FILES RAG VOICE

Context Length: 8k

Links: Data Policy / Status

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward pass (400B total). It supports multilingual text and image input, and produces multilingual text and code output across 12 supported languages. Optimized for vision-language tasks, Maverick is instruction-tuned for assistant-like behavior, image reasoning, and general-purpose multimodal interaction.

Provider: Deep Infra

API Slug: meta/llama-4-maverick-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE CODE TRANSLATION

Context Length: 131k

Links: Data Policy / Status

GPT-4o Mini (FREE)

OpenAI's cost-efficient multimodal model, offering state-of-the-art intelligence for text and vision inputs.

Provider: OpenAI

API Slug: openai/gpt-4o-mini-free

Capabilities:

TEXT VISION OCR FILES TRANSLATION RAG VOICE CODE LONG CONTEXT

Context Length: 128k

Links: Data Policy / Status

Llama 4 Scout (FREE)

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens.

Provider: Parasail

API Slug: meta/llama-4-scout-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE CODE TRANSLATION

Context Length: 131k

Links: Data Policy / Status

Deepseek R1 Distill Llama 3.3 (70B) (FREE)

DeepSeek R1 Distill Llama 70B, distilled from Llama-3.3-70B-Instruct using DeepSeek R1 outputs, employs advanced distillation techniques to achieve high benchmark performance, including AIME 2024 pass@1 (70.0), MATH-500 pass@1 (94.5), and a CodeForces rating of 1633, offering competitive performance akin to larger frontier models.

Provider: DeepSeek

API Slug: deepseek/deepseek-r1-distill-llama-70b-free

Capabilities:

TEXT OCR FILES RAG VOICE CODE REASONING

Context Length: 16k

Links: Data Policy / Status

Llama 3.3 70B Instruct (FREE)

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

Provider: Lambda

API Slug: meta/llama-3-3-70b-instruct-free

Capabilities:

TEXT OCR FILES RAG VOICE CODE TRANSLATION

Context Length: 131k

Links: Data Policy / Status

Deepseek V3 (FREE)

DeepSeek-V3, pre-trained on nearly 15T tokens, improves upon previous versions instruction following and coding, outperforming open-source models and rivaling leading closed-source models.

Provider: DeepSeek

API Slug: deepseek/deepseek-v3-free

Capabilities:

TEXT OCR FILES RAG VOICE CODE

Context Length: 64k

Links: Data Policy / Status

Deepseek R1 (FREE)

DeepSeek R1 is here: Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass.

Provider: DeepSeek

API Slug: deepseek/deepseek-r1-free

Capabilities:

TEXT OCR FILES RAG VOICE CODE REASONING

Context Length: 16k

Links: Data Policy / Status

Qwen: QwQ 32B Preview

QwQ-32B-Preview is an experimental research model from the Qwen Team showcasing promising AI reasoning capabilities, particularly in math and coding, while exhibiting limitations in language mixing, recursive reasoning, safety, and overall performance.

Provider: DeepInfra

API Slug: qwen/qwq-32b-preview-free

Capabilities:

TEXT OCR FILES CODE RAG VOICE REASONING

Context Length: 33k

Links: Data Policy / Status

Gemini 2.0 Flash Thinking (FREE)

Gemini 2.0 Flash Thinking Mode is an experimental model that's trained to generate the "thinking process" the model goes through as part of its response. As a result, Thinking Mode is capable of stronger reasoning capabilities in its responses than the base Gemini 2.0 Flash model.

Provider: Google Vertex

API Slug: google/gemini-20-flash-thinking-exp-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE CODE REASONING LONG CONTEXT

Context Length: 40k

Links: Data Policy / Status

Mistral Pixtral 12b (FREE)

The first image-to-text model from Mistral AI, optimized for converting visual data into text with precision.

Provider: Hyperbolic

API Slug: mistral/pixtral-12b-free

Capabilities:

TEXT OCR VISION FILES RAG VOICE

Context Length: 4k

Links: Data Policy / Status

Gemini 2.0 Flash (FREE)

Gemini Flash 2.0 significantly improves TTFT over Gemini Flash 1.5 while maintaining Gemini Pro 1.5-level quality, enhancing multimodal understanding, coding, complex instruction following, and function calling for more seamless and robust agentic experiences.

Provider: Google Vertex

API Slug: google/gemini-2.0-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE CODE LONG CONTEXT

Context Length: 1000k

Links: Data Policy / Status

Llama 3.1 405B Instruct (FREE)

The highly anticipated 405B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models including GPT-4o and Claude 3.5 Sonnet in evaluations.

Provider: DeepInfra

API Slug: meta/llama-3-1-405b-instruct-free

Capabilities:

TEXT OCR FILES RAG VOICE CODE TRANSLATION

Context Length: 32k

Links: Data Policy / Status

OpenAI: o1 (FREE)

The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology.

Provider: openAI

API Slug: openai/o1-free

Capabilities:

TEXT OCR VISION FILES RAG VOICE CODE REASONING LONG CONTEXT

Context Length: 128k

Links: Data Policy / Status

Mistral Nemo (FREE)

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, and Hindi.

Provider: DeepInfra

API Slug: mistral/mistral-nemo-free

Capabilities:

TEXT OCR VISION TRANSLATION FILES RAG VOICE LONG CONTEXT

Context Length: 128k

Links: Data Policy / Status

Qwen2.5 Coder 32B Instruct (FREE)

Qwen2.5-Coder improves upon CodeQwen1.5 in code generation, reasoning, and fixing, providing a stronger foundation for applications like Code Agents. It also maintains strengths in math and general knowledge.

Provider: Lambda

API Slug: qwen/qwen-2-5-coder-32b-instruct-free

Capabilities:

TEXT OCR FILES CODE RAG VOICE REASONING

Context Length: 33k

Links: Data Policy / Status

Qwen-2.5 72b (FREE)

Qwen2.5 72B is an advanced LLM with improved knowledge, coding, and multilingual capabilities, supporting long context and generating high-quality outputs.

Provider: DeepInfra

API Slug: qwen/qwen-2-5-72b-free

Capabilities:

TEXT OCR FILES CODE RAG VOICE TRANSLATION

Context Length: 32k

Links: Data Policy / Status

Codestral Mamba (FREE)

This 7.3B Mamba model offers linear time inference, a 256k context window, and fast responses for code and reasoning tasks, performing comparably to transformers and available under Apache 2.0.

Provider: Mistral

API Slug: mistral/codestral-mamba-free

Capabilities:

TEXT OCR FILES CODE RAG VOICE REASONING

Context Length: 256k

Links: Data Policy / Status

GPT-3.5 Turbo (FREE)

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks.

Provider: OpenAI

API Slug: openai/gpt-3-5-turbo-free

Capabilities:

TEXT OCR FILES TRANSLATION RAG VOICE CODE

Context Length: 16k

Links: Data Policy / Status

Mistral Large (FREE)

Mistral Large 2 is a powerful AI model from Mistral AI excelling at reasoning, coding, and various languages, boasting a long context window for effective information retrieval.

Provider: Mistral

API Slug: mistral/mistral-large-free

Capabilities:

TEXT OCR FILES CODE RAG VOICE REASONING

Context Length: 128k

Links: Data Policy / Status

LearnLM 1.5 Pro Experimental (FREE)

An experimental multimodal model based on Gemini 1.5 Pro, capable of handling both text and vision tasks with high efficiency.

Provider: Google AI Studio

API Slug: google/learnlm-1-5-pro-experimental-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE

Context Length: 8k

Links: Data Policy / Status

Grok 2 Vision (FREE)

Grok 2 Vision improves image AI with enhanced visual understanding, instruction following, and multilingual support, enabling intuitive, visually aware apps and paving the way for future image solutions.

Provider: xAI

API Slug: x-ai/grok-2-vision-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE REASONING TRANSLATION

Context Length: 33k

Links: Data Policy / Status

Liquid LFM 40B (FREE)

Liquid's 40.3B MoE model, a powerful LFM built on dynamic systems, excels at modeling diverse sequential data, including video, audio, and text.

Provider: Lambda

API Slug: liquid/lfm-40b-free

Capabilities:

TEXT OCR FILES RAG VOICE

Context Length: 66k

Links: Data Policy / Status

Llama 3.2 90b Vision Instruct (FREE)

A 90-billion-parameter multimodal model excelling in advanced visual reasoning and language tasks like image captioning and analysis.

Provider: SambaNova

API Slug: meta/llama-3-2-90b-vision-instruct-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE

Context Length: 131k

Links: Data Policy / Status

Llama 3.2 11b Vision Instruct (FREE)

An 11B parameter multimodal model designed for tasks that integrate visual and textual reasoning with high accuracy.

Provider: Together

API Slug: meta/llama-3-2-11b-vision-instruct-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE

Context Length: 131k

Links: Data Policy / Status

GPT-o1 Mini (FREE)

An experimental model optimized for STEM tasks, providing PhD-level accuracy in physics, chemistry, and biology.

Provider: OpenAI

API Slug: gpt-o1-mini-free

Capabilities:

TEXT

Context Length: 128k

Links: Data Policy / Status

Cohere: Command R + (FREE)

An updated model with faster performance and lower latencies, designed for multilingual tasks and reasoning.

Provider: Cohere

API Slug: cohere/command-r-plus-free

Capabilities:

TEXT CODE OCR FILES RAG VOICE

Context Length: 128k

Links: Data Policy / Status

Cohere: Command R (FREE)

Command-R is a 35B parameter model that performs conversational language tasks at a higher quality, more reliably, and with a longer context than previous models. It can be used for complex workflows like code generation, retrieval augmented generation (RAG), tool use, and agents.

Provider: Cohere

API Slug: cohere/command-r-free

Capabilities:

TEXT CODE OCR FILES RAG VOICE

Context Length: 128k

Links: Data Policy / Status

GPT-4o (FREE)

OpenAI's advanced GPT-4o model, offering faster processing, enhanced multilingual capabilities, and better file analysis.

Provider: OpenAI

API Slug: openai/gpt-4o-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE CODE LONG CONTEXT TRANSLATION

Context Length: 128k

Links: Data Policy / Status

Gemini Flash-1.5 8b (FREE)

A lightweight, experimental 8B parameter model offering efficient multimodal capabilities for text and vision tasks.

Provider: Google AI Studio

API Slug: google/gemini-flash-1-5-8b-exp

Capabilities:

TEXT VISION OCR FILES RAG VOICE

Context Length: 1000k

Links: Data Policy / Status

Gemini Pro 1.5 (FREE)

An experimental multimodal model by Google, heavily rate-limited but designed for high-end vision and text tasks.

Provider: Google

API Slug: google/gemini-pro-1-5-free

Capabilities:

TEXT VISION OCR FILES RAG VOICE CODE TRANSLATION LONG CONTEXT

Context Length: 1000k

Links: Data Policy / Status

Llama 3.1 Nemotron 70B Instruct (FREE)

NVIDIA's Llama 3.1 Nemotron 70B is a language model optimized for generating precise and useful responses across various domains, leveraging RLHF and excelling in automatic alignment benchmarks.

Provider: Lambda

API Slug: nvidia/llama-3-1-nemotron-70b-instruct-free

Capabilities:

TEXT OCR FILES RAG VOICE

Context Length: 131k

Links: Data Policy / Status

Qwen-2 7b Instruct (FREE)

A transformer-based model with strengths in multilingual tasks, reasoning, and coding capabilities.

Provider: Novita

API Slug: qwen-2-7b-instruct-free

Capabilities:

TEXT OCR FILES RAG VOICE

Context Length: 8k

Links: Data Policy / Status

Gemma 2 27B (FREE)

Gemma 2 27B is an open-source model from Google, based on Gemini research, that excels in text generation tasks like question answering, summarization, and reasoning.

Provider: DeepInfra

API Slug: google/gemma-2-27b-it-free

Capabilities:

TEXT OCR FILES RAG VOICE

Context Length: 8k

Links: Data Policy / Status

Gemma 2 9B (FREE)

A versatile 9B parameter model by Google, designed for efficient and cost-effective language tasks across domains.

Provider: DeepInfra

API Slug: google/gemma-2-9b-it-free

Capabilities:

TEXT OCR FILES RAG VOICE

Context Length: 4k

Links: Data Policy / Status

MythoMax 13B (FREE)

A fine-tuned Llama 2 13B model excelling in descriptive roleplay and creative narratives.

Provider: Together 2

API Slug: gryphe/mythomax-l2-13b-free

Capabilities:

TEXT OCR FILES RAG VOICE

Context Length: 4k

Links: Data Policy / Status

Our Models

CloxoGPT v1.5 (FREE)

Llama 4 Maverick (FREE)

GPT-4o Mini (FREE)

Llama 4 Scout (FREE)

Deepseek R1 Distill Llama 3.3 (70B) (FREE)

Llama 3.3 70B Instruct (FREE)

Deepseek V3 (FREE)

Deepseek R1 (FREE)

Qwen: QwQ 32B Preview

Gemini 2.0 Flash Thinking (FREE)

Mistral Pixtral 12b (FREE)

Gemini 2.0 Flash (FREE)

Llama 3.1 405B Instruct (FREE)

OpenAI: o1 (FREE)

Mistral Nemo (FREE)

Qwen2.5 Coder 32B Instruct (FREE)

Qwen-2.5 72b (FREE)

Codestral Mamba (FREE)

GPT-3.5 Turbo (FREE)

Mistral Large (FREE)

LearnLM 1.5 Pro Experimental (FREE)

Grok 2 Vision (FREE)

Liquid LFM 40B (FREE)

Llama 3.2 90b Vision Instruct (FREE)

Llama 3.2 11b Vision Instruct (FREE)

GPT-o1 Mini (FREE)

Cohere: Command R + (FREE)

Cohere: Command R (FREE)

GPT-4o (FREE)

Gemini Flash-1.5 8b (FREE)

Gemini Pro 1.5 (FREE)

Llama 3.1 Nemotron 70B Instruct (FREE)

Qwen-2 7b Instruct (FREE)

Gemma 2 27B (FREE)

Gemma 2 9B (FREE)

MythoMax 13B (FREE)