Provider | LLM | Text Embedding | Rerank | Speech to text | TTS |
---|---|---|---|---|---|
OpenAI | âïž(ð ïž)(ð) | âïž | âïž | âïž | |
Anthropic | âïž(ð ïž) | ||||
Azure OpenAI | âïž(ð ïž)(ð) | âïž | âïž | âïž | |
Gemini | âïž | ||||
Google Cloud | âïž(ð) | âïž | |||
Nvidia API Catalog | âïž | âïž | âïž | ||
Nvidia NIM | âïž | ||||
Nvidia Triton Inference Server | âïž | ||||
AWS Bedrock | âïž | âïž | |||
OpenRouter | âïž | ||||
Cohere | âïž | âïž | âïž | ||
together.ai | âïž | ||||
Ollama | âïž | âïž | |||
Mistral AI | âïž | ||||
groqcloud | âïž | ||||
Replicate | âïž | âïž | |||
Hugging Face | âïž | âïž | |||
Xorbits inference | âïž | âïž | âïž | âïž | âïž |
Zhipu AI | âïž(ð ïž)(ð) | âïž | |||
Baichuan | âïž | âïž | |||
Spark | âïž | ||||
Minimax | âïž(ð ïž) | âïž | |||
Tongyi | âïž | âïž | âïž | ||
Wenxin | âïž | âïž | |||
Moonshot AI | âïž(ð ïž) | ||||
Tencent Cloud | âïž | ||||
Stepfun | âïž(ð ïž)(ð) | ||||
VolcanoEngine | âïž | âïž | |||
01.AI | âïž | ||||
360 Zhinao | âïž | ||||
Azure AI Studio | âïž | âïž | |||
deepseek | âïž(ð ïž) | ||||
Tencent Hunyuan | âïž | ||||
SILICONFLOW | âïž | âïž | |||
Jina AI | âïž | âïž | |||
ChatGLM | âïž | ||||
Xinference | âïž(ð ïž)(ð) | âïž | âïž | ||
OpenLLM | âïž | âïž | |||
LocalAI | âïž | âïž | âïž | âïž | |
OpenAI API-Compatible | âïž | âïž | âïž | ||
PerfXCloud | âïž | âïž | |||
Lepton AI | âïž | ||||
novita.ai | âïž | ||||
Amazon Sagemaker | âïž | âïž | âïž | ||
Text Embedding Inference | âïž | âïž | |||
GPUStack | âïž(ð ïž)(ð) | âïž | âïž | ||
GPUStack | âïž(ð§ïž)(ð) | âïž | âïž | âïž | âïž |