Unleash your creativity
Check our our current supported models below. We support the best, highest-rated models available, and we update the selection of models we support frequently. Sign up to receive a monthly update about our latest models and other news, and participate in our Discord Server to ask questions or request additional models.
TOP LLMS
Fallen Llama 3.3 R1 70B v1
L3.3 70B Magnum v4 SE
DeepSeek R1 Distill Llama 70B
70B L3.3 Cirrus x1
- Sao10K-70B-L3.3-Cirrus-x1
Magnum-72b-v4
- RP, Storywriting.
- FP8 Dynamic
- anthracite-org-magnum-v4-72b-FP8-Dynamic
- Context: 32k
- Preset: https://files.catbox.moe/rqei05.json
- RP Instruct: https://files.catbox.moe/btnhau.json
- RP Context: https://files.catbox.moe/7kct3f.json
Settings provided by: GERGE
72B Qwen2.5 Kunou v1
- Sao10K-72B-Qwen2.5-Kunou-v1-FP8-Dynamic
- FP8
- Context: 32K
SorcererLM 8x22b bf16
- RP
- BF16
- rAIfle-SorcererLM-8x22b-bf16
- Context: 16K
- Recommended Settings: https://files.catbox.moe/9tj7m0.json
L3.3-70B-Euryale-v2.3
- Sao10K-L3.3-70B-Euryale-v2.3-FP8-Dynamic
- FP8
- Context: 32K
L3.1 70B Hanami x1
- GP, RP
- FP16
- Sao10K-L3.1-70B-Hanami-x1
- Context: 32K
PLUS PLAN MODELS
All of Free + Essential + Standard and the following:
SorcererLM 8x22b bf16
- RP
- BF16
- rAIfle-SorcererLM-8x22b-bf16
- Context: 16K
- Recommended Settings: https://files.catbox.moe/9tj7m0.json
STANDARD PLAN MODELS
All of Free + Essential and the following:
Fallen Llama 3.3 R1 70B v1
L3.3 70B Magnum v4 SE
DeepSeek R1 Distill Llama 70B
Magnum-72b-v4
- RP, Storywriting.
- FP8 Dynamic
- anthracite-org-magnum-v4-72b-FP8-Dynamic
- Context: 32k
- Preset: https://files.catbox.moe/rqei05.json
- RP Instruct: https://files.catbox.moe/btnhau.json
- RP Context: https://files.catbox.moe/7kct3f.json
Settings provided by: GERGE
ESSENTIAL PLAN MODELS
All of Free and the following:
multilingual-e5-base
- intfloat-multilingual-e5-base
DeepHermes 3 Mistral 24B Preview
- NousResearch-DeepHermes-3-Mistral-24B-Preview
70B L3.3 Cirrus x1
- Sao10K-70B-L3.3-Cirrus-x1
72B Qwen2.5 Kunou v1
- Sao10K-72B-Qwen2.5-Kunou-v1-FP8-Dynamic
- FP8
- Context: 32K
L3.3-70B-Euryale-v2.3
- Sao10K-L3.3-70B-Euryale-v2.3-FP8-Dynamic
- FP8
- Context: 32K
L3.1 70B Hanami x1
- GP, RP
- FP16
- Sao10K-L3.1-70B-Hanami-x1
- Context: 32K
Wayfarer 12B
- LatitudeGames-Wayfarer-12B
Meta Llama Guard 2 8B
Anubis 70B v1
- RP – Storywriting
- TheDrummer-Anubis-70B-v1-FP8-Dynamic
- Context: 32K
L3-70B-Euryale-v2.2
- RP, Storywriting.
- FP8 Dynamic
- L3.1-70B-Euryale-v2.2-FP8-Dynamic
- Context: 16K
- RP Instruct: https://files.catbox.moe/1c9sp0.json
- RP Context: https://files.catbox.moe/5wwpin.json
Settings provided by: ShotMisser64
magnum v2 72b
- anthracite-org-magnum-v2-72b-FP8-Dynamic
- FP8
- Context: 32K
UnslopNemo 12B v4.1
- RP
- BF16
- TheDrummer-UnslopNemo-12B-v4.1
- Context: 32K
- Recommended Settings: https://files.catbox.moe/7e6zjo.json
Qwen2.5-72B-Instruct
- GP
- FP8
- Qwen2.5-72B-Instruct-Turbo
- Context: 32K
Llama 3.1 Nemotron 70B Instruct HF
- GP
- BF16
- nvidia-Llama-3.1-Nemotron-70B-Instruct-HF
- Context: 32K
- Recommended Settings: https://files.catbox.moe/7e6zjo.json
Midnight Miqu 70B v1.5
- RP
- FP16
- Midnight-Miqu-70B-v1.5
- Context: 18K
- Preset: https://files.catbox.moe/l8e5zt.json
- RP Instruct: https://files.catbox.moe/eaj6gy.json
- RP Context: https://files.catbox.moe/mvn3jo.json
Settings provided by: ShadingCrawler
Qwen2-72B-Instruct
- BF16
- GP
- Qwen2.5-72B-Instruct
- Context: 32K
Llama-3.2-11B-Vision-Instruct
- BF16
- GP
- Llama-3.2-11B-Vision-Instruct-Turbo
- Context: 128K
FREE MODELS
Mixtral 8x7B Instruct v0.1
- BF16
- GP
- Mixtral-8x7B-Instruct-v0.1
- Context: 32K
Rocinante-12B-v1.1
- RP
- BF16
- TheDrummer-Rocinante-12B-v1.1
- Context: 32K
GUIDES
Models
L3-70B-Euryale-v2.1
Meet L3 70B Euryale v2.1: Your New Creative Companion What is L3 70B Euryale v2.1 [...]
Aug
Guides
Using Infermatic.ai API with SillyTavern
SillyTavern is one of the most popular interfaces to interact with LLMs. We have been [...]
Jun
Models
nvidia/Llama-3.1-Nemotron-70B-Instruct
Llama 3.1 Nemotron 70B Instruct: Follow and assert Llama 3.1 Nemotron 70B Instruct is NVIDIA’s [...]
Dec
Models
Infermatic/MN 12B Inferor v0.0
MN 12B Inferor v0.0: Dynamic and Creative MN 12B Inferor, also known as Mistral Nemo [...]
Dec
Frequently Asked Questions from Geek to Geek
- What is prompt engineering, and why is it critical in working with LLMs?
- How can I design effective prompts for LLMs?
- What are some standard techniques used in prompt engineering?
- How does prompt length impact the output of an LLM?
- How do LLMs understand and generate human-like text?
- What is the difference between Llama, Mixtral, and Qwen?
- What are some examples of advanced use cases of prompt engineering with LLMs?
- How do I choose the best LLM model for my project?
- What are large language models, and how do they differ from traditional NLP models?
- Can LLMs write code well?