Unleash your creativity

Check our our current supported models below. We support the best, highest-rated models available, and we update the selection of models we support frequently. Sign up to receive a monthly update about our latest models and other news, and participate in our Discord Server to ask questions or request additional models.

TOP LLMS

TheDrummer/

Fallen Llama 3.3 R1 70B v1

RP, Storywriting
  • TheDrummer-Fallen-Llama-3.3-R1-70B-v1
New
Doctor-Shotgun/

L3.3 70B Magnum v4 SE

"SE" for Special Edition, the objective, as with the other Magnum models, is to emulate the prose style and quality of the Claude 3 Sonnet/Opus series of models on a local scale.
  • Doctor-Shotgun-L3.3-70B-Magnum-v4-SE
Sao10K/

70B L3.3 Cirrus x1

RP, Story Writing, Creative
  • Sao10K-70B-L3.3-Cirrus-x1
Anthracite-org/

Magnum-72b-v4

This model has spatial awareness, memory and detailed descriptions to keep the generation entertaining. Very good creativity and NSFW.

Settings provided by: GERGE

Sao10K/

72B Qwen2.5 Kunou v1

Another version of Euryale with with a Qwen base model.
rAIfle/

SorcererLM 8x22b bf16

Sao10K/

L3.3-70B-Euryale-v2.3

A direct replacement / successor to Euryale v2.2.
Sao10K/

L3.1 70B Hanami x1

  • GP, RP
  • FP16
  • Sao10K-L3.1-70B-Hanami-x1
  • Context: 32K

FREE MODELS

Mistralai/

Mixtral 8x7B Instruct v0.1

  • BF16
  • GP
  • Mixtral-8x7B-Instruct-v0.1
  • Context: 32K
TheDrummer/

Rocinante-12B-v1.1

  • RP
  • BF16
  • TheDrummer-Rocinante-12B-v1.1
  • Context: 32K

PLUS PLAN MODELS

All of Free + Essential and the following:

New
Top
Doctor-Shotgun/

L3.3 70B Magnum v4 SE

"SE" for Special Edition, the objective, as with the other Magnum models, is to emulate the prose style and quality of the Claude 3 Sonnet/Opus series of models on a local scale.
  • Doctor-Shotgun-L3.3-70B-Magnum-v4-SE
Top
rAIfle/

SorcererLM 8x22b bf16

ESSENTIAL PLAN MODELS

All of Free and the following:

intfloat/

multilingual-e5-base

Embeddings model. This model is initialized from xlm-roberta-base and continually trained on a mixture of multilingual datasets. It supports 100 languages from xlm-roberta, but low-resource languages may see performance degradation. 512 Max context length.
  • intfloat-multilingual-e5-base
NousResearch/

DeepHermes 3 Mistral 24B Preview

Latest version of the flagship Hermes series. One of the first models to unify Reasoning and normal LLM response modes into one model. Also has also improved LLM annotation, judgement, and function calling.
  • NousResearch-DeepHermes-3-Mistral-24B-Preview
Top
TheDrummer/

Fallen Llama 3.3 R1 70B v1

RP, Storywriting
  • TheDrummer-Fallen-Llama-3.3-R1-70B-v1
Deepseek-ai/

DeepSeek R1 Distill Llama 70B

GP, reasoning
  • deepseek-ai-DeepSeek-R1-Distill-Llama-70B
Top
Sao10K/

70B L3.3 Cirrus x1

RP, Story Writing, Creative
  • Sao10K-70B-L3.3-Cirrus-x1
Top
Anthracite-org/

Magnum-72b-v4

This model has spatial awareness, memory and detailed descriptions to keep the generation entertaining. Very good creativity and NSFW.

Settings provided by: GERGE

Top
Sao10K/

72B Qwen2.5 Kunou v1

Another version of Euryale with with a Qwen base model.
Top
Sao10K/

L3.3-70B-Euryale-v2.3

A direct replacement / successor to Euryale v2.2.
Top
Sao10K/

L3.1 70B Hanami x1

  • GP, RP
  • FP16
  • Sao10K-L3.1-70B-Hanami-x1
  • Context: 32K
LatitudeGames/

Wayfarer 12B

Wayfarer is an adventure role-play model specifically trained to give players a challenging and dangerous experience.
  • LatitudeGames-Wayfarer-12B
meta-llama/

Meta Llama Guard 2 8B

Meta Llama Guard 2 is a safeguard model. It can be used for classifying content in both LLM inputs (prompt classification) and in LLM responses (response classification).
TheDrummer/

Anubis 70B v1

Finetune of llama 3.3.
Sao10K/

L3-70B-Euryale-v2.2

Coherent, emotional and very creative.

Settings provided by: ShotMisser64

Anthracite-org/

magnum v2 72b

This model is fine-tuned on top of Qwen-2 72B Instruct.
TheDrummer/

UnslopNemo 12B v4.1

Qwen/

Qwen2.5-72B-Instruct

  • GP
  • FP8
  • Qwen2.5-72B-Instruct-Turbo
  • Context: 32K
nvidia/

Llama 3.1 Nemotron 70B Instruct HF

Sophosympatheia/

Midnight Miqu 70B v1.5

Settings provided by: ShadingCrawler

Qwen/

Qwen2-72B-Instruct

  • BF16
  • GP
  • Qwen2.5-72B-Instruct
  • Context: 32K
meta-llama/

Llama-3.2-11B-Vision-Instruct

  • BF16
  • GP
  • Llama-3.2-11B-Vision-Instruct-Turbo
  • Context: 128K

GUIDES

Models

L3-70B-Euryale-v2.1

Meet L3 70B Euryale v2.1: Your New Creative Companion What is L3 70B Euryale v2.1 [...]

Guides

Using Infermatic.ai API with SillyTavern

SillyTavern is one of the most popular interfaces to interact with LLMs. We have been [...]

Models

nvidia/Llama-3.1-Nemotron-70B-Instruct

Llama 3.1 Nemotron 70B Instruct: Follow and assert Llama 3.1 Nemotron 70B Instruct is NVIDIA’s [...]

Models

Infermatic/MN 12B Inferor v0.0

MN 12B Inferor v0.0: Dynamic and Creative MN 12B Inferor, also known as Mistral Nemo [...]