Unleash your creativity

Check our our current supported models below. We support the best, highest-rated models available, and we update the selection of models we support frequently. Sign up to receive a monthly update about our latest models and other news, and participate in our Discord Server to ask questions or request additional models.

Top LLMs

Settings provided by: GERGE

Settings provided by: ShotMisser64

All Models

  • A RP/storywriting specialist model, full-parameter finetune of Qwen2.5-72B on mixture of synthetic and natural data.
  • Format: ChatML
  • Context: 32K
  • Settings:
  • Temperature: 1
  • Min-P: 0.05
  • Top-A: 0.2
  • Repetition Penalty: 1.03
  • A RP/storywriting specialist model, full-parameter finetune of Qwen2.5-32B on mixture of synthetic and natural data.
  • Context: 32K
  • Settings:
  • Temp: 1
  • Min-P: 0.05
  • Typical-P: 0.9
  • Top-A: 0.2
  • Repetition Penalty: 1.03
  • GP
  • FP8
  • Qwen2.5-72B-Instruct-Turbo
  • Context: 32K

 Settings provided by: ShadingCrawler

Settings provided by: GERGE

Settings provided by: ShotMisser64

  • GP, RP
  • FP16
  • Sao10K-3.1-70B-Hanami-x1
  • Context: 32K
  • Mixtral-8x7B-Instruct-v0.1
  • General purpose
  • Context: 32K
  • RP
  • BF16
  • TheDrummer-Rocinante-12B-v1.1
  • Context: 32K
  • FP8 Dynamic
  • RP
  • llama-3-lumimaid-8b-v0.1
  • Context: 8K
  • BF16
  • GP
  • Qwen2-72B-Instruct
  • Context: 32K
  • BF16
  • GP
  • Mixtral-8x7B-Instruct-v0.1
  • Context: 32K
  • BF16
  • GP
  • Llama-3.2-11B-Vision-Instruct-Turbo
  • Context: 128K

Guides

Guides

Guide to quant FP8

Simple Guide to Convert an FP16 Model to FP8 Overview This simple guide to quant [...]

Models

L3-70B-Euryale-v2.1

Meet L3 70B Euryale v2.1: Your New Creative Companion What is L3 70B Euryale v2.1 [...]

Guides

Using Infermatic.ai API with SillyTavern

SillyTavern is one of the most popular interfaces to interact with LLMs. We have been [...]

Docs