infermatic, Author at Infermatic.ai

Guides

How to use Kokoro 82M – TTS Model

Posted on July 29, 2025July 29, 2025 by infermatic

What is TTS Generator? TTS Generator converts written text into natural-sounding speech using advanced AI voice synthesis. Simply type your text, select a voice, and generate high-quality audio in seconds. Key Features: 67 Voices: Pre-trained in multiple languages with customization options. Voice Combination: Blend voices using different weights for unique audio results. How to […]

Continue reading →

Models

Intro to Embedding Models

Posted on April 15, 2025April 15, 2025 by infermatic

What Are Embedding Models? Embedding models convert text into numbers—specifically, high-dimensional vectors (arrays of numbers). These vectors capture the semantic meaning of the text so that similar texts have similar vector representations. You can store and compare these vectors using methods like cosine similarity to find similar content. API Usage Endpoint POST https://api.totalgpt.ai/v1/embeddings Required Headers […]

Continue reading →

AI

Wyvern Chat Guide

Posted on December 27, 2024February 20, 2025 by infermatic

Unlocking the Full Potential with WyvernChat Integrations: Lorebooks: Enhance your narratives by importing or selecting from existing lorebooks. Character Cards & Scenarios: Deepen engagement with detailed character profiles and predefined scenarios.

Continue reading →

Models

microsoft/WizardLM 2 8x22B

Posted on December 17, 2024February 20, 2025 by infermatic

microsoft/WizardLM-2-8x22B: Bewitched WizardLM 2 8x22b stands out as the best model for applications. It delivers precise and comprehensive answers to knowledge-based questions and excels in inferential reasoning and mathematical problem-solving, outperforming all other models I’ve tested.This state-of-the-art large language model, developed by Microsoft AI, showcases enhanced capabilities in complex chat, multilingual tasks, reasoning, and agent-based […]

Continue reading →

Models

Sao10K/L3.3 70B Euryale v2.3

Posted on December 10, 2024February 20, 2025 by infermatic

Sao10K/L3.3 70B Euryale v2.3: Get delighted The Euryale series of models, originally known as “Stheno’s sister” (starting with the 8b creative model), has evolved over time into one of the most popular creative roleplay (RP) and storywriting models available today. Across its versions, this model series has maintained key standout features: Strong prompt adherence (while […]

Continue reading →

Models

nvidia/Llama-3.1-Nemotron-70B-Instruct

Posted on December 6, 2024February 21, 2025 by infermatic

Llama 3.1 Nemotron 70B Instruct: Follow and assert Llama 3.1 Nemotron 70B Instruct is NVIDIA’s state-of-the-art LLM designed for helpful and precise responses using RLHF (REINFORCE).It ranks #1 on key benchmarks like Arena Hard (85.0), AlpacaEval 2 LC (57.6), and MT-Bench (8.98). Compatible with HuggingFace Transformers, it supports inputs up to 128k tokens and outputs […]

Continue reading →

Models

Infermatic/MN 12B Inferor v0.0

Posted on December 5, 2024December 6, 2024 by infermatic

MN 12B Inferor v0.0: Dynamic and Creative MN 12B Inferor, also known as Mistral Nemo Inferor, is an impressive model merge created from 4 MN fine-tunes. It uses the following models: anthracite-org/magnum-v4-12b nbeerbower/Mistral-Nemo-Gutenberg-Doppel-12B-v2 nothingiisreal/MN-12B-Starcannon-v3 Fizzarolli/MN-12b-Sunrose Review This model stands out for its remarkable creativity, often generating long, contextually adaptive paragraphs. However, it has a notable […]

Continue reading →

Guides

Guide to quant FP8

Posted on August 27, 2024January 13, 2025 by infermatic

Simple Guide to Convert an FP16 Model to FP8 Overview This simple guide to quant models walks you through converting a model from FP16 to FP8, an 8-bit data format that significantly improves model inference efficiency without sacrificing output quality. FP8 is ideal for quantizing large language models (LLMs), ensuring faster and more cost-effective deployments. […]

Continue reading →

Models

L3-70B-Euryale-v2.1

Posted on August 20, 2024December 6, 2024 by infermatic

Meet L3 70B Euryale v2.1: Your New Creative Companion What is L3 70B Euryale v2.1 ? L3 70B Euryale v2.1 is a text generation model, ranked as the moment as one of the best RP/Story Writing models. As described by its creator Sao10K, like the big sister of L3 Stheno v3/3 8B. Think of her […]

Continue reading →

Guides

Using Infermatic.ai API with SillyTavern

Posted on June 21, 2024December 6, 2024 by infermatic

SillyTavern is one of the most popular interfaces to interact with LLMs. We have been working on developing an API and one of the first interfaces we wanted to integrate with was SillyTavern. We have done just that. Requirements: Infermatic.ai Plus Tier subscription ($15/month) Steps to integrate: After you subscribe to Infermatic.ai you can generate […]

Continue reading →