Skip to main content

🧠 Models

Last Updated: 2025-01-19

Overview of prominent AI language models and their capabilities.

LLMs

ChatGPT

  • Provider: OpenAI
  • Access: chat.openai.com
  • Key Features:
    • General-purpose conversation
    • Code generation
    • Text analysis
    • Multiple model versions (GPT-3.5, GPT-4)
    • API access
    • Code interpreter
    • Memory
    • Screenshots
    • Image Generation

Claude

  • Provider: Anthropic
  • Access: claude.ai
  • Key Features:
    • Long context windows
    • Accurate reasoning
    • Code generation
    • Multiple versions (Claude 2, Claude 3)

DeepSeek

  • Website: deepseek.ai
  • Access: DeepSeek Chat
  • GitHub: deepseek-ai/DeepSeek-Coder
  • Description: Open-source language model
  • Key Features:
    • Code generation
    • DeepSeek R1 model with 128k context window
    • Enhanced mathematical reasoning capabilities
    • Superior code generation performance
    • Available via API and chat interface

Llama 3

  • Provider: Meta
  • Description: Open-source large language model
  • Key Features:
    • Multiple model sizes
    • Fine-tuning capabilities
    • Research and commercial use
    • API access
    • Can run locally with Ollama

Gemini

  • Provider: Google
  • Access: gemini.google.com
  • Key Features:
    • Multimodal capabilities
    • Multiple versions (Pro, Ultra)
    • API access

NotebookLM

  • Provider: Google
  • Description: AI-powered note-taking and research assistant
  • Key Features:
    • Document analysis
    • Smart summarization
    • Research assistance
    • Has the biggest content window (25 million!)
    • Low hallucinations
    • Lower Creativity
    • Can take in the largest sets of documents
    • Can make Audio Summary and Podcasts from contents.

Grok

  • Provider: xAI
  • Website: grok.x.ai
  • Key Features:
    • Real-time X (Twitter) data access
    • Code generation and analysis
    • Mathematical reasoning
    • Conversational AI
    • Unique personality
  • Access: Limited beta
    • X Premium+ subscribers

Image Generation

DALL-E 3

Stable Diffusion

  • Provider: Stability AI
  • Website: stability.ai
  • GitHub: CompVis/stable-diffusion
  • API: platform.stability.ai
  • Deployment: Cloud & Self-hosted
  • Key Features:
    • Text-to-image generation
    • Image-to-image editing
    • Inpainting and outpainting
    • Multiple model versions (XL, 2.1, 3)
    • ControlNet support
    • Custom model training
    • Open source
  • Pricing:
    • Self-hosted: Free
    • API Credits: Starting at $10/month
    • Enterprise: Custom pricing
  • Popular UIs:
    • ComfyUI
    • Automatic1111
    • InvokeAI
    • RunwayML
    • DreamStudio
  • Use Cases:
    • Art creation
    • Design prototyping
    • Content generation
    • Game asset creation
    • Product visualization

Midjourney

  • Provider: Midjourney Inc.
  • Website: midjourney.com
  • API: Discord bot interface
  • Key Features:
    • Artistic style focus
    • High-quality outputs
    • Style customization
    • Community features
  • Pricing: midjourney.com/pricing
    • Basic: $10/month
    • Standard: $30/month
    • Pro: $60/month

Leonardo.ai

Recraft.ai

  • Provider: Recraft
  • Website: recraft.ai
  • API: docs.recraft.ai/reference
  • Key Features:
    • Vector graphics generation
    • Icon creation
    • Brand asset generation
    • SVG output format
  • Pricing:
    • Free tier available
    • Pro: $20/month
    • Team: $49/month
  • Unique Features:
    • SVG-first approach
    • Design system integration
    • Scalable graphics
    • Brand consistency tools

Video Generation

Runway Gen-2

  • Provider: Runway
  • Website: runway.ml
  • API: docs.runway.ml/reference
  • Key Features:
    • Text-to-video generation
    • Video editing
    • Motion tracking
    • Green screen
  • Pricing:
    • Pro: $15/month
    • Unlimited: $35/month

D-ID

  • Provider: D-ID
  • Website: d-id.com
  • API: docs.d-id.com
  • Key Features:
    • Digital human creation
    • Text-to-video
    • Voice synthesis
    • Avatar customization
  • Pricing:
    • Creator: $5.99/month
    • Pro: $24.99/month
    • Enterprise: Custom

Synthesia

  • Provider: Synthesia
  • Website: synthesia.io
  • API: docs.synthesia.io/reference
  • Key Features:
    • AI avatar videos
    • Multi-language support
    • Custom avatar creation
    • Template library
  • Pricing:
    • Personal: $29/month
    • Business: Custom
    • Enterprise: Custom

Voice Generation

Bland.ai

  • Provider: Bland
  • Website: bland.ai
  • Key Features:
    • Real-time voice AI calls
    • Natural conversation flow
    • Custom voice cloning
    • API integration
    • Call analytics
  • Pricing: bland.ai/pricing
    • Pay-per-minute model
    • Volume discounts available
    • Custom enterprise plans

ElevenLabs

  • Provider: ElevenLabs
  • Website: elevenlabs.io
  • Key Features:
    • Text-to-speech synthesis
    • Voice cloning
    • Custom voice creation
    • API access
  • Pricing: elevenlabs.io/pricing
    • Pay-per-minute model
    • Volume discounts available
    • Custom enterprise plans

OpenAI Real-Time GPT-4o

  • Provider: OpenAI
  • Website: openai.com
  • Key Features:
    • voice to voice
    • API access
    • Real-time voice chat
  • Pricing: openai.com/pricing
    • Pay-per-minute model
    • Volume discounts available
    • Custom enterprise plans

VAPI

  • Provider: VAPI
  • Website: vapi.ai
  • Key Features:
    • voice to voice
    • API access
    • Real-time voice chat
  • Pricing: vapi.ai/pricing
    • Pay-per-minute model
    • Volume discounts available
    • Custom enterprise plans

Tools

Ollama

  • Provider: Ollama
  • Website: ollama.ai
  • GitHub: github.com/ollama/ollama
  • Deployment: Self-hosted
  • Key Features:
    • Run LLMs locally
    • Easy model management
    • Multiple model support
    • API access
    • Cross-platform
    • Docker support
  • Supported Models:
    • Llama 3
    • Mistral
    • CodeLlama
    • Gemma
    • And many more
  • Pricing: Free and open source

HuggingFace

  • Provider: Hugging Face Inc.
  • Website: huggingface.co
  • GitHub: github.com/huggingface
  • Deployment: Cloud & Self-hosted
  • Key Features:
    • Model Hub with 300k+ models
    • Datasets repository
    • Spaces for demos
    • AutoTrain for fine-tuning
    • Inference API
    • Enterprise deployment
    • Transformers library
  • Popular Models:
    • BERT
    • T5
    • GPT-2
    • Stable Diffusion
    • Whisper
    • CodeLlama
  • Pricing:
    • Open Source: Free
    • Pro: $9/month
    • Enterprise: Custom pricing
  • Use Cases:
    • Model discovery and sharing
    • Model training and fine-tuning
    • Dataset management
    • MLOps and deployment
    • Research and experimentation
    • Production inference

OpenRouter

  • Provider: OpenRouter
  • Website: openrouter.ai
  • GitHub: github.com/OpenRouterTeam/openrouter
  • Key Features:
    • Single API for multiple LLMs
    • Pay-as-you-go pricing
    • Access to 50+ models
    • Unified prompt format
    • Load balancing
    • Fallback routing
  • Supported Models:
    • Claude 3
    • GPT-4
    • Gemini Pro
    • Mistral
    • Anthropic
    • And many more
  • Pricing:
    • Pay per token
    • No subscription required
    • Volume discounts available
  • Use Cases:
    • Model comparison
    • Production deployment
    • Cost optimization
    • Multi-model applications
    • API standardization

Uncensored Models

Dolphin 2.9.2

  • Provider: Open Source
  • Access: HuggingFace
  • Base Model: Mistral 7B
  • Key Features:
    • Uncensored responses
    • High performance/quality ratio
    • Low hardware requirements
    • Supports multiple contexts
    • Good coding capabilities
  • Deployment:
    • Self-hosted via Ollama
    • Run through various LLM interfaces
  • Technical Details:
    • 7B parameters
    • Context window: 32k tokens
    • GGUF format available
    • Apache 2.0 license
  • Use Cases:
    • Research
    • Development
    • Creative writing
    • Coding assistance

Content Curation

Jina.ai

  • Provider: Jina AI
  • Website: jina.ai
  • Key Features:
    • URL to Markdown conversion
    • World-class embeddings
    • Neural search and reranking
    • Zero-shot classification
    • Text segmentation
    • Multimodal capabilities
  • Products:
    • Reader: Clean URL content for LLMs
    • Embeddings: Multilingual & multimodal
    • Reranker: Search relevancy optimization
    • Classifier: Image and text classification
    • Segmenter: Text chunking & tokenization
  • Use Cases:
    • Enterprise search
    • RAG systems
    • Content processing
    • Document analysis
    • Search optimization
  • Pricing:
    • Free tier available
    • API-based pricing
    • SOC 2 Type 1 & 2 compliant
  • Notable Features:
    • No registration required
    • Simple API integration
    • High-quality content extraction
    • Multiple content formats
    • Streaming support