🧠 Models
Last Updated: 2025-01-19
Overview of prominent AI language models and their capabilities.
LLMs
ChatGPT
- Provider: OpenAI
- Access: chat.openai.com
- Key Features:
- General-purpose conversation
- Code generation
- Text analysis
- Multiple model versions (GPT-3.5, GPT-4)
- API access
- Code interpreter
- Memory
- Screenshots
- Image Generation
Claude
- Provider: Anthropic
- Access: claude.ai
- Key Features:
- Long context windows
- Accurate reasoning
- Code generation
- Multiple versions (Claude 2, Claude 3)
DeepSeek
- Website: deepseek.ai
- Access: DeepSeek Chat
- GitHub: deepseek-ai/DeepSeek-Coder
- Description: Open-source language model
- Key Features:
- Code generation
- DeepSeek R1 model with 128k context window
- Enhanced mathematical reasoning capabilities
- Superior code generation performance
- Available via API and chat interface
Llama 3
- Provider: Meta
- Description: Open-source large language model
- Key Features:
- Multiple model sizes
- Fine-tuning capabilities
- Research and commercial use
- API access
- Can run locally with Ollama
Gemini
- Provider: Google
- Access: gemini.google.com
- Key Features:
- Multimodal capabilities
- Multiple versions (Pro, Ultra)
- API access
NotebookLM
- Provider: Google
- Description: AI-powered note-taking and research assistant
- Key Features:
- Document analysis
- Smart summarization
- Research assistance
- Has the biggest content window (25 million!)
- Low hallucinations
- Lower Creativity
- Can take in the largest sets of documents
- Can make Audio Summary and Podcasts from contents.
Grok
- Provider: xAI
- Website: grok.x.ai
- Key Features:
- Real-time X (Twitter) data access
- Code generation and analysis
- Mathematical reasoning
- Conversational AI
- Unique personality
- Access: Limited beta
- X Premium+ subscribers
Image Generation
DALL-E 3
- Provider: OpenAI
- Website: openai.com/dall-e-3
- API: platform.openai.com/docs/guides/images
- Key Features:
- High-resolution outputs
- Text-to-image generation
- Image editing capabilities
- Photorealistic results
- Pricing: openai.com/pricing
- Pay-per-generation
- 1024×1024: $0.040/image
- 1024×1792: $0.080/image
Stable Diffusion
- Provider: Stability AI
- Website: stability.ai
- GitHub: CompVis/stable-diffusion
- API: platform.stability.ai
- Deployment: Cloud & Self-hosted
- Key Features:
- Text-to-image generation
- Image-to-image editing
- Inpainting and outpainting
- Multiple model versions (XL, 2.1, 3)
- ControlNet support
- Custom model training
- Open source
- Pricing:
- Self-hosted: Free
- API Credits: Starting at $10/month
- Enterprise: Custom pricing
- Popular UIs:
- ComfyUI
- Automatic1111
- InvokeAI
- RunwayML
- DreamStudio
- Use Cases:
- Art creation
- Design prototyping
- Content generation
- Game asset creation
- Product visualization
Midjourney
- Provider: Midjourney Inc.
- Website: midjourney.com
- API: Discord bot interface
- Key Features:
- Artistic style focus
- High-quality outputs
- Style customization
- Community features
- Pricing: midjourney.com/pricing
- Basic: $10/month
- Standard: $30/month
- Pro: $60/month
Leonardo.ai
- Provider: Leonardo AI
- Website: leonardo.ai
- API: docs.leonardo.ai/reference
- Key Features:
- Custom model training
- Batch generation
- Asset library
- Commercial rights
- Pricing: leonardo.ai/pricing
- Free: Limited generations
- Pro: $10/month
- Business: Custom
Recraft.ai
- Provider: Recraft
- Website: recraft.ai
- API: docs.recraft.ai/reference
- Key Features:
- Vector graphics generation
- Icon creation
- Brand asset generation
- SVG output format
- Pricing:
- Free tier available
- Pro: $20/month
- Team: $49/month
- Unique Features:
- SVG-first approach
- Design system integration
- Scalable graphics
- Brand consistency tools
Video Generation
Runway Gen-2
- Provider: Runway
- Website: runway.ml
- API: docs.runway.ml/reference
- Key Features:
- Text-to-video generation
- Video editing
- Motion tracking
- Green screen
- Pricing:
- Pro: $15/month
- Unlimited: $35/month
D-ID
- Provider: D-ID
- Website: d-id.com
- API: docs.d-id.com
- Key Features:
- Digital human creation
- Text-to-video
- Voice synthesis
- Avatar customization
- Pricing:
- Creator: $5.99/month
- Pro: $24.99/month
- Enterprise: Custom
Synthesia
- Provider: Synthesia
- Website: synthesia.io
- API: docs.synthesia.io/reference
- Key Features:
- AI avatar videos
- Multi-language support
- Custom avatar creation
- Template library
- Pricing:
- Personal: $29/month
- Business: Custom
- Enterprise: Custom
Voice Generation
Bland.ai
- Provider: Bland
- Website: bland.ai
- Key Features:
- Real-time voice AI calls
- Natural conversation flow
- Custom voice cloning
- API integration
- Call analytics
- Pricing: bland.ai/pricing
- Pay-per-minute model
- Volume discounts available
- Custom enterprise plans
ElevenLabs
- Provider: ElevenLabs
- Website: elevenlabs.io
- Key Features:
- Text-to-speech synthesis
- Voice cloning
- Custom voice creation
- API access
- Pricing: elevenlabs.io/pricing
- Pay-per-minute model
- Volume discounts available
- Custom enterprise plans
OpenAI Real-Time GPT-4o
- Provider: OpenAI
- Website: openai.com
- Key Features:
- voice to voice
- API access
- Real-time voice chat
- Pricing: openai.com/pricing
- Pay-per-minute model
- Volume discounts available
- Custom enterprise plans
VAPI
- Provider: VAPI
- Website: vapi.ai
- Key Features:
- voice to voice
- API access
- Real-time voice chat
- Pricing: vapi.ai/pricing
- Pay-per-minute model
- Volume discounts available
- Custom enterprise plans
Tools
Ollama
- Provider: Ollama
- Website: ollama.ai
- GitHub: github.com/ollama/ollama
- Deployment: Self-hosted
- Key Features:
- Run LLMs locally
- Easy model management
- Multiple model support
- API access
- Cross-platform
- Docker support
- Supported Models:
- Llama 3
- Mistral
- CodeLlama
- Gemma
- And many more
- Pricing: Free and open source
HuggingFace
- Provider: Hugging Face Inc.
- Website: huggingface.co
- GitHub: github.com/huggingface
- Deployment: Cloud & Self-hosted
- Key Features:
- Model Hub with 300k+ models
- Datasets repository
- Spaces for demos
- AutoTrain for fine-tuning
- Inference API
- Enterprise deployment
- Transformers library
- Popular Models:
- BERT
- T5
- GPT-2
- Stable Diffusion
- Whisper
- CodeLlama
- Pricing:
- Open Source: Free
- Pro: $9/month
- Enterprise: Custom pricing
- Use Cases:
- Model discovery and sharing
- Model training and fine-tuning
- Dataset management
- MLOps and deployment
- Research and experimentation
- Production inference
OpenRouter
- Provider: OpenRouter
- Website: openrouter.ai
- GitHub: github.com/OpenRouterTeam/openrouter
- Key Features:
- Single API for multiple LLMs
- Pay-as-you-go pricing
- Access to 50+ models
- Unified prompt format
- Load balancing
- Fallback routing
- Supported Models:
- Claude 3
- GPT-4
- Gemini Pro
- Mistral
- Anthropic
- And many more
- Pricing:
- Pay per token
- No subscription required
- Volume discounts available
- Use Cases:
- Model comparison
- Production deployment
- Cost optimization
- Multi-model applications
- API standardization
Uncensored Models
Dolphin 2.9.2
- Provider: Open Source
- Access: HuggingFace
- Base Model: Mistral 7B
- Key Features:
- Uncensored responses
- High performance/quality ratio
- Low hardware requirements
- Supports multiple contexts
- Good coding capabilities
- Deployment:
- Self-hosted via Ollama
- Run through various LLM interfaces
- Technical Details:
- 7B parameters
- Context window: 32k tokens
- GGUF format available
- Apache 2.0 license
- Use Cases:
- Research
- Development
- Creative writing
- Coding assistance
Content Curation
Jina.ai
- Provider: Jina AI
- Website: jina.ai
- Key Features:
- URL to Markdown conversion
- World-class embeddings
- Neural search and reranking
- Zero-shot classification
- Text segmentation
- Multimodal capabilities
- Products:
- Reader: Clean URL content for LLMs
- Embeddings: Multilingual & multimodal
- Reranker: Search relevancy optimization
- Classifier: Image and text classification
- Segmenter: Text chunking & tokenization
- Use Cases:
- Enterprise search
- RAG systems
- Content processing
- Document analysis
- Search optimization
- Pricing:
- Free tier available
- API-based pricing
- SOC 2 Type 1 & 2 compliant
- Notable Features:
- No registration required
- Simple API integration
- High-quality content extraction
- Multiple content formats
- Streaming support