AI Hub

Chat with AI

Model

Start a conversation...

XTTS-v2 — Multi-language voice synthesis

Text to speak

Language

Generating speech...

Faster-Whisper Large-v3 — Accurate speech recognition

Record Audio

Click to start recording

Or Upload Audio File

🎤

Drop audio file here or click to browse

Transcribing audio...

YOLOv11x — Real-time object detection

📷

Drop an image here or click to browse

Confidence Threshold 0.25

Detecting objects...

Florence-2 — Image captioning, OCR, and more

👁

Drop an image here or click to browse

Task

Caption

Detailed Caption

Very Detailed

OCR

Object Detection

Region Captions

Analyzing image...

FLUX.1 Dev — via ComfyUI (must be running on port 8188)

Prompt

Width

Height

Steps

Generating image... (this may take 30-60s)

🎨