83 Resources

MinerU: 70.6K Stars — Convert Any Document to LLM-Ready Markdown

MinerU (70,600+ GitHub stars) transforms PDF, DOCX, PPTX, XLSX, images and web pages into structured Markdown and JSON for LLM, RAG and Agent workflows. Supports 109-language OCR, formula-to-LaTeX, table-to-HTML, and runs on CPU or GPU.

Details

WorldMonitor: Real-Time Global Intelligence Dashboard for

A real-time AI-powered global intelligence dashboard aggregating news, geopolitical events, and infrastructure tracking. 59K stars. Open-source alternative to Palantir Gotham.

Details

VoiceBox: The Open-Source AI Voice Studio for Cloning, Dictation

A full-stack open-source AI voice studio that lets you clone any voice, generate speech, and dictate into any app. 33K stars. Runs locally on your machine with CUDA or Apple Silicon support.

Details

OpenMontage Review: The World's First Open-Source Agentic Video

OpenMontage (8.3K+ GitHub stars) is the world's first open-source, agentic video production system. 12 production pipelines, 52 tools, 500+ agent skills. Turn any AI coding assistant into a full video studio — from animated explainers to cinematic trailers to real-footage documentaries. Zero API keys needed for basic output.

  • Python 3.10+
  • AGPL-3.0
Details

Understand-Anything: Interactive Knowledge Graphs for Codebases

Understand-Anything turns any codebase into an interactive knowledge graph you can explore, search, and query. Works with Claude Code, Codex, Cursor, Copilot, Gemini CLI. 60,339 GitHub stars.

Details

Taste Skill: Stop AI From Generating Generic Slop

Taste Skill is a portable agent skill framework that upgrades AI-built interfaces with stronger layout, typography, motion, and spacing. Works with Codex, Cursor, Claude Code, and ChatGPT Images.

Details

Oh My Pi: Turn Any Raspberry Pi Into a Smart Device

Oh My Pi (12,554 stars) transforms Raspberry Pi devices into smart home hubs, media centers, and development workstations with one-click setup and automated configuration.

Details

MarkItDown: Universal File-to-Markdown Converter

MarkItDown by Microsoft AutoGen team converts 20+ file types to Markdown for LLM consumption. pip install markitdown[all], Python API, LangChain integration, RAG pipelines, and batch processing.

Details

Knowledge Work Plugins

Knowledge Work Plugins (20,728 stars) by Anthropic extends Claude with powerful tools for document editing, code analysis, web browsing, and file operations. Build custom plugins for your workflow.

Details

Apple's Container: Docker-Like Experience on Mac with 37K Stars

Apple released container, a Swift-based tool for running Linux containers on Mac using lightweight VMs. 37K stars, OCI-compatible, macOS 26 required.

Details

AI Engineering From Scratch: Build Production LLM Systems

AI Engineering From Scratch (32,771 stars) is a comprehensive curriculum covering LLM fine-tuning, RAG, agent frameworks, and production deployment. Learn to build, ship, and scale AI systems.

Details

Academic Research Skills: Automate Literature Reviews with AI

Academic Research Skills (31,628 stars) automates the research pipeline: search papers, extract insights, synthesize findings, and write literature reviews. Built for Claude Code with modular skill architecture.

Details

NVIDIA Cosmos: Open-Source World Models for Physical AI (10K

NVIDIA Cosmos is an open platform of world models, datasets, and tools for building Physical AI — robots, autonomous vehicles, smart infrastructure. Cosmos 3 uses Mixture-of-Transformers for unified language, image, video, audio, and action generation. 16B and 64B models available.

Details

Impeccable: The Design Language That Makes AI-Generated UIs

Impeccable (37K stars) is a design language for AI coding agents with 23 commands, 41 detector rules, and live browser iteration. Fixes AI-generated UI slop with deterministic design quality checks. Compatible with Claude Code, Cursor, and Codex.

Details

RuView: WiFi Spatial Intelligence for Smart Buildings

Learn how to use RuView, the Python-based WiFi spatial intelligence platform that tracks real-time positions, maps building layouts, and optimizes WiFi mesh networks. Step-by-step pip install guide, real-time tracking, and mesh network configuration.

Details

PaddleOCR: The 81K-Star Open-Source OCR Engine That Outperforms

PaddleOCR is a multi-language open-source OCR toolkit with 96.3%+ accuracy for text detection and recognition. Supports 80+ languages, document AI, table recognition, and layout analysis. 81K+ GitHub stars. Includes setup guide, benchmarks, and production deployment.

Details

Open-LLM-VTuber: Voice-Powered LLM Chat with Live2D Characters

Open-LLM-VTuber is an open-source AI avatar platform with voice interaction, Live2D characters, and hands-free voice interruption. Works with any LLM — local or cloud. Zero setup, cross-platform. Includes quick start guide, full integration list, and production deployment options.

Details

ByteDance UI-TARS Desktop

Learn how to deploy ByteDance's UI-TARS Desktop, a vision-language AI agent that sees your screen and controls applications through natural language. Step-by-step installation, real-world benchmarks, and comparisons with alternatives.

Details

TurboVec: Rust-Powered Vector Index

TurboVec (RyanCodrai/turbovec) is a vector index built on Google Research's TurboQuant algorithm, written in Rust with Python bindings. Drop-in replacements for LangChain, LlamaIndex, Haystack, and Agno. Delivers high-performance vector search with 4-bit quantization and SIMD acceleration. Covers Python integration, benchmarks, and production deployment.

Details

Odysseus: Self-Hosted AI Workspace with 10+ Built-in Tools

Odysseus (69,110 GitHub stars) is a self-hosted AI workspace combining chat, agent automation, deep research, document editing, email triage, calendar, and more. Supports vLLM, llama.cpp, Ollama, OpenRouter, OpenAI, and GitHub Copilot. Docker and native Linux/macOS installs available.

Details

Odysseus: The Self-Hosted AI Workspace That Hit 63

Odysseus is an open-source, privacy-first AI workspace (63 k GitHub stars in 9 days, MIT). One Docker command gives you chat, agents, deep research, email triage, calendar, notes, and a model cookbook — all on your own hardware. This guide covers installation, key features, and how it stacks up against ChatGPT Plus and Claude.ai.

  • MIT
Details

nanochat: Karpathy's $100 ChatGPT — Build Your Own AI Chat App on a Single GPU

nanochat (54,800 GitHub stars) is Andrej Karpathy's open-source ChatGPT clone that runs on a single $100 GPU. Train from scratch using SGLang or run pre-trained via vLLM. Includes setup guide, training benchmarks, and deployment examples.

Details

MoneyPrinterTurbo: Generate HD Short Videos with AI in One

MoneyPrinterTurbo (83,031 GitHub stars) generates HD short videos with one click using AI LLM. Script, voice, subtitles, background music — all automated. Includes setup tutorial, pipeline breakdown, and real video benchmarks.

Details

ComfyUI Workflow 2026

ComfyUI hit 106K GitHub stars in 2026. Beginner-friendly setup guide, model recommendations for 2026, and 5 production-ready workflow templates (text-to-image, inpaint, upscale, video, character consistency).

  • ComfyUI
  • GPL-3.0
Details

ViMax Review: Agentic Multi-Scene Video Generation from HKUDS

ViMax (7.1K+ GitHub stars) from Hong Kong University Data Science Lab is the first widely-adopted open-source agentic video generation framework. Instead of one-shot prompt-to-video like Sora or Runway, it orchestrates four AI roles — Director, Screenwriter, Producer, Video Generator — to produce long-form multi-scene videos from a single idea. Full breakdown of the agentic pipeline, supported backends (Gemini Flash, MiniMax, Google Veo), install steps, idea-to-video and script-to-video workflows, and honest comparison with Sora, OpenSora, Runway.

  • MIT
Details

Supertonic Review: 99M-Parameter On-Device TTS in 31 Languages

Supertonic (9.9K+ GitHub stars) by Supertone Inc. is a lightning-fast multilingual text-to-speech model that runs locally on CPU via ONNX Runtime — no cloud, no API, no GPU required. 99M parameters, 31 languages including Korean/Japanese/Vietnamese/Chinese, 44.1kHz studio audio, 10 expression tags, runtimes for Python, Node.js, browser (WebGPU/WASM), iOS, Android, Rust, Flutter. Full feature breakdown, install, code example, and 2026 on-device TTS landscape comparison.

  • MIT (code) / OpenRAIL-M (model)
Details

Stable Diffusion WebUI 2026 (AUTOMATIC1111)

AUTOMATIC1111 stable-diffusion-webui is the 163k-star de-facto standard self-hosted UI for SD/SDXL image generation. Complete 2026 install + production guide covering txt2img / img2img / inpainting / outpainting / LoRA / ControlNet, hardware requirements, alternatives (Forge, SD.Next).

  • Python
  • AGPL-3.0
Details

ComfyUI 2026: 114k-Star Node-Based AI Image/Video/Audio Workflow

ComfyUI is the 114k-star node-based visual workflow engine for SD/SDXL/Flux/Wan/Hunyuan and more. Supports image, video, audio, and 3D generation. Complete 2026 install guide covering node basics, workflow JSON import, ComfyUI Manager, and when ComfyUI beats AUTOMATIC1111.

  • Python
  • GPL-3.0
Details

ChatTTS 2026: 39.3k-Star Open-Source Dialogue TTS with Laughter

ChatTTS is the open-source TTS purpose-built for dialogue (not narration). 39.3k GitHub stars, 4 GB VRAM minimum, RTF 0.3 on RTX 4090, fine-grained prosodic control including laughter and pauses. Complete 2026 install + production setup guide.

  • Python
  • AGPL-3.0
Details

Mistral AI 2026: Deploy Production-Grade Local LLMs with 8x7B

Deploy production-grade Mistral LLMs locally — 8x7B MoE architecture, mistral-inference, vLLM serving, GGUF quantization, function calling, and fine-tuning. Complete guide.

  • Apache-2.0
Details

WhisperX: 22K+ Stars — Production ASR Setup Guide 2026

WhisperX is an open-source ASR toolkit with word-level timestamps and speaker diarization. Compatible with faster-whisper, pyannote.audio, and OpenAI Whisper models. Covers Docker deployment, Python API, benchmarks, and production hardening.

  • BSD-2-Clause
Details

Wan 2.1: 16.1K+ Stars

Wan 2.1 is an open suite of video foundation models by Alibaba with SOTA performance. Supports ComfyUI, Diffusers, and Gradio. Covers T2V, I2V, video editing, and text generation with 1.3B and 14B parameter variants.

  • Apache-2.0
Details

VoiceCraft: 8.5K+ Stars

VoiceCraft is a token infilling neural codec language model for zero-shot speech editing and TTS. Compatible with GPT-SoVITS, Coqui TTS, and RVC. Covers setup, benchmarks, Docker deployment, and comparison tables.

  • MIT
Details

VideoReTalking: 7.2K+ Stars

VideoReTalking (VRT) is an audio-based lip synchronization system for talking head video editing. Compatible with RVC, GPT-SoVITS, and Coqui TTS. Covers installation, inference, Gradio WebUI, production deployment, and benchmarks vs Wav2Lip and SadTalker.

  • Apache-2.0
Details

Ultimate Vocal Remover: 24.7K+ Stars — Complete Setup Guide 2026

Ultimate Vocal Remover (UVR) is a GUI application for vocal removal using deep neural networks. Compatible with demucs, RVC, GPT-SoVITS. Covers Windows, macOS, Linux installation, model selection, batch processing, and production hardening.

  • MIT
Details

RVC: Deploy AI Voice Conversion with 35K+ Stars

RVC (Retrieval-based Voice Conversion) is a VITS-based voice conversion framework compatible with GPT-SoVITS, Coqui TTS, and demucs. This tutorial covers Docker deployment, training pipelines, API integration, and production hardening.

  • MIT
Details

OpenAI Whisper: 99.8K+ Stars

OpenAI Whisper (ASR) robust speech recognition via large-scale weak supervision. Compatible with WhisperX, faster-whisper, LibreTranslate. Covers whisper tutorial, whisper vs whisperx, speech recognition setup, whisper python, whisper docker.

  • MIT
Details

Open-Sora: 29K+ Stars

Open-Sora is an open-source video generation framework with 29K+ GitHub stars. Covers Docker setup, ComfyUI integration, Stable Diffusion compatibility, production deployment, benchmarks vs HunyuanVideo, CogVideo, and Wan.

  • Apache-2.0
Details

MeloTTS: 7.4K+ Stars — Multi-Lingual TTS Benchmark vs Coqui TTS

MeloTTS is a high-quality multi-lingual text-to-speech library with 7.4K+ stars. Compare benchmarks with Coqui TTS, ChatTTS, and Bark. Covers Python setup, Docker deployment, real-time inference, and production hardening.

  • MIT
Details

Lobe Chat: The Open-Source ChatGPT UI Alternative with 20+ LLM

Deploy Lobe Chat as your self-hosted ChatGPT alternative. Supports 20+ LLM providers, plugin system, PWA, multi-language UI. Complete Docker setup guide with benchmarks and comparisons.

  • MIT
Details

LibreTranslate: Self-Hosted Translation API with 14.4K+ Stars

LibreTranslate (LT) is a free, open-source machine translation API powered by Argos Translate. Supports Docker, CUDA GPU, 30+ languages, and offline deployment. Covers setup, benchmarks, monitoring, and integration with OpenAI Whisper, Coqui TTS, and Argos Translate.

  • AGPL-3.0
Details

InvokeAI: 27.2K+ Stars — Complete Setup Guide for 2026

InvokeAI (Invoke) is the leading creative engine for Stable Diffusion models with an industry-leading WebUI. Compatible with SD 1.5, SDXL, FLUX, and ControlNet. Covers Docker install, workflow setup, benchmarks vs AUTOMATIC1111 and ComfyUI, and production hardening.

  • Apache-2.0
Details

HunyuanVideo: 12.1K+ Stars — Production Deployment Guide 2026

HunyuanVideo (HYV) is an open-source video generation framework by Tencent with 13B parameters. Supports ComfyUI, Diffusers, Gradio API. Covers Docker setup, FP8 quantization, multi-GPU inference, and production hardening.

  • Apache-2.0
Details

GPT-SoVITS: 57.5K+ Stars

GPT-SoVITS (GSV) is a few-shot voice cloning and TTS tool with zero-shot capabilities. Supports ComfyUI, RVC, and MeloTTS integration. Covers Docker deployment, voice training, API setup, and production hardening.

  • MIT
Details

faster-whisper: 4x Faster Speech-to-Text with 23K+ Stars

faster-whisper (SYSTRAN) reimplements OpenAI Whisper via CTranslate2 for 4x speedup. Covers faster whisper tutorial, benchmark data, Docker setup, Python API, VAD filter, batch processing, and production hardening with WhisperX and whisper.cpp integration.

  • MIT
Details

Demucs: Music Source Separation with 10K+ Stars

Demucs is a hybrid spectrogram and waveform source separation model by Meta AI. Compatible with Ultimate Vocal Remover, RVC, GPT-SoVITS. Covers demucs tutorial, demucs vs uvr, demucs docker setup, and production benchmarks.

  • MIT
Details

Coqui TTS: 45.3K+ Stars

Coqui TTS is an open-source deep learning toolkit for Text-to-Speech. Supports 1100+ languages, XTTS v2 voice cloning, VITS end-to-end synthesis. Benchmarks against ChatTTS, MeloTTS, Bark with real RTF numbers, Docker deployment, and production configs.

  • MPL-2.0
Details

CogVideo: 12.7K Stars — Complete Text-to-Video Setup Guide 2026

CogVideo (CogVideoX) is a text and image-to-video generation model from Zhipu AI. Supports ComfyUI, Diffusers, SAT, and Wan/HunyuanVideo/Open-Sora integration. Covers installation, Docker, inference, fine-tuning, and benchmarks.

  • Apache-2.0
Details

Baetyl: The Cloud-Native Edge AI Computing Platform Deploying

Deploy Baetyl v2.4 to bring Kubernetes-native edge computing to IoT devices. AI model inference, MQTT/BACnet support, OTA updates, K3s runtime, and cloud-edge synchronization.

  • Apache-2.0
Details

Best AI Writing Assistants 2025: Jasper, Copy.ai

Compare the best AI writing assistants of 2025: Jasper, Copy.ai, Writesonic, ChatGPT, Claude, and Notion AI. Find the right tool for your content needs with pricing and features.

  • MIT
Details

Best AI Voice Tools 2025

Compare the best AI voice tools of 2025 for text-to-speech and transcription. ElevenLabs, Murf.ai, Whisper, Otter.ai, and more with pricing, accuracy, and use cases.

  • MIT
Details

Best AI Video Generation Tools 2025: Sora, Runway

Compare the best AI video generation tools of 2025: OpenAI Sora, Runway Gen-3, Pika 2.0, Kling AI, HeyGen, and Luma Dream Machine. Features, pricing, and quality compared.

  • MIT
Details

Best AI Translation Tools 2025

Compare the best AI translation tools of 2025 — Google Translate, DeepL, ChatGPT, Microsoft Translator, Smartcat, and Reverso. See quality, pricing, and language coverage side by side.

  • MIT
Details

Best AI Presentation Tools 2025: Gamma, Beautiful.ai

Compare the best AI presentation tools of 2025. In-depth reviews of Gamma, Beautiful.ai, Tome, SlidesAI, Canva Magic Design, and Microsoft Copilot for PowerPoint with features, pricing, and use cases.

  • MIT
Details

Best AI Meeting Assistant Tools 2025: Otter.ai, Fireflies

Compare the best AI meeting assistant tools of 2025. In-depth reviews of Otter.ai, Fireflies.ai, Fathom, Notion AI, Microsoft Copilot for Teams, and Avoma with transcription accuracy, integrations, and pricing.

  • MIT
Details

Best AI Developer Tools & IDE Plugins 2025

Discover the best AI developer tools and IDE plugins of 2025 — GitHub Copilot, Cursor, Sourcegraph Cody, Tabnine, Codeium, and more. Compare features, pricing, and IDE support.

  • MIT
Details

Best AI Data Analysis Tools 2025: ChatGPT, Julius

Discover the best AI data analysis tools of 2025 — ChatGPT Advanced Data Analysis, Julius AI, Tableau Einstein, Copilot in Excel, and more. Compare features, pricing, and use cases.

  • MIT
Details

Best AI Customer Service Chatbot Tools 2025: Intercom

Compare the top AI customer service chatbot platforms in 2025 — Intercom Fin, Zendesk AI, Freshworks Freddy, ChatGPT Enterprise, Drift, and Tidio Lyro. See pricing, features, and ROI data.

  • MIT
Details

Best AI Content Detector Tools 2025: GPTZero, Turnitin AI

Compare the best AI content detector tools of 2025. In-depth analysis of GPTZero, Turnitin AI, Copyleaks, Originality.ai, and more with accuracy tests, pricing, and use case recommendations.

  • MIT
Details

Best AI Code Generators 2025

Compare the best AI code generators of 2025: GitHub Copilot, Cursor, Tabnine, Amazon CodeWhisperer, and more. Features, pricing, and use cases explained.

  • MIT
Details

AI Search Tools Compared

Compare the top AI search engines of 2025 — Perplexity, Google Gemini, ChatGPT Search, Copilot, and more. See accuracy, speed, and source coverage side by side.

  • MIT
Details

AI Image Generation Tools: Complete Guide to Midjourney, DALL-E

Complete guide to AI image generation tools in 2025. Compare Midjourney v7, DALL-E 3, Stable Diffusion 3.5, Adobe Firefly, FLUX, and Leonardo.ai with features and pricing.

  • MIT
Details

WiFi-Forge — A Safe, Legal Sandbox for Learning WiFi Hacking

WiFi Forge: safe WiFi hacking lab for security research. Learn penetration testing, wireless security and ethical hacking in a controlled environment.

  • Python
  • MIT
Details

Why Did the Classic 'Roop' Die?

FaceFusion replaces Roop: modular ONNX pipeline, multi-threaded rendering, cross-platform GPU acceleration — the open-source AI video face-swap engine.

  • C++
  • MIT
Details

Toprank: Open-Source Claude Code Skills That Automate SEO, GEO

Toprank is a trending open-source Claude Code skills suite for SEO audits, GEO optimization, Google Ads management, and Meta Ads automation. Install once, get automatic updates, and let AI handle your marketing stack.

  • Go
  • MIT
Details

Top 15 Product Hunt Alternatives to Launch Your Startup in 2026

Discover the best Product Hunt alternatives for launching your startup in 2026. Compare 15 platforms by audience, cost, SEO value, and launch strategy. Find the perfect platform for developers, founders, and indie hackers.

  • Go
  • MIT
Details

Terax AI: The Lightweight AI Terminal Emulator That Understands

Discover Terax AI, a 7 MB AI-native terminal emulator built on Tauri 2 + Rust. Features natural language commands, inline AI assistance, smart autocomplete, and cross-shell support for bash, zsh, fish, and PowerShell.

  • Docker
  • MIT
Details

TabPFN: Foundation Model for Tabular Data — AI

Discover TabPFN, the foundation model for tabular data that outperforms traditional ML methods. No hyperparameter tuning needed, works in seconds.

  • Go
  • Prior Labs License (Apache 2.0 based)
Details

Reading EXPLAIN ANALYZE in Postgres Without Getting Lost

Reading EXPLAIN ANALYZE in PostgreSQL without getting lost. Learn to interpret query plans, identify bottlenecks and optimize database performance.

  • Python
  • MIT
Details

Python Context Managers: The Three Cases You Actually Need

Python context managers: the three cases you actually need. Master with statements, contextlib and custom context managers for better resource management.

  • Go
  • MIT
Details

Pixelle-Video Review: AI Auto Short Video Generator

Pixelle-Video is an AI-powered automatic short video engine. Input a topic and get a complete video with script, AI images, voiceover, and BGM.

  • Go
  • MIT
Details

ML Systems Book: MIT Press Textbook on Machine Learning Systems

The ML Systems Book is an MIT Press textbook covering distributed training, model serving, hardware acceleration, and ML infrastructure. Essential reading for ML engineers.

  • Go
  • MIT
Details

Midjourney Alternative (2026): Why ComfyUI is the Free

Midjourney Alternative (2026): Why ComfyUI is the Free, Open-Source Standard

  • Python
  • MIT
Details

MemPalace vs Mem0: 96.6% Recall Benchmark & Best AI Memory

MemPalace is the best-benchmarked open-source AI memory system. Learn how to install it, mine your project history, and retrieve context with semantic search so your AI assistant never forgets again.

  • Go
  • MIT
Details

Ladybird: Truly Independent Web Browser

Discover Ladybird, the truly independent web browser built from scratch. No Chrome dependencies, no corporate influence, pure open source.

  • C++
  • MIT
Details

JustHireMe: AI That Automates Your Job Hunt

A review of JustHireMe, the open-source AI job-search workbench. A local-first job-intelligence system that auto-scrapes listings, scores AI match quality, and generates tailored resumes and cover letters.

  • JavaScript
  • MIT
Details

HowToCook: 297 Recipes for Programmers - The Open Source Cookbook

Discover HowToCook - an open-source cookbook with 297 recipes designed for programmers. Clear, precise cooking instructions like code.

  • Docker
  • MIT
Details

DocuSeal Review: Cut Document Signing Costs by 90%

DocuSeal is a 15.7k-star open-source platform that replaces DocuSign with self-hosted digital document signing, PDF form building, and white-label eSignature workflows.

  • Docker
  • MIT
Details

Cut Claude Code Token Usage by 65% With Caveman

Learn how Caveman, a Claude Code skill with 57K GitHub stars, reduces token usage by 65% without losing quality. Includes installation, usage, real benchmarks, and code examples.

  • Docker
  • MIT
Details

Claude Code Session Memory

Claude Code Session Memory: How to Integrate MemPalace for 96.6% Recall (2026 Guide)

  • Go
  • MIT
Details

Bitcoin-Classic (BTCC)

Bitcoin-Classic (BTCC) is a decentralized digital currency rebuilt from Bitcoin Core v28.1. It supports CPU mining with a built-in graphical miner, letting ordinary users experience early Bitcoin mining.

  • C++
  • MIT
Details

Best Open Source Alternative to Buffer (2026)

Best Open Source Alternative to Buffer (2026): AiToEarn vs Hootsuite Comparison

  • Docker
  • MIT
Details

AiWind: A 1000+ AI Art Prompt Library for Stunning Output from

AiWind is a free AI prompt library with 1000+ professional prompts tuned for GPT-Image 2, Nanobanana, Stable Diffusion, Midjourney, and other mainstream models, covering realistic portraits, cyberpunk, 3D rendering, and many more styles.

  • Go
  • MIT
Details