korchasa
MCP Serverkorchasapublic

awesome ai agents

This curated list focuses on tools and frameworks for building AI agents

Repository Info

9
Stars
3
Forks
9
Watchers
2
Issues
HTML
Language
Creative Commons Zero v1.0 Universal
License

About This Server

This curated list focuses on tools and frameworks for building AI agents

Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.

Documentation

Awesome AI Agents

Note: This repository contains an automatically compiled list of frameworks, tools, and resources related to AI agents, LLMs, and autonomous systems.

This curated list focuses on tools and frameworks for building AI agents, including:

  • Multi-agent systems and autonomous frameworks
  • Development tools and SDKs
  • Specialized agents for various domains
  • Research papers and educational resources

The projects are automatically categorized and regularly updated. Each entry includes a brief summary, tags, and GitHub stars to help you find the most relevant tools for your needs.

Pull requests with additional tools and projects are welcome!

Table of Contents:

  • AI Agent Frameworks & SDKs

    • Multi-Agent Collaboration Systems

      • Multi-Agent Frameworks (8)

      • Knowledge Graph Orchestration (10)

      • AI Agent Ecosystems (9)

      • LLM-Powered Platforms (11)

      • Collaborative Swarm Systems (9)

      • Agent Monitoring Tools (3)

    • Orchestration Frameworks

      • Generative Software Frameworks (13)

      • AI Workflow Orchestrators (11)

      • Language Agent Frameworks (9)

    • Cognitive Architecture Frameworks

      • End-to-End Optimizers (8)

      • Continual Learning Agents (2)

  • Autonomous Research & Content Generation

    • Prompt Libraries (15)

    • Reasoning Frameworks (8)

    • Programming Languages (9)

    • AI Assistants (8)

  • Agent Integration & Deployment Tools

    • Stateful Serverless Frameworks (8)

    • AI Agent Development (17)

    • AI Agent Orchestration (10)

    • LLM Framework Tools (9)

    • AI Agent Deployment (11)

    • AI Application Development (10)

    • AI Developer Toolkit (11)

    • AI Agent Operating System (12)

    • AI Agent Gateway (11)

    • MCP Server Framework (8)

  • Multimodal AI & Vision Agents

    • Image Processing & Analysis Agents (5)

    • Video Processing Agents (3)

  • Task Automation & Workflow Orchestration

    • Job Automation Tools (12)

    • Inventory Automation Agents (7)

    • Workflow Automation Assistants (7)

    • AI Platform Configurators (4)

  • Developer & Coding Assistants

    • IDE Integrations (7)

    • Code Generation & Refactoring

      • Code Search Engines (8)

      • AI Developer Tools (13)

      • Smart Contract Platforms (8)

    • Documentation & Testing Assistants (5)

  • GUI & Computer Control AI Agents

    • Desktop Automation (11)

    • Browser & Web Automation

      • Browser Automation Tools (5)

      • Web Agents (3)

  • Data Processing & ETL Agents

    • NL AI Frameworks (12)

    • Document Analytics Platforms (2)

  • Domain-Specific AI Agents

    • Financial & Trading Systems (1)

    • Educational & Learning Agents (4)

  • Gaming & Simulation Agents

    • Generative Agents Simulation (7)

    • Virtual Town AI (1)

  • Low-Code/No-Code Platforms (5)

  • Agent Communication & Protocols

    • Multi-Agent Platforms (6)

    • Communication Protocols (1)

  • Simulation & Benchmarking Environments

    • Multimodal Model Benchmarks (2)

    • Sensor Fusion Agents (3)

  • Web Automation and UI Interaction

    • Browser Automation

      • Web Automation Systems (8)

      • AI-Powered Agents (2)

    • UI Interaction

      • GUI Action Mapping (5)

      • Visual GUI Agents (5)

  • Personal Assistants & Conversational Agents

    • Chatbots

      • Chatbot Platforms (9)

      • AI Agents (15)

    • Virtual Assistants (8)

    • Human-in-the-Loop Agents (2)

  • Audio & Voice Assistants (7)

  • Security & Privacy Agents

    • Runtime Analysis Tools (6)

    • Post-Exploitation Agents (8)

    • Red-Teaming Platforms (1)

  • Corporate and Analytical Applications

    • RAG and Business Analytics (3)

    • Data Integration and Specialized Solutions (9)

  • Developer Tools and Automation

    • Code Automation and Testing (6)

    • Integration and Extensions (1)

  • Educational and Research Resources

    • Courses and Tutorials (10)

    • Research Lists and Survey Projects

      • LLM Research Repositories (9)

      • Multi-Agent Surveys (4)

AI Agent Frameworks & SDKs

Multi-Agent Collaboration Systems

Multi-Agent Frameworks

  • HITsz-TMG/FilmAgent (★1007 Python) - FilmAgent is a multi-agent framework that automates the entire film production process in virtual 3D spaces by simulating key crew roles and employing collaborative strategies for scriptwriting and cinematography.
  • awslabs/multi-agent-orchestrator (★4775 Python) - Multi-Agent Orchestrator is a flexible open-source framework for managing multiple AI agents to handle complex conversations with intelligent routing and context management.
  • i-am-bee/beeai-framework (★2676 TypeScript) - BeeAI Framework is a production-ready framework for building scalable and flexible multi-agent AI systems in Python and TypeScript, supporting integration with various AI models and tools.
  • jgravelle/AutoGroq (★1459 Python) - AutoGroq is a revolutionary tool that dynamically generates tailored teams of AI agents to efficiently address user projects and inquiries without manual configuration.
  • microsoft/JARVIS (★24246 Python) - JARVIS is a collaborative AI system that connects large language models with expert AI models from HuggingFace to autonomously plan, execute, and integrate complex AI tasks.
  • modelscope/ms-agent (★3305 Python) - MS-Agent is a lightweight and extensible framework that empowers autonomous agents with multi-agent capabilities, code generation, tool calling, and advanced exploration for complex task execution.
  • starpig1129/DATAGEN (★1388 Python) - DATAGEN is an AI-powered multi-agent research assistant that automates hypothesis generation, data analysis, visualization, and report writing, with expanding capabilities in crypto market intelligence through a strategic partnership.
  • zchoi/Awesome-Embodied-Robotics-and-Agent (★1464 ) - A curated and actively maintained repository compiling the latest research and advancements in Embodied AI and robotics integrated with Large Language Models and Vision-Language Models.

Knowledge Graph Orchestration

  • Paper2Poster/Paper2Poster (★2391 Python) - Paper2Poster is an open-source multi-agent system that automates the generation of editable scientific posters from research papers in PDF format, featuring a comprehensive evaluation suite to ensure visual and textual quality.
  • camel-ai/oasis (★1618 Python) - OASIS is an open-source, scalable social media simulation platform that uses large language models and rule-based agents to realistically simulate the behavior of up to one million users on platforms like Twitter and Reddit for studying complex social phenomena.
  • geekan/MetaGPT (★54762 Python) - MetaGPT is a multi-agent AI framework that simulates a software company by assigning different roles to GPT-based agents to collaboratively automate complex software development tasks from natural language requirements.
  • hyp1231/awesome-llm-powered-agent (★2097 ) - A comprehensive curated repository of papers, projects, and resources focused on autonomous and collaborative agents powered by Large Language Models (LLMs) for complex task solving and human interaction simulation.
  • kingjulio8238/Memary (★2289 Jupyter Notebook) - Memary is an open-source memory layer that emulates human memory to enhance autonomous agents' reasoning and learning capabilities, supporting local LLMs and vision models with multi-agent and multi-graph database integration.
  • liveblocks/liveblocks (★4092 TypeScript) - Liveblocks is a platform providing customizable pre-built features to easily add multiplayer, engaging, and AI-ready collaborative experiences to applications without disrupting development roadmaps.
  • metaskills/experts (★1058 JavaScript) - Experts.js is a JavaScript library that simplifies creating and deploying OpenAI Assistants and linking them as Tools to build advanced Multi AI Agent Systems with expanded memory and attention to detail.
  • microsoft/autogen (★48044 Python) - AutoGen is a Microsoft-developed framework for building multi-agent AI applications that operate autonomously or with human collaboration, featuring layered APIs, developer tools, and community support.
  • plexe-ai/plexe (★2100 Python) - Plexe is a machine learning framework that enables users to build and train models using natural language prompts and an AI-powered multi-agent system.
  • satellitecomponent/Neurite (★1843 JavaScript) - Neurite is an open-source fractal graph-of-thought platform that enables rhizomatic mind-mapping for AI agents, multimedia content, and notes, integrating fractal mathematics and multi-agent collaboration for an infinite digital garden of ideas.

AI Agent Ecosystems

  • FoundationAgents/MetaGPT (★57546 Python) - MetaGPT is a multi-agent AI framework that simulates a software company by assigning GPT-based roles to collaboratively automate software development from natural language requirements.
  • OpenBMB/ChatDev (★27221 Python) - ChatDev is a multi-agent LLM-powered virtual software company framework that enables collaborative, customizable, and automated software development through intelligent agents with specialized roles.
  • VRSEN/agency-swarm (★3728 Python) - Agency Swarm is a reliable and production-ready agent framework that enables the creation and management of collaborative multi-agent AI systems using the latest OpenAI Assistants API, designed to automate workflows through customizable agent roles and efficient communication.
  • agi-inc/agent-protocol (★1425 Python) - Agent Protocol is a tech stack agnostic API specification providing a common interface for seamless communication and integration with AI agents, simplifying development and benchmarking across diverse agent implementations.
  • camel-ai/owl (★17673 Python) - OWL is an advanced open-source framework for multi-agent collaboration that automates real-world tasks using dynamic AI agent interactions and a comprehensive set of toolkits.
  • crewAIInc/crewAI (★35021 Python) - CrewAI is a fast, flexible Python framework for orchestrating autonomous, role-playing AI agents that collaborate to solve complex tasks, offering both high-level simplicity and low-level control for enterprise-grade AI automation.
  • daveshap/OpenAI_Agent_Swarm (★3070 Python) - HAAS is a hierarchical autonomous agent swarm leveraging OpenAI's agent APIs to create a self-organizing, ethically governed ecosystem of AI agents that operate fully autonomously under a structured hierarchy for complex task execution.
  • frdel/agent-zero (★10613 Python) - Agent Zero is a customizable, dynamic AI framework that acts as a personal assistant using multi-agent cooperation, tool creation, and autonomous web browsing to accomplish complex tasks interactively.
  • openai/openai-agents-python (★13062 Python) - The OpenAI Agents SDK is a lightweight and powerful framework for building, orchestrating, and tracing multi-agent workflows using large language models with configurable instructions, tools, and safety features.

LLM-Powered Platforms

  • InternLM/MindSearch (★6499 JavaScript) - MindSearch is a multi-agent AI framework that enhances web search engines by using large language models to mimic human cognitive processes for deeper and more efficient search results.
  • SamurAIGPT/GPT-Agent (★1183 JavaScript) - Camel-AutoGPT is a platform that enables two autonomous AI agents to collaborate on shared goals, enhancing problem-solving capabilities and expanding applications in conversational AI, gaming, and education.
  • X-PLUG/MobileAgent (★4504 Python) - Mobile-Agent is a powerful and versatile mobile device operation assistant family featuring multi-agent collaboration, visual perception, and hierarchical frameworks for automating complex tasks on mobile devices and PCs.
  • agno-agi/agno (★30886 Python) - Agno is a lightweight, model-agnostic library for building advanced AI agents with memory, knowledge, tools, and reasoning capabilities, supporting multi-modal inputs and outputs and featuring an advanced multi-agent architecture.
  • asinghcsu/AgenticRAG-Survey (★955 ) - AgenticRAG-Survey is a comprehensive repository and survey exploring advanced Retrieval-Augmented Generation systems enhanced with autonomous AI agents, detailing their patterns, taxonomy, applications, and future challenges.
  • awslabs/agent-squad (★6328 Python) - Agent Squad is a flexible open-source framework for orchestrating multiple AI agents to handle complex conversations with intelligent intent classification and team coordination.
  • camel-ai/camel (★13547 Python) - CAMEL is an open-source multi-agent framework designed for large-scale simulations to study the scaling laws and emergent behaviors of agents in complex environments.
  • eosphoros-ai/DB-GPT (★17047 Python) - DB-GPT is an open-source AI native data app development framework featuring AWEL and multi-agent orchestration to simplify building data-driven applications with large language models and databases.
  • google-a2a/A2A (★17248 TypeScript) - Agent2Agent (A2A) is an open protocol by Google enabling secure communication and interoperability between diverse AI agents to foster collaboration and innovation in the AI ecosystem.
  • modelscope/agentscope (★7668 Python) - AgentScope is a multi-agent platform that enables developers to easily build robust and scalable LLM-powered multi-agent applications with comprehensive tools and a drag-and-drop interface.
  • victordibia/autogen-ui (★952 TypeScript) - A web UI example for the AutoGen framework demonstrating multi-agent LLM applications using the AgentChat API with a Next.js frontend and FastAPI backend.

Collaborative Swarm Systems

  • 101dotxyz/GPTeam (★1696 Python) - GPTeam is an open-source multi-agent simulation platform that uses GPT-4 to enable AI agents to collaborate and communicate effectively to achieve predefined goals.
  • EvoAgentX/EvoAgentX (★1031 Python) - EvoAgentX is an automated framework for building a self-evolving ecosystem of AI agents that generate, evaluate, and execute multi-agent workflows based on natural language goals.
  • Farama-Foundation/chatarena (★1488 Python) - ChatArena is a multi-agent language game environment framework designed to develop and research communication and collaboration capabilities of large language model agents like GPT-3 and GPT-4.
  • OpenBMB/AgentVerse (★4680 JavaScript) - AgentVerse is a versatile open-source framework that enables the deployment and collaboration of multiple LLM-based agents for task-solving and simulation applications, supporting research and development in multi-agent AI systems.
  • StreetLamb/tribe (★1024 TypeScript) - Tribe AI is a low-code platform for rapidly building and coordinating multi-agent teams to solve complex tasks efficiently through customizable workflows and skill-equipped agents.
  • agentuniverse-ai/agentUniverse (★1564 Python) - agentUniverse is a multi-agent framework based on large language models that enables developers to build collaborative intelligent agents integrating domain expertise for solving complex problems.
  • fetchai/uAgents (★1468 Python) - uAgents is a lightweight Python framework by Fetch.ai for creating secure, decentralized autonomous AI agents that operate within a blockchain-enabled network.
  • kaushikb11/awesome-llm-agents (★1024 Python) - A curated list of awesome frameworks and tools for developing Large Language Model (LLM) agents, showcasing a wide range of capabilities and features for AI agent development.
  • langroid/langroid (★3509 Python) - Langroid is a Python framework that enables building LLM-powered applications using a multi-agent programming approach for collaborative problem solving and enhanced developer experience.

Agent Monitoring Tools

  • MervinPraison/PraisonAI (★5234 Jupyter Notebook) - PraisonAI is a production-ready Multi AI Agents framework that enables the creation and management of intelligent AI agents to automate and solve complex problems through collaboration and customization.
  • eumemic/ai-legion (★1415 TypeScript) - AI Legion is a platform for autonomous agents powered by large language models that collaborate to accomplish tasks with minimal human intervention.
  • raga-ai-hub/RagaAI-Catalyst (★16031 Python) - RagaAI Catalyst is a Python SDK platform for observability, monitoring, and evaluation of AI agents and LLM projects, featuring tracing, debugging, advanced analytics, and project management tools.

Orchestration Frameworks

Generative Software Frameworks

  • FedML-AI/FedML (★3911 Python) - FEDML is a unified and scalable machine learning library integrated with TensorOpera AI, enabling large-scale distributed training, model serving, and federated learning across diverse GPU cloud and edge environments.
  • Netflix/metaflow (★9317 Python) - Metaflow is a human-centric framework by Netflix that streamlines building, managing, and deploying real-life AI and machine learning systems from prototyping to production at scale.
  • SqueezeAILab/LLMCompiler (★1721 Python) - LLMCompiler is a framework that optimizes parallel function calling in large language models to improve latency, cost, and accuracy by automatically orchestrating task execution.
  • The-Pocket/PocketFlow (★7466 Python) - Pocket Flow is a minimalist 100-line LLM framework that uses a graph-based abstraction to enable expressive multi-agent systems, workflows, and retrieval-augmented generation, designed for high productivity and minimal bloat.
  • deepset-ai/haystack (★21692 Python) - Haystack is an AI orchestration framework for building customizable, production-ready NLP applications using large language models, vector search, and advanced retrieval techniques.
  • dynamiq-ai/dynamiq (★914 Python) - Dynamiq is an orchestration framework for building and managing agentic AI and large language model applications, enabling complex multi-agent workflows and AI-powered solutions.
  • embabel/embabel-agent (★2305 Kotlin) - Embabel Agent Framework is a JVM-based framework for authoring dynamic agentic flows that combine LLM interactions with code and domain models, featuring sophisticated AI planning and extensibility.
  • google-gemini/genai-processors (★1830 Python) - GenAI Processors is a lightweight Python library for building modular, asynchronous, and composable AI pipelines that enable efficient and parallel content processing for generative AI applications.
  • julep-ai/julep (★5626 Jupyter Notebook) - Julep is a serverless platform that enables data and machine learning teams to build and orchestrate sophisticated AI workflows with smart memory, parallel processing, and seamless integration capabilities, all without managing infrastructure.
  • microsoft/TaskWeaver (★5837 Python) - TaskWeaver is a code-first agent framework by Microsoft designed for planning and executing complex data analytics tasks through coordinated plugins and stateful code execution.
  • octotools/octotools (★1320 Python) - OctoTools is an open-source agentic framework with extensible tools designed for complex reasoning across diverse domains, supporting multiple LLM engines and achieving significant accuracy improvements over existing solutions.
  • om-ai-lab/OmAgent (★2534 Python) - OmAgent is a Python library that enables easy building of multimodal language agents with support for complex workflows, multimodal inputs, advanced reasoning algorithms, and local model deployment for rapid prototyping and production.
  • presenton/presenton (★999 TypeScript) - Presenton is an open-source AI-powered presentation generator and API that runs locally, offering customizable layouts, multiple AI model integrations, versatile image generation, and privacy-focused presentation creation and export.

AI Workflow Orchestrators

  • EmbeddedLLM/JamAIBase (★1027 Python) - JamAI Base is an open-source RAG backend platform with an intuitive spreadsheet-like UI and REST API that integrates embedded databases and LLM orchestration to enable collaborative AI application development and real-time evaluation.
  • PrefectHQ/marvin (★5828 Python) - Marvin is a Python framework for building structured, agentic AI workflows that enable task management, specialized AI agents, and multi-agent orchestration with type-safe results and developer-focused control.
  • dagger/dagger (★14347 Go) - Dagger is an open-source runtime for building composable, repeatable, and observable workflows, ideal for AI agents and CI/CD systems with cross-platform support.
  • evilsocket/nerve (★1120 Python) - Nerve is a simple and powerful Agent Development Kit that enables building, running, evaluating, and orchestrating LLM-based agents using YAML and CLI for programmable and reproducible automation.
  • humanlayer/12-factor-agents (★10810 ) - 12 Factor Agents is a comprehensive framework outlining principles for building reliable, scalable, and production-ready LLM-powered AI software agents.
  • langflow-ai/langflow (★92525 Python) - Langflow is a powerful open-source platform for building, testing, and deploying AI-powered agents and workflows with visual authoring, multi-agent orchestration, and API integration capabilities.
  • lastmile-ai/mcp-agent (★6861 Python) - mcp-agent is a lightweight, composable framework for building AI agents using the Model Context Protocol and simple workflow patterns, enabling robust multi-agent orchestration and integration with AI models.
  • pezzolabs/pezzo (★2993 TypeScript) - Pezzo is an open-source, cloud-native LLMOps platform that streamlines prompt design, version management, collaboration, observability, and instant delivery to optimize AI operations efficiently.
  • stanfordnlp/dspy (★26772 Python) - DSPy is a framework that enables programming language models with compositional Python code to build modular AI systems and optimize their performance beyond traditional prompting methods.
  • von-development/awesome-LangGraph (★951 ) - A curated repository of projects, resources, and tools for building stateful, multi-actor AI applications using the LangGraph framework, extending the LangChain ecosystem.
  • yoheinakajima/babyagi (★21681 Python) - BabyAGI is an experimental framework for building self-building autonomous agents using a function-centric approach with comprehensive management, logging, and a dashboard interface.

Language Agent Frameworks

  • MotiaDev/motia (★4237 TypeScript) - Motia is a modern unified backend framework that integrates APIs, events, background jobs, and AI agents into a single cohesive system with built-in state management, observability, and multi-language support.
  • ThousandBirdsInc/chidori (★1323 Rust) - Chidori is an open-source reactive runtime and IDE for building durable AI agents with advanced debugging, state management, and observability features.
  • VoltAgent/voltagent (★2553 TypeScript) - VoltAgent is an open-source TypeScript framework for building and orchestrating AI agents powered by Large Language Models, enabling developers to create scalable, customizable, and maintainable AI applications with modular components and visual monitoring.
  • dataelement/bisheng (★9234 TypeScript) - BISHENG is an open-source LLM DevOps platform designed for enterprise AI applications, offering powerful workflow orchestration, security features, and high-precision document parsing to support complex intelligent business solutions.
  • kyegomez/swarms (★5057 Python) - Swarms is an enterprise-grade, production-ready multi-agent orchestration framework designed to enable complex AI workflows with high scalability, flexibility, and robust developer and security features.
  • langchain-ai/langchain (★112518 Jupyter Notebook) - LangChain is a versatile framework for building and deploying applications powered by large language models, offering interoperability, real-time data integration, and advanced agent orchestration capabilities.
  • langchain-ai/langgraph (★16509 Python) - LangGraph is a low-level orchestration framework for building resilient, controllable AI language agents with customizable architectures, long-term memory, and real-time streaming support, used by major companies and integrated with the LangChain ecosystem.
  • langchain-ai/langgraphjs (★1837 TypeScript) - LangGraph.js is a low-level orchestration framework for building customizable, reliable, and controllable language agents as graphs, enabling complex AI workflows with features like long-term memory, human-in-the-loop, and real-time streaming support.
  • microsoft/semantic-kernel (★25584 C#) - Semantic Kernel is an enterprise-ready orchestration framework by Microsoft that enables developers to build and deploy intelligent AI agents and multi-agent systems using cutting-edge large language model technology.

Cognitive Architecture Frameworks

End-to-End Optimizers

  • KhoomeiK/LlamaGym (★1208 Python) - LlamaGym is a framework that simplifies fine-tuning large language model agents using online reinforcement learning in Gym-style environments.
  • aiwaves-cn/agents (★5665 Python) - Agents 2.0 is an open-source framework that enables the training and self-evolution of autonomous language agents through symbolic learning inspired by neural network methodologies.
  • getzep/zep (★3444 Go) - Zep is an AI memory foundation that uses a temporal Knowledge Graph to enable continuous learning and personalized experiences for AI agents by efficiently managing and retrieving contextual chat and data artifacts.
  • langchain-ai/opengpts (★6673 Rich Text Format) - OpenGPTs is an open-source platform that replicates and extends OpenAI's GPTs and Assistants API, offering customizable cognitive architectures, extensive tooling, and support for multiple language models and vector databases.
  • letta-ai/letta (★17552 Python) - Letta is an open-source framework for building stateful AI agents with advanced reasoning, transparent long-term memory, and context management, supporting multiple LLM backends and providing a graphical development environment.
  • seanpixel/Teenage-AGI (★914 Python) - Teenage-AGI is a Python project that creates an AI agent with persistent memory and internal thought processes using OpenAI and Pinecone, inspired by generative agents research.
  • topoteretes/cognee (★6507 Jupyter Notebook) - Cognee is an open-source project that provides scalable and modular memory solutions for AI agents using ECL pipelines, enabling efficient data ingestion, knowledge graph generation, and query capabilities to enhance AI performance and reduce hallucinations.
  • ysymyth/awesome-language-agents (★990 TeX) - A curated list and comprehensive resource of language agents based on the Cognitive Architectures for Language Agents (CoALA) framework, including extensive research papers and references.

Continual Learning Agents

  • AgentDock/AgentDock (★1471 TypeScript) - AgentDock is an open-source framework for building sophisticated AI agents with configurable determinism, enabling reliable and creative AI applications through a node-based architecture and multi-stage workflows.
  • langchain-ai/langchainjs (★15282 TypeScript) - LangChain.js is a TypeScript framework for building context-aware, reasoning applications powered by large language models, offering modular components, chains, agents, and tools for production and deployment.

Autonomous Research & Content Generation

Prompt Libraries

  • HKUDS/RAG-Anything (★1928 Python) - RAG-Anything is a next-generation all-in-one multimodal Retrieval-Augmented Generation system that processes and queries diverse document content including text, images, tables, and equations within a unified framework.
  • abilzerian/LLM-Prompt-Library (★1386 Python) - The LLM Prompt Library is a comprehensive collection of experimental prompt templates and tools designed for multiple major large language models, supporting advanced prompt engineering and optimization across various domains.
  • agentcoinorg/evo.ninja (★1075 TypeScript) - Evo.ninja is a dynamic AI generalist agent that adapts to tasks by selecting specialized agent personas to efficiently achieve user goals through a continuous execution loop.
  • arcee-ai/mergekit (★6116 Python) - mergekit is a toolkit for efficiently merging pre-trained large language models using various algorithms, supporting CPU and GPU execution, and enabling creation of versatile merged models with maintained inference costs.
  • dottxt-ai/outlines (★12181 Python) - Outlines is an open-source tool that enables structured text generation with large language models, ensuring predictable and schema-compliant outputs for various applications.
  • dzhng/deep-research (★17177 TypeScript) - Open Deep Research is an AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models to generate comprehensive markdown reports.
  • guy-hartstein/company-research-agent (★1370 Python) - Agentic Company Researcher is a multi-agent AI-powered tool that generates comprehensive, real-time company research reports by synthesizing data from multiple sources using Gemini 2.0 Flash and GPT-4.1 models.
  • icip-cas/PPTAgent (★1806 Python) - PPTAgent is an innovative system that automatically generates and evaluates high-quality presentations from documents using a two-phase approach and a comprehensive multi-dimensional evaluation framework.
  • johnbean393/Sidekick (★2996 Swift) - Sidekick is a native macOS app that enables offline interaction with a local large language model to retrieve and analyze information from files, folders, and websites on the user's Mac, offering advanced features like function calling, memory, and image generation.
  • memfreeme/memfree (★1415 TypeScript) - MemFree is a hybrid AI search engine and AI page generator that provides accurate answers from personal knowledge bases and the internet, while enabling rapid creation of production-ready web pages using advanced AI models and popular front-end frameworks.
  • microsoft/RD-Agent (★6780 Python) - RD-Agent is an open-source R&D automation tool by Microsoft designed to automate and enhance industrial research and development processes focused on data and models using AI.
  • mshumer/ai-researcher (★932 Jupyter Notebook) - AI Researcher is an AI-powered tool that automates comprehensive topic research by breaking down subjects into subtopics, generating detailed reports, and combining them into a final document using Claude 3 and SERPAPI.
  • muellerberndt/mini-agi (★2885 Python) - MiniAGI is a simple autonomous agent leveraging OpenAI's GPT models to perform complex tasks autonomously using chain-of-thought reasoning, short-term memory, and self-criticism.
  • parthsarthi03/raptor (★1335 Python) - RAPTOR is a retrieval-augmented language model framework that uses recursive tree-structured processing to enable efficient and context-aware information retrieval from large documents.
  • snap-stanford/Biomni (★1894 Jupyter Notebook) - Biomni is a general-purpose biomedical AI agent that autonomously executes diverse research tasks across multiple biomedical subfields by integrating advanced LLM reasoning, retrieval-augmented planning, and code-based execution.

Reasoning Frameworks

  • HKUDS/Auto-Deep-Research (★1072 Python) - Auto-Deep-Research is an open-source, fully automated, and cost-effective personal AI assistant that supports multiple large language models and offers a one-click launch experience as an alternative to OpenAI's Deep Research.
  • NirDiamant/Controllable-RAG-Agent (★1370 Jupyter Notebook) - Controllable-RAG-Agent is an advanced Retrieval-Augmented Generation solution featuring a deterministic graph-based autonomous agent designed for complex question answering with multi-step reasoning and hallucination prevention.
  • anysphere/priompt (★2650 TypeScript) - Priompt is a JSX-based prompting library that uses a priority system to efficiently construct token-limited prompts for language models, inspired by React's component design.
  • bytedance/deer-flow (★15792 TypeScript) - DeerFlow is a community-driven deep research framework that integrates language models with web search, crawling, and Python execution tools to facilitate comprehensive AI-powered research workflows.
  • friuns2/BlackFriday-GPTs-Prompts (★8340 ) - A curated repository of free GPT prompts and jailbreaks across various domains, enabling users to utilize GPT models without a subscription to GPT Plus.
  • guidance-ai/guidance (★20534 Jupyter Notebook) - Guidance is a programming paradigm and library that enables efficient and fine-grained control over large language models to produce structured, high-quality outputs with reduced latency and cost.
  • mongodb-developer/GenAI-Showcase (★3888 Jupyter Notebook) - MongoDB's GenAI Showcase is a comprehensive repository offering examples and applications that integrate MongoDB with advanced Generative AI techniques like Retrieval-Augmented Generation and AI Agents to support AI development and innovation.
  • refly-ai/refly (★4399 TypeScript) - Refly is an open-source AI-native creation engine that combines multi-threaded dialogues, multimodal inputs, knowledge base integration, and AI-powered tools to transform ideas into production-ready interactive content and web applications.

Programming Languages

  • AnotiaWang/deep-research-web-ui (★1981 Vue) - Deep Research Web UI is a secure, AI-powered web interface that enables iterative, deep research by combining search engines, web scraping, and large language models with real-time feedback and exportable research reports.
  • EgoAlpha/prompt-in-context-learning (★1621 Jupyter Notebook) - An open-source comprehensive resource and engineering guide for mastering in-context learning and prompt engineering with large language models like ChatGPT, GPT-3, and FlanT5.
  • Fosowl/agenticSeek (★20457 Python) - AgenticSeek is a fully local, autonomous AI assistant powered by Deepseek R1 agents that can code, browse the web, and manage files without relying on cloud services, ensuring user data privacy and eliminating ongoing costs.
  • HKUDS/AI-Researcher (★1721 Python) - AI-Researcher is a fully automated scientific discovery platform powered by large language models that autonomously conducts research from idea generation to manuscript creation.
  • InternLM/xtuner (★4666 Python) - XTuner is a comprehensive and efficient toolkit for fine-tuning large language and vision-language models, supporting a wide range of models and training algorithms with scalability and flexibility.
  • MODSetter/SurfSense (★6188 TypeScript) - SurfSense is a highly customizable AI research agent that integrates with personal knowledge bases and external sources to provide advanced, privacy-conscious research and knowledge management capabilities.
  • developersdigest/llm-answer-engine (★4911 TypeScript) - A Perplexity-inspired answer engine built with Next.js and multiple AI and search technologies to provide rich, context-aware responses including sources, images, videos, and follow-up questions.
  • rotemweiss57/gpt-newspaper (★1344 Python) - GPT Newspaper is an AI-driven autonomous agent system that creates personalized newspapers tailored to individual user preferences by automating news curation, writing, design, and publishing.
  • run-llama/LlamaIndexTS (★2780 TypeScript) - LlamaIndexTS is a lightweight, extensible data framework designed to integrate large language models into server-side JavaScript applications, supporting multiple runtimes and a wide range of LLM providers.

AI Assistants

  • Alibaba-NLP/WebAgent (★5310 Python) - Alibaba-NLP/WebAgent is a research project by Alibaba's Tongyi Lab featuring WebWalker and WebDancer models for autonomous information seeking and web traversal using large language models with advanced training paradigms and benchmarks.
  • KwaiKEG/KwaiAgents (★1173 Python) - KwaiAgents is an open-source generalized information-seeking agent system leveraging Large Language Models with enhanced agent capabilities, datasets, benchmarks, and deployment tools.
  • MLSysOps/MLE-agent (★1328 Python) - MLE-Agent is an intelligent AI assistant designed to autonomously support machine learning engineers and researchers by building baselines, participating in Kaggle competitions, integrating research resources, and providing smart debugging and personalized project assistance.
  • SakanaAI/evolutionary-model-merge (★1353 Python) - A repository by SakanaAI for evolutionary optimization of model merging recipes, providing models, code, and resources to improve and evaluate merged language and vision-language models, particularly for Japanese tasks.
  • agent0ai/agent-zero (★11332 Python) - Agent Zero is a dynamic, customizable AI framework that acts as a personal assistant capable of learning, creating tools, and cooperating with multiple agents to accomplish diverse tasks.
  • assafelovic/gpt-researcher (★22586 Python) - GPT Researcher is an autonomous AI agent that conducts deep web and local research to generate detailed, unbiased research reports with citations.
  • microsoft/ToRA (★1083 Python) - ToRA is a series of tool-integrated reasoning agents developed by Microsoft that combine large language models with external computational tools to solve challenging mathematical reasoning problems with high accuracy.
  • superagent-ai/grok-cli (★1288 TypeScript) - Grok CLI is an open-source conversational AI command-line tool that integrates Grok AI to provide intelligent file operations, shell command execution, and extensible tool usage directly within the terminal.

Agent Integration & Deployment Tools

Stateful Serverless Frameworks

  • BerriAI/litellm (★26531 Python) - LiteLLM is a Python SDK and proxy server that provides a unified OpenAI-compatible interface to call over 100 large language model APIs from multiple providers, featuring consistent output, retry logic, budget controls, and extensive observability integrations.
  • FlowiseAI/Flowise (★42066 TypeScript) - Flowise is an open-source drag-and-drop platform that enables users to easily build and deploy customized Large Language Model (LLM) application workflows with a user-friendly interface and flexible deployment options.
  • Mirascope/mirascope (★1236 Python) - Mirascope is a flexible and user-friendly library that provides a unified interface to work with multiple large language model providers, simplifying AI-driven text generation and information extraction tasks.
  • OpenBMB/ToolBench (★5186 Python) - ToolBench is an open platform for training, serving, and evaluating large language models with advanced tool-use capabilities using a large-scale, richly annotated dataset of real-world APIs.
  • deepsense-ai/ragbits (★1549 Python) - Ragbits is a modular framework providing building blocks for rapid development, deployment, and monitoring of scalable and reliable Generative AI applications with flexible LLM integration and advanced document processing capabilities.
  • emcie-co/parlant (★3415 Python) - Parlant is a Conversation Modeling engine that enables precise, consistent, and reliable control over GenAI-driven conversational agents by enforcing structured behavioral guidelines and adapting dynamically to user interactions.
  • rivet-gg/rivetkit (★1048 TypeScript) - RivetKit is a stateful serverless framework that enables building scalable, real-time, and collaborative applications deployable across multiple platforms including Rivet, Cloudflare Workers, Bun, and Node.js.
  • run-llama/llama_deploy (★2046 Python) - LlamaDeploy is an async-first framework that enables seamless deployment, scaling, and productionization of agentic multi-service workflows built with llama_index, facilitating easy transition from development to cloud-based production environments.

AI Agent Development

  • DataDog/dd-agent (★1316 Python) - Datadog/dd-agent is the legacy version 5 of the Datadog monitoring agent that collects and reports system and application metrics to the Datadog platform, supporting custom checks and configurations.
  • Intelligent-Internet/ii-agent (★2772 Python) - II-Agent is an open-source framework that builds and deploys intelligent agents leveraging leading large language models to automate complex workflows and enhance productivity across multiple domains.
  • JoshuaC215/agent-service-toolkit (★3376 Python) - AI Agent Service Toolkit is a comprehensive framework for building, deploying, and interacting with AI agents using LangGraph, FastAPI, and Streamlit, featuring multi-agent support, streaming, content moderation, and a user-friendly chat interface.
  • Meirtz/Awesome-Context-Engineering (★1325 ) - Awesome Context Engineering is a comprehensive survey and resource repository focused on the evolution from prompt engineering to dynamic, context-aware AI systems, providing theoretical foundations, methodologies, and practical guides for large language models and AI agents.
  • OpenPipe/ART (★3713 Python) - OpenPipe/ART is an open-source reinforcement learning framework that trains multi-step agents for real-world tasks using GRPO and innovative zero-shot reward scoring with LLMs.
  • TaskingAI/TaskingAI (★5274 Python) - TaskingAI is an open-source BaaS platform for developing, managing, and deploying AI-native applications using large language models with unified APIs, customizable tools, and multi-tenant support.
  • bytedance/trae-agent (★8492 Python) - Trae Agent is an LLM-based agent with a modular, research-friendly architecture designed to execute complex software engineering tasks via a natural language CLI interface using multiple LLM providers.
  • codelion/optillm (★2687 Python) - OptiLLM is an OpenAI API compatible optimizing inference proxy that enhances the accuracy and performance of large language models by applying advanced reasoning and decoding techniques during inference.
  • coze-dev/coze-loop (★4104 Go) - Coze Loop is an open-source AI agent optimization platform providing full-lifecycle management from prompt development and debugging to evaluation and monitoring, enabling efficient AI agent development and operation.
  • coze-dev/coze-studio (★12517 TypeScript) - Coze Studio is an all-in-one AI agent development platform that simplifies the creation, debugging, and deployment of AI agents through visual tools and supports no-code and low-code development approaches.
  • hiyouga/LLaMA-Factory (★55189 Python) - LLaMA Factory is a unified and efficient platform for fine-tuning over 100 large language and vision-language models, offering scalable training methods, advanced optimization algorithms, and versatile deployment options.
  • huggingface/smolagents (★21664 Python) - Smolagents is a minimalistic Python library by Hugging Face for creating intelligent agents that think and act by generating and executing Python code, supporting multiple LLMs, modalities, and tool integrations.
  • instructor-ai/instructor (★10312 Python) - Instructor is a popular Python library that enables structured output handling and validation for large language models, supporting multiple LLM providers and offering features like retry management and streaming support.
  • jackmpcollins/magentic (★2348 Python) - Magentic is a Python library that enables seamless integration of Large Language Models into Python code by using decorators to create functions with structured LLM outputs, supporting advanced features like streaming, function calling, and observability.
  • mpaepper/llm_agents (★1003 Python) - A small library to build intelligent agents controlled by large language models, enabling task-solving through tool integration in a Thought-Action-Observation loop.
  • pinpoint-apm/pinpoint (★13682 Java) - Pinpoint is an APM tool for large-scale distributed systems that provides real-time monitoring, transaction tracing, and detailed performance analysis with minimal impact on application performance.
  • transitive-bullshit/agentic (★17847 TypeScript) - Agentic is a versatile AI agent standard library that integrates with any large language model and TypeScript AI SDK, providing a comprehensive set of AI functions and tools for building intelligent applications.

AI Agent Orchestration

  • Agent-RL/ReCall (★1103 Python) - ReCall is a reinforcement learning framework that trains large language models to reason and use arbitrary tools agentically, enabling advanced tool-based reasoning and general-purpose AI agents.
  • AgentEra/Agently (★1397 Python) - Agently is a GenAI application development framework that enables fast and easy creation of AI agent-native applications using structured data, chained-calls syntax, and workflow management, with seamless model switching capabilities.
  • BloopAI/vibe-kanban (★3476 Rust) - Vibe Kanban is a Kanban board tool designed to manage and orchestrate AI coding agents, enabling efficient task tracking and workflow optimization in AI-assisted software development.
  • LazyAGI/LazyLLM (★2311 Python) - LazyLLM is a low-code development tool that simplifies building, deploying, and optimizing multi-agent large language model applications through an iterative development workflow.
  • NirDiamant/agents-towards-production (★9133 Jupyter Notebook) - Agents Towards Production is an open-source repository offering end-to-end tutorials and blueprints for building scalable, production-grade Generative AI agents from concept to deployment.
  • crewAIInc/crewAI-tools (★1177 Python) - CrewAI Tools is a versatile toolkit that extends AI agents with powerful, customizable tools for complex real-world tasks, supporting rapid integration and community-driven Model Context Protocol (MCP) server connections.
  • elizaOS/eliza (★16497 TypeScript) - Eliza is a versatile AI agent operating system that enables the creation and deployment of autonomous agents across multiple platforms with extensible features and document interaction capabilities.
  • strands-agents/sdk-python (★2544 Python) - Strands Agents is a Python SDK that enables building and deploying AI agents using a model-driven approach, supporting multiple AI models and advanced features like multi-agent systems and MCP integration.
  • strnad/CrewAI-Studio (★946 Python) - CrewAI Studio is a multi-platform, user-friendly GUI application for managing and running CrewAI agents and tasks without coding, supporting various LLM providers and offering features like results history, custom tools, and Docker deployment.
  • uber-common/jvm-profiler (★1791 Java) - Uber JVM Profiler is a Java Agent for collecting and reporting JVM metrics and stack traces in distributed Hadoop and Spark environments, supporting multiple output formats and advanced profiling features.

LLM Framework Tools

  • 567-labs/instructor (★11084 Python) - Instructor is a popular Python library that simplifies working with structured outputs from large language models by providing validation, retries, and streaming support across multiple LLM providers.
  • TEN-framework/ten-framework (★5844 C) - TEN Framework is a real-time, distributed, cloud-edge collaborative multimodal AI agent framework supporting multiple programming languages for building high-performance AI applications.
  • farizrahman4u/loopgpt (★1458 Python) - LoopGPT is a modular and extensible Auto-GPT framework that enables users to create customizable AI agents with efficient token usage, human feedback integration, and full state serialization.
  • google/adk-samples (★3653 Python) - A collection of sample agents built with the Agent Development Kit (ADK) to accelerate development of conversational and multi-agent workflows, designed for customization and integration with Google Cloud services.
  • langfuse/langfuse (★14388 TypeScript) - Langfuse is an open source LLM engineering platform that enables collaborative development, monitoring, evaluation, and debugging of AI applications with comprehensive observability, prompt management, and integration features.
  • modelscope/modelscope-agent (★3207 Python) - ModelScope-Agent is a customizable and scalable agent framework that connects large language models with real-world applications, enabling role-playing, tool usage, planning, and memory functionalities through a unified and extensible interface.
  • patterns-ai-core/langchainrb (★1877 Ruby) - Langchain.rb is a Ruby gem that provides a unified interface to build LLM-powered applications, supporting multiple AI providers and features like embeddings, prompt completions, chat completions, and Retrieval Augmented Generation.
  • shroominic/codeinterpreter-api (★3852 Python) - An open-source API that implements the ChatGPT Code Interpreter for sandboxed Python code execution, enabling data analysis, visualization, and interactive AI-driven computational tasks with local execution and scalable production deployment.
  • truefoundry/cognita (★4170 Python) - Cognita is an open-source, modular, and production-ready RAG framework by TrueFoundry that simplifies building, customizing, and deploying scalable Retrieval Augmented Generation applications with a no-code UI and extensive integration features.

AI Agent Deployment

  • Forethought-Technologies/AutoChain (★1857 Python) - AutoChain is a lightweight and extensible framework that simplifies building and automatically evaluating custom generative agents powered by large language models using custom tools and simulated multi-turn conversations.
  • GoogleCloudPlatform/agent-starter-pack (★2280 Python) - A collection of production-ready Generative AI Agent templates for Google Cloud that accelerates development by addressing deployment, evaluation, customization, and observability challenges.
  • MeetKai/functionary (★1574 Python) - Functionary is a chat language model that dynamically interprets and executes functions or plugins, enabling advanced tool usage and result interpretation for interactive AI applications.
  • e2b-dev/E2B (★9107 MDX) - E2B is an open-source secure cloud runtime infrastructure that enables safe execution of AI-generated code in isolated sandboxes for AI applications and agents.
  • e2b-dev/awesome-ai-sdks (★990 ) - A curated database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging, and deploying autonomous AI agents, maintained by the e2b team building an operating system for AI agents.
  • firebase/genkit (★2643 TypeScript) - Genkit is an open-source framework that enables developers to build, integrate, and test AI-powered applications with a unified API, plugin architecture, and tools for observability and evaluation across various AI models and platforms.
  • nasa-jpl/rosa (★1245 Python) - ROSA is an AI-powered assistant that enables natural language interaction with ROS1 and ROS2 robotics systems to help developers inspect, diagnose, and operate robots efficiently.
  • nndeploy/nndeploy (★1103 C++) - nndeploy is a workflow-based multi-platform AI deployment tool that enables efficient, high-performance, and customizable deployment of AI models across cloud, desktop, mobile, and edge platforms using a visual drag-and-drop interface and extensive framework support.
  • ollama/ollama (★147948 Go) - Ollama is a versatile platform that enables easy deployment, customization, and management of large language models locally across multiple operating systems with support for various model formats and a REST API.
  • xingyaoww/code-act (★1312 Python) - CodeAct is a framework that improves large language model agents by using executable code as a unified action space, enabling dynamic multi-turn interactions and outperforming traditional text and JSON action representations.
  • zebbern/claude-code-guide (★1444 ) - Claude Code Guide is a comprehensive and authoritative resource providing detailed instructions, tips, and references for using the Claude Code CLI tool to interact with the Claude AI model, including advanced features and MCP integration.

AI Application Development

  • 0xPlaygrounds/rig (★4023 Rust) - Rig is a Rust library for building scalable, modular, and ergonomic fullstack agents powered by large language models, supporting multiple LLM providers and vector stores for AI integration.
  • crestalnetwork/intentkit (★6429 Python) - IntentKit is an open-source autonomous agent framework that enables the creation and management of AI agents with blockchain and social media integrations, featuring an extensible skill and plugin system.
  • daytonaio/daytona (★21035 TypeScript) - Daytona is a secure and elastic infrastructure platform designed for safely running AI-generated code with high performance and scalability.
  • e2b-dev/awesome-ai-agents (★20067 ) - A curated and comprehensive list of AI autonomous agents categorized into open-source and closed-source projects, serving as a valuable resource for exploring, building, and contributing to AI agent development.
  • fiatrete/OpenDAN-Personal-AI-OS (★1892 Python) - OpenDAN is an open source Personal AI Operating System that integrates various AI modules and agents into a unified platform for personal use, enabling users to manage AI assistants, workflows, and smart device control with strong privacy protections.
  • langgenius/dify (★109009 TypeScript) - Dify is an open-source platform for developing and deploying AI applications using large language models, featuring workflow building, model management, agent capabilities, and observability tools.
  • microsandbox/microsandbox (★3455 Rust) - Microsandbox is a self-hosted platform that enables secure, fast, and isolated execution of untrusted user or AI-generated code using hardware-level microVMs and supports multiple programming languages through SDKs.
  • mlflow/mlflow (★21381 Python) - MLflow is an open-source platform that manages the full machine learning lifecycle, providing tools for experiment tracking, model packaging, deployment, evaluation, and observability in one integrated solution.
  • openinterpreter/open-interpreter (★60088 Python) - Open Interpreter is an open-source natural language interface that enables users to run code locally and interact with their computers through conversational commands, combining the power of large language models with local computing flexibility.
  • pydantic/pydantic-ai (★11326 Python) - PydanticAI is a Python agent framework that simplifies building production-grade Generative AI applications by providing type-safe, structured, and model-agnostic integration with multiple LLMs, along with features like dependency injection and real-time debugging.

AI Developer Toolkit

  • GaiaNet-AI/gaianet-node (★4986 Shell) - GaiaNet Node is a decentralized AI agent platform that enables users to install, run, and deploy their own AI nodes with customizable models and configurations.
  • OpenAgentPlatform/Dive (★1455 TypeScript) - Dive is an open-source MCP Host Desktop Application that integrates with various LLMs supporting function calling, offering cross-platform compatibility, multi-language support, advanced API management, and seamless AI agent integration via the Model Context Protocol.
  • agiresearch/AIOS (★4437 Python) - AIOS is an AI Agent Operating System that integrates large language models into an OS environment to facilitate the development, deployment, and management of AI agents with robust resource and tool management.
  • agiresearch/OpenAGI (★2176 Python) - OpenAGI is a research-driven package for creating AI agents that integrate large language models with domain experts, supporting collaborative development and extensibility within the AIOS ecosystem.
  • cheshire-cat-ai/core (★2828 Python) - Cheshire Cat AI is a flexible and extensible framework for building custom AI agents as microservices, featuring API-first design, plugin support, multiuser capabilities, and integration with various language models via Langchain.
  • google/adk-python (★11340 Python) - An open-source Python toolkit by Google for building, evaluating, and deploying sophisticated AI agents with flexibility, modularity, and integration within the Google ecosystem.
  • kyrolabs/awesome-langchain (★8628 ) - A curated and comprehensive list of tools, projects, and resources centered around the LangChain framework for developing and deploying large language model applications.
  • pietrozullo/mcp-use (★1847 Python) - MCP-Use is an open-source client library that enables seamless integration of any large language model with MCP servers to build custom agents with tool access and dynamic server management.
  • ragapp/ragapp (★4291 TypeScript) - RAGapp is an enterprise-ready platform that simplifies deploying and managing Agentic Retrieval-Augmented Generation systems using Docker and LlamaIndex, enabling secure and customizable AI model integration on private cloud infrastructure.
  • reworkd/AgentGPT (★34605 TypeScript) - AgentGPT is an open-source platform that allows users to assemble, configure, and deploy autonomous AI agents in a web browser to pursue custom goals by generating and executing tasks independently.
  • unslothai/unsloth (★42895 Python) - Unsloth is a toolkit for fine-tuning and reinforcement learning of large language models, enabling faster training with significantly reduced VRAM usage across various advanced models and tasks.

AI Agent Operating System

  • Arindam200/awesome-ai-apps (★1777 Jupyter Notebook) - A comprehensive repository showcasing practical examples, tutorials, and tools for building powerful AI applications using large language models and various AI agent frameworks.
  • HKUDS/AutoAgent (★5679 Python) - AutoAgent is a fully-automated, zero-code framework for creating and deploying LLM agents using natural language, featuring top GAIA benchmark performance and broad LLM support.
  • MemTensor/MemOS (★2062 Python) - MemOS is an operating system for Large Language Models that enhances their capabilities with long-term memory, enabling more context-aware, consistent, and personalized AI interactions through a modular and extensible memory architecture.
  • PySpur-Dev/pyspur (★5316 TypeScript) - PySpur is a visual playground that enables AI engineers to rapidly build, debug, and deploy AI agents with enhanced visibility, human-in-the-loop workflows, and multimodal support.
  • TrafficGuard/typedai (★1115 TypeScript) - TypedAI is a TypeScript-first AI platform for developers to build and run autonomous AI agents, LLM-based workflows, chatbots, and software developer agents with advanced features and flexible deployment options.
  • e2b-dev/e2b-cookbook (★1056 TypeScript) - The e2b-cookbook repository provides example code and guides for building AI-powered applications using the E2B SDK, showcasing integrations with multiple LLM providers and AI frameworks in Python and TypeScript.
  • grafana/agent (★1637 Go) - Grafana Agent is a vendor-neutral, programmable observability pipeline for collecting, processing, and delivering telemetry data across Prometheus, OpenTelemetry, and Grafana ecosystems, now in Long-Term Support with a recommended migration to Grafana Alloy.
  • heshengtao/comfyui_LLM_party (★1814 Python) - ComfyUI LLM Party is a versatile framework that enables users to build, customize, and integrate large language model workflows within ComfyUI, supporting a wide range of LLMs, local models, and social platform integrations for diverse AI applications.
  • kyrolabs/awesome-agents (★950 ) - Awesome Agents is a curated list of open source AI agents, frameworks, and tools for developing, testing, and deploying intelligent autonomous systems.
  • langchain-ai/langserve (★2135 JavaScript) - LangServe is a library that enables developers to deploy LangChain runnables and chains as validated, scalable REST APIs with rich features including streaming, tracing, and client SDKs.
  • mcp-use/mcp-use (★4972 Python) - mcp-use is an open-source unified client library that enables seamless integration of any LangChain-supported LLM with MCP servers to build custom agents with tool access.
  • wassim249/fastapi-langgraph-agent-production-ready-template (★1031 Python) - A production-ready FastAPI template for building scalable, secure AI agent applications integrated with LangGraph, featuring robust monitoring, security, and an automated model evaluation framework.

AI Agent Gateway

  • Arize-ai/phoenix (★6489 Jupyter Notebook) - Phoenix is an open-source AI observability platform that enables tracing, evaluation, and optimization of large language model applications across various deployment environments.
  • TransformerOptimus/SuperAGI (★16582 Python) - SuperAGI is an open-source autonomous AI agent framework that enables developers to build, manage, and deploy scalable and efficient autonomous AI agents with extensible capabilities through toolkits and a user-friendly interface.
  • Upsonic/Upsonic (★7625 Python) - Upsonic is a reliability-focused AI agent framework that supports Model Context Protocol (MCP) and provides advanced features for trustworthy, production-ready AI agent workflows with integrated tool use and secure runtime environments.
  • cloudflare/agents (★2248 TypeScript) - Cloudflare Agents is a framework for building and deploying intelligent, stateful AI agents that operate autonomously at the edge of the network with real-time communication and persistent memory.
  • e2b-dev/fragments (★5675 TypeScript) - Fragments by E2B is an open-source Next.js template for building AI-generated applications with secure code execution, multi-stack support, and integration with various large language model providers.
  • jina-ai/dev-gpt (★1868 Python) - Dev-GPT is an AI-powered automated virtual development team that generates, runs, and deploys custom microservices based on user descriptions using advanced OpenAI models.
  • mastra-ai/mastra (★15368 TypeScript) - Mastra is a TypeScript AI agent framework that enables rapid development of AI applications with support for multiple LLMs, advanced agents, workflows, RAG, integrations, and automated evaluations.
  • metorial/mcp-containers (★2531 TypeScript) - MCP Containers offers containerized versions of hundreds of Model Context Protocol (MCP) servers, enabling easy, secure, and up-to-date deployment of these servers via Docker containers.
  • simstudioai/sim (★6251 TypeScript) - Sim Studio is an open-source platform for building, testing, and optimizing AI agent workflows with flexible deployment options including cloud and self-hosting.
  • superagent-ai/superagent (★6061 TypeScript) - Superagent is an open-source AI assistant framework and API that enables developers to build and integrate powerful AI agents into applications using large language models and generative AI technologies.
  • xlang-ai/OpenAgents (★4404 Python) - OpenAgents is an open platform providing versatile language agents for data analysis, plugin integration, and autonomous web browsing, designed for real-world applications and easy deployment.

MCP Server Framework

  • AgentOps-AI/agentops (★4710 Python) - AgentOps is a Python SDK providing observability, cost tracking, benchmarking, and security tools for AI agents and large language models, integrating with major agent frameworks to support developers from prototype to production.
  • Helicone/helicone (★4251 TypeScript) - Helicone is an open-source LLM observability platform that enables developers to monitor, evaluate, and experiment with large language model applications through easy integration and comprehensive tools.
  • Klavis-AI/klavis (★3572 TypeScript) - Klavis AI is an open-source project offering production-ready MCP integration and scalable infrastructure for AI applications across multiple platforms with built-in authentication and extensive tool support.
  • aipotheosis-labs/aci (★4454 Python) - ACI.dev is an open-source platform that connects AI agents to over 600 tool integrations with multi-tenant authentication, granular permissions, and access via direct function calls or a unified MCP server, enabling production-ready AI agent development without infrastructure complexity.
  • casibase/casibase (★3893 Go) - Casibase is an open-source enterprise-level AI Cloud OS that provides a knowledge base and agent management platform with admin UI, user management, Single-Sign-On, and support for multiple AI models including ChatGPT and Claude.
  • jenkinsci/kubernetes-plugin (★2290 Java) - Jenkins Kubernetes plugin enables dynamic provisioning and management of Jenkins build agents as Kubernetes pods, automating scaling and integration for efficient CI/CD workflows.
  • kagent-dev/kagent (★1237 Python) - kagent is a Kubernetes native framework for building, deploying, and managing AI agents with extensibility, observability, and declarative configuration features.
  • trycua/cua (★9082 Python) - c/ua is a high-performance framework that enables AI agents to control full macOS and Linux operating systems within secure, lightweight virtual containers on Apple Silicon Macs, supporting complex autonomous workflows and LLM integration.

Multimodal AI & Vision Agents

Image Processing & Analysis Agents

  • 11cafe/jaaz (★1682 TypeScript) - Jaaz is a local and free AI design agent that enables users to design, edit, and generate images, posters, and storyboards with advanced AI-powered tools and a creative canvas for fast iterations.
  • QIN2DIM/hcaptcha-challenger (★1849 Python) - hCaptcha Challenger is an open-source project that uses multimodal large language models and advanced machine learning techniques to automate solving hCaptcha challenges without relying on third-party services or scripts.
  • THUDM/CogVLM (★6604 Python) - CogVLM and CogAgent are state-of-the-art open-source visual language models designed for advanced image understanding, multi-turn dialogue, and GUI agent capabilities, achieving top performance on multiple cross-modal benchmarks.
  • apple/ml-ferret (★8645 Python) - Ferret is an end-to-end multimodal large language model developed by Apple that excels in fine-grained referring and grounding tasks with open vocabulary, supported by a large-scale dataset and evaluation benchmark for research purposes.
  • roboflow/inference (★1822 Python) - Roboflow Inference is a platform that turns any computer or edge device into a command center for deploying and managing computer vision models and workflows, enabling advanced AI-powered visual applications.

Video Processing Agents

  • RayVentura/ShortGPT (★6677 Python) - ShortGPT is an AI-powered framework that automates video content creation and editing for platforms like YouTube Shorts and TikTok, supporting multi-language voiceovers and automated footage sourcing.
  • showlab/Awesome-Video-Diffusion (★4748 ) - A curated list of recent diffusion models and resources for video generation, editing, restoration, and various AI-driven video applications.
  • video-db/Director (★1018 Python) - Director is an AI video agents framework that enables complex video tasks like search, editing, and generation through natural language commands and real-time streaming, built on VideoDB's video-as-data infrastructure.

Task Automation & Workflow Orchestration

Job Automation Tools

  • Josh-XT/AGiXT (★3051 Python) - AGiXT is a comprehensive AI automation platform that orchestrates instruction management and complex task execution across multiple AI providers using adaptive memory, smart features, and a versatile plugin system.
  • OpenAdaptAI/OpenAdapt (★1338 Python) - OpenAdapt is an open-source AI-first process automation tool that connects Large Multimodal Models with traditional desktop and web GUIs to automate repetitive workflows using advanced AI techniques.
  • PrefectHQ/ControlFlow (★1347 Python) - ControlFlow is a Python framework for building structured, observable, and controllable agentic AI workflows that enable developers to orchestrate complex AI tasks with specialized agents and multi-agent coordination.
  • activepieces/activepieces (★16037 TypeScript) - Activepieces is an open-source AI automation platform that provides an extensible, no-code builder framework with over 280 MCP servers for AI agents, enabling seamless AI workflows and integrations.
  • agentheroes/agentheroes (★3376 TypeScript) - AgentHeroes is a platform to generate, animate, and schedule AI characters and videos with automation and social media integration.
  • dust-tt/dust (★1170 TypeScript) - Dust is a custom AI assistant platform designed to speed up work by enabling personalized AI agents for task automation and workflow optimization.
  • feder-cr/Jobs_Applier_AI_Agent_AIHawk (★28493 Python) - AIHawk is an AI-powered tool that automates and streamlines the job application process to help job seekers efficiently apply to multiple jobs with tailored applications.
  • huginn/huginn (★46906 Ruby) - Huginn is an open-source platform that enables users to create customizable agents to automate online tasks, monitor events, and perform actions securely on their own server.
  • kaiban-ai/KaibanJS (★1194 JavaScript) - KaibanJS is a JavaScript-native framework that uses a Kanban-inspired approach to build, manage, and visualize multi-agent AI systems and workflows in real-time.
  • microsoft/genaiscript (★2739 TypeScript) - GenAIScript is a JavaScript framework for programmatically creating and managing prompts for large language models, integrating tools, data, and workflows for advanced AI scripting.
  • microsoft/magentic-ui (★6650 Python) - Magentic-UI is a human-centered research prototype featuring a multi-agent system that automates complex web tasks with user collaboration, transparency, and control.
  • qax-os/excelize (★19435 Go) - Excelize is a Go library for reading, writing, and manipulating Microsoft Excel spreadsheet files with support for charts, images, and large datasets.

Inventory Automation Agents

  • FellouAI/eko (★4278 TypeScript) - Eko is a production-ready JavaScript framework that enables developers to build reliable, multi-step agentic workflows using natural language commands across computer and browser environments.
  • OpenInterpreter/open-interpreter (★59845 Python) - Open Interpreter is an open-source natural language interface that enables users to run code locally on their computers through a chat-based system, facilitating tasks like multimedia editing, browser control, and data analysis with full user approval and flexibility beyond hosted solutions.
  • cjo4m06/mcp-shrimp-task-manager (★1424 TypeScript) - Shrimp Task Manager is an intelligent task management system for AI Agents that converts natural language into structured development tasks with dependency tracking, iterative refinement, and research capabilities to enhance programming workflows.
  • firerpa/lamda (★7168 Python) - FIRERPA/lamda is a robust and versatile Android RPA framework designed for next-generation mobile automation, supporting a wide range of Android versions and environments with extensive APIs and Python SDK for seamless AI integration and device control.
  • irgolic/AutoPR (★1342 Python) - AutoPR is a tool that runs AI-powered workflows over codebases to automate tasks such as documentation, issue tracking, and pull request management.
  • microsoft/UFO (★7511 Python) - UFO is a UI-Focused multi-agent framework that automates and navigates Windows OS applications using AI agents powered by GPT-4V to fulfill complex user requests across multiple applications.
  • sigoden/aichat (★7529 Rust) - AIChat is an all-in-one command-line interface tool that integrates multiple large language models and provides features like Shell Assistant, Chat-REPL, RAG, AI Tools, and Agents for enhanced AI-driven command-line productivity.

Workflow Automation Assistants

  • AIDC-AI/ComfyUI-Copilot (★2303 TypeScript) - ComfyUI-Copilot is an AI-powered intelligent assistant for the Comfy-UI framework that enhances workflow automation, provides natural language node recommendations, and offers real-time development support to simplify AI algorithm debugging and deployment.
  • Significant-Gravitas/AutoGPT (★177333 Python) - AutoGPT is a powerful AI platform that enables users to create, deploy, and manage continuous AI agents to automate complex workflows with options for self-hosting and cloud deployment.
  • Yifan-Song793/RestGPT (★1380 Python) - RestGPT is an autonomous agent that leverages large language models to control real-world applications by interacting with RESTful APIs through a structured planning and execution framework.
  • dtyq/magic (★1129 PHP) - Magic is the first open-source all-in-one AI productivity platform offering a comprehensive product matrix to help enterprises build and deploy intelligent AI applications for enhanced productivity.
  • kreneskyp/ix (★1032 Python) - iX is a platform for designing, deploying, and managing autonomous and semi-autonomous AI agents powered by GPT-4 and other large language models to automate complex workflows and tasks.
  • lmnr-ai/lmnr (★2196 TypeScript) - Laminar is an open-source all-in-one platform for engineering, tracing, evaluating, and managing AI products with scalable infrastructure and rich telemetry features.
  • n8n-io/self-hosted-ai-starter-kit (★10975 ) - The Self-hosted AI Starter Kit by n8n is an open-source Docker Compose template that quickly sets up a secure, local AI and low-code development environment integrating n8n, Ollama, Qdrant, and PostgreSQL for building self-hosted AI workflows.

AI Platform Configurators

  • OpenBMB/XAgent (★8404 Python) - XAgent is an open-source autonomous LLM-driven agent designed to automatically solve complex tasks with safety, extensibility, and human collaboration features.
  • ashishpatel26/500-AI-Agents-Projects (★3730 ) - A curated collection of over 500 AI agent use cases across multiple industries, showcasing practical applications and linking to open-source projects for implementation.
  • calcom/cal.com (★37370 TypeScript) - Cal.com is an open-source, customizable scheduling platform that offers full control over scheduling data and workflows, serving as a versatile alternative to Calendly.
  • semanser/codel (★2371 TypeScript) - Codel is a fully autonomous AI agent that performs complex tasks and projects using terminal, browser, and editor environments within a secure, self-hosted Docker setup.

Developer & Coding Assistants

IDE Integrations

  • bmadcode/cursor-custom-agents-rules-generator (★2178 Batchfile) - A project that automates the generation and management of Cursor IDE rules and custom agents to enhance AI-driven development workflows and enforce best practices.
  • coder/coder (★10535 Go) - Coder is a platform for creating secure, self-hosted cloud development environments that streamline developer onboarding and optimize resource usage through automation and integrations.
  • continuedev/continue (★27968 TypeScript) - Continue is an open-source platform that enables developers to create, share, and use custom AI code assistants within popular IDEs, enhancing coding productivity and collaboration through features like Chat, Autocomplete, Edit, and Agent.
  • eth-sri/lmql (★4014 Python) - LMQL is a Python-superset programming language designed for constraint-guided and efficient interaction with large language models, enabling advanced control flow, decoding, and integration with multiple LLM providers.
  • langchain-ai/langgraph-studio (★3025 ) - LangGraph Studio is a macOS desktop application for prototyping, visualizing, and debugging LangGraph LLM agent applications with collaborative features via LangSmith integration.
  • nicepkg/aide (★2638 TypeScript) - Aide is a Visual Studio Code extension that leverages AI to provide one-click code conversion, commenting, optimization, and batch processing features to enhance developer productivity.
  • sweepai/sweep (★7577 Jupyter Notebook) - Sweep is an AI-powered coding assistant plugin designed to enhance productivity and coding efficiency within JetBrains IDEs.

Code Generation & Refactoring

Code Search Engines

  • AbanteAI/archive-old-cli-mentat (★2575 Python) - This repository contains an archived and unsupported version of an old command-line tool named Mentat, now replaced by an AI-powered GitHub bot available at mentat.ai.
  • Aider-AI/aider (★36156 Python) - Aider is an AI-powered pair programming tool that integrates with your terminal and codebase to assist developers by leveraging large language models for coding, testing, and project management.
  • All-Hands-AI/OpenHands (★61386 Python) - OpenHands is an AI-powered software development platform that automates coding, command execution, web browsing, and API interactions to assist developers in building software more efficiently.
  • BloopAI/bloop (★9479 Rust) - Bloop is a fast, AI-powered code search engine that enables natural language queries, code understanding, and generation to boost developer productivity using existing codebases as context.
  • SkalskiP/awesome-chatgpt-code-interpreter-experiments (★1011 ) - A curated collection of experiments and demonstrations showcasing the creative and advanced capabilities of the ChatGPT and Code Interpreter combination for AI-assisted coding and data processing.
  • dot-agent/nextpy (★2302 Python) - Nextpy is a cutting-edge framework for building self-modifying AI software with advanced prompt engineering, modularity, and significant performance optimizations.
  • melih-unsal/DemoGPT (★1851 Python) - DemoGPT is an open-source all-in-one library that automates the creation of interactive LLM agent applications by generating LangChain code and transforming it into Streamlit apps, leveraging GPT-3.5-turbo and supporting future autonomous API integration.
  • zilliztech/code-context (★988 TypeScript) - Code Context is an MCP plugin that adds semantic code search capabilities to AI coding assistants like Claude Code and Gemini CLI, enabling deep, context-aware search across large codebases.

AI Developer Tools

  • Col-E/Recaf (★6552 Java) - Recaf is a modern, user-friendly Java bytecode editor that simplifies bytecode manipulation, supports decompilation, recompilation, deobfuscation, and extends functionality through scripting and plugins.
  • TabbyML/tabby (★31842 Rust) - Tabby is a self-hosted, open-source AI coding assistant providing a private and customizable alternative to GitHub Copilot with rich integration and deployment options.
  • ai-christianson/RA.Aid (★2100 Python) - RA.Aid is an autonomous software development agent that uses a three-stage AI-driven process to research, plan, and implement complex multi-step programming tasks with optional human interaction and advanced AI integrations.
  • cursor/cursor (★30915 ) - Cursor is an AI-powered code editor designed to enhance programming productivity by integrating intelligent code suggestions and AI-driven features directly into the coding workflow.
  • entropy-research/Devon (★3443 Python) - Devon is an open-source AI-powered pair programming tool that assists developers with multi-file editing, code exploration, bug fixing, and test writing through integration with advanced AI models and a user-friendly interface.
  • eylonmiz/react-agent (★1619 TypeScript) - ReactAgent is an open-source autonomous agent that uses GPT-4 to generate and compose React components from user stories, facilitating rapid UI prototyping and development.
  • getcursor/cursor (★30344 ) - Cursor is an AI-powered code editor designed to enhance programming productivity by integrating intelligent code suggestions and automation features.
  • oraios/serena (★5562 Python) - Serena is a free and open-source coding agent toolkit that enhances large language models with semantic code retrieval and editing capabilities, enabling autonomous coding workflows directly on your codebase.
  • plandex-ai/plandex (★14214 Go) - Plandex is an open-source AI coding agent designed to plan, execute, and manage large-scale coding tasks and real-world projects with high efficiency and flexibility.
  • smallcloudai/refact (★3114 Rust) - Refact is an open-source AI software development agent that automates end-to-end engineering tasks by integrating with developer tools, supporting multiple programming languages, and providing intelligent code assistance and automation.
  • smol-ai/developer (★12108 Python) - Smol Developer is a library and toolset that embeds a junior developer AI agent into applications to scaffold and generate codebases interactively through human-in-the-loop prompt engineering.
  • sourcegraph/cody (★3792 TypeScript) - Cody is an AI-powered coding assistant that uses advanced search and codebase context to help developers understand, write, and fix code faster within their IDEs.
  • unit-mesh/auto-dev (★4039 Kotlin) - AutoDev is an AI-powered coding assistant plugin for IntelliJ IDEA that offers multilingual support, automatic code generation, bug detection, customizable prompts, and a suite of interactive development tools to enhance the software development lifecycle.

Smart Contract Platforms

  • AntonOsika/gpt-engineer (★54609 Python) - gpt-engineer is an open-source CLI platform that enables users to generate and improve software code through natural language instructions using AI models, supporting various AI backends and community collaboration.
  • Pythagora-io/gpt-pilot (★33231 Python) - GPT Pilot is an AI-powered developer companion that automates the creation of fully working, production-ready applications by simulating a multi-agent software development team and providing comprehensive coding, debugging, and review capabilities.
  • TransformerOptimus/SuperCoder (★950 Go) - SuperCoder is an open-source autonomous software development system that automates coding, testing, and deployment tasks using advanced AI tools and supports multiple frameworks like Flask, Django, and NextJS.
  • aws/amazon-q-developer-cli (★1327 Rust) - Amazon Q Developer CLI enhances terminal productivity on macOS and Linux by providing IDE-style autocomplete, natural language chat, contextual awareness, and AI-driven automation for popular command-line tools.
  • joshpxyne/gpt-migrate (★6973 Python) - GPT-Migrate is a tool that automates the migration of codebases from one programming language or framework to another using GPT-4 and Docker environments, with support for testing and iterative debugging.
  • kuafuai/DevOpsGPT (★5949 HTML) - DevOpsGPT is an AI-driven software development automation system that uses large language models combined with DevOps tools to convert natural language requirements into working software, improving efficiency and accelerating delivery.
  • potpie-ai/potpie (★4813 Python) - Potpie is an open-source platform that creates AI agents specialized in your codebase to automate code analysis, testing, debugging, and development tasks, enhancing software engineering productivity.
  • stitionai/devika (★19429 Python) - Devika is an open-source agentic AI software engineer that understands high-level instructions, researches information, and writes code to autonomously achieve software development objectives.

Documentation & Testing Assistants

  • eli64s/readme-ai (★2302 Python) - ReadmeAI is a developer tool that automatically generates customizable README files using advanced language models and supports multiple AI providers and offline mode.
  • idosal/git-mcp (★3658 TypeScript) - GitMCP is a free, open-source remote MCP server that transforms any GitHub project into an AI-accessible documentation hub to eliminate code hallucinations and provide accurate, up-to-date project information.
  • samchon/nestia (★2047 TypeScript) - Nestia is a powerful helper library suite for NestJS that enhances API development, testing, and AI chatbot integration with high performance and developer-friendly tools.
  • steipete/agent-rules (★3417 Shell) - Agent Rules is a repository of reusable rules and knowledge documents designed to improve the effectiveness of AI coding assistants like Claude Code and Cursor in software development workflows.
  • truera/trulens (★2675 Python) - TruLens is a tool for systematic evaluation and tracking of large language model experiments, enabling developers to identify failure modes and iteratively improve their applications through fine-grained instrumentation and comprehensive feedback.

GUI & Computer Control AI Agents

Desktop Automation

  • BAAI-Agents/Cradle (★2222 Python) - Cradle is a framework that enables foundation AI models to perform complex computer tasks through human-like interaction using screenshots and keyboard/mouse controls, supporting various games and software applications.
  • OthersideAI/self-operating-computer (★9807 Python) - The Self-Operating Computer Framework enables multimodal AI models to autonomously operate a computer by interpreting screen visuals and executing mouse and keyboard actions to achieve user-defined objectives.
  • asweigart/pyautogui (★11686 Python) - PyAutoGUI is a cross-platform Python module that enables programmatic control of the mouse and keyboard for GUI automation, including features like screen capture and image recognition.
  • bytebot-ai/bytebot (★925 TypeScript) - Bytebot is a containerized Linux desktop environment that enables programmatic control and remote access for automation, scraping, CI tasks, and remote work through a unified REST API and VNC interfaces.
  • bytedance/UI-TARS-desktop (★15399 TypeScript) - UI-TARS Desktop is a GUI Agent application that uses a Vision-Language Model to enable natural language control of computers across multiple platforms with real-time visual recognition and automation capabilities.
  • e2b-dev/open-computer-use (★1400 Python) - Open Computer Use is a secure cloud Linux computer environment controlled by open-source large language models, enabling AI-driven operation via keyboard, mouse, and shell commands with live display streaming.
  • mediar-ai/screenpipe (★15364 TypeScript) - Screenpipe is an open-source AI app store that continuously records and indexes 24/7 desktop and microphone activity locally, enabling developers to build highly contextual AI-powered desktop applications called pipes, which can be shared, monetized, and run securely in a sandboxed environment.
  • nut-tree/nut.js (★2593 TypeScript) - nut.js is a cross-platform native UI automation and testing tool enabling keyboard, mouse, and image-based screen interactions for automating user interfaces.
  • showlab/computer_use_ootb (★1643 Python) - Computer Use OOTB is an out-of-the-box GUI agent solution for Windows and macOS that enables desktop automation and remote control using API-based and local AI models with a user-friendly Gradio interface.
  • wonderwhy-er/DesktopCommanderMCP (★4045 TypeScript) - Desktop Commander MCP is a server that integrates with the Claude desktop app to provide terminal control, file system search, and diff-based file editing capabilities, enabling advanced AI-assisted desktop management.
  • yuruotong1/autoMate (★3627 Python) - autoMate is an AI-driven local automation assistant that uses natural language commands to enable users to automate complex computer tasks without programming, supporting local deployment and full interface control for enhanced privacy and flexibility.

Browser & Web Automation

Browser Automation Tools

  • McGill-NLP/webllama (★1410 Python) - WebLlama is a project that builds advanced AI agents using Meta's Llama 3 model to browse the web by following instructions and engaging in dialogue, surpassing GPT-4V in web navigation benchmarks.
  • browser-use/browser-use (★66548 Python) - Browser-use is a project that enables AI agents to control and automate web browsers, allowing users to perform complex tasks through natural language commands and AI-driven browser interactions.
  • browserbase/stagehand (★14842 TypeScript) - Stagehand is a production-ready AI web browsing framework that combines code and natural language to enable reliable and extensible browser automation.
  • nat/natbot (★1929 Python) - Natbot is a project that enables controlling a web browser using GPT-3 to perform intelligent and flexible browser automation tasks.
  • zabirauf/AutoGPT.js (★1029 TypeScript) - AutoGPT.js is an open-source project that brings the powerful capabilities of AutoGPT to run directly in web browsers, enabling local GPT agent execution with enhanced accessibility and privacy.

Web Agents

  • lavague-ai/LaVague (★6101 Python) - LaVague is an open-source Large Action Model framework for developing AI Web Agents that automate web-based tasks using components like a World Model and Action Engine with support for Selenium, Playwright, and Chrome extension drivers.
  • lmnr-ai/index (★2288 Python) - Index is an advanced open-source browser agent that autonomously performs complex web tasks using AI models, offering hosted and self-hosted options with strong observability features.
  • reworkd/tarsier (★1713 Jupyter Notebook) - Tarsier is a vision utility that enhances web interaction agents by tagging webpage elements and converting screenshots into text representations to enable effective LLM-based web automation.

Data Processing & ETL Agents

NL AI Frameworks

  • HumanSignal/Adala (★1228 Python) - Adala is an autonomous data labeling agent framework that enables intelligent agents to learn and perform diverse data processing and labeling tasks through iterative learning using large language models.
  • Mintplex-Labs/vector-admin (★1979 TypeScript) - VectorAdmin is a universal and user-friendly tool suite for managing multiple vector databases, providing full control over vector data with multi-user support, embedding management, and cloud deployment capabilities.
  • ScrapeGraphAI/Scrapegraph-ai (★20799 Python) - ScrapeGraphAI is an AI-powered Python library that creates intelligent web scraping pipelines using large language models to extract structured data from websites and local documents efficiently.
  • apache/doris (★14031 Java) - Apache Doris is a high-performance, real-time analytical database with a storage-compute integrated architecture, designed for sub-second query response and high concurrency in diverse data analysis scenarios.
  • axflow/axflow (★1122 TypeScript) - Axflow is a modular TypeScript framework designed to simplify the development of robust natural language AI applications by providing scalable components for data connection, model evaluation, and more.
  • bespokelabsai/curator (★1464 Python) - Bespoke Curator is a Python-based library for scalable synthetic data curation and structured data extraction, supporting bulk inference, asynchronous operations, and integration with various inference APIs.
  • business-science/ai-data-science-team (★2331 Python) - AI Data Science Team is a Python library featuring AI-powered agents that automate and accelerate common data science tasks, including data cleaning, feature engineering, machine learning, and exploratory data analysis, to improve productivity and efficiency.
  • hitsz-ids/synthetic-data-generator (★2373 Python) - Synthetic Data Generator (SDG) is a comprehensive framework for generating high-quality, privacy-compliant synthetic structured tabular data using advanced statistical and LLM-based models, optimized for big data applications.
  • infiniflow/ragflow (★61247 TypeScript) - RAGFlow is an open-source Retrieval-Augmented Generation engine that leverages deep document understanding and Large Language Models to provide accurate, citation-backed question-answering from complex and diverse data sources.
  • pydantic/pydantic (★24693 Python) - Pydantic is a Python library for fast and extensible data validation using Python type hints, enabling developers to define and validate data models efficiently.
  • scouter-project/scouter (★2140 Java) - Scouter is an open source Application Performance Management (APM) tool that monitors and manages the performance of software applications and system resources across various platforms and services.
  • ucbepic/docetl (★2440 Python) - DocETL is a system for creating and executing complex document processing and data transformation pipelines powered by large language models, featuring an interactive UI playground and a Python package for production use.

Document Analytics Platforms

  • georgia-tech-db/evadb (★2684 Python) - EvaDB is an open-source database system that empowers developers to build AI-powered applications by integrating advanced AI models and optimizations within a SQL-based querying framework for both structured and unstructured data.
  • loggie-io/loggie (★1305 Go) - Loggie is a lightweight, cloud-native data transfer agent and aggregator designed for scalable, high-performance log collection, processing, and monitoring in Kubernetes environments.

Domain-Specific AI Agents

Financial & Trading Systems

  • pskrunner14/trading-bot (★1102 Jupyter Notebook) - A Stock Trading Bot that uses Deep Q-Learning, a form of Deep Reinforcement Learning, to learn and execute trading strategies based on historical stock market data.

Educational & Learning Agents

  • FareedKhan-dev/all-rl-algorithms (★1007 Jupyter Notebook) - A comprehensive educational repository providing clear, step-by-step Python implementations and explanations of a wide range of reinforcement learning algorithms for interactive learning and deep understanding.
  • TIGER-AI-Lab/TheoremExplainAgent (★1348 Python) - TheoremExplainAgent is an AI system that generates long-form Manim videos to visually explain mathematical theorems, enhancing understanding through multimodal explanations combining text, visuals, and audio.
  • ashishps1/learn-ai-engineering (★1810 ) - A comprehensive collection of free resources to learn AI, machine learning, large language models, and AI agents from scratch.
  • microsoft/ai-agents-for-beginners (★33109 Jupyter Notebook) - AI Agents for Beginners is a Microsoft educational course with 10 lessons teaching the fundamentals and practical skills to build AI agents using Azure AI and GitHub models, supported by multi-language resources and community engagement.

Gaming & Simulation Agents

Generative Agents Simulation

  • GDQuest/godot-steering-ai-framework (★1326 GDScript) - A comprehensive Godot framework for creating smooth and complex AI motion using steering behaviors in both 2D and 3D games.
  • MineDojo/Voyager (★6250 JavaScript) - Voyager is an open-ended embodied agent powered by large language models that autonomously explores and learns diverse skills in Minecraft through lifelong learning and an evolving skill library.
  • dzoba/gptrpg (★990 JavaScript) - GPTRPG is a proof-of-concept project featuring a GPT-based AI agent that autonomously interacts within a simple RPG-like environment using the OpenAI API.
  • joonspk-research/generative_agents (★19379 ) - This project provides a simulation framework for generative agents that mimic believable human behaviors within an interactive game environment, enabling running, saving, replaying, and demonstrating agent-based simulations.
  • linyiLYi/street-fighter-ai (★6489 Python) - SFighterAI is a deep reinforcement learning-based AI agent designed to master and beat the final boss in Street Fighter II: Special Champion Edition using only game screen pixel data.
  • mkturkcan/generative-agents (★979 Jupyter Notebook) - A locally runnable, low-cost implementation of generative agents simulating human-like behavior in an interactive Dungeons & Dragons setting using large language models.
  • ntasfi/PyGame-Learning-Environment (★1048 Python) - PyGame Learning Environment (PLE) is a Python-based reinforcement learning platform that provides a variety of games and tools to facilitate the development and testing of reinforcement learning algorithms.

Virtual Town AI

  • a16z-infra/ai-town (★8617 TypeScript) - AI Town is a deployable starter kit for building and customizing a virtual town where AI characters live, chat, and socialize, featuring a robust backend and support for multiple LLMs and deployment options.

Low-Code/No-Code Platforms

  • Mintplex-Labs/anything-llm (★47188 JavaScript) - AnythingLLM is an all-in-one AI application that enables intelligent chat interactions with documents using various large language models, supporting multi-user management, no-code AI agent building, and multi-modal capabilities in both desktop and Docker environments.
  • Onelevenvy/flock (★933 TypeScript) - Flock is a workflow-based low-code platform for rapidly building chatbots, RAG applications, and coordinating multi-agent teams, featuring modular nodes, human-in-the-loop support, and integration with advanced AI tools.
  • getmaxun/maxun (★13313 TypeScript) - Maxun is an open-source no-code platform that enables users to create custom robots for automated web data extraction, turning websites into APIs and spreadsheets without coding.
  • gmpetrov/databerry (★2935 ) - Databerry is a no-code platform that enables users to build custom Large Language Model agents without programming skills, making AI development accessible and user-friendly.
  • trypromptly/LLMStack (★1994 Python) - LLMStack is a no-code multi-agent framework for building generative AI agents, workflows, and applications that integrate with your data and business processes, deployable on cloud or on-premise with multi-tenant support and API access.

Agent Communication & Protocols

Multi-Agent Platforms

  • Div99/agent-protocol (★1234 Python) - Agent Protocol is a tech-agnostic API specification providing a standardized interface for seamless communication and integration with AI agents across different frameworks.
  • IBM/mcp-context-forge (★960 Python) - IBM MCP Context Forge is a production-grade Model Context Protocol Gateway that unifies REST and MCP services into a single secure, scalable endpoint for AI clients with federation, virtualization, multi-protocol support, and an optional Admin UI.
  • TooTallNate/proxy-agents (★1073 TypeScript) - TooTallNate/proxy-agents is a Node.js monorepo containing various HTTP Proxy Agent implementations that enable routing HTTP requests through different proxy protocols.
  • a2aproject/A2A (★18704 TypeScript) - Agent2Agent (A2A) is an open protocol enabling secure communication and interoperability between diverse generative AI agents to foster collaboration and innovation in the AI ecosystem.
  • jlowin/fastmcp (★15413 Python) - FastMCP is a Python framework that simplifies building MCP servers and clients, enabling standardized and efficient interactions with large language models through tools, resources, and prompts.
  • sonnylazuardi/cursor-talk-to-figma-mcp (★4898 JavaScript) - Cursor Talk to Figma MCP is a project that integrates Cursor AI with Figma using a Model Context Protocol to enable programmatic reading and modification of Figma designs for enhanced design automation and workflow efficiency.

Communication Protocols

  • agent-network-protocol/AgentNetworkProtocol (★940 HTML) - AgentNetworkProtocol (ANP) is an open-source protocol designed to enable secure, decentralized, and efficient communication and collaboration among intelligent agents, aiming to become the HTTP of the Agentic Web era.

Simulation & Benchmarking Environments

Multimodal Model Benchmarks

  • THUDM/AgentBench (★2710 Python) - AgentBench is a comprehensive benchmark platform designed to evaluate large language models as autonomous agents across diverse environments and tasks, facilitating research and development in LLM-based agent capabilities.
  • google-deepmind/acme (★3756 Python) - Acme is a flexible and scalable research framework providing modular reinforcement learning components and agents for developing and benchmarking RL algorithms.

Sensor Fusion Agents

  • google-deepmind/android_env (★1114 Python) - AndroidEnv is a Python library by DeepMind that provides a Reinforcement Learning platform by simulating Android devices, enabling agents to interact with real-world Android applications through touchscreen gestures for diverse RL tasks.
  • web-arena-x/webarena (★1076 Python) - WebArena is a self-hostable web environment designed for building and evaluating autonomous agents capable of realistic web navigation and interaction tasks.
  • xlang-ai/OSWorld (★1994 Python) - OSWorld is a benchmarking platform for evaluating multimodal AI agents performing open-ended tasks in real computer environments using virtual machines and various virtualization technologies.

Web Automation and UI Interaction

Browser Automation

Web Automation Systems

  • EmergenceAI/Agent-E (★1153 Python) - Agent-E is an agent-based automation system that enables natural language-driven web browser automation and multi-agent orchestration for personal and enterprise use.
  • TaxyAI/browser-extension (★1225 TypeScript) - Taxy AI is an open-source Chrome extension that uses GPT-4 to automate repetitive browser tasks by interpreting natural language instructions and interacting with web pages through an action cycle.
  • brightdata/brightdata-mcp (★1009 JavaScript) - Bright Data MCP is a powerful Model Context Protocol server that enables AI agents and applications to access and extract real-time web data seamlessly, bypassing geo-restrictions and bot protections for enhanced web scraping and navigation.
  • fake-useragent/fake-useragent (★3946 Python) - fake-useragent is a Python package that provides an up-to-date and customizable user-agent faker using a real-world database for realistic user-agent strings.
  • ishan0102/vimGPT (★2669 Python) - vimGPT is a project that combines GPT-4V's vision capabilities with the Vimium keyboard navigation extension to enable AI-assisted web browsing through visual and keyboard interactions.
  • nanobrowser/nanobrowser (★8097 TypeScript) - Nanobrowser is an open-source Chrome extension that enables AI-powered web automation through a multi-agent system using user-configured LLM API keys, offering a privacy-focused and cost-effective alternative to commercial tools like OpenAI Operator.
  • nottelabs/notte (★1116 Python) - Notte is an open-source full stack framework that creates intelligent web browsing agents using a perception layer to enable fast, reliable, and cost-effective interactions with websites through large language models.
  • steel-dev/steel-browser (★4872 TypeScript) - Steel Browser is an open-source browser API that enables developers to build AI-powered web agents and automation tools with full browser control, session management, proxy support, and debugging features, simplifying web automation without infrastructure overhead.

AI-Powered Agents

  • Integuru-AI/Integuru (★4406 Python) - Integuru is an AI agent that generates integration code by reverse-engineering platforms' internal APIs using browser network requests and OpenAI models to automate platform interactions without requiring permission.
  • Planetary-Computers/autotab-starter (★1007 Python) - Autotab-starter is an AI-powered tool that enables users to create auditable browser automation scripts by recording interactions in Chrome and converting them into Python code for automating real-world web tasks.

UI Interaction

GUI Action Mapping

  • THUDM/CogAgent (★985 Python) - CogAgent is an open-source, bilingual Vision-Language Model-based GUI agent that enables advanced interaction with graphical user interfaces through visual and natural language inputs, achieving state-of-the-art performance in GUI tasks and benchmarks.
  • ddupont808/GPT-4V-Act (★1045 JavaScript) - GPT-4V-Act is an AI agent that uses GPT-4V(ision) to interact with web user interfaces through mouse and keyboard inputs, enabling enhanced accessibility and automation.
  • mobile-next/mobile-mcp (★1551 TypeScript) - Mobile Next MCP is a platform-agnostic Model Context Protocol server enabling scalable mobile automation and interaction with iOS and Android devices, simulators, and emulators through accessibility snapshots and coordinate-based controls.
  • naver/egjs (★941 JavaScript) - egjs is a modular collection of JavaScript components designed to simplify and accelerate the development of customizable web applications with a focus on ease of use and performance.
  • wandb/openui (★21618 TypeScript) - OpenUI is an open-source tool that enables users to describe user interfaces using natural language and see them rendered live, leveraging large language models to generate UI components across multiple frontend frameworks.

Visual GUI Agents

  • TencentQQGYLab/AppAgent (★6045 Python) - AppAgent is an LLM-based multimodal agent framework that enables AI agents to operate smartphone applications through human-like interactions such as tapping and swiping without backend access.
  • agno-agi/agent-ui (★940 TypeScript) - Agent UI is a modern, customizable chat interface for AI agents featuring real-time streaming, tool call visualization, reasoning steps, and multi-modal content support, built with Next.js and Tailwind CSS.
  • showlab/ShowUI (★1398 Python) - ShowUI is an open-source, end-to-end vision-language-action model designed for intelligent GUI agents and computer interaction, featuring advanced training, inference, and deployment capabilities.
  • simular-ai/Agent-S (★5907 Python) - Agent S is an open-source framework that enables autonomous agents to interact with computers through GUIs, leveraging large language models and web retrieval to perform complex tasks with state-of-the-art performance.
  • zai-org/CogAgent (★1007 Python) - CogAgent is an advanced open-source vision-language model-based GUI agent supporting bilingual interaction and excelling in GUI perception, reasoning, and task generalization.

Personal Assistants & Conversational Agents

Chatbots

Chatbot Platforms

  • CopilotKit/CopilotKit (★22188 TypeScript) - CopilotKit is a React-based framework for building deeply integrated AI assistants, chatbots, and in-app AI agents that enhance user interactions through natural language and contextual understanding.
  • LLPhant/LLPhant (★1298 PHP) - LLPhant is a comprehensive PHP generative AI framework inspired by Langchain, supporting multiple LLMs and designed to simplify building powerful AI applications.
  • SylphAI-Inc/AdalFlow (★3482 Python) - AdalFlow is a PyTorch-like library that enables building and auto-optimizing large language model applications, including chatbots, RAG, and agents, with a unified framework for prompt optimization and model-agnostic flexibility.
  • langchain-ai/streamlit-agent (★1520 Python) - Reference implementations of various LangChain agents as interactive Streamlit applications demonstrating conversational AI capabilities and integrations.
  • lm-sys/FastChat (★38917 Python) - FastChat is an open platform for training, serving, and evaluating large language model based chatbots, featuring multi-model serving, extensive model support, and large-scale evaluation datasets.
  • microsoft/TypeChat (★8536 TypeScript) - TypeChat is a library that simplifies building natural language interfaces by using types to define intents and schemas, replacing traditional prompt engineering with schema engineering for improved validation and reliability.
  • plurai-ai/intellagent (★1102 Python) - IntellAgent is a multi-agent framework that simulates realistic user interactions to diagnose, evaluate, and optimize conversational AI agents for improved performance and reliability.
  • run-llama/rags (★6489 Python) - RAGs is a Streamlit app that enables users to build and interact with personalized Retrieval-Augmented Generation pipelines over their own data using natural language instructions.
  • yym68686/ChatGPT-Telegram-Bot (★1085 Python) - TeleChat is a versatile AI-powered Telegram bot that integrates multiple advanced language models and APIs to provide multimodal conversational AI, web search, and document processing capabilities within Telegram chats.

AI Agents

  • Anil-matcha/Awesome-GPT-Store (★1487 ) - Awesome GPT Store is a curated collection of specialized Custom GPT chatbots designed to assist users across various domains and tasks.
  • JetBrains/koog (★1395 Kotlin) - Koog is a Kotlin-based framework by JetBrains for building scalable, production-ready AI agents that operate across multiple platforms and integrate with major LLM providers.
  • Kav-K/GPTDiscord (★1845 Python) - GPTDiscord is an all-in-one, robust OpenAI-powered Discord bot offering ChatGPT-style conversations, image generation, AI moderation, custom knowledge bases, internet-connected chat, and advanced code interpretation features.
  • MrLesk/Backlog.md (★2819 TypeScript) - Backlog.md is a Markdown-native task manager and Kanban visualizer that transforms any Git repository into a self-contained project board with AI integration and a modern web interface.
  • NVIDIA/NeMo-Guardrails (★4934 Python) - NeMo Guardrails is an open-source toolkit by NVIDIA that enables developers to add programmable guardrails to large language model-based conversational systems to ensure safer, more controlled, and trustworthy interactions.
  • SciSharp/BotSharp (★2817 C#) - BotSharp is an open-source AI multi-agent framework on .NET Core that enables enterprise developers to build and integrate intelligent conversational AI assistants and platforms with modular plugins and multi-agent support.
  • ThinkInAIXYZ/deepchat (★3749 TypeScript) - DeepChat is an open-source, cross-platform AI chat platform that unifies multiple large language models with advanced tool calling, search enhancement, and local model integration for a powerful and privacy-focused AI assistant experience.
  • botpress/botpress (★13986 TypeScript) - Botpress is an open-source platform for building and deploying advanced chatbots and AI assistants powered by GPT and large language models, featuring integrations, development tools, and community support.
  • darrenburns/elia (★2238 Python) - Elia is a keyboard-centric terminal user interface for interacting efficiently with various large language models, supporting both cloud and local AI models with customizable themes and configurations.
  • langchain-ai/agent-chat-ui (★1158 TypeScript) - Agent Chat UI is a Next.js web application that enables users to interact with LangGraph agents via a chat interface, supporting both local development and production deployment with configurable authentication and message visibility features.
  • langchain-ai/langgraph-swarm-py (★1062 Python) - LangGraph Multi-Agent Swarm is a Python library that enables the creation of dynamic swarm-style multi-agent systems with specialized agents collaborating and handing off control based on their expertise.
  • lobehub/lobe-chat (★63904 TypeScript) - Lobe Chat is an open-source, modern AI chat framework supporting multiple AI providers, multi-modal interactions, knowledge base integration, and easy deployment for private AI chat applications.
  • mayooear/ai-pdf-chatbot-langchain (★15729 TypeScript) - AI PDF Chatbot & Agent that ingests PDFs, stores embeddings in Supabase, and answers queries using OpenAI and LangChain/LangGraph orchestration frameworks.
  • vercel/ai (★16119 TypeScript) - The AI SDK by Vercel is a TypeScript toolkit that enables developers to build AI-powered applications and chatbots across popular frameworks and runtimes using multiple AI model providers.
  • wshobson/agents (★977 ) - A comprehensive collection of 44 specialized AI subagents for Claude Code that enhance development workflows with domain-specific expertise across software development, infrastructure, security, data, and business domains.

Virtual Assistants

  • CherryHQ/cherry-studio (★30770 TypeScript) - Cherry Studio is a cross-platform desktop client that supports multiple large language model providers, offering extensive AI assistant features, document processing, and practical productivity tools in an open-source environment.
  • OS-Copilot/OS-Copilot (★1674 Python) - OS-Copilot is an open-source project that develops a self-improving embodied conversational AI agent integrated into operating systems to automate diverse daily computer tasks.
  • gptme/gptme (★3938 Python) - gptme is a versatile personal AI assistant that operates in your terminal, equipped with tools for coding, file editing, web browsing, and vision, providing a powerful local alternative to popular AI agents for diverse knowledge work.
  • khoj-ai/khoj (★30623 Python) - Khoj is a self-hostable personal AI platform that transforms any local or online large language model into a customizable, autonomous AI assistant for research, automation, and document retrieval.
  • kortix-ai/suna (★17075 TypeScript) - Suna is an open-source generalist AI assistant that automates real-world tasks through natural conversation, combining browser automation, file management, web crawling, and API integrations in a secure, modular architecture.
  • mem0ai/mem0 (★37551 Python) - Mem0 is an open-source memory layer that enhances AI agents and assistants with personalized, adaptive memory capabilities for improved context-aware interactions across various applications.
  • menloresearch/jan (★35201 TypeScript) - Jan is an open-source offline AI assistant that serves as a ChatGPT alternative, running locally on various hardware platforms to provide privacy and full user control.
  • myshell-ai/AIlice (★1224 Python) - AIlice is a fully autonomous, general-purpose AI agent designed as a standalone AI assistant that leverages large language models to perform complex tasks, support voice interaction, and enable self-evolution of AI capabilities.

Human-in-the-Loop Agents

  • humanlayer/humanlayer (★1014 Python) - HumanLayer is an API and SDK that enables AI agents to safely interact with humans for feedback and approvals, ensuring human oversight in high-stakes function calls across various communication platforms.
  • langchain-ai/agents-from-scratch (★1074 Jupyter Notebook) - Agents From Scratch is a modular guide and implementation for building an intelligent email assistant with human-in-the-loop and memory capabilities using LangGraph and Gmail API integration.

Audio & Voice Assistants

  • 2noise/ChatTTS (★37311 Python) - ChatTTS is a generative text-to-speech model optimized for natural and expressive dialogue-based speech synthesis, supporting multi-speaker and fine-grained prosody control for conversational AI applications.
  • Cloud-Code-AI/BrowserAI (★1077 TypeScript) - BrowserAI is an open-source project that enables running local large language models directly in the browser with WebGPU acceleration, providing private, offline-capable, and production-ready AI functionalities including text generation, speech recognition, and text-to-speech.
  • OpenInterpreter/01 (★5072 Python) - 01 is an open-source voice interface platform enabling natural language interaction with intelligent devices across desktop, mobile, and ESP32 hardware.
  • TEN-framework/TEN-Agent (★5634 Python) - TEN Agent is a real-time conversational voice AI agent integrating multiple AI technologies and hardware to enable seeing, hearing, and speaking capabilities, compatible with platforms like Dify and Coze.
  • livekit/agents (★6907 Python) - LiveKit Agents is an open-source framework for building real-time voice AI agents with integrated speech-to-text, large language models, text-to-speech, and telephony capabilities.
  • neonbjb/tortoise-tts (★14462 Jupyter Notebook) - Tortoise is a high-quality multi-voice text-to-speech system focused on realistic prosody and intonation, offering various usage modes and advanced performance optimizations.
  • openinterpreter/01 (★5080 Python) - 01 is an open-source voice interface platform that enables natural language voice control across desktop, mobile, and ESP32 devices, offering extensive customization and hardware support.

Security & Privacy Agents

Runtime Analysis Tools

  • FunnyWolf/Viper (★4441 Batchfile) - VIPER is a versatile and powerful red team platform that supports adversary simulation and cybersecurity assessments across multiple operating systems, enhanced with AI-driven automation and extensive post-exploitation capabilities.
  • amonapp/amon (★1327 Python) - Amon is a modern, open-source server monitoring platform designed to help users track and manage server performance and uptime effectively.
  • fortra/impacket (★14621 Python) - Impacket is a Python library offering low-level programmatic access and implementations for various network protocols, focusing on security research and educational use.
  • invariantlabs-ai/mcp-scan (★933 Python) - MCP-Scan is a security tool that statically and dynamically scans MCP connections to detect and prevent vulnerabilities like prompt injections, tool poisoning, and cross-origin escalations, offering real-time monitoring and customizable guardrails for enhanced protection.
  • openrecall/openrecall (★2274 Python) - OpenRecall is a fully open-source, privacy-focused digital memory tool that captures and makes searchable your digital history through screenshots, supporting Windows, macOS, and Linux with local data storage and advanced semantic search capabilities.
  • zylon-ai/private-gpt (★56372 Python) - PrivateGPT is a fully private, offline-capable AI platform that enables users to interact with their documents using Large Language Models through a comprehensive API and user-friendly interface, ensuring data never leaves the user's environment.

Post-Exploitation Agents

  • AprilNEA/AChat (★3270 TypeScript) - AChat is an open-source, self-hosted AI platform designed for enterprises and teams, combining local processing with remote synchronization for centralized AI conversation management and collaboration.
  • CyberAlbSecOP/Awesome_GPT_Super_Prompting (★2784 ) - A comprehensive curated repository focused on GPT prompt engineering, security vulnerabilities, jailbreak techniques, prompt leaks, injections, and adversarial machine learning for large language models.
  • Ne0nd0g/merlin (★5320 Go) - Merlin is a cross-platform post-exploitation Command & Control server and agent supporting multiple protocols and advanced encryption for secure and flexible red teaming operations.
  • PromtEngineer/localGPT (★21743 Python) - LocalGPT enables secure, private conversations with your documents on your local device using various open-source GPT models and embeddings without any data leaving your computer.
  • fr0gger/Awesome-GPT-Agents (★6166 ) - A curated community-driven repository compiling diverse GPT agents specialized in offensive and defensive cybersecurity tasks, providing AI-powered tools for malware analysis, threat intelligence, vulnerability assessment, and more.
  • guardrails-ai/guardrails (★5345 Python) - Guardrails is a Python framework that enhances AI application reliability by providing input/output guards to detect and mitigate risks and generate structured data from large language models.
  • quark-engine/quark-engine (★1498 Python) - Quark Agent is an AI-powered tool that enables natural language-based detection and analysis of vulnerabilities and malware in Android APKs through intuitive workflows and Quark Script integration.
  • sauravpanda/BrowserAI (★1176 TypeScript) - BrowserAI is an open-source platform that enables running production-ready large language models and AI functionalities directly in the browser with WebGPU acceleration, ensuring privacy, offline capability, and no server infrastructure requirements.

Red-Teaming Platforms

  • beclab/Olares (★2240 Shell) - Olares is an open-source sovereign cloud operating system that enables users to run local AI models, manage personal data, and self-host various applications securely and privately on their own hardware.

Corporate and Analytical Applications

RAG and Business Analytics

  • Canner/WrenAI (★8847 TypeScript) - Wren AI is an open-source Generative Business Intelligence AI agent that enables data-driven teams to interact with their data using natural language to generate SQL queries, visualizations, and reports for smarter business insights.
  • newrelic/newrelic-ruby-agent (★1204 Ruby) - The New Relic Ruby agent is a comprehensive performance monitoring tool for Ruby applications that helps identify issues, analyze business data, and improve customer experience.
  • vanna-ai/vanna (★19652 Python) - Vanna is an open-source Python framework that uses Retrieval-Augmented Generation and large language models to convert natural language questions into accurate SQL queries for various databases, simplifying data access and querying.

Data Integration and Specialized Solutions

  • Scale3-Labs/langtrace (★993 TypeScript) - Langtrace is an open-source observability tool providing end-to-end tracing, real-time monitoring, and analytics for large language model applications, supporting multiple LLM providers, frameworks, and vector databases with SDKs for TypeScript and Python.
  • ZeroQI/Hama.bundle (★1272 Python) - Hama.bundle is a Plex metadata agent that enriches anime series and movies with detailed metadata from AniDB, TVDB, and TMDB, supporting local media assets and community-driven updates.
  • airweave-ai/airweave (★2813 Python) - Airweave is a platform that turns any app, database, URL, or API into a semantically searchable server for AI agents, enabling easy retrieval of structured and unstructured data through REST and MCP endpoints.
  • getzep/graphiti (★15425 Python) - Graphiti is a framework for building and querying real-time, temporally-aware knowledge graphs designed to support AI agents in dynamic environments with efficient incremental updates and hybrid retrieval methods.
  • llmware-ai/llmware (★14249 Python) - llmware is a unified framework for building enterprise Retrieval-Augmented Generation (RAG) pipelines using small, specialized language models integrated with secure knowledge sources for efficient AI applications.
  • mindsdb/mindsdb (★34927 Python) - MindsDB is an AI-powered query engine platform that connects, unifies, and enables conversational querying over large-scale federated data sources including databases, data warehouses, and SaaS applications.
  • robusta-dev/holmesgpt (★1136 Python) - HolmesGPT is an AI-powered on-call agent that automates alert investigations by correlating observability data from multiple sources to quickly identify root causes and streamline incident response.
  • run-llama/llama_index (★43384 Python) - LlamaIndex is a leading data framework that enables building LLM-powered applications by providing tools for data ingestion, structuring, and advanced querying to augment large language models with private and external data.
  • zilliztech/deep-searcher (★6698 Python) - DeepSearcher is an open-source platform that combines large language models and vector databases to enable secure and intelligent search, reasoning, and reporting on private enterprise data.

Developer Tools and Automation

Code Automation and Testing

  • BuilderIO/micro-agent (★4148 TypeScript) - BuilderIO/micro-agent is an AI-powered tool that writes and iteratively refines code by generating tests and ensuring all test cases pass, focusing on reliable test-driven development.
  • OpenAutoCoder/Agentless (★1841 Python) - Agentless is an open-source project that automatically solves software development problems through a three-phase process of fault localization, repair, and patch validation, achieving competitive results on SWE-bench benchmarks.
  • SWE-agent/SWE-agent (★16838 Python) - SWE-agent is an autonomous software engineering tool that uses large language models to automatically fix GitHub issues, perform web tasks, and solve cybersecurity challenges, achieving state-of-the-art results in automated software repair and offensive cybersecurity.
  • alibaba/arthas (★36550 Java) - Arthas is an open-source Java diagnostic tool by Alibaba that enables real-time production troubleshooting and performance monitoring without code changes or JVM restarts.
  • mattzcarey/shippie (★2204 TypeScript) - Shippie is an AI-powered extensible code review agent that integrates into CI/CD pipelines to automatically detect code issues and improve software delivery speed.
  • qodo-ai/qodo-cover (★5124 Python) - Qodo-Cover is an AI-powered tool that automates the generation of unit tests to enhance code coverage, supporting integration with GitHub CI workflows and local CLI usage.

Integration and Extensions

  • ComposioHQ/composio (★25587 Python) - Composio is a production-ready toolset that equips AI agents and LLMs with over 250 high-quality integrations across software, OS operations, and search capabilities, featuring managed authentication and a pluggable architecture for custom extensions.

Educational and Research Resources

Courses and Tutorials

  • NirDiamant/GenAI_Agents (★15229 Jupyter Notebook) - GenAI_Agents is a comprehensive repository offering tutorials, implementations, and community resources for building and advancing generative AI agents from basic to advanced levels.
  • The-Pocket/PocketFlow-Tutorial-Codebase-Knowledge (★11082 Python) - An AI-powered tool that analyzes GitHub repositories and generates beginner-friendly tutorials with visualizations to simplify understanding complex codebases.
  • alirezadir/Machine-Learning-Interviews (★6631 Jupyter Notebook) - A comprehensive guide to prepare for machine learning and AI technical interviews at major tech companies, covering coding, ML fundamentals, system design, and behavioral aspects.
  • benman1/generative_ai_with_langchain (★953 Jupyter Notebook) - Companion code repository for the book "Generative AI with LangChain, 2024 Edition," providing practical examples and resources to build large language model applications using LangChain and Python.
  • daveebbelaar/ai-cookbook (★2343 Python) - The AI Cookbook by Dave Ebbelaar offers practical examples and tutorials with copy/paste code snippets to help developers build real-world AI systems efficiently.
  • decodingml/second-brain-ai-assistant-course (★1401 Jupyter Notebook) - An open-source course teaching how to build a production-ready Second Brain AI assistant using LLMs, RAG, agents, fine-tuning, and LLMOps techniques.
  • huggingface/agents-course (★21264 Jupyter Notebook) - The Hugging Face Agents Course is a comprehensive educational resource that teaches the fundamentals and advanced concepts of AI agents and large language models through structured units, practical frameworks, and community collaboration.
  • neural-maze/ava-whatsapp-agent-course (★1448 Python) - Ava is a comprehensive course and project for building a production-ready AI WhatsApp agent capable of realistic conversations, voice and image recognition, and deployment using advanced AI technologies and cloud services.
  • neural-maze/philoagents-course (★1194 Python) - PhiloAgents Course is an open-source educational project that teaches how to build an AI-powered game simulation engine impersonating historical philosophers, combining philosophy with modern AI technology through hands-on modules covering agent development, RAG systems, system architecture, and LLMOps best practices.
  • patchy631/ai-engineering-hub (★14982 Jupyter Notebook) - AI Engineering Hub is a comprehensive repository offering in-depth tutorials and practical resources on Large Language Models, Retrieval-Augmented Generation, and real-world AI agent applications for learners and practitioners.

Research Lists and Survey Projects

LLM Research Repositories

  • DSXiangLi/DecryptPrompt (★3146 ) - DecryptPrompt is a comprehensive resource hub that summarizes prompt and LLM research papers, open-source datasets and models, and AIGC applications, providing in-depth tutorials and analyses for AI researchers and practitioners.
  • HCPLab-SYSU/Embodied_AI_Paper_List (★1608 ) - A comprehensive and regularly updated repository of papers, projects, and surveys focused on Embodied Artificial Intelligence, highlighting the latest research and advancements in the field.
  • WooooDyy/LLM-Agent-Paper-List (★7795 ) - A comprehensive repository compiling essential academic papers and a detailed survey on the rise, framework, applications, and societal aspects of Large Language Model based AI agents.
  • Xnhyacinth/Awesome-LLM-Long-Context-Modeling (★1634 ) - A curated repository of must-read papers and blogs focused on Long Context Modeling techniques and research in Large Language Models, providing comprehensive resources for advancing long sequence understanding in AI.
  • elder-plinius/CL4R1T4S (★8326 ) - CL4R1T4S is a project that provides transparency by collecting and sharing the hidden system prompts and guidelines used by major AI models and agents to promote trust and understanding of AI behavior.
  • filipecalegario/awesome-generative-ai (★2977 ) - A curated and comprehensive list of Generative AI tools, projects, models, artworks, and educational resources, regularly updated to reflect the latest advancements in the field.
  • luo-junyu/Awesome-Agent-Papers (★1309 ) - A comprehensive and up-to-date research collection of papers on Large Language Model (LLM) agents, covering methodologies, applications, challenges, and key categories such as construction, collaboration, evolution, tools, security, benchmarks, and real-world applications.
  • mahseema/awesome-ai-tools (★2849 ) - A curated repository listing top Artificial Intelligence tools across various categories including generative AI, chatbots, search engines, and writing assistants.
  • zjunlp/LLMAgentPapers (★2585 ) - LLMAgentPapers is a curated repository of essential academic papers on Large Language Model agents, covering topics from agent personality to multi-agent systems and applications.

Multi-Agent Surveys

  • AGI-Edgerunners/LLM-Agents-Papers (★1885 Python) - A comprehensive repository listing academic papers related to large language model (LLM) based agents, covering surveys, enhancement techniques, interactions, applications, automation, training, scaling, stability, and infrastructure.
  • Paitesanshi/LLM-Agent-Survey (★2820 ) - A comprehensive survey on the construction, application, and evaluation of autonomous agents powered by large language models, providing a foundational resource for researchers and practitioners in the field.
  • sindresorhus/awesome (★386881 ) - A curated collection of awesome lists covering a wide range of technology topics and development resources.
  • taichengguo/LLM_MultiAgents_Survey_Papers (★1057 ) - A comprehensive repository and survey of research papers on Large Language Model based Multi-Agent systems, covering frameworks, orchestration, problem solving, world simulation, and datasets.

License

Quick Start

1

Clone the repository

git clone https://github.com/korchasa/awesome-ai-agents
2

Install dependencies

cd awesome-ai-agents
npm install
3

Follow the documentation

Check the repository's README.md file for specific installation and usage instructions.

Repository Details

Ownerkorchasa
Repoawesome-ai-agents
LanguageHTML
LicenseCreative Commons Zero v1.0 Universal
Last fetched8/10/2025

Recommended MCP Servers

💬

Discord MCP

Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.

integrationsdiscordchat
🔗

Knit MCP

Connect AI agents to 200+ SaaS applications and automate workflows.

integrationsautomationsaas
🕷️

Apify MCP Server

Deploy and interact with Apify actors for web scraping and data extraction.

apifycrawlerdata
🌐

BrowserStack MCP

BrowserStack MCP Server for automated testing across multiple browsers.

testingqabrowsers

Zapier MCP

A Zapier server that provides automation capabilities for various apps.

zapierautomation