AI open source project

 Submit Website

Gemini CLI: Google's open source command-line AI programming tool
Gemini CLI is an open source command line tool developed by Google, based on the Gemini 2.5 Pro model, that allows users to work with AI functionality directly in the terminal. It supports tasks such as working with large code bases, generating applications, automating workflows, and managing files. Users can access AI functionality through their personal Google...
06-29 260kudos
GitHub Copilot Chat: Microsoft open-sources VS Code-assisted AI programming tool
GitHub Copilot Chat is a Visual Studio Code (VS Code) extension developed by Microsoft. It provides developers with code-related help through artificial intelligence. Users can ask questions in natural language to get code suggestions, explanations and optimizations. The tool is powered by GitHub ...
06-29 280kudos
PartCrafter: Generating Editable 3D Part Models from a Single Image
PartCrafter is an innovative open source project focused on generating editable 3D part models from a single RGB image. It uses advanced structured 3D generation technology to generate multiple semantically meaningful 3D parts simultaneously from a single image , applicable to game development, product design and other fields. The project is based on pre-training...
06-27 930kudos
Quarkdown: Markdown-based dynamic typography tool
Quarkdown is a modern Markdown-based typesetting tool that extends the functionality of CommonMark and GitHub Flavored Markdown (GFM). It enables users to create dynamic content by introducing functions, variables and standard libraries to easily generate interactive presentations...
06-10 2490kudos
BAGEL
BAGEL is an open source multimodal base model developed by the ByteDance Seed team and hosted on GitHub.It integrates text comprehension, image generation, and editing capabilities to support cross-modal tasks. The model has 7B active parameters (14B parameters in total) and uses Mixture-of-Tra...
05-22 9130kudos
DeepResearchAgent
DeepResearchAgent is an open source AI tool developed by SkyworkAI that focuses on automating deep research. It helps users quickly generate detailed research reports by combining search engines, web crawling and large-scale language modeling (LLM). Users simply enter a research topic or question, and the tool automatically searches...
05-22 5610kudos
Muscle-Mem
Muscle-Mem is an open source Python tool hosted on GitHub and developed by pig-dot-dev. It is designed to provide behavioral caching capabilities for AI agents to help reduce large language model (LLM) calls in repetitive tasks, thereby increasing runtime speed, reducing variability, and saving costs....
05-16 5580kudos
Simple Subtitling: an open source tool for automatically generating video subtitles and speaker identification
Simple Subtitling is an open source audio subtitle generation tool that focuses on automatically generating subtitles and labeling speakers for video or audio files. Project developed by Jaesung Huh , hosted on GitHub , aims to provide a simple and efficient subtitle generation solution . Tools through the audio processing technology .....
05-16 5990kudos
ArXiv Paper Summarizer: automatic summary tool for arXiv papers
arXiv Summarizer is an open source Python scripting tool, hosted on GitHub, designed to help users quickly access and generate summaries of academic papers from the arXiv platform. It utilizes the free Gemini API for efficient text summarization and is suitable for researchers, students and academic...
05-16 5720kudos
Sim Studio: open source workflow builder for AI agents
Sim Studio is an open source AI agent workflow building platform focused on helping users quickly design, test, and deploy large-scale language model (LLM) workflows through a lightweight, intuitive visual interface. Users can create complex multi-agent applications with drag-and-drop without deep programming. It supports this ...
05-07 1.1 K0kudos
Mad Professor
Mad Professor (暴躁的教授读论文) is an open source AI academic tool designed for researchers and students to simplify the reading and analysis of academic papers. It integrates PDF processing, AI translation, RAG search, AI Q&A and voice interaction. Users can import PDF papers...
05-06 1.1 K0kudos
AIstudioProxyAPI: Unlimited use of the Gemini 2.5 Pro Model API
AIstudioProxyAPI is an open source project that uses Node.js and Playwright technology to convert the Gemini model dialog functionality of the Google AI Studio web version into a standard API connection by emulating the OpenAI API ...
05-06 1.0 K0kudos
Step1X-Edit: An Open Source Tool for Editing Images with Natural Language Instructions
Step1X-Edit is an open source image editing framework developed by the Stepfun AI team and hosted on GitHub It combines a multimodal large language model (Qwen-VL) and a diffusion transformer (DiT) to allow users to edit an image with simple natural language commands, such as changing the background, removing an object, or transforming the wind ....
05-06 1.0 K0kudos
Klavis AI: Model Context Protocol (MCP) Integration Tool for AI Applications
Klavis AI is an open source platform focused on simplifying the use and integration of the Model Context Protocol (MCP), an open standard that allows AI applications to dynamically connect with external tools and data sources.Klavis AI offers Slack, Discord clients, hosted MCP servers, and simplicity...
05-06 9840kudos
RealtimeVoiceChat
RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use the microphone to input voice, the system captures the audio through the browser, quickly converts it to text, generates a reply from a large language model (LLM), and then converts the text to speech output, the whole...
05-06 1.1 K0kudos
MiMo: A Small Open Source Model for Efficient Mathematical Reasoning and Code Generation
MiMo is an open source large language modeling project developed by Xiaomi, focusing on mathematical reasoning and code generation. The core product is the MiMo-7B family of models, which consists of a base model (Base), a supervised fine-tuning model (SFT), a reinforcement learning model trained from the base model (RL-Zero), and a SFT model trained from...
05-06 9880kudos
Muyan-TTS: Personalized Podcast Speech Training and Synthesis
Muyan-TTS is an open source text-to-speech (TTS) model designed for podcasting scenarios. It is pre-trained with over 100,000 hours of podcast audio data and supports zero-sample speech synthesis to generate high-quality natural speech. The model is built on Llama-3.2-3B, and combined with the SoVITS decoder, it provides high...
05-06 9130kudos
CAD-MCP: MCP services for controlling CAD software through natural language commands
CAD-MCP is an open source project that allows users to control CAD software drawing operations through natural language commands. It combines natural language processing and CAD automation technology , so that users do not need to manually operate the CAD interface , just enter simple text commands to create and modify the drawing . The project supports a variety of ...
05-06 9220kudos
GraphGen: Fine-tuning Language Models Using Knowledge Graphs to Generate Synthetic Data
GraphGen is an open-source framework developed by OpenScienceLab, an AI lab in Shanghai, hosted on GitHub, focused on optimizing supervised fine-tuning of Large Language Models (LLMs) by guiding synthetic data generation through knowledge graphs. It constructs fine-grained knowledge graphs from source text, utilizing the expected calibration error...
05-05 6570kudos