小镜故事板
小镜故事板(xjstoryboard.com)是一个专注于在线创建分镜头脚本的工具。它帮助用户快速制作故事板,适合影视制作、广告策划和动画设计。用户无需专业绘图技能,通过拖拽模板和元素即可完成脚本设计。网站提供多...
MagicArena
MagicArena 是一个专注于视觉生成模型对战的在线平台。用户可以选择不同 AI 模型,输入相同的文字描述,让模型生成内容进行对比和评测。平台适合开发者、研究人员和对 AI 技术感兴趣的人。MagicArena 提供简单操...
SuperMaker
SuperMaker AI 是一个免费的在线创作平台,帮助用户快速生成高质量视频、音乐、图像和语音内容。用户无需登录即可试用核心功能,操作简单,适合个人创作者和小型团队。平台通过人工智能技术,将文字、图片或创意...
Quarkdown:基于Markdown的动态排版工具
Quarkdown 是一个基于 Markdown 的现代化排版工具,扩展了 CommonMark 和 GitHub Flavored Markdown(GFM)的功能。它通过引入函数、变量和标准库,让用户能创建动态内容,轻松生成交互式演示文...
Simple Subtitling: an open source tool for automatically generating video subtitles and speaker identification
Simple Subtitling is an open source audio subtitle generation tool that focuses on automatically generating subtitles and labeling speakers for video or audio files. Project developed by Jaesung Huh , hosted on GitHub , aims to provide a simple and efficient subtitle generation solution . Tools through the audio processing technology .....
ArXiv Paper Summarizer: automatic summary tool for arXiv papers
arXiv Summarizer is an open source Python scripting tool, hosted on GitHub, designed to help users quickly access and generate summaries of academic papers from the arXiv platform. It utilizes the free Gemini API for efficient text summarization and is suitable for researchers, students and academic...
Sim Studio: open source workflow builder for AI agents
Sim Studio is an open source AI agent workflow building platform focused on helping users quickly design, test, and deploy large-scale language model (LLM) workflows through a lightweight, intuitive visual interface. Users can create complex multi-agent applications with drag-and-drop without deep programming. It supports this ...
Hula: turn selfies into short viral videos and personalized stickers in one click
Hula is an AI-powered creative tool designed to transform user selfies into viral videos, multi-style images and personalized sticker packs with a simple one-click operation. Developer Prequel Inc. built the app to support iOS and Android platforms for avid social...
AIstudioProxyAPI: Unlimited use of the Gemini 2.5 Pro Model API
AIstudioProxyAPI is an open source project that uses Node.js and Playwright technology to convert the Gemini model dialog functionality of the Google AI Studio web version into a standard API connection by emulating the OpenAI API ...
Step1X-Edit: An Open Source Tool for Editing Images with Natural Language Instructions
Step1X-Edit is an open source image editing framework developed by the Stepfun AI team and hosted on GitHub It combines a multimodal large language model (Qwen-VL) and a diffusion transformer (DiT) to allow users to edit an image with simple natural language commands, such as changing the background, removing an object, or transforming the wind ....
Klavis AI: Model Context Protocol (MCP) Integration Tool for AI Applications
Klavis AI is an open source platform focused on simplifying the use and integration of the Model Context Protocol (MCP), an open standard that allows AI applications to dynamically connect with external tools and data sources.Klavis AI offers Slack, Discord clients, hosted MCP servers, and simplicity...
MiMo: A Small Open Source Model for Efficient Mathematical Reasoning and Code Generation
MiMo is an open source large language modeling project developed by Xiaomi, focusing on mathematical reasoning and code generation. The core product is the MiMo-7B family of models, which consists of a base model (Base), a supervised fine-tuning model (SFT), a reinforcement learning model trained from the base model (RL-Zero), and a SFT model trained from...
Muyan-TTS: Personalized Podcast Speech Training and Synthesis
Muyan-TTS is an open source text-to-speech (TTS) model designed for podcasting scenarios. It is pre-trained with over 100,000 hours of podcast audio data and supports zero-sample speech synthesis to generate high-quality natural speech. The model is built on Llama-3.2-3B, and combined with the SoVITS decoder, it provides high...
CAD-MCP: MCP services for controlling CAD software through natural language commands
CAD-MCP is an open source project that allows users to control CAD software drawing operations through natural language commands. It combines natural language processing and CAD automation technology , so that users do not need to manually operate the CAD interface , just enter simple text commands to create and modify the drawing . The project supports a variety of ...
Cotrans
manga-image-translator(Cotrans翻译器开源版),用于翻译漫画或图片中的文字。提供命令行交互方式和在线演示,拥有批量转换模式、web服务器模式等多样化的使用选项。可设置多种语言目标翻译和识别参数,配有详...
GraphGen: Fine-tuning Language Models Using Knowledge Graphs to Generate Synthetic Data
GraphGen is an open-source framework developed by OpenScienceLab, an AI lab in Shanghai, hosted on GitHub, focused on optimizing supervised fine-tuning of Large Language Models (LLMs) by guiding synthetic data generation through knowledge graphs. It constructs fine-grained knowledge graphs from source text, utilizing the expected calibration error...
ACI.DEV: Integration of 600+ tools for AI intelligences via MCP server
ACI.dev is an open source infrastructure platform designed to provide AI intelligences with rapid integration to over 600 tools. It ensures that intelligences have secure access to tools such as Google Calendar, Slack, and Brave Search through multi-tenant authentication and fine-grained permissions management. developers can...
llm.pdf: experimental project to run a large-scale language model in a PDF file
llm.pdf is an open source project that allows users to run Large Language Models (LLMs) directly in PDF files. This project, developed by EvanZhouDev and hosted on GitHub, demonstrates an innovative approach: compiling llama.cpp via Emscripten as ...
Abogen: a tool for converting multiple text formats to audiobooks
Abogen is an open source tool designed to quickly convert ePub, PDF or plain text files to high quality audio. It uses the Kokoro-82M model to generate natural and smooth speech, and also supports synchronized subtitle generation, making it suitable for audiobooks, video dubbing or learning aids. Users can choose...