MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone 中文 | English WeChat | Discord MiniCPM-o 2.6 🤗 🤖 | MiniCPM-V 2.6 🤗 🤖 | Technical Blog Coming Soon MiniCPM-o is the latest series of end-side multimodal LLMs (MLLMs) ungraded from MiniCPM-V. The models can now take image, video, text, and audio as inputs and provide high-quality text and s…
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.RealtimeSTT Easy-to-use, low-latency speech-to-text library for realtime applications New AudioToTextRecorderClient class, which automatically starts a server if none is running and connects to it. The class shares the same interface as AudioToTextRecorder, making it easy to upgrade or switch between the two. (Work in progress, most parameters and…
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer 💡 Introduction We introduce Sana, a text-to-image framework that can efficiently generate images up to 4096 × 4096 resolution. Sana can synthesize high-resolution, high-quality images with strong text-image alignment at a remarkably fast speed, deployable on laptop GPU. Core designs include:…
A high-performance algorithmic trading platform and event-driven backtester Branch Version Status master nightly develop Platform Rust Python Linux (x86_64) 1.84.0+ 3.11, 3.12 macOS (arm64) 1.84.0+ 3.11, 3.12 Windows (x86_64) 1.84.0+ 3.11, 3.12 Docs: https://nautilustrader.io/docs/ Website: https://nautilustrader.io Support: support@nautilustrader.io Introduction NautilusTrader is an open-source, high-performance, production-grade algorithmic trading p…
The ML4W Dotfiles for Hyprland - An advanced and full-featured configuration for the dynamic tiling window manager Hyprland including an easy to use installation script for Arch and Fedora based Linux distributions.ML4W Dotfiles for Hyprland An advanced configuration of Hyprland for Arch Linux based distributions. This package includes an installation script to install and setup the required components. About the screenshot: The dock can be enabled in the Dotfiles Settings app. The waybar the…
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.code2prompt code2prompt is a command-line tool (CLI) that converts your codebase into a single LLM prompt with a source tree, prompt templating, and token counting. Table of Contents Features Installation Usage Templates User Defined Variables Tokenizers Contribution License Support The Author Features You can run this tool on the entire directory and it would generate a…
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.SCUDA: GPU-over-IP SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines. Demo CUBLAS Matrix Multiplication using Unified Memory The below demo displays a NVIDIA GeForce RTX 4090 running on a remote machine (right pane). Left pane is a Mac running a docker container with nvidia utils installed. The docker container runs this matrixMulCUBLAS example. Thi…
Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit🧰 AI Agent Service Toolkit A full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit. It includes a LangGraph agent, a FastAPI service to serve it, a client to interact with the service, and a Streamlit app that uses the client to provide a chat interface. Data structures and settings are built with Pydantic. This project offers a template for you to easily build and run…
为键盘工作者设计的单词记忆与英语肌肉记忆锻炼软件 / Words learning and English muscle memory training software designed for keyboard workers Qwerty Learner English 日本語 为键盘工作者设计的单词记忆与英语肌肉记忆锻炼软件 📸 在线访问 首选部署: https://qwerty.kaiyi.cool/ GitHub Pages: https://realkai42.github.io/qwerty-learner/ 镜像仓库: GitCode: RealKai42/qwerty-learner Gitee: KaiyiWing/qwerty-learner) 项目已发布 VSCode 插件版,一键启动、随时开始练习 VSCode Plugin Market GitHub 快速部署 Vercel 部署步骤 更新 Vercel Build & Development Sett…