An open-source AI agent that brings the power of Gemini directly into your terminal.Gemini CLI Gemini CLI is an open-source AI agent that brings the power of Gemini directly into your terminal. It provides lightweight access to Gemini, giving you the most direct path from your prompt to our model. 🚀 Why Gemini CLI? 🎯 Free tier: 60 requests/min and 1,000 requests/day with personal Google account 🧠 Powerful Gemini 2.5 Pro: Access to 1M token context window 🔧 Built-in tools: Google Search gr…
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025. Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting Dolphin (Document Image Parsing via Heterogeneous Anchor Prompting) is a novel multimodal document image parsing model following an analyze-then-parse paradigm. This repository contains the demo code and pre-trained models for Dolphin. 📑 Overview D…