Ferret: Refer and Ground Anything Anywhere at Any Granularity An End-to-End MLLM that Accept Any-Form Referring and Ground Anything in Response. [Paper] Haoxuan You*, Haotian Zhang*, Zhe Gan, Xianzhi Du, Bowen Zhang, Zirui Wang, Liangliang Cao, Shih-Fu Chang, Yinfei Yang [*: equal contribution] Overview Diagram of Ferret Model. Key Contributions: Ferret Model - Hybrid Region Representation + Spatial-aware Visual Sampler enable fine-grained and open-vocabulary referring and ground…
A dedicated scratchpad for developersHeynote Heynote is a dedicated scratchpad for developers. It functions as a large persistent text buffer where you can write down anything you like. Works great for that Slack message you don't want to accidentally send, a JSON response from an API you're working with, notes from a meeting, your daily to-do list, etc. The Heynote buffer is divided into blocks, and each block can have its own Language set (e.g. JavaScript, JSON, Markdown, etc.). This…
A novel LLM-based multimodal agent framework designed to operate smartphone applicationsAppAgent Chi Zhang*, Zhao Yang*, Jiaxuan Liu*, Yucheng Han, Xin Chen, Zebiao Huang, Bin Fu, Gang Yu (Corresponding Author) (* equal contributions) 🔆 Introduction We introduce a novel LLM-based multimodal agent framework designed to operate smartphone applications. Our framework enables the agent to operate smartphone applications through a simplified action space, mimicking human…
🏆 Welcome to TON Smart Challenge #5 by TON Foundation & Codeforces 📝 Tasks Inheritance Smart Contract Splitter Smart Contract Universal Versioning Template Quantum Maze The goal of the contestants is to provide a code that matches the description. Each task except Task 4 may give the contestant either 0 or 5 to 6 score points: 5 for all tests passed plus "gas-score" from 0 to 1 (0 for "infinite" gas consumption, 1 for 0 gas consumption, dependence is inverse exponent). Tas…
The AI-native database built for LLM applications, providing incredibly fast vector and full-text search The AI-native database built for LLM applications, providing incredibly fast vector and full-text search Roadmap 2024 | Twitter | Discord | YouTube | Infinity is a cutting-edge AI-native database that provides a wide range of search capabilities for rich data types such as vectors, full-text, and structured data. It provides robust support for various LLM applications, in…
对mybatis-plus-generator进行封装,通过Web UI快速生成兼容Spring boot,mybatis-plus框架的各类业务代码mybatis-plus-generator-ui 提供交互式的Web UI用于生成兼容mybatis-plus框架的相关功能代码,包括Entity,Mapper,Mapper.xml,Service,Controller等 ,可以自定义模板以及各类输出参数,也可通过SQL查询语句直接生成代码。 使用方法 引入maven的相关依赖,注意scope只需要写test就可以了 <dependency> <groupId>com.github.davidfantasy</groupId> <artifactId>mybatis-plus-generator-ui</artifactId> <version>2.0.5</version> <scope>test</scope> </dependen…
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction'splatter-image Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' Using this repository Installation Create a conda environment and install requirements: conda create --name splatter-image conda activate splatter-image pip install -r requirements.txt Install Gaussian Splatting renderer, i.e. the library for rendering a Gaussian Point cloud to an image. To do so, pull …
A taxonomy for open source cryptocurrency, blockchain, and decentralized ecosystemsV1.1 UPDATE [12/20/23]: Read about the Crypto Ecosystems taxonomy's update to Version 1.1 here. Crypto Ecosystems MIT license with attribution 🌲 Crypto Ecosystems is a taxonomy for sharing data around open source blockchain, Web3, cryptocurrency, and decentralized ecosystems and tying them to GitHub organizations and code repositories. All of the ecosystems are specified in TOML configuration files. This repo…