A high-throughput and memory-efficient inference and serving engine for LLMs Easy, fast, and cheap LLM serving for everyone | Documentation | Blog | Discussions | Latest News 🔥 [2023/06] We officially released vLLM! vLLM has powered LMSYS Vicuna and Chatbot Arena since mid April. Check out our blog post. vLLM is a fast and easy-to-use library for LLM inference and serving. vLLM is fast with: State-of-the-art serving throughput Efficient management of attention key and …
A Javascript AI getting started stack for weekend projects, including image/text models, vector stores, auth, and deployment configsAI Getting Started Live Demo (deployed on fly.io) Stack Auth: Clerk App logic: Next.js VectorDB: Pinecone / Supabase pgvector LLM Orchestration: Langchain.js Image Model: Replicate Text Model: OpenAI Text streaming: ai sdk Deployment: Fly Overview 🚀 Quickstart 💻 Contribute to this repo Quickstart The simplest way to try out this stack is to test it out locally…
X-GRAM SOCIAL MOBILE APPLICATION 📺 Demo Video 📃 Don't forget to give a star if you like it. Thank you so much! Hello React Native developers, this is my latest project what I did with React Native. I'm very happy to share it with you. I hope it will help you to learn more about React Native. 📗 Library used in this project: React Native (v0.69.3) Reanimated v2 React-native-navigation v6 Mobx v6 Wanna support me? You can buy me a coffee here ☕ 🔨 Installation Firstly, install dependenci…
[PREVIEW] Sample code for a simple web chat experience targeting chatGPT through AOAI.[Preview] Sample Chat App with AOAI This repo contains sample code for a simple chat webapp that integrates with Azure OpenAI. Note: some portions of the app use preview APIs. Prerequisites An existing Azure OpenAI resource and model deployment of a chat model (e.g. gpt-35-turbo, gpt-4) To use Azure OpenAI on your data: an existing Azure Cognitive Search resource and index. Deploy the app Deploy with Azure D…
Label, clean and enrich text datasets with LLMs Clean, labeled data at the speed of thought. Getting started | Docs | Discord | Twitter | Website ⚡ Quick Install pip install refuel-autolabel 🏷 What is Autolabel Access to large, clean and diverse labeled datasets is a critical component for any machine learning effort to be successful. State-of-the-art LLMs like GPT-4 are able to automatically label data with high accuracy, and at a fraction of the cost and time compared…
Desktop implementation of Steam's mobile authenticator app Steam Desktop Authenticator A desktop implementation of Steam's mobile authenticator app. We are not affiliated with Steam or Scrap.TF in any way! This project is run by community volunteers. WARNING: Recently there have been fake versions of SDA floating around that will steal your Steam account. Never download SDA from any place other than this github repo! Download here Supports Windows 7 and up. Clicking …
Lightweight Google-searchable Slack alternative for Communities Linen is a Google-searchable community chat tool. Linen was built as an alternative to closed tools like Slack and Discord. Philosophy Modern communities are built on chat, Slack and Discord is great to get fast realtime answers from teams but it can become a chaotic blackhole of information. Historically these communities existed in forums where they had better structure and search-engine friendliness. We believe the f…
A ChatGPT plugin that allows you to load and edit your local files in a controlled way, as well as run any Python, JavaScript, and bash script.Kaguya Kaguya is a ChatGPT plugin that allows you to load and edit your local files in a controlled way, as well as run any Python, JavaScript, and bash script. This makes it a powerful tool for developers, enabling them to interact with their file system and run scripts directly from ChatGPT. API Endpoints The project provides several API endpoints that…