AI4Finance-Foundation / FinGPT
- четверг, 15 июня 2023 г. в 00:00:01
Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We'll soon release the trained model.
Let us DO NOT expect Wall Street to open-source LLMs nor open APIs.
We democratize Internet-scale data for financial large language models (FinLLMs) at FinNLP and FinNLP Website
Disclaimer: We are sharing codes for academic purpose under the MIT education license. Nothing herein is financial advice, and NOT a recommendation to trade real money. Please use common sense and always first consult a professional before trading or investing.
1). Finance is highly dynamic. BloombergGPT retrains an LLM using a mixed dataset of finance and general data sources, which is too expensive (1.3M GPU hours, a cost of around $5M). It is costly to retrain an LLM model every month or every week, so lightweight adaptation is highly favorable in finance. Instead of undertaking a costly and time-consuming process of retraining a model from scratch with every significant change in the financial landscape, FinGPT can be fine-tuned swiftly to align with new data (the cost of adaptation falls significantly, estimated at less than $300 per training).
2). Democratizing Internet-scale financial data is critical, which should allow timely updates (monthly or weekly updates) using an automatic data curation pipeline. But, BloombergGPT has privileged data access and APIs. FinGPT presents a more accessible alternative. It prioritizes lightweight adaptation, leveraging the strengths of some of the best available open-source LLMs, which are then fed with financial data and fine-tuned for financial language modeling.
3). The key technology is "RLHF (Reinforcement learning from human feedback)", which is missing in BloombergGPT. RLHF enables an LLM model to learn individual preferences (risk-aversion level, investing habits, personalized robo-advisor, etc.), which is the ``secret" ingredient of ChatGPT and GPT4.
The Journey of Open AI GPT models. GPT models explained. Open AI's GPT-1, GPT-2, GPT-3.
[BloombergGPT] BloombergGPT: A Large Language Model for Finance
WHAT’S IN MY AI? A Comprehensive Analysis of Datasets Used to Train GPT-1, GPT-2, GPT-3, GPT-NeoX-20B, Megatron-11B, MT-NLG, and Gopher
FinRL-Meta Repo and paper FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning. Advances in Neural Information Processing Systems, 2022.
[AI4Finance] FinNLP Democratizing Internet-scale financial data.
ChatGPT Trading Bot
(Fast and accurate) Sentiment Analysis
GPT-3 can help study customer surveys, social media tweets from customers/users.
Tweets
PromptNet Analogy to ImageNet and WordNet, it is critical to build a PromptNet.
Robo-advisor
Coding-tutor
Blogs about ChatGPT for FinTech
Prompting as a new programming paradigm!
[Towards Data Science] GPT-3: Creative Potential of NLP
[YouTube video] OpenAI GPT-3 - Prompt Engineering For Financial NLP
[YouTube video] Advanced ChatGPT Prompt Engineering
GPT-3 Sandbox (Github) Enable users to create cool web demos using OpenAI GPT-3 API.
Exploring the Capabilities of the ChatGPT API: A Beginner’s Guide
Prompting programming
A Release Timeline of many LLMs.
Interesting evaluations:
[YouTube] Physics Solution: ChatGPT vs. Google