Skip to content

Latest commit

 

History

History
68 lines (47 loc) · 3.11 KB

README.md

File metadata and controls

68 lines (47 loc) · 3.11 KB

LLMRoboFund Empowered with Retrieval-Augmented Generation

Conducting investment research requires obtaining access to various platforms to become well-informed about investment strategies, financial risks involved, or even financial data about a fund or ETFs. Traditional chatbots, also called 'Robo-Funds,' offer mere hard-coded pre-defined choices based on your answers, being unable to generate or answer questions with real-time information or data. LLM-backed chatbots can easily eliminate the need for standalone research with dedicated hours.

LLMRoboFund presents a novel approach for chatting with an LLM to get informed about funds/ETFs and obtain financial data with up-to-date documents. To enable the LLM with real-time information, Retrieval-Augmented Generation (Lewis et al., 2020) method is deployed to update the knowledge base using dense vector and SQL databases.

The data used to update the knowledge base of the LLM include Turkey Electronic Fund Trading Platform and Public Disclosure Platform. TEFT platform provide diverse financial information, such as management fee, outstanding number of shares, initial and current price, return data up-to 5 year, percentage distribution of the invested instruments, and so on. PDP share documents related to the funds/ETFs available at TEFT platform, such as investor information documents, which are the foundation of LLMRoboFund.

The list below include the available QA:

  • Investment strategies
  • Investment purpose and policy
  • Managing company of the fund
  • Buy/Sell policy of the fund
  • Financial data, such as applied/bylaw management fees, return data up-to 5 year, number of initial and current shares. Basically all available financial information at Turkey Electronic Fund Trading Platform

Disclosure

Neither the content or LLM responses should be used to obtain investment idea or strategy, as the LLM responses are refined by diverse prompt engineering techniques and chain methods in accordance with my preferences.

Requirements

 streamlit==1.28.0
 tefas-crawler==0.3.4
 openai==0.28.1
 streamlit-dynamic-filters==0.1.3
 faiss-cpu==1.7.4
 streamlit==1.28.0
 cohere==4.36
 langchain==0.0.348
 pypdf==3.17.1
 chromadb==0.4.18
 pinecone-client==2.2.4
 psutil==5.9.6
 gputil==1.4.0
 tiktoken==0.5.2
 selenium==4.16.0
 chardet==5.2.0
 streamlit_option_menu==0.3.6
 streamlit_chat==0.1.1
 plotly==5.18.0
 tiktoken==0.5.2
   $Streamlit run Application/app.py

Pinecone Schema

image

LangChain Schema

image

LLMRoboFund

Investment Chatbot

streamlit-app-2023-12-08-16-12-37.webm

Interactive Dashboard

streamlit-app-2023-12-04-12-12-38.webm