A group of researchers at Meta AI research has finetuned a large language model (GPT-J which is based on GPT-3) to be able to select tools.
Large Language Models has limitation such as mathematical reasoning or being unaware of current events. One way to solve this is to use APIs to gather the required information. The problem is, this solution can get very manual-heavy — making it unscalable.
Toolformer, the new model from Meta AI research is trained on a special dataset which is generated by itself to choose the necessary API automatically.
Let's see how it is trained in this video.
▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬
🖥️ Website: https://www.assemblyai.com/?utm_source=youtube&utm_medium=referral&utm_campaign=yt_mis_37
🐦 Twitter: https://twitter.com/AssemblyAI
🦾 Discord: https://discord.gg/Cd8MyVJAXd
▶️ Subscribe: null
🔥 We're hiring! Check our open roles: https://www.assemblyai.com/careers
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#MachineLearning #DeepLearning
9 Comments