Llama 2 - More like Llama1 went to the gym, trained hard and got lean muscle

๐ช๐ต๐ ๐ถ๐ ๐๐น๐ฎ๐บ๐ฎ๐ฎ ๐ฎ ๐ฏ๐ถ๐ด ๐ฑ๐ฒ๐ฎ๐น?
Meta just released a bigger and commerical (conditionally) open-source Large Language Model - Llama 2. 4 Months after it released Llama 1
Netflix takes a whole year to release a new season, while AI research is on a rip in the last year
โ ๐ช๐ฎ๐ถ๐, ๐ช๐ต๐ฎ๐โ๐ ๐๐น๐ฎ๐บ๐ฎ?
Llama is a large language model like OpenAIโs GPT, except much smaller and open-source.
GPT3 has 175B Parameters, while Llama comes in Starbucks sizes like Tall(7B) , Grande (13B), Venti (65B).
Why do parameters matter? More the parameters, better it can learn from data and produce more nuanced results. Visualize a color palette for a painter; more colors at hand, more vibrant the picture. However, โmoreโ always comes with cost.
Denser models need more computational resources, so more parameters isnโt always good. Universally facepalmed response to any question applies here as well โit dependsโ. Remember the times when we used Dial-Up internet based on number of minutes used like calling cards? We're in that age of LLMs now. Lean models with reasonably good performance reduces cost thats needed to move to the broadband, 3G equivalent.
Llamaโs source code is available for anyone to peek into. Open Source invokes transparency and trust, thatโs foundational for something like AI that scares the life out of most people.
Open-Source model available for download, also means anyone can download the model and run it on any machine.GPT is synonymous to a Cloud Software - always runs on someone elseโs machine (for now)
๐คท ๐ข๐ธ, ๐๐ผ๐โ๐ ๐๐น๐ฎ๐บ๐ฎ๐ฎ ๐ฏ๐ฒ๐๐๐ฒ๐ฟ?
1๏ธโฃ Fine-tuned Llama2 used for chat - Llama-2-Chat - outperforms open-source chat models and on par with some popular closed-source models like ChatGPT and PaLM
2๏ธโฃ Llama2 is available for use commercially (special license for monthly active exceeding 700M users)
3๏ธโฃ Llama2 is also available as a service offered through Microsoft Azureโs compute.LLaMA will also be available through AWS, Hugging Face, and other providers
4๏ธโฃ Llama2 can run locally on devices without cloud dependency.
Qualcomm announced it is working with Meta to bring LLaMa to laptops, phones, and headsets starting from 2024 onward for AI-powered apps that work without relying on cloud services.
Google also announced PaLM2 to have many sizes, smallest capable for running on phones. Judging by the pace of innovation thus far, donโt think we need to hold our breath for too long
๐ ๐ ๐ฒ๐๐ฎ ๐ฟ๐ฎ๐ป ๐๐ต๐ฒ ๐ฟ๐ฎ๐ฐ๐ฒ, ๐ฏ๐๐ ๐ ๐ถ๐ฐ๐ฟ๐ผ๐๐ผ๐ณ๐ ๐ฝ๐ถ๐ฐ๐ธ๐ ๐๐ฝ ๐๐ต๐ฒ ๐บ๐ฒ๐ฑ๐ฎ๐น ๐
In all this AI innovation, there one silent winner - Microsoft. Infrastructure providers laid the roads and are happily collecting tolls
AWS will soon join the party for sure, but Microsoft is clearly on full-volume with rolled down windows playing โI Ainโt Worriedโ for everyone to dance