Naveen Pitchandi

Llama 2 - More like Llama1 went to the gym, trained hard and got lean muscle

๐—ช๐—ต๐˜† ๐—ถ๐˜€ ๐—Ÿ๐—น๐—ฎ๐—บ๐—ฎ๐Ÿฎ ๐—ฎ ๐—ฏ๐—ถ๐—ด ๐—ฑ๐—ฒ๐—ฎ๐—น?

Meta just released a bigger and commerical (conditionally) open-source Large Language Model - Llama 2. 4 Months after it released Llama 1

Netflix takes a whole year to release a new season, while AI research is on a rip in the last year

โ“ ๐—ช๐—ฎ๐—ถ๐˜, ๐—ช๐—ต๐—ฎ๐˜โ€™๐˜€ ๐—Ÿ๐—น๐—ฎ๐—บ๐—ฎ?

Llama is a large language model like OpenAIโ€™s GPT, except much smaller and open-source.

GPT3 has 175B Parameters, while Llama comes in Starbucks sizes like Tall(7B) , Grande (13B), Venti (65B).

Why do parameters matter? More the parameters, better it can learn from data and produce more nuanced results. Visualize a color palette for a painter; more colors at hand, more vibrant the picture. However, โ€œmoreโ€ always comes with cost.

Denser models need more computational resources, so more parameters isnโ€™t always good. Universally facepalmed response to any question applies here as well โ€œit dependsโ€. Remember the times when we used Dial-Up internet based on number of minutes used like calling cards? We're in that age of LLMs now. Lean models with reasonably good performance reduces cost thats needed to move to the broadband, 3G equivalent.

Llamaโ€™s source code is available for anyone to peek into. Open Source invokes transparency and trust, thatโ€™s foundational for something like AI that scares the life out of most people.

Open-Source model available for download, also means anyone can download the model and run it on any machine.GPT is synonymous to a Cloud Software - always runs on someone elseโ€™s machine (for now)

๐Ÿคท ๐—ข๐—ธ, ๐—›๐—ผ๐˜„โ€™๐˜€ ๐—Ÿ๐—น๐—ฎ๐—บ๐—ฎ๐Ÿฎ ๐—ฏ๐—ฒ๐˜๐˜๐—ฒ๐—ฟ?

1๏ธโƒฃ Fine-tuned Llama2 used for chat - Llama-2-Chat - outperforms open-source chat models and on par with some popular closed-source models like ChatGPT and PaLM

2๏ธโƒฃ Llama2 is available for use commercially (special license for monthly active exceeding 700M users)

3๏ธโƒฃ Llama2 is also available as a service offered through Microsoft Azureโ€™s compute.LLaMA will also be available through AWS, Hugging Face, and other providers

4๏ธโƒฃ Llama2 can run locally on devices without cloud dependency.

Qualcomm announced it is working with Meta to bring LLaMa to laptops, phones, and headsets starting from 2024 onward for AI-powered apps that work without relying on cloud services.

Google also announced PaLM2 to have many sizes, smallest capable for running on phones. Judging by the pace of innovation thus far, donโ€™t think we need to hold our breath for too long

๐Ÿƒ ๐— ๐—ฒ๐˜๐—ฎ ๐—ฟ๐—ฎ๐—ป ๐˜๐—ต๐—ฒ ๐—ฟ๐—ฎ๐—ฐ๐—ฒ, ๐—ฏ๐˜‚๐˜ ๐— ๐—ถ๐—ฐ๐—ฟ๐—ผ๐˜€๐—ผ๐—ณ๐˜ ๐—ฝ๐—ถ๐—ฐ๐—ธ๐˜€ ๐˜‚๐—ฝ ๐˜๐—ต๐—ฒ ๐—บ๐—ฒ๐—ฑ๐—ฎ๐—น ๐Ÿ†

In all this AI innovation, there one silent winner - Microsoft. Infrastructure providers laid the roads and are happily collecting tolls

AWS will soon join the party for sure, but Microsoft is clearly on full-volume with rolled down windows playing โ€œI Ainโ€™t Worriedโ€ for everyone to dance

built with btw btw logo