Meta posted “Expanding our open source large language models responsibly”. From the Meta blog:
Takeaways:
- Meta is committed to openly accessible AI. Read Mark Zuckerberg’s letter detailing why open source is good for developers, good for Meta, and good for the world.
- Open source has multiple benefits: It helps ensure that more people around the world can access the opportunities that AI provides, guards against concentrating power in the hands of a small few, and deploys technology more equitably. And we believe it will lead to more safe AI outcomes across society. That’s why we continue to advocate for making open access to the AI industry standard.
- We’re bringing open intelligence to all by introducing Llama 3.1 collection of models, which expand context length to 128K, add support across eight languages, and include Llama 3.1 405B — the first frontier-level open source AI model.
- As we improve the capabilities of our models, we’re also scaling our evaluations, red teaming, and mitigations, including for catastrophic risks.
- We’re bolstering our system-level safety approach with new security and safety tools, which include Llama Guard 3 (an input and output multilingual moderation tool), Prompt Guard (a tool to protect against prompt injections), and CyberSecEval 3 (evaluations that help AI model and product developers understand and reduce generative AI cybersecurity risk). We’re also continuing to work with a global set of partners to create industry-wide standards that benefit the open source community.
- We prioritize responsible AI development, and want to empower others to do the same. As part of our responsible release efforts, we’re giving developers new tools and resources to implement the best practices in our Responsible Use Guide.
ArsTechnica reported: In the AI world, there’s a buzz in the air about a new AI language model released Tuesday by Meta: Llama 3.1 405B. The reason? It’s potentially the first time anyone can download a GPT-4-class large language model (LLM) for free and run it on their own hardware.
You’ll still need some beefy hardware: Meta says it can run on a “single sever node,” which isn’t desktop PC-grade equipment. But it’s a provocative shot across the bow of “closed” AI model vendors such as OpenAI and Anthropic.
“Llama 3.1 405B is the first openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation,” says Meta. Company CEO Mark Zuckerberg calls 405B “the first frontier-level open source AI model.”
The Register reported: First teased alongside the launch of its smaller eight- and 70-billion parameter siblings earlier this spring, Meta’s Llama 3.1 405B was trained on more than 15 trillion tokens — think of these a fragments of words, phrases, figures and punctuation — using 16,000 Nvidia H100 GPUs.
According to The Register, in total, the Facebook giant says training the 405-billion-parameter model required the equivalent of 30.84 million GPU hours and produced the equivalent of 11,390 tons of CO2 emissions.
In my opinion, I don’t think that large corporations should be using resources that humans need in order to feed an AI. This includes a huge amount of water, and also adds CO2 emissions into the air.