RedPajama, which creates fully open-source large language models, has released a 1.2 trillion token dataset following the LLaMA recipe.
View Article on VentureBeat
AI,Business,AI, ML and Deep Learning,category-/Science/Computer Science,Conversational AI,Databricks,Github,Hugging Face,large language models,LLaMA,Meta,NLP,Open source,open source software
Conversational AI