How to Build LLMs on Your Company’s Data While on a Budget
HTML-код
- Опубликовано: 5 фев 2025
- Large Language Models (LLMs) are taking AI mainstream across companies and individuals. However, public LLMs are trained on general-purpose data. They do not include your own corporate data and they are black boxes on how they are trained. Because terminology is different for healthcare, financial, retail, digital-native and other industries, companies today are looking for industry-specific LLMs to better understand the terminology, context and knowledge that better suits their needs. In contrast to closed LLMs, open source-based models can be used for commercial usage or customized to suit an enterprise’s needs on their own data. Learn how Databricks makes it easy for you to build, tune and use custom models, including a deep dive into Dolly, the first open source, instruction-following LLM, fine-tuned on a human-generated instruction dataset licensed for research and commercial use.
In this session, you will:
See a real-life demo of creating your own LLMs specific to your industry
Learn how to securely train on your own documents if needed
Learn how Databricks makes it quick, scalable and inexpensive
Deep dive into Dolly and its applications
Talk by: Sean Owen
Connect with us: Website: databricks.com
Twitter: / databricks
LinkedIn: / databricks
Instagram: / databricksinc
Facebook: / databricksinc
Brilliant presentation. Thank you.
Just amazing, thanks a lot! Information density is incredibly high, very practical
Is there a way to use Data Dictionary to train LLM model to generate SQL queries later on?
what is the legality of using such a model in the product? can one legally sell the product in case of an acquisition? also what about underlying model support and deprecation?
great
If you are on buget don't build llm
It's a company's budget not a personal one lmao