databricks/databricks-ml-examples
is a repository to show machine learning examples on Databricks platforms.
Currently this repository contains:
llm-models/
: Example notebooks to use different State of the art (SOTA) models on Databricks.
Databricks works with thousands of customers to build generative AI applications. While you can use Databricks to work with any generative AI model, including commercial and research, the table below lists our current model recommendations* for popular use cases. Note: The table only lists open source models that are for free commercial use.
Use case | Quality-optimized | Balanced | Speed-optimized |
---|---|---|---|
Text generation following instructions | MPT-30B-Instruct Llama-2-70b-chat-hf Falcon-40B-Instruct |
MPT-7B-Instruct Llama-2-7b-chat-hf Llama-2-13b-chat-hf Falcon-7B-Instruct |
|
Text embeddings (English only) | bge-large-en(1.3B) instructor-xl (1.3B) |
bge-base-en (0.4B) e5-large-v2 (0.3B) |
bge-small-en (0.1B) e5-base-v2 (0.1B) |
Transcription (speech to text) | whisper-large-v2 (1.6B) whisper-medium (0.8B, English only) |
||
Image generation | stable-diffusion-2-1 | ||
Code generation | StarCoderBase (16B) StarCoder (16B, Python optimized) |
replit-code-v1-3b |