Support OLMo models. #2832

Isotr0py · 2024-02-10T12:38:59Z

Related issue:

Support for new OLMo models #2763

TODO:

Add model config OLMoConfig
Test on OLMo-1B
Test on OLMo-7B/7B-Twin-2T
Format code

~~This is still in progress before all developments and tests finish.~~
Done.

Isotr0py · 2024-02-15T12:49:55Z

vllm/model_executor/models/olmo.py

+class SwiGLU(nn.Module):
+
+    def forward(self, x: torch.Tensor) -> torch.Tensor:
+        x, gate = x.chunk(2, dim=-1)
+        return F.silu(gate) * x
+
+    @property
+    def output_multiplier(self) -> float:
+        return 0.5


It seems that SwiGLU activation used in olmo is different from the SiluAndMul in vllm:

class SiluAndMul(nn.Module): """An activation function for SwiGLU. The function computes x -> silu(x[:d]) * x[d:] where d = x.shape[-1] // 2. Shapes: x: (batch_size, seq_len, 2 * d) or (num_tokens, 2 * d) return: (batch_size, seq_len, d) or (num_tokens, d) """ def _forward(self, x: torch.Tensor) -> torch.Tensor: """PyTorch-native implementation equivalent to forward().""" d = x.shape[-1] // 2 return F.silu(x[..., :d]) * x[..., d:] def forward(self, x: torch.Tensor) -> torch.Tensor: d = x.shape[-1] // 2 output_shape = (x.shape[:-1] + (d, )) out = torch.empty(output_shape, dtype=x.dtype, device=x.device) ops.silu_and_mul(out, x) return out

zhuohan123

LGTM! Thank you for your contribution!

This version is for more model support. Add support for Gemma models (#2964) and OLMo models (#2832).

This version is for more model support. Add support for Gemma models (vllm-project#2964) and OLMo models (vllm-project#2832).

Isotr0py added 6 commits February 9, 2024 16:31

initialize olmo model

985c87d

add olmo loader

ed801bc

fix model running

b470160

Fix activation error

342394e

add olmo config

e471d57

fix a mistake

5bd459e

saattrupdan mentioned this pull request Feb 12, 2024

[MODEL EVALUATION REQUEST] OLMo models ScandEval/ScandEval#206

Closed

1 task

Isotr0py and others added 4 commits February 15, 2024 10:49

fix weight tying

070597c

fix config import

e6292eb

Merge branch 'vllm-project:main' into olmo

ddc5b25

format olmo code

8b4a899

Isotr0py commented Feb 15, 2024

View reviewed changes

Isotr0py marked this pull request as ready for review February 15, 2024 12:50

Isotr0py and others added 3 commits February 17, 2024 23:43

Merge branch 'vllm-project:main' into olmo

b56ff03

Merge branch 'main' into olmo

55d1251

change docs and readme

4ee4c16

zhuohan123 approved these changes Feb 19, 2024

View reviewed changes

zhuohan123 merged commit ab3a5a8 into vllm-project:main Feb 19, 2024
7 of 10 checks passed

Isotr0py deleted the olmo branch February 19, 2024 09:08

zhuohan123 added a commit that referenced this pull request Feb 21, 2024

Bump up version to v0.3.2

9e38ef6

This version is for more model support. Add support for Gemma models (#2964) and OLMo models (#2832).

zhuohan123 mentioned this pull request Feb 21, 2024

Bump up version to v0.3.2 #2968

Merged

simon-mo pushed a commit that referenced this pull request Feb 21, 2024

Bump up version to v0.3.2 (#2968)

8fbd84b

This version is for more model support. Add support for Gemma models (#2964) and OLMo models (#2832).

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024

Support OLMo models. (vllm-project#2832)

c26cf98

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024

Bump up version to v0.3.2 (vllm-project#2968)

70d19ef

This version is for more model support. Add support for Gemma models (vllm-project#2964) and OLMo models (vllm-project#2832).

andy-neuma mentioned this pull request Feb 23, 2024

andy/bump main to v0.3.2 neuralmagic/nm-vllm#49

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

Support OLMo models. (vllm-project#2832)

6fa8f05

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

Bump up version to v0.3.2 (vllm-project#2968)

1176f2f

This version is for more model support. Add support for Gemma models (vllm-project#2964) and OLMo models (vllm-project#2832).

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

Support OLMo models. (vllm-project#2832)

c5114be

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

Bump up version to v0.3.2 (vllm-project#2968)

1f20669

This version is for more model support. Add support for Gemma models (vllm-project#2964) and OLMo models (vllm-project#2832).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support OLMo models. #2832

Support OLMo models. #2832

Isotr0py commented Feb 10, 2024 •

edited

Loading

Isotr0py Feb 15, 2024

zhuohan123 left a comment

Support OLMo models. #2832

Support OLMo models. #2832

Conversation

Isotr0py commented Feb 10, 2024 • edited Loading

Isotr0py Feb 15, 2024

Choose a reason for hiding this comment

zhuohan123 left a comment

Choose a reason for hiding this comment

Isotr0py commented Feb 10, 2024 •

edited

Loading