require -l in run.cpp #594

metascroy · 2024-04-30T19:03:31Z

We pass the llama version in run.cpp with -l arg. Today it has a default value of 2 (for llama2), but I don't think this should have a default value.

mikekgfb · 2024-05-01T04:16:38Z

Does it actually matter what it is? Or is it just the tokenizer anyway? Maybe don't call it model if it's tokenizer?

metascroy · 2024-05-01T16:39:06Z

@mikekgfb it's the model in the sense that it's used to select the tokenizer class, prompt structure, and default vocab size, all of which are different for llama2 vs. llama3.

metascroy · 2024-05-01T17:25:24Z

@larryliu0820 and I were talking that maybe the tokenizer should get the vocab size from the tokenizer.model file, but that's not how they work today.

metascroy · 2024-05-02T17:00:37Z

After #626 lands, the tokenizer will get the vocab size from the *.model file.

So we should be able to automatically deduce 2 or 3 by a tokenizer load failure, and clean up some of the edge cases in the runner.

mikekgfb · 2024-05-12T03:21:09Z

Agreed. Let's make sure changing this to remove the default doesn't break documented workflows.
(And in long term, will autodetect).

mikekgfb

Thank you!

…ire-llama-ver

mikekgfb · 2024-05-13T03:13:57Z

@metascroy do we still need to land this, or did you already land a version of this?

mikekgfb

Thank you!

…ire-llama-ver

pytorch-bot · 2024-05-20T05:45:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/594

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 5 New Failures

As of commit 84d8e5c with merge base a80ae4d ():

NEW FAILURES - The following jobs have failed:

pull / runner-aoti (16-core-ubuntu) (gh)
pull / runner-aoti (macos-14-xlarge) (gh)
Process completed with exit code 1.
pull / runner-et (16-core-ubuntu) (gh)
pull / runner-et (macos-14-xlarge) (gh)
Process completed with exit code 1.
Run the aoti runner with CUDA using stories / test-runner-aot-cuda / linux-job (gh)
RuntimeError: Command docker exec -t 38bbe48e00a868adf8c190da0b91e18eb38357346f377529567f15c4dc7d6b15 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-07-01T23:37:38Z

Closing Stale PR

metascroy requested a review from larryliu0820 April 30, 2024 19:03

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 30, 2024

require -l in run.cpp

3be4854

metascroy force-pushed the require-llama-ver branch from 9933565 to 3be4854 Compare April 30, 2024 23:25

metascroy requested a review from mikekgfb April 30, 2024 23:28

format

f9a8059

Merge branch 'main' into require-llama-ver

5438beb

mikekgfb approved these changes May 12, 2024

View reviewed changes

Merge branch 'main' of https://github.com/pytorch/torchchat into requ…

cd1e143

…ire-llama-ver

mikekgfb approved these changes May 15, 2024

View reviewed changes

Merge branch 'main' of https://github.com/pytorch/torchchat into requ…

84d8e5c

…ire-llama-ver

Jack-Khuu closed this Jul 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

require -l in run.cpp #594

require -l in run.cpp #594

metascroy commented Apr 30, 2024 •

edited

Loading

mikekgfb commented May 1, 2024

metascroy commented May 1, 2024

metascroy commented May 1, 2024

metascroy commented May 2, 2024

mikekgfb commented May 12, 2024

mikekgfb left a comment

mikekgfb commented May 13, 2024

mikekgfb left a comment

pytorch-bot bot commented May 20, 2024 •

edited

Loading

Jack-Khuu commented Jul 1, 2024

require -l in run.cpp #594

require -l in run.cpp #594

Conversation

metascroy commented Apr 30, 2024 • edited Loading

mikekgfb commented May 1, 2024

metascroy commented May 1, 2024

metascroy commented May 1, 2024

metascroy commented May 2, 2024

mikekgfb commented May 12, 2024

mikekgfb left a comment

Choose a reason for hiding this comment

mikekgfb commented May 13, 2024

mikekgfb left a comment

Choose a reason for hiding this comment

pytorch-bot bot commented May 20, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/594

❌ 5 New Failures

Jack-Khuu commented Jul 1, 2024

metascroy commented Apr 30, 2024 •

edited

Loading

pytorch-bot bot commented May 20, 2024 •

edited

Loading