how to use openbmb/MiniCPM3-4B-GGUF #362

gy9527 · 2024-10-11T01:25:49Z

gy9527
Oct 11, 2024

I am trying to use openbmb/MiniCPM3-4B-GGUF, but the model doesn't stop generating output. After checking the documentation, I learned that I need to customize the chat wrapper. But how should I define the format? Can you provide some help?

Answered by giladgd

Oct 14, 2024

I just tested this model and it worked as expected.
You can run this command to test it yourself:

npx -y node-llama-cpp chat --prompt 'Hi there!' https://huggingface.co/openbmb/MiniCPM3-4B-GGUF/blob/main/minicpm3-4b-q4_k_m.gguf

If you could share the code you've used and the result of running the npx -y node-llama-cpp inspect gpu command then I can try to help you find what the issue is.

I recommend you to try to scaffold a new project and run it with this model.

View full answer

giladgd · 2024-10-14T19:52:33Z

giladgd
Oct 14, 2024
Maintainer

I just tested this model and it worked as expected.
You can run this command to test it yourself:

npx -y node-llama-cpp chat --prompt 'Hi there!' https://huggingface.co/openbmb/MiniCPM3-4B-GGUF/blob/main/minicpm3-4b-q4_k_m.gguf

If you could share the code you've used and the result of running the npx -y node-llama-cpp inspect gpu command then I can try to help you find what the issue is.

I recommend you to try to scaffold a new project and run it with this model.

1 reply

gy9527 Oct 15, 2024
Author

I previously tested the 'qwen2.5' series, and everything was OK, but this model cannot stop generating output.

import { fileURLToPath } from "node:url"
import * as path from "node:path";
import { getLlama, LlamaChatSession } from "node-llama-cpp";

const __dirname = path.dirname(fileURLToPath(import.meta.url));

const modelPath = path.join(__dirname, "./models/", "minicpm3-4b-q4_k_m.gguf")

const llama = await getLlama();
const model = await llama.loadModel({
    modelPath
});
const context = await model.createContext();
const session = new LlamaChatSession({
    contextSequence: context.getSequence()
});

const q1 = "Hi there, how are you?";

const a1 = await session.prompt(q1);

console.log(a1)

OS: Windows 10.0.22631 (x64)
Node: 20.16.0 (x64)
node-llama-cpp: 3.1.1

Vulkan: available

Vulkan device: Intel(R) Arc(TM) Graphics
Vulkan used VRAM: 0% (0B/15.75GB)
Vulkan free VRAM: 100% (15.75GB/15.75GB)

CPU model: Intel(R) Core(TM) Ultra 9 185H
Math cores: 16
Used RAM: 34.64% (10.92GB/31.51GB)
Free RAM: 65.35% (20.59GB/31.51GB)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to use openbmb/MiniCPM3-4B-GGUF #362

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

how to use openbmb/MiniCPM3-4B-GGUF #362

gy9527 Oct 11, 2024

Replies: 1 comment · 1 reply

giladgd Oct 14, 2024 Maintainer

gy9527 Oct 15, 2024 Author

gy9527
Oct 11, 2024

Replies: 1 comment 1 reply

giladgd
Oct 14, 2024
Maintainer

gy9527 Oct 15, 2024
Author