Use “/api/infill” instead of “/api/generate” #61

zhengxs2018 · 2024-06-26T15:57:23Z

Validations

I'm not able to find an open issue that requests the same enhancement

Problem

The endpoints "/chat/completions" or "/api/generate" are well-suited for writing test cases or generating complete code snippets.

Writing tests:

write a unit test for this function: $(cat example.py)

Code completions:

# A simple python function to remove whitespace from a string:

However, these endpoints are not very effective when dealing with non-standard or incomplete code.

FIM (Fill-in-the-Middle) is a specialized prompting format supported by code completion models, allowing completion of code between two pre-written code segments.

<PRE> def compute_gcd(x, y): <SUF>return result <MID>

<PRE>, <SUF> and <MID> are special tokens that guide the model.

The challenge lies in the fact that different models use different special tokens for this purpose. Developers from llama.cpp and ollama have already identified this issue.

Links:

Solution

No response

The text was updated successfully, but these errors were encountered:

zhengxs2018 added the enhancement New feature or request label Jun 26, 2024

zhengxs2018 assigned zhengxs2018 and CGQAQ Jun 26, 2024

JohnSmithToYou mentioned this issue Jun 27, 2024

Request: Deepseek Coder V2 model TabbyML/tabby#2451

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use “/api/infill” instead of “/api/generate” #61

Use “/api/infill” instead of “/api/generate” #61

zhengxs2018 commented Jun 26, 2024

Use “/api/infill” instead of “/api/generate” #61

Use “/api/infill” instead of “/api/generate” #61

Comments

zhengxs2018 commented Jun 26, 2024

Validations

Problem

Solution