Add PVC as storage option for LM-Eval results #325

ruivieira · 2024-10-17T07:50:55Z

For large, full-results (not just summaries) storing the evaluation results in the CR's status is not practical.
An option should be available to persist full evaluation results to PVC.

yhwang · 2024-10-22T22:19:39Z

Here is my thought on this feature, suggestions/comments are welcome

provide an option to save the outputs, including stdout, stderr, results, and sampling log, to external storage
for external storage, we can support different types of storage, i.e. PVC, COS, etc. Let's add the PVC for now.

For the argument's sake, the new option is named Outputs and users can specify a full data struct of the PersistentVolumeClaim . LM-Eval controller would create the PVC and mount it on the /opt/app-root/src/output and all the outputs would store to the PVC. The data struct for the Outputs could be:

type Outputs struct {
	// +optional
	PersistentVolumeClaim *corev1.PersistentVolumeClaim `json:"pvc,omitempty"`
}

Only support PVC for now.

One thing worth mentioning is If users want to create a PVC by themselves and then use it in an LM-Eval job, they can do that already. Here is an example and the name of the PVC that has been created is mypvc:

apiVersion: trustyai.opendatahub.io/v1alpha1
kind: LMEvalJob
metadata:
  name: evaljob-sample
spec:
  model: hf
  modelArgs:
  - name: pretrained
    value: google/flan-t5-base
  taskList:
    taskRecipes:
    - card:
        name: "cards.wnli"
      template: "templates.classification.multi_class.relation.default"
  logSamples: true
  pod:
    volumes:
      - name: pvc
        persistentVolumeClaim:
          claimName: mypvc
    container:
      volumeMounts:
      - mountPath: "/opt/app-root/src/output"
        name: PVC

yhwang · 2024-10-23T04:06:36Z

If we want to make the case of using an existing PVC easier, the Outputs data struct could be:

type Outputs struct {
	// Use an existing PVC to store the outputs
	// +optional
	PersistentVolumeClaimName *string `json:"pvcName,omitempty"`
	// Create a PVC and use it to store the outputs
	// +optional
	PersistentVolumeClaim *corev1.PersistentVolumeClaim `json:"pvc,omitempty"`
}

When the pvcName is specified, the LM-Eval controller will create the corresponding volumes and volumeMOunts for the job pod and store the outputs in the PVC. Basically, it makes the LMEvalJob example in previous comment become this:

apiVersion: trustyai.opendatahub.io/v1alpha1
kind: LMEvalJob
metadata:
  name: evaljob-sample
spec:
  model: hf
  modelArgs:
  - name: pretrained
    value: google/flan-t5-base
  taskList:
    taskRecipes:
    - card:
        name: "cards.wnli"
      template: "templates.classification.multi_class.relation.default"
  logSamples: true
  outputs:
    pvcName: mypvc

ruivieira · 2024-10-27T18:35:51Z

Hi @yhwang, thanks for the suggestions. I think that's a good idea.
However, I've added a proposal with #347 that's slightly different, wanted to know your opinion.

IMO the two use cases (for the PVC scope, at the moment) are:

PVC managed by the operator with minimal configuration
Give user the ability to provide more specific PVC configurations

I think that adding the ability to specify a full PVC spec under the LMEvalJob CR might be avoided with your pvcName suggestion. This adds the freedom to create any custom PVC and just pass the name reference to the LMEvalJob.
For the scenario where we need a PVC to created from the LMEvalJob, we can simply use pvcManaged. wdyt?

yhwang · 2024-10-28T16:40:36Z

Your implementation looks great to me. Left a minor comment and I think it's good to go!

ruivieira added kind/enhancement New feature or request lm-eval Issues related to LM-Eval labels Oct 17, 2024

ruivieira mentioned this issue Oct 17, 2024

LM-Eval (language model evaluation) #302

Open

ruivieira added this to TrustyAI planning Oct 17, 2024

github-project-automation bot moved this to Todo in TrustyAI planning Oct 17, 2024

ruivieira added this to the LM-Eval milestone Oct 17, 2024

ruivieira moved this from Todo to Backlog in TrustyAI planning Oct 17, 2024

ruivieira moved this from Backlog to Todo in TrustyAI planning Oct 17, 2024

yhwang self-assigned this Oct 23, 2024

ruivieira moved this from Todo to In Progress in TrustyAI planning Oct 23, 2024

ruivieira mentioned this issue Oct 27, 2024

Add PVC management #347

Merged

ruivieira linked a pull request Oct 27, 2024 that will close this issue

Add PVC management #347

Merged

ruivieira moved this from In Progress to In Review in TrustyAI planning Oct 28, 2024

ruivieira closed this as completed in #347 Oct 29, 2024

github-project-automation bot moved this from In Review to Done in TrustyAI planning Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PVC as storage option for LM-Eval results #325

Add PVC as storage option for LM-Eval results #325

ruivieira commented Oct 17, 2024

yhwang commented Oct 22, 2024 •

edited

Loading

yhwang commented Oct 23, 2024 •

edited

Loading

ruivieira commented Oct 27, 2024

yhwang commented Oct 28, 2024

Add PVC as storage option for LM-Eval results #325

Add PVC as storage option for LM-Eval results #325

Comments

ruivieira commented Oct 17, 2024

yhwang commented Oct 22, 2024 • edited Loading

yhwang commented Oct 23, 2024 • edited Loading

ruivieira commented Oct 27, 2024

yhwang commented Oct 28, 2024

yhwang commented Oct 22, 2024 •

edited

Loading

yhwang commented Oct 23, 2024 •

edited

Loading