Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

multi-step-training #12

Open
WeihaoGe1009 opened this issue Aug 19, 2024 · 1 comment
Open

multi-step-training #12

WeihaoGe1009 opened this issue Aug 19, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@WeihaoGe1009
Copy link
Collaborator

  1. relevance - binary (+NA?)
  2. highlight and note - unstructured text, but highlight needs to be exact subtext in selftext
    • condition 1: highlight and note == None and relevance ==False,
    • condition 2: highlight and note != None and relevance == True
    • check highlight be exact subtext (would BWT facilitate it or not?)
  3. field - categorical
    • train based on original text ( + masked_author_id?)
    • train based on highlight and notes ( + background?)
  4. background - categorical or unstructured text?
  5. train conclusion
    • has misunderstanding or has vague knowledge or correct
      • binary (misunderstanding + vague knowledge) vs correct
      • double binary 10, 01, 00 (possible 11?)
    • which misunderstanding/vague knowledge - categorical or unstructured text
      • add a layer/prompt: try put the conclusion in existing categories, if not, create a new category
      • in this case, a train of binary prediction is not suitable. what score and what threshold can figure out the categorical prediction is not working and need to create a new category?
      • a small code of create new category
@WeihaoGe1009 WeihaoGe1009 self-assigned this Aug 19, 2024
@WeihaoGe1009 WeihaoGe1009 added the enhancement New feature or request label Aug 19, 2024
@WeihaoGe1009
Copy link
Collaborator Author

simplifying: done the simple sentence training first.
Then, we can take out sentence prediction results as an annotation to train for highlighted sentences.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant