Chaiverse
  • Welcome to Chaiverse!
  • Quick Start
  • Prompting Guide
  • Rewards model
  • Blending Models
  • Getting Feedback
  • Talking to your model
  • Deactivating your Model
  • Leaderboard
  • 🤑Model Competition
  • How everything Works
    • 🤑Model Competition Explained
    • Rewards Model Explained
    • Prompt Guiding Explained
    • What we do with your model
Powered by GitBook
On this page
  • How it works
  • Great You're Ready to Submit your first rewards model

Rewards model

Reward Modelling is an essential part of making high-quality models.

PreviousPrompting GuideNextBlending Models

Last updated 1 year ago

Now, you will have to install the chaiverse package to submit your first model

import chaiverse as chai
from chaiverse.formatters import PromptFormatter

How it works

Great You're Ready to Submit your first rewards model

model_url = "ChaiML/phase2_winner_13b2"
reward_url = "ChaiML/reward_models_100_170000000_cp_498032"

submission_parameters = {
	"model_repo": model_url,
  "reward_repo": reward_url,
	"generation_params": {
		"temperature": 0.99,
    "top_p": 0.2,
    "top_k": 40,
    "stopping_words": ['\n'],
    "presence_penalty": 0.,
    "frequency_penalty": 0.,
    "max_input_tokens": 1024,
    "best_of": 4
	},
  'model_name': 'My-First-RwardModel',
}

Simply submit your model using the chai package

submitter = chai.ModelSubmitter(verbose=True)
submission_id = submitter.submit(submission_parameters)

Currently, only 2 models are allowed to be deployed at a time.

Rewards Model Explained
Deactivating your Model
Google Colaboratory
Notebook to submit rewards model
Logo