Prompt Order Experiment

Overview

Results

Steps

Dataset Selection

We begin with the layoric/labeled-multiple-choice-explained dataset, which includes reasoning provided by GPT-3.5-turbo. reasoning explanations serve as a starting point but may differ from Falcon's reasoning style.

00-poe-generate-falcon-reasoning.ipynb: To align with falcon, we need to create a refined dataset: derek-thomas/labeled-multiple-choice-explained-falcon-reasoning.
01-poe-dataset-creation.ipynb: Then we need to create our prompt experiments.
02-autotrain.ipynb: We generate autotrain jobs on spaces to train our models.
03-poe-token-count-exploration.ipynb: We do some quick analysis so we can optimize our TGI settings.
04-poe-eval.ipynb: We finally evaluate our trained models.

The flowchart is Clickable