Bentune

Bentune is an instruction-tuned large language model that is fine-tuned to excel at complex mathematics and reasoning tasks.

Fine-Tuning Process

Base Model: LLaMA 3.2 3B
Instruction Tuning & Chain-of-Thought: We applied instruction tuning with chain-of-thought prompts to improve reasoning capabilities.
Compute Resources: Training conducted on ASU’s SOL supercomputing cluster powered with a couple NVIDIA A100 GPUs.
Training Framework: Built with Hugging Face Transformers, Accelerate, and PyTorch.

Datasets

gsm8k (main split)
hotpot_qa (distractor split)
mandarjoshi/trivia_qa (rc split)
sentence-transformers/natural-questions (pair split)
domenicrosati/TruthfulQA (default split)
Anthropic/hh-rlhf (default split)

BIG-Bench Subsets

boolean_expressions, causal_judgement, date_understanding, disambiguation_qa, dyck_languages, formal_fallacies, geometric_shapes, hyperbaton, logical_deduction_five_objects, logical_deduction_seven_objects, logical_deduction_three_objects, movie_recommendation, multistep_arithmetic_two, navigate, object_counting, penguins_in_a_table, reasoning_about_colored_objects, ruin_names, salient_translation_error_detection, snarks, sports_understanding, temporal_sequences, tracking_shuffled_objects_five_objects, tracking_shuffled_objects_seven_objects, tracking_shuffled_objects_three_objects, web_of_lies, word_sorting

Repositories

Team

Namita Shah · Jay Pavuluri · Evan Zhu · Navni Athale