AYA Prompt-Based Classification and Evaluation

This repository provides a framework for prompt-based classification using pre-trained language models, with a focus on Persian text classification tasks. It includes scripts and notebooks for generating prompts, fine-tuning prompts for classification, evaluating results, and analyzing model performance metrics such as F1 score, precision, and recall. The repository also supports K-shot learning to enhance model adaptability by incorporating relevant examples.

Project Structure

Codes: Contains the core code and notebooks for model training, prompt generation, and evaluation.
- AYA-Colab.ipynb: Main notebook for training and fine-tuning prompts with AYA models on Colab.
- Classification_report.ipynb: Generates classification metrics, including F1 score, precision, and recall for different prompt setups.
- Creating_dataset.ipynb: Data preparation and dataset creation for prompt-based learning.
- f1-calculation.py: Python script to calculate and visualize F1 scores.
- news-aya-symbol-tuning.ipynb: Notebook for symbol-based tuning with AYA models for text classification.
- news-aya-system-user-prompt.ipynb: Script for generating system and user prompts using a pre-trained language model.
- Symbol_tuning_aya.ipynb: Symbol tuning notebook for optimizing prompt effectiveness.
Datasets: Contains datasets used for training and evaluation.
Prompts: Contains prompt templates used for various classification tasks.
Slides: Documentation and presentation files explaining in-context learning, prompt design, K-shot learning, and symbol tuning.
- In-Context Learning.pptx & In-Context Learning.pdf: Details on using in-context learning for model tuning.
- System-User Prompt Design.pptx & System-User Prompt Design.pdf: Guide for designing system and user prompts.
- Symbol Tuning.pptx & Symbol Tuning.pdf: Instructions on using symbol tuning to improve prompt performance.

Key Features

Prompt-Based Classification: Framework to classify text using prompts with a language model. The system allows dynamic generation of prompts, integrating user-defined inputs and system prompts for flexible text classification.
K-Shot Learning: Supports K-shot learning where the model is provided with K relevant examples to improve performance on specific tasks.
Evaluation Metrics: Provides tools for comprehensive evaluation, including accuracy, F1 score, precision, and recall. Results are saved and can be visualized through confusion matrices and classification reports.
Symbol Tuning: Techniques to adjust and refine prompts by using symbols and other prompt-based modifications, enhancing model responsiveness to specific queries.
In-Context Learning: Documentation and support for in-context learning to improve prompt-based model adaptability with examples in the prompt context.

Setup Instructions

Clone the repository:

git clone https://github.com/ShayanSalehi81/BachelorProject
cd BachelorProject

Install the required packages:
```
pip install -r requirements.txt
```
Authenticate with Hugging Face (if necessary) and install additional libraries:
```
huggingface-cli login --token YOUR_HUGGINGFACE_TOKEN
```
Run any of the notebooks or Python scripts in the Codes directory to perform tasks such as dataset creation, prompt tuning, or evaluation.

Usage

Generating Prompts and Running Classification

news-aya-system-user-prompt.ipynb: This notebook provides an end-to-end pipeline for generating system and user prompts and performing classification on news datasets. The Generator class loads a pre-trained language model, formats prompts, and generates predictions. The script supports 4-bit quantization for efficient memory usage and leverages user-provided prompts to classify Persian news data as "important" or "not important."

Evaluation and Metrics

Classification_report.ipynb: Evaluates model performance with metrics such as accuracy, precision, recall, and F1 score. It includes K-fold cross-validation and produces detailed classification reports.
f1-calculation.py: Calculates and visualizes F1 scores for classification results, with category-wise breakdowns. Confusion matrices and summary tables can be generated to understand model performance across categories.

K-Shot Learning

The prompt generation pipeline supports K-shot learning, where K most similar examples are retrieved from the training set using TF-IDF similarity. This enhances prompt-based classification by providing the model with contextually relevant examples.

Symbol Tuning

Notebooks like news-aya-symbol-tuning.ipynb and Symbol_tuning_aya.ipynb are designed to fine-tune prompt symbols, which can improve model interpretability and response consistency. Symbol tuning introduces minor adjustments to the prompts, enhancing the model's comprehension of nuanced queries.

Example Workflow

Data Preparation: Use Creating_dataset.ipynb to preprocess and format your dataset.
Prompt Generation: Load news-aya-system-user-prompt.ipynb to define system and user prompts, and run classification on the dataset.
Evaluation: Use Classification_report.ipynb to calculate metrics like accuracy and F1 score and f1-calculation.py to visualize performance.
Symbol Tuning: Run news-aya-symbol-tuning.ipynb to refine prompt design with symbol tuning.

Future Enhancements

Prompt Optimization: Further refine prompt generation methods to support more complex classification tasks.
Fine-Tuning: Incorporate model fine-tuning on custom datasets to improve model adaptability.
Extended K-Shot Learning: Experiment with variable K values to optimize in-context learning.
Symbol Tuning Enhancements: Extend symbol tuning techniques to handle a broader range of tasks and user contexts.

License

This project is licensed under the MIT License.

Contributing

Contributions are welcome! Feel free to submit issues, feature requests, or pull requests to enhance this project.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
Codes		Codes
Datasets		Datasets
Prompts		Prompts
Report/BachelorThesis		Report/BachelorThesis
Slides		Slides
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AYA Prompt-Based Classification and Evaluation

Project Structure

Key Features

Setup Instructions

Usage

Generating Prompts and Running Classification

Evaluation and Metrics

K-Shot Learning

Symbol Tuning

Example Workflow

Future Enhancements

License

Contributing

About

Releases

Packages

Languages

License

ShayanSalehi81/BachelorProject

Folders and files

Latest commit

History

Repository files navigation

AYA Prompt-Based Classification and Evaluation

Project Structure

Key Features

Setup Instructions

Usage

Generating Prompts and Running Classification

Evaluation and Metrics

K-Shot Learning

Symbol Tuning

Example Workflow

Future Enhancements

License

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages