Distilling Fine-grained Sentiment Understanding from Large Language Models

This repository contains code and data for the paper, a method for distilling fine-grained sentiment understanding from large language models (LLMs) into small language models (SLMs).

Abstract

Fine-grained sentiment analysis (FSA) aims to extract and summarize user opinions from vast opinionated text. Recent studies demonstrate that large language models (LLMs) possess exceptional sentiment understanding capabilities. However, directly deploying LLMs for FSA applications incurs high inference costs. Therefore, this paper investigates the distillation of fine-grained sentiment understanding from LLMs into small language models (SLMs). We prompt LLMs to examine and interpret the sentiments of given reviews and then utilize the generated content to pretrain SLMs. Additionally, we develop a comprehensive FSA benchmark to evaluate both SLMs and LLMs. Extensive experiments on this benchmark reveal that: (1) distillation significantly enhances the performance of SLMs in FSA tasks, achieving a 6.00% improvement in F1-score, and the distilled model can outperform Llama-2-7b with only 220M parameters; (2) distillation equips SLMs with excellent zero-shot sentiment classification capabilities, enabling them to match or even exceed their teacher models. These results suggest that distillation from LLMs is a highly promising direction for FSA.

Project Structure

.
├── README.md
├── evaluation # evaluation code
│   ├── acsa.py
│   ├── atsa.py
│   ├── bash
│   ├── output
│   └── utils
├── fsa_datasets # fsa datasets
│   └── w_hard
├── parse_performance.ipynb # parse result
├── pre-training # distillation code
│   ├── bash
│   ├── output_model
│   ├── seq2seq.py
│   └── utils
├── pretrained_models # base model
├── prompting # sentiment understanding corpus
│   ├── data
│   └── test
└── requirements.txt

Usage

Prerequisites

1. Download the Sentiment Understanding Corpus

Download the Gporrt/sentiment-understanding-corpus from HuggingFace and place it in the ./prompting/data directory.

2. Download the T5-Base Model

Download the google-t5/t5-base model and place it in the ./pretrained_models directory.

Training and Evaluation

3. Run Distillation Scripts

Navigate to the evaluation directory and execute the following command:

cd ./evaluation

# Run pretraining with distillation corpus
bash/pt_eval/v7.9.sh -c ${CUDA_IDS}

Important: Before proceeding, update the model version and path mappings in evaluation/bash/model_name.json.

4. Fine-tune and Evaluate the Distilled Model

Run the following commands for parallel multi-seed fine-tuning:

cd ./evaluation
# Fine-tune T5 with parallel processing (n = number of parallel processes)
bash/atsa_batch_parallel.sh -c ${CUDA_IDS} -b ${model_version} -n 3
bash/acsa_batch_parallel.sh -c ${CUDA_IDS} -b ${model_version} -n 2

Results Analysis

5. Parse Performance Results

Use the ./parse_performance.ipynb notebook to analyze the results. Make sure to:

Replace the path variable with your actual path
Replace the version variable with your model version

Pretrained Models

We release pretrained distilled models below:

Model	Base Model	Download Link
t5-sentiment-base	t5-base	HuggingFace
t5-sentiment-large	t5-large	HuggingFace

Citation

If you use the our framework or data, feel free to cite us.

@misc{zhang2024distillingfinegrainedsentimentunderstanding, 
      title={Distilling Fine-grained Sentiment Understanding from Large Language Models}, 
      author={Yice Zhang and Guangyu Xie and Hongling Xu and Kaiheng Hou and Jianzhu Bao and Qianlong Wang and Shiwei Chen and Ruifeng Xu}, 
      year={2024}, 
      eprint={2412.18552}, 
      archivePrefix={arXiv}, 
      primaryClass={cs.CL}, 
      url={https://arxiv.org/abs/2412.18552}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Distilling Fine-grained Sentiment Understanding from Large Language Models

Abstract

Project Structure

Usage

Prerequisites

1. Download the Sentiment Understanding Corpus

2. Download the T5-Base Model

Training and Evaluation

3. Run Distillation Scripts

4. Fine-tune and Evaluate the Distilled Model

Results Analysis

5. Parse Performance Results

Pretrained Models

Citation

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
evaluation		evaluation
fsa_datasets/w_hard		fsa_datasets/w_hard
pre-training		pre-training
pretrained_models		pretrained_models
prompting		prompting
.DS_Store		.DS_Store
README.md		README.md
README.zh-CN.md		README.zh-CN.md
parse_performance.ipynb		parse_performance.ipynb
requirements.txt		requirements.txt

HITSZ-HLT/FSA-Distillation

Folders and files

Latest commit

History

Repository files navigation

Distilling Fine-grained Sentiment Understanding from Large Language Models

Abstract

Project Structure

Usage

Prerequisites

1. Download the Sentiment Understanding Corpus

2. Download the T5-Base Model

Training and Evaluation

3. Run Distillation Scripts

4. Fine-tune and Evaluate the Distilled Model

Results Analysis

5. Parse Performance Results

Pretrained Models

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages