Knowledge-Distillation-using-Attention-Maps

This repository explores knowledge distillation in Vision Transformers (ViTs) with a focus on attention maps. This technique helps the student model mimic the attention maps of the teacher model, improving its understanding of image details. Attention maps are generated using attention rollout, providing insights into where the model focuses its attention during inference. An appropriate loss function is created for the same.

Deeplake/ Activeloop login is required to access the imagenet dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
KDVisiontransformers.ipynb		KDVisiontransformers.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Knowledge-Distillation-using-Attention-Maps

About

Uh oh!

Releases

Packages

Languages

Parth38/Knowledge-Distillation-using-Attention-Maps

Folders and files

Latest commit

History

Repository files navigation

Knowledge-Distillation-using-Attention-Maps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages