Implemented baseline models comparing fine-tuning strategies (last layer vs. full network) for transfer learning.
Formalized mathematical definitions for transfer learning, clustering algorithms, and attention masking in the Methods section.
Assisted with LaTeX typesetting and document formatting.