I am a first-year PhD student in the Department of Computer Science at the University of North Carolina at Chapel Hill. My research interest lies in computer vision and machine learning. Particularly, I am interested in video understanding, multi-modal deep learning, and transfer learning. Currently, I am working on developing efficient deep learning architectures for long-range video modeling. I work with professor Gedas Bertasius.

Before joining UNC, I worked at Samsung Research, Bangladesh. I also worked as a lecturer at the University of Asia Pacific.

Download my resumé.

  • Machine Learning
  • Computer Vision
  • Video Understanding
  • Multi-modal deep learning
  • PhD student, Computer Science, 2021-present

    University of North Caroliona at Chapel Hill

  • BSc in Computer Science, 2018

    Bangladesh University of Engineering and technology


Video Longformer
A novel transformer-based video recognition framework that can model videos up to several minutes long and capture long-range global properties of the video.
Towards Visual Content Generation from Audio for Scene Understanding.


May 2019 – Dec 2020 Bangladesh
Software Engineer
Nov 2018 – Apr 2019 Bangladesh