Keeping imagination
is hard.
Find my publications, projects, essays, and personal information.
What’s new on my publication
All papers are accepted by international conferences or journals.
What’s new on my Collection
Projects, datasets, and related resources are open-sourced on GitHub and Hugging Face.
Dataset NBA Games
A full-game NBA video metadata dataset with 189 verified YouTube games linked to official NBA.com box scores and play-by-play annotations for long-video and multimodal sports research.
Dataset ICCV Papers
A comprehensive ICCV paper dataset from 2013 to present, including metadata, abstracts, BibTeX records, download links, and full-paper PDFs for computer vision research.
Dataset Dog100K
A large-scale, high-quality dog image-text alignment dataset with 103,508 image-text pairs for multimodal learning, retrieval, captioning, and conditional generation.
What’s new on my Blog
Notes on computer vision, natural language processing, multimodal learning, and research practice.