Computer Science: COMS W 4995 006
COMS W 4995 006 (3 pts)
Peter Belhumeur pb2019 C002442097
Location: 413 Kent Hall
Recent advances in Deep Learning have propelled Computer Vision forward. Applications such as image recognition and search, unconstrained face recognition, and image and video captioning which only recently seemed decades off, are now being realized and deployed at scale. This course will look at the advances in computer vision and machine learning that have made this possible. In particular we will look at Convolutional Neural Nets (CNNs), Recurrent Neural Nets (RNNs), Transformers, Vision Transformers and their application to computer vision. We will also look at the datasets needed to feed these data hungry approaches--both how to create them and how to leverage them to address a wider range of applications. The course will have homework assignments and a final project; there will be no exams. Total enrollment capped at 60.