Название: A Guide to Convolutional Neural Networks for Computer Vision
Автор: Salman Khan
Издательство: Ingram
Жанр: Программы
Серия: Synthesis Lectures on Computer Vision
isbn: 9781681732824
isbn:
The University of Western Australia, Crawley, WA
Syed Afaq Ali Shah
The University of Western Australia, Crawley, WA
Mohammed Bennamoun
The University of Western Australia, Crawley, WA
SYNTHESIS LECTURES ON COMPUTER VISION #15
ABSTRACT
Computer vision has become increasingly important and effective in recent years due to its wide-ranging applications in areas as diverse as smart surveillance and monitoring, health and medicine, sports and recreation, robotics, drones, and self-driving cars. Visual recognition tasks, such as image classification, localization, and detection, are the core building blocks of many of these applications, and recent developments in Convolutional Neural Networks (CNNs) have led to outstanding performance in these state-of-the-art visual recognition tasks and systems. As a result, CNNs now form the crux of deep learning algorithms in computer vision.
This self-contained guide will benefit those who seek to both understand the theory behind CNNs and to gain hands-on experience on the application of CNNs in computer vision. It provides a comprehensive introduction to CNNs starting with the essential concepts behind neural networks: training, regularization, and optimization of CNNs. The book also discusses a wide range of loss functions, network layers, and popular CNN architectures, reviews the different techniques for the evaluation of CNNs, and presents some popular CNN tools and libraries that are commonly used in computer vision. Further, this text describes and discusses case studies that are related to the application of CNN in computer vision, including image classification, object detection, semantic segmentation, scene understanding, and image generation.
This book is ideal for undergraduate and graduate students, as no prior background knowledge in the field is required to follow the material, as well as new researchers, developers, engineers, and practitioners who are interested in gaining a quick understanding of CNN models.
KEYWORDS
deep learning, computer vision, convolution neural networks, perception, back-propagation, feed-forward networks, image classification, action recognition, object detection, object tracking, video processing, semantic segmentation, scene understanding, 3D processing
SK: | To my parents and my wife Nusrat |
HR: | To my father Shirzad, my mother Rahimeh, and my wife Shahla |
AS: | To my parents, my wife Maleeha, and our children Abiya, Maryam, and Muhammad. Thanks for always being there for me. |
MB: | To my parents: Mostefa and Rabia Bennamoun and to my nuclear family: Leila, Miriam, Basheer, and Rayaane Bennamoun |
Contents
1.1.2 Image Processing vs. Computer Vision
2.1 Importance of Features and Classifiers
2.2 Traditional Feature Descriptors
2.2.1 Histogram of Oriented Gradients (HOG)
2.2.2 Scale-invariant Feature Transform (SIFT)
2.2.3 Speeded-up Robust Features (SURF)
2.2.4 Limitations of Traditional Hand-engineered Features
2.3 Machine Learning Classifiers
2.3.1 Support Vector Machine (SVM)
СКАЧАТЬ