Bookmark and Share

Deep Learning for Vision Using CNNs and Caffe: A Hands-on Tutorial

A full-day tutorial focused on convolutional neural networks for vision and the Caffe framework for deep learning, presented by the primary Caffe developers from the Berkeley Vision and Learning Center, and organized by the Embedded Vision Alliance and Berkeley Design Technology, Inc. (BDTI). This tutorial takes participants from an introduction to the theory behind convolutional neural networks to their actual implementation. It includes hands-on labs using Caffe.

Convolutional neural networks (CNNs) have emerged as a very powerful technique for a range of computer vision tasks. This live tutorial begins with an introduction to the structure, operation, and training of CNNs and how they are used for computer vision. It explores the strengths and weaknesses of CNNs, and how to design and train them. The tutorial then introduces the popular Caffe open source framework for CNNs, and provides hands-on labs in creating, training, and deploying CNNs using Caffe.

  • Date/Time:  September 22, 2016 — 9:00 am to 5:00 pm (check-in begins at 8:00 am)
  • Location:  Hyatt Regency Cambridge, Cambridge, Massachusetts U.S.A.
  • Cost:  $795 ($720 Early Bird registration available until August 5)
  • Read details of the tutorial below.

 

Tutorial Details
This full-day tutorial is designed to equip product creators, application developers, and engineering managers with the tools and practical know-how needed to incorporate deep-learning-based computer vision capabilities into products. The morning focuses on an introduction to convolutional neural networks (CNNs)—how they work, what they can be used for, how they're trained, and their strengths and weaknesses. The afternoon provides a hands-on introduction to the popular open source Caffe framework. Attendees will learn how to use Caffe to create, train, evaluate, and deploy CNNs.

Who should attend: Application and product developers and managers interested in using deep learning for computer vision.

Attendees should bring a laptop with WiFi and a web browser. Lunch will be provided, along with morning and afternoon snacks.

What You Will Learn

  • Structure, operation, and training of convolutional neural networks (CNNs)
  • How CNNs are used for computer vision
  • Strengths and weaknesses of CNNs for computer vision
  • How to design and train CNNs
  • How to use Caffe to create, train, and deploy CNNs

Got a question? Contact the organizers.

Presenters and Organizers
This tutorial will be presented by Evan Shelhamer, Jeff Donahue, and Jonathan Long—the primary Caffe developers from the Berkeley Vision and Learning Center at U.C. Berkeley. It is organized by the Embedded Vision Alliance and BDTI.

About the presenters:

Evan Shelhamer is a Ph.D. student at U.C. Berkeley who is advised by Trevor Darrell as a member of the Berkeley Vision and Learning Center. Evan's research focuses on deep learning and end-to-end optimization for vision. He is the lead developer of the Caffe deep learning framework and takes his coffee black.

Jeff Donahue is a Ph.D. student at U.C. Berkeley who is advised by Trevor Darrell as a member of the Berkeley Vision and Learning Center. Jeff's research focuses on deep learning for visual localization and sequence learning tasks. He is an active developer and maintainer of Caffe.

Jonathan Long is a Ph.D. student at U.C. Berkeley who is advised by Trevor Darrell as a member of the Berkeley Vision and Learning Center. Jon delights in building beautiful recognition systems. His research focuses on segmentation and detection with deep learning.

Detailed Tutorial Outline

  1. Designing, Implementing, and Training CNNs: A Theoretical and Practical Discussion
    1. Introduction to deep learning and neural networks
    2. Introduction to and theory of CNNs with a focus on:
      1. How and where can CNNs be used for computer vision?
      2. How do CNNs work?
      3. CNN layers and components and their operation
      4. What are the advantages and disadvantages of CNNs?
      5. What computer vision tasks do CNNs do better than other methods?
        1. Example applications where CNNs improve on previous solutions
    3. The process of designing and training a CNN
    4. CNN computation and data bandwidth requirements
    5. Advanced topics:
      1. Training tricks to improve accuracy and speed
      2. Current research directions in CNNs
      3. Extensions to CNNs:
        1. FCN: pixel-wise prediction
        2. Sequence learning
  2. Introduction to Caffe for Designing and Training CNNs: A Hands-on Tutorial
    1. Introduction to Caffe
    2. Hands-on Caffe tutorial:
      1. How does Caffe work?
      2. Describing a CNN in Caffe
      3. Training a CNN in Caffe
      4. Fine-tuning a CNN in Caffe
      5. Requirements for installing and running Caffe
      6. Deploying a CNN with Caffe

Contact the organizers.