Bookmark and Share

"Machine Learning- based Image Compression: Ready for Prime Time?," a Presentation from Clarifai

Register or sign in to access the Embedded Vision Academy's free technical training content.

The training materials provided by the Embedded Vision Academy are offered free of charge to everyone. All we ask in return is that you register, and tell us a little about yourself so that we can understand a bit about our audience. As detailed in our Privacy Policy, we will not share your registration information, nor contact you, except with your consent.

Registration is free and takes less than one minute. Click here to register, and get full access to the Embedded Vision Academy's unique technical training content.

If you've already registered, click here to sign in.

See a sample of this page's content below:


Michael Gormish, Research Manager at Clarifai, presents the "Machine Learning- based Image Compression: Ready for Prime Time?" tutorial at the May 2019 Embedded Vision Summit.

Computer vision is undergoing dramatic changes because deep learning techniques are now able to solve complex non-linear problems. Computer vision pipelines used to consist of hand engineered stages mathematically optimized for some carefully chosen objective function. These pipelines are being replaced with machine- learned stages or end-to-end learning techniques where enough ground truth data is available.

Similarly, for decades image compression has relied on hand crafted algorithm pipelines, but recent efforts using deep learning are reporting higher image quality than that provided by conventional techniques. Is it time to replaced discrete cosine transforms with machine-learned compression techniques?

This talk examines practical aspects of deep learned image compression systems as compared with traditional approaches. Gormish considers memory, computation and other aspects, in addition to rate-distortion, to see when ML-based compression should be considered or avoided. He also discusses approaches using a combination of machine learned and traditional techniques.