CVPR 2016 Summary Reading

Last Update:2018-07-26 Source: Internet

Author: User

Tags compact

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

to illustrate the summary of all the articles in CVPR2016, summarize, the summary only retains the innovation point part. ORAL SESSION Image captioning and Question answering Monday, June 27th, 9:00am-10:05am.

These papers'll also be presented at the following poster session

1 Deep compositional captioning:describing novel Object Categories without paired Training Data.

Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond Mooney, Kate Saenko, Trevor Darrell

The general job is that they can also describe objects that are not present in the picture-statement library. In this work, we propose the deep compositional captioner (DCC) to address the task of generating descriptions of novel OB Jects which is not present in paired image-sentence datasets.

2 Generation and comprehension of unambiguous Object descriptions.

Junhua Mao, Jonathan Huang, Alexander Toshev, Oana Camburu, Alan L. Yuille, Kevin Murphy

is also a form of image description, the author points out that the image description can be an objective evaluation index. We propose a method that can generate an unambiguous description (known as a referring expression) of a specific object or Region in an image, and which can also comprehend or interpret such a expression to infer which object is being describe D.

3 Stacked Attention Networks for Image Question answering.

Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Smola

The article is used for the image question and answer, for example the question picture has several people, makes the corresponding reply, feels more difficult. The innovation here is a stack-based network. This paper presents stacked attention networks (SANs) is learn to answer natural language questions from images.

4 Image Question answering Using convolutional neural Network with Dynamic Parameter prediction.

Hyeonwoo Noh, Paul hongsuck Seo, Bohyung Han

Image question and answer, the innovation here is to add an adaptive parameter layer, which uses GRU learning from the applicable parameters. We tackle Image question answering (IMAGEQA) problem by learning a convolutional neural network (CNN) with a dynamic param Eter layer whose weights is determined adaptively based on questions.

5 Neural Module Networks.

Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein

The goal is also the image question and answer, the innovation is simultaneously considers two aspects content: expresses the question and the language model (embarrassed, not all simultaneously considers these two aspects). Not carefully. Our approach decomposes questions into their linguistic substructures, and uses these structures to dynamically instantiat E Modular networks (with reusable components for recognizing dogs, classifying colors, etc.)

Go back to Calendar >

SPOTLIGHT SESSION Language and Vision Monday, June 27th, 10:05am-10:30am.

These papers'll also be presented at the following poster session

6 learning deep representations of fine-grained Visual descriptions.

Scott Reed, Zeynep Akata, Honglak Lee, Bernt Schiele

Deal with zero-shot problem, not carefully understand the innovation point. is basically divided into two parts, from the topic, in fact, the use of deep learning to obtain fine-grained characteristics of the expression. Our proposed models train end-to-end to align with the fine-grained and category-specific content of images. Natural language provides a flexible and compact by the encoding only the salient visual aspects for distinguishing Catego Ries.

7 multi-cue Zero-shot Learning with strong supervision.

Zeynep Akata, Mateusz Malinowski, Mario Fritz, Bernt Schiele

Although it is zero-shot, it is irrelevant. It is not a way to deal with deep learning.

8 latent embeddings for zero-shot classification.

Yongqin Xian, Zeynep Akata, Gaurav Sharma, Quynh Nguyen, Matthias Hein, Bernt Schiele

Although it is zero-shot, it is irrelevant. It is not a way to deal with deep learning.

9 One-Shot learning of Scene Locations via Feature trajectory Transfer.

Roland Kwitt, Sebastian hegenbart, Marc Niethammer

Although it is one-shot, it is irrelevant. It is not a way to deal with deep learning.

Ten learning Attributes Equals multi-source Domain generalization.

Chuang Gan, Tianbao Yang, boqing Gong

It is not a way to deal with deep learning.

anticipating Visual representations from unlabeled Video.

Carl Vondrick, Hamed pirsiavash, Antonio Torralba

The core content is the use of deep learning to predict the next time or time period of behavioral activities. We present a framework that capitalizes on temporal structure in unlabeled video to learn to anticipate human actions and Objects. The key idea behind our approach is so we can train deep networks to predict the visual representation of images in the Future.

Go back to Calendar >

ORAL SESSION Matching and Alignment Monday, June 27th, 9:00am-10:05am.

These papers'll also be presented at the following poster session

Learning to Assign orientations to Feature Points.

Kwang Moo Yi, Yannick Verdie, Pascal Fua, Vincent Lepetit

The content of this paper is to use deep learning to calibrate the feature points, to match the algorithm, and to propose a new activation function. We show how to train a convolutional neural Network to assign a canonical orientation to feature points given an image pat Ch centered on the feature point.

Learning dense correspondence via 3d-guided Cycle consistency.

Tinghui Zhou, Philipp Krähenbuhl, Mathieu Aubry, qixing Huang, Alexei A. Efros

It is also the use of deep learning to explore cross-instance similarity. We exploit this consistency as a supervisory signal to train a convolutional neural network to predict Cross-instance Corr Espondences between pairs of images depicting objects of the same category.

The Global Patch Collider.

Shenlong Wang, Sean Ryan Fanello, Christoph Rhemann, Shahram Izadi, Pushmeet Kohli

Not deep learning.

Joint Probabilistic Matching Using m-best Solutions.

Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

Not deep learning.

Alignment Across Large poses:a 3D solution.

Xiangyu Zhu, Zhen Lei, xiaoming Liu, Hailin Shi, Stan Z. Li

This paper presents a three-dimensional face correction technique using deep learning. We propose a solution to the three problems on an new alignment framework, called 3D dense face Alignment (3DDFA), in whic H a dense 3D face model was fitted to the image via convolutional neutral network (CNN)

Go back to Calendar >

SPOTLIGHT SESSION Segmentation and Contour Detection Monday, June 27th, 10:05am-10:30am.

These papers'll also be presented at the following poster session

Interactive segmentation on RGBD Images via Cue Selection.

Jie Feng, Brian Price, Scott Cohen, Shih-fu Chang

Not deep learning.

layered Scene decomposition via the OCCLUSION-CRF.

Chen Liu, Pushmeet Kohli, Yasutaka Furukawa

Not deep learning.

Affinity cnn:learning pixel-centric pairwise relations for Figure/ground embedding.

Michael Maire, Takuya Narihira, Stella x. Yu

Get a affinity matrix through deep learning. Interest is not enough. We train a convolutional neural network (CNN) to directly predict the pairwise relationships it define this affinity mat Rix

weakly supervised Object boundaries.

Anna Khoreva, Rodrigo Benenson, Mohamed Omran, Matthias Hein, Bernt Schiele

Not deep learning.

Object Contour Detection with a Fully convolutional encoder-decoder Network.

Jimei Yang, Brian price, Scott Cohen, Honglak Lee, Ming-hsuan Yang

A contour recognition algorithm based on full convolutional coding and decoding network is proposed. We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network.

Go back to Calendar >

POSTER SESSION Poster Session 1-1. Monday, June 27th, 10:30am-12:30pm. Images and Language

What Value does Explicit high level concepts has in the Vision to Language problems?.

Qi Wu, Chunhua Shen, Lingqiao Liu, Anthony Dick, Anton van den Hengel

The article considers the image description, but because the existing methods are directly mapping objects in the image into textual information, and do not consider the high-level semantic information, the article puts forward the innovation point of considering the high-level information. We propose a method of incorporating high-level concepts into the successful CNN-RNN approach, and show that it achieves a Significant improvement on the State-of-the-art in both image captioning and visual question answering. Edge Contour Detection

Detection of curved Edges at low SNR.

Nati Ofir, Meirav Galun, Boaz Nadler, Ronen Basri

Not deep learning. Solving the problem is equivalent to the edge detection while eliminating noise.

The Object Skeleton Extraction in Natural Images is fusing scale-associated deep Side Outputs.

Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Zhijiang Zhang, Xiang Bai

A skeleton extraction algorithm is proposed using the full convolutional network, which seems to be the article in the previous blog. In this paper, we present a fully convolutional network with multiple scale-associated side outputs to address this proble M. by observing the relationship between the receptive field sizes of the sequential stages in the network and the Skeleto N Scales They can capture, we introduce a scale-associated side output to each stage

Learning Relaxed Deep supervision for Better Edge Detection.

Yu Liu, Michael S. Lew

An edge detection algorithm based on deep learning. We propose using relaxed deep supervision (RDS) within convolutional neural networks for edge detection.

occlusion boundary Detection via Deep Exploration of the Context.

Huan Fu, Chaohui Wang, Dacheng Tao, Michael J. Black

Occlusion Edge detection. Based on deep learning in this paper, we improve occlusion boundary detection via enhanced exploration of contextual information (e.g., lo Cal Structural boundary patterns, observations from surrounding regions, and temporal context), and in doing so develop a Novel approach based on convolutional neural Networks (CNNs) and conditional random fields (CRFs)

semicontour:a semi-supervised Learning approach for Contour Detection.

Zizhao Zhang, Fuyong Xing, Xiaoshuang Shi, Lin Yang

Not deep learning. Contour detection. Feature Extraction and Description

Learning to Localize Little landmarks.

Saurabh Singh, Derek hoiem, David Forsyth

Not deep learning, feature extraction.

interactive:inter-layer activeness propagation.

Lingxi Xie, Liang Zheng, Jingdong Wang, Alan L. Yuille, Qi Tian

Equivalent to an improvement of the activation function. In this paper, we present InterActive, a novel algorithm which computes the activeness of neurons and network connections.

Exploit bounding Box Annotations for multi-label Object recognition.

Hao Yang, Joey Tianyi Zhou, Yu Zhang, Bin-bin Gao, Jianxin Wu, Jianfei Cai

A multi-target recognition algorithm is used to identify deep learning in multi-label target recognition. In this paper, we incorporate local information to enhance the feature discriminative power.

ti-pooling:transformation-invariant POOLING for Feature learning in convolutional neural Networks.

Dmitry Laptev, Nikolay Savinov, Joachim M. Buhmann, Marc Pollefeys

A new pooling operator. In this paper we present a deep neural network topology, incorporates a simple to implement Transformation-invariant p Ooling operator (ti-pooling)

Fashion Style in Floats:joint Ranking and classification Using Weak Data for Feature Extraction.

Edgar Simo-serra, Hiroshi Ishikawa

Not deep learning.

equiangular Kernel Dictionary Learning with applications to Dynamic Texture analysis.

Yuhui Quan, Chenglong Bao, Hui Ji

Not deep learning.

The Compact Bilinear Pooling.

Yang Gao, Oscar Beijbom, Ning Zhang, Trevor Darrell

Not deep learning. Feature Extraction and Matching

accumulated stability voting:a Robust descriptor from descriptors of multiple Scales.

Tsun-yi Yang, Yen-yu Lin, Yung-yu Chuang

Not deep learning.

Comal:good Features to Match on Object boundaries.

Swarna K. Ravindran, Anurag Mittal

Not deep learning.

PNS Progressive Feature Matching with Alternate descriptor Selection and correspondence enrichment.

Yuan-ting Hu, Yen-yu Lin

Not deep learning. Image Segmentation

A New Finsler Minimal Path Model with curvature penalization for Image segmentation and Closed Contour Detectio N.

Da Chen, Jean-marie Mirebeau, Laurent d. Cohen

Not deep learning.

scale-aware Alignment of hierarchical Image segmentation.

Yuhua Chen, Dengxin Dai, Jordi pont-tuset, Luc Van Gool

Not deep learning.

The deep Interactive Object Selection.

Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas S. Huang

Target recognition for deep learning interactions. Choose. In this paper, we present a novel deep-learning-based algorithm which have much better understanding of objectness and can Reduce user interactions to just a few clicks.

The Plug? Predicting If Computers or humans should Segment Images.

Danna Gurari, Suyog Jain, Margrit Betke, Kristen Grauman

Not deep learning.

the Shadows, Shape Priors shine:using occlusion to Improve multi-region segmentation.

Yuka Kihara, Matvey Soloviev, Tsuhan Chen

Deep learning, but not much interest.

convexity Shape Constraints for Image segmentation.

Loic A. Royer, David L. Richmond, Carsten Rother, Bjoern Andres, Dagmar Kainmueller

Not deep learning.

MCMC Shape sampling for Image segmentation with nonparametric shape Priors.

Ertunc Erdil, Sinan Yildirim, Müjdat Cetin, Tolga Tasdizen

Not deep learning. Low-level Vision

Noise Modeling to Blind Image denoising.

Fengyuan Zhu, Guangyong Chen, Pheng-ann Heng

Not deep learning.

efficient and robust Color consistency for Community Photo collections.

Jaesik Park, yu-wing Tai, Sudipta N. Sinha, in so Kweon

Not deep learning.

needle-match:reliable Patch Matching under high uncertainty.

Or Lotan, Michal Irani

Not deep learning.

reconnet:non-iterative reconstruction of Images from compressively sensed measurements.

Kuldeep Kulkarni, Suhas lohit, Pavan Turaga, Ronan Kerviche, Amit Ashok

The recovery and reconstruction of compressed-sensing image is made by convolution neuron network. We propose a novel convolutional neural Network (CNN) architecture which takes in CS measurements of an image as input and Outputs an intermediate reconstruction

soft-segmentation Guided Object Motion Deblurri

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More