Video-to-Video Synthesis |
NIPS |
code |
4749 |
Deep Image Prior |
CVPR |
code |
3451 |
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation |
CVPR |
code |
3104 |
Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network |
ECCV |
code |
2109 |
Learning to See in the Dark |
CVPR |
code |
2033 |
Glow: Generative Flow with Invertible 1x1 Convolutions |
NIPS |
code |
1862 |
Squeeze-and-Excitation Networks |
CVPR |
code |
1263 |
Efficient Neural Architecture Search via Parameters Sharing |
ICML |
code |
1189 |
Multimodal Unsupervised Image-to-image Translation |
ECCV |
code |
1183 |
Non-Local Neural Networks |
CVPR |
code |
859 |
Image Generation From Scene Graphs |
CVPR |
code |
772 |
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? |
CVPR |
code |
690 |
Single-Shot Refinement Neural Network for Object Detection |
CVPR |
code |
668 |
GANimation: Anatomically-aware Facial Animation from a Single Image |
ECCV |
code |
628 |
Detect-and-Track: Efficient Pose Estimation in Videos |
CVPR |
code |
549 |
Relation Networks for Object Detection |
CVPR |
code |
532 |
PointCNN |
NIPS |
code |
506 |
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples |
ICML |
code |
491 |
Simple Baselines for Human Pose Estimation and Tracking |
ECCV |
code |
488 |
Taskonomy: Disentangling Task Transfer Learning |
CVPR |
code |
453 |
Which Training Methods for GANs do actually Converge? |
ICML |
code |
453 |
Cascaded Pyramid Network for Multi-Person Pose Estimation |
CVPR |
code |
447 |
Pelee: A Real-Time Object Detection System on Mobile Devices |
NIPS |
code |
441 |
Generative Image Inpainting With Contextual Attention |
CVPR |
code |
441 |
Neural 3D Mesh Renderer |
CVPR |
code |
436 |
Look at Boundary: A Boundary-Aware Face Alignment Algorithm |
CVPR |
code |
416 |
Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs |
CVPR |
code |
412 |
End-to-End Recovery of Human Shape and Pose |
CVPR |
code |
388 |
In-Place Activated BatchNorm for Memory-Optimized Training of DNNs |
CVPR |
code |
388 |
ICNet for Real-Time Semantic Segmentation on High-Resolution Images |
ECCV |
code |
372 |
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric |
CVPR |
code |
360 |
Distractor-aware Siamese Networks for Visual Object Tracking |
ECCV |
code |
350 |
Frustum PointNets for 3D Object Detection From RGB-D Data |
CVPR |
code |
346 |
Efficient Interactive Annotation of Segmentation Datasets With Polygon-RNN++ |
CVPR |
code |
339 |
Gibson Env: Real-World Perception for Embodied Agents |
CVPR |
code |
332 |
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning |
CVPR |
code |
309 |
Soccer on Your Tabletop |
CVPR |
code |
308 |
Noise2Noise: Learning Image Restoration without Clean Data |
ICML |
code |
304 |
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose |
CVPR |
code |
301 |
GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation |
CVPR |
code |
301 |
Neural Baby Talk |
CVPR |
code |
292 |
Acquisition of Localization Confidence for Accurate Object Detection |
ECCV |
code |
285 |
The Lovász-Softmax Loss: A Tractable Surrogate for the Optimization of the Intersection-Over-Union Measure in Neural Networks |
CVPR |
code |
283 |
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume |
CVPR |
code |
283 |
Fast End-to-End Trainable Guided Filter |
CVPR |
code |
274 |
Adversarially Regularized Autoencoders |
ICML |
code |
261 |
License Plate Detection and Recognition in Unconstrained Scenarios |
ECCV |
code |
258 |
Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors |
CVPR |
code |
257 |
Supervising Unsupervised Learning |
NIPS |
code |
255 |
Pyramid Stereo Matching Network |
CVPR |
code |
250 |
Convolutional Neural Networks With Alternately Updated Clique |
CVPR |
code |
250 |
Deep Photo Enhancer: Unpaired Learning for Image Enhancement From Photographs With GANs |
CVPR |
code |
241 |
Neural Relational Inference for Interacting Systems |
ICML |
code |
240 |
Learning to Adapt Structured Output Space for Semantic Segmentation |
CVPR |
code |
239 |
An intriguing failing of convolutional neural networks and the CoordConv solution |
NIPS |
code |
230 |
Learning to Segment Every Thing |
CVPR |
code |
227 |
LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation |
CVPR |
code |
223 |
End-to-End Learning of Motion Representation for Video Understanding |
CVPR |
code |
222 |
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images |
ECCV |
code |
219 |
Bilinear Attention Networks |
NIPS |
code |
216 |
Iterative Visual Reasoning Beyond Convolutions |
CVPR |
code |
213 |
Semi-Parametric Image Synthesis |
CVPR |
code |
213 |
A Style-Aware Content Loss for Real-time HD Style Transfer |
ECCV |
code |
201 |
Style Aggregated Network for Facial Landmark Detection |
CVPR |
code |
192 |
Pose-Robust Face Recognition via Deep Residual Equivariant Mapping |
CVPR |
code |
189 |
GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models |
ICML |
code |
186 |
Referring Relationships |
CVPR |
code |
185 |
MoCoGAN: Decomposing Motion and Content for Video Generation |
CVPR |
code |
184 |
Compressed Video Action Recognition |
CVPR |
code |
180 |
LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image |
CVPR |
code |
178 |
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation |
ECCV |
code |
176 |
Latent Alignment and Variational Attention |
NIPS |
code |
172 |
Multi-Content GAN for Few-Shot Font Style Transfer |
CVPR |
code |
170 |
SPLATNet: Sparse Lattice Networks for Point Cloud Processing |
CVPR |
code |
166 |
Attentive Generative Adversarial Network for Raindrop Removal From a Single Image |
CVPR |
code |
158 |
Single View Stereo Matching |
CVPR |
code |
158 |
Unsupervised Feature Learning via Non-Parametric Instance Discrimination |
CVPR |
code |
156 |
An End-to-End TextSpotter With Explicit Alignment and Attention |
CVPR |
code |
156 |
Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks |
CVPR |
code |
154 |
ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing |
CVPR |
code |
153 |
Evolved Policy Gradients |
NIPS |
code |
151 |
Optimizing Video Object Detection via a Scale-Time Lattice |
CVPR |
code |
150 |
Large-Scale Point Cloud Semantic Segmentation With Superpoint Graphs |
CVPR |
code |
150 |
Learning Category-Specific Mesh Reconstruction from Image Collections |
ECCV |
code |
146 |
Group Normalization |
ECCV |
code |
145 |
DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks |
CVPR |
code |
142 |
MegaDepth: Learning Single-View Depth Prediction From Internet Photos |
CVPR |
code |
142 |
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices |
CVPR |
code |
142 |
Deep Clustering for Unsupervised Learning of Visual Features |
ECCV |
code |
139 |
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation |
ECCV |
code |
139 |
Learning a Single Convolutional Super-Resolution Network for Multiple Degradations |
CVPR |
code |
139 |
Facelet-Bank for Fast Portrait Manipulation |
CVPR |
code |
138 |
Image Super-Resolution Using Very Deep Residual Channel Attention Networks |
ECCV |
code |
137 |
ECO: Efficient Convolutional Network for Online Video Understanding |
ECCV |
code |
137 |
PlaneNet: Piece-Wise Planar Reconstruction From a Single RGB Image |
CVPR |
code |
137 |
Self-Imitation Learning |
ICML |
code |
136 |
Residual Dense Network for Image Super-Resolution |
CVPR |
code |
134 |
Embodied Question Answering |
CVPR |
code |
132 |
Unsupervised Cross-Dataset Person Re-Identification by Transfer Learning of Spatial-Temporal Patterns |
CVPR |
code |
131 |
Two-Stream Convolutional Networks for Dynamic Texture Synthesis |
CVPR |
code |
131 |
Densely Connected Pyramid Dehazing Network |
CVPR |
code |
130 |
Camera Style Adaptation for Person Re-Identification |
CVPR |
code |
128 |
Neural Motifs: Scene Graph Parsing With Global Context |
CVPR |
code |
127 |
Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer |
CVPR |
code |
125 |
Relational recurrent neural networks |
NIPS |
code |
124 |
LSTM Pose Machines |
CVPR |
code |
124 |
SO-Net: Self-Organizing Network for Point Cloud Analysis |
CVPR |
code |
123 |
Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification |
CVPR |
code |
121 |
Context Embedding Networks |
CVPR |
code |
120 |
Fast and Accurate Online Video Object Segmentation via Tracking Parts |
CVPR |
code |
119 |
Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation |
CVPR |
code |
119 |
Learning to Compare: Relation Network for Few-Shot Learning |
CVPR |
code |
118 |
Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining |
ECCV |
code |
116 |
Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships |
CVPR |
code |
116 |
MVSNet: Depth Inference for Unstructured Multi-view Stereo |
ECCV |
code |
116 |
Weakly Supervised Instance Segmentation Using Class Peak Response |
CVPR |
code |
116 |
L4: Practical loss-based stepsize adaptation for deep learning |
NIPS |
code |
116 |
A Closer Look at Spatiotemporal Convolutions for Action Recognition |
CVPR |
code |
115 |
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry With Deep Feature Reconstruction |
CVPR |
code |
114 |
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling |
CVPR |
code |
114 |
MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network |
ECCV |
code |
113 |
Gated Path Planning Networks |
ICML |
code |
113 |
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning |
CVPR |
code |
110 |
Decoupled Networks |
CVPR |
code |
109 |
Video Based Reconstruction of 3D People Models |
CVPR |
code |
109 |
CosFace: Large Margin Cosine Loss for Deep Face Recognition |
CVPR |
code |
109 |
DeepMVS: Learning Multi-View Stereopsis |
CVPR |
code |
108 |
Hierarchical Imitation and Reinforcement Learning |
ICML |
code |
107 |
Real-Time Seamless Single Shot 6D Object Pose Prediction |
CVPR |
code |
107 |
Adaptive Affinity Fields for Semantic Segmentation |
ECCV |
code |
107 |
Long-term Tracking in the Wild: a Benchmark |
ECCV |
code |
106 |
Realistic Evaluation of Deep Semi-Supervised Learning Algorithms |
NIPS |
code |
106 |
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics |
CVPR |
code |
104 |
Deep Back-Projection Networks for Super-Resolution |
CVPR |
code |
104 |
3D-CODED: 3D Correspondences by Deep Deformation |
ECCV |
code |
102 |
Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform |
CVPR |
code |
102 |
Scale-Recurrent Network for Deep Image Deblurring |
CVPR |
code |
101 |
PU-Net: Point Cloud Upsampling Network |
CVPR |
code |
101 |
Noisy Natural Gradient as Variational Inference |
ICML |
code |
100 |
Domain Adaptive Faster R-CNN for Object Detection in the Wild |
CVPR |
code |
99 |
Rethinking Feature Distribution for Loss Functions in Image Classification |
CVPR |
code |
97 |
DenseASPP for Semantic Segmentation in Street Scenes |
CVPR |
code |
97 |
Quantized Densely Connected U-Nets for Efficient Landmark Localization |
ECCV |
code |
97 |
Graph R-CNN for Scene Graph Generation |
ECCV |
code |
96 |
Factoring Shape, Pose, and Layout From the 2D Image of a 3D Scene |
CVPR |
code |
94 |
Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network |
CVPR |
code |
93 |
Deep Depth Completion of a Single RGB-D Image |
CVPR |
code |
93 |
MAttNet: Modular Attention Network for Referring Expression Comprehension |
CVPR |
code |
92 |
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis |
ICML |
code |
91 |
ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes |
ECCV |
code |
89 |
Neural Arithmetic Logic Units |
NIPS |
code |
87 |
Perturbative Neural Networks |
CVPR |
code |
86 |
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding |
CVPR |
code |
86 |
Repulsion Loss: Detecting Pedestrians in a Crowd |
CVPR |
code |
86 |
End-to-End Weakly-Supervised Semantic Alignment |
CVPR |
code |
86 |
Learning Blind Video Temporal Consistency |
ECCV |
code |
84 |
PSANet: Point-wise Spatial Attention Network for Scene Parsing |
ECCV |
code |
84 |
Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights |
ECCV |
code |
83 |
Nonlinear 3D Face Morphable Model |
CVPR |
code |
81 |
Deep Mutual Learning |
CVPR |
code |
80 |
Image Inpainting for Irregular Holes Using Partial Convolutions |
ECCV |
code |
79 |
BodyNet: Volumetric Inference of 3D Human Body Shapes |
ECCV |
code |
78 |
Integral Human Pose Regression |
ECCV |
code |
77 |
FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors |
CVPR |
code |
77 |
Attention-based Deep Multiple Instance Learning |
ICML |
code |
77 |
LiDAR-Video Driving Dataset: Learning Driving Policies Effectively |
CVPR |
code |
77 |
Multi-View Consistency as Supervisory Signal for Learning Shape and Pose Prediction |
CVPR |
code |
76 |
Macro-Micro Adversarial Network for Human Parsing |
ECCV |
code |
76 |
Multi-view to Novel view: Synthesizing novel views with Self-Learned Confidence |
ECCV |
code |
75 |
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks |
ECCV |
code |
75 |
Neural Kinematic Networks for Unsupervised Motion Retargetting |
CVPR |
code |
75 |
Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking |
CVPR |
code |
75 |
Synthesizing Images of Humans in Unseen Poses |
CVPR |
code |
74 |
A PID Controller Approach for Stochastic Optimization of Deep Networks |
CVPR |
code |
74 |
Tell Me Where to Look: Guided Attention Inference Network |
CVPR |
code |
74 |
Multi-Scale Location-Aware Kernel Representation for Object Detection |
CVPR |
code |
73 |
Recurrent Relational Networks |
NIPS |
code |
73 |
VITON: An Image-Based Virtual Try-On Network |
CVPR |
code |
73 |
VITAL: VIsual Tracking via Adversarial Learning |
CVPR |
code |
73 |
Future Frame Prediction for Anomaly Detection – A New Baseline |
CVPR |
code |
72 |
Recurrent Pixel Embedding for Instance Grouping |
CVPR |
code |
71 |
Learning Human-Object Interactions by Graph Parsing Neural Networks |
ECCV |
code |
69 |
Repeatability Is Not Enough: Learning Affine Regions via Discriminability |
ECCV |
code |
67 |
Visual Feature Attribution Using Wasserstein GANs |
CVPR |
code |
67 |
Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration |
CVPR |
code |
66 |
Learning SO(3) Equivariant Representations with Spherical CNNs |
ECCV |
code |
64 |
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation |
ECCV |
code |
64 |
SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation |
CVPR |
code |
64 |
ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans |
CVPR |
code |
64 |
One-Shot Unsupervised Cross Domain Translation |
NIPS |
code |
62 |
Pairwise Confusion for Fine-Grained Visual Classification |
ECCV |
code |
62 |
Multi-Shot Pedestrian Re-Identification via Sequential Decision Making |
CVPR |
code |
62 |
Generalizing A Person Retrieval Model Hetero- and Homogeneously |
ECCV |
code |
61 |
Learning Depth From Monocular Videos Using Direct Methods |
CVPR |
code |
61 |
Optimizing the Latent Space of Generative Networks |
ICML |
code |
60 |
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes |
CVPR |
code |
59 |
“Zero-Shot” Super-Resolution Using Deep Internal Learning |
CVPR |
code |
59 |
Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking |
CVPR |
code |
59 |
PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition |
CVPR |
code |
58 |
Progressive Neural Architecture Search |
ECCV |
code |
58 |
Generative Neural Machine Translation |
NIPS |
code |
58 |
Learning to Reweight Examples for Robust Deep Learning |
ICML |
code |
58 |
Object Level Visual Reasoning in Videos |
ECCV |
code |
57 |
Generate to Adapt: Aligning Domains Using Generative Adversarial Networks |
CVPR |
code |
57 |
Improving Generalization via Scalable Neighborhood Component Analysis |
ECCV |
code |
57 |
Geometry-Aware Learning of Maps for Camera Localization |
CVPR |
code |
57 |
Path-Level Network Transformation for Efficient Architecture Search |
ICML |
code |
57 |
Decorrelated Batch Normalization |
CVPR |
code |
57 |
Ordinal Depth Supervision for 3D Human Pose Estimation |
CVPR |
code |
57 |
Disentangled Person Image Generation |
CVPR |
code |
57 |
Regularizing RNNs for Caption Generation by Reconstructing the Past With the Present |
CVPR |
code |
57 |
Diverse Image-to-Image Translation via Disentangled Representations |
ECCV |
code |
56 |
Pointwise Convolutional Neural Networks |
CVPR |
code |
56 |
Neural Program Synthesis from Diverse Demonstration Videos |
ICML |
code |
56 |
Learning Less Is More - 6D Camera Localization via 3D Surface Regression |
CVPR |
code |
55 |
Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency |
ECCV |
code |
55 |
Learning Latent Super-Events to Detect Multiple Activities in Videos |
CVPR |
code |
55 |
Depth-aware CNN for RGB-D Segmentation |
ECCV |
code |
55 |
Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning |
CVPR |
code |
54 |
Unsupervised Discovery of Object Landmarks as Structural Representations |
CVPR |
code |
54 |
[ |
|
|
|