Lin's Notes Garden
Search
Search
Dark mode
Light mode
Explorer
Home
❯
Researches
❯
Papers
Folder: Researches/Papers
61 items under this folder.
Dec 05, 2024
End-to-End Object Detection with Transformers
Nov 13, 2024
Asymmetric Non-local Neural Networks for Semantic Segmentation
Nov 13, 2024
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Nov 13, 2024
BEiT: BERT Pre-Training of Image Transformers
Nov 13, 2024
BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation
Nov 13, 2024
BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation
Nov 13, 2024
CCNet: Criss-Cross Attention for Semantic Segmentation
Nov 13, 2024
Learning Transferable Visual Models From Natural Language Supervision
Nov 13, 2024
Diffusion Models Beat GANs on Image Synthesis
Nov 13, 2024
Classifier-Free Diffusion Guidance
Nov 13, 2024
A ConvNet for the 2020s
Nov 13, 2024
Hierarchical Text-Conditional Image Generation with CLIP Latents
Nov 13, 2024
Dual Attention Network for Scene Segmentation
Nov 13, 2024
Vision Transformers for Dense Prediction
Nov 13, 2024
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Nov 13, 2024
Rethinking Atrous Convolution for Semantic Image Segmentation
Nov 13, 2024
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Nov 13, 2024
Denoising Diffusion Implicit Models
Nov 13, 2024
Scalable Diffusion Models with Transformers
Nov 13, 2024
Context Encoding for Semantic Segmentation
Nov 13, 2024
Fully Convolutional Networks for Semantic Segmentation
Nov 13, 2024
Feature Pyramid Networks for Object Detection
Nov 13, 2024
Fast R-CNN
Nov 13, 2024
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Nov 13, 2024
Generative Adversarial Networks
Nov 13, 2024
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
Nov 13, 2024
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Nov 13, 2024
Deep High-Resolution Representation Learning for Visual Recognition
Nov 13, 2024
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
Nov 13, 2024
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Nov 13, 2024
High-Resolution Image Synthesis with Latent Diffusion Models
Nov 13, 2024
Masked Autoencoders Are Scalable Vision Learners
Nov 13, 2024
Mask R-CNN
Nov 13, 2024
Masked-attention Mask Transformer for Universal Image Segmentation
Nov 13, 2024
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Nov 13, 2024
Non-local Neural Networks
Nov 13, 2024
Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation
Nov 13, 2024
Training Region-based Object Detectors with Online Hard Example Mining
Nov 13, 2024
OneFormer: One Transformer to Rule Universal Image Segmentation
Nov 13, 2024
PSANet: Point-wise Spatial Attention Network for Scene Parsing
Nov 13, 2024
Pyramid Scene Parsing Network
Nov 13, 2024
PointRend: Image Segmentation as Rendering
Nov 13, 2024
Progressive Distillation for Fast Sampling of Diffusion Models
Nov 13, 2024
Rich feature hierarchies for accurate object detection and semantic segmentation
Nov 13, 2024
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Nov 13, 2024
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation
Nov 13, 2024
Focal Loss for Dense Object Detection
Nov 13, 2024
SAM 2: Segment Anything in Images and Videos
Nov 13, 2024
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
Nov 13, 2024
SSD: Single Shot MultiBox Detector
Nov 13, 2024
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Nov 13, 2024
Segment Anything
Nov 13, 2024
Segmenter: Transformer for Semantic Segmentation
Nov 13, 2024
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Nov 13, 2024
Attention Is All You Need
Nov 13, 2024
U-Net: Convolutional Networks for Biomedical Image Segmentation
Nov 13, 2024
Neural Discrete Representation Learning
Nov 13, 2024
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Nov 13, 2024
You Only Look Once: Unified, Real-Time Object Detection
Nov 13, 2024
YOLO9000: Better, Faster, Stronger
Nov 13, 2024
YOLOv3: An Incremental Improvement