Skip to the content.

Contributors Forks Stargazers Issues

GitPages on https://theneao.github.io/CV-SAR-Seg-arxiv-daily

Updated on 2025.07.02

Usage instructions: here

self-supervised

Publish Date Title Authors PDF Code
2021-09-09 Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders Fangyu Liu et.al. 2104.08027 link
2022-07-11 Learned Camera Gain and Exposure Control for Improved Visual Feature Detection and Matching Justin Tomasi et.al. 2102.04341 null

edge detection

Publish Date Title Authors PDF Code
2025-06-25 U-R-VEDA: Integrating UNET, Residual Links, Edge and Dual Attention, and Vision Transformer for Accurate Semantic Segmentation of CMRs Racheal Mukisa et.al. 2506.20689 null
2025-06-23 Programmable electro-optic frequency comb empowers integrated parallel convolution processing Jinze He et.al. 2506.18310 null
2025-06-22 Mobile Image Analysis Application for Mantoux Skin Test Liong Gele et.al. 2506.17954 null
2025-06-04 Mechanistic Interpretability of Diffusion Models: Circuit-Level Analysis and Causal Validation Dip Roy et.al. 2506.17237 null
2025-06-20 Self-supervised Feature Extraction for Enhanced Ball Detection on Soccer Robots Can Lin et.al. 2506.16821 null
2025-06-14 Binarization-Aware Adjuster: Bridging Continuous Optimization and Binary Inference in Edge Detection Hao Shu et.al. 2506.12460 null
2025-06-13 Exploring the Effectiveness of Deep Features from Domain-Specific Foundation Models in Retinal Image Synthesis Zuzanna Skorniewska et.al. 2506.11753 null
2025-06-11 A new approach for image segmentation based on diffeomorphic registration and gradient fields Junchao Zhou et.al. 2506.09357 null
2025-06-10 Machine Learning for the Cluster Reconstruction in the CALIFA Calorimeter at R3B Tobias Jenegger et.al. 2506.09088 null
2025-06-06 Elementary Cellular Automata as Non-Cryptographic Hash Functions Daniel McKinley et.al. 2506.06551 null
2025-06-18 Statistical microlocal analysis in two-dimensional X-ray CT Anuj Abhishek et.al. 2506.05113 null
2025-06-03 Heliostat Optical Error Inspection with Polarimetric Imaging Drone Mo Tian et.al. 2506.02333 null
2025-06-01 Hybridizing Expressive Rendering: Stroke-Based Rendering with Classic and Neural Methods Kapil Dev et.al. 2506.00870 null
2025-05-28 Depth to magnetic source estimation using TDX contour Hammed Oyekan et.al. 2505.22780 null
2025-05-24 Tropical Geometry Based Edge Detection Using Min-Plus and Max-Plus Algebra Shivam Kumar Jha S et.al. 2505.18625 null
2025-05-07 Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer Sainath Dey et.al. 2505.04740 null
2025-05-06 Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation Yi Lin et.al. 2505.04652 link
2025-05-03 Seeing Heat with Color – RGB-Only Wildfire Temperature Inference from SAM-Guided Multimodal Distillation using Radiometric Ground Truth Michael Marinaccio et.al. 2505.01638 null
2025-05-02 Edge Detection based on Channel Attention and Inter-region Independence Test Ru-yu Yan et.al. 2505.01040 null
2025-05-02 Edge-preserving Image Denoising via Multi-scale Adaptive Statistical Independence Testing Ruyu Yan et.al. 2505.01032 null
2025-04-22 DeepCS-TRD, a Deep Learning-based Cross-Section Tree Ring Detector Henry Marichal et.al. 2504.16242 null
2025-04-22 Multi-Scale Tensorial Summation and Dimensional Reduction Guided Neural Network for Edge Detection Lei Xu et.al. 2504.15770 null
2025-04-21 Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model Ahmed Sobhi Saleh et.al. 2504.14782 null
2025-04-18 DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images Racheal Mukisa et.al. 2504.13415 null
2025-04-07 Advanced Knife-Edge free Self-Aligned Colour Schlieren Imaging with Extended Measuring Range Shubham Saxena et.al. 2504.05433 null
2025-04-06 Evaluation framework for Image Segmentation Algorithms Tatiana Merkulova et.al. 2504.04435 null
2025-03-26 Hybrid Multi-Stage Learning Framework for Edge Detection: A Survey Mark Phil Pacot et.al. 2503.21827 null
2025-03-21 Model reduction of convection-dominated viscous conservation laws using implicit feature tracking and landmark image registration Victor Zucatti et.al. 2503.17463 null
2025-03-21 Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection Gensheng Pei et.al. 2503.17080 null
2025-03-19 Benchmarking Brain Connectivity Graph Inference: A Novel Validation Approach Alice Chevaux et.al. 2503.15012 null
2025-03-04 Robust Detection of Extremely Thin Lines Using 0.2mm Piano Wire Jisoo Hong et.al. 2503.13473 null
2025-03-14 Refining Image Edge Detection via Linear Canonical Riesz Transforms Shuhui Yang et.al. 2503.11148 null
2025-03-12 Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection Qipeng Mei et.al. 2503.09187 null
2025-03-02 STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds Zikuan Li et.al. 2503.00801 link
2025-02-24 Theory-guided Pseudo-spectral Full Waveform Inversion via Deep Neural Networks Christopher Zerafa et.al. 2502.17624 null
2025-02-23 Subpixel Edge Localization Based on Converted Intensity Summation under Stable Edge Region Yingyuan Yang et.al. 2502.16502 null
2025-02-17 Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection Tessa Pulli et.al. 2502.12027 null
2025-02-14 Edge detection with polynomial frames on the sphere Frederic Schoppert et.al. 2502.09979 null
2025-02-08 Multifunctional meta-optic azimuthal shear interferometer Linzhi Yu et.al. 2502.05569 null
2025-02-06 Agricultural Field Boundary Detection through Integration of “Simple Non-Iterative Clustering (SNIC) Super Pixels” and “Canny Edge Detection Method” Artughrul Gayibov et.al. 2502.04529 link
2025-01-31 Training-free Quantum-Inspired Image Edge Extraction Method Arti Jain et.al. 2501.18929 null
2025-01-27 Autonomous Horizon-based Asteroid Navigation With Observability-constrained Maneuvers Aditya Arjun Anibha et.al. 2501.15806 null
2025-01-25 Snapshot Compressed Imaging Based Single-Measurement Computer Vision for Videos Fengpu Pan et.al. 2501.15122 null
2025-01-29 Stroke classification using Virtual Hybrid Edge Detection from in silico electrical impedance tomography data Juan Pablo Agnelli et.al. 2501.14704 null
2025-01-23 Enhanced Extractor-Selector Framework and Symmetrization Weighted Binary Cross-Entropy for Edge Detections Hao Shu et.al. 2501.13365 null
2025-01-20 Wafer-scale waveguide sidewall roughness scattering loss characterization by image processing Mohit Khurana et.al. 2501.11590 null
2025-01-08 EDMB: Edge Detector with Mamba Yachuan Li et.al. 2501.04846 link
2025-01-06 Gaussian Masked Autoencoders Jathushan Rajasegaran et.al. 2501.03229 null
2025-01-05 Pixel-Wise Feature Selection for Perceptual Edge Detection without post-processing Hao Shu et.al. 2501.02534 null
2025-01-03 Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification Jarin Ritu et.al. 2501.01921 link
2024-12-24 Efficient Detection Framework Adaptation for Edge Computing: A Plug-and-play Neural Network Toolbox Enabling Edge Deployment Jiaqi Wu et.al. 2412.18230 link
2024-12-22 Phase-change metasurfaces for reconfigurable image processing Tingting Liu et.al. 2412.16856 null
2024-12-17 Synthetic Data Generation for Anomaly Detection on Table Grapes Ionut Marian Motoi et.al. 2412.12949 link
2024-12-17 SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection Xing Liufu et.al. 2412.12892 link
2025-02-03 Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining Zhiqi Ge et.al. 2412.10342 null
2024-12-13 Deep Gaussian Process Priors for Bayesian Image Reconstruction Jonas Latz et.al. 2412.10248 link
2024-12-06 Spinal ligaments detection on vertebrae meshes using registration and 3D edge detection Ivanna Kramer et.al. 2412.05081 null
2024-11-29 Simultaneous two-dimensional velocity and distance measurements based on laser triangulation Hao Zhang et.al. 2411.19669 null
2024-11-27 Fall Leaf Adversarial Attack on Traffic Sign Classification Anthony Etim et.al. 2411.18776 null
2024-11-22 Deep Learning-Based Automatic Delineation of Liver Domes in kV Triggered Images for Online Breath-hold Reproducibility Verification of Liver Stereotactic Body Radiation Therapy Sugandima Weragoda et.al. 2411.15322 null
2024-12-24 Defective Edge Detection Using Cascaded Ensemble Canny Operator Anjali Nambiyar Rajkumar Kannan et.al. 2411.14868 null
2024-11-21 Transforming Engineering Diagrams: A Novel Approach for P&ID Digitization using Transformers Jan Marius Stürmer et.al. 2411.13929 null
2024-11-20 Edge-Detected 4DSTEM – effective low-dose diffraction data acquisition method for nanopowder samples in a SEM instrument Nikita Denisov et.al. 2411.13265 null
2024-11-12 Well-posedness of a Variable-Exponent Telegraph Equation Applied to Image Despeckling Sudeb Majee et.al. 2411.08175 null
2024-11-12 WavShadow: Wavelet Based Shadow Segmentation and Removal Shreyans Jain et.al. 2411.05747 null
2024-11-06 Mapping reionization bubbles in the JWST era I: empirical edge detection with Lyman alpha emission from galaxies Ting-Yi Lu et.al. 2411.04176 null
2024-11-04 Deep Learning for Leopard Individual Identification: An Adaptive Angular Margin Approach David Colomer Matachana et.al. 2411.01962 link
2024-10-29 Assessment of Abrupt Shifts in CMIP6 Models using Edge Detection Sjoerd Terpstra et.al. 2410.19498 null
2024-10-19 Cutting-Edge Detection of Fatigue in Drivers: A Comparative Study of Object Detection Models Amelia Jones et.al. 2410.15030 null
2024-10-17 Co-Segmentation without any Pixel-level Supervision with Application to Large-Scale Sketch Classification Nikolaos-Antonios Ypsilantis et.al. 2410.13582 null
2024-10-16 Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization Nanda Febri Istighfarin et.al. 2410.12240 null
2024-10-13 Energy-Efficient and Fast Memristor-based Serial Multipliers Applicable in Image Processing Seyed Erfan Fatemieh et.al. 2410.09953 null
2024-10-04 Generative Edge Detection with Stable Diffusion Caixia Zhou et.al. 2410.03080 null
2024-11-07 Learning from Pattern Completion: Self-supervised Controllable Generation Zhiqiang Chen et.al. 2409.18694 link
2024-09-26 Photon Inhibition for Energy-Efficient Single-Photon Imaging Lucas J. Koerner et.al. 2409.18337 null
2024-09-26 EfficientCrackNet: A Lightweight Model for Crack Segmentation Abid Hasan Zim et.al. 2409.18099 null
2024-09-24 Nonlinear Analog Processing with Anisotropic Nonlinear Films Michele Cotrufo et.al. 2409.16448 null
2024-11-24 A new baseline for edge detection: Make Encoder-Decoder great again Yachuan Li et.al. 2409.14976 link
2024-09-17 OmniGen: Unified Image Generation Shitao Xiao et.al. 2409.11340 link
2024-09-17 Nonlocal phase-change metaoptics for reconfigurable nonvolatile image processing Guoce Yang et.al. 2409.10976 null
2024-08-26 Automated Quantification of White Blood Cells in Light Microscopic Images of Injured Skeletal Muscle Yang Jiao et.al. 2409.06722 null
2024-09-11 A Machine Learning Based Approach for Statistical Analysis of Detonation Cells from Soot Foils Vansh Sharma et.al. 2409.06466 null
2024-09-10 Contour Analysis Tool: an interactive tool for background and morphology analysis Mark A. Hutchison et.al. 2409.06421 null
2024-09-06 Cycle Pixel Difference Network for Crisp Edge Detection Changsong Liu et.al. 2409.04272 null
2024-09-04 Image Registration with Averaging Network and Edge-Based Loss for Low-SNR Cardiac MRI Xuan Lei et.al. 2409.02348 null
2024-09-03 EDCSSM: Edge Detection with Convolutional State Space Model Qinghui Hong et.al. 2409.01609 null
2024-08-29 Android Malware Detection Based on RGB Images and Multi-feature Fusion Zhiqiang Wang et.al. 2408.16555 null
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445 link
2024-08-28 Image Triangulation Using the Sobel Operator for Vertex Selection Olivia Laske et.al. 2408.16112 null
2024-08-27 Optimizing Lung Cancer Detection in CT Imaging: A Wavelet Multi-Layer Perceptron (WMLP) Approach Enhanced by Dragonfly Algorithm (DA) Bitasadat Jamshidi et.al. 2408.15355 null
2024-09-03 A Multiscale Gradient Fusion Method for Edge Detection in Color Images Utilizing the CBM3D Filter Zhuoyue Wang et.al. 2408.14013 null
2024-08-20 EdgeNAT: Transformer for Efficient Edge Detection Jinghuai Jie et.al. 2408.10527 link
2024-08-19 Edge detection imaging by quasi-bound states in the continuum Tingting Liu et.al. 2408.10106 null
2024-08-08 UHNet: An Ultra-Lightweight and High-Speed Edge Detection Network Fuzhang Li et.al. 2408.04258 null
2024-08-07 GUI Element Detection Using SOTA YOLO Deep Learning Models Seyed Shayan Daneshvar et.al. 2408.03507 null
2024-07-19 How Homogenizing the Channel-wise Magnitude Can Enhance EEG Classification Model? Huyen Ngo et.al. 2407.20247 null
2024-07-29 More precise edge detections Hao Shu et.al. 2407.19992 link
2024-06-28 DCSM 2.0: Deep Conditional Shape Models for Data Efficient Segmentation Athira J Jacob et.al. 2407.00186 null
2024-06-19 Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review Abhishek Swami et.al. 2406.13266 null
2024-06-14 Research on Edge Detection of LiDAR Images Based on Artificial Intelligence Technology Haowei Yang et.al. 2406.09773 null
2024-06-14 An alternate approach for estimating grain-growth kinetics Manoj Prabakar et.al. 2406.09653 link
2024-06-12 A New Class Biorthogonal Spline Wavelet for Image Edge Detection Dujuan Zhou et.al. 2406.08285 null
2024-06-28 Learning to utilize image second-order derivative information for crisp edge detection Changsong Liu et.al. 2406.05779 null
2024-06-04 RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting Qi Wang et.al. 2406.02461 null
2024-06-02 An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites Ylva Grønningsæter et.al. 2406.00704 link
2024-06-01 A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing Nurul Rafi et.al. 2406.00239 null
2024-05-28 Enhanced infrared vision by nonlinear up-conversion in nonlocal metasurfaces Laura Valencia Molina et.al. 2405.17726 null
2024-04-02 Improving and Evaluating Machine Learning Methods for Forensic Shoeprint Matching Divij Jain et.al. 2405.14878 null
2024-05-21 Automating Attendance Management in Human Resources: A Design Science Approach Using Computer Vision and Facial Recognition Bao-Thien Nguyen-Tat et.al. 2405.12633 null
2024-05-19 The Effectiveness of Edge Detection Evaluation Metrics for Automated Coastline Detection Conor O’Sullivan et.al. 2405.11498 link
2024-05-19 Automated Coastline Extraction Using Edge Detection Algorithms Conor O’Sullivan et.al. 2405.11494 link
2024-05-18 Quantum Edge Detection Santiago Llorens et.al. 2405.11373 null
2024-05-14 NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-Resolution Yihong Chen et.al. 2405.08423 link
2024-05-13 AnomalyLLM: Few-shot Anomaly Edge Detection for Dynamic Graphs using Large Language Models Shuo Liu et.al. 2405.07626 link
2024-05-07 Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map Yuxuan Xia et.al. 2405.04290 null
2024-05-06 Statistical Edge Detection And UDF Learning For Shape Representation Virgile Foy et.al. 2405.03381 null
2024-04-14 Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery Chengxi Han et.al. 2404.09179 link
2024-04-10 Edge Detection Quantumized: A Novel Quantum Algorithm For Image Processing Syed Emad Uddin Shubha et.al. 2404.06889 null
2024-06-01 Leveraging edge detection and neural networks for better UAV localization Theo Di Piazza et.al. 2404.06207 link
2024-04-07 Msmsfnet: a multi-stream and multi-scale fusion net for edge detection Chenguang Liu et.al. 2404.04856 null
2024-03-30 The Devil is in the Edges: Monocular Depth Estimation with Edge-aware Consistency Fusion Pengzhi Li et.al. 2404.00373 null
2024-03-30 Radio Frequency Interference Detection Using Efficient Multi-Scale Convolutional Attention UNet Fei Gu et.al. 2404.00277 null
2024-03-28 Learning Multiple Representations with Inconsistency-Guided Detail Regularization for Mask-Guided Matting Weihao Jiang et.al. 2403.19213 null
2024-03-27 Colour and Brush Stroke Pattern Recognition in Abstract Art using Modified Deep Convolutional Generative Adversarial Networks Srinitish Srinivasan et.al. 2403.18397 link
2024-03-23 An edge detection-based deep learning approach for tear meniscus height measurement Kesheng Wang et.al. 2403.15853 null
2024-03-18 Logistic regression to boost exoplanet detection performances Hadrien Cambazard et.al. 2403.11571 null
2024-03-17 Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning Jesher Joshua M et.al. 2403.11291 null
2024-03-16 Texture Edge detection by Patch consensus (TEP) Guangyu Cui et.al. 2403.11038 null
2024-03-14 Temporal Signal Processing with Nonlocal Optical Metasurfaces Michele Cotrufo et.al. 2403.09087 null
2024-03-13 RAF-GI: Towards Robust, Accurate and Fast-Convergent Gradient Inversion Attack in Federated Learning Can Liu et.al. 2403.08383 link
2024-03-13 MGIC: A Multi-Label Gradient Inversion Attack based on Canny Edge Detection on Federated Learning Can Liu et.al. 2403.08284 null
2024-03-07 RankED: Addressing Imbalance and Uncertainty in Edge Detection Using Ranking-based Losses Bedrettin Cetinkaya et.al. 2403.01795 link
2024-03-03 CDSE-UNet: Enhancing COVID-19 CT Image Segmentation with Canny Edge Detection and Dual-Path SENet Feature Fusion Jiao Ding et.al. 2403.01513 null
2024-02-28 On the Accuracy of Edge Detectors in Number Plate Extraction Bashir Olaniyi Sadiq et.al. 2402.18251 null
2024-03-20 Lightweight, error-tolerant edge detection using memristor-enabled stochastic logics Lekai Song et.al. 2402.16908 null
2024-02-22 SHM-Traffic: DRL and Transfer learning based UAV Control for Structural Health Monitoring of Bridges with Traffic Divija Swetha Gadiraju et.al. 2402.14757 null
2024-02-18 Near-infrared metalens empowered dual-mode high resolution and large FOV microscope Chuang Sun et.al. 2402.11554 null
2024-02-07 Color Recognition in Challenging Lighting Environments: CNN Approach Nizamuddin Maitlo et.al. 2402.04762 null
2024-02-01 Lightweight Pixel Difference Networks for Efficient Visual Representation Learning Zhuo Su et.al. 2402.00422 link
2024-01-27 Applications of Tao General Difference in Discrete Domain Linmi Tao et.al. 2401.15287 null
2024-01-18 False Discovery Rate Control for Gaussian Graphical Models via Neighborhood Screening Taulant Koka et.al. 2401.09979 null
2024-01-14 Photonic real time video image signal processor at 17Tb/s based on a Kerr microcomb Mengxi Tan et.al. 2401.07197 null
2024-01-12 Space-Time Nonlocal Metasurfaces for Event-Based Image Processing Sedigheh Esfahani et.al. 2401.06586 null
2024-01-07 Real-Time Asphalt Pavement Layer Thickness Prediction Using Ground-Penetrating Radar Based on a Modified Extended Common Mid-Point (XCMP) Approach Siqi Wang et.al. 2401.03375 null
2024-01-05 Systematic review of image segmentation using complex networks Amin Rezaei et.al. 2401.02758 null
2024-01-04 SuperEdge: Towards a Generalization Model for Self-Supervised Edge Detection Leng Kai et.al. 2401.02313 link
2024-01-09 DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection Yunfan Ye et.al. 2401.02032 link
2023-12-21 Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation Rasha Alshawi et.al. 2312.14053 link
2023-12-14 Automated Grain Boundary Detection for Bright-Field Transmission Electron Microscopy Images via U-Net Matthew J. Patrick et.al. 2312.09392 null
2023-12-10 Polar Linear Canonical Wavelet Transform: Theory and Its Application Hui Zhao et.al. 2312.06702 null
2023-12-09 A fast numerical algorithm for finding all real solutions to a system of N nonlinear equations in a finite domain Fernando Chueca-Diez et.al. 2312.03927 null
2023-12-04 Cable Slack Detection for Arresting Gear Application using Machine Vision Ari Goodman et.al. 2312.02320 null
2023-12-03 Meta ControlNet: Enhancing Task Adaptation via Meta Learning Junjie Yang et.al. 2312.01255 link
2023-10-28 Vision-Based Incoming Traffic Estimator Using Deep Neural Network on General Purpose Embedded Hardware K. G. Zoysa et.al. 2311.16125 null
2023-11-27 DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization Zhaoyang Xia et.al. 2311.16060 link
2023-11-22 Reconfigurable Image Processing Metasurfaces with Phase-Change Materials Michele Cotrufo et.al. 2311.13109 null
2023-11-21 Unveiling the cosmic dawn and epoch of reionization using cosmic 21-cm signal Ankita Bera et.al. 2311.13019 null
2023-11-16 Depth Insight – Contribution of Different Features to Indoor Single-image Depth Estimation Yihong Wu et.al. 2311.10042 null
2023-11-14 RoboSense At Edge: Detecting Slip, Crumple and Shape of the Object in Robotic Hand for Teleoprations Sudev Kumar Padhi et.al. 2311.07888 null
2023-10-28 Tracking and fast imaging of a translational object via Fourier modulation Shijian Li et.al. 2310.18732 null
2024-01-09 FaultSeg Swin-UNETR: Transformer-Based Self-Supervised Pretraining Model for Fault Recognition Zeren Zhang et.al. 2310.17974 null
2023-11-08 Constraining exotic dark matter models with the dark ages 21-cm signal Rajesh Mondal et.al. 2310.15530 null
2023-10-22 Research on Key Technologies of Infrastructure Digitalization based on Multimodal Spatial Data Zhanyuan Tian et.al. 2310.14296 null
2023-10-01 Quantum image edge detection based on eight-direction Sobel operator for NEQR Wenjie Liu et.al. 2310.03037 null
2023-09-26 3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction Miriam Jäger et.al. 2309.14800 null
2023-09-13 Temporal compressive edge imaging enabled by a lensless diffuser camera Ze Zheng et.al. 2309.07198 null
2023-11-05 MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation Nhat-Tan Bui et.al. 2309.03329 link
2023-09-05 DeNISE: Deep Networks for Improved Segmentation Edges Sander Riisøen Jyhne et.al. 2309.02091 null
2023-08-29 A Pseudo-Boolean Polynomials Approach for Image Edge Detection Tendai Mapungwana Chikake et.al. 2308.15557 link
2023-08-29 Pseudo-Boolean Polynomials Approach To Edge Detection And Image Segmentation Tendai Mapungwana Chikake et.al. 2308.15453 null
2023-08-27 Practical Edge Detection via Robust Collaborative Learning Yuanbin Fu et.al. 2308.14084 link
2023-11-18 Zero-Shot Edge Detection with SCESAME: Spectral Clustering-based Ensemble for Segment Anything Model Estimation Hiroaki Yamagiwa et.al. 2308.13779 link
2023-08-19 R-C-P Method: An Autonomous Volume Calculation Method Using Image Processing and Machine Vision MA Muktadir et.al. 2308.10058 null
2023-08-19 TSAR-MVS: Textureless-aware Segmentation and Correlative Refinement Guided Multi-View Stereo Zhenlong Yuan et.al. 2308.09990 null
2023-08-12 The Color Clifford Hardy Signal: Application to Color Edge Detection and Optical Flow Xiaoxiao Hu et.al. 2308.06485 null
2023-08-12 Tiny and Efficient Model for the Edge Detection Generalization Xavier Soria et.al. 2308.06468 link
2023-08-05 Electromagnetic Spatiotemporal Differentiators Yi Zhou et.al. 2308.03797 null
2023-08-06 ECT: Fine-grained Edge Detection with Learned Cause Tokens Shaocong Xu et.al. 2308.03092 link
2023-08-08 Generation of Realistic Synthetic Raw Radar Data for Automated Driving Applications using Generative Adversarial Networks Eduardo C. Fidelis et.al. 2308.02632 link
2023-08-23 MSECNet: Accurate and Robust Normal Estimation for 3D Point Clouds by Multi-Scale Edge Conditioning Haoyi Xiu et.al. 2308.02237 link
2023-07-31 Multispectral Image Segmentation in Agriculture: A Comprehensive Study on Fusion Approaches Nuno Cunha et.al. 2308.00159 link
2023-07-31 Hybrid quantum transfer learning for crack image classification on NISQ hardware Alexander Geng et.al. 2307.16723 null
2023-10-16 PNT-Edge: Towards Robust Edge Detection with Noisy Labels by Learning Pixel-level Noise Transitions Wenjie Xuan et.al. 2307.14070 link
2023-07-20 Integrated Photonic Fractional Convolution Accelerator Kevin Zelaya et.al. 2307.10976 null
2023-07-11 Compact Twice Fusion Network for Edge Detection Yachuan Li et.al. 2307.04952 link
2023-07-08 Edge-Aware Mirror Network for Camouflaged Object Detection Dongyue Sun et.al. 2307.03932 link
2023-07-08 On a cylindrical scanning modality in three-dimensional Compton scatter tomography James W. Webber et.al. 2307.03896 null
2023-07-07 Polarization Imaging and Edge Detection with Image-Processing Metasurfaces Michele Cotrufo et.al. 2307.03548 null
2023-07-07 A Deep Active Contour Model for Delineating Glacier Calving Fronts Konrad Heidler et.al. 2307.03461 null
2023-06-29 Pupil-driven quantitative differential phase contrast imaging Shuhe Zhang et.al. 2306.17088 null
2023-06-27 Delving into Crispness: Guided Label Refinement for Crisp Edge Detection Yunfan Ye et.al. 2306.15172 link
2023-06-26 Integrated lithium niobate microwave photonic processing engine Hanke Feng et.al. 2306.14415 null
2023-06-22 XAI-TRIS: Non-linear benchmarks to quantify ML explanation performance Benedict Clark et.al. 2306.12816 link
2023-07-03 A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering Chaoning Zhang et.al. 2306.06211 null
2023-06-03 Hierarchical Multiresolution Feature- and Prior-based Graphs for Classification Faezeh Fallah et.al. 2306.02143 null
2023-05-31 SPAC-Net: Synthetic Pose-aware Animal ControlNet for Enhanced Pose Estimation Le Jiang et.al. 2305.17845 link
2023-05-16 A Geometric Calibration of the Tip of the Red Giant Branch in the Milky Way using Gaia DR3 M. Dixon et.al. 2305.09215 null
2023-05-12 Vision and Control for Grasping Clear Plastic Bags Joohwan Seo et.al. 2305.07631 link
2023-07-28 Edge-Enhanced Microscopy of Comlplex Object using Scalar and Vectorial Vortex Filtering Jigme Zangpo et.al. 2305.07225 null
2023-05-10 Novel Quantum Information Processing Methods and Investigation Zhang Ze Yu et.al. 2305.05953 null
2023-05-10 Low-Light Image Enhancement via Structure Modeling and Guidance Xiaogang Xu et.al. 2305.05839 link
2023-04-30 Multi-directional Sobel operator kernel on GPUs Qiong Chang et.al. 2305.00515 null
2023-04-30 Continuous motion of an electrically actuated water droplet over a PDMS-coated surface Supriya Upadhyay et.al. 2305.00420 null
2023-04-13 CATS: The Hubble Constant from Standardized TRGB and Type Ia Supernova Measurements D. Scolnic et.al. 2304.06693 null
2023-04-10 Reconstruction-driven Dynamic Refinement based Unsupervised Domain Adaptation for Joint Optic Disc and Cup Segmentation Ziyang Chen et.al. 2304.04581 null
2023-03-28 Vision based UAV Navigation through Narrow Passages Jayakant Kumar et.al. 2303.15803 null
2023-03-21 The Treasure Beneath Multiple Annotations: An Uncertainty-aware Edge Detector Caixia Zhou et.al. 2303.11828 link
2023-03-15 PENet: A Joint Panoptic Edge Detection Network Yang Zhou et.al. 2303.08848 link
2023-05-08 SILOP: An Automated Framework for Semantic Segmentation Using Image Labels Based on Object Perimeters Erik Ostrowski et.al. 2303.07892 link
2023-03-16 NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction from Multi-view Images Yunfan Ye et.al. 2303.07653 link
2023-03-10 Automatic Detection and Rectification of Paper Receipts on Smartphones Edward Whittaker et.al. 2303.05763 null
2023-03-09 When Optical Microscopy Meets All-Optical Analog Computing: A Brief Review Yichang Shou et.al. 2303.04988 null
2023-03-06 Optimal Periodic Control of Unmanned Aerial Vehicles Based on Fourier Integral Pseudospectral and Edge-Detection Methods Kareem T. Elgindy et.al. 2303.02969 null
2023-03-02 Scalable optical neural networks based on temporal computing Shuang Zheng et.al. 2303.01287 null
2023-03-26 Attention-based Point Cloud Edge Sampling Chengzhi Wu et.al. 2302.14673 link

transfer learning

Publish Date Title Authors PDF Code
2025-06-30 CoMMiT: Co-informed inference of microbiome-metabolome interactions via transfer learning Leiyue Li et.al. 2506.24013 null
2025-06-30 Pruning by Block Benefit: Exploring the Properties of Vision Transformer Blocks during Domain Adaptation Patrick Glandorf et.al. 2506.23675 null
2025-06-30 AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval Suyash Maniyar et.al. 2506.23605 null
2025-06-29 FedRef: Communication-Efficient Bayesian Fine Tuning with Reference Model Taehwan Yoon et.al. 2506.23210 null
2025-06-29 Self-Supervised Contrastive Learning for Multi-Label Images Jiale Chen et.al. 2506.23156 null
2025-06-28 Towards Time Series Generation Conditioned on Unstructured Natural Language Jaeyun Woo et.al. 2506.22927 null
2025-06-28 ReasonBridge: Efficient Reasoning Transfer from Closed to Open-Source Language Models Ziqi Zhong et.al. 2506.22865 null
2025-06-27 Are Fast Methods Stable in Adversarially Robust Transfer Learning? Joshua C. Zhao et.al. 2506.22602 null
2025-06-25 How Can Multimodal Remote Sensing Datasets Transform Classification via SpatialNet-ViT? Gautam Siddharth Kashyap et.al. 2506.22501 null
2025-06-27 Multi-View Contrastive Learning for Robust Domain Adaptation in Medical Time Series Analysis YongKyung Oh et.al. 2506.22393 null
2025-06-27 Transfer Learning for Assessing Heavy Metal Pollution in Seaports Sediments Tin Lai et.al. 2506.22096 null
2025-06-27 Visual Content Detection in Educational Videos with Transfer Learning and Dataset Enrichment Dipayan Biswas et.al. 2506.21903 null
2025-06-26 Offensive Language Detection on Social Media Using XLNet Reem Alothman et.al. 2506.21795 null
2025-06-26 Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation Sweta Banerjee et.al. 2506.21444 null
2025-06-25 Brain2Model Transfer: Training sensory and decision models with human neural activity as a teacher Tomas Gallo Aquino et.al. 2506.20834 null
2025-06-25 Physics-Informed Machine Learning Regulated by Finite Element Analysis for Simulation Acceleration of Laser Powder Bed Fusion R. Sharma et.al. 2506.20537 null
2025-06-25 Comparative Analysis of Deep Learning Models for Crop Disease Detection: A Transfer Learning Approach Saundarya Subramaniam et.al. 2506.20323 null
2025-06-25 FundaQ-8: A Clinically-Inspired Scoring Framework for Automated Fundus Image Quality Assessment Lee Qi Zun et.al. 2506.20303 null
2025-06-24 General Methods Make Great Domain-specific Foundation Models: A Case-study on Fetal Ultrasound Jakob Ambsdorf et.al. 2506.19552 null
2025-06-24 From High-SNR Radar Signal to ECG: A Transfer Learning Model with Cardio-Focusing Algorithm for Scenarios with Limited Data Yuanyuan Zhang et.al. 2506.19358 null
2025-06-23 Focus Your Attention: Towards Data-Intuitive Lightweight Vision Transformers Suyash Gaurav et.al. 2506.18791 null
2025-06-23 Leveraging Transfer Learning to Overcome Data Limitations in Czochralski Crystal Growth Milena Petkovic et.al. 2506.18774 null
2025-06-23 Benchmarking histopathology foundation models in a multi-center dataset for skin cancer subtyping Pablo Meseguer et.al. 2506.18668 null
2025-06-23 When Fine-Tuning Fails: Lessons from MS MARCO Passage Ranking Manu Pande et.al. 2506.18535 null
2025-06-23 Generalizing Vision-Language Models to Novel Domains: A Comprehensive Survey Xinyao Li et.al. 2506.18504 null
2025-06-23 Leveraging neural network interatomic potentials for a foundation model of chemistry So Yeon Kim et.al. 2506.18497 null
2025-06-26 These Are Not All the Features You Are Looking For: A Fundamental Bottleneck in Supervised Pretraining Xingyu Alice Yang et.al. 2506.18221 null
2025-06-22 Deep Supervised LSTM for 3D morphology estimation from Multi-View RGB Images of Wheat Spikes Olivia Zumsteg et.al. 2506.18060 null
2025-06-22 Classification of Tents in Street Bazaars Using CNN Azamat Ibragimov et.al. 2506.17946 null
2025-06-21 Rethinking the Role of Operating Conditions for Learning-based Multi-condition Fault Diagnosis Pengyu Han et.al. 2506.17740 null
2025-06-21 Numerical simulation of transient heat conduction with moving heat source using Physics Informed Neural Networks Anirudh Kalyan et.al. 2506.17726 null
2025-06-21 Unveiling Factors for Enhanced POS Tagging: A Study of Low-Resource Medieval Romance Languages Matthias Schöffel et.al. 2506.17715 null
2025-06-20 Trustworthy Few-Shot Transfer of Medical VLMs through Split Conformal Prediction Julio Silva-Rodríguez et.al. 2506.17503 null
2025-06-19 Energy-Based Transfer for Reinforcement Learning Zeyun Deng et.al. 2506.16590 null
2025-06-17 Large Language Models – the Future of Fundamental Physics? Caroline Heneka et.al. 2506.14757 null
2025-06-17 DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning Kunal Swami et.al. 2506.14709 null
2025-06-17 Bayesian Knowledge Transfer for a Kalman Fixed-Lag Interval Smoother Ondřej Skalský et.al. 2506.14572 null
2025-06-17 Adjustment for Confounding using Pre-Trained Representations Rickmer Schulte et.al. 2506.14329 link
2025-06-17 Less is More: Undertraining Experts Improves Model Upcycling Stefan Horoi et.al. 2506.14126 null
2025-06-17 Leveraging Transfer Learning and User-Specific Updates for Rapid Training of BCI Decoders Ziheng Chen et.al. 2506.14120 null
2025-06-16 Understand the Implication: Learning to Think for Pragmatic Understanding Settaluri Lakshmi Sravanthi et.al. 2506.13559 null
2025-06-16 Advancing Image-Based Grapevine Variety Classification with a New Benchmark and Evaluation of Masked Autoencoders Gabriel A. Carneiro et.al. 2506.13335 null
2025-06-16 Evolution of ReID: From Early Methods to LLM Integration Amran Bhuiyan et.al. 2506.13039 null
2025-06-16 Geometric Embedding Alignment via Curvature Matching in Transfer Learning Sung Moon Ko et.al. 2506.13015 null
2025-06-14 Konooz: Multi-domain Multi-dialect Corpus for Named Entity Recognition Nagham Hamad et.al. 2506.12615 null
2025-06-14 A Transfer Learning Framework for Multilayer Networks via Model Averaging Yongqin Qiu et.al. 2506.12455 null
2025-06-14 Hierarchical Deep Feature Fusion and Ensemble Learning for Enhanced Brain Tumor MRI Classification Zahid Ullah et.al. 2506.12363 null
2025-06-13 Interpretable Classification of Levantine Ceramic Thin Sections via Neural Networks Sara Capriotti et.al. 2506.12250 null
2025-06-13 Coefficient Shape Transfer Learning for Functional Linear Regression Shuhao Jiao et.al. 2506.11367 null
2025-06-12 Many-Body Neural Network Wavefunction for a Non-Hermitian Ising Chain Lavoisier Wah et.al. 2506.11222 null
2025-06-12 PromptTSS: A Prompting-Based Approach for Interactive Multi-Granularity Time Series Segmentation Ching Chang et.al. 2506.11170 null
2025-06-12 Instance-Based Transfer Learning with Similarity-Aware Subject Selection for Cross-Subject SSVEP-Based BCIs Ziwen Wang et.al. 2506.10933 null
2025-06-12 Efficient nanophotonic devices optimization using deep neural network trained with physics-based transfer learning (PBTL) methodology Gibaek Kim et.al. 2506.10418 null
2025-06-12 Uncertainty-Aware Deep Learning for Automated Skin Cancer Classification: A Comprehensive Evaluation Hamzeh Asgharnezhad et.al. 2506.10302 null
2025-06-11 Going beyond density functional theory accuracy: Leveraging experimental data to refine pre-trained machine learning interatomic potentials Shriya Gumber et.al. 2506.10211 null
2025-06-11 Attention on flow control: transformer-based reinforcement learning for lift regulation in highly disturbed flows Zhecheng Liu et.al. 2506.10153 null
2025-06-11 Auto-Compressing Networks Vaggelis Dorovatas et.al. 2506.09714 null
2025-06-11 An Effective End-to-End Solution for Multimodal Action Recognition Songping Wang et.al. 2506.09345 null
2025-06-10 An Explainable Deep Learning Framework for Brain Stroke and Tumor Progression via MRI Interpretation Rajan Das Gupta et.al. 2506.09161 null
2025-06-07 Exploring Image Transforms derived from Eye Gaze Variables for Progressive Autism Diagnosis Abigail Copiaco et.al. 2506.09065 null
2025-06-11 Do Multiple Instance Learning Models Transfer? Daniel Shao et.al. 2506.09022 link
2025-06-10 Data-Efficient Challenges in Visual Inductive Priors: A Retrospective Robert-Jan Bruintjes et.al. 2506.08612 null
2025-06-10 Robust Evolutionary Multi-Objective Network Architecture Search for Reinforcement Learning (EMNAS-RL) Nihal Acharya Adde et.al. 2506.08533 null
2025-06-10 Discovery of Odd Radio Circles and Other Peculiars in the First Year of the EMU Survey using Object Detection Nikhel Gupta et.al. 2506.08439 null
2025-06-09 CrosswalkNet: An Optimized Deep Learning Framework for Pedestrian Crosswalk Detection in Aerial Images with High-Performance Computing Zubin Bhuyan et.al. 2506.07885 null
2025-06-09 The Catechol Benchmark: Time-series Solvent Selection Data for Few-shot Machine Learning Toby Boyne et.al. 2506.07619 link
2025-06-09 Flowing Datasets with Wasserstein over Wasserstein Gradient Flows Clément Bonet et.al. 2506.07534 link
2025-06-09 Variational Supervised Contrastive Learning Ziwen Wang et.al. 2506.07413 null
2025-06-08 Transfer Learning and Explainable AI for Brain Tumor Classification: A Study Using MRI Data from Bangladesh Shuvashis Sarker et.al. 2506.07228 null
2025-06-08 State Entropy Regularization for Robust Reinforcement Learning Uri Koren et.al. 2506.07085 null
2025-06-07 Exploring Visual Prompting: Robustness Inheritance and Beyond Qi Li et.al. 2506.06823 null
2025-06-06 Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models Yannis Spyridis et.al. 2506.06569 null
2025-06-03 CR-BLEA: Contrastive Ranking for Adaptive Resource Allocation in Bilevel Evolutionary Algorithms Dejun Xu et.al. 2506.06362 null
2025-06-06 Full Conformal Adaptation of Medical Vision-Language Models Julio Silva-Rodríguez et.al. 2506.06076 null
2025-06-05 DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning Tanmay Parekh et.al. 2506.05128 null
2025-06-05 GEX: Democratizing Dexterity with Fully-Actuated Dexterous Hand and Exoskeleton Glove Yunlong Dong et.al. 2506.04982 link
2025-06-05 Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets Marianna Nezhurina et.al. 2506.04598 link
2025-06-05 OpenAg: Democratizing Agricultural Intelligence Srikanth Thudumu et.al. 2506.04571 null
2025-06-04 Neurosymbolic Artificial Intelligence for Robust Network Intrusion Detection: From Scratch to Transfer Learning Huynh T. T. Tran et.al. 2506.04454 null
2025-06-08 Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems Yu Ma et.al. 2506.03586 null
2025-06-03 Culture Matters in Toxic Language Detection in Persian Zahra Bokaei et.al. 2506.03458 null
2025-06-06 StARS DCM: A Sleep Stage-Decoding Forehead EEG Patch for Real-time Modulation of Sleep Physiology William G. Coon et.al. 2506.03442 null
2025-06-03 Semiconductor SEM Image Defect Classification Using Supervised and Semi-Supervised Learning with Vision Transformers Chien-Fu et.al. 2506.03345 null
2025-06-03 Extremely large oblate deformation of the first excited state in $^{12}$ C: a new challenge to modern nuclear theory C. Ngwetsheni et.al. 2506.03236 null
2025-05-31 Human Fall Detection using Transfer Learning-based 3D CNN Ekram Alam et.al. 2506.03193 null
2025-06-04 MMM4Rec: A Transfer-Efficient Framework for Multi-modal Sequential Recommendation Hao Fan et.al. 2506.02916 null
2025-06-03 MVTD: A Benchmark Dataset for Maritime Visual Object Tracking Ahsan Baidar Bakht et.al. 2506.02866 null
2025-06-03 Self-attention U-Net decoder for toric codes Wei-Wei Zhang et.al. 2506.02734 link
2025-06-03 MLaGA: Multimodal Large Language and Graph Assistant Dongzhe Fan et.al. 2506.02568 null
2025-06-02 Benchmarking Large Language Models for Polymer Property Predictions Sonakshi Gupta et.al. 2506.02129 null
2025-06-02 Principled data augmentation for learning to solve quadratic programming problems Chendi Qian et.al. 2506.01728 null
2025-06-02 Computing Diverse and Nice Triangulations Waldo Gálvez et.al. 2506.01323 null
2025-06-01 Advancing from Automated to Autonomous Beamline by Leveraging Computer Vision Baolu Li et.al. 2506.00836 null
2025-05-31 Getting More from Less: Transfer Learning Improves Sleep Stage Decoding Accuracy in Peripheral Wearable Devices William G Coon et.al. 2506.00730 null
2025-05-31 Temporal Chunking Enhances Recognition of Implicit Sequential Patterns Jayanta Dey et.al. 2506.00588 null
2025-05-31 COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning Chamika Sudusinghe et.al. 2506.00424 null
2025-05-31 Neuro2Semantic: A Transfer Learning Framework for Semantic Reconstruction of Continuous Language from Human Intracranial EEG Siavash Shams et.al. 2506.00381 link
2025-05-30 Conformal Prediction for Zero-Shot Models Julio Silva-Rodríguez et.al. 2505.24693 link
2025-05-30 Density Ratio Permutation Tests with connections to distributional shifts and conditional two-sample testing Alberto Bordino et.al. 2505.24529 null
2025-05-30 Attractor learning for spatiotemporally chaotic dynamical systems using echo state networks with transfer learning Mohammad Shah Alam et.al. 2505.24099 null
2025-05-29 BIRD: Behavior Induction via Representation-structure Distillation Galen Pogoncheff et.al. 2505.23933 null
2025-05-29 To Trust Or Not To Trust Your Vision-Language Model’s Prediction Hao Dong et.al. 2505.23745 link
2025-05-29 Epistemic Errors of Imperfect Multitask Learners When Distributions Shift Sabina J. Sloman et.al. 2505.23496 null
2025-05-29 Graph Positional Autoencoders as Self-supervised Learners Yang Liu et.al. 2505.23345 null
2025-05-29 FreRA: A Frequency-Refined Augmentation for Contrastive Learning on Time Series Classification Tian Tian et.al. 2505.23181 link
2025-05-28 When Does Neuroevolution Outcompete Reinforcement Learning in Transfer Learning Tasks? Eleni Nisioti et.al. 2505.22696 link
2025-05-28 Chest Disease Detection In X-Ray Images Using Deep Learning Classification Method Alanna Hazlett et.al. 2505.22609 null
2025-05-28 GLAMP: An Approximate Message Passing Framework for Transfer Learning with Applications to Lasso-based Estimators Longlin Wang et.al. 2505.22594 null
2025-05-27 A Joint Reconstruction-Triplet Loss Autoencoder Approach Towards Unseen Attack Detection in IoV Networks Julia Boone et.al. 2505.21703 null
2025-05-27 LLMPR: A Novel LLM-Driven Transfer Learning based Petition Ranking Model Avijit Gayen et.al. 2505.21689 null
2025-05-27 Optimizing Deep Learning for Skin Cancer Classification: A Computationally Efficient CNN with Minimal Accuracy Trade-Off Abdullah Al Mamun et.al. 2505.21597 null
2025-05-26 Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework Julien Soulé et.al. 2505.21559 null
2025-05-27 Data-Driven Cellular Mobility Management via Bayesian Optimization and Reinforcement Learning Mohamed Benzaghta et.al. 2505.21249 null
2025-05-27 Transfer learning for multifidelity simulation-based inference in cosmology Alex A. Saoulis et.al. 2505.21215 null
2025-05-27 Intelligent Incident Hypertension Prediction in Obstructive Sleep Apnea Omid Halimi Milani et.al. 2505.20615 null
2025-05-26 Solving Euler equations with Multiple Discontinuities via Separation-Transfer Physics-Informed Neural Networks Chuanxing Wang et.al. 2505.20361 null
2025-05-26 ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers Fotios Lygerakis et.al. 2505.20032 null
2025-05-26 Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation Models Mobina Mansoori et.al. 2505.19779 link
2025-05-25 Omni-Perception: Omnidirectional Collision Avoidance for Legged Locomotion in Dynamic Environments Zifan Wang et.al. 2505.19214 null
2025-05-25 A Smart Healthcare System for Monkeypox Skin Lesion Detection and Tracking Huda Alghoraibi et.al. 2505.19023 null
2025-05-29 Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning Chi Zhang et.al. 2505.18447 null
2025-05-23 X-MethaneWet: A Cross-scale Global Wetland Methane Emission Benchmark Dataset for Advancing Science Discovery with AI Yiming Sun et.al. 2505.18355 null
2025-05-21 Reinforcement Twinning for Hybrid Control of Flapping-Wing Drones Romain Poletti et.al. 2505.18201 null
2025-05-23 TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations Alan Arazi et.al. 2505.18125 null
2025-05-23 Wasserstein Transfer Learning Kaicheng Zhang et.al. 2505.17404 null
2025-05-22 Transfer Faster, Price Smarter: Minimax Dynamic Pricing under Cross-Market Preference Shift Yi Zhang et.al. 2505.17203 null
2025-05-22 Mitigating Overfitting in Medical Imaging: Self-Supervised Pretraining vs. ImageNet Transfer Learning for Dermatological Diagnosis Iván Matas et.al. 2505.16773 null
2025-05-24 End-to-End Framework for Predicting the Remaining Useful Life of Lithium-Ion Batteries Khoa Tran et.al. 2505.16664 null
2025-05-22 WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning Zhaomin Wu et.al. 2505.16635 null
2025-05-22 Reward-Aware Proto-Representations in Reinforcement Learning Hon Tik Tse et.al. 2505.16217 null
2025-05-22 Scalable Graph Generative Modeling via Substructure Sequences Zehong Wang et.al. 2505.16130 link
2025-05-21 An Exploratory Approach Towards Investigating and Explaining Vision Transformer and Transfer Learning for Brain Disease Detection Shuvashis Sarker et.al. 2505.16039 null
2025-05-21 An Approach Towards Identifying Bangladeshi Leaf Diseases through Transfer Learning and XAI Faika Fairuj Preotee et.al. 2505.16033 null
2025-05-21 Comprehensive Lung Disease Detection Using Deep Learning Models and Hybrid Chest X-ray Data with Explainable AI Shuvashis Sarker et.al. 2505.16028 null
2025-05-21 Transfer of Structural Knowledge from Synthetic Languages Mikhail Budnikov et.al. 2505.15769 link
2025-05-21 Inter-Subject Variance Transfer Learning for EMG Pattern Classification Based on Bayesian Inference Seitaro Yoneda et.al. 2505.15381 null
2025-05-21 Scaling Diffusion Transformers Efficiently via $μ$ P Chenyu Zheng et.al. 2505.15270 link
2025-05-21 GAMA++: Disentangled Geometric Alignment with Adaptive Contrastive Perturbation for Reliable Domain Transfer Kim Yun et.al. 2505.15241 null
2025-05-21 Geometrically Regularized Transfer Learning with On-Manifold and Off-Manifold Perturbation Hana Satou et.al. 2505.15191 null
2025-05-21 AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation Meenal Parakh et.al. 2505.14986 null
2025-05-20 MultiMAE Meets Earth Observation: Pre-training Multi-modal Multi-task Masked Autoencoders for Earth Observation Tasks Jose Sosa et.al. 2505.14951 link
2025-05-20 LOD1 3D City Model from LiDAR: The Impact of Segmentation Accuracy on Quality of Urban 3D Modeling and Morphology Extraction Fatemeh Chajaei et.al. 2505.14747 link
2025-05-20 Vulnerability of Transfer-Learned Neural Networks to Data Reconstruction Attacks in Small-Data Regime Tomasz Maciążek et.al. 2505.14323 link
2025-05-20 Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data Faeze Ghorbanpour et.al. 2505.14272 null
2025-05-20 Contrastive Consolidation of Top-Down Modulations Achieves Sparsely Supervised Continual Learning Viet Anh Khoa Tran et.al. 2505.14125 null
2025-05-20 Domain Adaptation of VLM for Soccer Video Understanding Tiancheng Jiang et.al. 2505.13860 null
2025-05-19 Adaptive Image Restoration for Video Surveillance: A Real-Time Approach Muhammad Awais Amin et.al. 2505.13130 null
2025-05-19 Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR Xugang Lu et.al. 2505.13079 null
2025-05-19 Mamba-Adaptor: State Space Model Adaptor for Visual Recognition Fei Xie et.al. 2505.12685 null
2025-05-19 On the Mechanisms of Adversarial Data Augmentation for Robust and Adaptive Transfer Learning Hana Satou et.al. 2505.12681 null
2025-05-18 InnateCoder: Learning Programmatic Options with Foundation Models Rubens O. Moraes et.al. 2505.12508 link
2025-05-18 Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation Hang Yu et.al. 2505.12428 null
2025-05-17 Relation-Aware Graph Foundation Model Jianxiang Yu et.al. 2505.12027 null
2025-05-17 Residual Feature Integration is Sufficient to Prevent Negative Transfer Yichen Xu et.al. 2505.11771 link
2025-05-16 Evaluation and optimization of deep learning models for enhanced detection of brain cancer using transmission optical microscopy of thin brain tissue samples Mohnish Sao et.al. 2505.11735 null
2025-05-16 Humble your Overconfident Networks: Unlearning Overfitting via Sequential Monte Carlo Tempered Deep Ensembles Andrew Millard et.al. 2505.11671 null
2025-05-16 Programmable metasurfaces for future photonic artificial intelligence Loubnan Abou-Hamdan et.al. 2505.11659 null
2025-05-16 Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model Phan Tran Minh Dat et.al. 2505.11421 null
2025-05-16 Assessing the Performance of Analog Training for Transfer Learning Omobayode Fagbohungbe et.al. 2505.11067 null
2025-05-19 Bias and Generalizability of Foundation Models across Datasets in Breast Mammography Elodie Germani et.al. 2505.10579 null
2025-05-15 An AI-driven framework for the prediction of personalised health response to air pollution Nazanin Zounemat Kermani et.al. 2505.10556 null
2025-05-15 Logos as a Well-Tempered Pre-train for Sign Language Recognition Ilya Ovodov et.al. 2505.10481 null
2025-05-15 MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models Yuncheng Guo et.al. 2505.10088 link
2025-05-15 Automated grading and staging of ovarian cancer using deep learning on the transmission optical microscopy bright-field images of thin biopsy tissue samples Ashmit K Mishra et.al. 2505.09993 null
2025-05-14 Community-based Multi-Agent Reinforcement Learning with Transfer and Active Exploration Zhaoyang Shi et.al. 2505.09756 null
2025-05-14 Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis Bingxin Ke et.al. 2505.09358 link
2025-05-13 GNN-based Precoder Design and Fine-tuning for Cell-free Massive MIMO with Real-world CSI Tianzheng Miao et.al. 2505.08788 null
2025-05-13 Revealing economic facts: LLMs know more than they say Marcus Buckmann et.al. 2505.08662 null
2025-05-13 A computer vision-based model for occupancy detection using low-resolution thermal images Xue Cui et.al. 2505.08336 null
2025-05-13 Knowledge-Informed Deep Learning for Irrigation Type Mapping from Remote Sensing Oishee Bintey Hoque et.al. 2505.08302 null
2025-05-12 Sleep Position Classification using Transfer Learning for Bed-based Pressure Sensors Olivier Papillon et.al. 2505.08111 null
2025-05-12 Multi-modal wound classification using wound image and location by Xception and Gaussian Mixture Recurrent Neural Network (GMRNN) Ramin Mousa et.al. 2505.08086 null
2025-05-10 Development of a WAZOBIA-Named Entity Recognition System S. E Emedem et.al. 2505.07884 null
2025-05-12 Gameplay Highlights Generation Vignesh Edithal et.al. 2505.07721 null
2025-05-12 Transfer Learning Across Fixed-Income Product Classes Nicolas Camenzind et.al. 2505.07676 null
2025-05-12 Automated Visual Attention Detection using Mobile Eye Tracking in Behavioral Classroom Studies Efe Bozkir et.al. 2505.07552 null
2025-05-12 Linux Kernel Configurations at Scale: A Dataset for Performance and Evolution Analysis Heraldo Borges et.al. 2505.07487 link
2025-05-11 Enhancing Inference for Small Cohorts via Transfer Learning and Weighted Integration of Multiple Datasets Subharup Guha et.al. 2505.07153 null
2025-05-15 A systematic review of challenges and proposed solutions in modeling multimodal data Maryam Farhadizadeh et.al. 2505.06945 null
2025-05-11 A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting Lhuqita Fazry et.al. 2505.06862 link
2025-05-10 Deep Neural Networks for Cross-Energy Particle Identification at RHIC and LHC Omar M. Khalaf et.al. 2505.06732 null
2025-05-10 Mixer-Informer-Based Two-Stage Transfer Learning for Long-Sequence Load Forecasting in Newly Constructed Electric Vehicle Charging Stations Zhenhua Zhou et.al. 2505.06657 null
2025-05-09 The 76Cu conundrum remains unsolved B. Olaizola et.al. 2505.06400 null
2025-05-09 NSF-MAP: Neurosymbolic Multimodal Fusion for Robust and Interpretable Anomaly Prediction in Assembly Pipelines Chathurangi Shyalika et.al. 2505.06333 link
2025-05-09 The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review Jingguo Qu et.al. 2505.06118 null
2025-05-09 Discovery of the Polar Ring Galaxies with deep learning D. V. Dobrycheva et.al. 2505.05890 null
2025-05-09 Automated Knot Detection and Pairing for Wood Analysis in the Timber Industry Guohao Lin et.al. 2505.05845 null
2025-05-09 HyperspectralMAE: The Hyperspectral Imagery Classification Model using Fourier-Encoded Dual-Branch Masked Autoencoder Wooyoung Jeong et.al. 2505.05710 null
2025-05-08 Fast and Fourier Features for Transfer Learning of Interatomic Potentials Pietro Novelli et.al. 2505.05652 null
2025-05-08 Improved Brain Tumor Detection in MRI: Fuzzy Sigmoid Convolution in Deep Learning Muhammad Irfan et.al. 2505.05208 null
2025-05-08 Structural Alignment in Link Prediction Jeffrey Seathrún Sardina et.al. 2505.04939 link
2025-05-08 VaCDA: Variational Contrastive Alignment-based Scalable Human Activity Recognition Soham Khisa et.al. 2505.04907 null
2025-05-05 Advanced Clustering Framework for Semiconductor Image Analytics Integrating Deep TDA with Self-Supervised and Transfer Learning Techniques Janhavi Giri et.al. 2505.03848 null
2025-05-06 Sustainable Smart Farm Networks: Enhancing Resilience and Efficiency with Decision Theory-Guided Deep Reinforcement Learning Dian Chen et.al. 2505.03721 null
2025-05-07 Multi-modal cascade feature transfer for polymer property prediction Kiichi Obuchi et.al. 2505.03704 null
2025-05-06 Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices Tasnim Shahriar et.al. 2505.03303 null
2025-05-06 HMAE: Self-Supervised Few-Shot Learning for Quantum Spin Systems Ibne Farabi Shihab et.al. 2505.03140 null
2025-05-05 Early Prediction of Sepsis: Feature-Aligned Transfer Learning Oyindolapo O. Komolafe et.al. 2505.02889 null
2025-05-05 Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning David Ramos et.al. 2505.02634 null
2025-05-04 Local Herb Identification Using Transfer Learning: A CNN-Powered Mobile Application for Nepalese Flora Prajwal Thapa et.al. 2505.02147 null
2025-05-03 Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge Florian Schmid et.al. 2505.01747 link
2025-05-02 Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments Noussaiba Djeffal et.al. 2505.01632 null
2025-05-02 A Physics-preserved Transfer Learning Method for Differential Equations Hao-Ran Yang et.al. 2505.01281 null
2025-05-01 A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic Muhammad Imran Zaman et.al. 2505.00534 null
2025-05-01 AI-Assisted Decision-Making for Clinical Assessment of Auto-Segmented Contour Quality Biling Wang et.al. 2505.00308 null
2025-05-01 Explorative Curriculum Learning for Strongly Correlated Electron Systems Kimihiro Yamazaki et.al. 2505.00233 null
2025-04-30 Convergence rate for Nearest Neighbour matching: geometry of the domain and higher-order regularity Simon Viel et.al. 2504.21633 null
2025-04-30 Multi-level datasets training method in Physics-Informed Neural Networks Yao-Hsuan Tsai et.al. 2504.21328 null
2025-04-30 Multi-modal Transfer Learning for Dynamic Facial Emotion Recognition in the Wild Ezra Engel et.al. 2504.21248 null
2025-04-29 A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection Andreas Karathanasis et.al. 2504.21066 null
2025-04-29 SVD Based Least Squares for X-Ray Pneumonia Classification Using Deep Features Mete Erdogan et.al. 2504.20970 null
2025-04-29 Transfer Learning Under High-Dimensional Network Convolutional Regression Model Liyuan Wang et.al. 2504.19979 null
2025-04-28 Comments on the minimal training set for CNN: a case study of the frustrated $J_1$-$J_2$ Ising model on the square lattice Shang-Wei Li et.al. 2504.19795 null
2025-04-26 Improving Pretrained YAMNet for Enhanced Speech Command Detection via Transfer Learning Sidahmed Lachenani et.al. 2504.19030 null
2025-04-26 Predicting Stress in Two-phase Random Materials and Super-Resolution Method for Stress Images by Embedding Physical Information Tengfei Xing et.al. 2504.18854 null
2025-04-26 FiberKAN: Kolmogorov-Arnold Networks for Nonlinear Fiber Optics Xiaotian Jiang et.al. 2504.18833 null
2025-04-23 Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning Abdulhady Abas Abdullah et.al. 2504.18582 null
2025-04-25 Unifying Direct and Indirect Learning for Safe Control of Linear Systems Amir Modares et.al. 2504.18331 null
2025-04-25 Post-Transfer Learning Statistical Inference in High-Dimensional Regression Nguyen Vu Khai Tam et.al. 2504.18212 null
2025-04-25 A Model Zoo on Phase Transitions in Neural Networks Konstantin Schürholt et.al. 2504.18072 null
2025-04-24 FlexPINN: Modeling Fluid Dynamics and Mass Transfer in 3D Micromixer Geometries Using a Flexible Physics-Informed Neural Network Meraj Hassanzadeh et.al. 2504.17896 null
2025-04-22 Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models Ze Yang et.al. 2504.17807 null
2025-04-24 An Explainable Nature-Inspired Framework for Monkeypox Diagnosis: Xception Features Combined with NGBoost and African Vultures Optimization Algorithm Ahmadreza Shateri et.al. 2504.17540 null
2025-04-25 On the workflow, opportunities and challenges of developing foundation model in geophysics Hanlin Sheng et.al. 2504.17384 null
2025-04-24 The Riemannian Means Field Classifier for EEG-Based BCI Data Anton Andreev et.al. 2504.17352 null
2025-04-24 Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo Ocheme Anthony Ekle et.al. 2504.17252 null
2025-04-23 A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs Jalal Arabneydi et.al. 2504.17006 null
2025-04-23 An Adaptive ML Framework for Power Converter Monitoring via Federated Transfer Learning Panagiotis Kakosimos et.al. 2504.16866 null
2025-04-22 SparseJEPA: Sparse Representation Learning of Joint Embedding Predictive Architectures Max Hartman et.al. 2504.16140 null
2025-04-21 Active Learning Methods for Efficient Data Utilization and Model Performance Enhancement Chiung-Yi Tseng et.al. 2504.16136 null
2025-04-22 Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications Leonardo Olivi et.al. 2504.15991 null
2025-04-23 MedNNS: Supernet-based Medical Task-Adaptive Neural Network Search Lotfi Abdelkrim Mecharbat et.al. 2504.15865 null
2025-04-22 Transfer Learning for High-dimensional Reduced Rank Time Series Models Mingliang Ma Abolfazl Safikhani et.al. 2504.15691 null
2025-04-21 Fourier analysis of the physics of transfer learning for data-driven subgrid-scale models of ocean turbulence Moein Darman et.al. 2504.15487 null
2025-04-21 Transferable Learning of Reaction Pathways from Geometric Priors Juno Nam et.al. 2504.15370 link
2025-04-22 Histogram-based Parameter-efficient Tuning for Passive Sonar Classification Amirmohammad Mohammadi et.al. 2504.15214 link
2025-04-21 Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments? Xinglei Dou et.al. 2504.15021 null
2025-04-21 PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV Qianyu Zhu et.al. 2504.14952 link
2025-04-18 CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning Yang Yue et.al. 2504.13820 link
2025-04-18 Enhancing Pothole Detection and Characterization: Integrated Segmentation and Depth Estimation in Road Anomaly Systems Uthman Baroudi et.al. 2504.13648 null
2025-04-18 MetaDSE: A Few-shot Meta-learning Framework for Cross-workload CPU Design Space Exploration Runzhen Xue et.al. 2504.13568 null
2025-04-18 A Deep Learning-Based Supervised Transfer Learning Framework for DOA Estimation with Array Imperfections Bo Zhou et.al. 2504.13394 link
2025-04-17 Non-Uniform Class-Wise Coreset Selection: Characterizing Category Difficulty for Data-Efficient Transfer Learning Hanyu Zhang et.al. 2504.13234 null
2025-04-17 Scaling Laws for Data-Efficient Visual Transfer Learning Wenxuan Yang et.al. 2504.13219 null
2025-04-17 Transfer Learning via Auxiliary Labels with Application to Cold-Hardiness Prediction Kristen Goebel et.al. 2504.13142 null
2025-04-17 All-in-One Transferring Image Compression from Human Perception to Multi-Machine Perception Jiancheng Zhao et.al. 2504.12997 null
2025-04-17 Enhancing Cocoa Pod Disease Classification via Transfer Learning and Ensemble Methods: Toward Robust Predictive Modeling Devina Anduyan et.al. 2504.12992 null
2025-04-17 Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification Reek Majumder et.al. 2504.12644 link
2025-04-17 Privacy-Preserving CNN Training with Transfer Learning: Two Hidden Layers John Chiang et.al. 2504.12623 null
2025-04-15 TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics Data Shuo Shuo Liu et.al. 2504.12353 link
2025-04-16 Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets Yechao Zhang et.al. 2504.11990 null
2025-04-15 Towards a Universal Vibration Analysis Dataset: A Framework for Transfer Learning in Predictive Maintenance and Structural Health Monitoring Mert Sehri et.al. 2504.11581 null
2025-04-15 Rank-based transfer learning for high-dimensional survival data with application to sepsis data Nan Qiao et.al. 2504.11270 null
2025-04-15 Meta-learning For Few-Shot Time Series Crop Type Classification: A Benchmark On The EuroCropsML Dataset Joana Reuss et.al. 2504.11022 null
2025-04-17 Transfer Learning for Temporal Link Prediction Ayan Chatterjee et.al. 2504.10925 link
2025-04-14 Transfer Learning Assisted XgBoost For Adaptable Cyberattack Detection In Battery Packs Sanchita Ghosh et.al. 2504.10658 null
2025-04-14 Inferring genotype-phenotype maps using attention models Krishna Rijal et.al. 2504.10388 link
2025-04-14 UP-Person: Unified Parameter-Efficient Transfer Learning for Text-based Person Retrieval Yating Liu et.al. 2504.10084 link
2025-04-14 Learning to Harmonize Cross-vendor X-ray Images by Non-linear Image Dynamics Correction Yucheng Lu et.al. 2504.10080 null
2025-04-14 Progressive Transfer Learning for Multi-Pass Fundus Image Restoration Uyen Phan et.al. 2504.10025 null
2025-04-14 Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics Nikolai Röhrich et.al. 2504.10021 null
2025-04-13 Comorbidity-Informed Transfer Learning for Neuro-developmental Disorder Diagnosis Xin Wen et.al. 2504.09463 null
2025-04-12 Beyond Glucose-Only Assessment: Advancing Nocturnal Hypoglycemia Prediction in Children with Type 1 Diabetes Marco Voegeli et.al. 2504.09299 null
2025-04-12 Query-based Knowledge Transfer for Heterogeneous Learning Environments Norah Alballa et.al. 2504.09205 null
2025-04-12 Towards On-Device Learning and Reconfigurable Hardware Implementation for Encoded Single-Photon Signal Processing Zhenya Zang et.al. 2504.09028 null
2025-04-11 Distilling and exploiting quantitative insights from Large Language Models for enhanced Bayesian optimization of chemical reactions Roshan Patel et.al. 2504.08874 null
2025-04-11 Boosting multi-demographic federated learning for chest x-ray analysis using general-purpose self-supervised representations Mahshad Lotfinia et.al. 2504.08584 null
2025-04-11 Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets Luis Chuquimarca et.al. 2504.08568 null
2025-04-10 Deep Reinforcement Learning for Day-to-day Dynamic Tolling in Tradable Credit Schemes Xiaoyi Wu et.al. 2504.08074 null
2025-04-14 Pushing the Accuracy Limit of Foundation Neural Network Models with Quantum Monte Carlo Forces and Path Integrals Anouar Benali et.al. 2504.07948 null
2025-04-10 Focal Cortical Dysplasia Type II Detection Using Cross Modality Transfer Learning and Grad-CAM in 3D-CNNs for MRI Analysis Lorenzo Lasagni et.al. 2504.07775 null
2025-04-10 Benchmarking Image Embeddings for E-Commerce: Evaluating Off-the Shelf Foundation Models, Fine-Tuning Strategies and Practical Trade-offs Urszula Czerwinska et.al. 2504.07567 null
2025-04-10 Conditional Data Synthesis Augmentation Xinyu Tian et.al. 2504.07426 null
2025-04-09 Identifying regions of interest in whole slide images of renal cell carcinoma Mohammed Lamine Benomar et.al. 2504.07313 null
2025-04-09 Data Fusion of Deep Learned Molecular Embeddings for Property Prediction Robert J Appleton et.al. 2504.07297 null
2025-04-09 EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture Wenfeng Feng et.al. 2504.06738 null
2025-04-09 TabKAN: Advancing Tabular Data Analysis using Kolmograv-Arnold Network Ali Eslamian et.al. 2504.06559 null
2025-04-08 High-Resource Translation:Turning Abundance into Accessibility Abhiram Reddy Yanampally et.al. 2504.05914 null
2025-04-07 Cross-functional transferability in universal machine learning interatomic potentials Xu Huang et.al. 2504.05565 null
2025-04-07 Cellular Network Design for UAV Corridors via Data-driven High-dimensional Bayesian Optimization Mohamed Benzaghta et.al. 2504.05176 null
2025-04-07 Sparse Optimization for Transfer Learning: A L0-Regularized Framework for Multi-Source Domain Adaptation Chenqi Gong et.al. 2504.04812 null
2025-04-05 ADA-Net: Attention-Guided Domain Adaptation Network with Contrastive Learning for Standing Dead Tree Segmentation Using Aerial Imagery Mete Ahishali et.al. 2504.04271 link
2025-04-05 Quantum parallel information exchange (QPIE) hybrid network with transfer learning Ziqing Guo et.al. 2504.04235 null
2025-04-05 PIORF: Physics-Informed Ollivier-Ricci Flow for Long-Range Interactions in Mesh Graph Neural Networks Youn-Yeol Yu et.al. 2504.04052 null
2025-04-04 Optimizing Specific and Shared Parameters for Efficient Parameter Tuning Van-Anh Nguyen et.al. 2504.03450 null
2025-04-04 Early detection of diabetes through transfer learning-based eye (vision) screening and improvement of machine learning model performance and advanced parameter setting algorithms Mohammad Reza Yousefi et.al. 2504.03439 null
2025-04-04 Block Toeplitz Sparse Precision Matrix Estimation for Large-Scale Interval-Valued Time Series Forecasting Wan Tian et.al. 2504.03322 null
2025-04-04 A model-free feature extraction procedure for interval-valued time series prediction Wan Tian et.al. 2504.03310 null
2025-04-04 Mitigating the Impact of Electrode Shift on Classification Performance in Electromyography-Based Motion Prediction Using Sliding-Window Normalization Taichi Tanaka et.al. 2504.03196 null
2025-04-03 Data-Driven Design of 3GPP Handover Parameters with Bayesian Optimization and Transfer Learning Mohamed Benzaghta et.al. 2504.02633 null
2025-04-02 Instruction-Guided Autoregressive Neural Network Parameter Generation Soro Bedionita et.al. 2504.02012 null
2025-04-02 Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning Yiting Lu et.al. 2504.01655 link
2025-04-01 Privacy-Preserving Transfer Learning for Community Detection using Locally Distributed Multiple Networks Xiao Guo et.al. 2504.00890 null
2025-04-01 Data-driven Optimization and Transfer Learning for Cellular Network Antenna Configurations Mohamed Benzaghta et.al. 2504.00825 null
2025-04-01 Transfer Learning in Financial Time Series with Gramian Angular Field Hou-Wan Long et.al. 2504.00378 null
2025-04-01 Spatiotemporal Attention Learning Framework for Event-Driven Object Recognition Tiantian Xie et.al. 2504.00370 null
2025-04-01 CopyQNN: Quantum Neural Network Extraction Attack under Varying Quantum Noise Zhenxiao Fu et.al. 2504.00366 null
2025-03-31 Detecting Glioma, Meningioma, and Pituitary Tumors, and Normal Brain Tissues based on Yolov11 and Yolov8 Deep Learning Models Ahmed M. Taha et.al. 2504.00189 null
2025-03-31 From Colors to Classes: Emergence of Concepts in Vision Transformers Teresa Dorszewski et.al. 2503.24071 link
2025-03-29 A QUBO Framework for Team Formation Karan Vombatkere et.al. 2503.23209 null
2025-03-29 Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning Xinlei Shao et.al. 2503.23012 link
2025-04-01 Nonhuman Primate Brain Tissue Segmentation Using a Transfer Learning Approach Zhen Lin et.al. 2503.22829 null
2025-03-28 Accelerated VQE: Parameter Recycling for Similar Recurring Problem Instances Tobias Rohe et.al. 2503.22590 null
2025-03-28 Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation Sarubi Thillainathan et.al. 2503.22582 null
2025-03-28 Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets Martin Kišš et.al. 2503.22513 null
2025-03-28 On-site estimation of battery electrochemical parameters via transfer learning based physics-informed neural network approach Josu Yeregui et.al. 2503.22396 null
2025-03-28 A Survey on Remote Sensing Foundation Models: From Vision to Multimodality Ziyue Huang et.al. 2503.22081 link
2025-04-04 Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models Umer Butt et.al. 2503.21530 null
2025-03-27 Exploring the flavor structure of leptons via diffusion models Satsuki Nishimura et.al. 2503.21432 null
2025-03-27 AugWard: Augmentation-Aware Representation Learning for Accurate Graph Classification Minjun Kim et.al. 2503.21105 link
2025-03-27 Integrate Meta-analysis into Specific Study (InMASS) for Estimating Conditional Average Treatment Effect Keisuke Hanada et.al. 2503.21091 link
2025-03-26 World Model Agents with Change-Based Intrinsic Motivation Jeremias Ferrao et.al. 2503.21047 link
2025-03-26 A Deep Learning Pipeline for Large Earthquake Analysis using High-Rate Global Navigation Satellite System Data Claudia Quinteros-Cartaya et.al. 2503.20584 null
2025-03-26 Low-resource Information Extraction with the European Clinical Case Corpus Soumitra Ghosh et.al. 2503.20568 null
2025-03-26 Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications Mahya Nikouei et.al. 2503.20516 null
2025-03-26 Multi-dataset and Transfer Learning Using Gene Expression Knowledge Graphs Rita T. Sousa et.al. 2503.20400 link
2025-03-25 The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs Jonathan Sauder et.al. 2503.20000 link
2025-03-25 Untangling the Influence of Typology, Data and Model Architecture on Ranking Transfer Languages for Cross-Lingual POS Tagging Enora Rice et.al. 2503.19979 null
2025-03-25 Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification Daniel G. P. Petrini et.al. 2503.19945 null
2025-03-25 Exploring Cultural Nuances in Emotion Perception Across 15 African Languages Ibrahim Said Ahmad et.al. 2503.19642 null
2025-03-24 Continual Reinforcement Learning for HVAC Systems Control: Integrating Hypernetworks and Transfer Learning Gautham Udayakumar Bekal et.al. 2503.19212 null
2025-03-24 Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach Jakob Abeßer et.al. 2503.19161 null
2025-03-24 Out-of-distribution evaluations of channel agnostic masked autoencoders in fluorescence microscopy Christian John Hurry et.al. 2503.19149 null
2025-03-24 Anomaly Detection Using Computer Vision: A Comparative Analysis of Class Distinction and Performance Metrics Md. Barkat Ullah Tusher et.al. 2503.19100 null
2025-03-24 Convolutional neural network approach to ion Coulomb crystal image analysis James Allsopp et.al. 2503.18846 null
2025-03-24 Natural Language Processing for Electronic Health Records in Scandinavian Languages: Norwegian, Swedish, and Danish Ashenafi Zebene Woldaregay et.al. 2503.18539 null
2025-03-24 k-NN as a Simple and Effective Estimator of Transferability Moein Sorkhei et.al. 2503.18528 null
2025-03-24 Similarity-Informed Transfer Learning for Multivariate Functional Censored Quantile Regression Hua Liu et.al. 2503.18437 null
2025-03-24 PNN: A Novel Progressive Neural Network for Fault Classification in Rotating Machinery under Small Dataset Constraint Praveen Chopra et.al. 2503.18263 null
2025-03-25 PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment Jong Myoung Kim et.al. 2503.18250 null
2025-03-23 Adaptive Multi-Fidelity Reinforcement Learning for Variance Reduction in Engineering Design Optimization Akash Agrawal et.al. 2503.18229 null
2025-03-23 Adaptive Physics-informed Neural Networks: A Survey Edgar Torres et.al. 2503.18181 null
2025-03-23 Training A Neural Network For Partially Occluded Road Sign Identification In The Context Of Autonomous Vehicles Gulnaz Gimaletdinova et.al. 2503.18177 null
2025-03-23 Cost-effective multi-fidelity strategy for the optimization of high-Reynolds number turbine flows guided by LES Camille Matar et.al. 2503.17977 null
2025-03-23 Physics-Guided Multi-Fidelity DeepONet for Data-Efficient Flow Field Prediction Sunwoong Yang et.al. 2503.17941 null
2025-03-23 Cross-Domain Underwater Image Enhancement Guided by No-Reference Image Quality Assessment: A Transfer Learning Approach Zhi Zhang et.al. 2503.17937 null
2025-03-22 Causal Inference based Transfer Learning with LLMs: An Efficient Framework for Industrial RUL Prediction Yan Chen et.al. 2503.17686 null
2025-03-21 Shear-based Grasp Control for Multi-fingered Underactuated Tactile Robotic Hands Christopher J. Ford et.al. 2503.17501 null
2025-03-21 Stream Automatic Detection with Convolutional Neural Network (SAD-CNN) Alex Vera-Casanova. et.al. 2503.17202 null
2025-03-21 Jailbreaking the Non-Transferable Barrier via Test-Time Data Disguising Yongli Xiang et.al. 2503.17198 null
2025-03-21 Transfer Learning for EDFA Gain Modeling: A Semi-Supervised Approach Using Internal Amplifier Features Agastya Raj et.al. 2503.17094 null
2025-03-21 PRIOT: Pruning-Based Integer-Only Transfer Learning for Embedded Systems Honoka Anada et.al. 2503.16860 null
2025-03-21 Multi-property directed generative design of inorganic materials through Wyckoff-augmented transfer learning Shuya Yamazaki et.al. 2503.16784 null
2025-03-20 UniCrossAdapter: Multimodal Adaptation of CLIP for Radiology Report Generation Yaxiong Chen et.al. 2503.15940 link
2025-03-21 Sample-Efficient Bayesian Transfer Learning for Online Machine Parameter Optimization Philipp Wagner et.al. 2503.15928 null
2025-03-20 Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation Tiange Xiang et.al. 2503.15877 null
2025-03-19 Sequential learning based PINNs to overcome temporal domain complexities in unsteady flow past flapping wings Rahul Sundar et.al. 2503.15679 null
2025-03-20 Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis Imanol G. Estepa et.al. 2503.15060 null
2025-03-19 Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene Shengqiong Wu et.al. 2503.15019 null
2025-03-19 A Novel Channel Boosted Residual CNN-Transformer with Regional-Boundary Learning for Breast Cancer Detection Aamir Mehmood et.al. 2503.15008 null
2025-03-18 Cross-Environment Transfer Learning for Location-Aided Beam Prediction in 5G and Beyond Millimeter-Wave Networks Enrico Tosi et.al. 2503.14287 null
2025-03-18 Multi-task Learning for Identification of Porcelain in Song and Yuan Dynasties Ziyao Ling et.al. 2503.14231 null
2025-03-17 MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset Zhaodong Wu et.al. 2503.13560 link
2025-03-17 Edit Transfer: Learning Image Editing via Vision In-Context Relations Lan Chen et.al. 2503.13327 null
2025-03-17 Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach Muhan Hou et.al. 2503.12993 null
2025-03-17 An Optimization Framework for Differentially Private Sparse Fine-Tuning Mehdi Makni et.al. 2503.12822 null
2025-03-16 TuneNSearch: a hybrid transfer learning and local search approach for solving vehicle routing problems Arthur Corrêa et.al. 2503.12662 null
2025-03-16 Realized Volatility Forecasting for New Issues and Spin-Offs using Multi-Source Transfer Learning Andreas Teller et.al. 2503.12648 null
2025-03-16 COVID 19 Diagnosis Analysis using Transfer Learning Anjali Dharmik et.al. 2503.12642 null
2025-03-16 Learning Privacy from Visual Entities Alessio Xompero et.al. 2503.12464 null
2025-03-16 A Transformer-based survival model for prediction of all-cause mortality in heart failure patients: a multi-cohort study Shishir Rao et.al. 2503.12317 null
2025-03-15 Automatic Characterization of Fluxonium Superconducting Qubits Parameters with Deep Transfer Learning Huan-Hsuan Kung et.al. 2503.12099 null
2025-03-15 Effective and Efficient Cross-City Traffic Knowledge Transfer A Privacy-Preserving Perspective Zhihao Zeng et.al. 2503.11963 null
2025-03-14 Transfer Learning for Automated Feedback Generation on Small Datasets Oscar Morris et.al. 2503.11836 null
2025-03-14 Deepfake Detection of Face Images based on a Convolutional Neural Network Lukas Kroiß et.al. 2503.11389 null
2025-03-14 TransiT: Transient Transformer for Non-line-of-sight Videography Ruiqian Li et.al. 2503.11328 null
2025-03-13 Automated Tomato Maturity Estimation Using an Optimized Residual Model with Pruning and Quantization Techniques Muhammad Waseem et.al. 2503.10940 null
2025-03-13 SOLA-GCL: Subgraph-Oriented Learnable Augmentation Method for Graph Contrastive Learning Tianhao Peng et.al. 2503.10100 null
2025-03-11 Are ECGs enough? Deep learning classification of cardiac anomalies using only electrocardiograms Joao D. S. Marques et.al. 2503.08960 link
2025-03-11 Beam Selection in ISAC using Contextual Bandit with Multi-modal Transformer and Transfer Learning Mohammad Farzanullah et.al. 2503.08937 null
2025-03-11 Towards species’ classification of the \textit{Anastrepha pseudoparallela} group Gabriel R. Palma et.al. 2503.08598 null
2025-03-11 MMRL: Multi-Modal Representation Learning for Vision-Language Models Yuncheng Guo et.al. 2503.08497 link
2025-03-17 Structure-Activation Synergy: A Dual Efficiency Framework for Parameter-Memory Optimized Transfer Learning Tian Jin et.al. 2503.08154 null
2025-03-11 Pre-trained Models Succeed in Medical Imaging with Representation Similarity Degradation Wenqiang Zu et.al. 2503.07958 null
2025-03-11 A Study to Evaluate the Impact of LoRA Fine-tuning on the Performance of Non-functional Requirements Classification Xia Li et.al. 2503.07927 null
2025-03-10 Elderly Activity Recognition in the Wild: Results from the EAR Challenge Anh-Kiet Duong et.al. 2503.07821 null
2025-03-10 Real-Time Load Estimation for Load-lifting Exoskeletons Using Insole Pressure Sensors and Machine Learning Kaida Wu et.al. 2503.07527 null
2025-03-10 Linguistic Knowledge Transfer Learning for Speech Enhancement Kuo-Hsuan Hung et.al. 2503.07078 null
2025-03-10 Are We Truly Forgetting? A Critical Re-examination of Machine Unlearning Evaluation Protocols Yongwoo Kim et.al. 2503.06991 null
2025-03-09 Transfer Learning for LQR Control Taosha Guo et.al. 2503.06755 null
2025-03-09 MetaXCR: Reinforcement-Based Meta-Transfer Learning for Cross-Lingual Commonsense Reasoning Jie He et.al. 2503.06531 null
2025-03-09 R+R: Security Vulnerability Dataset Quality Is Critical Anurag Swarnim Yadav et.al. 2503.06387 link
2025-03-08 Adversarial Robustness of Discriminative Self-Supervised Learning in Vision Ömer Veysel Çağatan et.al. 2503.06361 null
2025-03-08 NeuroADDA: Active Discriminative Domain Adaptation in Connectomic Shashata Sawmya et.al. 2503.06196 null
2025-03-07 CACTUS: An Open Dataset and Framework for Automated Cardiac Assessment and Classification of Ultrasound Images Using Deep Transfer Learning Hanae Elmekki et.al. 2503.05604 null
2025-03-10 opXRD: Open Experimental Powder X-ray Diffraction Database Daniel Hollarek et.al. 2503.05577 null
2025-03-13 Statistical Deficiency for Task Inclusion Estimation Loïc Fosse et.al. 2503.05491 null
2025-03-07 Quantum-PEFT: Ultra parameter-efficient fine-tuning Toshiaki Koike-Akino et.al. 2503.05431 null
2025-03-07 Spatial Distillation based Distribution Alignment (SDDA) for Cross-Headset EEG Classification Dingkun Liu et.al. 2503.05349 link
2025-03-06 TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Lin Sun et.al. 2503.04872 null
2025-03-06 DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval Yating Liu et.al. 2503.04144 null
2025-03-05 On the Acquisition of Shared Grammatical Representations in Bilingual Language Models Catherine Arnett et.al. 2503.03962 null
2025-03-05 Hierarchical quantum embedding by machine learning for large molecular assemblies Moritz Bensberg et.al. 2503.03928 null
2025-03-05 Sarcasm Detection as a Catalyst: Improving Stance Detection with Cross-Target Capabilities Gibson Nkhata Shi Yin Hong et.al. 2503.03787 null
2025-03-04 A Phylogenetic Approach to Genomic Language Modeling Carlos Albors et.al. 2503.03773 link
2025-03-10 MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving Ruida Wang et.al. 2503.03205 link
2025-03-05 Intermediate-Task Transfer Learning: Leveraging Sarcasm Detection for Stance Detection Gibson Nkhata et.al. 2503.03172 null
2025-03-04 Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment Matthew DosSantos DiSorbo et.al. 2503.02976 null
2025-03-03 Hyperspectral Image Restoration and Super-resolution with Physics-Aware Deep Learning for Biomedical Applications Yuchen Xiang et.al. 2503.02908 null
2025-03-03 Diagnosis of Patients with Viral, Bacterial, and Non-Pneumonia Based on Chest X-Ray Images Using Convolutional Neural Networks Carlos Arizmendi et.al. 2503.02906 null
2025-03-04 Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques Mustafa Majeed Abd Zaid et.al. 2503.02510 null
2025-03-04 X2CT-CLIP: Enable Multi-Abnormality Detection in Computed Tomography from Chest Radiography via Tri-Modal Contrastive Learning Jianzhong You et.al. 2503.02162 null
2025-03-03 A General Neural Network Potential for Energetic Materials with C, H, N, and O elements Mingjie Wen et.al. 2503.01932 link
2025-03-03 Do GFlowNets Transfer? Case Study on the Game of 24/42 Adesh Gupta et.al. 2503.01819 null
2025-03-03 An Efficient Approach to Detecting Lung Nodules Using Swin Transformer Saeed Shakuri et.al. 2503.01592 null
2025-03-03 A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models Seyed Mohamad Ali Tousi et.al. 2503.01169 null
2025-03-01 Rapid morphology characterization of two-dimensional TMDs and lateral heterostructures based on deep learning Junqi He et.al. 2503.00470 link
2025-03-01 Towards Understanding the Benefit of Multitask Representation Learning in Decision Process Rui Lu et.al. 2503.00345 null
2025-02-28 Optimal Transfer Learning for Missing Not-at-Random Matrix Completion Akhil Jalan et.al. 2503.00174 null
2025-02-28 Fine-tuning machine-learned particle-flow reconstruction for new detector geometries in future colliders Farouk Mokhtar et.al. 2503.00131 null
2025-02-28 RuCCoD: Towards Automated ICD Coding in Russian Aleksandr Nesterov et.al. 2502.21263 link
2025-02-28 Incorporating Long-Range Interactions via the Multipole Expansion into Ground and Excited-State Molecular Simulations Rhyan Barrett et.al. 2502.21045 null
2025-02-27 On the Role of Individual Differences in Current Approaches to Computational Image Aesthetics Li-Wei Chen et.al. 2502.20518 null
2025-02-27 Deep Convolutional Neural Networks for Palm Fruit Maturity Classification Mingqiang Han et.al. 2502.20223 link
2025-02-27 An Amplitude-Encoding-Based Classical-Quantum Transfer Learning framework: Outperforming Classical Methods in Image Recognition Shouwei Hu et.al. 2502.20184 null
2025-02-27 Transfer Learning in Latent Contextual Bandits with Covariate Shift Through Causal Transportability Mingwei Deng et.al. 2502.20153 link
2025-02-27 Energy-carbon comprehensive efficiency evaluation of hydrogen metallurgy system considering low-temperature waste heat recovery Qiang Ji et.al. 2502.20131 null
2025-02-27 Efficient Machine Learning Approach for Yield Prediction in Chemical Reactions Supratim Ghosh et.al. 2502.19976 null
2025-02-27 A Principled Approach to Bayesian Transfer Learning Adam Bretherton et.al. 2502.19796 null
2025-02-26 Deep Learning-Based Transfer Learning for Classification of Cassava Disease Ademir G. Costa Junior et.al. 2502.19351 null
2025-02-26 Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective Jiawei Huang et.al. 2502.19255 link
2025-03-01 GraphBridge: Towards Arbitrary Transfer Learning in GNNs Li Ju et.al. 2502.19252 link
2025-02-26 A Sample-Level Evaluation and Generative Framework for Model Inversion Attacks Haoyang Li et.al. 2502.19070 link
2025-02-26 KAN-powered large-target detection for automotive radar Vinay Kulkarni et.al. 2502.19000 null
2025-02-25 Transfer Learning Assisted Fast Design Migration Over Technology Nodes: A Study on Transformer Matching Network Chenhao Chu et.al. 2502.18636 link
2025-02-25 Transfer Learning for Transient Classification: From Simulations to Real Data and ZTF to LSST Rithwik Gupta et.al. 2502.18558 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-25 Conformal Prediction Under Generalized Covariate Shift with Posterior Drift Baozhen Wang et.al. 2502.17744 null
2025-02-23 Multimodal Bearing Fault Classification Under Variable Conditions: A 1D CNN with Transfer Learning Tasfiq E. Alam et.al. 2502.17524 null
2025-02-24 Leveraging recurrence in neural network wavefunctions for large-scale simulations of Heisenberg antiferromagnets: the square lattice M. Schuyler Moss et.al. 2502.17144 link
2025-02-24 Provable Benefits of Unsupervised Pre-training and Transfer Learning via Single-Index Models Taj Jones-McCormick et.al. 2502.16849 null
2025-02-23 Automated Keypoint Estimation for Self-Piercing Rivet Joints Using micro-CT Imaging and Transfer Learning Wei Qin Chuah et.al. 2502.16752 null
2025-02-27 Diagnosing COVID-19 Severity from Chest X-Ray Images Using ViT and CNN Architectures Luis Lara et.al. 2502.16622 link
2025-02-23 SDA-DDA Semi-supervised Domain Adaptation with Dynamic Distribution Alignment Network For Emotion Recognition Using EEG Signals Jiahao Tang et.al. 2502.16485 link
2025-02-22 Iterative Auto-Annotation for Scientific Named Entity Recognition Using BERT-Based Models Kartik Gupta et.al. 2502.16312 null
2025-02-21 Graph Attention Convolutional U-NET: A Semantic Segmentation Model for Identifying Flooded Areas Muhammad Umair Danish et.al. 2502.15907 null
2025-02-21 Improving variable selection properties by using external data Paul Rognon-Vael et.al. 2502.15584 null
2025-02-21 Fine-tuning foundation models of materials interatomic potentials with frozen transfer learning Mariia Radova et.al. 2502.15582 null
2025-02-20 P2W: From Power Traces to Weights Matrix – An Unconventional Transfer Learning Approach Roozbeh Siyadatzadeh et.al. 2502.14968 null
2025-02-20 Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes Lukas Rauch et.al. 2502.14721 null
2025-02-20 Distribution Matching for Self-Supervised Transfer Learning Yuling Jiao et.al. 2502.14424 link
2025-02-20 A Macro- and Micro-Hierarchical Transfer Learning Framework for Cross-Domain Fake News Detection Xuankai Yang et.al. 2502.14403 null
2025-02-20 Asymmetric Co-Training for Source-Free Few-Shot Domain Adaptation Gengxu Li et.al. 2502.14214 link
2025-02-19 Appeal prediction for AI up-scaled Images Steve Göring et.al. 2502.14013 link
2025-02-19 Toward Robust Non-Transferable Learning: A Survey and Benchmark Ziming Hong et.al. 2502.13593 link
2025-02-19 Enhancing Machine Learning Potentials through Transfer Learning across Chemical Elements Sebastien Röcken et.al. 2502.13522 null
2025-02-18 Performance Evaluation of Sentiment Analysis on Text and Emoji Data Using End-to-End, Transfer Learning, Distributed and Explainable AI Models Sirisha Velampalli et.al. 2502.13278 null
2025-02-18 Pre-training Auto-regressive Robotic Models with 4D Representations Dantong Niu et.al. 2502.13142 null
2025-02-18 Detection and Geographic Localization of Natural Objects in the Wild: A Case Study on Palms Kangning Cui et.al. 2502.13023 null
2025-02-18 Universal Embedding Function for Traffic Classification via QUIC Domain Recognition Pretraining: A Transfer Learning Success Jan Luxemburk et.al. 2502.12930 link
2025-02-18 Unsupervised optimal deep transfer learning for classification under general conditional shift Junjun Lang et.al. 2502.12729 null
2025-02-18 NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation Zhiyuan Liu et.al. 2502.12638 link
2025-02-17 PreAdaptFWI: Pretrained-Based Adaptive Residual Learning for Full-Waveform Inversion Without Dataset Dependency Xintong Dong et.al. 2502.11913 null
2025-02-17 M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis Chengyan Wu et.al. 2502.11824 link
2025-02-17 Transfer Learning of CATE with Kernel Ridge Regression Seok-Jin Kim et.al. 2502.11331 link
2025-02-16 Detecting Cadastral Boundary from Satellite Images Using U-Net model Neda Rahimpour Anaraki et.al. 2502.11044 null
2025-02-15 Controlling Neural Collapse Enhances Out-of-Distribution Detection and Transfer Learning Md Yousuf Harun et.al. 2502.10691 null
2025-02-14 SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models Aditya Mishra et.al. 2502.10307 null
2025-02-19 ExoMiner++ on TESS with Transfer Learning from Kepler: Transit Classification and Vetting Catalog for 2-min Data Hamed Valizadegan et.al. 2502.09790 null
2025-02-13 NeuralCFD: Deep Learning on High-Fidelity Automotive Aerodynamics Simulations Maurits Bleeker et.al. 2502.09692 null
2025-02-13 A Survey of Reinforcement Learning for Optimization in Automation Ahmad Farooq et.al. 2502.09417 null
2025-02-13 Revisiting Euclidean Alignment for Transfer Learning in EEG-Based Brain-Computer Interfaces Dongrui Wu et.al. 2502.09203 null
2025-02-13 A Hybrid Model for Few-Shot Text Classification Using Transfer and Meta-Learning Jia Gao et.al. 2502.09086 null
2025-02-12 $\mathsf{CSMAE~}$ :~Cataract Surgical Masked Autoencoder (MAE) based Pre-training Nisarg A. Shah et.al. 2502.08822 null
2025-02-12 Advancing machine fault diagnosis: A detailed examination of convolutional neural networks Govind Vashishtha et.al. 2502.08689 null
2025-02-14 Multifidelity Simulation-based Inference for Computationally Expensive Simulators Anastasia N. Krouglova et.al. 2502.08416 null
2025-02-12 Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation Fenghe Tang et.al. 2502.08347 link
2025-02-12 Knowledge-Guided Wasserstein Distributionally Robust Optimization Zitao Wang et.al. 2502.08146 null
2025-02-11 Instance-dependent Early Stopping Suqin Yuan et.al. 2502.07547 link
2025-02-12 Music for All: Exploring Multicultural Representations in Music Generation Models Atharva Mehta et.al. 2502.07328 link
2025-02-11 Long-term simulation of physical and mechanical behaviors using curriculum-transfer-learning based physics-informed neural networks Yuan Guo et.al. 2502.07325 null
2025-02-11 Robust Indoor Localization in Dynamic Environments: A Multi-source Unsupervised Domain Adaptation Framework Jiyu Jiao et.al. 2502.07246 null
2025-02-11 Tab2Visual: Overcoming Limited Data in Tabular Data Classification Using Deep Learning with Visual Representations Ahmed Mamdouh et.al. 2502.07181 null
2025-02-10 Cross-platform Learning-based Fault Tolerant Surfacing Controller for Underwater Robots Yuya Hamamatsu et.al. 2502.07133 null
2025-02-10 Generative Distribution Prediction: A Unified Approach to Multimodal Learning Xinyu Tian et.al. 2502.07090 null
2025-02-10 Model Diffusion for Certifiable Few-shot Transfer Learning Fady Rezk et.al. 2502.06970 null
2025-02-08 Topological derivative approach for deep neural network architecture adaptation C G Krishnanunni et.al. 2502.06885 null
2025-02-10 Institutional Preferences in the Laboratory Qiankun Zhong et.al. 2502.06748 null
2025-02-10 Hyperparameters in Score-Based Membership Inference Attacks Gauri Pradhan et.al. 2502.06374 link
2025-02-10 A Data-Efficient Pan-Tumor Foundation Model for Oncology CT Interpretation Wenhui Lei et.al. 2502.06171 null
2025-02-10 Low Tensor-Rank Adaptation of Kolmogorov–Arnold Networks Yihang Gao et.al. 2502.06153 null
2025-02-09 Estimation with missing not at random binary outcomes via exponential tilts Subha Maity et.al. 2502.06046 link
2025-02-09 Protecting Intellectual Property of EEG-based Neural Networks with Watermarking Ahmed Abdelaziz et.al. 2502.05931 link
2025-02-09 Target Speaker Lipreading by Audio-Visual Self-Distillation Pretraining and Speaker Adaptation Jing-Xuan Zhang et.al. 2502.05758 null
2025-02-08 Coalition Formation for Heterogeneous Federated Learning Enabled Channel Estimation in RIS-assisted Cell-free MIMO Nan Qi et.al. 2502.05538 null
2025-02-07 Evaluating Standard and Dialectal Frisian ASR: Multilingual Fine-tuning and Language Identification for Improved Low-resource Performance Reihaneh Amooie et.al. 2502.04883 null
2025-02-07 Self-Supervised Learning for Pre-training Capsule Networks: Overcoming Medical Imaging Dataset Challenges Heba El-Shimy et.al. 2502.04748 null
2025-02-07 Performance Evaluation of Image Enhancement Techniques on Transfer Learning for Touchless Fingerprint Recognition S Sreehari et.al. 2502.04680 null
2025-02-06 Provable Sample-Efficient Transfer Learning Conditional Diffusion Models via Representation Learning Ziheng Cheng et.al. 2502.04491 null
2025-02-06 Multi-fidelity emulator for large-scale 21 cm lightcone images: a few-shot transfer learning approach with generative adversarial network Kangning Diao et.al. 2502.04246 null
2025-02-06 A Theoretical Framework for Data Efficient Multi-Source Transfer Learning Based on Cramér-Rao Bound Qingyue Zhang et.al. 2502.04242 null
2025-02-06 Transfer Learning for Covert Speech Classification Using EEG Hilbert Envelope and Temporal Fine Structure Saravanakumar Duraisamy et.al. 2502.04132 null
2025-02-06 Exploring Group Convolutional Networks for Sign Problem Mitigation via Contour Deformation Christoph Gäntgen et.al. 2502.04104 null
2025-02-06 Generalize Drug Response Prediction by Latent Independent Projection for Asymmetric Constrained Domain Generalization Ran Song et.al. 2502.04034 null
2025-02-06 ICGNN: Graph Neural Network Enabled Scalable Beamforming for MISO Interference Channels Changpeng He et.al. 2502.03936 null
2025-02-06 SWIPTNet: A Unified Deep Learning Framework for SWIPT based on GNN and Transfer Learning Hong Han et.al. 2502.03928 null
2025-02-06 Self-Supervised Learning for Solar Radio Spectrum Classification Siqi Li et.al. 2502.03778 null
2025-02-05 Prediction of the Most Fire-Sensitive Point in Building Structures with Differentiable Agents for Thermal Simulators Yuan Xinjie et.al. 2502.03424 null
2025-02-05 DES to HSC: Detecting low surface brightness galaxies in the Abell 194 cluster using transfer learning H. Thuruthipilly et.al. 2502.03142 null
2025-02-05 TopoCL: Topological Contrastive Learning for Time Series Namwoo Kim et.al. 2502.02924 null
2025-02-04 Cross-Lingual Transfer for Low-Resource Natural Language Processing Iker García-Ferrero et.al. 2502.02722 null
2025-02-05 Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study Calvin Yixiang Cheng et.al. 2502.02451 link
2025-02-04 Self-Supervised Convolutional Audio Models are Flexible Acoustic Feature Learners: A Domain Specificity and Transfer-Learning Study Mattson Ogg et.al. 2502.02366 link
2025-02-04 Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation Shutong Duan et.al. 2502.02340 null
2025-02-03 Geometric Framework for 3D Cell Segmentation Correction Peter Chen et.al. 2502.01890 null
2025-02-03 Learning Hyperparameters via a Data-Emphasized Variational Objective Ethan Harvey et.al. 2502.01861 link
2025-02-03 Grokking Explained: A Statistical Phenomenon Breno W. Carvalho et.al. 2502.01774 null
2025-02-03 Towards Robust and Generalizable Lensless Imaging with Modular Learned Reconstruction Eric Bezzam et.al. 2502.01102 null
2025-02-02 Fruit Fly Classification (Diptera: Tephritidae) in Images, Applying Transfer Learning Erick Andrew Bustamante Flores et.al. 2502.00939 null
2025-02-02 UniGraph2: Learning a Unified Embedding Space to Bind Multimodal Graphs Yufei He et.al. 2502.00806 link
2025-02-02 Transfer Learning in Physics-Informed Neural Networks: Full Fine-Tuning, Lightweight Fine-Tuning, and Low-Rank Adaptation Yizheng Wang et.al. 2502.00782 null
2025-02-02 Role of Mixup in Topological Persistence Based Knowledge Distillation for Wearable Sensor Data Eun Som Jeon et.al. 2502.00779 null
2025-02-01 SSRepL-ADHD: Adaptive Complex Representation Learning Framework for ADHD Detection from Visual Attention Tasks Abdul Rehman et.al. 2502.00376 null
2025-02-01 Machine Learning Models for Reinforced Concrete Pipes Condition Prediction: The State-of-the-Art Using Artificial Neural Networks and Multiple Linear Regression in a Wisconsin Case Study Mohsen Mohammadagha et.al. 2502.00363 null
2025-02-01 MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model Jihyeok Kim et.al. 2502.00315 null
2025-01-31 Improving Quality Control Of MRI Images Using Synthetic Motion Data Charles Bricout et.al. 2502.00160 null
2025-01-31 Exploring Transfer Learning for Deep Learning Polyp Detection in Colonoscopy Images Using YOLOv8 Fabian Vazquez et.al. 2502.00133 null
2025-01-31 SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging Javier Montalvo et.al. 2501.19035 link
2025-01-31 Lightspeed Geometric Dataset Distance via Sliced Optimal Transport Khai Nguyen et.al. 2501.18901 link
2025-01-31 Transfer Learning for Nonparametric Contextual Dynamic Pricing Fan Wang et.al. 2501.18836 link
2025-01-31 Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques Samitha Vidhanaarachchi et.al. 2501.18835 null
2025-01-30 Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images Wei-Lun Chen et.al. 2501.18453 null
2025-01-30 Function Encoders: A Principled Approach to Transfer Learning in Hilbert Spaces Tyler Ingebrand et.al. 2501.18373 null
2025-01-30 Transfer Learning of Surrogate Models: Integrating Domain Warping and Affine Transformations Shuaiqun Pan et.al. 2501.18344 null
2025-01-30 Advancing Personalized Federated Learning: Integrative Approaches with AI for Enhanced Privacy and Customization Kevin Cooper et.al. 2501.18174 null
2025-01-29 Digital Twin-Enabled Real-Time Control in Robotic Additive Manufacturing via Soft Actor-Critic Reinforcement Learning Matsive Ali et.al. 2501.18016 null
2025-01-29 LEKA:LLM-Enhanced Knowledge Augmentation Xinhao Zhang et.al. 2501.17802 null
2025-01-29 Action Recognition Using Temporal Shift Module and Ensemble Learning Anh-Kiet Duong et.al. 2501.17550 link
2025-01-29 EMD-Fuzzy: An Empirical Mode Decomposition Based Fuzzy Model for Cross-Stimulus Transfer Learning of SSVEP Beining Cao et.al. 2501.17475 null
2025-01-29 Fundamental Computational Limits in Pursuing Invariant Causal Prediction and Invariance-Guided Regularization Yihong Gu et.al. 2501.17354 null
2025-01-28 Stiff Transfer Learning for Physics-Informed Neural Networks Emilien Seiler et.al. 2501.17281 null
2025-01-28 CoRe-Net: Co-Operational Regressor Network with Progressive Transfer Learning for Blind Radar Signal Restoration Muhammad Uzair Zahid et.al. 2501.17125 null
2025-01-31 Multimodal Magic Elevating Depression Detection with a Fusion of Text and Audio Intelligence Lindy Gan et.al. 2501.16813 null
2025-01-28 Molecular-driven Foundation Model for Oncologic Pathology Anurag Vaidya et.al. 2501.16652 link
2025-01-27 Automatic Machine Learning Framework to Study Morphological Parameters of AGN Host Galaxies within $z < 1.4$ in the Hyper Supreme-Cam Wide Survey Chuan Tian et.al. 2501.15739 link
2025-01-26 Building Efficient Lightweight CNN Models Nathan Isong et.al. 2501.15547 null
2025-01-26 Universal Image Restoration Pre-training via Degradation Classification JiaKui Hu et.al. 2501.15510 link
2025-01-26 Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning Alberto Castagna et.al. 2501.15495 link
2025-01-26 Cross-Modal Transfer from Memes to Videos: Addressing Data Scarcity in Hateful Video Detection Han Wang et.al. 2501.15438 link
2025-01-26 A Transfer Learning Framework for Anomaly Detection in Multivariate IoT Traffic Data Mahshid Rezakhani et.al. 2501.15365 null
2025-01-25 Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data Nora Fink et.al. 2501.15263 null
2025-01-25 In-Context Operator Learning for Linear Propagator Models Tingwei Meng et.al. 2501.15106 null
2025-01-24 A Recurrent Spiking Network with Hierarchical Intrinsic Excitability Modulation for Schema Learning Yingchao Yu et.al. 2501.14539 null
2025-01-24 Quantum Neural Networks: A Comparative Analysis and Noise Robustness Evaluation Tasnim Ahmed et.al. 2501.14412 null
2025-01-24 Deep Learning-Powered Classification of Thoracic Diseases in Chest X-Rays Yiming Lei et.al. 2501.14279 null
2025-01-24 Detection and Classification of Acute Lymphoblastic Leukemia Utilizing Deep Transfer Learning Md. Abu Ahnaf Mollick et.al. 2501.14228 null
2025-01-23 On the Transfer of Knowledge in Quantum Algorithms Esther Villar-Rodriguez et.al. 2501.14120 null
2025-01-23 Transfer Learning of Surrogate Models via Domain Affine Transformation Across Synthetic and Real-World Benchmarks Shuaiqun Pan et.al. 2501.14012 null
2025-01-23 2-Tier SimCSE: Elevating BERT for Robust Sentence Embeddings Yumeng Wang et.al. 2501.13758 null
2025-01-23 Skin Disease Detection and Classification of Actinic Keratosis and Psoriasis Utilizing Deep Transfer Learning Fahud Ahmmed et.al. 2501.13713 null
2025-01-23 GenTL: A General Transfer Learning Model for Building Thermal Dynamics Fabian Raisch et.al. 2501.13703 link
2025-01-23 WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control Claire Bizon Monroc et.al. 2501.13592 link
2025-01-23 NUDT4MSTAR: A New Dataset and Benchmark Towards SAR Target Recognition in the Wild Yongxiang Liu et.al. 2501.13354 link
2025-01-22 Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral Reza Saadati Fard et.al. 2501.13247 null
2025-01-22 LLM4WM: Adapting LLM for Wireless Multi-Tasking Xuanyu Liu et.al. 2501.12983 null
2025-01-21 Bidirectional Brain Image Translation using Transfer Learning from Generic Pre-trained Models Fatima Haimour et.al. 2501.12488 null
2025-01-21 Transfer learning electronic structure: millielectron volt accuracy for sub-million-atom moiré semiconductor Ting Bao et.al. 2501.12452 null
2025-01-21 Tackling Small Sample Survival Analysis via Transfer Learning: A Study of Colorectal Cancer Prognosis Yonghao Zhao et.al. 2501.12421 link
2025-01-21 Efficient PINNs: Multi-Head Unimodular Regularization of the Solutions Space Pedro Tarancón-Álvarez et.al. 2501.12116 null
2025-01-21 Multi-Modal Variable-Rate CSI Reconstruction for FDD Massive MIMO Systems Yunseo Nam et.al. 2501.11926 null
2025-01-20 Rethinking Membership Inference Attacks Against Transfer Learning Cong Wu et.al. 2501.11577 null
2025-01-20 On the Adversarial Vulnerabilities of Transfer Learning in Remote Sensing Tao Bai et.al. 2501.11462 null
2025-01-20 How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks? Wenxuan Li et.al. 2501.11253 link
2025-01-20 Energy Consumption Reduction for UAV Trajectory Training : A Transfer Learning Approach Chenrui Sun et.al. 2501.11243 null
2025-01-19 Enhancing Brain Tumor Segmentation Using Channel Attention and Transfer learning Majid Behzadpour et.al. 2501.11196 link
2025-01-19 Transfer Learning Strategies for Pathological Foundation Models: A Systematic Evaluation in Brain Tumor Classification Ken Enda et.al. 2501.11014 null
2025-01-19 BeST – A Novel Source Selection Metric for Transfer Learning Ashutosh Soni et.al. 2501.10933 null
2025-01-19 Adaptive Target Localization under Uncertainty using Multi-Agent Deep Reinforcement Learning with Knowledge Transfer Ahmed Alagha et.al. 2501.10924 null
2025-01-18 Model-Robust and Adaptive-Optimal Transfer Learning for Tackling Concept Shifts in Nonparametric Regression Haotian Lin et.al. 2501.10870 null
2025-01-18 A Resource-Efficient Training Framework for Remote Sensing Text–Image Retrieval Weihang Zhang et.al. 2501.10638 null
2025-01-17 Surrogate-based multiscale analysis of experiments on thermoplastic composites under off-axis loading M. A. Maia et.al. 2501.10193 link
2025-01-17 Automatic Speech Recognition for Sanskrit with Transfer Learning Bidit Sadhukhan et.al. 2501.10024 null
2025-01-16 Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities Runzhou Mao et.al. 2501.09579 null
2025-01-16 Transfer learning of many-body electronic correlation entropy from local measurements Faluke Aikebaier et.al. 2501.09505 null
2025-01-15 An analysis of data variation and bias in image-based dermatological datasets for machine learning classification Francisco Mauro et.al. 2501.08962 null
2025-01-15 Empowering Agricultural Insights: RiceLeafBD – A Novel Dataset and Optimal Model Selection for Rice Leaf Disease Diagnosis through Transfer Learning Technique Sadia Afrin Rimi et.al. 2501.08912 null
2025-01-15 A Bayesian Hierarchical Model for Generating Synthetic Unbalanced Power Distribution Grids Henrique O. Caetano et.al. 2501.08808 null
2025-01-15 Detecting Wildfire Flame and Smoke through Edge Computing using Transfer Learning Enhanced Deep Learning Models Giovanny Vazquez et.al. 2501.08639 null
2025-01-15 Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation Jiaqi Huang et.al. 2501.08580 link
2025-01-14 Mechanics Informatics: A paradigm for efficiently learning constitutive models Royal C. Ihuaenyi et.al. 2501.08314 null
2025-01-14 Continual Deep Active Learning for Medical Imaging: Replay-Base Architecture for Context Adaptation Rui Daniel et.al. 2501.08245 link
2025-01-14 Optimal Policy Adaptation under Covariate Shift Xueqing Liu et.al. 2501.08067 null
2025-01-16 Mining Intraday Risk Factor Collections via Hierarchical Reinforcement Learning based on Transferred Options Wenyan Xu et.al. 2501.07274 link
2025-01-13 Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis Andrzej D. Dobrzycki et.al. 2501.07221 null
2025-01-13 **AlgoRxplorers Precision in Mutation – Enhancing Drug Design with Advanced Protein Stability Prediction Tools** Karishma Thakrar et.al. 2501.07014
2025-01-12 Towards Fair and Privacy-Aware Transfer Learning for Educational Predictive Modeling: A Case Study on Retention Prediction in Community Colleges Chengyuan Yao et.al. 2501.06913 link
2025-01-12 Transfer Learning of Tabular Data by Finetuning Large Language Models Shourav B. Rabbani et.al. 2501.06863 null
2025-01-12 Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures Samia Mehnaz et.al. 2501.06740 null
2025-01-12 Hold On! Is My Feedback Useful? Evaluating the Usefulness of Code Review Comments Sharif Ahmed et.al. 2501.06738 null
2025-01-11 Transforming Social Science Research with Transfer Learning: Social Science Survey Data Integration with AI Ali Amini et.al. 2501.06577 null
2025-01-11 Mathematics of Digital Twins and Transfer Learning for PDE Models Yifei Zong et.al. 2501.06400 null
2025-01-10 IoT Firmware Version Identification Using Transfer Learning with Twin Neural Networks Ashley Andrews et.al. 2501.06033 null
2025-01-09 Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal Wanli Ma et.al. 2501.05265 null
2025-01-09 Load Forecasting for Households and Energy Communities: Are Deep Learning Models Worth the Effort? Lukas Moosbrugger et.al. 2501.05000 link
2025-01-09 A CT Image Classification Network Framework for Lung Tumors Based on Pre-trained MobileNetV2 Model and Transfer learning, And Its Application and Market Analysis in the Medical field Ziyang Gao et.al. 2501.04996 null
2025-01-09 AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data Haoran Zhu et.al. 2501.04969 link
2025-01-08 Deep Transfer $Q$ -Learning for Offline Non-Stationary Reinforcement Learning Jinhang Chai et.al. 2501.04870 null
2025-01-08 Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model Sanjana Sankar et.al. 2501.04799 null
2025-01-08 Rapid Automated Mapping of Clouds on Titan With Instance Segmentation Zachary Yahn et.al. 2501.04459 link
2025-01-08 A novel Facial Recognition technique with Focusing on Masked Faces Dana A Abdullah et.al. 2501.04444 null
2025-01-08 TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning Seungmin Baek et.al. 2501.04293 null
2025-01-08 Comparison of Neural Models for X-ray Image Classification in COVID-19 Detection Jimi Togni et.al. 2501.04196 null
2025-01-07 DeepVIVONet: Using deep neural operators to optimize sensor locations with application to vortex-induced vibrations Ruyin Wan et.al. 2501.04105 null
2025-01-07 Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study Xaver Maria Krückl et.al. 2501.03863 link
2025-01-07 SelectiveFinetuning: Enhancing Transfer Learning in Sleep Staging through Selective Domain Alignment Siyuan Zhao et.al. 2501.03764 null
2025-01-07 A Multimodal Lightweight Approach to Fault Diagnosis of Induction Motors in High-Dimensional Dataset Usman Ali et.al. 2501.03746 null
2025-01-07 Transfer Learning for Deep-Unfolded Combinatorial Optimization Solver with Quantum Annealer Ryo Hagiwara et.al. 2501.03518 null
2025-01-06 FTA-FTL: A Fine-Tuned Aggregation Federated Transfer Learning Scheme for Lithology Microscopic Image Classification Keyvan RahimiZadeh et.al. 2501.03349 link
2025-01-06 CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets Tanay Agrawal et.al. 2501.03332 null
2025-01-06 Scalable Forward-Forward Algorithm Andrii Krutsylo et.al. 2501.03176 null
2025-01-06 Offline-to-online hyperparameter transfer for stochastic bandits Dravyansh Sharma et.al. 2501.02926 null
2025-01-06 Hybrid deep convolution model for lung cancer detection with transfer learning Sugandha Saxena et.al. 2501.02785 null
2025-01-08 Transfer learning via Regularized Linear Discriminant Analysis Hongzhe Zhang et.al. 2501.02411 null
2025-01-04 tCURLoRA: Tensor CUR Decomposition Based Low-Rank Parameter Adaptation for Medical Image Segmentation Guanghua He et.al. 2501.02227 null
2025-01-03 Transfer Learning for Individualized Treatment Rules: Application to Sepsis Patients Data from eICU-CRD and MIMIC-III Databases Andong Wang et.al. 2501.02128 null
2025-01-03 Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model Haixu Liu et.al. 2501.01611 null
2025-01-02 Transfer Neyman-Pearson Algorithm for Outlier Detection Mohammadreza M. Kalan et.al. 2501.01525 null
2025-01-02 Transfer Learning Analysis of Variational Quantum Circuits Huan-Hsin Tseng et.al. 2501.01507 null
2025-01-02 Robust COVID-19 Detection from Cough Sounds using Deep Neural Decision Tree and Forest: A Comprehensive Cross-Datasets Evaluation Rofiqul Islam et.al. 2501.01117 null
2025-01-02 SpecPT (Spectroscopy Pre-trained Transformer) Model for Extragalactic Spectroscopy: I. Architecture and Automated Redshift Measurement Rohan Pattnaik et.al. 2501.01070 null
2025-01-02 Prediction of Geoeffective CMEs Using SOHO Images and Deep Learning Khalid A. Alobaid et.al. 2501.01011 null
2025-01-02 Is It Still Fair? Investigating Gender Fairness in Cross-Corpus Speech Emotion Recognition Shreya G. Upadhyay et.al. 2501.00995 null
2025-01-01 Active and transfer learning with partially Bayesian neural networks for materials and chemicals Sarah I. Allec et.al. 2501.00952 link
2025-01-01 Intent-based Radio Scheduler for RAN Slicing: Learning to deal with different network scenarios Cleverson Nahum et.al. 2501.00950 link
2025-01-01 Navigating Nuance: In Quest for Political Truth Soumyadeep Sar et.al. 2501.00782 link
2024-12-31 Advanced Lung Nodule Segmentation and Classification for Early Detection of Lung Cancer using SAM and Transfer Learning Asha V et.al. 2501.00586 null
2024-12-31 Addressing Challenges in Data Quality and Model Generalization for Malaria Detection Kiswendsida Kisito Kabore et.al. 2501.00464 null
2024-12-30 Class-based Subset Selection for Transfer Learning under Extreme Label Shift Akul Goyal et.al. 2501.00162 null
2024-12-29 On Adversarial Robustness of Language Models in Transfer Learning Bohdan Turbal et.al. 2501.00066 null
2024-12-28 VisTabNet: Adapting Vision Transformers for Tabular Data Witold Wydmański et.al. 2501.00057 null
2024-12-28 LLM-Virus: Evolutionary Jailbreak Attack on Large Language Models Miao Yu et.al. 2501.00055 link
2024-12-30 Investigating layer-selective transfer learning of QAOA parameters for Max-Cut problem Francesco Aldo Venturelli et.al. 2412.21071 null
2024-12-30 Improving Location-based Thermal Emission Side-Channel Analysis Using Iterative Transfer Learning Tun-Chieh Lou et.al. 2412.21030 null
2024-12-30 Attention Is All You Need For Mixture-of-Depths Routing Advait Gadhikar et.al. 2412.20875 null
2024-12-30 Sample Correlation for Fingerprinting Deep Face Recognition Jiyang Guan et.al. 2412.20768 link
2024-12-30 Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning Tomasz Rutowski et.al. 2412.20741 null
2024-12-29 LEARNER: A Transfer Learning Method for Low-Rank Matrix Estimation Sean McGrath et.al. 2412.20605 link
2024-12-28 Enhancing Transfer Learning for Medical Image Classification with SMOTE: A Comparative Study Md. Zehan Alam et.al. 2412.20235 null
2024-12-28 SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection Phi Vu Tran et.al. 2412.20047 link
2024-12-28 Uncertainty Quantified Deep Learning and Regression Analysis Framework for Image Segmentation of Skin Cancer Lesions Elhoucine Elfatimi et.al. 2412.20007 link
2024-12-27 Data-driven tool wear prediction in milling, based on a process-integrated single-sensor approach Eric Hirsch et.al. 2412.19950 null
2024-12-27 Mouth Articulation-Based Anchoring for Improved Cross-Corpus Speech Emotion Recognition Shreya G. Upadhyay et.al. 2412.19909 null
2024-12-27 EEG-Reptile: An Automatized Reptile-Based Meta-Learning Library for BCIs Daniil A. Berdyshev et.al. 2412.19725 link
2024-12-27 Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models Shuo Wang et.al. 2412.19449 null
2024-12-26 Large Language Models for Market Research: A Data-augmentation Approach Mengxin Wang et.al. 2412.19363 null
2024-12-26 Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components Tengxue Zhang et.al. 2412.19085 null
2024-12-26 Robust Speech and Natural Language Processing Models for Depression Screening Y. Lu et.al. 2412.19072 null
2024-12-24 On the Applicability of Zero-Shot Cross-Lingual Transfer Learning for Sentiment Classification in Distant Language Pairs Andre Rusli et.al. 2412.18188 link
2024-12-24 Text-Aware Adapter for Few-Shot Keyword Spotting Youngmoon Jung et.al. 2412.18142 null
2024-12-24 Heterogeneous transfer learning for high dimensional regression with feature mismatch Jae Ho Chang et.al. 2412.18081 null
2024-12-24 SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC Yue Deng et.al. 2412.17707 link
2024-12-23 Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework Aswini Kumar Patra et.al. 2412.17587 null
2024-12-23 CALLIC: Content Adaptive Learning for Lossless Image Compression Daxin Li et.al. 2412.17464 null
2024-12-23 Feature Based Methods Domain Adaptation for Object Detection: A Review Paper Helia Mohamadi et.al. 2412.17325 null
2024-12-23 On the Feasibility of Vision-Language Models for Time-Series Classification Vinay Prithyani et.al. 2412.17304 link
2024-12-23 Trainingless Adaptation of Pretrained Models for Environmental Sound Classification Noriyuki Tonami et.al. 2412.17212 null
2024-12-24 Semantic Hierarchical Prompt Tuning for Parameter-Efficient Fine-Tuning Haowei Zhu et.al. 2412.16956 link
2024-12-22 Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and a Large Corpus Amir Harati et.al. 2412.16900 null
2024-12-21 The Master Key Filters Hypothesis: Deep Filters Are General in DS-CNNs Zahra Babaiee et.al. 2412.16751 null
2024-12-21 Optoelectronic generative adversarial networks Jumin Qiu et.al. 2412.16672 link
2024-12-21 IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks Yaming Zhang et.al. 2412.16654 link
2024-12-21 Learning for Cross-Layer Resource Allocation in MEC-Aided Cell-Free Networks Chong Zheng et.al. 2412.16565 null
2024-12-20 SeagrassFinder: Deep Learning for Eelgrass Detection and Coverage Estimation in the Wild Jannik Elsäßer et.al. 2412.16147 null
2024-12-20 Monkey Transfer Learning Can Improve Human Pose Estimation Bradley Scott et.al. 2412.15966 null
2024-12-20 Polaris: Multi-Fidelity Design Space Exploration of Deep Learning Accelerators Chirag Sakhuja et.al. 2412.15548 null
2024-12-20 The First Multilingual Model For The Detection of Suicide Texts Rodolfo Zevallos et.al. 2412.15498 null
2024-12-19 A Multi-Fidelity Graph U-Net Model for Accelerated Physics Simulations Rini Jasmine Gladstone et.al. 2412.15372 null
2024-12-19 Transfer Learning Meets Functional Linear Regression: No Negative Transfer under Posterior Drift Xiaoyu Hu et.al. 2412.14563 null
2024-12-19 Color Enhancement for V-PCC Compressed Point Cloud via 2D Attribute Map Optimization Jingwei Bao et.al. 2412.14449 null
2024-12-18 Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations Ludovico Nista et.al. 2412.14150 null
2024-12-18 Trustworthy Transfer Learning: A Survey Jun Wu et.al. 2412.14116 null
2024-12-18 Language verY Rare for All Ibrahim Merad et.al. 2412.13924 null
2024-12-18 Understanding and Analyzing Model Robustness and Knowledge-Transfer in Multilingual Neural Machine Translation using TX-Ray Vageesh Saxena et.al. 2412.13881 null
2024-12-18 FlexPose: Pose Distribution Adaptation with Limited Guidance Zixiao Wang et.al. 2412.13463 null
2024-12-17 Deep Speech Synthesis from Multimodal Articulatory Representations Peter Wu et.al. 2412.13387 null
2024-12-16 A Digital twin for Diesel Engines: Operator-infused PINNs with Transfer Learning for Engine Health Monitoring Kamaljyoti Nath et.al. 2412.11967 null
2024-12-16 Prediction of social dilemmas in networked populations via graph neural networks Huaiyu Tan et.al. 2412.11775 null
2024-12-16 Classification of Spiral Galaxies by Spiral Arm Number using Convolutional Neural Network Ming Wei Lee et.al. 2412.11696 null
2024-12-18 CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning Eloy Geenjaar et.al. 2412.11695 null
2024-12-16 Fast-staged CNN Model for Accurate pulmonary diseases and Lung cancer detection Abdelbaki Souid et.al. 2412.11681 null
2024-12-16 Multilabel Classification for Lung Disease Detection: Integrating Deep Learning and Natural Language Processing Maria Efimovich et.al. 2412.11452 null
2024-12-16 Accurate, Robust and Privacy-Preserving Brain-Computer Interface Decoding Xiaoqing Chen et.al. 2412.11390 null
2024-12-14 Global Estimation of Subsurface Eddy Kinetic Energy of Mesoscale Eddies Using a Multiple-input Residual Neural Network Chenyue Xie et.al. 2412.10656 null
2024-12-13 Active Poisoning: Efficient Backdoor Attacks on Transfer Learning-Based Brain-Computer Interfaces X. Jiang et.al. 2412.09933 null
2024-12-13 Data-Driven Transfer Learning Framework for Estimating Turning Movement Counts Xiaobo Ma et.al. 2412.09861 null
2024-12-12 BayesAdapter: enhanced uncertainty estimation in CLIP few-shot adaptation Pablo Morales-Álvarez et.al. 2412.09718 null
2024-12-12 A Novel Ensemble-Based Deep Learning Model with Explainable AI for Accurate Kidney Disease Diagnosis Md. Arifuzzaman et.al. 2412.09472 null
2024-12-12 Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy Alistair Plum et.al. 2412.09415 null
2024-12-12 Prediction Aided by Surrogate Training Eric Xia et.al. 2412.09364 null
2024-12-12 Stop Relearning: Model Reuse via Feature Distribution Analysis for Incremental Entity Resolution Victor Christen et.al. 2412.09355 link
2024-12-12 Computer-Aided Osteoporosis Diagnosis Using Transfer Learning with Enhanced Features from Stacked Deep Learning Modules Ayesha Siddiqua et.al. 2412.09330 null
2024-12-12 Transfer Learning of RSSI to Improve Indoor Localisation Performance Thanaphon Suwannaphong et.al. 2412.09292 link
2024-12-12 Evaluating Pixel Language Models on Non-Standardized Languages Alberto Muñoz-Ortiz et.al. 2412.09084 null
2024-12-16 Improvement in Sign Language Translation Using Text CTC Alignment Sihan Tan et.al. 2412.09014 link
2024-12-12 A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter Zirun Guo et.al. 2412.08979 link
2024-12-11 Improving Satellite Imagery Masking using Multi-task and Transfer Learning Rangel Daroya et.al. 2412.08545 null
2024-12-11 ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts Sinan Du et.al. 2412.08341 null
2024-12-11 Unified HT-CNNs Architecture: Transfer Learning for Segmenting Diverse Brain Tumors in MRI from Gliomas to Pediatric Tumors Ramy A. Zeineldin et.al. 2412.08240 null
2024-12-10 PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition Kartik Narayan et.al. 2412.07771 null
2024-12-10 Real-time Sign Language Recognition Using MobileNetV2 and Transfer Learning Smruti Jagtap et.al. 2412.07486 null
2024-12-10 T-TIME: Test-Time Information Maximization Ensemble for Plug-and-Play BCIs Siyang Li et.al. 2412.07228 link
2024-12-10 Monte Carlo Tree Search based Space Transfer for Black-box Optimization Shukuan Wang et.al. 2412.07186 link
2024-12-10 An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications Kayne Uriel K. Rodrigo et.al. 2412.07182 null
2024-12-10 Annotation Techniques for Judo Combat Phase Classification from Tournament Footage Anthony Miyaguchi et.al. 2412.07155 null
2024-12-10 Enhancing radioisotope identification in gamma spectra with transfer learning Peter Lalor et.al. 2412.07069 null
2024-12-09 Using optimal control to guide neural-network interpolation of continuously-parameterized gates Bikrant Bhattacharyya et.al. 2412.06623 link
2024-12-09 Representational Transfer Learning for Matrix Completion Yong He et.al. 2412.06233 null
2024-12-09 SGIA: Enhancing Fine-Grained Visual Classification with Sequence Generative Image Augmentation Qiyu Liao et.al. 2412.06138 null
2024-12-08 Self-Supervised Learning with Probabilistic Density Labeling for Rainfall Probability Estimation Junha Lee et.al. 2412.05825 link
2024-12-07 Finite Element Neural Network Interpolation. Part I: Interpretable and Adaptive Discretization for Solving PDEs Kateřina Škardová et.al. 2412.05719 link
2024-12-07 Finite Element Neural Network Interpolation. Part II: Hybridisation with the Proper Generalised Decomposition for non-linear surrogate modelling Alexandre Daby-Seesaram et.al. 2412.05714 link
2024-12-05 Assessing and Learning Alignment of Unimodal Vision and Language Models Le Zhang et.al. 2412.04616 null
2024-12-05 Moto: Latent Motion Token as the Bridging Language for Robot Manipulation Yi Chen et.al. 2412.04445 link
2024-12-05 Adult Glioma Segmentation in Sub-Saharan Africa using Transfer Learning on Stratified Finetuning Data Abhijeet Parida et.al. 2412.04111 null
2024-12-04 Automated galaxy sizes in Euclid images using the Segment Anything Model J. Vega-Ferrero et.al. 2412.03642 link
2024-12-04 Streaming Detection of Queried Event Start Cristobal Eyzaguirre et.al. 2412.03567 link
2024-12-04 Hybrid deep learning-based strategy for the hepatocellular carcinoma cancer grade classification of H&E stained liver histopathology images Ajinkya Deshpande et.al. 2412.03084 null
2024-12-04 Bayesian Transfer Learning for Enhanced Estimation and Inference Daoyuan Lai et.al. 2412.02986 null
2024-12-02 Pooling Solvent Mixtures for Solvation Free Energy Predictions Roel J. Leenhouts et.al. 2412.01982 null
2024-12-02 The Evolution and Future Perspectives of Artificial Intelligence Generated Content Chengzhang Zhu et.al. 2412.01948 null
2024-12-01 Pairwise Discernment of AffectNet Expressions with ArcFace Dylan Waldner et.al. 2412.01860 null
2024-12-02 Transfer Learning for Control Systems via Neural Simulation Relations Alireza Nadali et.al. 2412.01783 null
2024-12-02 FathomVerse: A community science dataset for ocean animal discovery Genevieve Patterson et.al. 2412.01701 null
2024-12-02 Command-line Risk Classification using Transformer-based Neural Architectures Paolo Notaro et.al. 2412.01655 null
2024-12-02 Task Adaptation of Reinforcement Learning-based NAS Agents through Transfer Learning Amber Cassimon et.al. 2412.01420 null
2024-12-02 A Bottom-Up Approach to Optimizing the Solar Organic Rankine Cycle for Transactive Energy Trading Silvia Anna Cordieri et.al. 2412.01359 null
2024-12-02 SiTSE: Sinhala Text Simplification Dataset and Evaluation Surangika Ranathunga et.al. 2412.01293 link
2024-11-30 Pruned Convolutional Attention Network Based Wideband Spectrum Sensing with Sub-Nyquist Sampling Peihao Dong et.al. 2412.00562 link
2024-11-29 Transfer Learning for High-dimensional Quantile Regression with Distribution Shift Ruiqi Bai et.al. 2411.19933 null
2024-11-29 Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation Syed Mohammed Mostaque Billah et.al. 2411.19726 null
2024-11-28 Parameter-Efficient Transfer Learning for Music Foundation Models Yiwei Ding et.al. 2411.19371 link
2024-11-28 Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG Xinxu Wei et.al. 2411.19230 null
2024-11-28 TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition Yilong Wang et.al. 2411.19041 link
2024-11-28 Data Augmentation with Diffusion Models for Colon Polyp Localization on the Low Data Regime: How much real data is enough? Adrian Tormos et.al. 2411.18926 null
2024-11-27 Exponential Moving Average of Weights in Deep Learning: Dynamics and Benefits Daniel Morales-Brotons et.al. 2411.18704 null
2024-11-27 What do physics-informed DeepONets learn? Understanding and improving training for scientific computing applications Emily Williams et.al. 2411.18459 null
2024-11-27 Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification José Fernando Núñez et.al. 2411.18456 null
2024-11-27 Deep learning-based spatio-temporal fusion for high-fidelity ultra-high-speed x-ray radiography Songyuan Tang et.al. 2411.18441 link
2024-11-27 Transfer Learning for Deep Learning-based Prediction of Lattice Thermal Conductivity L. Klochko et.al. 2411.18259 link
2024-11-27 Leveraging Transfer Learning for Astronomical Image Analysis Stefano Cavuoti et.al. 2411.18206 null
2024-11-27 Spectral-Spatial Transformer with Active Transfer Learning for Hyperspectral Image Classification Muhammad Ahmad et.al. 2411.18115 link
2024-11-27 Using different sources of ground truths and transfer learning to improve the generalization of photometric redshift estimation Jonathan Soriano et.al. 2411.18054 null
2024-11-27 Can bidirectional encoder become the ultimate winner for downstream applications of foundation models? Lewen Yang et.al. 2411.18021 null
2024-11-26 Breast Tumor Classification Using EfficientNet Deep Learning Model Majid Behzadpour et.al. 2411.17870 link
2024-11-26 “Nuclear thermometers” reveal the origin of the universal r-process nucleosynthesis José Nicolás Orce et.al. 2411.17852 null
2024-11-26 On the Generalization of Handwritten Text Recognition Models Carlos Garrido-Munoz et.al. 2411.17332 null
2024-11-26 MeerKAT discovery of a MIGHTEE Odd Radio Circle Ray P. Norris et.al. 2411.17311 null
2024-11-26 Learning Hierarchical Polynomials of Multiple Nonlinear Features with Three-Layer Networks Hengyu Fu et.al. 2411.17201 null
2024-11-26 Crack Detection in Infrastructure Using Transfer Learning, Spatial Attention, and Genetic Algorithm Optimization Feng Ding et.al. 2411.17140 null
2024-11-25 Glo-In-One-v2: Holistic Identification of Glomerular Cells, Tissues, and Lesions in Human and Mouse Histopathology Lining Yu et.al. 2411.16961 link
2024-11-25 SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction Shester Gueuwou et.al. 2411.16765 null
2024-11-25 Towards Foundation Models for Critical Care Time Series Manuel Burger et.al. 2411.16346 null
2024-11-25 Deep Learning for Motion Classification in Ankle Exoskeletons Using Surface EMG and IMU Signals Silas Ruhrberg Estévez et.al. 2411.16273 null
2024-11-24 Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan Saba Zahid et.al. 2411.15923 null
2024-11-23 Trans-Glasso: A Transfer Learning Approach to Precision Matrix Estimation Boxin Zhao et.al. 2411.15624 null
2024-11-23 MulModSeg: Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training Chengyin Li et.al. 2411.15576 link
2024-11-22 Personalization of Wearable Sensor-Based Joint Kinematic Estimation Using Computer Vision for Hip Exoskeleton Applications Changseob Song et.al. 2411.15366 null
2024-11-21 Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation Seokil Ham et.al. 2411.15224 null
2024-11-22 Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network Irfan Nafiz Shahan et.al. 2411.15082 link
2024-11-22 Implementation of Real-Time Lane Detection on Autonomous Mobile Robot Midriem Mirdanies et.al. 2411.14873 null
2024-11-22 Self-Supervised Learning for Ordered Three-Dimensional Structures Matthew Spellings et.al. 2411.14680 null
2024-11-21 Variable Extraction for Model Recovery in Scientific Literature Chunwei Liu et.al. 2411.14569 null
2024-11-21 SegBook: A Simple Baseline and Cookbook for Volumetric Medical Image Segmentation Jin Ye et.al. 2411.14525 null
2024-11-21 POS-tagging to highlight the skeletal structure of sentences Grigorii Churakov et.al. 2411.14393 link
2024-11-21 Data Formats in Analytical DBMSs: Performance Trade-offs and Future Directions Chunwei Liu et.al. 2411.14331 null
2024-11-21 BERT-Based Approach for Automating Course Articulation Matrix Construction with Explainable AI Natenaile Asmamaw Shiferaw et.al. 2411.14254 link
2024-11-21 Uncertainty-Aware Regression for Socio-Economic Estimation via Multi-View Remote Sensing Fan Yang et.al. 2411.14119 link
2024-11-20 Machine Learning Domain Adaptation in Spin Models with Continuous Phase Transitions Vladislav Chertenkov et.al. 2411.13027 null
2024-11-15 FedCL-Ensemble Learning: A Framework of Federated Continual Learning with Ensemble Transfer Learning Enhanced for Alzheimer’s MRI Classifications while Preserving Privacy Rishit Kapoor et.al. 2411.12756 null
2024-11-19 Multivariate and Online Transfer Learning with Uncertainty Quantification Jimmy Hickey et.al. 2411.12555 null
2024-11-19 Probe-Me-Not: Protecting Pre-trained Encoders from Malicious Probing Ruyi Ding et.al. 2411.12508 null
2024-11-19 Classification of Geographical Land Structure Using Convolution Neural Network and Transfer Learning Mustafa M. Abd Zaid et.al. 2411.12415 null
2024-11-19 Adversarial Multi-Agent Reinforcement Learning for Proactive False Data Injection Detection Kejun Chen et.al. 2411.12130 null
2024-11-18 In-Situ Melt Pool Characterization via Thermal Imaging for Defect Detection in Directed Energy Deposition Using Vision Transformers Israt Zarin Era et.al. 2411.12028 null
2024-11-18 Compression of Higher Order Ambisonics with Multichannel RVQGAN Toni Hirvonen et.al. 2411.12008 null
2024-11-18 TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition Ke Zhang et.al. 2411.11370 null
2024-11-18 Efficient Transfer Learning for Video-language Foundation Models Haoxing Chen et.al. 2411.11223 link
2024-11-16 Adaptive Learning of Design Strategies over Non-Hierarchical Multi-Fidelity Models via Policy Alignment Akash Agrawal et.al. 2411.10841 null
2024-11-15 Large quadrupole deformation in $^{20}$Ne challenges rotor model and modern theory: urging for $α$ clusters in nuclei C. V. Mehl et.al. 2411.10598 null
2024-11-15 Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review Hossein Hassani et.al. 2411.10268 null
2024-11-15 Causal Time-Series Synchronization for Multi-Dimensional Forecasting Michael Mayr et.al. 2411.10152 null
2024-11-15 Unlocking Transfer Learning for Open-World Few-Shot Recognition Byeonggeun Kim et.al. 2411.09986 null
2024-11-15 mmSpyVR: Exploiting mmWave Radar for Penetrating Obstacles to Uncover Privacy Vulnerability of Virtual Reality Luoyu Mei et.al. 2411.09914 link
2024-11-14 Edge Caching Optimization with PPO and Transfer Learning for Dynamic Environments Farnaz Niknia et.al. 2411.09812 null
2024-11-14 Assessing the Performance of the DINOv2 Self-supervised Learning Vision Transformer Model for the Segmentation of the Left Atrium from MRI Images Bipasha Kundu et.al. 2411.09598 null
2024-11-14 A Practical Guide to Fine-tuning Language Models with Limited Data Márton Szép et.al. 2411.09539 null
2024-11-14 A Centralized-Distributed Transfer Model for Cross-Domain Recommendation Based on Multi-Source Heterogeneous Transfer Learning Ke Xu et.al. 2411.09286 null
2024-11-14 Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Ashim Dahal et.al. 2411.09101 link
2024-11-13 Zero-shot Cross-lingual Transfer Learning with Multiple Source and Target Languages for Information Extraction: Language Selection and Adversarial Training Nghia Trung Ngo et.al. 2411.08785 null
2024-11-13 MVKTrans: Multi-View Knowledge Transfer for Robust Multiomics Classification Shan Cong et.al. 2411.08703 null
2024-11-13 Transfer Learning Guided Noise Reduction for Automatic Modulation Classification Zelin Ji et.al. 2411.08376 null
2024-11-13 DEEGITS: Deep Learning based Framework for Measuring Heterogenous Traffic State in Challenging Traffic Scenarios Muttahirul Islam et.al. 2411.08335 null
2024-11-12 Comprehensive and Comparative Analysis between Transfer Learning and Custom Built VGG and CNN-SVM Models for Wildfire Detection Aditya V. Jonnalagadda et.al. 2411.08171 null
2024-11-12 Triaxial nuclear shapes from simple ratios of electric-quadrupole matrix elements Elena Atanassova Lawrie et.al. 2411.08130 null
2024-11-11 High-Fidelity Cellular Network Control-Plane Traffic Generation without Domain Knowledge Z. Jonny Kong et.al. 2411.07345 null
2024-11-11 DeepONet as a Multi-Operator Extrapolation Model: Distributed Pretraining with Physics-Informed Fine-Tuning Zecheng Zhang et.al. 2411.07239 null
2024-11-10 Foundation Model for Composite Materials and Microstructural Analysis Ting-Ju Wei et.al. 2411.06565 link
2024-11-10 MBL-CPDP: A Multi-objective Bilevel Method for Cross-Project Defect Prediction via Automated Machine Learning Jiaxin Chen et.al. 2411.06491 null
2024-11-10 Do you want to play a game? Learning to play Tic-Tac-Toe in Hypermedia Environments Katharine Beaumont et.al. 2411.06398 null
2024-11-10 A Hybrid Approach for COVID-19 Detection: Combining Wasserstein GAN with Transfer Learning Sumera Rounaq et.al. 2411.06397 null
2024-11-09 Deep Nonparametric Conditional Independence Tests for Images Marco Simnacher et.al. 2411.06140 link
2024-11-12 Cross-Domain Transfer Learning using Attention Latent Features for Multi-Agent Trajectory Prediction Jia Quan Loh et.al. 2411.06087 null
2024-11-09 Predicting band structures for 2D Photonic Crystals via Deep Learning Yueqi Wang et.al. 2411.06063 null
2024-11-08 Towards Equitable ASD Diagnostics: A Comparative Study of Machine and Deep Learning Models Using Behavioral and Facial Data Mohammed Aledhari et.al. 2411.05880 null
2024-11-08 Predicting Stroke through Retinal Graphs and Multimodal Self-supervised Learning Yuqing Huang et.al. 2411.05597 link
2024-11-07 AGE2HIE: Transfer Learning from Brain Age to Predicting Neurocognitive Outcome for Infant Brain Injury Rina Bao et.al. 2411.05188 null
2024-11-07 High Entropy Alloy property predictions using Transformer-based language model Spyros Kamnis et.al. 2411.04861 null
2024-11-07 SpectraFM: Tuning into Stellar Foundation Models Nolan Koblischke et.al. 2411.04750 link
2024-11-07 wav2sleep: A Unified Multi-Modal Approach to Sleep Stage Classification from Physiological Signals Jonathan F. Carter et.al. 2411.04644 link
2024-11-07 Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation Qingyao Tian et.al. 2411.04404 null
2024-11-06 Fine-tuning – a Transfer Learning approach Joseph Arul Raj et.al. 2411.03941 null
2024-11-06 Cross Feature Fusion of Fundus Image and Generated Lesion Map for Referable Diabetic Retinopathy Classification Dahyun Mok et.al. 2411.03618 null
2024-11-05 Energy Price Modelling: A Comparative Evaluation of four Generations of Forecasting Methods Alexandru-Victor Andrei et.al. 2411.03372 null
2024-11-05 Proxy-informed Bayesian transfer learning with unknown sources Sabina J. Sloman et.al. 2411.03263 null
2024-11-05 Exploiting the Segment Anything Model (SAM) for Lung Segmentation in Chest X-ray Images Gabriel Bellon de Carvalho et.al. 2411.03064 null
2024-11-05 A Mamba Foundation Model for Time Series Forecasting Haoyu Ma et.al. 2411.02941 null
2024-11-04 Supervised Transfer Learning Framework for Fault Diagnosis in Wind Turbines Kenan Weber et.al. 2411.02127 null
2024-11-04 AM Flow: Adapters for Temporal Processing in Action Recognition Tanay Agrawal et.al. 2411.02065 null
2024-11-04 V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams Muhammad Waqas Ashraf et.al. 2411.01963 null
2024-11-03 Interaction-Aware Trajectory Prediction for Safe Motion Planning in Autonomous Driving: A Transformer-Transfer Learning Approach Jinhao Liang et.al. 2411.01475 null
2024-11-02 Transfer Learning for Finetuning Large Language Models Tobias Strangmann et.al. 2411.01195 null
2024-11-02 Transfer Learning Between U.S. Presidential Elections: How Should We Learn From A 2020 Ad Campaign To Inform 2024 Ad Campaigns? Xinran Miao et.al. 2411.01100 null
2024-11-01 Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior Mingxuan Zhang et.al. 2411.00969 null
2024-10-31 Denoising study of Fluoroscopic Images in real time tumor tracking System based on Statistical model of noise Yongxuan Yan et.al. 2411.00199 null
2024-10-31 Attention is All You Need to Optimize Wind Farm Operations and Maintenance Iman Kazemian et.al. 2410.24052 null
2024-10-31 Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment Weichao Zhou et.al. 2410.23680 link
2024-10-31 BioNCERE: Non-Contrastive Enhancement For Relation Extraction In Biomedical Texts Farshad Noravesh et.al. 2410.23583 null
2024-10-30 Mind the Gap: A Generalized Approach for Cross-Modal Embedding Alignment Arihan Yadav et.al. 2410.23437 null
2024-10-30 Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks Axel Klawonn et.al. 2410.23359 null
2024-10-30 Sequential Order-Robust Mamba for Time Series Forecasting Seunghan Lee et.al. 2410.23356 null
2024-10-30 Transfer Learning in Vocal Education: Technical Evaluation of Limited Samples Describing Mezzo-soprano Zhenyi Hou et.al. 2410.23325 null
2024-10-30 Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe Songyu Xu et.al. 2410.23154 null
2024-10-30 Don’t Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification Debjyoti Saharoy et.al. 2410.23066 null
2024-10-30 MutaPLM: Protein Language Modeling for Mutation Explanation and Engineering Yizhen Luo et.al. 2410.22949 link
2024-10-30 Self-Driving Car Racing: Application of Deep Reinforcement Learning Florentiana Yuwono et.al. 2410.22766 null
2024-10-29 Towards Neural-Network-based optical temperature sensing of Semiconductor Membrane External Cavity Laser Jakob Mannstadt et.al. 2410.22528 null
2024-10-29 The PV-ALE Dataset: Enhancing Apple Leaf Disease Classification Through Transfer Learning with Convolutional Neural Networks Joseph Damilola Akinyemi et.al. 2410.22490 null
2024-10-30 Feature distribution Adaptation Network for Speech Emotion Recognition Shaokai Li et.al. 2410.22023 link
2024-10-29 Advancing Efficient Brain Tumor Multi-Class Classification – New Insights from the Vision Mamba Model in Transfer Learning Yinyi Lai et.al. 2410.21872 null
2024-10-29 Cross-Domain Transfer Learning Method for Thermal Adaptive Behavior Recognition with WiFi Zhaohe Lv et.al. 2410.21827 null
2024-10-30 Adaptive Transfer Clustering: A Unified Framework Yuqi Gu et.al. 2410.21263 link
2024-10-28 Breccia and basalt classification of thin sections of Apollo rocks with deep learning Freja Thoresen et.al. 2410.21024 null
2024-10-28 KANsformer for Scalable Beamforming Xinke Xie et.al. 2410.20690 null
2024-10-27 Causal Modeling in Multi-Context Systems: Distinguishing Multiple Context-Specific Causal Graphs which Account for Observational Support Martin Rabel et.al. 2410.20405 null
2024-10-27 Uncovering Capabilities of Model Pruning in Graph Contrastive Learning Wu Junran et.al. 2410.20356 null
2024-10-26 Detection-Guided Deep Learning-Based Model with Spatial Regularization for Lung Nodule Segmentation Jiasen Zhang et.al. 2410.20154 null
2024-10-26 Sensor2Text: Enabling Natural Language Interactions for Daily Activity Tracking Using Wearable Sensors Wenqiang Chen et.al. 2410.20034 null
2024-10-25 Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models Zheng Zhao et.al. 2410.20008 null
2024-10-25 The Galaxy Zoo Catalogs for the Galaxy And Mass Assembly (GAMA) Survey Benne W. Holwerda et.al. 2410.19985 null
2024-10-25 A Review of Deep Learning Approaches for Non-Invasive Cognitive Impairment Detection Muath Alsuhaibani et.al. 2410.19898 null
2024-10-25 Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective Ethan Harvey et.al. 2410.19675 link
2024-10-25 Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis Yanguang Zhao et.al. 2410.18698 null
2024-10-23 Deep learning for model correction of dynamical systems with data scarcity Caroline Tatsuoka et.al. 2410.17913 null
2024-10-23 New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture Ach. Khozaimi et.al. 2410.17735 null
2024-10-22 Subshell gaps and onsets of collectivity from proton and neutron pairing gap correlations José Nicolás Orce et.al. 2410.17436 null
2024-10-23 Understanding Transfer Learning via Mean-field Analysis Gholamali Aminian et.al. 2410.17128 null
2024-10-22 Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification Ganga Prasad Basyal et.al. 2410.16711 null
2024-10-22 Enhancing Two-Player Performance Through Single-Player Knowledge Transfer: An Empirical Study on Atari 2600 Games Kimiya Saadat et.al. 2410.16653 link
2024-10-21 Towards Optimal Adapter Placement for Efficient Transfer Learning Aleksandra I. Nowak et.al. 2410.15858 null
2024-10-21 SSMT: Few-Shot Traffic Forecasting with Single Source Meta-Transfer Kishor Kumar Bhaumik et.al. 2410.15589 null
2024-10-20 Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing Daniya Najiha Abdul Kareem et.al. 2410.15360 link
2024-10-20 FoMo: A Foundation Model for Mobile Traffic Forecasting with Diffusion Model Haoye Chai et.al. 2410.15322 null
2024-10-19 Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning David Schulte et.al. 2410.15148 link
2024-10-19 Generalizable Prediction Model of Molten Salt Mixture Density with Chemistry-Informed Transfer Learning Julian Barra et.al. 2410.15120 null
2024-10-19 Water quality polluted by total suspended solids classified within an Artificial Neural Network approach I. Luviano Soto et.al. 2410.14929 null
2024-10-18 A novel approach towards the classification of Bone Fracture from Musculoskeletal Radiography images using Attention Based Transfer Learning Sayeda Sanzida Ferdous Ruhi et.al. 2410.14833 null
2024-10-18 Effects of Soft-Domain Transfer and Named Entity Information on Deception Detection Steven Triplett et.al. 2410.14814 null
2024-10-18 How Does Data Diversity Shape the Weight Landscape of Neural Networks? Yang Ba et.al. 2410.14602 null
2024-10-18 Transfer Reinforcement Learning in Heterogeneous Action Spaces using Subgoal Mapping Kavinayan P. Sivakumar et.al. 2410.14484 null
2024-10-18 Predicting the trajectory of intracranial pressure in patients with traumatic brain injury: evaluation of a foundation model for time series Florian D. van Leeuwen et.al. 2410.14333 null
2024-10-18 Transfer Learning on Transformers for Building Energy Consumption Forecasting – A Comparative Study Robert Spencer et.al. 2410.14107 null
2024-10-18 ST-MoE-BERT: A Spatial-Temporal Mixture-of-Experts Framework for Long-Term Cross-City Mobility Prediction Haoyu He et.al. 2410.14099 link
2024-10-16 FedGTST: Boosting Global Transferability of Federated Models via Statistics Tuning Evelyn Ma et.al. 2410.13045 null
2024-10-15 Exploring transfer learning for Deep NLP systems on rarely annotated languages Dipendra Yadav et.al. 2410.12879 null
2024-10-17 Local transfer learning Gaussian process modeling, with applications to surrogate modeling of expensive computer simulators Xinming Wang et.al. 2410.12690 null
2024-10-16 Tracking Universal Features Through Fine-Tuning and Model Merging Niels Horn et.al. 2410.12391 null
2024-10-16 iFuzzyTL: Interpretable Fuzzy Transfer Learning for SSVEP BCI System Xiaowei Jiang et.al. 2410.12267 null
2024-10-16 Transfer Learning on Multi-Dimensional Data: A Novel Approach to Neural Network-Based Surrogate Modeling Adrienne M. Propp et.al. 2410.12241 null
2024-10-16 TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration Yiwei Guo et.al. 2410.12183 link
2024-10-15 Learning to rumble: Automated elephant call classification, detection and endpointing using deep architectures Christiaan M. Geldenhuys et.al. 2410.12082 null
2024-10-15 A Survey on Deep Tabular Learning Shriyank Somvanshi et.al. 2410.12034 null
2024-10-15 Transfer Learning Adapts to Changing PSD in Gravitational Wave Data Beka Modrekiladze et.al. 2410.11911 null
2024-10-15 YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection Olalekan Akindele et.al. 2410.11727 null
2024-10-15 Transfer Learning with Foundational Models for Time Series Forecasting using Low-Rank Adaptations M. Germán-Morales et.al. 2410.11539 null
2024-10-15 Improving Bias in Facial Attribute Classification: A Combined Impact of KL Divergence induced Loss Function and Dual Attention Shweta Patel et.al. 2410.11176 null
2024-10-14 TL-PCA: Transfer Learning of Principal Component Analysis Sharon Hendy et.al. 2410.10805 null
2024-10-14 Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework Zhengwei Yang et.al. 2410.10663 null
2024-10-14 SpeGCL: Self-supervised Graph Spectrum Contrastive Learning without Positive Samples Yuntao Shou et.al. 2410.10365 null
2024-10-12 Bayesian Transfer Learning for Artificially Intelligent Geospatial Systems: A Predictive Stacking Approach Luca Presicce et.al. 2410.09504 link
2024-10-12 Deep Transfer Learning: Model Framework and Error Analysis Yuling Jiao et.al. 2410.09383 null
2024-10-12 Hey AI Can You Grade My Essay?: Automatic Essay Grading Maisha Maliha et.al. 2410.09319 null
2024-10-11 Meta-Transfer Learning Empowered Temporal Graph Networks for Cross-City Real Estate Appraisal Weijia Zhang et.al. 2410.08947 null
2024-10-10 Features are fate: a theory of transfer learning in high-dimensional regression Javan Tahir et.al. 2410.08194 null
2024-10-10 Non-transferable Pruning Ruyi Ding et.al. 2410.08015 null
2024-10-10 CL3: A Collaborative Learning Framework for the Medical Data Ensuring Data Privacy in the Hyperconnected Environment Mohamamd Zavid Parvez et.al. 2410.07900 link
2024-10-10 Unsupervised Data Validation Methods for Efficient Model Training Yurii Paniv et.al. 2410.07880 null
2024-10-10 Robustness and Security Enhancement of Radio Frequency Fingerprint Identification in Time-Varying Channels Lu Yang et.al. 2410.07591 null
2024-10-10 Physics-informed neural networks for multi-field visualization with single-color laser induced fluorescence Nagahiro Ohashi et.al. 2410.07568 null
2024-10-09 Collusion Detection with Graph Neural Networks Lucas Gomes et.al. 2410.07091 null
2024-10-09 Z-upscaling: Optical Flow Guided Frame Interpolation for Isotropic Reconstruction of 3D EM Volumes Fisseha A. Ferede et.al. 2410.07043 link
2024-10-09 Selecting the Best Sequential Transfer Path for Medical Image Segmentation with Limited Labeled Data Jingyun Yang et.al. 2410.06892 link
2024-10-09 Transfer Learning for a Class of Cascade Dynamical Systems Shima Rabiei et.al. 2410.06828 null
2024-10-09 Seg2Act: Global Context-aware Action Generation for Document Logical Structuring Zichao Li et.al. 2410.06802 link
2024-10-09 Utilizing Transfer Learning and pre-trained Models for Effective Forest Fire Detection: A Case Study of Uttarakhand Hari Prabhat Gupta et.al. 2410.06743 null
2024-10-09 On The Relationship between Visual Anomaly-free and Anomalous Representations Riya Sadrani et.al. 2410.06576 null
2024-10-09 Model-assisted and Knowledge-guided Transfer Regression for the Underrepresented Population Doudou Zhou et.al. 2410.06484 null
2024-10-08 Advancements in Road Lane Mapping: Comparative Fine-Tuning Analysis of Deep Learning-based Semantic Segmentation Methods Using Aerial Imagery Xuanchen et.al. 2410.05717 null
2024-10-08 Robust Transfer Learning for Active Level Set Estimation with Locally Adaptive Gaussian Process Prior Giang Ngo et.al. 2410.05660 null
2024-10-08 Deep Transfer Learning-based Detection for Flash Memory Channels Zhen Mei et.al. 2410.05618 null
2024-10-07 Pre-Ictal Seizure Prediction Using Personalized Deep Learning Shriya Jaddu et.al. 2410.05491 null
2024-10-07 Deep learning-based Visual Measurement Extraction within an Adaptive Digital Twin Framework from Limited Data Using Transfer Learning Mehrdad Shafiei Dizaji et.al. 2410.05403 null
2024-10-07 Hyper-Representations: Learning from Populations of Neural Networks Konstantin Schürholt et.al. 2410.05107 link
2024-10-07 Learning Interpretable Hierarchical Dynamical Systems Models from Time Series Data Manuel Brenner et.al. 2410.04814 null
2024-10-06 Learning De-Biased Representations for Remote-Sensing Imagery Zichen Tian et.al. 2410.04546 link
2024-10-06 Transfer Learning with General Estimating Equations Han Yan et.al. 2410.04398 null
2024-10-05 Deep Transfer Learning Based Peer Review Aggregation and Meta-review Generation for Scientific Articles Md. Tarek Hasan et.al. 2410.04202 null
2024-10-04 Interpolation-Free Deep Learning for Meteorological Downscaling on Unaligned Grids Across Multiple Domains with Application to Wind Power Jean-Sébastien Giroux et.al. 2410.03945 null
2024-10-03 Reconstructing Human Mobility Pattern: A Semi-Supervised Approach for Cross-Dataset Transfer Learning Xishun Liao et.al. 2410.03788 null
2024-10-04 SAG: Style-Aligned Article Generation via Model Collaboration Chenning Xu et.al. 2410.03137 null
2024-10-04 Remaining Useful Life Prediction: A Study on Multidimensional Industrial Signal Processing and Efficient Transfer Learning Based on Large Language Models Yan Chen et.al. 2410.03134 null
2024-10-03 Ethio-Fake: Cutting-Edge Approaches to Combat Fake News in Under-Resourced Languages Using Explainable AI Mesay Gemeda Yigezu et.al. 2410.02609 null
2024-10-03 Source Data Selection for Brain-Computer Interfaces based on Simple Features Frida Heskebeck et.al. 2410.02360 null
2024-10-03 QDGset: A Large Scale Grasping Dataset Generated with Quality-Diversity Johann Huber et.al. 2410.02319 null
2024-10-03 The Comparison of Individual Cat Recognition Using Neural Networks Mingxuan Li et.al. 2410.02305 null
2024-10-03 A Novel Method for Accurate & Real-time Food Classification: The Synergistic Integration of EfficientNetB7, CBAM, Transfer Learning, and Data Augmentation Shayan Rokhva et.al. 2410.02304 null
2024-10-03 Universality in Transfer Learning for Linear Models Reza Ghane et.al. 2410.02164 null
2024-10-02 In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks Dingzirui Wang et.al. 2410.01548 link
2024-10-02 RS-FME-SwinT: A Novel Feature Map Enhancement Framework Integrating Customized SwinT with Residual and Spatial CNN for Monkeypox Diagnosis Saddam Hussain Khan et.al. 2410.01216 null
2024-10-02 Recovering Manifold Structure Using Ollivier-Ricci Curvature Tristan Luca Saidi et.al. 2410.01149 link
2024-09-30 On the topology and geometry of population-based SHM Keith Worden et.al. 2410.00923 null
2024-10-01 Advanced Arabic Alphabet Sign Language Recognition Using Transfer Learning and Transformer Models Mazen Balat et.al. 2410.00681 null
2024-10-01 EMGTTL: Transformers-Based Transfer Learning for Classification of ADL using Raw Surface EMG Signals Ashraf Ali Kareemulla et.al. 2410.00586 null
2024-10-01 Scalable Multi-Task Transfer Learning for Molecular Property Prediction Chanhui Lee et.al. 2410.00432 null
2024-09-30 FireLite: Leveraging Transfer Learning for Efficient Fire Detection in Resource-Constrained Environments Mahamudul Hasan et.al. 2409.20384 null
2024-09-30 UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation Cheng Zhang et.al. 2409.20197 link
2024-09-30 SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition Shu Yang et.al. 2409.20083 null
2024-09-30 Model Selection with a Shapelet-based Distance Measure for Multi-source Transfer Learning in Time Series Classification Jiseok Lee et.al. 2409.20005 link
2024-09-29 MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generation Lijian Xu et.al. 2409.19684 link
2024-09-29 Brain Tumor Classification on MRI in Light of Molecular Markers Jun Liu et.al. 2409.19583 null
2024-09-29 A Universal Deep Learning Framework for Materials X-ray Absorption Spectra Shubha R. Kharel et.al. 2409.19552 link
2024-09-28 Accelerating Malware Classification: A Vision Transformer Solution Shrey Bavishi et.al. 2409.19461 link
2024-09-28 On the universality of neural encodings in CNNs Florentin Guth et.al. 2409.19460 null
2024-09-27 Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning Yu Fu et.al. 2409.19075 null
2024-09-27 Audio-Based Linguistic Feature Extraction for Enhancing Multi-lingual and Low-Resource Text-to-Speech Youngjae Kim et.al. 2409.18622 null
2024-09-27 How Effective is Pre-training of Large Masked Autoencoders for Downstream Earth Observation Tasks? Jose Sosa et.al. 2409.18536 null
2024-10-01 Automated Segmentation and Analysis of Microscopy Images of Laser Powder Bed Fusion Melt Tracks Aagam Shah et.al. 2409.18326 null
2024-09-26 Jump Diffusion-Informed Neural Networks with Transfer Learning for Accurate American Option Pricing under Data Scarcity Qiguo Sun et.al. 2409.18168 null
2024-09-26 Transfer Learning in $\ell_1$ Regularized Regression: Hyperparameter Selection Strategy based on Sharp Asymptotic Analysis Koki Okajima et.al. 2409.17704 null
2024-09-26 T3: A Novel Zero-shot Transfer Learning Framework Iteratively Training on an Assistant Task for a Target Task Xindi Tong et.al. 2409.17640 null
2024-09-26 MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Gongfan Fang et.al. 2409.17481 link
2024-09-24 Transfer learning for financial data predictions: a systematic review V. Lanzetta et.al. 2409.17183 null
2024-09-25 Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models Zhichen Han et.al. 2409.16920 link
2024-09-25 GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning Zhe-Rui Yang et.al. 2409.16670 link
2024-09-25 Graph Pruning Based Spatial and Temporal Graph Convolutional Network with Transfer Learning for Traffic Prediction Zihao Jing et.al. 2409.16532 link
2024-09-24 Lessons Learned from a Unifying Empirical Study of Parameter-Efficient Transfer Learning (PETL) in Visual Recognition Zheda Mai et.al. 2409.16434 link
2024-09-24 Stable Survival Extrapolation via Transfer Learning Anastasios Apsemidis et.al. 2409.16044 null
2024-09-24 Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification Leire Benito-Del-Valle et.al. 2409.16002 link
2024-09-24 Machine Translation Advancements of Low-Resource Indian Languages by Transfer Learning Bin Wei et.al. 2409.15879 null
2024-09-21 Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics Burooj Ghani et.al. 2409.15383 null
2024-09-22 From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks Clémentine C. J. Dominé et.al. 2409.14623 null
2024-09-21 Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer Zheng Liu et.al. 2409.13999 null
2024-09-20 Transfer Learning with Clinical Concept Embeddings from Large Language Models Yuhe Gao et.al. 2409.13893 null
2024-09-20 Transfer Learning for Passive Sonar Classification using Pre-trained Audio and ImageNet Models Amirmohammad Mohammadi et.al. 2409.13878 null
2024-09-20 Transfer Learning and Double U-Net Empowered Wave Propagation Model in Complex Indoor Environment Ziheng Fu et.al. 2409.13833 null
2024-09-20 MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension Ting Liu et.al. 2409.13609 link
2024-09-20 Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Tensorflow Pretrained Models Keyu Chen et.al. 2409.13566 null
2024-09-20 Overcoming Data Limitations in Internet Traffic Forecasting: LSTM Models with Transfer Learning and Wavelet Augmentation Sajal Saha et.al. 2409.13181 null
2024-09-20 Bilateral Sharpness-Aware Minimization for Flatter Minima Jiaxin Deng et.al. 2409.13173 null
2024-09-19 Recognition of Harmful Phytoplankton from Microscopic Images using Deep Learning Aymane Khaldi et.al. 2409.12900 null
2024-09-19 Rapid aerodynamic prediction of swept wings via physics-embedded transfer learning Yunjia Yang et.al. 2409.12711 null
2024-09-19 Exploring bat song syllable representations in self-supervised audio encoders Marianne de Heer Kloots et.al. 2409.12634 null
2024-09-19 Using Large Language Models to Generate Clinical Trial Tables and Figures Yumeng Yang et.al. 2409.12046 null
2024-09-18 All-in-one foundational models learning across quantum chemical levels Yuxinxin Chen et.al. 2409.12015 link
2024-09-18 Location based Probabilistic Load Forecasting of EV Charging Sites: Deep Transfer Learning with Multi-Quantile Temporal Convolutional Network Mohammad Wazed Ali et.al. 2409.11862 null
2024-09-18 Bridging Domain Gap for Flight-Ready Spaceborne Vision Tae Ha Park et.al. 2409.11661 null
2024-09-17 Leveraging Reviewer Experience in Code Review Comment Generation Hong Yi Lin et.al. 2409.10959 null
2024-09-16 Can Transfer Learning be Used to Identify Tropical State-Dependent Bias Relevant to Midlatitude Subseasonal Predictability? Kirsten J. Mayer et.al. 2409.10755 null
2024-09-16 RF-GML: Reference-Free Generative Machine Listener Arijit Biswas et.al. 2409.10210 null
2024-09-16 A Comparative Study of Open Source Computer Vision Models for Application on Small Data: The Case of CFRP Tape Laying Thomas Fraunholz et.al. 2409.10104 null
2024-09-14 Target Speaker ASR with Whisper Alexander Polok et.al. 2409.09543 link
2024-09-14 On the Generalizability of Foundation Models for Crop Type Mapping Yi-Chia Chang et.al. 2409.09451 link
2024-09-14 The T05 System for The VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic Speech Kaito Baba et.al. 2409.09305 link
2024-09-22 Train-On-Request: An On-Device Continual Learning Workflow for Adaptive Real-World Brain Machine Interfaces Lan Mei et.al. 2409.09161 link
2024-09-11 Distributed Convolutional Neural Network Training on Mobile and Edge Clusters Pranav Rama et.al. 2409.09083 null
2024-09-13 Comparative Analysis of Pretrained Audio Representations in Music Recommender Systems Yan-Martin Tamm et.al. 2409.08987 link
2024-09-13 Data Efficient Child-Adult Speaker Diarization with Simulated Conversations Anfeng Xu et.al. 2409.08881 link
2024-09-13 Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages Yao-Fei Cheng et.al. 2409.08872 null
2024-09-12 Identification of head impact locations, speeds, and force based on head kinematics Xianghao Zhan et.al. 2409.08177 link
2024-09-12 SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality Chenyang Lei et.al. 2409.08083 link
2024-09-12 SPARK: Self-supervised Personalized Real-time Monocular Face Capture Kelian Baert et.al. 2409.07984 null
2024-09-12 Data-efficient multi-fidelity training for high-fidelity machine learning interatomic potentials Jaesun Kim et.al. 2409.07947 null
2024-09-12 Reimagining Linear Probing: Kolmogorov-Arnold Networks in Transfer Learning Sheng Shen et.al. 2409.07763 null
2024-09-12 Transfer Learning Applied to Computer Vision Problems: Survey on Current Progress, Limitations, and Opportunities Aaryan Panda et.al. 2409.07736 null
2024-09-17 Music auto-tagging in the long tail: A few-shot approach T. Aleksandra Ma et.al. 2409.07730 null
2024-09-11 Deep Neural Network-Based Sign Language Recognition: A Comprehensive Approach Using Transfer Learning with Explainability A. E. M Ridwan et.al. 2409.07426 null
2024-09-11 Deep Learning Techniques for Hand Vein Biometrics: A Comprehensive Review Mustapha Hemis et.al. 2409.07128 null
2024-09-13 A Bayesian framework for active object recognition, pose estimation and shape transfer learning through touch Haodong Zheng et.al. 2409.06912 null
2024-09-10 Adaptive Meta-Domain Transfer Learning (AMDTL): A Novel Approach for Knowledge Transfer in AI Michele Laurelli et.al. 2409.06800 link
2024-09-10 A study on Deep Convolutional Neural Networks, Transfer Learning and Ensemble Model for Breast Cancer Detection Md Taimur Ahad et.al. 2409.06699 null
2024-09-10 A comprehensive study on Blood Cancer detection and classification using Convolutional Neural Network Md Taimur Ahad et.al. 2409.06689 null
2024-09-10 Advancements in Gesture Recognition Techniques and Machine Learning for Enhanced Human-Robot Interaction: A Comprehensive Review Sajjad Hussain et.al. 2409.06503 null
2024-09-10 Inference is All You Need: Self Example Retriever for Cross-domain Dialogue State Tracking with ChatGPT Jihyun Lee et.al. 2409.06243 null
2024-09-09 Robust Real-time Segmentation of Bio-Morphological Features in Human Cherenkov Imaging during Radiotherapy via Deep Learning Shiru Wang et.al. 2409.05666 null
2024-09-09 Preparing Schrödinger cat states in a microwave cavity using a neural network Hector Hutin et.al. 2409.05557 null
2024-09-13 Federated Transfer Learning Based Cooperative Wideband Spectrum Sensing with Model Pruning Jibin Jia et.al. 2409.05462 null
2024-09-09 Sample-Efficient Bayesian Optimization with Transfer Learning for Heterogeneous Search Spaces Aryan Deshwal et.al. 2409.05325 link
2024-09-07 Collaborative Learning with Shared Linear Representations: Statistical Rates and Optimal Algorithms Xiaochun Niu et.al. 2409.04919 null
2024-09-07 Urban traffic analysis and forecasting through shared Koopman eigenmodes Chuhan Yang et.al. 2409.04728 null
2024-09-06 A Unified Framework for Cross-Domain Recommendation Jiangxia Cao et.al. 2409.04540 null
2024-09-06 Incorporating external data for analyzing randomized clinical trials: A transfer learning approach Yujia Gu et.al. 2409.04126 null
2024-09-09 AnyMatch – Efficient Zero-Shot Entity Matching with a Small Language Model Zeyu Zhang et.al. 2409.04073 link
2024-09-05 Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning Isaac Ray et.al. 2409.03938 null
2024-09-05 The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives Èric Śanchez et.al. 2409.03911 link
2024-09-05 Threat Classification on Deployed Optical Networks Using MIMO Digital Fiber Sensing, Wavelets, and Machine Learning Khouloud Abdelli et.al. 2409.03667 null
2024-09-05 Shuffle Vision Transformer: Lightweight, Fast and Efficient Recognition of Driver Facial Expression Ibtissam Saadi et.al. 2409.03438 null
2024-09-05 Non-stationary and Sparsely-correlated Multi-output Gaussian Process with Spike-and-Slab Prior Wang Xinming et.al. 2409.03149 null
2024-09-04 Knowledge Transfer for Collaborative Misbehavior Detection in Untrusted Vehicular Environments Roshan Sedar et.al. 2409.02844 null
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-04 Regularized Multi-output Gaussian Convolution Process with Domain Adaptation Wang Xinming et.al. 2409.02778 null
2024-09-04 A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing Davi Rodrigues et.al. 2409.02528 null
2024-09-05 Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR Xugang Lu et.al. 2409.02239 null
2024-09-04 When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective Hsi-Ai Tsao et.al. 2409.01821 link
2024-09-03 METcross: A framework for short-term forecasting of cross-city metro passenger flow Wenbo Lu et.al. 2409.01515 null
2024-09-02 A multilingual training strategy for low resource Text to Speech Asma Amalas et.al. 2409.01217 null
2024-09-02 Beyond Efficiency: Molecular Data Pruning for Enhanced Generalization Dingshuo Chen et.al. 2409.01081 null
2024-09-01 Equitable Skin Disease Prediction Using Transfer Learning and Domain Adaptation Sajib Acharjee Dip et.al. 2409.00873 null
2024-09-01 Multiscale Color Guided Attention Ensemble Classifier for Age-Related Macular Degeneration using Concurrent Fundus and Optical Coherence Tomography Images Pragya Gupta et.al. 2409.00718 null
2024-08-31 Comparative Analysis of Modality Fusion Approaches for Audio-Visual Person Identification and Verification Aref Farhadipour et.al. 2409.00562 null
2024-08-31 Foundations of Multivariate Distributional Reinforcement Learning Harley Wiltzer et.al. 2409.00328 null
2024-08-30 Self-Supervised Learning for Building Robust Pediatric Chest X-ray Classification Models Sheng Cheng et.al. 2409.00231 null
2024-08-30 Transfer Learning Based Hybrid Quantum Neural Network Model for Surface Anomaly Detection Sounak Bhowmik et.al. 2409.00228 null
2024-09-02 Disease Classification and Impact of Pretrained Deep Convolution Neural Networks on Diverse Medical Imaging Datasets across Imaging Modalities Jutika Borah et.al. 2408.17011 null
2024-08-30 Contrastive Learning with Synthetic Positives Dewen Zeng et.al. 2408.16965 link
2024-08-30 An Empirical Study of Scaling Laws for Transfer Matthew Barnett et.al. 2408.16947 null
2024-08-29 Comparative Analysis of Transfer Learning Models for Breast Cancer Classification Sania Eskandari et.al. 2408.16859 link
2024-08-29 CNN Based Detection of Cardiovascular Diseases from ECG Images Irem Sayin et.al. 2408.16800 null
2024-08-29 Data Quality Monitoring through Transfer Learning on Anomaly Detection for the Hadron Calorimeters Mulugeta Weldezgina Asres et.al. 2408.16612 null
2024-08-29 On Transfer Learning for a Fully Convolutional Deep Neural SIMO Receiver Uyoata E. Uyoata et.al. 2408.16401 null
2024-08-29 Efficient Transfer Learning Framework for Cross-Domain Click-Through Rate Prediction Qi Liu et.al. 2408.16238 null
2024-08-29 A More Unified Theory of Transfer Learning Steve Hanneke et.al. 2408.16189 null
2024-08-28 Q-MRS: A Deep Learning Framework for Quantitative Magnetic Resonance Spectra Analysis Christopher J. Wu et.al. 2408.15999 null
2024-08-28 Auxiliary Input in Training: Incorporating Catheter Features into Deep Learning Models for ECG-Free Dynamic Coronary Roadmapping Yikang Liu et.al. 2408.15947 null
2024-08-28 Emulating Brain-like Rapid Learning in Neuromorphic Edge Computing Kenneth Stewart et.al. 2408.15800 link
2024-08-28 Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection Sondos Mohamed et.al. 2408.15637 null
2024-08-27 Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models Hongfu Liu et.al. 2408.14866 link
2024-08-27 GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning Shubhendu Jena et.al. 2408.14724 null
2024-08-26 Comparative Analysis: Violence Recognition from Videos using Transfer Learning Dursun Dashdamirov et.al. 2408.14659 link
2024-08-23 Knowledge Graph Modeling-Driven Large Language Model Operating System (LLM OS) for Task Automation in Process Engineering Problem-Solving Sakhinana Sagar Srinivas et.al. 2408.14494 null
2024-08-26 Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition Axel Klawonn et.al. 2408.14442 null
2024-08-26 Application of Neural Ordinary Differential Equations for ITER Burning Plasma Dynamics Zefang Liu et.al. 2408.14404 link
2024-08-26 Histology Virtual Staining with Mask-Guided Adversarial Transfer Learning for Tertiary Lymphoid Structure Detection Qiuli Wang et.al. 2408.13978 null
2024-08-24 Advancing Gamma-Ray Burst Identification through Transfer Learning with Convolutional Neural Networks Peng Zhang et.al. 2408.13598 null
2024-08-24 Optimal Layer Selection for Latent Data Augmentation Tomoumi Takase et.al. 2408.13426 null
2024-08-23 Enhancing Few-Shot Transfer Learning with Optimized Multi-Task Prompt Tuning through Modular Prompt Composition Ahmad Pouramini et.al. 2408.13227 null
2024-08-23 Deep Learning for Lung Disease Classification Using Transfer Learning and a Customized CNN Architecture with Attention Xiaoyi Liu et.al. 2408.13180 null
2024-08-22 Time series forecasting of multiphase microstructure evolution using deep learning Saurabh Tiwari et.al. 2408.13111 null
2024-08-23 A cost-effective strategy of enhancing machine learning potentials by transfer learning from a multicomponent dataset on ænet-PyTorch An Niza El Aisnadaa et.al. 2408.12939 null
2024-08-23 Efficient Training Approaches for Performance Anomaly Detection Models in Edge Computing Environments Duneesha Fernando et.al. 2408.12855 null
2024-08-23 Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence Purushothaman Natarajan et.al. 2408.12837 link
2024-08-22 Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification Sudi Murindanyi et.al. 2408.12426 null
2024-08-22 Modularized data-driven approximation of the Koopman operator and generator Yang Guo et.al. 2408.12277 null
2024-08-22 Accounts of using the Tustin-Net architecture on a rotary inverted pendulum Stijn van Esch et.al. 2408.12266 link
2024-08-23 Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models Shenglin Zhang et.al. 2408.12247 link
2024-08-21 Defining Boundaries: The Impact of Domain Specification on Cross-Language and Cross-Domain Transfer in Machine Translation Lia Shahnazaryan et.al. 2408.11926 null
2024-08-19 Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition Xuan Kan et.al. 2408.11873 null
2024-08-21 Embedding Ordinality to Binary Loss Function for Improving Solar Flare Forecasting Chetraj Pandey et.al. 2408.11768 link
2024-08-21 Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning Methods David Jacob Kedziora et.al. 2408.11322 link
2024-08-21 RedWhale: An Adapted Korean LLM Through Efficient Continual Pretraining Anh-Dung Vo et.al. 2408.11294 null
2024-08-20 Multichannel Attention Networks with Ensembled Transfer Learning to Recognize Bangla Handwritten Charecter Farhanul Haque et.al. 2408.10955 null
2024-08-20 The Evolution of Reinforcement Learning in Quantitative Finance Nikolaos Pippas et.al. 2408.10932 null
2024-08-20 ViLReF: A Chinese Vision-Language Retinal Foundation Model Shengzhu Yang et.al. 2408.10894 link
2024-08-20 TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning Bin Wang et.al. 2408.10688 link
2024-08-20 Multi-Attribute Preferences: A Transfer Learning Approach Sjoerd Hermes et.al. 2408.10558 null
2024-08-20 Transfer Operator Learning with Fusion Frame Haoyang Jiang et.al. 2408.10458 null
2024-08-23 Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language Manjil Karki et.al. 2408.10128 null
2024-08-19 Weakly Supervised Pretraining and Multi-Annotator Supervised Finetuning for Facial Wrinkle Detection Ik Jun Moon et.al. 2408.09952 null
2024-08-19 Electron-nucleus cross sections from transfer learning Krzysztof M. Graczyk et.al. 2408.09936 null
2024-08-19 Meta-Learning on Augmented Gene Expression Profiles for Enhanced Lung Cancer Detection Arya Hadizadeh Moghaddam et.al. 2408.09635 link
2024-08-18 CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination Kaicheng Yang et.al. 2408.09441 null
2024-08-16 GLANCE: Graph-based Learnable Digital Twin for Communication Networks Boning Li et.al. 2408.09040 null
2024-08-16 AdaRank: Disagreement Based Module Rank Prediction for Low-rank Adaptation Yihe Dong et.al. 2408.09015 link
2024-08-16 A Multi-Task and Multi-Label Classification Model for Implicit Discourse Relation Recognition Nelson Filipe Costa et.al. 2408.08971 null
2024-08-16 CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk Mohamad Fares El Hajj Chehade et.al. 2408.08812 null
2024-08-16 Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation Linghao Zheng et.al. 2408.08576 null
2024-08-16 Unsupervised Transfer Learning via Adversarial Contrastive Training Chenguang Duan et.al. 2408.08533 link
2024-08-16 Inverse design with conditional cascaded diffusion models Milad Habibi et.al. 2408.08526 null
2024-08-16 Enhancement of price trend trading strategies via image-induced importance weights Zhoufan Zhu et.al. 2408.08483 link
2024-08-15 Training Spatial-Frequency Visual Prompts and Probabilistic Clusters for Accurate Black-Box Transfer Learning Wonwoo Cho et.al. 2408.07944 null
2024-08-14 MeerKAT reveals a ghostly thermal radio ring towards the Galactic Centre C. Bordiu et.al. 2408.07727 null
2024-08-14 PolyCL: Contrastive Learning for Polymer Representation Learning via Explicit and Implicit Augmentations Jiajun Zhou et.al. 2408.07556 link
2024-08-20 Surrogate-Assisted Search with Competitive Knowledge Transfer for Expensive Optimization Xiaoming Xue et.al. 2408.07176 link
2024-08-13 Object Tracking Incorporating Transfer Learning into Unscented and Cubature Kalman Filters Omar Alotaibi et.al. 2408.07157 null
2024-08-12 A Unified Manifold Similarity Measure Enhancing Few-Shot, Transfer, and Reinforcement Learning in Manifold-Distributed Datasets Sayed W Qayyumi et.al. 2408.07095 null
2024-08-07 Anatomical Foundation Models for Brain MRIs Carlo Alberto Barbano et.al. 2408.07079 link
2024-08-13 Approaches for enhancing extrapolability in process-based and data-driven models in hydrology Haiyang Shi et.al. 2408.07071 null
2024-08-20 Spectrum Prediction With Deep 3D Pyramid Vision Transformer Learning Guangliang Pan et.al. 2408.06870 link
2024-08-12 InfLocNet: Enhanced Lung Infection Localization and Disease Detection from Chest X-Ray Images Using Lightweight Deep Learning Md. Asiful Islam Miah et.al. 2408.06459 null
2024-08-12 Wireless Channel Aware Data Augmentation Methods for Deep Leaning-Based Indoor Localization Omer Gokalp Serbetci et.al. 2408.06452 null
2024-08-12 Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems Steve Yuwono et.al. 2408.05992 null
2024-08-09 ECG-FM: An Open Electrocardiogram Foundation Model Kaden McKeen et.al. 2408.05178 link
2024-08-08 Segmentation of Mental Foramen in Orthopantomographs: A Deep Learning Approach Haider Raza et.al. 2408.04763 null
2024-08-08 Hybrid Quantum-Classical Neural Networks for Downlink Beamforming Optimization Juping Zhang et.al. 2408.04747 null
2024-08-08 Modelling parametric uncertainty in PDEs models via Physics-Informed Neural Networks Milad Panahi et.al. 2408.04690 null
2024-08-08 Model-Based Transfer Learning for Contextual Reinforcement Learning Jung-Hoon Cho et.al. 2408.04498 link
2024-08-08 Deep Transfer Learning for Kidney Cancer Diagnosis Yassine Habchi et.al. 2408.04318 null
2024-08-07 Scaling Law of Sim2Real Transfer Learning in Expanding Computational Materials Databases for Real-World Predictions Shunya Minami et.al. 2408.04042 null
2024-08-06 An Interactive Augmented Reality Interface for Personalized Proxemics Modeling Massimiliano Nigro et.al. 2408.03453 null
2024-08-05 Quantum Transfer Learning for MNIST Classification Using a Hybrid Quantum-Classical Approach Soumyadip Sarkar et.al. 2408.03351 null
2024-08-06 LLaVA-OneVision: Easy Visual Task Transfer Bo Li et.al. 2408.03326 link
2024-08-06 Segment Anything in Medical Images and Videos: Benchmark and Deployment Jun Ma et.al. 2408.03322 link
2024-08-06 Fast Whole-Brain MR Multi-Parametric Mapping with Scan-Specific Self-Supervised Networks Amir Heydari et.al. 2408.02988 null
2024-08-05 FPT+: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification Yijin Huang et.al. 2408.02426 link
2024-08-05 FE-Adapter: Adapting Image-based Emotion Classifiers to Videos Shreyank N Gowda et.al. 2408.02421 null
2024-08-05 Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding Renato Vukovic et.al. 2408.02361 null
2024-08-05 Machine Learning Applications in Medical Prognostics: A Comprehensive Review Michael Fascia et.al. 2408.02344 null
2024-08-05 Synergistic Learning with Multi-Task DeepONet for Efficient PDE Problem Solving Varun Kumar et.al. 2408.02198 link
2024-08-04 Graph-Enabled Fast MCMC Sampling with an Unknown High-Dimensional Prior Distribution Chenyang Zhong et.al. 2408.02122 link
2024-08-04 DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation Qinshuo Liu et.al. 2408.02045 link
2024-08-04 Unsupervised Representation Learning by Balanced Self Attention Matching Daniel Shalam et.al. 2408.02014 link
2024-08-04 AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis Townim F. Chowdhury et.al. 2408.02001 link
2024-08-06 Sharpness-Aware Cross-Domain Recommendation to Cold-Start Users Guohang Zeng et.al. 2408.01931 null
2024-08-02 PiCoGen2: Piano cover generation with transfer learning approach and weakly aligned data Chih-Pin Tan et.al. 2408.01551 null
2024-08-02 Analyzing LLMs’ Capabilities to Establish Implicit User Sentiment of Software Desirability Sherri Weitl-Harms et.al. 2408.01527 null
2024-08-02 IAI Group at CheckThat! 2024: Transformer Models and Data Augmentation for Checkworthy Claim Detection Peter Røysland Aarnes et.al. 2408.01118 link
2024-08-08 Cross-domain Named Entity Recognition via Graph Matching Junhao Zheng et.al. 2408.00981 null
2024-08-01 A deep learning-enabled smart garment for versatile sleep behaviour monitoring Chenyu Tang et.al. 2408.00753 null
2024-08-01 Accelerating Full Waveform Inversion By Transfer Learning Divya Shyam Singh et.al. 2408.00695 null
2024-08-03 Scaling Backwards: Minimal Synthetic Pre-training? Ryo Nakamura et.al. 2408.00677 link
2024-08-01 Efficient Patient Fine-Tuned Seizure Detection with a Tensor Kernel Machine Seline J. S. de Rooij et.al. 2408.00437 null
2024-08-01 Provably Efficient Adiabatic Learning for Quantum-Classical Dynamics Changnan Peng et.al. 2408.00276 null
2024-07-31 Leveraging Self-Supervised Learning for Fetal Cardiac Planes Classification using Ultrasound Scan Videos Joseph Geo Benjamin et.al. 2407.21738 null
2024-07-31 Shape-restricted transfer learning analysis for generalized linear regression model Pengfei Li et.al. 2407.21682 null
2024-07-31 An Explainable Vision Transformer with Transfer Learning Combined with Support Vector Machine Based Efficient Drought Stress Identification Aswini Kumar Patra et.al. 2407.21666 null
2024-07-31 Accurate Tunneling Splittings for Ever-Larger Molecules from Transfer-Learned, CCSD(T) Quality Energy Functions Silvan Käser et.al. 2407.21366 null
2024-07-30 Domain Shift Analysis in Chest Radiographs Classification in a Veterans Healthcare Administration Population Mayanka Chandrashekar et.al. 2407.21149 null
2024-07-30 Transfer Learning for Multi-material Classification of Transition Metal Dichalcogenides with Atomic Force Microscopy Isaiah A. Moses et.al. 2407.20975 null
2024-07-30 Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning Norman Di Palo et.al. 2407.20798 null
2024-07-30 Image-based Detection of Segment Misalignment in Multi-mirror Satellites using Transfer Learning C. Tanner Fredieu et.al. 2407.20582 null
2024-07-30 DuA: Dual Attentive Transformer in Long-Term Continuous EEG Emotion Analysis Yue Pan et.al. 2407.20519 null
2024-07-26 Robust and Efficient Transfer Learning via Supernet Transfer in Warm-started Neural Architecture Search Prabhant Singh et.al. 2407.20279 null
2024-07-29 Enhancing Anti-spoofing Countermeasures Robustness through Joint Optimization and Transfer Learning Yikang Wang et.al. 2407.20111 null
2024-07-29 Transfer Learning Targeting Mixed Population: A Distributional Robust Perspective Keyao Zhan et.al. 2407.20073 null
2024-07-29 ProRuka: A highly efficient HMI algorithm for controlling a novel prosthetic hand with 6-DOF using sonomyography Vaheh Nazari et.al. 2407.19859 null
2024-07-29 Online Multi-Source Domain Adaptation through Gaussian Mixtures and Dataset Dictionary Learning Eduardo Fernandes Montesuma et.al. 2407.19853 null
2024-07-29 Unmasking unlearnable models: a classification challenge for biomedical images without visible cues Shivam Kumar et.al. 2407.19773 null
2024-07-28 Deep Generative Models-Assisted Automated Labeling for Electron Microscopy Images Segmentation Wenhao Yuan et.al. 2407.19544 link
2024-07-25 Adapting Mouse Pathological Model to Human Glomerular Lesion Segmentation Lining Yu et.al. 2407.18390 null
2024-07-25 Detection of manatee vocalisations using the Audio Spectrogram Transformer Stefano Schiappacasse et.al. 2407.18083 link
2024-07-25 Difficulty Estimation and Simplification of French Text Using LLMs Henri Jamet et.al. 2407.18061 null
2024-07-26 Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision Tim J. M. Jaspers et.al. 2407.17904 link
2024-07-25 Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey Shahab Saquib Sohail et.al. 2407.17877 null
2024-07-25 Innovative Speech-Based Deep Learning Approaches for Parkinson’s Disease Classification: A Systematic Review Lisanne van Gelderen et.al. 2407.17844 null
2024-07-25 How Lightweight Can A Vision Transformer Be Jen Hong Tan et.al. 2407.17783 null
2024-07-24 Traditional Methods Outperform Generative LLMs at Forecasting Credit Ratings Felix Drinkall et.al. 2407.17624 link
2024-07-24 Wavelet-based Autoencoder and EfficientNet for Schizophrenia Detection from EEG Signals Umesh Kumar Naik M et.al. 2407.17540 null
2024-07-24 Federated Automatic Latent Variable Selection in Multi-output Gaussian Processes Jingyi Gao et.al. 2407.16935 null
2024-07-24 Cross-Domain Policy Transfer by Representation Alignment via Multi-Domain Behavioral Cloning Hayato Watahiki et.al. 2407.16912 link
2024-07-23 AbdomenAtlas: A Large-Scale, Detailed-Annotated, & Multi-Center Dataset for Efficient Transfer Learning and Open Algorithmic Benchmarking Wenxuan Li et.al. 2407.16697 link
2024-07-23 Towards scalable efficient on-device ASR with transfer learning Laxmi Pandey et.al. 2407.16664 null
2024-07-23 EffiSegNet: Gastrointestinal Polyp Segmentation through a Pre-Trained EfficientNet-based Network with a Simplified Decoder Ioannis A. Vezakis et.al. 2407.16298 link
2024-07-23 Exploring the Effectiveness and Consistency of Task Selection in Intermediate-Task Transfer Learning Pin-Jie Lin et.al. 2407.16245 link
2024-07-23 ODGR: Online Dynamic Goal Recognition Matan Shamir et.al. 2407.16220 null
2024-07-20 Enhancing Wildfire Forecasting Through Multisource Spatio-Temporal Data, Deep Learning, Ensemble Models and Transfer Learning Ayoub Jadouli et.al. 2407.15878 null
2024-07-22 Reconstructing Training Data From Real World Models Trained with Transfer Learning Yakir Oz et.al. 2407.15845 null
2024-07-22 TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly Mengqi Guo et.al. 2407.15648 link
2024-07-22 Affordance Labeling and Exploration: A Manifold-Based Approach İsmail Özçil et.al. 2407.15479 null
2024-07-21 Practical multi-fidelity machine learning: fusion of deterministic and Bayesian models Jiaxiang Yi et.al. 2407.15110 link
2024-07-20 Enhancing Skin Disease Classification Leveraging Transformer-based Deep Learning Architectures and Explainable AI Jayanth Mohan et.al. 2407.14757 null
2024-07-19 A Comparative Study of Transfer Learning for Emotion Recognition using CNN and Modified VGG16 Models Samay Nathani et.al. 2407.14576 null
2024-07-22 Vision-Based Power Line Cables and Pylons Detection for Low Flying Aircrafts Jakub Gwizdała et.al. 2407.14352 null
2024-07-19 Quantifying the value of positive transfer: An experimental case study Aidan J. Hughes et.al. 2407.14342 null
2024-07-19 Straightforward Layer-wise Pruning for More Efficient Visual Adaptation Ruizi Han et.al. 2407.14330 null
2024-07-23 Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition Yurong Zhang et.al. 2407.14302 null
2024-07-19 Enhancing Data-Limited Graph Neural Networks by Actively Distilling Knowledge from Large Language Models Quan Li et.al. 2407.13989 null
2024-07-18 PowerTrain: Fast, Generalizable Time and Power Prediction Models to Optimize DNN Training on Accelerated Edges Prashanthi S. K. et.al. 2407.13944 null
2024-07-18 Semi-Supervised Contrastive Learning of Musical Representations Julien Guinot et.al. 2407.13840 link
2024-07-18 AROhI: An Interactive Tool for Estimating ROI of Data Analytics Noopur Zambar et.al. 2407.13839 null
2024-07-18 Are We Ready for Out-of-Distribution Detection in Digital Pathology? Ji-Hun Oh et.al. 2407.13708 null
2024-07-17 On Initializing Transformers with Pre-trained Embeddings Ha Young Kim et.al. 2407.12514 null
2024-07-16 Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces Shumei Liu et.al. 2407.11701 null
2024-07-16 Green Resource Allocation in Cloud-Native O-RAN Enabled Small Cell Networks Rana M. Sohaib et.al. 2407.11563 null
2024-07-16 Genomic Language Models: Opportunities and Challenges Gonzalo Benegas et.al. 2407.11435 null
2024-07-16 MRIo3DS-Net: A Mutually Reinforcing Images to 3D Surface RNN-like framework for model-adaptation indoor 3D reconstruction Chang Li et.al. 2407.11431 null
2024-07-16 Exploring connections of spectral analysis and transfer learning in medical imaging Yucheng Lu et.al. 2407.11379 null
2024-07-19 LoRA-PT: Low-Rank Adapting UNETR for Hippocampus Segmentation Using Principal Tensor Singular Values and Vectors Guanghua He et.al. 2407.11292 link
2024-07-15 Exploration in Knowledge Transfer Utilizing Reinforcement Learning Adam Jedlička et.al. 2407.10835 null
2024-07-15 Detecting Omissions in Geographic Maps through Computer Vision Phuc D. A. Nguyen et.al. 2407.10709 link
2024-07-15 Deep-Learning-Based Markerless Pose Estimation Systems in Gait Analysis: DeepLabCut Custom Training and the Refinement Function Giulia Panconi et.al. 2407.10590 null
2024-07-13 Automated detection of gibbon calls from passive acoustic monitoring data using convolutional neural networks in the “torch for R” ecosystem Dena J. Clink et.al. 2407.09976 null
2024-07-11 Improve Load Forecasting in Energy Communities through Transfer Learning using Open-Access Synthetic Profiles Lukas Moosbrugger et.al. 2407.08434 null
2024-07-11 A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning Adrien Banse et.al. 2407.08324 null
2024-07-11 AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization Shixiong Xu et.al. 2407.08156 link
2024-07-10 Prediction of Frequency-Dependent Optical Spectrum for Solid Materials: A Multi-Output & Multi-Fidelity Machine Learning Approach Akram Ibrahim et.al. 2407.07736 null
2024-07-10 SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning Haiwen Diao et.al. 2407.07523 link
2024-07-10 Fine-Grained Classification for Poisonous Fungi Identification with Transfer Learning Christopher Chiu et.al. 2407.07492 link
2024-07-10 Towards a text-based quantitative and explainable histopathology image analysis Anh Tien Nguyen et.al. 2407.07360 link
2024-07-09 Estimating centrality in heavy-ion collisions using Transfer Learning technique Dipankar Basak et.al. 2407.07210 null
2024-07-09 Statistical mechanics of transfer learning in fully-connected networks in the proportional limit Alessandro Ingrosso et.al. 2407.07168 null
2024-07-14 Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach Taolin Zhang et.al. 2407.06964 null
2024-07-09 Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation Filipe Lauar et.al. 2407.06950 link
2024-07-09 Rethinking Image-to-Video Adaptation: An Object-centric Perspective Rui Qian et.al. 2407.06871 null
2024-07-09 Robust and Explainable Framework to Address Data Scarcity in Diagnostic Imaging Zehui Zhao et.al. 2407.06566 null
2024-07-09 Using Graph Neural Networks and Frequency Domain Data for Automated Operational Modal Analysis of Populations of Structures Xudong Jian et.al. 2407.06492 link
2024-07-09 CrowdTransfer: Enabling Crowd Knowledge Transfer in AIoT Community Yan Liu et.al. 2407.06485 null
2024-07-08 Multi-Label Plant Species Classification with Self-Supervised Vision Transformers Murilo Gustineli et.al. 2407.06298 link
2024-07-08 Transfer Learning with Pseudo Multi-Label Birdcall Classification for DS@GT BirdCLEF 2024 Anthony Miyaguchi et.al. 2407.06291 link
2024-07-08 Transfer Learning with Self-Supervised Vision Transformers for Snake Identification Anthony Miyaguchi et.al. 2407.06178 link
2024-07-08 Multi-Fidelity Bayesian Neural Network for Uncertainty Quantification in Transonic Aerodynamic Loads Andrea Vaiuso et.al. 2407.05684 null
2024-07-08 An Experimental Comparison of Transfer Learning against Self-supervised Learning Zehui Zhao et.al. 2407.05592 null
2024-07-09 CBM: Curriculum by Masking Andrei Jarca et.al. 2407.05193 link
2024-07-06 Recent Advancements and Challenges of Turkic Central Asian Language Processing Yana Veitsman et.al. 2407.05006 null
2024-07-05 Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates Shirley Kokane et.al. 2407.04871 null
2024-07-05 TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR Shashi Kumar et.al. 2407.04444 null
2024-07-05 Understanding the Role of Invariance in Transfer Learning Till Speicher et.al. 2407.04325 link
2024-07-05 Graph Pooling via Ricci Flow Amy Feng et.al. 2407.04236 null
2024-07-08 A Computer Vision Approach to Estimate the Localized Sea State Aleksandar Vorkapic et.al. 2407.03755 null
2024-07-04 On-Device Training Empowered Transfer Learning For Human Activity Recognition Pixi Kang et.al. 2407.03644 null
2024-07-03 Iris and Palmprint Multimodal Biometric Recognition using Novel Preactivated Inverted ResNet and Hybrid Metaheuristic Optimized DenseNet Indu Singh et.al. 2407.03498 null
2024-07-03 DACB-Net: Dual Attention Guided Compact Bilinear Convolution Neural Network for Skin Disease Classification Belal Ahmad et.al. 2407.03439 null
2024-07-03 Artificial Inductive Bias for Synthetic Tabular Data Generation in Data-Scarce Scenarios Patricia A. Apellániz et.al. 2407.03080 link
2024-07-02 MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering Ahmad AlMughrabi et.al. 2407.02668 null
2024-07-02 ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation Chaoqun Hou et.al. 2407.02542 null
2024-07-02 AXIAL: Attention-based eXplainability for Interpretable Alzheimer’s Localized Diagnosis using 2D CNNs on 3D MRI brain scans Gabriele Lozupone et.al. 2407.02418 link
2024-07-03 MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing Shangda Wu et.al. 2407.02277 link
2024-07-02 MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance Optimizations Akash Dutta et.al. 2407.02238 null
2024-07-02 Towards Training Music Taggers on Synthetic Data Nadine Kroher et.al. 2407.02156 link
2024-07-01 Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models Lam Pham et.al. 2407.01777 null
2024-06-30 A Deep Generative Framework for Joint Households and Individuals Population Synthesis Xiao Qian et.al. 2407.01643 null
2024-07-01 Bridging the Gap: Transfer Learning from English PLMs to Malaysian English Mohan Raj Chanthran et.al. 2407.01374 null
2024-07-01 M $^2$ IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension Xuyang Liu et.al. 2407.01131 null
2024-07-01 Cross-Lingual Transfer Learning for Speech Translation Rao Ma et.al. 2407.01130 null
2024-07-01 Deep Image-to-Recipe Translation Jiangqin Ma et.al. 2407.00911 link
2024-06-30 Image Classification for Snow Detection to Improve Pedestrian Safety Ricardo de Deijn et.al. 2407.00818 null
2024-06-30 Establishing Deep InfoMax as an effective self-supervised learning methodology in materials informatics Michael Moran et.al. 2407.00671 link
2024-06-30 LegalTurk Optimized BERT for Multi-Label Text Classification and NER Farnaz Zeidi et.al. 2407.00648 null
2024-06-29 Resource Allocation and Secure Wireless Communication in the Large Model-based Mobile Edge Computing System Zefan Wang et.al. 2407.00347 null
2024-06-28 Minimax And Adaptive Transfer Learning for Nonparametric Classification under Distributed Differential Privacy Constraints Arnab Auddy et.al. 2406.20088 null
2024-06-28 Malaria Cell Detection Using Deep Neural Networks Saurabh Sawant et.al. 2406.20005 null
2024-06-28 Fine-tuning of Geospatial Foundation Models for Aboveground Biomass Estimation Michal Muszynski et.al. 2406.19888 null
2024-06-27 T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings Björn Deiseroth et.al. 2406.19223 link
2024-06-27 Towards Learning Abductive Reasoning using VSA Distributed Representations Giacomo Camposampiero et.al. 2406.19121 link
2024-07-01 RouteLLM: Learning to Route LLMs with Preference Data Isaac Ong et.al. 2406.18665 link
2024-07-01 VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning Challenges Robert-Jan Bruintjes et.al. 2406.18176 null
2024-06-25 LABOR-LLM: Language-Based Occupational Representations with Large Language Models Tianyu Du et.al. 2406.17972 null
2024-06-25 Transfer Learning for High Dimensional Robust Regression Xiaohui Yuan et.al. 2406.17567 null
2024-06-25 Leveraging Parameter-Efficient Transfer Learning for Multi-Lingual Text-to-Speech Adaptation Yingting Li et.al. 2406.17257 null
2024-06-24 Convolutional neural network for Lyman break galaxies classification and redshift regression in DESI (Dark Energy Spectroscopic Instrument) Julien Taran et.al. 2406.16730 null
2024-06-24 Robust NLoS Localization in 5G mmWave Networks: Data-based Methods and Performance Roman Klus et.al. 2406.16519 null
2024-06-23 Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization Kshitij Bhatta et.al. 2406.16191 null
2024-06-23 Evaluation and Comparison of Emotionally Evocative Image Augmentation Methods Jan Ignatowicz et.al. 2406.16187 null
2024-06-23 Federated Transfer Learning Aided Interference Classification in GNSS Signals Min Jiang et.al. 2406.16102 null
2024-06-22 Bone Fracture Classification using Transfer Learning Shyam Gupta et.al. 2406.15958 link
2024-06-21 Flat Posterior Does Matter For Bayesian Transfer Learning Sungjun Lim et.al. 2406.15664 link
2024-06-21 GOAL: A Generalist Combinatorial Optimization Agent Learner Darko Drakulic et.al. 2406.15079 link
2024-06-20 Depth $F_1$ : Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability Parker Seegmiller et.al. 2406.14695 link
2024-06-19 Modeling & Evaluating the Performance of Convolutional Neural Networks for Classifying Steel Surface Defects Nadeem Jabbar Chaudhry et.al. 2406.14583 null
2024-06-20 Robust Few-shot Transfer Learning for Knowledge Base Question Answering with Unanswerable Questions Riya Sawhney et.al. 2406.14313 null
2024-06-20 Multi-modal Transfer Learning between Biological Foundation Models Juan Jose Garau-Luis et.al. 2406.14150 null
2024-06-21 Information Guided Regularization for Fine-tuning Language Models Mandar Sharma et.al. 2406.14005 link
2024-06-20 Generalization error of min-norm interpolators in transfer learning Yanke Song et.al. 2406.13944 null
2024-06-20 Semi-supervised Regression Analysis with Model Misspecification and High-dimensional Data Ye Tian et.al. 2406.13906 null
2024-06-19 Neuro-symbolic Training for Reasoning over Spatial Language Tanawan Premsri et.al. 2406.13828 link
2024-06-19 CNN Based Flank Predictor for Quadruped Animal Species Vanessa Suessle et.al. 2406.13588 null
2024-06-19 Robust Melanoma Thickness Prediction via Deep Transfer Learning enhanced by XAI Techniques Miguel Nogales et.al. 2406.13441 null
2024-06-19 Representation Transfer Learning for Semiparametric Regression Baihua He et.al. 2406.13197 null
2024-06-19 Optimal pre-train/fine-tune strategies for accurate material property predictions Reshma Devi et.al. 2406.13142 link
2024-06-18 Skin Cancer Images Classification using Transfer Learning Techniques Md Sirajul Islam et.al. 2406.12954 null
2024-06-18 Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video Xiangming Zhu et.al. 2406.12769 null
2024-06-18 BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity Zahra Gharaee et.al. 2406.12723 link
2024-06-18 Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly Siddhant Shete et.al. 2406.12698 null
2024-06-18 Spatial Sequence Attention Network for Schizophrenia Classification from Structural Brain MR Images Nagur Shareef Shaik et.al. 2406.12683 null
2024-06-18 Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation Sophie Loizillon et.al. 2406.12448 link
2024-06-18 The Wisdom of a Crowd of Brains: A Universal Brain Encoder Roman Beliy et.al. 2406.12179 null
2024-06-17 UniGLM: Training One Unified Language Model for Text-Attributed Graphs Yi Fang et.al. 2406.12052 link
2024-06-17 Large Scale Transfer Learning for Tabular Data via Language Modeling Josh Gardner et.al. 2406.12031 link
2024-06-15 A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges Yuqi Nie et.al. 2406.11903 null
2024-06-17 Faces of Experimental Pain: Transferability of Deep Learned Heat Pain Features to Electrical Pain Pooja Prajod et.al. 2406.11808 null
2024-06-16 A Unified View of Abstract Visual Reasoning Problems Mikołaj Małkiński et.al. 2406.11068 null
2024-06-16 Generalization and Knowledge Transfer in Abstract Visual Reasoning Models Mikołaj Małkiński et.al. 2406.11061 null
2024-06-16 Physics-Informed Deep Learning and Partial Transfer Learning for Bearing Fault Diagnosis in the Presence of Highly Missing Data Mohammadreza Kavianpour et.al. 2406.11023 null
2024-06-16 ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts Samar Khanna et.al. 2406.10973 null
2024-06-16 On the Effectiveness of Supervision in Asymmetric Non-Contrastive Learning Jeongheon Oh et.al. 2406.10815 link
2024-06-16 ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation Yurun Song et.al. 2406.10785 link
2024-06-18 Augmenting Biomedical Named Entity Recognition with General-domain Resources Yu Yin et.al. 2406.10671 link
2024-06-15 ADSNet: Cross-Domain LTV Prediction with an Adaptive Siamese Network in Advertising Ruize Wang et.al. 2406.10517 null
2024-06-14 Comparison of fine-tuning strategies for transfer learning in medical image classification Ana Davila et.al. 2406.10050 null
2024-06-14 Deep Learning Models to Automate the Scoring of Hand Radiographs for Rheumatoid Arthritis Zhiyan Bo et.al. 2406.09980 null
2024-06-17 UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages Trinh Pham et.al. 2406.09717 link
2024-06-14 RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications Shyam Venkatasubramanian et.al. 2406.09638 null
2024-06-14 Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings Keno Moenck et.al. 2406.09637 link
2024-06-13 Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment Fengbin Guan et.al. 2406.09546 link
2024-06-12 Quantum Hardware-Enabled Molecular Dynamics via Transfer Learning Abid Khan et.al. 2406.08554 null
2024-06-12 Strategies for Pretraining Neural Operators Anthony Zhou et.al. 2406.08473 link
2024-06-12 PRIBOOT: A New Data-Driven Expert for Improved Driving Simulations Daniel Coelho et.al. 2406.08421 link
2024-06-12 Measuring model variability using robust non-parametric testing Sinjini Banerjee et.al. 2406.08307 null
2024-06-12 Beyond the Mean: Differentially Private Prototypes for Private Transfer Learning Dariush Wahdany et.al. 2406.08039 null
2024-06-11 Unleashing the Power of Transfer Learning Model for Sophisticated Insect Detection: Revolutionizing Insect Classification Md. Mahmudul Hasan et.al. 2406.07716 null
2024-06-11 Transferring Knowledge from Large Foundation Models to Small Downstream Models Shikai Qiu et.al. 2406.07337 null
2024-06-10 SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection Sakshi Mahendru et.al. 2406.06663 null
2024-06-10 Network-Based Transfer Learning Helps Improve Short-Term Crime Prediction Accuracy Jiahui Wu et.al. 2406.06645 null
2024-06-10 Contrastive learning of T cell receptor representations Yuta Nagano et.al. 2406.06397 link
2024-06-09 Few-Shot Load Forecasting Under Data Scarcity in Smart Grids: A Meta-Learning Approach Georgios Tsoumplekas et.al. 2406.05887 null
2024-06-09 Utilizing Grounded SAM for self-supervised frugal camouflaged human detection Matthias Pijarowski et.al. 2406.05776 null
2024-06-11 MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-Training Bo Chen et.al. 2406.05347 link
2024-06-08 Hidden Question Representations Tell Non-Factuality Within and Across Large Language Models Yanling Wang et.al. 2406.05328 null
2024-06-08 DeviceBERT: Applied Transfer Learning With Targeted Annotations and Vocabulary Enrichment to Identify Medical Device and Component Terminology in FDA Recall Summaries Miriam Farrington et.al. 2406.05307 null
2024-06-07 Accelerating evolutionary exploration through language model-based transfer learning Maximilian Reissmann et.al. 2406.05166 null
2024-06-07 Labeled Data Selection for Category Discovery Bingchen Zhao et.al. 2406.04898 null
2024-06-07 FunBO: Discovering Acquisition Functions for Bayesian Optimization with FunSearch Virginia Aglietti et.al. 2406.04824 null
2024-06-07 Low-Resource Cross-Lingual Summarization through Few-Shot Learning with Large Language Models Gyutae Park et.al. 2406.04630 null
2024-06-06 InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation David Doukhan et.al. 2406.04429 link
2024-06-06 UrbanSARFloods: Sentinel-1 SLC-Based Benchmark Dataset for Urban and Open-Area Flood Mapping Jie Zhao et.al. 2406.04111 null
2024-06-06 Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation Loc X. Nguyen et.al. 2406.03773 null
2024-06-06 LLMEmbed: Rethinking Lightweight LLM’s Genuine Function in Text Classification Chun Liu et.al. 2406.03725 link
2024-06-06 Transfer Learning for Latent Variable Network Models Akhil Jalan et.al. 2406.03437 null
2024-06-08 Randomized Geometric Algebra Methods for Convex Neural Networks Yifei Wang et.al. 2406.02806 link
2024-06-04 CADE: Cosine Annealing Differential Evolution for Spiking Neural Network Runhua Jiang et.al. 2406.02349 link
2024-06-04 Towards Neural Architecture Search for Transfer Learning in 6G Networks Adam Orucu et.al. 2406.02333 null
2024-06-04 M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation Daisuke Niizumi et.al. 2406.02032 link
2024-06-04 Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs Nik Bear Brown et.al. 2406.01943 null
2024-06-03 Multi-Agent Transfer Learning via Temporal Contrastive Learning Weihao Zeng et.al. 2406.01377 null
2024-06-04 Towards Practical Single-shot Motion Synthesis Konstantinos Roditakis et.al. 2406.01136 null
2024-06-03 Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models Georgia Markham et.al. 2406.01073 null
2024-06-03 Satellites swarm cooperation for pursuit-attachment tasks with transformer-based reinforcement learning yonghao Li et.al. 2406.01061 null
2024-06-02 Phonetic Error Analysis of Raw Waveform Acoustic Models with Parametric and Non-Parametric CNNs Erfan Loweimi et.al. 2406.00898 null
2024-06-02 Using 3-D LiDAR Data for Safe Physical Human-Robot Interaction Sarthak Arora et.al. 2406.00869 null
2024-06-06 Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting Jincheng Zhong et.al. 2406.00773 null
2024-06-05 Profiled Transfer Learning for High Dimensional Linear Model Ziqian Lin et.al. 2406.00701 null
2024-05-29 On the Condition Monitoring of Bolted Joints through Acoustic Emission and Deep Transfer Learning: Generalization, Ordinal Loss and Super-Convergence Emmanuel Ramasso et.al. 2405.20887 null
2024-05-30 Learning 3D Robotics Perception using Inductive Priors Muhammad Zubair Irshad et.al. 2405.20364 null
2024-05-30 Who Writes the Review, Human or AI? Panagiotis C. Theocharopoulos et.al. 2405.20285 null
2024-05-30 Image-to-Joint Inverse Kinematic of a Supportive Continuum Arm Using Deep Learning Shayan Sepahvand et.al. 2405.20248 null
2024-05-30 Federated and Transfer Learning for Cancer Detection Based on Image Analysis Amine Bechar et.al. 2405.20126 null
2024-05-30 Chemical Space-Informed Machine Learning Models for Rapid Predictions of X-ray Photoelectron Spectra of Organic Molecules Susmita Tripathy et.al. 2405.20033 link
2024-05-30 Breaking Indistinguishability with Transfer Learning: A First Look at SPECK32/64 Lightweight Block Ciphers Jimmy Dani et.al. 2405.19683 null
2024-05-30 Few-shot fault diagnosis based on multi-scale graph convolution filtering for industry Mengjie Gan et.al. 2405.19642 null
2024-05-30 Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases Zian Su et.al. 2405.19581 link
2024-05-29 MDS-ViTNet: Improving saliency prediction for Eye-Tracking with Vision Transformer Polezhaev Ignat et.al. 2405.19501 link
2024-05-29 RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter Meng Cao et.al. 2405.19465 null
2024-05-29 Domain adaptation in small-scale and heterogeneous biological datasets Seyedmehdi Orouji et.al. 2405.19221 null
2024-05-28 Recent Advances of Foundation Language Models-based Continual Learning: A Survey Yutao Yang et.al. 2405.18653 null
2024-05-28 Transfer Learning for Emulating Ocean Climate Variability across $CO_2$ forcing Surya Dheeshjith et.al. 2405.18585 null
2024-05-28 Deep Learning-based Epicenter Localization using Single-Station Strong Motion Records Melek Türkmen et.al. 2405.18451 null
2024-05-28 Adaptive Multiscale Retinal Diagnosis: A Hybrid Trio-Model Approach for Comprehensive Fundus Multi-Disease Detection Leveraging Transfer Learning and Siamese Networks Yavuz Selim Inan et.al. 2405.18449 null
2024-05-28 A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic Ioanna Gogou et.al. 2405.18387 link
2024-05-28 An adaptive transfer learning perspective on classification in non-stationary environments Henry W J Reeve et.al. 2405.18091 null
2024-05-28 A Survey of Latent Factor Models in Recommender Systems Hind I. Alshbanat et.al. 2405.18068 null
2024-05-28 MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction Xiang Dai et.al. 2405.18015 null
2024-05-28 Self-supervised Pre-training for Transferable Multi-modal Perception Xiaohao Xu et.al. 2405.17942 link
2024-05-28 Cost-Sensitive Multi-Fidelity Bayesian Optimization with Transfer of Learning Curve Extrapolation Dong Bok Lee et.al. 2405.17918 null
2024-05-28 Gradually Vanishing Gap in Prototypical Network for Unsupervised Domain Adaptation Shanshan Wang et.al. 2405.17774 null
2024-05-27 Flow control of three-dimensional cylinders transitioning to turbulence via multi-agent reinforcement learning P. Suárez et.al. 2405.17210 null
2024-05-27 Harnessing the Power of Vicinity-Informed Analysis for Classification under Covariate Shift Mitsuhiro Fujikawa et.al. 2405.16906 null
2024-05-28 Transfer Learning for Diffusion Models Yidong Ouyang et.al. 2405.16876 null
2024-05-27 Enhancing Accuracy in Generative Models via Knowledge Transfer Xinyu Tian et.al. 2405.16837 null
2024-05-27 Dual-State Personalized Knowledge Tracing with Emotional Incorporation Shanshan Wang et.al. 2405.16799 null
2024-05-26 Transfer Learning Under High-Dimensional Graph Convolutional Regression Model for Node Classification Jiachen Chen et.al. 2405.16672 null
2024-05-26 Mixture of Experts Using Tensor Products Zhan Su et.al. 2405.16671 link
2024-05-26 Acceleration of Grokking in Learning Arithmetic Operations via Kolmogorov-Arnold Representation Yeachan Park et.al. 2405.16658 null
2024-05-26 From Macro to Micro: Boosting micro-expression recognition via pre-training on macro-expression videos Hanting Li et.al. 2405.16451 null
2024-05-26 Daily Physical Activity Monitoring – Adaptive Learning from Multi-source Motion Sensor Data Haoting Zhang et.al. 2405.16395 null
2024-05-25 LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters Xinyu Zhou et.al. 2405.16287 link
2024-05-25 Generation of synthetic data using breast cancer dataset and classification with resnet18 Dilsat Berin Aytar et.al. 2405.16286 null
2024-05-25 Transfer learning in predicting quantum many-body dynamics: from physical observables to entanglement entropy Philipp Schmidt et.al. 2405.16254 null
2024-05-25 A statistical framework for weak-to-strong generalization Seamus Somerstep et.al. 2405.16236 null
2024-05-24 Disease-informed Adaptation of Vision-Language Models Jiajin Zhang et.al. 2405.15728 link
2024-05-28 The Impact of Geometric Complexity on Neural Collapse in Transfer Learning Michael Munn et.al. 2405.15706 null
2024-05-24 Transfer Learning with Informative Priors: Simple Baselines Better than Previously Reported Ethan Harvey et.al. 2405.15583 link
2024-05-24 Unsteady aerodynamic prediction using limited samples based on transfer learning Wen Ji et.al. 2405.15470 null
2024-05-24 Environment Sensing-aided Beam Prediction with Transfer Learning for Smart Factory Yuan Feng et.al. 2405.15339 null
2024-05-24 Detection and Positive Reconstruction of Cognitive Distortion sentences: Mandarin Dataset and Evaluation Shuya Lin et.al. 2405.15334 link
2024-05-23 Deep learning lattice gauge theories Anuj Apte et.al. 2405.14830 null
2024-05-23 Implicit In-context Learning Zhuowei Li et.al. 2405.14660 link
2024-05-23 SolNet: Open-source deep learning models for photovoltaic power forecasting across the globe Joris Depoortere et.al. 2405.14472 null
2024-05-23 Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer Models Alejo Lopez-Avila et.al. 2405.14437 link
2024-05-22 Just rotate it! Uncertainty estimation in closed-source models via multiple queries Konstantinos Pitas et.al. 2405.13864 null
2024-05-22 Multi-Dataset Multi-Task Learning for COVID-19 Prognosis Filippo Ruffini et.al. 2405.13771 null
2024-05-22 Transfer of Safety Controllers Through Learning Deep Inverse Dynamics Model Alireza Nadali et.al. 2405.13735 null
2024-05-22 Identifying type II quasars at intermediate redshift with few-shot learning photometric classification P. A. C. Cunha et.al. 2405.13650 link
2024-05-22 Dynamically enhanced static handwriting representation for Parkinson’s disease detection Moises Diaz et.al. 2405.13438 null
2024-05-22 Boosted Neural Decoders: Achieving Extreme Reliability of LDPC Codes for 6G Networks Hee-Youl Kwak et.al. 2405.13413 link
2024-05-22 Accelerated Evaluation of Ollivier-Ricci Curvature Lower Bounds: Bridging Theory and Computation Wonwoo Kang et.al. 2405.13302 null
2024-05-22 Traffic control using intelligent timing of traffic lights with reinforcement learning technique and real-time processing of surveillance camera images Mahdi Jamebozorg et.al. 2405.13256 null
2024-05-21 Transfer Learning Approach for Railway Technical Map (RTM) Component Identification Obadage Rochana Rumalshan et.al. 2405.13229 null
2024-05-21 Accelerating Resonance Searches via Signature-Oriented Pre-training Congqiao Li et.al. 2405.12972 null
2024-05-21 Prompt-Enhanced Spatio-Temporal Graph Transfer Learning Junfeng Hu et.al. 2405.12452 link
2024-05-15 Fully Distributed Fog Load Balancing with Multi-Agent Reinforcement Learning Maad Ebrahim et.al. 2405.12236 null
2024-05-20 Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models Tong Zeng et.al. 2405.12206 link
2024-05-20 Towards Graph Contrastive Learning: A Survey and Beyond Wei Ju et.al. 2405.11868 null
2024-05-20 Transfer Learning for CSI-based Positioning with Multi-environment Meta-learning Anastasios Foliadis et.al. 2405.11816 null
2024-05-20 Foundation Model for Chemical Process Modeling: Meta-Learning with Physics-Informed Adaptation Zihao Wang et.al. 2405.11752 link
2024-05-19 Computer Vision in the Food Industry: Accurate, Real-time, and Automatic Food Recognition with Pretrained MobileNetV2 Shayan Rokhva et.al. 2405.11621 null
2024-05-19 Learning More Generalized Experts by Merging Experts in Mixture-of-Experts Sejik Park et.al. 2405.11530 null
2024-05-17 Probabilistic transfer learning methodology to expedite high fidelity simulation of reactive flows Bruno S. Soriano et.al. 2405.10944 null
2024-05-17 Multicenter Privacy-Preserving Model Training for Deep Learning Brain Metastases Autosegmentation Yixing Huang et.al. 2405.10870 link
2024-05-17 DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts Anastasia Voznyuk et.al. 2405.10629 link
2024-05-17 Dynamic data sampler for cross-language transfer learning in large language models Yudong Li et.al. 2405.10626 link
2024-05-16 Continuous Transfer Learning for UAV Communication-aware Trajectory Design Chenrui Sun et.al. 2405.10087 null
2024-05-16 Monaural speech enhancement on drone via Adapter based transfer learning Xingyu Chen et.al. 2405.10022 null
2024-05-16 A Unified Deep Transfer Learning Model for Accurate IoT Localization in Diverse Environments Abdullahi Isa Ahmed et.al. 2405.09960 null
2024-05-16 Confidence Estimation in Unsupervised Deep Change Vector Analysis Sudipan Saha et.al. 2405.09896 null
2024-05-15 SA-FedLora: Adaptive Parameter Allocation for Efficient Federated Learning with LoRA Tuning Yuning Yang et.al. 2405.09394 null
2024-05-15 Transfer Learning in Pre-Trained Large Language Models for Malware Detection Based on System Calls Pedro Miguel Sánchez Sánchez et.al. 2405.09318 null
2024-05-15 Deep Learning in Earthquake Engineering: A Comprehensive Review Yazhou Xie et.al. 2405.09021 null
2024-05-15 Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy Feng Wang et.al. 2405.09014 link
2024-05-16 Neural Collapse Meets Differential Privacy: Curious Behaviors of NoisyGD with Near-perfect Representation Learning Chendi Wang et.al. 2405.08920 null
2024-05-14 FLEXIBLE: Forecasting Cellular Traffic by Leveraging Explicit Inductive Graph-Based Learning Duc Thinh Ngo et.al. 2405.08843 null
2024-05-14 Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs P. Mas-Buitrago et.al. 2405.08703 link
2024-05-13 Modeling of Time-varying Wireless Communication Channel with Fading and Shadowing Lee Youngmin et.al. 2405.08199 link
2024-05-13 Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer Chi-en Amy Tai et.al. 2405.07869 null
2024-05-13 Automatic Recognition of Food Ingestion Environment from the AIM-2 Wearable Sensor Yuning Huang et.al. 2405.07827 null
2024-05-11 Fractals as Pre-training Datasets for Anomaly Detection and Localization C. I. Ugwu et.al. 2405.06980 null
2024-05-13 MRSegmentator: Robust Multi-Modality Segmentation of 40 Classes in MRI and CT Sequences Hartmut Häntze et.al. 2405.06463 link
2024-05-10 DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding Ting Liu et.al. 2405.06217 link
2024-05-09 Scalable Learning of Segment-Level Traffic Congestion Functions Shushman Choudhury et.al. 2405.06080 null
2024-05-09 Robust and Explainable Fine-Grained Visual Classification with Transfer Learning: A Dual-Carriageway Framework Zheming Zuo et.al. 2405.05853 null
2024-05-17 Identification of problematic epochs in astronomical time series through transfer learning Stefano Cavuoti et.al. 2405.05591 link
2024-05-09 Model Inversion Robustness: Can Transfer Learning Help? Sy-Tuyen Ho et.al. 2405.05588 null
2024-05-08 Large Language Model Enhanced Machine Learning Estimators for Classification Yuhang Wu et.al. 2405.05445 link
2024-05-08 Deep Learning Method to Predict Wound Healing Progress Based on Collagen Fibers in Wound Tissue Juan He et.al. 2405.05297 null
2024-05-08 Deep learning-based variational autoencoder for classification of quantum and classical states of light Mahesh Bhupati et.al. 2405.05243 null
2024-05-08 Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming Tommaso Pasini et.al. 2405.05176 null
2024-05-08 Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches Qing Yu et.al. 2405.04771 null
2024-05-09 Large Language Models for Cyber Security: A Systematic Literature Review HanXiang Xu et.al. 2405.04760 link
2024-05-07 SingIt! Singer Voice Transformation Amit Eliav et.al. 2405.04627 null
2024-05-07 Neural network based approach for solving problems in plane wave duct acoustics D. Veerababu et.al. 2405.04603 null
2024-05-07 Cross-Platform Autonomous Control of Minimal Kitaev Chains David van Driel et.al. 2405.04596 null
2024-05-07 Enriched BERT Embeddings for Scholarly Publication Classification Benjamin Wolff et.al. 2405.04136 link
2024-05-07 A Stealthy Wrongdoer: Feature-Oriented Reconstruction Attack against Split Learning Xiaoyang Xu et.al. 2405.04115 link
2024-05-07 Predicting Lung Disease Severity via Image-Based AQI Analysis using Deep Learning Techniques Anvita Mahajan et.al. 2405.03981 null
2024-05-05 Spatial Transfer Learning with Simple MLP Hongjian Yang et.al. 2405.03720 null
2024-05-06 Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data Leonhard Hennicke et.al. 2405.03243 null
2024-05-04 Stable Diffusion Dataset Generation for Downstream Classification Tasks Eugenio Lomurno et.al. 2405.02698 null
2024-05-04 Few-Shot Fruit Segmentation via Transfer Learning Jordan A. James et.al. 2405.02556 link
2024-05-04 CNN-LSTM and Transfer Learning Models for Malware Classification based on Opcodes and API Calls Ahmed Bensaoud et.al. 2405.02548 null
2024-05-03 Spatio-Temporal SwinMAE: A Swin Transformer based Multiscale Representation Learner for Temporal Satellite Imagery Yohei Nakayama et.al. 2405.02512 null
2024-05-03 Deep Learning and Transfer Learning Architectures for English Premier League Player Performance Forecasting Daniel Frees et.al. 2405.02412 link
2024-05-03 GMP-ATL: Gender-augmented Multi-scale Pseudo-label Enhanced Adaptive Transfer Learning for Speech Emotion Recognition via HuBERT Yu Pan et.al. 2405.02151 null
2024-05-03 Creation of Novel Soft Robot Designs using Generative AI Wee Kiat Chan et.al. 2405.01824 null
2024-05-02 Diabetic Retinopathy Detection Using Quantum Transfer Learning Ankush Jain et.al. 2405.01734 null
2024-05-02 Individual Fairness Through Reweighting and Tuning Abdoul Jalil Djiberou Mahamadou et.al. 2405.01711 null
2024-05-01 KITE: A Kernel-based Improved Transferability Estimation Method Yunhui Guo et.al. 2405.01603 null
2024-05-02 CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation Chenying Liu et.al. 2405.01217 null
2024-05-01 Transformer-Based Self-Supervised Learning for Histopathological Classification of Ischemic Stroke Clot Origin K. Yeh et.al. 2405.00908 null
2024-05-01 Koopman-based Deep Learning for Nonlinear System Estimation Zexin Sun et.al. 2405.00627 null
2024-05-01 Self-supervised Pre-training of Text Recognizers Martin Kišš et.al. 2405.00420 link
2024-05-01 Employing Federated Learning for Training Autonomous HVAC Systems Fredrik Hagström et.al. 2405.00389 null
2024-04-30 Expanding the Horizon: Enabling Hybrid Quantum Transfer Learning for Long-Tailed Chest X-Ray Classification Skylar Chan et.al. 2405.00156 link
2024-04-30 ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents Hoang-Thang Ta et.al. 2404.19714 null
2024-04-30 Let’s Focus: Focused Backdoor Attack against Federated Transfer Learning Marco Arazzi et.al. 2404.19420 null
2024-04-29 What Drives Performance in Multilingual Language Models? Sina Bagheri Nezhad et.al. 2404.19159 link
2024-04-27 Remote Sensing Image Enhancement through Spatiotemporal Filtering Hessah Albanwan et.al. 2404.18950 null
2024-04-29 Adaptive Reinforcement Learning for Robot Control Yu Tang Liu et.al. 2404.18713 link
2024-04-29 Generation of Uncorrelated Residual Variables for Chemical Process Fault Diagnosis via Transfer Learning-based Input-Output Decoupled Network Zhuofu Pan et.al. 2404.18528 null
2024-05-02 Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment Tengjun Huang et.al. 2404.18253 link
2024-04-28 EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter Comfort Eseohen Ilevbare et.al. 2404.18180 link
2024-04-27 Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering Chenhao Cui et.al. 2404.17949 null
2024-04-26 Causally Abstracted Multi-armed Bandits Fabio Massimo Zennaro et.al. 2404.17493 link
2024-04-26 FTL: Transfer Learning Nonlinear Plasma Dynamic Transitions in Low Dimensional Embeddings via Deep Neural Networks Zhe Bai et.al. 2404.17466 link
2024-04-26 Comparison of self-supervised in-domain and supervised out-domain transfer learning for bird species recognition Houtan Ghaffari et.al. 2404.17252 null
2024-04-26 Self-supervised visual learning in the low-data regime: a comparative evaluation Sotirios Konstantakos et.al. 2404.17202 null
2024-04-26 2M-NER: Contrastive Learning for Multilingual and Multimodal NER with Language and Modal Fusion Dongsheng Wang et.al. 2404.17122 null
2024-04-26 Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection Daisuke Niizumi et.al. 2404.17107 link
2024-04-29 On TinyML and Cybersecurity: Electric Vehicle Charging Infrastructure Use Case Fatemeh Dehrouyeh et.al. 2404.16894 link
2024-04-25 Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer Learning for Skin Disease Classification in Long-Tail Distribution Zeynep Özdemir et.al. 2404.16814 null
2024-04-25 Probabilistic Multi-Layer Perceptrons for Wind Farm Condition Monitoring Filippo Fiocchi et.al. 2404.16496 null
2024-04-25 Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics Ben Williams et.al. 2404.16436 link
2024-04-25 Asking and Answering Questions to Extract Event-Argument Structures Md Nayem Uddin et.al. 2404.16413 link
2024-04-24 Employing Two-Dimensional Word Embedding for Difficult Tabular Data Stream Classification Paweł Zyblewski et.al. 2404.15836 link
2024-04-24 Where to Mask: Structure-Guided Masking for Graph Masked Autoencoders Chuang Liu et.al. 2404.15806 link
2024-04-24 No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement Mateusz Klimaszewski et.al. 2404.15737 link
2024-04-24 MDDD: Manifold-based Domain Adaptation with Dynamic Distribution for Non-Deep Transfer Learning in Cross-subject and Cross-session EEG-based Emotion Recognition Ting Luo et.al. 2404.15615 null
2024-04-19 KATO: Knowledge Alignment and Transfer for Transistor Sizing of Different Design and Technology Wei W. Xing et.al. 2404.14433 null
2024-04-22 Machine Learning Techniques for MRI Data Processing at Expanding Scale Taro Langner et.al. 2404.14326 null
2024-04-22 Automated Long Answer Grading with RiceChem Dataset Shashank Sonkar et.al. 2404.14316 link
2024-04-26 ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis Zichen Tang et.al. 2404.13711 link
2024-04-20 MultiConfederated Learning: Inclusive Non-IID Data handling with Decentralized Federated Learning Michael Duchesne et.al. 2404.13421 null
2024-04-20 Transfer Learning for Molecular Property Predictions from Small Data Sets Thorren Kirschbaum et.al. 2404.13393 link
2024-04-20 Federated Transfer Learning with Task Personalization for Condition Monitoring in Ultrasonic Metal Welding Ahmadreza Eslaminia et.al. 2404.13278 null
2024-04-19 Explainable AI for Fair Sepsis Mortality Predictive Model Chia-Hsuan Chang et.al. 2404.13139 null
2024-04-19 Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for Vision-Language Models Juncheng Yang et.al. 2404.12588 null
2024-04-18 Understanding Optimal Feature Transfer via a Fine-Grained Bias-Variance Analysis Yufan Li et.al. 2404.12481 null
2024-04-18 sEMG-based Fine-grained Gesture Recognition via Improved LightGBM Model Xiupeng Qiao et.al. 2404.11861 null
2024-04-17 GenFighter: A Generative and Evolutive Textual Attack Removal Md Athikul Islam et.al. 2404.11538 null
2024-04-17 Explainable Lung Disease Classification from Chest X-Ray Images Utilizing Deep Learning and XAI Tanzina Taher Ifty et.al. 2404.11428 null
2024-04-19 Feature Corrective Transfer Learning: End-to-End Solutions to Object Detection in Non-Ideal Visual Conditions Chuheng Wei et.al. 2404.11214 null
2024-04-18 Supervised Contrastive Vision Transformer for Breast Histopathological Image Classification Mohammad Shiri et.al. 2404.11052 null
2024-04-17 Control Theoretic Approach to Fine-Tuning and Transfer Learning Erkan Bayram et.al. 2404.11013 null
2024-04-16 Tao: Re-Thinking DL-based Microarchitecture Simulation Santosh Pandey et.al. 2404.10921 null
2024-04-21 Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport Eduardo Fernandes Montesuma et.al. 2404.10261 link
2024-04-16 Privacy-Preserving Training-as-a-Service for On-Device Intelligence: Concept, Architectural Scheme, and Open Problems Zhiyuan Wu et.al. 2404.10255 null
2024-04-15 High-Resolution Detection of Earth Structural Heterogeneities from Seismic Amplitudes using Convolutional Neural Networks with Attention layers Luiz Schirmer et.al. 2404.10170 null
2024-04-15 Self-Supervised Learning Featuring Small-Scale Image Dataset for Treatable Retinal Diseases Classification Luffina C. Huang et.al. 2404.10166 null
2024-04-15 Multiple-Input Fourier Neural Operator (MIFNO) for source-dependent 3D elastodynamics Fanny Lehmann et.al. 2404.10115 link
2024-04-15 Conditional Prototype Rectification Prompt Learning Haoxing Chen et.al. 2404.09872 link
2024-04-15 The Physalis system: Discovery of ORC-like radio shells around a massive pair of interacting early-type galaxies with offset X-ray emission Bärbel S. Koribalski et.al. 2404.09522 null
2024-04-14 Low-Resource Named Entity Recognition with Cross-Lingual, Character-Level Neural Conditional Random Fields Ryan Cotterell et.al. 2404.09383 null
2024-04-14 Breast Cancer Image Classification Method Based on Deep Transfer Learning Weimin Wang et.al. 2404.09226 null
2024-04-14 Intelligent Chemical Purification Technique Based on Machine Learning Wenchao Wu et.al. 2404.09114 null
2024-04-13 HEAT: Head-level Parameter Efficient Adaptation of Vision Transformers with Taylor-expansion Importance Scores Yibo Zhong et.al. 2404.08894 null
2024-04-16 E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data Aref Azizpour et.al. 2404.08814 link
2024-04-12 Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data Huan Zhang et.al. 2404.08613 link
2024-04-12 Advanced wood species identification based on multiple anatomical sections and using deep feature transfer and fusion Kallil M. Zielinski et.al. 2404.08585 null
2024-04-12 Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example MingXuan Xiao et.al. 2404.08279 null
2024-04-12 Transfer Learning Study of Motion Transformer-based Trajectory Predictions Lars Ullrich et.al. 2404.08271 null
2024-04-12 Investigating Neural Machine Translation for Low-Resource Languages: Using Bavarian as a Case Study Wan-Hua Her et.al. 2404.08259 link
2024-04-11 Predictive Handover Strategy in 6G and Beyond: A Deep and Transfer Learning Approach Ioannis Panitsas et.al. 2404.08113 null
2024-04-11 MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference Mobashir Sadat et.al. 2404.08066 link
2024-04-11 OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities Lasse H. Hansen et.al. 2404.07711 link
2024-04-11 Depth Estimation using Weighted-loss and Transfer Learning Muhammad Adeel Hafeez et.al. 2404.07686 null
2024-04-11 PINNACLE: PINN Adaptive ColLocation and Experimental points selection Gregory Kang Ruey Lau et.al. 2404.07662 link
2024-04-11 GLID: Pre-training a Generalist Encoder-Decoder Vision Model Jihao Liu et.al. 2404.07603 null
2024-04-10 Transfer Learning via Latent Dependency Factor for Estimating PM 2.5 Shrey Gupta et.al. 2404.07308 link
2024-04-10 XNLIeu: a dataset for cross-lingual NLI in Basque Maite Heredia et.al. 2404.06996 link
2024-04-10 The ‘Sandwich’ meta-framework for architecture agnostic deep privacy-preserving transfer learning for non-invasive brainwave decoding Xiaoxi Wei et.al. 2404.06868 null
2024-04-10 Adapting LLaMA Decoder to Vision Transformer Jiahao Wang et.al. 2404.06773 link
2024-04-09 Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis Mikel Zubillaga et.al. 2404.06392 null
2024-04-09 The impact of data set similarity and diversity on transfer learning success in time series forecasting Claudia Ehrig et.al. 2404.06198 null
2024-04-10 Using Few-Shot Learning to Classify Primary Lung Cancer and Other Malignancy with Lung Metastasis in Cytological Imaging via Endobronchial Ultrasound Procedures Ching-Kai Lin et.al. 2404.06080 null
2024-04-08 BatSort: Enhanced Battery Classification with Transfer Learning for Battery Sorting and Recycling Yunyi Zhao et.al. 2404.05802 link
2024-04-08 MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning Matteo Farina et.al. 2404.05621 link
2024-04-07 DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology Valentin Koch et.al. 2404.05022 link
2024-04-06 Latent-based Diffusion Model for Long-tailed Recognition Pengxiao Han et.al. 2404.04517 link
2024-04-05 Open vocabulary keyword spotting through transfer learning from speech synthesis Kesavaraj V et.al. 2404.03914 null
2024-04-05 VoltaVision: A Transfer Learning model for electronic component classification Anas Mohammad Ishfaqul Muktadir Osmani et.al. 2404.03898 link
2024-04-09 Enhancing Breast Cancer Diagnosis in Mammography: Evaluation and Integration of Convolutional Neural Networks and Explainable AI Maryam Ahmed et.al. 2404.03892 null
2024-04-04 Free Energy Calculations using Smooth Basin Classification Sander Vandenhaute et.al. 2404.03777 null
2024-04-04 How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes Harmon Bhasin et.al. 2404.03558 link
2024-04-03 Transfer learning applications for anomaly detection in wind turbines Cyriana M. A. Roelofs et.al. 2404.03011 null
2024-04-03 Fast Diffusion Model For Seismic Data Noise Attenuation Junheng Peng et.al. 2404.02767 null
2024-04-03 Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers Sehyun Choi et.al. 2404.02684 null
2024-04-03 What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases Anthony Meng Huat Tiong et.al. 2404.02415 link
2024-04-02 Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning Jonathan C. Balloch et.al. 2404.02235 null
2024-04-03 ResNet with Integrated Convolutional Block Attention Module for Ship Classification Using Transfer Learning on Optical Satellite Imagery Ryan Donghan Kwon et.al. 2404.02135 null
2024-04-02 ImageNot: A contrast with ImageNet preserves model rankings Olawale Salaudeen et.al. 2404.02112 link
2024-04-02 Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation Carlos Plou et.al. 2404.01867 null
2024-04-02 Transfer Learning from Whisper for Microscopic Intelligibility Prediction Paul Best et.al. 2404.01737 null
2024-04-01 NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields Muhammad Zubair Irshad et.al. 2404.01300 link
2024-04-01 LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization Akshita Gupta et.al. 2404.01282 null
2024-04-01 Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models Amir Faghihi et.al. 2404.01160 null
2024-04-01 TransFusion: Covariate-Shift Robust Transfer Learning for High-Dimensional Regression Zelin He et.al. 2404.01153 null
2024-04-01 Machine Learning Robustness: A Primer Houssem Ben Braiek et.al. 2404.00897 null
2024-04-01 Bailong: Bilingual Transfer Learning based on QLoRA and Zip-tie Embedding Lung-Chuan Chen et.al. 2404.00862 null
2024-04-01 Transfer Learning with Point Transformers Kartik Gupta et.al. 2404.00846 null
2024-03-31 $R^2$ -Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding Ye Liu et.al. 2404.00801 link
2024-03-31 Minimum-Norm Interpolation Under Covariate Shift Neil Mallinar et.al. 2404.00522 null
2024-03-31 Transfer Learning with Reconstruction Loss Wei Cui et.al. 2404.00505 link
2024-03-30 Noise-Aware Training of Layout-Aware Language Models Ritesh Sarkhel et.al. 2404.00488 null
2024-03-30 From attention to profit: quantitative trading strategy based on transformer Zhaofeng Zhang et.al. 2404.00424 link
2024-03-28 Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization Yuhang Li et.al. 2403.19866 null
2024-03-28 A Tulu Resource for Machine Translation Manu Narayanan et.al. 2403.19142 link
2024-04-01 Quantum to Classical Neural Network Transfer Learning Applied to Drug Toxicity Prediction Anthony M. Smaldone et.al. 2403.18997 link
2024-03-27 Direct mineral content prediction from drill core images via transfer learning Romana Boiger et.al. 2403.18495 null
2024-03-27 Deep Learning Segmentation and Classification of Red Blood Cells Using a Large Multi-Scanner Dataset Mohamed Elmanna et.al. 2403.18468 null
2024-03-26 Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision Transformer Badri N. Patro et.al. 2403.18063 link
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921 link
2024-03-26 Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos Akshay Paruchuri et.al. 2403.17915 null
2024-03-26 To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of 3D Transfer Learning Souhail Hadgi et.al. 2403.17869 null
2024-03-26 A Bayesian shrinkage estimator for transfer learning Mohamed A. Abba et.al. 2403.17321 null
2024-03-25 A Hybrid Approach To Aspect Based Sentiment Analysis Using Transfer Learning Gaurav Negi et.al. 2403.17254 null
2024-03-25 Engagement Measurement Based on Facial Landmarks and Spatial-Temporal Graph Convolutional Networks Ali Abedi et.al. 2403.17175 null
2024-03-29 Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships Rangel Daroya et.al. 2403.17173 link
2024-03-25 Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning? Shaoxiong Ji et.al. 2403.16777 null
2024-03-25 Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT Rohit Raju et.al. 2403.16655 null
2024-03-25 Enhancing Industrial Transfer Learning with Style Filter: Cost Reduction and Defect-Focus Chen Li et.al. 2403.16607 null
2024-03-25 Exploit High-Dimensional RIS Information to Localization: What Is the Impact of Faulty Element? Tuo Wu et.al. 2403.16529 null
2024-03-25 Employing High-Dimensional RIS Information for RIS-aided Localization Systems Tuo Wu et.al. 2403.16521 null
2024-03-25 Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes Tianwei Zhang et.al. 2403.16499 null
2024-03-25 Data-Driven Extrusion Force Control Tuning for 3D Printing Xavier Guidetti et.al. 2403.16470 null
2024-03-23 A Deep Learning Architectures for Kidney Disease Classification Muhammad Shoaib Farooq et.al. 2403.15895 null
2024-03-23 VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding Phong Nguyen-Thuan Do et.al. 2403.15882 null
2024-03-22 SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series Badri N. Patro et.al. 2403.15360 link
2024-03-22 Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models Qiong Wu et.al. 2403.15226 link
2024-03-22 Vehicle Detection Performance in Nordic Region Hamam Mokayed et.al. 2403.15017 null
2024-03-21 A Transfer Learning Causal Approach to Evaluate Racial/Ethnic and Geographic Variation in Outcomes Following Congenital Heart Surgery Larry Han et.al. 2403.14573 null
2024-03-21 Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets Ahmet Alp Kindiroglu et.al. 2403.14534 link
2024-03-21 Exploring Task Unification in Graph Representation Learning via Generative Approach Yulan Hu et.al. 2403.14340 null
2024-03-21 Stitching for Neuroevolution: Recombining Deep Neural Networks without Breaking Them Arthur Guijt et.al. 2403.14224 null
2024-03-21 HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption Seewoo Lee et.al. 2403.14111 link
2024-03-20 Bayesian Physics-informed Neural Networks for System Identification of Inverter-dominated Power Systems Simon Stock et.al. 2403.13602 null
2024-03-20 AdaTrans: Feature-wise and Sample-wise Adaptive Transfer Learning for High-dimensional Regression Zelin He et.al. 2403.13565 null
2024-03-20 Have You Poisoned My Data? Defending Neural Networks against Data Poisoning Fabio De Gaspari et.al. 2403.13523 null
2024-03-20 FissionFusion: Fast Geometric Generation and Hierarchical Souping for Medical Image Analysis Santosh Sanjeev et.al. 2403.13341 link
2024-03-21 Arcee’s MergeKit: A Toolkit for Merging Large Language Models Charles Goddard et.al. 2403.13257 link
2024-03-19 Wildfire danger prediction optimization with transfer learning Spiros Maggioros et.al. 2403.12871 link
2024-03-19 TransformMix: Learning Transformation and Mixing Strategies from Data Tsz-Him Cheung et.al. 2403.12429 null
2024-03-19 Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning Cheng Peng et.al. 2403.12374 null
2024-03-18 Transfer Learning for T-Cell Response Prediction Josua Stadelmaier et.al. 2403.12117 link
2024-03-18 Sub-photon accuracy noise reduction of single shot coherent diffraction pattern with atomic model trained autoencoder Takuto Ishikawa et.al. 2403.11992 null
2024-03-18 Transfer Learning Beyond Bounded Density Ratios Alkis Kalavasis et.al. 2403.11963 null
2024-03-18 SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modules Xiangyu Chen et.al. 2403.11887 null
2024-03-18 S-JEPA: towards seamless cross-dataset transfer through dynamic spatial attention Pierre Guetschel et.al. 2403.11772 null
2024-03-18 Revisiting Tensor Basis Neural Networks for Reynolds stress modeling: application to plane channel and square duct flows Jiayi Cai et.al. 2403.11746 null
2024-03-18 MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks Ibrahim Almakky et.al. 2403.11646 null
2024-03-18 Augment Before Copy-Paste: Data and Memory Efficiency-Oriented Instance Segmentation Framework for Sport-scenes Chih-Chung Hsu et.al. 2403.11572 null
2024-03-17 Federated Transfer Learning with Differential Privacy Mengchu Li et.al. 2403.11343 null
2024-03-16 Automatic location detection based on deep learning Anjali Karangiya et.al. 2403.10912 link
2024-03-15 On the low-shot transferability of [V]-Mamba Diganta Misra et.al. 2403.10696 null
2024-03-15 Latent Object Characteristics Recognition with Visual to Haptic-Audio Cross-modal Transfer Learning Namiko Saito et.al. 2403.10689 null
2024-03-14 Achieving Pareto Optimality using Efficient Parameter Reduction for DNNs in Resource-Constrained Edge Environment Atah Nuh Mih et.al. 2403.10569 null
2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution Stephanie Fu et.al. 2403.10516 link
2024-03-15 TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model Changhong Hou et.al. 2403.10127 null
2024-03-14 The galaxy group merger origin of the Cloverleaf odd radio circle system E. Bulbul et.al. 2403.09808 null
2024-03-14 GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding Chengyao Wang et.al. 2403.09639 link
2024-03-14 The Neural-SRP method for positional sound source localization Eric Grinstein et.al. 2403.09455 link
2024-03-13 A Physics-driven GraphSAGE Method for Physical Process Simulations Described by Partial Differential Equations Hang Hu et.al. 2403.08569 null
2024-03-13 HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers Francesco Dibitonto et.al. 2403.08536 link
2024-03-13 Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts Shengzhuang Chen et.al. 2403.08477 link
2024-03-12 Authorship Style Transfer with Policy Optimization Shuai Liu et.al. 2403.08043 link
2024-03-12 Conditional computation in neural networks: principles and research trends Simone Scardapane et.al. 2403.07965 null
2024-03-12 Physics-Transfer Learning for Material Strength Screening Yingjie Zhao et.al. 2403.07526 null
2024-03-12 DALSA: Domain Adaptation for Supervised Learning From Sparsely Annotated MR Images Michael Götz et.al. 2403.07434 null
2024-03-12 Knowledge Transfer across Multiple Principal Component Analysis Studies Zeyu Li et.al. 2403.07431 null
2024-03-12 Enhancing Transfer Learning with Flexible Nonparametric Posterior Sampling Hyungi Lee et.al. 2403.07282 null
2024-03-11 Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents Nishchal Prasad et.al. 2403.06872 link
2024-03-11 LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations Mohammad Alkhalefi et.al. 2403.06813 null
2024-03-11 Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation Bianca-Cerasela-Zelia Blaga et.al. 2403.06621 null
2024-03-11 Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers Alexander H. Berger et.al. 2403.06601 link
2024-03-11 When Crypto Economics Meet Graph Analytics and Learning Bingqiao Luo et.al. 2403.06454 null
2024-03-11 Can LLMs’ Tuning Methods Work in Medical Multimodal Domain? Jiawei Chen et.al. 2403.06407 link
2024-03-11 A Segmentation Foundation Model for Diverse-type Tumors Jianhao Xie et.al. 2403.06396 null
2024-03-11 Pre-Trained Model Recommendation for Downstream Fine-tuning Jiameng Bai et.al. 2403.06382 null
2024-03-11 See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI Yulong Liu et.al. 2403.06361 link
2024-03-10 Active Learning for Rapid Targeted Synthesis of Compositionally Complex Alloys Nathan Johnson et.al. 2403.06329 null
2024-03-10 Large Language Models on Fine-grained Emotion Detection Dataset with Data Augmentation and Transfer Learning Kaipeng Wang et.al. 2403.06108 null
2024-03-10 Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models Esmaeil Seraj et.al. 2403.06088 null
2024-03-09 Multimodal deep learning approach to predicting neurological recovery from coma after cardiac arrest Felix H. Krones et.al. 2403.06027 null
2024-03-08 OmniJet- $α$ : The first cross-task foundation model for particle physics Joschka Birk et.al. 2403.05618 link
2024-03-08 Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT Aisha Khatun et.al. 2403.05519 null
2024-03-08 JointMotion: Joint Self-supervision for Joint Motion Prediction Royden Wagner et.al. 2403.05489 link
2024-03-08 HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction Zhengrui Guo et.al. 2403.05396 link
2024-03-08 Hybridized Convolutional Neural Networks and Long Short-Term Memory for Improved Alzheimer’s Disease Diagnosis from MRI Scans Maleka Khatun et.al. 2403.05353 null
2024-03-07 Cell reprogramming design by transfer learning of functional transcriptional networks Thomas P. Wytock et.al. 2403.04837 link
2024-03-07 AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors Kaishen Yuan et.al. 2403.04697 link
2024-03-07 Source Matters: Source Dataset Impact on Model Robustness in Medical Imaging Dovile Juodelyte et.al. 2403.04484 link
2024-03-07 DA-Net: A Disentangled and Adaptive Network for Multi-Source Cross-Lingual Transfer Learning Ling Ge et.al. 2403.04158 null
2024-03-06 Self and Mixed Supervision to Improve Training Labels for Multi-Class Medical Image Segmentation Jianfei Liu et.al. 2403.03882 null
2024-03-06 Neural Architecture Search using Particle Swarm and Ant Colony Optimization Séamus Lankford et.al. 2403.03781 null
2024-03-06 On Transfer in Classification: How Well do Subsets of Classes Generalize? Raphael Baena et.al. 2403.03569 null
2024-03-06 A comparative study of cosmological constraints from weak lensing using Convolutional Neural Networks Divij Sharma et.al. 2403.03490 null
2024-03-06 Multi-modal Deep Learning Chen Yuhua et.al. 2403.03385 null
2024-03-05 PalmProbNet: A Probabilistic Approach to Understanding Palm Distributions in Ecuadorian Tropical Forest via Transfer Learning Kangning Cui et.al. 2403.03161 null
2024-03-05 Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning Zhitao He et.al. 2403.02893 null
2024-03-05 Generative Software Engineering Yuan Huang et.al. 2403.02583 null
2024-03-04 Encodings for Prediction-based Neural Architecture Search Yash Akhauri et.al. 2403.02484 link
2024-03-04 On Latency Predictors for Neural Architecture Search Yash Akhauri et.al. 2403.02446 link
2024-03-04 How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider Transformer Models Xin Lu et.al. 2403.02436 null
2024-03-04 On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation Joaquín Sánchez García et.al. 2403.02432 null
2024-03-04 Distilled ChatGPT Topic & Sentiment Modeling with Applications in Finance Olivier Gandouet et.al. 2403.02185 null
2024-03-04 Self-Supervised Facial Representation Learning with Facial Region Awareness Zheng Gao et.al. 2403.02138 null
2024-03-04 Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models Sargam Yadav et.al. 2403.02121 null
2024-03-04 A New Perspective on Smiling and Laughter Detection: Intensity Levels Matter Hugo Bohy et.al. 2403.02112 null
2024-03-03 Is in-domain data beneficial in transfer learning for landmarks detection in x-ray images? Roberto Di Via et.al. 2403.01470 null
2024-03-03 Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis Xin Zhou et.al. 2403.01439 link
2024-03-03 A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications Wei Guo et.al. 2403.01387 null
2024-03-02 Fast Low-parameter Video Activity Localization in Collaborative Learning Environments Venkatesh Jatla et.al. 2403.01281 null
2024-03-02 Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey Hamza Kheddar et.al. 2403.01255 null
2024-03-02 Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding Ha-Thanh Nguyen et.al. 2403.01185 null
2024-03-02 Transfer Learning-Enhanced Instantaneous Multi-Person Indoor Localization by CSI Zhiyuan He et.al. 2403.01153 null
2024-03-01 Transfer Learning for Security: Challenges and Future Directions Adrian Shuai Li et.al. 2403.00935 null
2024-03-01 A Regularization-based Transfer Learning Method for Information Extraction via Instructed Graph Decoder Kedi Chen et.al. 2403.00891 link
2024-03-01 Bias Mitigation in Fine-tuning Pre-trained Models for Enhanced Fairness and Efficiency Yixuan Zhang et.al. 2403.00625 null
2024-03-01 Generalized User Representations for Transfer Learning Ghazal Fazelnia et.al. 2403.00584 null
2024-03-01 Cross-Lingual Learning vs. Low-Resource Fine-Tuning: A Case Study with Fact-Checking in Turkish Recep Firat Cekinel et.al. 2403.00411 link
2024-03-01 Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification Mufan Sang et.al. 2403.00293 null
2024-02-29 Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement Xinyi Fang et.al. 2402.19001 null
2024-02-28 Deep Neural Network Models Trained With A Fixed Random Classifier Transfer Better Across Domains Hafiz Tiomoko Ali et.al. 2402.18614 null
2024-02-28 TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding Zhihao Zhang et.al. 2402.18490 null
2024-02-28 Universal neural network potentials as descriptors: Towards scalable chemical property prediction using quantum and classical computers Tomoya Shiota et.al. 2402.18433 null
2024-02-28 Emotion Classification in Low and Moderate Resource Languages Shabnam Tafreshi et.al. 2402.18424 null
2024-02-29 Investigation of Adapter for Automatic Speech Recognition in Noisy Environment Hao Shi et.al. 2402.18275 null
2024-02-28 Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations Gregor Donabauer et.al. 2402.18179 link
2024-02-28 Diffusion-based Neural Network Weights Generation Bedionita Soro et.al. 2402.18153 link
2024-03-03 Automated Testing of Spatially-Dependent Environmental Hypotheses through Active Transfer Learning Nicholas Harrison et.al. 2402.18064 null
2024-03-04 OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine Xiaosong Wang et.al. 2402.18028 null
2024-02-27 Quantum Circuit Discovery for Fault-Tolerant Logical State Preparation with Reinforcement Learning Remmy Zen et.al. 2402.17761 link
2024-02-27 MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation Hanan Gani et.al. 2402.17725 link
2024-02-27 Transfer Learning Bayesian Optimization to Design Competitor DNA Molecules for Use in Diagnostic Assays Ruby Sedgwick et.al. 2402.17704 link
2024-02-27 Intensive Care as One Big Sequence Modeling Problem Vadim Liventsev et.al. 2402.17501 link
2024-02-26 CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision Hao Wang et.al. 2402.16928 link
2024-02-26 Enhancing Continuous Domain Adaptation with Multi-Path Transfer Curriculum Hanbing Liu et.al. 2402.16681 null
2024-02-28 Few-Shot Learning for Annotation-Efficient Nucleus Instance Segmentation Yu Ming et.al. 2402.16280 null
2024-02-25 StochCA: A Novel Approach for Exploiting Pretrained Models with Cross-Attention Seungwon Seo et.al. 2402.16092 link
2024-02-25 Emotion Classification in Short English Texts using Deep Learning Techniques Siddhanth Bhat et.al. 2402.16034 null
2024-02-25 Adversarial-Robust Transfer Learning for Medical Imaging via Domain Assimilation Xiaohui Chen et.al. 2402.16005 null
2024-02-25 Exploring the Power of Pure Attention Mechanisms in Blind Room Parameter Estimation Chunxi Wang et.al. 2402.16003 null
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-23 Artificial Bee Colony optimization of Deep Convolutional Neural Networks in the context of Biomedical Imaging Adri Gomez Martin et.al. 2402.15246 null
2024-02-23 Which Model to Transfer? A Survey on Transferability Estimation Yuhe Ding et.al. 2402.15231 null
2024-02-23 Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning Joseph D. Clark et.al. 2402.15181 link
2024-02-23 PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning Zhisheng Lin et.al. 2402.15082 null
2024-02-22 Smoothness Adaptive Hypothesis Transfer Learning Haotian Lin et.al. 2402.14966 null
2024-02-22 An image-based transfer learning approach for using in situ processing data to predict laser powder bed fusion additively manufactured Ti-6Al-4V mechanical properties Qixiang Luo et.al. 2402.14945 null
2024-02-22 SHM-Traffic: DRL and Transfer learning based UAV Control for Structural Health Monitoring of Bridges with Traffic Divija Swetha Gadiraju et.al. 2402.14757 null
2024-02-22 CLCE: An Approach to Refining Cross-Entropy and Contrastive Learning for Optimized Learning Fusion Zijun Long et.al. 2402.14551 null
2024-02-21 Simple and Effective Transfer Learning for Neuro-Symbolic Integration Alessandro Daniele et.al. 2402.14047 null
2024-02-21 UniGraph: Learning a Cross-Domain Graph Foundation Model From Natural Language Yufei He et.al. 2402.13630 link
2024-02-21 ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling Lingxi Zhang et.al. 2402.13542 null
2024-02-20 LinkSAGE: Optimizing Job Matching Using Graph Neural Networks Ping Liu et.al. 2402.13430 null
2024-02-20 Cross-Domain Transfer Learning with CoRTe: Consistent and Reliable Transfer from Black-Box to Lightweight Segmentation Model Claudia Cuttano et.al. 2402.13122 null
2024-02-20 CST: Calibration Side-Tuning for Parameter and Memory Efficient Transfer Learning Feng Chen et.al. 2402.12736 null
2024-02-20 Scalable and reliable deep transfer learning for intelligent fault detection via multi-scale neural processes embedded with knowledge Zhongzhi Li et.al. 2402.12729 null
2024-02-20 Iterated learning and multiscale modeling of history-dependent architectured metamaterials Yupeng Zhang et.al. 2402.12674 null
2024-02-20 Indiscriminate Data Poisoning Attacks on Pre-trained Feature Extractors Yiwei Lu et.al. 2402.12626 null
2024-02-19 Predicting trucking accidents with truck drivers ‘safety climate perception across companies: A transfer learning approach Kailai Sun et.al. 2402.12417 null
2024-02-19 A synthetic data approach for domain generalization of NLI models Mohammad Javad Hosseini et.al. 2402.12368 null
2024-02-19 Molecule Generation and Optimization for Efficient Fragrance Creation Bruno C. L. Rodrigues et.al. 2402.12134 link
2024-02-19 Stealing the Invisible: Unveiling Pre-Trained CNN Models through Adversarial Examples and Timing Side-Channels Shubhi Shukla et.al. 2402.11953 null
2024-02-20 A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning Yuan Yuan et.al. 2402.11922 link
2024-02-18 Autocorrect for Estonian texts: final report from project EKTB25 Agnes Luhtaru et.al. 2402.11671 null
2024-02-17 ZeroG: Investigating Cross-dataset Zero-shot Transferability in Graphs Yuhan Li et.al. 2402.11235 link
2024-02-17 A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction Huaiyuan Ying et.al. 2402.11177 null
2024-02-16 Robust agents learn causal world models Jonathan Richens et.al. 2402.10877 null
2024-02-16 Differential Private Federated Transfer Learning for Mental Health Monitoring in Everyday Settings: A Case Study on Stress Detection Ziyu Wang et.al. 2402.10862 null
2024-02-16 Masked Attention is All You Need for Graphs David Buterez et.al. 2402.10793 null
2024-02-16 Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information Aishwarya Jayagopal et.al. 2402.10551 link
2024-02-15 Data Augmentation and Transfer Learning Approaches Applied to Facial Expressions Recognition Enrico Randellini et.al. 2402.09982 null
2024-02-15 Are Odd Radio Circles phoenixes of powerful radio galaxies? Stanislav Shabala et.al. 2402.09708 null
2024-02-15 Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm Amir Mohammad Naderi et.al. 2402.09658 null
2024-02-14 Prediction of Activated Sludge Settling Characteristics from Microscopy Images with Deep Convolutional Neural Networks and Transfer Learning Sina Borzooei et.al. 2402.09367 link
2024-02-14 Few-Shot Object Detection with Sparse Context Transformers Jie Mei et.al. 2402.09315 null
2024-02-15 Multi-Hierarchical Surrogate Learning for Structural Dynamical Crash Simulations Using Graph Convolutional Neural Networks Jonas Kneifl et.al. 2402.09234 null
2024-02-14 Tackling Negative Transfer on Graphs Zehong Wang et.al. 2402.08907 link
2024-02-14 Multiscale graph neural networks with adaptive mesh refinement for accelerating mesh-based simulations Roberto Perera et.al. 2402.08863 null
2024-02-13 Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning Haeju Lee et.al. 2402.08594 link
2024-02-13 Convolutional Neural Networks Towards Facial Skin Lesions Detection Reza Sarshar et.al. 2402.08592 null
2024-02-13 FedLPS: Heterogeneous Federated Learning for Multiple Tasks with Local Parameter Sharing Yongzhe Jia et.al. 2402.08578 link
2024-02-13 Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation Ayesha Siddika Nipu et.al. 2402.08184 null
2024-02-12 A Competition Winning Deep Reinforcement Learning Agent in microRTS Scott Goodfriend et.al. 2402.08112 link
2024-02-12 MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLO Shubhabrata Mukherjee et.al. 2402.07894 link
2024-02-13 Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and DINOv2 in Medical Imaging Classification Yuning Huang et.al. 2402.07595 link
2024-02-11 Multi-Modal Emotion Recognition by Text, Speech and Video Using Pretrained Transformers Minoo Shayaninasab et.al. 2402.07327 null
2024-02-10 An Optimization Framework for Processing and Transfer Learning for the Brain Tumor Segmentation Tianyi Ren et.al. 2402.07008 null
2024-02-10 Should I try multiple optimizers when fine-tuning pre-trained Transformers for NLP tasks? Should I tune their hyperparameters? Nefeli Gkouti et.al. 2402.06948 null
2024-02-09 Transfer learning with generative models for object detection on limited datasets Matteo Paiano et.al. 2402.06784 null
2024-02-09 Transferring facade labels between point clouds with semantic octrees while considering change detection Sophia Schwarz et.al. 2402.06531 link
2024-02-09 BarlowTwins-CXR : Enhancing Chest X-Ray abnormality localization in heterogeneous data with cross-domain self-supervised learning Haoyue Sheng et.al. 2402.06499 null
2024-02-12 Text-to-Code Generation with Modality-relative Pre-training Fenia Christopoulou et.al. 2402.05783 null
2024-02-08 Transfer learning of optimal QAOA parameters in combinatorial optimization J. A. Montanez-Barrera et.al. 2402.05549 null
2024-02-05 Enhancing Textbook Question Answering Task with Large Language Models and Retrieval Augmented Generation Hessa Abdulrahman Alawwad et.al. 2402.05128 link
2024-02-07 Group Distributionally Robust Dataset Distillation with Risk Minimization Saeed Vahidian et.al. 2402.04676 link
2024-02-07 Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers Md Shamim Hussain et.al. 2402.04538 link
2024-02-06 Scaling Laws for Downstream Task Performance of Large Language Models Berivan Isik et.al. 2402.04177 null
2024-02-06 Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models Jianyuan Guo et.al. 2402.03749 link
2024-02-06 Symbol Correctness in Deep Neural Networks Containing Symbolic Layers Aaron Bembenek et.al. 2402.03663 null
2024-02-04 Survival and grade of the glioma prediction using transfer learning Santiago Valbuena Rubio et.al. 2402.03384 null
2024-02-05 Constrained Decoding for Cross-lingual Label Projection Duong Minh Le et.al. 2402.03131 link
2024-02-04 Pruner: An Efficient Cross-Platform Tensor Compiler with Dual Awareness Liang Qiao et.al. 2402.02361 link
2024-02-03 InceptionCapsule: Inception-Resnet and CapsuleNet with self-attention for medical image Classification Elham Sadeghnezhad et.al. 2402.02274 null
2024-02-08 Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey Yi Xin et.al. 2402.02242 link
2024-02-03 Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties Ekaterina Artemova et.al. 2402.02078 link
2024-02-03 Transfer Learning in ECG Diagnosis: Is It Effective? Cuong V. Nguyen et.al. 2402.02021 link
2024-02-03 Enhancing the efficiency of protein language models with minimal wet-lab data through few-shot learning Ziyi Zhou et.al. 2402.02004 null
2024-02-03 Online Transfer Learning for RSV Case Detection Yiming Sun et.al. 2402.01987 null
2024-02-02 Exploring transfer learning for pathological speech feature prediction: Impact of layer selection Daniela A. Wiepert et.al. 2402.01796 link
2024-02-02 cmaes : A Simple yet Practical Python Library for CMA-ES Masahiro Nomura et.al. 2402.01373 link
2024-02-05 Cascaded Scaling Classifier: class incremental learning with probability scaling Jary Pomponi et.al. 2402.01262 link
2024-02-02 Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization Arezoo Rajabi et.al. 2402.01114 null
2024-02-01 Graph Domain Adaptation: Challenges, Progress and Prospects Boshen Shi et.al. 2402.00904 link
2024-02-01 Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters Umberto Cappellazzo et.al. 2402.00828 link
2024-02-01 Control-Theoretic Techniques for Online Adaptation of Deep Neural Networks in Dynamical Systems Jacob G. Elkins et.al. 2402.00761 null
2024-02-01 HAYATE: Photometric redshift estimation by hybridising machine learning with template fitting Shingo Tanigawa et.al. 2402.00323 null
2024-01-31 MelNet: A Real-Time Deep Learning Algorithm for Object Detection Yashar Azadvatan et.al. 2401.17972 null
2024-01-30 Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks Savas Yildirim et.al. 2401.17396 null
2024-01-30 Transfer Learning for Text Diffusion Models Kehang Han et.al. 2401.17181 null
2024-01-30 Finetuning Large Language Models for Vulnerability Detection Alexey Shestov et.al. 2401.17010 link
2024-01-30 Quantum Transfer Learning with Adversarial Robustness for Classification of High-Resolution Image Datasets Amena Khatun et.al. 2401.17009 null
2024-01-30 A Framework of Data Assimilation for Wind Flow Fields by Physics-informed Neural Networks Chang Yan et.al. 2401.17001 link
2024-01-30 Multiple Yield Curve Modeling and Forecasting using Deep Learning Ronald Richman et.al. 2401.16985 null
2024-01-29 Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending Mario Sanz-Guerrero et.al. 2401.16458 null
2024-01-29 Capturing Pertinent Symbolic Features for Enhanced Content-Based Misinformation Detection Flavio Merenda et.al. 2401.16285 link
2024-01-29 Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data Sascha Jecklin et.al. 2401.16027 null
2024-01-29 GPS: Graph Contrastive Learning via Multi-scale Augmented Views from Adversarial Pooling Wei Ju et.al. 2401.16011 null
2024-01-29 MV2MAE: Multi-View Video Masked Autoencoders Ketul Shah et.al. 2401.15900 null
2024-01-27 Exploring the Transferability of a Foundation Model for Fundus Images: Application to Hypertensive Retinopathy Julio Silva-Rodriguez et.al. 2401.15526 null
2024-01-27 A New Method for Vehicle Logo Recognition Based on Swin Transformer Yang Li et.al. 2401.15458 null
2024-01-27 GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis Jing Hao et.al. 2401.15282 link
2024-01-26 Transfer Learning for the Prediction of Entity Modifiers in Clinical Text: Application to Opioid Use Disorder Case Detection Abdullateef I. Almudaifer et.al. 2401.15222 null
2024-01-26 Additional Look into GAN-based Augmentation for Deep Learning COVID-19 Image Classification Oleksandr Fedoruk et.al. 2401.14705 null
2024-01-26 Asymptotic Midpoint Mixup for Margin Balancing and Moderate Broadening Hoyong Kim et.al. 2401.14696 null
2024-01-23 Multi-Agent Based Transfer Learning for Data-Driven Air Traffic Applications Chuhao Deng et.al. 2401.14421 null
2024-01-25 Assessing the Portability of Parameter Matrices Trained by Parameter-Efficient Finetuning Methods Mohammed Sabry et.al. 2401.14228 null
2024-01-25 Deep Learning Innovations in Diagnosing Diabetic Retinopathy: The Potential of Transfer Learning and the DiaCNN Model Mohamed R. Shoaib et.al. 2401.13990 null
2024-01-25 StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models Yalong Bai et.al. 2401.13942 null
2024-01-25 A comparative study of zero-shot inference with large language models and supervised modeling in breast cancer pathology classification Madhumita Sushil et.al. 2401.13887 null
2024-01-24 Don’t Push the Button! Exploring Data Leakage Risks in Machine Learning and Transfer Learning Andrea Apicella et.al. 2401.13796 null
2024-01-24 SEDNet: Shallow Encoder-Decoder Network for Brain Tumor Segmentation Chollette C. Olisah et.al. 2401.13403 link
2024-01-23 TCE at Qur’an QA 2023 Shared Task: Low Resource Enhanced Transformer-based Ensemble Approach for Qur’anic QA Mohammed Alaa Elkomy et.al. 2401.13060 link
2024-01-23 Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning? Cheng Han et.al. 2401.12902 link
2024-01-23 Deep reinforcement transfer learning for active flow control of a 3D square cylinder under state dimension mismatch Lei Yan et.al. 2401.12543 null
2024-01-22 Contrastive Learning and Cycle Consistency-based Transductive Transfer Learning for Target Annotation Shoaib Meraj Sami et.al. 2401.12340 null
2024-01-22 Transfer Learning for Functional Mean Estimation: Phase Transition and Adaptive Algorithms T. Tony Cai et.al. 2401.12331 null
2024-01-22 Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data Leonardo Castro-Gonzalez et.al. 2401.12295 link
2024-01-22 Transfer Learning for Nonparametric Regression: Non-asymptotic Minimax Analysis and Adaptive Procedure T. Tony Cai et.al. 2401.12272 null
2024-01-21 Transfer learning-assisted inverse modeling in nanophotonics based on mixture density networks Liang Cheng et.al. 2401.12254 null
2024-01-22 Less Could Be Better: Parameter-efficient Fine-tuning Advances Medical Vision Foundation Models Chenyu Lian et.al. 2401.12215 link
2024-01-22 Cross-lingual Transfer Learning for Javanese Dependency Parsing Fadli Aulawi Al Ghiffari et.al. 2401.12072 null
2024-01-22 Feature Denoising Diffusion Model for Blind Image Quality Assessment Xudong Li et.al. 2401.11949 null
2024-01-21 Transfer Learning under Covariate Shift: Local $k$ -Nearest Neighbours Regression with Heavy-Tailed Design Petr Zamolodtchikov et.al. 2401.11554 null
2024-01-20 A Hybrid Approach of Transfer Learning and Physics-Informed Modeling: Improving Dissolved Oxygen Concentration Prediction in an Industrial Wastewater Treatment Plant Ece S. Koksal et.al. 2401.11217 null
2024-01-19 A Systematic Evaluation of Euclidean Alignment with Deep Learning for EEG Decoding Bruna Junqueira et.al. 2401.10746 null
2024-01-19 Name Tagging Under Domain Shift via Metric Learning for Life Sciences Hongyi Liu et.al. 2401.10472 link
2024-01-18 Transfer Learning in Human Activity Recognition: A Survey Sourish Gunesh Dhekane et.al. 2401.10185 null
2024-01-18 Few-shot learning for COVID-19 Chest X-Ray Classification with Imbalanced Data: An Inter vs. Intra Domain Study Alejandro Galán-Cuenca et.al. 2401.10129 link
2024-01-18 Material-Response-Informed DeepONet and its Application to Polycrystal Stress-strain Prediction in Crystal Plasticity Junyan He et.al. 2401.09977 null
2024-01-12 Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications Hania Khan et.al. 2401.09354 null
2024-01-17 Material Informatics through Neural Networks on Ab-Initio Electron Charge Densities: the Role of Transfer Learning Dario Massa et.al. 2401.09301 null
2024-01-17 Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges Aiqi Jiang et.al. 2401.09244 link
2024-01-17 Toward Diverse Polymer Property Prediction Using Transfer Learning Elaheh Kazemi-Khasragh et.al. 2401.09139 null
2024-01-16 Using i-vectors for subject-independent cross-session EEG transfer learning Jonathan Lasko et.al. 2401.08851 null
2024-01-16 Surface-Enhanced Raman Spectroscopy and Transfer Learning Toward Accurate Reconstruction of the Surgical Zone Ashutosh Raman et.al. 2401.08821 null
2024-01-16 Selecting Subsets of Source Data for Transfer Learning with Applications in Metal Additive Manufacturing Yifan Tang et.al. 2401.08715 null
2024-01-16 N-Adaptive Ritz Method: A Neural Network Enriched Partition of Unity for Boundary Value Problems Jonghyuk Baek et.al. 2401.08544 null
2024-01-16 AGN jet-inflated bubbles as possible origin of odd radio circles Yen-Hsing Lin et.al. 2401.08207 null
2024-01-16 Transferring Core Knowledge via Learngenes Fu Feng et.al. 2401.08139 null
2024-01-15 6-DoF Grasp Pose Evaluation and Optimization via Transfer Learning from NeRFs Gergely Sóti et.al. 2401.07935 null
2024-01-15 Quantum Transfer Learning for Acceptability Judgements Giuseppe Buonaiuto et.al. 2401.07777 null
2024-01-14 Harnessing Machine Learning for Discerning AI-Generated Synthetic Images Yuyang Wang et.al. 2401.07358 null
2024-01-13 Concrete Surface Crack Detection with Convolutional-based Deep Learning Models Sara Shomal Zadeh et.al. 2401.07124 null
2024-01-13 Bayesian Signal Matching for Transfer Learning in ERP-Based Brain Computer Interface Tianwen Ma et.al. 2401.07111 null
2024-01-12 PyTy: Repairing Static Type Errors in Python Yiu Wai Chow et.al. 2401.06619 link
2024-01-12 PersianMind: A Cross-Lingual Persian-English Large Language Model Pedram Rostami et.al. 2401.06466 null
2024-01-11 Zero Resource Cross-Lingual Part Of Speech Tagging Sahil Chopra et.al. 2401.05727 null
2024-01-16 POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation Shilong Pan et.al. 2401.05596 null
2024-01-10 Enhancing Blood Flow Assessment in Diffuse Correlation Spectroscopy: A Transfer Learning Approach with Noise Robustness Analysis Xi Chen et.al. 2401.05580 null
2024-01-10 VI-PANN: Harnessing Transfer Learning and Uncertainty-Aware Variational Inference for Improved Generalization in Audio Pattern Recognition John Fischer et.al. 2401.05531 link
2024-01-10 Consensus Focus for Object Detection and minority classes Erik Isai Valle Salgado et.al. 2401.05530 link
2024-01-10 Taming “data-hungry” reinforcement learning? Stability in continuous state-action spaces Yaqi Duan et.al. 2401.05233 null
2024-01-10 Neural Population Learning beyond Symmetric Zero-sum Games Siqi Liu et.al. 2401.05133 null
2024-01-09 Arabic Text Diacritization In The Age Of Transfer Learning: Token Classification Is All You Need Abderrahman Skiredj et.al. 2401.04848 null
2024-01-10 Low-Resource Vision Challenges for Foundation Models Yunhua Zhang et.al. 2401.04716 null
2024-01-09 Transfer-Learning-Based Autotuning Using Gaussian Copula Thomas Randall et.al. 2401.04669 link
2024-01-11 Tiny Time Mixers (TTMs): Fast Pretrained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series Vijay Ekambaram et.al. 2401.03955 link
2024-01-08 Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification Adarsh Bhandary Panambur et.al. 2401.03912 null
2024-01-08 Anatomy of Neural Language Models Majd Saleh et.al. 2401.03797 link
2024-01-07 Improving Transferability of Network Intrusion Detection in a Federated Learning Setup Shreya Ghosh et.al. 2401.03560 link
2024-01-06 Efficient Bitrate Ladder Construction using Transfer Learning and Spatio-Temporal Features Ali Falahati et.al. 2401.03195 link
2024-01-06 Transferable Learned Image Compression-Resistant Adversarial Perturbations Yang Sui et.al. 2401.03115 null
2024-01-05 Physics-Informed Neural Networks for High-Frequency and Multi-Scale Problems using Transfer Learning Abdul Hannan Mustajab et.al. 2401.02810 null
2024-01-05 Detection and Classification of Diabetic Retinopathy using Deep Learning Algorithms for Segmentation to Facilitate Referral Recommendation for Test and Treatment Prediction Manoj S H et.al. 2401.02759 link
2024-01-05 Nurse-in-the-Loop Artificial Intelligence for Precision Management of Type 2 Diabetes in a Clinical Trial Utilizing Transfer-Learned Predictive Digital Twin Syed Hasib Akhter Faruqui et.al. 2401.02661 null
2024-01-05 GTA: Guided Transfer of Spatial Attention from Object-Centric Representations SeokHyun Seo et.al. 2401.02656 null
2024-01-04 Multi-Source Domain Adaptation with Transformer-based Feature Generation for Subject-Independent EEG-based Emotion Recognition Shadi Sartipi et.al. 2401.02344 null
2024-01-03 A Comparative Study with Traditional and Transfer Learning-enhanced Machine Learning Algorithms for Geotechnical Characterisation of Coal Spoil Sureka Thiruchittampalam et.al. 2401.01969 null
2024-01-03 Graph Neural Networks for Surfactant Multi-Property Prediction Christoforos Brozos et.al. 2401.01874 link
2023-12-21 Discovery of a circular symmetry extended diffuse radio emission around an elliptical galaxy with the VLA FIRST survey Shobha Kumari et.al. 2401.01278 null
2024-01-02 GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction Yuping Hu et.al. 2401.01178 null
2024-01-01 Self-supervised learning for skin cancer diagnosis with limited training data Hamish Haggerty et.al. 2401.00692 link
2023-12-30 AClassiHonk: A System Framework to Annotate and Classify Vehicular Honk from Road Traffic Biswajit Maitya et.al. 2401.00154 null
2023-12-29 FedLED: Label-Free Equipment Fault Diagnosis with Vertical Federated Transfer Learning Jie Shen et.al. 2312.17451 null
2023-12-28 OmniDialog: An Omnipotent Pre-training Model for Task-Oriented Dialogue System Mingtao Yang et.al. 2312.16864 null
2023-12-29 GRSDet: Learning to Generate Local Reverse Samples for Few-shot Object Detection Hefei Mei et.al. 2312.16571 null
2023-12-27 Soft Contrastive Learning for Time Series Seunghan Lee et.al. 2312.16424 link
2023-12-26 EnchantDance: Unveiling the Potential of Music-Driven Dance Movement Bo Han et.al. 2312.15946 link
2023-12-25 TimesURL: Self-supervised Contrastive Learning for Universal Time Series Representation Learning Jiexi Liu et.al. 2312.15709 link
2023-12-25 APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond Yuxiang Yang et.al. 2312.15612 link
2023-12-24 Leveraging Public Representations for Private Transfer Learning Pratiksha Thaker et.al. 2312.15551 link
2023-12-24 Agent based modelling for continuously varying supply chains Wan Wang et.al. 2312.15502 null
2023-12-22 Efficient Discrete Physics-informed Neural Networks for Addressing Evolutionary Partial Differential Equations Siqi Chen et.al. 2312.14608 null
2023-12-21 Crystal Growth Characterization of WSe $_2$ Thin Film Using Machine Learning Isaiah A. Moses et.al. 2312.14311 null
2023-12-25 Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning Jiangmeng Li et.al. 2312.14222 link
2023-12-21 BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0 Miseul Kim et.al. 2312.13600 null
2023-12-21 Fine-tuning Graph Neural Networks by Preserving Graph Generative Patterns Yifei Sun et.al. 2312.13583 link
2023-12-20 Bayesian Transfer Learning Piotr M. Suder et.al. 2312.13484 null
2023-12-20 1D-CNN Optimization for Non-contact Respiration Pattern Classification Md Zobaer Islam et.al. 2312.13035 null
2023-12-20 Heterogeneous Transfer Learning for Building High-Dimensional Generalized Linear Models with Disparate Datasets Ruzhang Zhao et.al. 2312.12786 link
2023-12-20 A Closer Look at the Few-Shot Adaptation of Large Vision-Language Models Julio Silva-Rodriguez et.al. 2312.12730 link
2023-12-19 H-ensemble: An Information Theoretic Approach to Reliable Few-Shot Multi-Source-Free Transfer Yanru Wu et.al. 2312.12489 null
2023-12-19 Value Explicit Pretraining for Goal-Based Transfer Learning Kiran Lekkala et.al. 2312.12339 null
2023-12-19 Empowering Dual-Level Graph Self-Supervised Pretraining with Motif Discovery Pengwei Yan et.al. 2312.11927 link
2023-12-19 Point Cloud Segmentation Using Transfer Learning with RandLA-Net: A Case Study on Urban Areas Alperen Enes Bayar et.al. 2312.11880 null
2023-12-18 AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System Chengyuan Zhu et.al. 2312.11583 null
2023-12-18 Ensuring Cross-Device Portability of Electromagnetic Side-Channel Analysis Lojenaa Navanesana et.al. 2312.11301 null
2023-12-18 LaViP:Language-Grounded Visual Prompts Nilakshan Kunananthaseelan et.al. 2312.10945 null
2023-12-18 Domain adaption and physical constrains transfer learning for shale gas production Zhaozhong Yang et.al. 2312.10920 null
2023-12-17 Cross-Domain Robustness of Transformer-based Keyphrase Generation Anna Glazkova et.al. 2312.10700 null
2023-12-17 p-Laplacian Adaptation for Generative Pre-trained Vision-Language Models Haoyuan Wu et.al. 2312.10613 link
2023-12-16 Optimizing Dense Feed-Forward Neural Networks Luis Balderas et.al. 2312.10560 null
2023-12-15 One Self-Configurable Model to Solve Many Abstract Visual Reasoning Problems Mikołaj Małkiński et.al. 2312.09997 link
2023-12-18 Multi-Modality is All You Need for Transferable Recommender Systems Youhua Li et.al. 2312.09602 link
2023-12-21 Enhancing Data Lakes with GraphAr: Efficient Graph Data Management with a Specialized Storage Scheme Xue Li et.al. 2312.09577 link
2023-12-14 Weight subcloning: direct initialization of transformers using larger pretrained ones Mohammad Samragh et.al. 2312.09299 null
2023-12-14 Bayesian Optimization for Robust State Preparation in Quantum Many-Body Systems Tizian Blatz et.al. 2312.09253 null
2023-12-14 Applying Pre-Trained Deep-Learning Model on Wrist Angel Data – An Analysis Plan Harald Vilhelm Skat-Rørdam et.al. 2312.09052 null
2023-12-14 Context-PEFT: Efficient Multi-Modal, Multi-Task Fine-Tuning Avelina Asada Hadji-Kyriacou et.al. 2312.08900 null
2023-12-12 AdaptIR: Parameter Efficient Multi-task Adaptation for Pre-trained Image Restoration Models Hang Guo et.al. 2312.08881 link
2023-12-15 VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding Yi Xin et.al. 2312.08733 null
2023-12-14 MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning Yi Xin et.al. 2312.08636 null
2023-12-13 Distributional Robustness and Transfer Learning Through Empirical Bayes Michael Law et.al. 2312.08485 null
2023-12-13 Explainable AI in Grassland Monitoring: Enhancing Model Performance and Domain Adaptability Shanghua Liu et.al. 2312.08408 null
2023-12-12 Taking it further: leveraging pseudo labels for field delineation across label-scarce smallholder regions Philippe Rufin et.al. 2312.08384 null
2023-12-13 Robust Few-Shot Named Entity Recognition with Boundary Discrimination and Correlation Purification Xiaojun Xue et.al. 2312.07961 link
2023-12-13 DTL: Disentangled Transfer Learning for Visual Recognition Minghao Fu et.al. 2312.07856 link
2023-12-12 Automated Behavioral Analysis Using Instance Segmentation Chen Yang et.al. 2312.07723 link
2023-12-12 Reacting like Humans: Incorporating Intrinsic Human Behaviors into NAO through Sound-Based Reactions for Enhanced Sociability Ali Ghadami et.al. 2312.07671 null
2023-12-10 COVID-19 Detection Using Slices Processing Techniques and a Modified Xception Classifier from Computed Tomography Images Kenan Morani et.al. 2312.07580 link
2023-12-12 Medical Image Classification Using Transfer Learning and Chaos Game Optimization on the Internet of Medical Things Alhassan Mabrouk et.al. 2312.07437 null
2023-12-12 NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image Yoonwoo Jeong et.al. 2312.07315 link
2023-12-12 Neural Machine Translation of Clinical Text: An Empirical Investigation into Multilingual Pre-Trained Language Models and Transfer-Learning Lifeng Han et.al. 2312.07250 link
2023-12-12 Dynamic Corrective Self-Distillation for Better Fine-Tuning of Pretrained Models Ibtihel Amara et.al. 2312.07028 null
2023-12-12 READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling Thong Nguyen et.al. 2312.06950 link
2023-12-12 Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks Hongyue Fan et.al. 2312.06904 null
2023-12-14 Understanding and Leveraging the Learning Phases of Neural Networks Johannes Schneider et.al. 2312.06887 null
2023-12-11 The improved backward compatible physics-informed neural networks for reducing error accumulation and applications in data-driven higher-order rogue waves Shuning Lin et.al. 2312.06715 null
2023-12-11 Stoch BiRo: Design and Control of a low cost bipedal robot GVS Mothish et.al. 2312.06512 null
2023-12-11 Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach Yan Zhao et.al. 2312.06466 null
2023-12-11 The Intrinsic Sizes of Odd Radio Circles David Rupke et.al. 2312.06387 null
2023-12-11 MMDesign: Multi-Modality Transfer Learning for Generative Protein Design Jiangbin Zheng et.al. 2312.06297 null
2023-12-10 Natural Interaction Modalities for Human-CPS Interaction in Construction Progress Monitoring Srijeet Halder et.al. 2312.05988 null
2023-12-10 Jumpstarting Surgical Computer Vision Deepak Alapatt et.al. 2312.05968 null
2023-12-10 Initialization Matters for Adversarial Transfer Learning Andong Hua et.al. 2312.05716 link
2023-12-09 Teamwork Dimensions Classification Using BERT Junyoung Lee et.al. 2312.05483 null
2023-12-09 Model Evaluation for Domain Identification of Unknown Classes in Open-World Recognition: A Proposal Gusti Ahmad Fanshuri Alfarisy et.al. 2312.05454 null
2023-12-07 Enhancing Polynomial Chaos Expansion Based Surrogate Modeling using a Novel Probabilistic Transfer Learning Strategy Wyatt Bridgman et.al. 2312.04648 null
2023-12-07 TLCE: Transfer-Learning Based Classifier Ensembles for Few-Shot Class-Incremental Learning Shuangmei Wang et.al. 2312.04225 null
2023-12-07 Small Area Estimation of Case Growths for Timely COVID-19 Outbreak Detection Zhaowei She et.al. 2312.04110 link
2023-12-07 A Review and Taxonomy of Methods for Quantifying Dataset Similarity Marieke Stolte et.al. 2312.04078 null
2023-12-06 A Scalable and Generalizable Pathloss Map Prediction Ju-Hyung Lee et.al. 2312.03950 link
2023-12-07 Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Umberto Cappellazzo et.al. 2312.03694 link
2023-12-06 Transfer learning for galaxy feature detection: Finding Giant Star-forming Clumps in low redshift galaxies using Faster R-CNN Jürgen Popp et.al. 2312.03503 link
2023-12-07 SVQ: Sparse Vector Quantization for Spatiotemporal Forecasting Chao Chen et.al. 2312.03406 link
2023-12-06 Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition and Phoneme to Grapheme Translation Wonjun Lee et.al. 2312.03312 null
2023-12-06 Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning Haowen Wang et.al. 2312.03248 null
2023-12-05 Enhanced Breast Cancer Tumor Classification using MobileNetV2: A Detailed Exploration on Image Intensity, Error Mitigation, and Streamlit-driven Real-time Deployment Aaditya Surya et.al. 2312.03020 null
2023-12-05 Applications of Domain Adversarial Neural Network in phase transition of 3D Potts model Xiangna Chen et.al. 2312.02479 null
2023-12-02 Disentangling the Effects of Data Augmentation and Format Transform in Self-Supervised Learning of Image Representations Neha Kalibhat et.al. 2312.02205 null
2023-12-04 VLTSeg: Simple Transfer of CLIP-Based Vision-Language Representations for Domain Generalized Semantic Segmentation Christoph Hümmer et.al. 2312.02021 null
2023-12-03 Robust Computer Vision in an Ever-Changing World: A Survey of Techniques for Tackling Distribution Shifts Eashan Adhikarla et.al. 2312.01540 null
2023-12-03 Facial Emotion Recognition Under Mask Coverage Using a Data Augmentation Technique Aref Farhadipour et.al. 2312.01335 link
2023-12-02 A Comparative Analysis Towards Melanoma Classification Using Transfer Learning by Analyzing Dermoscopic Images Md. Fahim Uddin et.al. 2312.01212 null
2023-12-02 Efficient Expansion and Gradient Based Task Inference for Replay Free Incremental Learning Soumya Roy et.al. 2312.01188 null
2023-12-02 SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer Renan A. Rojas-Gomez et.al. 2312.01187 null
2023-12-02 Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning Raviraj Joshi et.al. 2312.01107 null
2023-12-02 Code-Mixed Text to Speech Synthesis under Low-Resource Constraints Raviraj Joshi et.al. 2312.01103 null
2023-12-02 On the Effects of Randomness on Stability of Learning with Limited Labelled Data: A Systematic Literature Review Branislav Pecher et.al. 2312.01082 null
2023-12-02 Acoustic Signal Analysis with Deep Neural Network for Detecting Fault Diagnosis in Industrial Machines Mustafa Yurdakul et.al. 2312.01062 null
2023-12-02 Scaling Whole-Chip QAOA for Higher-Order Ising Spin Glass Models on Heavy-Hex Graphs Elijah Pelofske et.al. 2312.00997 link
2023-12-04 Simple Transferability Estimation for Regression Tasks Cuong N. Nguyen et.al. 2312.00656 link
2023-12-01 Pathway to a fully data-driven geotechnics: lessons from materials informatics Stephen Wu et.al. 2312.00581 null
2023-12-01 Explainable AI in Diagnosing and Anticipating Leukemia Using Transfer Learning Method Wahidul Hasan Abir et.al. 2312.00487 null
2023-12-01 Transfer learning for predicting source terms of principal component transport in chemically reactive flow Ki Sung Jung et.al. 2312.00356 null
2023-12-01 Student Activity Recognition in Classroom Environments using Transfer Learning Anagha Deshpande et.al. 2312.00348 null
2023-11-30 Stochastic Vision Transformers with Wasserstein Distance-Aware Attention Franciskus Xaverius Erick et.al. 2311.18645 null
2023-11-30 Calibration-free online test-time adaptation for electroencephalography motor imagery decoding Martin Wimpff et.al. 2311.18520 link
2023-11-30 Transfer Learning across Different Chemical Domains: Virtual Screening of Organic Materials with Deep Learning Models Pretrained on Small Molecule and Chemical Reaction Data Chengwei Zhang et.al. 2311.18377 null
2023-12-01 Learning Robust Precipitation Forecaster by Temporal Frame Interpolation Lu Han et.al. 2311.18341 link
2023-11-29 Transfer Learning in Robotics: An Upcoming Breakthrough? A Review of Promises and Challenges Noémie Jaquier et.al. 2311.18044 null
2023-11-29 Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings Andrea W Wen-Yi et.al. 2311.18034 link
2023-11-29 Latent Alignment with Deep Set EEG Decoders Stylianos Bakas et.al. 2311.17968 null
2023-11-29 Skilful Precipitation Nowcasting Using NowcastNet Ajitabh Kumar et.al. 2311.17961 null
2023-11-30 Grounding Foundation Models through Federated Transfer Learning: A General Framework Yan Kang et.al. 2311.17431 null
2023-11-27 Data Imbalance, Uncertainty Quantification, and Generalization via Transfer Learning in Data-driven Parameterizations: Lessons from the Emulation of Gravity Wave Momentum Transport in WACCM Y. Qiang Sun et.al. 2311.17078 link
2023-11-28 Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis Aman Yadav et.al. 2311.16965 null
2023-11-29 ROSO: Improving Robotic Policy Inference via Synthetic Observations Yusuke Miyashita et.al. 2311.16680 link
2023-11-28 Empowering COVID-19 Detection: Optimizing Performance Through Fine-Tuned EfficientNet Deep Learning Architecture Md. Alamin Talukder et.al. 2311.16593 null
2023-11-28 FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning Pengchao Han et.al. 2311.16584 null
2023-11-29 Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos Takehiko Ohkawa et.al. 2311.16444 null
2023-11-27 Transformer-QEC: Quantum Error Correction Code Decoding with Transferable Transformers Hanrui Wang et.al. 2311.16082 null
2023-11-27 Towards Transfer Learning for Large-Scale Image Classification Using Annealing-based Quantum Boltzmann Machines Daniëlle Schuman et.al. 2311.15966 null
2023-11-27 Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning Huanjin Yao et.al. 2311.15769 link
2023-11-27 Machine Learning-Based Jamun Leaf Disease Detection: A Comprehensive Review Auvick Chandra Bhowmik et.al. 2311.15741 null
2023-11-27 Adinkra Symbol Recognition using Classical Machine Learning and Deep Learning Michael Adjeisah et.al. 2311.15728 null
2023-11-27 Improving Adaptability and Generalizability of Efficient Transfer Learning for Vision-Language Models Yongjin Yang et.al. 2311.15569 link
2023-11-26 Untargeted Code Authorship Evasion with Seq2Seq Transformation Soohyeon Choi et.al. 2311.15366 null
2023-11-26 How much data do I need? A case study on medical data Ayse Betul Cengiz et.al. 2311.15331 null
2023-11-25 nlpBDpatriots at BLP-2023 Task 2: A Transfer Learning Approach to Bangla Sentiment Analysis Dhiman Goswami et.al. 2311.15032 null
2023-11-25 One-Shot Transfer Learning for Nonlinear ODEs Wanzhou Lei et.al. 2311.14931 null
2023-11-24 A Reusable AI-Enabled Defect Detection System for Railway Using Ensembled CNN Rahatara Ferdousi et.al. 2311.14824 null
2023-11-24 Data-driven Prior Learning for Bayesian Optimisation Sigrid Passano Hellan et.al. 2311.14653 link
2023-11-24 Machine Translation for Ge’ez Language Aman Kassahun Wassie et.al. 2311.14530 null
2023-11-23 Video Anomaly Detection using GAN Anikeit Sethi et.al. 2311.14095 null
2023-11-23 On the Hyperparameter Landscapes of Machine Learning Algorithms Mingyu Huang et.al. 2311.14014 null
2023-11-23 Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation Mohammad Junayed Hasan et.al. 2311.13810 null
2023-11-22 End-to-end Transfer Learning for Speaker-independent Cross-language Speech Emotion Recognition Duowei Tang et.al. 2311.13678 null
2023-11-23 Transfer Learning-based Real-time Handgun Detection Youssef Elmir et.al. 2311.13559 null
2023-11-22 Recurrent neural networks and transfer learning for elasto-plasticity in woven composites Ehsan Ghane et.al. 2311.13434 link
2023-11-21 InteRACT: Transformer Models for Human Intent Prediction Conditioned on Robot Actions Kushal Kedia et.al. 2311.12943 null
2023-11-21 Digital Twin Framework for Optimal and Autonomous Decision-Making in Cyber-Physical Systems: Enhancing Reliability and Adaptability in the Oil and Gas Industry Carine Menezes Rebello et.al. 2311.12755 null
2023-11-21 Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations Sayak Mukherjee et.al. 2311.12264 null
2023-11-20 Broadband non-thermal emission of odd radio circles induced by galactic outflow remnants and their evolution Yutaka Fujita et.al. 2311.12099 null
2023-11-17 Using Guided Transfer Learning to Predispose AI Agent to Learn Efficiently from Small RNA-sequencing Datasets Kevin Li et.al. 2311.12045 null
2023-11-17 TransCDR: a deep learning model for enhancing the generalizability of cancer drug response prediction through transfer learning and multimodal data fusion for drug representation Xiaoqiong Xia et.al. 2311.12040 link
2023-11-20 High-performance cVEP-BCI under minimal calibration Yining Miao et.al. 2311.11596 null
2023-11-20 Event Camera Data Dense Pre-training Yan Yang et.al. 2311.11533 null
2023-11-19 Towards interpretable-by-design deep learning algorithms Plamen Angelov et.al. 2311.11396 null
2023-11-19 RflyMAD: A Dataset for Multicopter Fault Detection and Health Assessment Xiangli Le et.al. 2311.11340 null
2023-11-18 Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning Clifton Poth et.al. 2311.11077 link
2023-11-18 Bit Cipher – A Simple yet Powerful Word Representation System that Integrates Efficiently with Language Models Haoran Zhao et.al. 2311.11012 null
2023-11-18 Gendec: A Machine Learning-based Framework for Gender Detection from Japanese Names Duong Tien Pham et.al. 2311.11001 null
2023-11-18 Towards Robust and Accurate Visual Prompting Qi Li et.al. 2311.10992 null
2023-11-17 SpACNN-LDVAE: Spatial Attention Convolutional Latent Dirichlet Variational Autoencoder for Hyperspectral Pixel Unmixing Soham Chitnis et.al. 2311.10701 null
2023-11-17 Physics-Enhanced Multi-fidelity Learning for Optical Surface Imprint Yongchao Chen et.al. 2311.10278 null
2023-11-16 Harnessing Transformers: A Leap Forward in Lung Cancer Image Detection Amine Bechar et.al. 2311.09942 null
2023-11-16 Network Wide Evacuation Traffic Prediction in a Rapidly Intensifying Hurricane from Traffic Detectors and Facebook Movement Data: A Deep Learning Approach Md Mobasshir Rashid et.al. 2311.09498 null
2023-11-15 Combining Transfer Learning with In-context Learning using Blackbox LLMs for Zero-shot Knowledge Base Question Answering Mayur Patidar et.al. 2311.08894 link
2023-11-15 Language Semantic Graph Guided Data-Efficient Learning Wenxuan Ma et.al. 2311.08782 link
2023-11-15 Discovery of Diffuse Radio Source in Abell 1060 Kohei Kurahara et.al. 2311.08693 null
2023-11-14 Peer is Your Pillar: A Data-unbalanced Conditional GANs for Few-shot Image Generation Ziqiang Li et.al. 2311.08217 null
2023-11-14 Residual Importance Weighted Transfer Learning For High-dimensional Linear Regression Junlong Zhao et.al. 2311.07972 link
2023-11-14 Cross-subject dual-domain fusion network with task-related and task-discriminant component analysis enhancing one-shot SSVEP classification Yang Deng et.al. 2311.07932 link
2023-11-13 FedOpenHAR: Federated Multi-Task Transfer Learning for Sensor-Based Human Activity Recognition Egemen İşgüder et.al. 2311.07765 null
2023-11-13 Histopathologic Cancer Detection Varan Singh Rohila et.al. 2311.07711 link
2023-11-16 Lattice relaxation, electronic structure and continuum model for twisted bilayer MoTe $_2$ Ning Mao et.al. 2311.07533 null
2023-11-13 Fine-Tuning the Retrieval Mechanism for Tabular Deep Learning Felix den Breejen et.al. 2311.07343 null
2023-11-13 C-Procgen: Empowering Procgen with Controllable Contexts Zhenxiong Tan et.al. 2311.07312 null
2023-11-13 TIAGo RL: Simulated Reinforcement Learning Environments with Tactile Data for Mobile Robots Luca Lach et.al. 2311.07260 null
2023-11-13 Developing a Named Entity Recognition Dataset for Tagalog Lester James V. Miranda et.al. 2311.07161 link
2023-11-13 PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation Vikas Dwivedi et.al. 2311.07002 null
2023-11-12 Sharing, Teaching and Aligning: Knowledgeable Transfer Learning for Cross-Lingual Machine Reading Comprehension Tingfeng Cao et.al. 2311.06758 null
2023-11-12 Transfer Learning to Detect COVID-19 Coughs with Incremental Addition of Patient Coughs to Healthy People’s Cough Detection Models Sudip Vhaduri et.al. 2311.06707 null
2023-11-10 Transfer Learning for Structured Pruning under Limited Task Data Lucio Dery et.al. 2311.06382 null
2023-11-10 Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Bin Xiao et.al. 2311.06242 link
2023-11-10 Deep learning segmentation of fibrous cap in intravascular optical coherence tomography images Juhwan Lee et.al. 2311.06202 null
2023-11-15 Cluster Expansion by Transfer Learning from Empirical Potentials A. Dana et.al. 2311.06179 link
2023-11-10 Deep Fast Vision: A Python Library for Accelerated Deep Transfer Learning Vision Prototyping Fabi Prezja et.al. 2311.06169 link
2023-11-10 Comparing Male Nyala and Male Kudu Classification using Transfer Learning with ResNet-50 and VGG-16 T. T Lemani et.al. 2311.05981 null
2023-11-10 Adaptive Variance Thresholding: A Novel Approach to Improve Existing Deep Transfer Vision Models and Advance Automatic Knee-Joint Osteoarthritis Classification Fabi Prezja et.al. 2311.05799 null
2023-11-09 Deep Learning Architecture for Network-Efficiency at the Edge Akrit Mudvari et.al. 2311.05739 null
2023-11-09 Enhancing Instance-Level Image Classification with Set-Level Labels Renyu Zhang et.al. 2311.05659 null
2023-11-09 Disentangling Quantum and Classical Contributions in Hybrid Quantum Machine Learning Architectures Michael Kölle et.al. 2311.05559 null
2023-11-09 Generalization in medical AI: a perspective on developing scalable models Joachim A. Behar et.al. 2311.05418 null
2023-11-09 Weakly-supervised Deep Cognate Detection Framework for Low-Resourced Languages Using Morphological Knowledge of Closely-Related Languages Koustava Goswami et.al. 2311.05155 link
2023-11-08 Active Transfer Learning for Efficient Video-Specific Human Pose Estimation Hiromu Taketsugu et.al. 2311.05041 link
2023-11-08 Transfer learning from a sparsely annotated dataset of 3D medical images Gabriel Efrain Humpire-Mamani et.al. 2311.05032 link
2023-11-09 On Characterizing the Evolution of Embedding Space of Neural Networks using Algebraic Topology Suryaka Suresh et.al. 2311.04592 link
2023-11-07 Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning Rishabh Jain et.al. 2311.04313 link
2023-11-07 Elastic Information Bottleneck Yuyan Ni et.al. 2311.03955 null
2023-11-07 Sparse Contrastive Learning of Sentence Embeddings Ruize An et.al. 2311.03881 null
2023-11-07 Mini but Mighty: Finetuning ViTs with Mini Adapters Imad Eddine Marouf et.al. 2311.03873 link
2023-11-03 Determination of droplet size from wide-angle light scattering image data using convolutional neural networks Tom Kirstein et.al. 2311.03387 null
2023-11-06 Risk of Transfer Learning and its Applications in Finance Haoyang Cao et.al. 2311.03283 null
2023-11-06 Machine Learning-Based Tea Leaf Disease Detection: A Comprehensive Review Faruk Ahmed et.al. 2311.03240 null
2023-11-06 Quantifying the value of information transfer in population-based SHM Aidan J. Hughes et.al. 2311.03083 null
2023-11-06 TabRepo: A Large Scale Repository of Tabular Model Evaluations and its AutoML Applications David Salinas et.al. 2311.02971 link
2023-11-06 Understanding Deep Representation Learning via Layerwise Feature Compression and Discrimination Peng Wang et.al. 2311.02960 link
2023-11-06 AttentioNet: Monitoring Student Attention Type in Learning with EEG-Based Measurement System Dhruv Verma et.al. 2311.02924 null
2023-11-05 AI Techniques for Uncovering Resolved Planetary Nebula Candidates from Wide-field VPHAS+ Survey Data Ruiqi Sun et.al. 2311.02607 null
2023-11-03 Robust Fine-Tuning of Vision-Language Models for Domain Generalization Kevin Vogt-Lowell et.al. 2311.02236 link
2023-11-03 Active Learning-Based Species Range Estimation Christian Lange et.al. 2311.02061 link
2023-11-03 A Data-Driven Approach to Coarse-Graining Simple Liquids in Confinement Ishan Nadkarni et.al. 2311.02042 null
2023-11-03 Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection Gretel Liz De la Peña Sarracén et.al. 2311.02025 null
2023-11-03 CheX-Nomaly: Segmenting Lung Abnormalities from Chest Radiographs using Machine Learning Sanskriti Singh et.al. 2311.01777 null
2023-11-03 Capturing Local and Global Features in Medical Images by Using Ensemble CNN-Transformer Javad Mirzapour Kaleybar et.al. 2311.01731 null
2023-11-02 Adversary ML Resilience in Autonomous Driving Through Human Centered Perception Mechanisms Aakriti Shah et.al. 2311.01478 null
2023-11-02 Scattering Vision Transformer: Spectral Mixing Matters Badri N. Patro et.al. 2311.01310 null
2023-11-02 M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D Object Detection Hang Zhang et.al. 2311.00986 link
2023-11-02 IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems Muhammad Dehan Al Kautsar et.al. 2311.00958 link
2023-11-01 The Quantum Cartpole: A benchmark environment for non-linear reinforcement learning Kai Meinerz et.al. 2311.00756 null
2023-10-31 Investigating Relative Performance of Transfer and Meta Learning Benji Alwis et.al. 2311.00727 null
2023-11-01 Transfer learning for improved generalizability in causal physics-informed neural networks for beam simulations Taniya Kapoor et.al. 2311.00578 null
2023-11-01 TLMCM Network for Medical Image Hierarchical Multi-Label Classification Meng Wu et.al. 2311.00282 null
2023-10-31 Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis Abhinav Nippani et.al. 2311.00164 link
2023-10-31 Dynamically Updating Event Representations for Temporal Relation Classification with Multi-category Learning Fei Cheng et.al. 2310.20236 null
2023-10-31 Self-supervised Pre-training for Precipitation Post-processor Sojung An et.al. 2310.20187 null
2023-10-30 Topological Learning for Motion Data via Mixed Coordinates Hengrui Luo et.al. 2310.19960 link
2023-10-31 Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models Hao Li et.al. 2310.19721 link
2023-10-30 CreoleVal: Multilingual Multitask Benchmarks for Creoles Heather Lent et.al. 2310.19567 link
2023-10-30 On consequences of finetuning on data with highly discriminative features Wojciech Masarczyk et.al. 2310.19537 null
2023-10-30 AdapINT: A Flexible and Adaptive In-Band Network Telemetry System Based on Deep Reinforcement Learning Penghui Zhang et.al. 2310.19331 null
2023-10-30 Adapter Pruning using Tropical Characterization Rishabh Bhardwaj et.al. 2310.19232 null
2023-10-29 BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping Srikumar Sastry et.al. 2310.19168 link
2023-10-29 Transfer Learning in Transformer-Based Demand Forecasting For Home Energy Management System Gargya Gokhale et.al. 2310.19159 null
2023-10-29 Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning Suraj Singireddy et.al. 2310.19137 null
2023-10-29 A transfer learning approach with convolutional neural network for Face Mask Detection Abolfazl Younesi et.al. 2310.18928 null
2023-10-29 QWID: Quantized Weed Identification Deep neural network Parikshit Singh Rathore et.al. 2310.18921 link
2023-10-27 Parameter-Efficient Methods for Metastases Detection from Clinical Notes Maede Ashofteh Barabadi et.al. 2310.18472 null
2023-10-27 Large-scale Foundation Models and Generative AI for BigData Neuroscience Ran Wang et.al. 2310.18377 null
2023-10-26 Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs Owen Henkel et.al. 2310.18373 null
2023-10-27 Transductive conformal inference with adaptive scores Ulysse Gazin et.al. 2310.18108 link
2023-10-27 CPIA Dataset: A Comprehensive Pathological Image Analysis Dataset for Self-supervised Learning Pre-training Nan Ying et.al. 2310.17902 link
2023-10-26 Feature Extraction and Classification from Planetary Science Datasets enabled by Machine Learning Conor Nixon et.al. 2310.17681 null
2023-10-26 PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications Yang Tan et.al. 2310.17415 link
2023-10-27 De-novo Chemical Reaction Generation by Means of Temporarily Convolutional Neural Networks Andrei Buin et.al. 2310.17341 null
2023-10-26 Deep Learning on SAR Imagery: Transfer Learning Versus Randomly Initialized Weights Morteza Karimzadeh et.al. 2310.17126 link
2023-10-25 An Efficient Deep Learning-based approach for Recognizing Agricultural Pests in the Wild Mohtasim Hadi Rafi et.al. 2310.16991 null
2023-10-25 Transferring a molecular foundation model for polymer property predictions Pei Zhang et.al. 2310.16958 null
2023-10-25 Learning Transfers over Several Programming Languages Razan Baltaji et.al. 2310.16937 null
2023-10-24 Deep Learning Models for Classification of COVID-19 Cases by Medical Images Amir Ali et.al. 2310.16851 null
2023-10-26 Deep machine learning for meteor monitoring: advances with transfer learning and gradient-weighted class activation mapping Eloy Peña-Asensio et.al. 2310.16826 null
2023-10-25 CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images Aaron Gokaslan et.al. 2310.16825 link
2023-10-25 From Pointwise to Powerhouse: Initialising Neural Networks with Generative Models Christian Harder et.al. 2310.16695 null
2023-10-24 Combining Behaviors with the Successor Features Keyboard Wilka Carvalho et.al. 2310.15940 null
2023-10-24 Ensemble of Task-Specific Language Models for Brain Encoding Sanjai Kumaran et.al. 2310.15720 link
2023-10-24 Transfer learning for day-ahead load forecasting: a case study on European national electricity demand time series Alexandros-Menelaos Tzortzis et.al. 2310.15555 link
2023-10-23 Burgers’ pinns with implicit euler transfer learning Vitória Biesek et.al. 2310.15343 null
2023-10-23 Ionized Gas Extended Over 40 kpc in an Odd Radio Circle Host Galaxy Alison L. Coil et.al. 2310.15162 null
2023-10-23 Quantum Federated Learning With Quantum Networks Tyler Wang et.al. 2310.15084 null
2023-10-20 A Novel Transfer Learning Method Utilizing Acoustic and Vibration Signals for Rotating Machinery Fault Diagnosis Zhongliang Chen et.al. 2310.14796 null
2023-10-22 Mobile Traffic Prediction at the Edge through Distributed and Transfer Learning Alfredo Petrella et.al. 2310.14456 null
2023-10-22 Cross-Domain HAR: Few Shot Transfer Learning for Human Activity Recognition Megha Thukral et.al. 2310.14390 null
2023-10-21 On the Transferability of Visually Grounded PCFGs Yanpeng Zhao et.al. 2310.14107 link
2023-10-21 Convolutional Bidirectional Variational Autoencoder for Image Domain Translation of Dotted Arabic Expiration Ahmed Zidane et.al. 2310.14069 null
2023-10-21 Minimax Optimal Transfer Learning for Kernel-based Nonparametric Regression Chao Wang et.al. 2310.13966 null
2023-10-20 Foundation Model’s Embedded Representations May Detect Distribution Shift Adam Tsou et.al. 2310.13836 null
2023-10-20 Using Human-like Mechanism to Weaken Effect of Pre-training Weight Bias in Face-Recognition Convolutional Neural Network Haojiang Ying et.al. 2310.13674 null
2023-10-20 Diagnosis-oriented Medical Image Compression with Efficient Transfer Learning Guangqi Xie et.al. 2310.13250 null
2023-10-20 The Less the Merrier? Investigating Language Representation in Multilingual Models Hellina Hailu Nigatu et.al. 2310.13228 null
2023-10-19 Streamlining Brain Tumor Classification with Custom Transfer Learning in MRI Images Javed Hossain et.al. 2310.13108 null
2023-10-19 Unsupervised Representation Learning to Aid Semi-Supervised Meta Learning Atik Faysal et.al. 2310.13085 link
2023-10-19 Representation Learning via Consistent Assignment of Views over Random Partitions Thalles Silva et.al. 2310.12692 link
2023-10-18 Adaptive Fine-tuning based Transfer Learning for the Identification of MGMT Promoter Methylation Status Erich Schmitz et.al. 2310.12373 link
2023-10-18 New Environment Adaptation with Few Shots for OFDM Receiver and mmWave Beamforming Ouya Wang et.al. 2310.12343 null
2023-10-17 Precise influence evaluation in complex networks Bingyu Zhu et.al. 2310.12181 link
2023-10-19 Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning Hao Zhao et.al. 2310.11670 link
2023-10-17 Predicting polymerization reactions via transfer learning using chemical language models Brenda S. Ferrari et.al. 2310.11423 link
2023-10-17 Relearning Forgotten Knowledge: on Forgetting, Overfit and Training-Free Ensembles of DNNs Uri Stern et.al. 2310.11094 null
2023-10-16 Electric dipole polarizability of low-lying excited states in atomic nuclei José Nicolás Orce et.al. 2310.10775 null
2023-10-16 UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking Chuang Li et.al. 2310.10492 link
2023-10-16 Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning Chong Li et.al. 2310.10318 link
2023-10-16 Structural transfer learning of non-Gaussian DAG Mingyang Ren et.al. 2310.10239 null
2023-10-15 Class-Specific Data Augmentation: Bridging the Imbalance in Multiclass Breast Cancer Classification Kanan Mahammadli et.al. 2310.09981 null
2023-10-18 BanglaNLP at BLP-2023 Task 2: Benchmarking different Transformer Models for Sentiment Analysis of Bangla Social Media Posts Saumajit Saha et.al. 2310.09238 link
2023-10-13 A Hybrid Transfer Learning Assisted Decision Support System for Accurate Prediction of Alzheimer Disease Mahin Khan Mahadi et.al. 2310.08888 null
2023-10-13 A Framework for Few-Shot Policy Transfer through Observation Mapping and Behavior Cloning Yash Shukla et.al. 2310.08836 link
2023-10-16 Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning Yihua Zhang et.al. 2310.08782 link
2023-10-12 Defect Analysis of 3D Printed Cylinder Object Using Transfer Learning Approaches Md Manjurul Ahsan et.al. 2310.08645 null
2023-10-15 A Survey of Heterogeneous Transfer Learning Runxue Bao et.al. 2310.08459 link
2023-10-12 Reset It and Forget It: Relearning Last-Layer Weights Improves Continual and Transfer Learning Lapo Frati et.al. 2310.07996 null
2023-10-12 Self-supervised visual learning for analyzing firearms trafficking activities on the Web Sotirios Konstantakos et.al. 2310.07975 null
2023-10-12 CleftGAN: Adapting A Style-Based Generative Adversarial Network To Create Images Depicting Cleft Lip Deformity Abdullah Hayajneh et.al. 2310.07969 link
2023-10-11 DeePref: Deep Reinforcement Learning For Video Prefetching In Content Delivery Networks Nawras Alkassab et.al. 2310.07881 null
2023-10-11 Quantitative Analysis of MoS $_2$ Thin Film Micrographs with Machine Learning Isaiah A. Moses et.al. 2310.07816 null
2023-10-11 A Transfer-Learning-Based Prognosis Prediction Paradigm that Bridges Data Distribution Shift across EMR Datasets Zhongji Zhang et.al. 2310.07799 null
2023-10-11 Automatic Control of Reactive Brain Computer Interfaces Pex Tufvesson et.al. 2310.07408 null
2023-10-12 GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning Yun Zhu et.al. 2310.07365 null
2023-10-11 Give and Take: Federated Transfer Learning for Industrial IoT Network Intrusion Detection Lochana Telugu Rajesh et.al. 2310.07354 null
2023-10-10 Distributed Transfer Learning with 4th Gen Intel Xeon Processors Lakshmi Arunachalam et.al. 2310.06916 null
2023-10-10 EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention Yulong Shi et.al. 2310.06629 link
2023-10-10 Self-Supervised Set Representation Learning for Unsupervised Meta-Learning Dong Bok Lee et.al. 2310.06511 link
2023-10-10 Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features Li Zhou et.al. 2310.06458 link
2023-10-10 Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks Sung Moon Ko et.al. 2310.06369 null
2023-10-10 HoloFed: Environment-Adaptive Positioning via Multi-band Reconfigurable Holographic Surfaces and Federated Learning Jingzhi Hu et.al. 2310.06336 null
2023-10-10 Transfer learning-based physics-informed convolutional neural network for simulating flow in porous media with time-varying controls Jungang Chen et.al. 2310.06319 link
2023-10-10 Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation Extraction Cheng Peng et.al. 2310.06239 null
2023-10-10 Efficient Adaptation of Large Vision Transformer via Adapter Re-Composing Wei Dong et.al. 2310.06234 link
2023-10-09 Empirical Evaluation of the Segment Anything Model (SAM) for Brain Tumor Segmentation Mohammad Peivandi et.al. 2310.06162 null
2023-10-09 Understanding Transfer Learning and Gradient-Based Meta-Learning Techniques Mike Huisman et.al. 2310.06148 link
2023-10-09 Advancing Diagnostic Precision: Leveraging Machine Learning Techniques for Accurate Detection of Covid-19, Pneumonia, and Tuberculosis in Chest X-Ray Images Aditya Kulkarni et.al. 2310.06080 null
2023-10-09 Transfer learning for piecewise-constant mean estimation: Optimality, $\ell_1$- and $\ell_0$ -penalisation Fan Wang et.al. 2310.05646 link
2023-10-09 A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers Matteo Bastico et.al. 2310.05572 link
2023-10-10 Hierarchical Side-Tuning for Vision Transformers Weifeng Lin et.al. 2310.05393 link
2023-10-09 Investigating Continuous Learning in Spiking Neural Networks C. Tanner Fredieu et.al. 2310.05343 null
2023-10-10 Enhancing Cross-Dataset Performance of Distracted Driving Detection With Score-Softmax Classifier Cong Duan et.al. 2310.05202 link
2023-10-08 Lifelong Learning for Fog Load Balancing: A Transfer Learning Approach Maad Ebrahim et.al. 2310.05187 null
2023-10-10 Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain Gerald Woo et.al. 2310.05063 link
2023-10-08 Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset Ze Liu et.al. 2310.04982 null
2023-10-07 Transferable Deep Clustering Model Zheng Zhang et.al. 2310.04946 null
2023-10-07 CAD Models to Real-World Images: A Practical Approach to Unsupervised Domain Adaptation in Industrial Object Classification Dennis Ritter et.al. 2310.04757 link
2023-10-07 EdgeFD: An Edge-Friendly Drift-Aware Fault Diagnosis System for Industrial IoT Chen Jiao et.al. 2310.04704 null
2023-10-07 Tight Rates in Supervised Outlier Transfer Learning Mohammadreza M. Kalan et.al. 2310.04686 null
2023-10-07 Neural2Speech: A Transfer Learning Framework for Neural-Driven Speech Reconstruction Jiawei Li et.al. 2310.04644 link
2023-10-07 X-Transfer: A Transfer Learning-Based Framework for Robust GAN-Generated Fake Image Detection Lei Zhang et.al. 2310.04639 null
2023-10-06 Robust Transfer Learning with Unreliable Source Data Jianqing Fan et.al. 2310.04606 null
2023-10-06 Acoustic and linguistic representations for speech continuous emotion recognition in call center conversations Manon Macary et.al. 2310.04481 null
2023-10-06 Enhancing the Authenticity of Rendered Portraits with Identity-Consistent Transfer Learning Luyuan Wang et.al. 2310.04194 null
2023-10-05 ECAvg: An Edge-Cloud Collaborative Learning Approach using Averaged Weights Atah Nuh Mih et.al. 2310.03823 null
2023-10-05 LumiNet: The Bright Side of Perceptual Knowledge Distillation Md. Ismail Hossain et.al. 2310.03669 link
2023-10-05 Network Alignment with Transferable Graph Autoencoders Jiashu He et.al. 2310.03272 link
2023-10-05 Detecting Electricity Service Equity Issues with Transfer Counterfactual Learning on Large-Scale Outage Datasets Song Wei et.al. 2310.03258 null
2023-10-04 Crossed-IoT device portability of Electromagnetic Side Channel Analysis: Challenges and Dataset Tharindu Lakshan Yasarathna et.al. 2310.03119 null
2023-10-04 Hybrid Quantum Machine Learning Assisted Classification of COVID-19 from Computed Tomography Scans Leo Sünkel et.al. 2310.02748 null
2023-10-04 Comparative Analysis of Imbalanced Malware Byteplot Image Classification using Transfer Learning Jayasudha M et.al. 2310.02742 null
2023-10-05 Hybrid Inception Architecture with Residual Connection: Fine-tuned Inception-ResNet Deep Learning Model for Lung Inflammation Diagnosis from Chest Radiographs Mehdi Neshat et.al. 2310.02591 null
2023-10-03 Reducing Intraspecies and Interspecies Covariate Shift in Traumatic Brain Injury EEG of Humans and Mice Using Transfer Euclidean Alignment Manoj Vishwanath et.al. 2310.02398 null
2023-10-03 Graph Neural Network-based EEG Classification: A Survey Dominik Klepl et.al. 2310.02152 null
2023-10-03 PAD-Phys: Exploiting Physiology for Presentation Attack Detection in Face Biometrics Luis F. Gomez et.al. 2310.02140 null
2023-10-03 An evaluation of pre-trained models for feature extraction in image classification Erick da Silva Puls et.al. 2310.02037 null
2023-10-02 Toward Scalable Visual Servoing Using Deep Reinforcement Learning and Optimal Control Salar Asayesh et.al. 2310.01360 null
2023-10-02 ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale Markus Frohmann et.al. 2310.01217 link
2023-10-03 A Theoretical Analysis of the Test Error of Finite-Rank Kernel Ridge Regression Tin Sum Cheng et.al. 2310.00987 null
2023-10-06 Data-Efficient Power Flow Learning for Network Contingencies Parikshit Pareek et.al. 2310.00763 null
2023-09-30 An easy zero-shot learning combination: Texture Sensitive Semantic Segmentation IceHrNet and Advanced Style Transfer Learning Strategy Zhiyong Yang et.al. 2310.00310 link
2023-09-29 Fusing simulation and monitoring data for real-time settlement prediction during tunnel construction: A multi-fidelity deep operator network (DeepONet) Chen Xu et.al. 2310.00057 null
2023-09-29 AI ensemble for signal detection of higher order gravitational wave modes of quasi-circular, spinning, non-precessing binary black hole mergers Minyang Tian et.al. 2310.00052 link
2023-10-03 Pretrain, Prompt, and Transfer: Evolving Digital Twins for Time-to-Event Analysis in Cyber-physical Systems Qinghua Xu et.al. 2310.00032 link
2023-09-29 Are Odd Radio Circles virial shocks around massive galaxies? Implications for cosmic-ray diffusion in the circumgalactic medium Shotaro Yamasaki et.al. 2309.17451 null
2023-09-29 Glioma subtype classification from histopathological images using in-domain and out-of-domain transfer learning: An experimental study Vladimir Despotovic et.al. 2309.17223 null
2023-09-29 A Survey of Incremental Transfer Learning: Combining Peer-to-Peer Federated Learning and Domain Incremental Learning for Multicenter Collaboration Yixing Huang et.al. 2309.17192 link
2023-09-29 Mixup Your Own Pairs Yilei Wu et.al. 2309.16633 link
2023-09-28 Transfer Learning for Bayesian Optimization on Heterogeneous Search Spaces Zhou Fan et.al. 2309.16597 null
2023-09-28 Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization Thilo von Neumann et.al. 2309.16482 null
2023-09-28 Nondestructive chicken egg fertility detection using CNN-transfer learning algorithms Shoffan Saifullah et.al. 2309.16257 null
2023-09-27 Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing Brian Yan et.al. 2309.15826 null
2023-09-27 Question answering using deep learning in low resource Indian language Marathi Dhiraj Amin et.al. 2309.15779 null
2023-09-27 Classification of skyrmionic textures and extraction of Hamiltonian parameters via machine learning Dushuo Feng et.al. 2309.15679 null
2023-09-27 OceanBench: The Sea Surface Height Edition J. Emmanuel Johnson et.al. 2309.15599 link
2023-09-29 Confidence-based Visual Dispersal for Few-shot Unsupervised Domain Adaptation Yizhe Xiong et.al. 2309.15575 link
2023-09-27 Robust Internal Representations for Domain Generalization Mohammad Rostami et.al. 2309.15522 null
2023-09-27 VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning Yanan Wang et.al. 2309.15494 null
2023-09-27 Cross-Dataset Experimental Study of Radar-Camera Fusion in Bird’s-Eye View Lukas Stäcker et.al. 2309.15465 null
2023-09-27 Detecting quantum phase transitions in a frustrated spin chain via transfer learning of a quantum classifier algorithm André J. Ferreira-Martins et.al. 2309.15339 link
2023-09-26 Boosting High Resolution Image Classification with Scaling-up Transformers Yi Wang et.al. 2309.15277 link
2023-09-26 Zero-Shot Constrained Motion Planning Transformers Using Learned Sampling Dictionaries Jacob J. Johnson et.al. 2309.15272 null
2023-09-26 An Ensemble Model for Distorted Images in Real Scenarios Boyuan Ji et.al. 2309.14998 null
2023-09-26 Transferring climate change knowledge Francesco Immorlano et.al. 2309.14780 link
2023-09-26 BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning Ching-Yu Chiang et.al. 2309.14774 link
2023-09-26 XGV-BERT: Leveraging Contextualized Language Model and Graph Neural Network for Efficient Software Vulnerability Detection Vu Le Anh Quan et.al. 2309.14677 null
2023-09-26 ALEX: Towards Effective Graph Transfer Learning with Noisy Labels Jingyang Yuan et.al. 2309.14673 null
2023-09-25 Unveiling the Potential of Deep Learning Models for Solar Flare Prediction in Near-Limb Regions Chetraj Pandey et.al. 2309.14483 null
2023-09-25 Incorporating Ensemble and Transfer Learning For An End-To-End Auto-Colorized Image Detection Model Ahmed Samir Ragab et.al. 2309.14478 null
2023-09-25 Chop & Learn: Recognizing and Generating Object-State Compositions Nirat Saini et.al. 2309.14339 null
2023-09-24 Policy Stitching: Learning Transferable Robot Policies Pingcheng Jian et.al. 2309.13753 null
2023-09-24 Crack-Net: Prediction of Crack Propagation in Composites Hao Xu et.al. 2309.13626 null
2023-09-24 GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph Xin Li et.al. 2309.13625 link
2023-09-23 Attention Is All You Need For Blind Room Volume Estimation Chunxi Wang et.al. 2309.13504 null
2023-09-23 Randomize to Generalize: Domain Randomization for Runway FOD Detection Javaria Farooq et.al. 2309.13264 null
2023-09-22 Understanding Calibration of Deep Neural Networks for Medical Image Classification Abhishek Singh Sambyal et.al. 2309.13132 null
2023-09-22 Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts Emad A. Alghamdi et.al. 2309.12863 null
2023-09-22 Unsupervised Representations Improve Supervised Learning in Speech Emotion Recognition Amirali Soltani Tehrani et.al. 2309.12714 null
2023-09-22 Multiply Robust Federated Estimation of Targeted Average Treatment Effects Larry Han et.al. 2309.12600 null
2023-09-21 Brain Tumor Detection Using Deep Learning Approaches Razia Sultana Misu et.al. 2309.12193 null
2023-09-21 Identification of pneumonia on chest x-ray images through machine learning Eduardo Augusto Roeder et.al. 2309.11995 null
2023-09-21 Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition Shuai Wang et.al. 2309.11730 link
2023-09-20 Hand Gesture Recognition with Two Stage Approach Using Transfer Learning and Deep Ensemble Learning Serkan Savaş et.al. 2309.11610 null
2023-09-20 SkeleTR: Towrads Skeleton-based Action Recognition in the Wild Haodong Duan et.al. 2309.11445 null
2023-09-20 Using Artificial Intelligence for the Automation of Knitting Patterns Uduak Uboh et.al. 2309.11202 null
2023-09-19 Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning Mohammad-Javad Darvishi-Bayazi et.al. 2309.10910 null
2023-09-19 Semi-supervised Domain Adaptation in Graph Transfer Learning Ziyue Qiao et.al. 2309.10773 null
2023-09-19 Exploring the Influence of Information Entropy Change in Learning Systems Xiaowei Yu et.al. 2309.10625 link
2023-09-20 PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring Thanveer Shaik et.al. 2309.10576 null
2023-09-19 A Hierarchical Neural Framework for Classification and its Explanation in Large Unstructured Legal Documents Nishchal Prasad et.al. 2309.10563 null
2023-09-19 Toward efficient resource utilization at edge nodes in federated learning Sadi Alawadi et.al. 2309.10367 null
2023-09-19 Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition Ziyang Ma et.al. 2309.10294 null
2023-09-17 A Swin-Transformer-based Model for Efficient Compression of Turbulent Flow Data Meng Zhang et.al. 2309.09192 null
2023-09-16 Universal Metric Learning with Parameter-Efficient Transfer Learning Sungyeon Kim et.al. 2309.08944 null
2023-09-16 An Unified Search and Recommendation Foundation Model for Cold-Start Scenario Yuqi Gong et.al. 2309.08939 null
2023-09-15 Global trends of the electric dipole polarizability from shell-model calculations José Nicolás Orce et.al. 2309.08810 null
2023-09-15 Improved Breast Cancer Diagnosis through Transfer Learning on Hematoxylin and Eosin Stained Histology Images Fahad Ahmed et.al. 2309.08745 null
2023-09-15 MIML: Multiplex Image Machine Learning for High Precision Cell Classification via Mechanical Traits within Microfluidic Systems Khayrul Islam et.al. 2309.08421 null
2023-09-14 Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning Zhiwu Qing et.al. 2309.07911 link
2023-09-14 Enhancing Performance, Calibration Time and Efficiency in Brain-Machine Interfaces through Transfer Learning and Wearable EEG Technology Xiaying Wang et.al. 2309.07798 null
2023-09-20 NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation Jiaqi Zhang et.al. 2309.07705 link
2023-09-14 Goal Space Abstraction in Hierarchical Reinforcement Learning via Set-Based Reachability Analysis Mehdi Zadem et.al. 2309.07675 null
2023-09-14 Efficiently Robustify Pre-trained Models Nishant Jain et.al. 2309.07499 null
2023-09-14 Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images Zhiyun Song et.al. 2309.07394 link
2023-09-13 Learning from Auxiliary Sources in Argumentative Revision Classification Tazin Afrin et.al. 2309.07334 null
2023-09-18 Safe and Accelerated Deep Reinforcement Learning-based O-RAN Slicing: A Hybrid Transfer Learning Approach Ahmad M. Nagib et.al. 2309.07265 link
2023-09-12 Goal Space Abstraction in Hierarchical Reinforcement Learning via Reachability Analysis Mehdi Zadem et.al. 2309.07168 null
2023-09-13 TransNet: A Transfer Learning-Based Network for Human Action Recognition K. Alomar et.al. 2309.06951 null
2023-09-12 Distributionally Robust Transfer Learning Xin Xiong et.al. 2309.06534 null
2023-09-12 Exploring the Benefits of Differentially Private Pre-training and Parameter-Efficient Fine-tuning for Table Transformers Xilong Wang et.al. 2309.06526 link
2023-09-08 Adversarial attacks on hybrid classical-quantum Deep Learning models for Histopathological Cancer Detection Biswaraj Baral et.al. 2309.06377 null
2023-09-12 Transfer learning from Hermitian to non-Hermitian quantum many-body physics Sharareh Sayyad et.al. 2309.06303 null
2023-09-12 Transferability analysis of data-driven additive manufacturing knowledge: a case study between powder bed fusion and directed energy deposition Mutahar Safdar et.al. 2309.06286 null
2023-09-12 A 3M-Hybrid Model for the Restoration of Unique Giant Murals: A Case Study on the Murals of Yongle Palace Jing Yang et.al. 2309.06194 null
2023-09-12 Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning Chunqing Ruan et.al. 2309.06123 null
2023-09-12 Systemization of Knowledge (SoK)- Cross Impact of Transfer Learning in Cybersecurity: Offensive, Defensive and Threat Intelligence Perspectives Sofiya Makar et.al. 2309.05889 null
2023-09-11 SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Action Recognition Cong Wu et.al. 2309.05834 null
2023-09-11 MultIOD: Rehearsal-free Multihead Incremental Object Detector Eden Belouadah et.al. 2309.05334 null
2023-09-11 Analysing Cross-Lingual Transfer in Low-Resourced African Named Entity Recognition Michael Beukman et.al. 2309.05311 link
2023-09-11 Generalized Graphon Process: Convergence of Graph Frequencies in Stretched Cut Distance Xingchao Jian et.al. 2309.05260 null
2023-09-11 DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning Zhengxiang Shi et.al. 2309.05173 link
2023-09-10 Seismic Data Strong Noise Attenuation Based on Diffusion Model and Principal Component Analysis Junheng Peng et.al. 2309.04944 link
2023-09-09 Towards Real-time Training of Physics-informed Neural Networks: Applications in Ultrafast Ultrasound Blood Flow Imaging Haotian Guan et.al. 2309.04755 null
2023-09-09 Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis Nikhil J. Dhinagar et.al. 2309.04651 null
2023-09-08 Regret-Optimal Federated Transfer Learning for Kernel Regression with Applications in American Option Pricing Xuwei Yang et.al. 2309.04557 link
2023-09-08 Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays Aroof Aimen et.al. 2309.04462 null
2023-09-07 S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statistical Tokens Rizhao Cai et.al. 2309.04038 null
2023-09-06 Active shooter detection and robust tracking utilizing supplemental synthetic data Joshua R. Waite et.al. 2309.03381 null
2023-09-06 EvoCLINICAL: Evolving Cyber-Cyber Digital Twin with Active Transfer Learning for Automated Cancer Registry System Chengjie Lu et.al. 2309.03246 link
2023-09-06 Adaptive Growth: Real-time CNN Layer Expansion Yunjie Zhu et.al. 2309.03049 link
2023-09-06 Leveraging ASR Pretrained Conformers for Speaker Verification through Transfer Learning and Knowledge Distillation Danwei Cai et.al. 2309.03019 null
2023-09-06 Roulette: A Semantic Privacy-Preserving Device-Edge Collaborative Inference Framework for Deep Learning Classification Tasks Jingyi Li et.al. 2309.02820 null
2023-09-05 A Survey of the Impact of Self-Supervised Pretraining for Diagnostic Tasks with Radiological Images Blake VanBerlo et.al. 2309.02555 null
2023-09-04 Active flow control for three-dimensional cylinders through deep reinforcement learning Pol Suárez et.al. 2309.02462 null
2023-09-05 Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach Vimal K B et.al. 2309.02429 null
2023-09-05 Graph Self-Contrast Representation Learning Minjie Chen et.al. 2309.02304 null
2023-09-05 DeepVol: A Deep Transfer Learning Approach for Universal Asset Volatility Modeling Chen Liu et.al. 2309.02072 link
2023-09-05 Probabilistic Self-supervised Learning via Scoring Rules Minimization Amirhossein Vahidi et.al. 2309.02048 null
2023-09-06 Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models Qiong Wu et.al. 2309.01479 link
2023-09-04 Deep Learning Approach for Large-Scale, Real-Time Quantification of Green Fluorescent Protein-Labeled Biological Samples in Microreactors Yuanyuan Wei et.al. 2309.01384 null
2023-09-02 Big-model Driven Few-shot Continual Learning Ziqi Gu et.al. 2309.00862 null
2023-09-01 Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models Dezhao Luo et.al. 2309.00661 null
2023-08-31 QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning Haohan Guo et.al. 2309.00126 null
2023-08-31 CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset Nayeon Lee et.al. 2308.16705 link
2023-08-31 Towards Optimal Patch Size in Vision Transformers for Tumor Segmentation Ramtin Mojtahedi et.al. 2308.16598 link
2023-08-29 Multi-Transfer Learning Techniques for Detecting Auditory Brainstem Response Fatih Ozyurt et.al. 2308.16203 null
2023-08-30 Hybrid Quantum Neural Network Structures for Image Multi-classification Mingrui Shi et.al. 2308.16005 null
2023-08-30 Towards Earlier Detection of Oral Diseases On Smartphones Using Oral and Dental RGB Images Ayush Garg et.al. 2308.15705 link
2023-08-29 Target PCA: Transfer Learning Large Dimensional Panel Data Junting Duan et.al. 2308.15627 null
2023-08-29 On the Steganographic Capacity of Selected Learning Models Rishit Agrawal et.al. 2308.15502 null
2023-08-29 A General-Purpose Self-Supervised Model for Computational Pathology Richard J. Chen et.al. 2308.15474 null
2023-08-29 Exploring Model Transferability through the Lens of Potential Energy Xiaotong Li et.al. 2308.15074 link
2023-08-28 Robust Activity Recognition for Adaptive Worker-Robot Interaction using Transfer Learning Farid Shahnavaz et.al. 2308.14843 null
2023-08-31 LAC: Latent Action Composition for Skeleton-based Action Segmentation Di Yang et.al. 2308.14500 null
2023-08-28 UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory Haiwen Diao et.al. 2308.14316 link
2023-08-28 Parameter-Efficient Transfer Learning for Audio-Visual-Language Tasks Hongye Liu et.al. 2308.14274 null
2023-08-27 Exploring the Transfer Learning Capabilities of CLIP in Domain Generalization for Diabetic Retinopathy Sanoojan Baliah et.al. 2308.14212 link
2023-08-27 Revolutionizing Disease Diagnosis: A Microservices-Based Architecture for Privacy-Preserving and Efficient IoT Data Analytics Using Federated Learning Safa Ben Atitallah et.al. 2308.14017 null
2023-08-26 Transfer Learning for Microstructure Segmentation with CS-UNet: A Hybrid Algorithm with Transformer and CNN Encoders Khaled Alrfou et.al. 2308.13917 link
2023-08-25 An Ensemble Approach to Personalized Real Time Predictive Writing for Experts Sourav Prosad et.al. 2308.13576 null
2023-08-25 Ultrafast-and-Ultralight ConvNet-Based Intelligent Monitoring System for Diagnosing Early-Stage Mpox Anytime and Anywhere Yubiao Yue et.al. 2308.13492 null
2023-08-25 Mesh-Wise Prediction of Demographic Composition from Satellite Images Using Multi-Head Convolutional Neural Network Yuta Sato et.al. 2308.13441 null
2023-08-25 Enhanced Mortality Prediction In Patients With Subarachnoid Haemorrhage Using A Deep Learning Model Based On The Initial CT Scan Sergio Garcia-Garcia et.al. 2308.13373 null
2023-08-25 CEIMVEN: An Approach of Cutting Edge Implementation of Modified Versions of EfficientNet (V1-V2) Architecture for Breast Cancer Detection and Classification from Ultrasound Images Sheekar Banerjee et.al. 2308.13356 link
2023-08-24 Electronic Structure Prediction of Multi-million Atom Systems Through Uncertainty Quantification Enabled Transfer Learning Shashank Pathrudkar et.al. 2308.13096 null
2023-08-24 Motion-Guided Masking for Spatiotemporal Representation Learning David Fan et.al. 2308.12962 null
2023-08-25 Pre-trained Model-based Automated Software Vulnerability Repair: How Far are We? Quanjun Zhang et.al. 2308.12533 link
2023-08-24 Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval Yuan Yuan et.al. 2308.12509 link
2023-08-23 Layer-wise Feedback Propagation Leander Weber et.al. 2308.12053 link
2023-08-23 Efficient Transfer Learning in Diffusion Models via Adversarial Noise Xiyu Wang et.al. 2308.11948 null
2023-08-25 Exploring the Optimization Objective of One-Class Classification for Anomaly Detection Han Gao et.al. 2308.11898 null
2023-08-23 ${\rm E}(3)$ -Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning Dingyang Chen et.al. 2308.11842 link
2023-08-22 Addressing Dynamic and Sparse Qualitative Data: A Hilbert Space Embedding of Categorical Variables Anirban Mukherjee et.al. 2308.11781 null
2023-08-22 Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding Jiantao Wu et.al. 2308.11448 null
2023-08-22 Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models Baoshuo Kan et.al. 2308.11186 null
2023-08-22 MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation Jinpeng Wang et.al. 2308.11175 link
2023-08-21 Ultrafast and Ultralight Network-Based Intelligent System for Real-time Diagnosis of Ear diseases in Any Devices Yubiao Yue et.al. 2308.10610 null
2023-08-20 VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation Yanyuan Qiao et.al. 2308.10172 link
2023-08-20 ExpeL: LLM Agents Are Experiential Learners Andrew Zhao et.al. 2308.10144 link
2023-08-19 Disposable Transfer Learning for Selective Source Task Unlearning Seunghee Koh et.al. 2308.09971 null
2023-08-19 Bamboo: Boosting Training Efficiency for Real-Time Video Streaming via Online Grouped Federated Transfer Learning Qianyuan Zheng et.al. 2308.09948 null
2023-08-19 Dual Branch Deep Learning Network for Detection and Stage Grading of Diabetic Retinopathy Hossein Shakibania et.al. 2308.09945 null
2023-08-19 Evaluating Transfer Learning for Simplifying GitHub READMEs Haoyu Gao et.al. 2308.09940 null
2023-08-19 Towards a High-Performance Object Detector: Insights from Drone Detection Using ViT and CNN-based Deep Learning Models Junyang Zhang et.al. 2308.09899 null
2023-08-18 Deformable-Detection Transformer for Microbubble Localization in Ultrasound Localization Microscopy Sepideh K. Gharamaleki et.al. 2308.09845 null
2023-08-18 Time Series Predictions in Unmonitored Sites: A Survey of Machine Learning Techniques in Water Resources Jared D. Willard et.al. 2308.09766 null
2023-08-18 SimDA: Simple Diffusion Adapter for Efficient Video Generation Zhen Xing et.al. 2308.09710 null
2023-08-18 On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers Thomas De Min et.al. 2308.09610 link
2023-08-18 Bridged-GNN: Knowledge Bridge Learning for Effective Knowledge Transfer Wendong Bi et.al. 2308.09499 null
2023-08-18 Improving Buoy Detection with Deep Transfer Learning for Mussel Farm Automation Carl McMillan et.al. 2308.09238 null
2023-08-18 A review of technical factors to consider when designing neural networks for semantic segmentation of Earth Observation imagery Sam Khallaghi et.al. 2308.09221 null
2023-08-17 Multi-fidelity Fourier Neural Operator for Fast Modeling of Large-Scale Geological Carbon Storage Hewei Tang1 et.al. 2308.09113 link
2023-08-16 PEvoLM: Protein Sequence Evolutionary Information Language Model Issar Arab et.al. 2308.08578 link
2023-08-16 Sarcasm Detection in a Disaster Context Tiberiu Sosea et.al. 2308.08156 null
2023-08-16 S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution Minghao She et.al. 2308.08142 link
2023-08-15 Synthesizing Political Zero-Shot Relation Classification via Codebook Knowledge, NLI, and ChatGPT Yibo Hu et.al. 2308.07876 link
2023-08-15 Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models Kanchan Poudel et.al. 2308.07706 link
2023-08-14 The Performance of Transferability Metrics does not Translate to Medical Tasks Levy Chaves et.al. 2308.07444 link
2023-08-16 Interaction-Aware Personalized Vehicle Trajectory Prediction Using Temporal Graph Neural Networks Amr Abdelraouf et.al. 2308.07439 null
2023-08-15 SEMI-CenterNet: A Machine Learning Facilitated Approach for Semiconductor Defect Inspection Vic De Ridder et.al. 2308.07180 null
2023-08-13 Optimizing Brain Tumor Classification: A Comprehensive Study on Transfer Learning and Imbalance Handling in Deep Learning Models Raza Imam et.al. 2308.06821 link
2023-08-12 SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models Sara Babakniya et.al. 2308.06522 null
2023-08-12 A Sequential Meta-Transfer (SMT) Learning to Combat Complexities of Physics-Informed Neural Networks: Application to Composites Autoclave Processing Milad Ramezankhani et.al. 2308.06447 link
2023-08-11 Classification of Blood Cells Using Deep Learning Models Rabia Asghar et.al. 2308.06300 null
2023-08-11 Hybrid-Supervised Deep Learning for Domain Transfer 3D Protoacoustic Image Reconstruction Yankun Lang et.al. 2308.06194 null
2023-08-11 Fast and Accurate Transferability Measurement by Evaluating Intra-class Feature Variance Huiwen Xu et.al. 2308.05986 null
2023-08-11 Tweet Sentiment Extraction using Viterbi Algorithm with Transfer Learning Zied Baklouti et.al. 2308.05973 link
2023-08-09 Deep Learning Model Transfer in Forest Mapping using Multi-source Satellite SAR and Optical Images Shaojia Ge et.al. 2308.05005 null
2023-08-08 Sparse Array Design for Direction Finding using Deep Learning Kumar Vijay Mishra et.al. 2308.04615 null
2023-08-11 Deep Learning for Diverse Data Types Steganalysis: A Review Hamza Kheddar et.al. 2308.04522 null
2023-08-08 Vascular Ageing and Smoking Habit Prediction via a Low-Cost Single-Lead ECG Module S. Anas Ali et.al. 2308.04355 null
2023-08-07 PMU measurements based short-term voltage stability assessment of power systems via deep transfer learning Yang Li et.al. 2308.03953 null
2023-08-07 Segmentation Framework for Heat Loss Identification in Thermal Images: Empowering Scottish Retrofitting and Thermographic Survey Companies Md Junayed Hasan et.al. 2308.03631 null
2023-08-07 Provably Efficient Learning in Partially Observable Contextual Bandit Xueping Gong et.al. 2308.03572 null
2023-08-07 A Transfer Learning Framework for Proactive Ramp Metering Performance Assessment Xiaobo Ma et.al. 2308.03542 null
2023-08-07 On-ramp and Off-ramp Traffic Flows Estimation Based on A Data-driven Transfer Learning Framework Xiaobo Ma et.al. 2308.03538 null
2023-08-07 RoadScan: A Novel and Robust Transfer Learning Framework for Autonomous Pothole Detection in Roads Guruprasad Parasnis et.al. 2308.03467 null
2023-08-05 Surrogate Empowered Sim2Real Transfer of Deep Reinforcement Learning for ORC Superheat Control Runze Lin et.al. 2308.02765 null
2023-08-04 Self-Normalizing Neural Network, Enabling One Shot Transfer Learning for Modeling EDFA Wavelength Dependent Gain Agastya Raj et.al. 2308.02233 null
2023-08-07 Deep Maxout Network-based Feature Fusion and Political Tangent Search Optimizer enabled Transfer Learning for Thalassemia Detection Hemn Barzan Abdalla et.al. 2308.02029 null
2023-08-03 Curricular Transfer Learning for Sentence Encoded Tasks Jader Martins Camboim de Sá et.al. 2308.01849 null
2023-08-03 Deep Learning-based Prediction of Stress and Strain Maps in Arterial Walls for Improved Cardiovascular Risk Assessment Yasin Shokrollahi1 et.al. 2308.01771 null
2023-08-03 IndoHerb: Indonesia Medicinal Plants Recognition using Transfer Learning and Deep Learning Muhammad Salman Ikrar Musyaffa et.al. 2308.01604 link
2023-08-02 Grasp Stability Assessment Through Attention-Guided Cross-Modality Fusion and Transfer Learning Zhuangzhuang Zhang et.al. 2308.00980 null
2023-08-01 Understanding Activation Patterns in Artificial Neural Networks by Exploring Stochastic Processes Stephan Johann Lehmler et.al. 2308.00858 null
2023-07-31 Cardiac MRI Orientation Recognition and Standardization using Deep Neural Networks Ruoxuan Zhen et.al. 2308.00615 link
2023-08-01 Scalable quantum measurement error mitigation via conditional independence and transfer learning ChangWon Lee et.al. 2308.00320 null
2023-08-01 Pixel to policy: DQN Encoders for within & cross-game reinforcement learning Ashrya Agrawal et.al. 2308.00318 null
2023-08-01 EEG-based Cognitive Load Classification using Feature Masked Autoencoding and Emotion Transfer Learning Dustin Pulver et.al. 2308.00246 null
2023-07-31 Structural Transfer Learning in NL-to-Bash Semantic Parsers Kyle Duffy et.al. 2307.16795 null
2023-07-31 Hybrid quantum transfer learning for crack image classification on NISQ hardware Alexander Geng et.al. 2307.16723 null
2023-07-31 UDAMA: Unsupervised Domain Adaptation through Multi-discriminator Adversarial Training with Noisy Labels Improves Cardio-fitness Prediction Yu Wu et.al. 2307.16651 link
2023-07-31 LP-MusicCaps: LLM-Based Pseudo Music Captioning SeungHeon Doh et.al. 2307.16372 link
2023-07-30 Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation Md Nurul Muttakin et.al. 2307.16275 null
2023-07-30 Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction Pengfei Hu et.al. 2307.16253 null
2023-07-30 Gastrointestinal Mucosal Problems Classification with Deep Learning Mohammadhasan Goharian et.al. 2307.16198 null
2023-07-29 Cross-dimensional transfer learning in medical image segmentation with deep learning Hicham Messaoudi et.al. 2307.15872 link
2023-07-28 A deep transfer learning network for structural condition identification with limited real-world training data Nengxin Bao et.al. 2307.15249 null
2023-07-27 Star Cluster Classification using Deep Transfer Learning with PHANGS-HST Stephen Hannon et.al. 2307.15133 null
2023-07-26 Towards Generalist Biomedical AI Tao Tu et.al. 2307.14334 null
2023-07-26 Reinforcement Learning by Guided Safe Exploration Qisong Yang et.al. 2307.14316 null
2023-07-26 Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy Luca Clissa et.al. 2307.14243 null
2023-07-25 ChildGAN: Large Scale Synthetic Child Facial Data Using Domain Adaptation in StyleGAN Muhammad Ali Farooq et.al. 2307.13746 null
2023-07-25 Transfer Learning for Portfolio Optimization Haoyang Cao et.al. 2307.13546 null
2023-07-25 Spectral-DP: Differentially Private Deep Learning through Spectral Perturbation and Filtering Ce Feng et.al. 2307.13231 null
2023-07-24 End-to-End Deep Transfer Learning for Calibration-free Motor Imagery Brain Computer Interfaces Maryam Alimardani et.al. 2307.12827 null
2023-07-24 Sparse annotation strategies for segmentation of short axis cardiac MRI Josh Stein et.al. 2307.12619 null
2023-07-23 NCART: Neural Classification and Regression Tree for Tabular Data Jiaqi Luo et.al. 2307.12198 null
2023-07-22 An X3D Neural Network Analysis for Runner’s Performance Assessment in a Wild Sporting Environment David Freire-Obregón et.al. 2307.12183 null
2023-07-22 Identifying Misinformation on YouTube through Transcript Contextual Analysis with Transformer Models Christos Christodoulou et.al. 2307.12155 link
2023-07-22 Flight Contrail Segmentation via Augmented Transfer Learning with Novel SR Loss Function in Hough Space Junzi Sun et.al. 2307.12032 link
2023-07-22 Pick the Best Pre-trained Model: Towards Transferability Estimation for Medical Image Segmentation Yuncheng Yang et.al. 2307.11958 link
2023-07-21 MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems Thilo von Neumann et.al. 2307.11394 link
2023-07-20 Transfer Learning and Bias Correction with Pre-trained Audio Embeddings Changhong Wang et.al. 2307.10834 link
2023-07-20 Predicting human motion intention for pHRI assistive control Paolo Franceschi et.al. 2307.10743 null
2023-07-20 Transfer Learning for Inverse Design of Tunable Graphene-Based Metasurfaces Mehdi Kiani et.al. 2307.10641 null
2023-07-20 Pluvio: Assembly Clone Search for Out-of-domain Architectures and Libraries through Transfer Learning and Conditional Variational Information Bottleneck Zhiwei Fu et.al. 2307.10631 null
2023-07-19 Eye Disease Classification Using Deep Learning Techniques Tareq Babaqi et.al. 2307.10501 null
2023-07-19 Novel Batch Active Learning Approach and Its Application to Synthetic Aperture Radar Datasets James Chapman et.al. 2307.10495 link
2023-07-19 Determination of the critical points for systems of directed percolation class using machine learning M. Ali Saif et.al. 2307.10456 null
2023-07-19 Gradient Sparsification For Masked Fine-Tuning of Transformers James O’ Neill et.al. 2307.10098 null
2023-07-19 Revisiting invariances and introducing priors in Gromov-Wasserstein distances Pinar Demetci et.al. 2307.10093 link
2023-07-19 From West to East: Who can understand the music of the others better? Charilaos Papaioannou et.al. 2307.09795 link
2023-07-17 Study of Vision Transformers for Covid-19 Detection from Chest X-rays Sandeep Angara et.al. 2307.09402 null
2023-07-18 Augmenting CLIP with Improved Visio-Linguistic Reasoning Samyadeep Basu et.al. 2307.09233 null
2023-07-18 Detecting Throat Cancer from Speech Signals Using Machine Learning: A Reproducible Literature Review Mary Paterson et.al. 2307.09230 null
2023-07-18 A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future Chaoyang Zhu et.al. 2307.09220 link
2023-07-18 Evaluate Fine-tuning Strategies for Fetal Head Ultrasound Image Segmentation with U-Net Fangyijie Wang et.al. 2307.09067 link
2023-07-18 Face-PAST: Facial Pose Awareness and Style Transfer Networks Sunder Ali Khowaja et.al. 2307.09020 null
2023-07-18 Alioth: A Machine Learning Based Interference-Aware Performance Monitor for Multi-Tenancy Applications in Public Cloud Tianyao Shi et.al. 2307.08949 link
2023-07-17 Diffusion Models Beat GANs on Image Classification Soumik Mukhopadhyay et.al. 2307.08702 null
2023-07-18 Revisiting the Robustness of the Minimum Error Entropy Criterion: A Transfer Learning Case Study Luis Pedro Silvestrin et.al. 2307.08572 link
2023-07-17 Domain Adaptation using Silver Standard Masks for Lateral Ventricle Segmentation in FLAIR MRI Owen Crystal et.al. 2307.08456 null
2023-07-17 Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models Zhiyuan Peng et.al. 2307.08303 link
2023-07-16 SHAMSUL: Simultaneous Heatmap-Analysis to investigate Medical Significance Utilizing Local interpretability methods Mahbub Ul Alam et.al. 2307.08003 link
2023-07-18 S2R-ViT for Multi-Agent Cooperative Perception: Bridging the Gap from Simulation to Reality Jinlong Li et.al. 2307.07935 null
2023-07-15 SoccerKDNet: A Knowledge Distillation Framework for Action Recognition in Soccer Videos Sarosij Bose et.al. 2307.07768 null
2023-07-14 MGit: A Model Versioning and Management System Wei Hao et.al. 2307.07507 null
2023-07-14 Improving Zero-Shot Generalization for CLIP with Synthesized Prompts Zhengbo Wang et.al. 2307.07397 link
2023-07-14 Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition Theresa Pekarek Rosin et.al. 2307.07280 null
2023-07-14 Improving BERT with Hybrid Pooling Network and Drop Mask Qian Chen et.al. 2307.07258 null
2023-07-13 A Scenario-Based Functional Testing Approach to Improving DNN Performance Hong Zhu et.al. 2307.07083 null
2023-07-13 AnyStar: Domain randomized universal star-convex 3D instance segmentation Neel Dey et.al. 2307.07044 link
2023-07-13 A decision framework for selecting information-transfer strategies in population-based SHM Aidan J. Hughes et.al. 2307.06978 null
2023-07-13 Agreement Tracking for Multi-Issue Negotiation Dialogues Amogh Mannekote et.al. 2307.06524 null
2023-07-12 Feature Embeddings from Large-Scale Acoustic Bird Classifiers Enable Few-Shot Transfer Learning Burooj Ghani et.al. 2307.06292 link
2023-07-12 Prototypical Contrastive Transfer Learning for Multimodal Language Understanding Seitaro Otsuki et.al. 2307.05942 null
2023-07-06 LogitMat : Zeroshot Learning Algorithm for Recommender Systems without Transfer Learning or Pretrained Models Hao Wang et.al. 2307.05680 null
2023-07-11 A Comprehensive Survey of Deep Transfer Learning for Anomaly Detection in Industrial Time Series: Methods, Applications, and Directions Peng Yan et.al. 2307.05638 null
2023-07-11 Channel Selection for Wi-Fi 7 Multi-Link Operation via Optimistic-Weighted VDN and Parallel Transfer Reinforcement Learning Pedro Enrique Iturria-Rivera et.al. 2307.05419 null
2023-07-11 Multi-fidelity Emulator for Cosmological Large Scale 21 cm Lightcone Images: a Few-shot Transfer Learning Approach with GAN Kangning Diao et.al. 2307.04976 link
2023-07-10 SimpleMTOD: A Simple Language Model for Multimodal Task-Oriented Dialogue with Symbolic Scene Representation Bhathiya Hemanthage et.al. 2307.04907 null
2023-07-10 Advances and Challenges in Meta-Learning: A Technical Review Anna Vettoruzzo et.al. 2307.04722 null
2023-07-11 Generalization Error of First-Order Methods for Statistical Learning with Generic Oracles Kevin Scaman et.al. 2307.04679 null
2023-07-10 Enhancing Biomedical Text Summarization and Question-Answering: On the Utility of Domain-Specific Pre-Training Dima Galat et.al. 2307.04412 null
2023-07-08 Building and Road Segmentation Using EffUNet and Transfer Learning Approach Sahil Gangurde et.al. 2307.03980 null
2023-07-07 Tranfer Learning of Semantic Segmentation Methods for Identifying Buried Archaeological Structures on LiDAR Data Paolo Soleni et.al. 2307.03512 null
2023-07-06 Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment Aref Farhadipour et.al. 2307.03296 link
2023-07-06 To pretrain or not to pretrain? A case study of domain-specific pretraining for semantic segmentation in histopathology Tushar Kataria et.al. 2307.03275 link
2023-07-06 Vision Language Transformers: A Survey Clayton Fields et.al. 2307.03254 null
2023-07-06 A Hybrid End-to-End Spatio-Temporal Attention Neural Network with Graph-Smooth Signals for EEG Emotion Recognition Shadi Sartipi et.al. 2307.03068 null
2023-07-13 Self-supervised learning via inter-modal reconstruction and feature projection networks for label-efficient 3D-to-2D segmentation José Morano et.al. 2307.03008 link
2023-07-06 Molecular Simulation for Atmospheric Reaction Exploration and Discovery: Non-Equilibrium Dynamics, Roaming and Glycolaldehyde Formation Following Photo-Induced Decomposition of syn-Acetaldehyde Oxide Meenu Upadhyay et.al. 2307.02994 null
2023-07-06 Transfer Learning for the Efficient Detection of COVID-19 from Smartphone Audio Data Mattia Giovanni Campana et.al. 2307.02975 link
2023-07-08 PUFFIN: A Path-Unifying Feed-Forward Interfaced Network for Vapor Pressure Prediction Vinicius Viena Santana et.al. 2307.02903 null
2023-07-04 Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure Yikang Wang et.al. 2307.01546 null
2023-07-04 On Conditional and Compositional Language Model Differentiable Prompting Jonathan Pilault et.al. 2307.01446 null
2023-07-03 Exploring Spoken Named Entity Recognition: A Cross-Lingual Perspective Moncef Benaicha et.al. 2307.01310 link
2023-07-03 SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation Liangliang Yao et.al. 2307.01024 link
2023-07-03 Autism Spectrum Disorder Classification in Children based on Structural MRI Features Extracted using Contrastive Variational Autoencoder Ruimin Ma et.al. 2307.00976 null
2023-07-03 Analysis of Task Transferability in Large Pre-trained Classifiers Akshay Mehra et.al. 2307.00823 link
2023-07-02 Variational Autoencoding Molecular Graphs with Denoising Diffusion Probabilistic Model Daiki Koge et.al. 2307.00623 null
2023-07-01 Unified Transfer Learning Models for High-Dimensional Linear Regression Shuo Shuo Liu et.al. 2307.00238 null
2023-06-30 BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark for Short-Term Load Forecasting Patrick Emami et.al. 2307.00142 link
2023-06-30 Scalable method for Bayesian experimental design without integrating over posterior distribution Vinh Hoang et.al. 2306.17615 link
2023-06-30 Towards the extraction of robust sign embeddings for low resource sign language recognition Mathieu De Coster et.al. 2306.17558 null
2023-06-30 Why does my medical AI look at pictures of birds? Exploring the efficacy of transfer learning across domain boundaries Frederic Jonske et.al. 2306.17555 link
2023-06-30 Audio Embeddings as Teachers for Music Classification Yiwei Ding et.al. 2306.17424 link
2023-06-29 Prediction of COVID-19 Patients’ Emergency Room Revisit using Multi-Source Transfer Learning Yuelyu Ji et.al. 2306.17257 null
2023-06-29 Noise-Aware Quantum Software Testing Asmar Muqeet et.al. 2306.16992 link
2023-06-29 Obeying the Order: Introducing Ordered Transfer Hyperparameter Optimisation Sigrid Passano Hellan et.al. 2306.16916 link
2023-06-29 Sampling weights of deep neural networks Erik Lien Bolager et.al. 2306.16830 link
2023-06-29 Transfer Learning with Semi-Supervised Dataset Annotation for Birdcall Classification Anthony Miyaguchi et.al. 2306.16760 link
2023-06-29 Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train Zhao Wang et.al. 2306.16741 link
2023-06-29 Multi-Scenario Ranking with Adaptive Feature Learning Yu Tian et.al. 2306.16732 null
2023-06-26 A Collaborative Transfer Learning Framework for Cross-domain Recommendation Wei Zhang et.al. 2306.16425 null
2023-06-28 Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks Leyla Benhamida et.al. 2306.16357 null
2023-06-28 Relevant Entity Selection: Knowledge Graph Bootstrapping via Zero-Shot Analogical Pruning Lucas Jarnac et.al. 2306.16296 link
2023-06-28 Recent Advances in Optimal Transport for Machine Learning Eduardo Fernandes Montesuma et.al. 2306.16156 null
2023-06-28 A serial dual-channel library occupancy detection system based on Faster RCNN Guoqiang Yang et.al. 2306.16080 null
2023-06-30 DUET: 2D Structured and Approximately Equivariant Representations Xavier Suau et.al. 2306.16058 link
2023-06-28 Transfer Learning with Random Coefficient Ridge Regression Hongzhe Zhang et.al. 2306.15915 null
2023-06-27 Differentially Private Video Activity Recognition Zelun Luo et.al. 2306.15742 null
2023-06-27 Semi-supervised Multimodal Representation Learning through a Global Workspace Benjamin Devillers et.al. 2306.15711 link
2023-06-27 Approximated Prompt Tuning for Vision-Language Pre-trained Models Qiong Wu et.al. 2306.15706 null
2023-06-27 CamemBERT-bio: a Tasty French Language Model Better for your Health Rian Touchent et.al. 2306.15550 null
2023-06-27 Transferability Metrics for Object Detection Louis Fouquet et.al. 2306.15306 link
2023-06-26 Deep Transfer Learning for Intelligent Vehicle Perception: a Survey Xinyu Liu et.al. 2306.15110 null
2023-06-26 Transfer Learning across Several Centuries: Machine and Historian Integrated Method to Decipher Royal Secretary’s Diary Sojung Lucia Kim et.al. 2306.14592 null
2023-06-25 GPT-assisted learning of structure-property relationships by graph neural networks: Application to rare-earth doped phosphors Xiang Zhang et.al. 2306.14238 link
2023-06-25 A Web-based Mpox Skin Lesion Detection System Using State-of-the-art Deep Learning Models Considering Racial Diversity Shams Nafisa Ali et.al. 2306.14169 link
2023-06-25 Semi-supervised Object Detection: A Survey on Recent Research and Progress Yanyang Wang et.al. 2306.14106 null
2023-06-24 Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks Maxime Chevalier-Boisvert et.al. 2306.13831 link
2023-06-23 Curvature-enhanced Graph Convolutional Network for Biomolecular Interaction Prediction Cong Shen et.al. 2306.13699 link
2023-06-23 Variance-Covariance Regularization Improves Representation Learning Jiachen Zhu et.al. 2306.13292 null
2023-06-20 EEG Decoding for Datasets with Heterogenous Electrode Configurations using Transfer Learning Graph Neural Networks Jinpei Han et.al. 2306.13109 null
2023-06-22 Natural Language Processing in Electronic Health Records in Relation to Healthcare Decision-making: A Systematic Review Elias Hossain et.al. 2306.12834 null
2023-06-22 TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter Binjie Zhang et.al. 2306.12642 null
2023-06-21 Introspective Action Advising for Interpretable Transfer Learning Joseph Campbell et.al. 2306.12314 null
2023-06-21 Wildfire Detection Via Transfer Learning: A Survey Ziliang Hong et.al. 2306.12276 null
2023-06-21 Benchmark data to study the influence of pre-training on explanation performance in MR image classification Marta Oliveira et.al. 2306.12150 link
2023-06-21 Strategies in Transfer Learning for Low-Resource Speech Synthesis: Phone Mapping, Features Input, and Source Language Selection Phat Do et.al. 2306.12040 null
2023-06-20 DynaQuant: Compressing Deep Learning Training Checkpoints via Dynamic Quantization Amey Agrawal et.al. 2306.11800 null
2023-06-20 Meta-Analysis of Transfer Learning for Segmentation of Brain Lesions Sovesh Mohapatra et.al. 2306.11714 null
2023-06-20 Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning Tianlun Hu et.al. 2306.11552 null
2023-06-20 MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models Yongzhu Miao et.al. 2306.11400 link
2023-06-20 MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian Willy Fitra Hendria et.al. 2306.11341 link
2023-06-20 Progressive Neural Representation for Sequential Video Compilation Haeyong Kang et.al. 2306.11305 link
2023-06-19 BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous Datasets Po-Ting Lai et.al. 2306.11189 link
2023-06-19 Knowledge Transfer-Driven Few-Shot Class-Incremental Learning Ye Wang et.al. 2306.10942 link
2023-06-19 Detailed retinal vessel segmentation without human annotations using simulated optical coherence tomography angiographs Linus Kreitner et.al. 2306.10941 link
2023-06-19 Transformer Training Strategies for Forecasting Multiple Load Time Series Matthias Hertel et.al. 2306.10891 link
2023-06-23 Text-Driven Foley Sound Generation With Latent Diffusion Model Yi Yuan et.al. 2306.10359 link
2023-06-17 Persian Semantic Role Labeling Using Transfer Learning and BERT-Based Models Saeideh Niksirat Aghdam et.al. 2306.10339 null
2023-06-16 Neural Priming for Sample-Efficient Adaptation Matthew Wallingford et.al. 2306.10191 link
2023-06-16 LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning Jifan Zhang et.al. 2306.09910 link
2023-06-16 Can robots mold soft plastic materials by shaping depth images? Ege Gursoy et.al. 2306.09848 null
2023-06-16 Parameter-efficient is not sufficient: Exploring Parameter, Memory, and Time Efficient Adapter Tuning for Dense Predictions Dongshuo Yin et.al. 2306.09729 null
2023-06-16 Cross-corpus Readability Compatibility Assessment for English Texts Zhenzhen Li et.al. 2306.09704 link
2023-06-16 Early-times Yang-Mills dynamics and the characterization of strongly interacting matter with statistical learning Matthew R. Heffernan et.al. 2306.09619 null
2023-06-15 Understanding and Mitigating Extrapolation Failures in Physics-Informed Neural Networks Lukas Fesser et.al. 2306.09478 link
2023-06-15 A Comparison of Self-Supervised Pretraining Approaches for Predicting Disease Risk from Chest Radiograph Images Yanru Chen et.al. 2306.08955 null
2023-06-14 Iterative self-transfer learning: A general methodology for response time-history prediction based on small dataset Yongjia Xu et.al. 2306.08700 null
2023-06-14 SMC-UDA: Structure-Modal Constraint for Unsupervised Cross-Domain Renal Segmentation Zhusi Zhong et.al. 2306.08213 null
2023-06-14 Solving Large-scale Spatial Problems with Convolutional Neural Networks Damian Owerko et.al. 2306.08191 null
2023-06-13 PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer Xu Han et.al. 2306.08126 null
2023-06-13 One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning Arnav Chavan et.al. 2306.07967 link
2023-06-13 CAMEO: A Causal Transfer Learning Approach for Performance Optimization of Configurable Computer Systems Md Shahriar Iqbal et.al. 2306.07888 null
2023-06-13 Robustness and Generalization Performance of Deep Learning Models on Cyber-Physical Systems: A Comparative Study Alexander Windmann et.al. 2306.07737 null
2023-06-14 Few-shot Multi-domain Knowledge Rearming for Context-aware Defence against Advanced Persistent Threats Gaolei Li et.al. 2306.07685 null
2023-06-12 EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing Iker de la Iglesia et.al. 2306.07373 null
2023-06-12 A Brief Review of Hypernetworks in Deep Learning Vinod Kumar Chauhan et.al. 2306.06955 link
2023-06-12 Differentiable Multi-Fidelity Fusion: Efficient Learning of Physics Simulations with Neural Architecture Search and Transfer Learning Yuwen Deng et.al. 2306.06904 null
2023-06-12 Generating Synthetic Datasets by Interpolating along Generalized Geodesics Jiaojiao Fan et.al. 2306.06866 null
2023-06-11 VBSF-TLD: Validation-Based Approach for Soft Computing-Inspired Transfer Learning in Drone Detection Jaskaran Singh et.al. 2306.06797 null
2023-06-11 An information-Theoretic Approach to Semi-supervised Transfer Learning Daniel Jakubovitz et.al. 2306.06731 null
2023-06-10 Enhancing Low Resource NER Using Assisting Language And Transfer Learning Maithili Sabane et.al. 2306.06477 null
2023-06-10 Augmentations of Forman’s Ricci Curvature and their Applications in Community Detection Lukas Fesser et.al. 2306.06474 null
2023-06-09 Understanding the Benefits of Image Augmentations Matthew Iceland et.al. 2306.06254 null
2023-06-09 PoET: A generative model of protein families as sequences-of-sequences Timothy F. Truong Jr et.al. 2306.06156 link
2023-06-13 End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates Anshul Nasery et.al. 2306.05785 null
2023-06-09 Data-Link: High Fidelity Manufacturing Datasets for Model2Real Transfer under Industrial Settings Sunny Katyara et.al. 2306.05766 null
2023-06-09 Emotion Detection from EEG using Transfer Learning Sidharth Sidharth et.al. 2306.05680 null
2023-06-09 Customizing General-Purpose Foundation Models for Medical Report Generation Bang Yang et.al. 2306.05642 null
2023-06-08 PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models Tiantian Feng et.al. 2306.05350 link
2023-06-08 T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text Classification Inigo Jauregi Unanue et.al. 2306.04996 link
2023-06-09 Generalization Performance of Transfer Learning: Overparameterized and Underparameterized Regimes Peizhong Ju et.al. 2306.04901 null
2023-06-08 ExtPerFC: An Efficient 2D and 3D Perception Hardware-Software Framework for Mobile Cobot Tuan Dang et.al. 2306.04853 link
2023-06-07 OBSTransformer: A Deep-Learning Seismic Phase Picker for OBS Data Using Automated Labelling and Transfer Learning Alireza Niksejel et.al. 2306.04753 link
2023-06-07 AutoML Systems For Medical Imaging Tasmia Tahmida Jidney et.al. 2306.04750 null
2023-06-07 Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation Taha Aksu et.al. 2306.04724 link
2023-06-07 Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages Claytone Sikasote et.al. 2306.04428 link
2023-06-07 Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak Jan Lehečka et.al. 2306.04399 null
2023-06-07 Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization Kohei Matsuura et.al. 2306.04233 null
2023-06-07 Transfer Learning for General M-estimators with Decomposable Regularizers in High-dimensions Zeyu Li et.al. 2306.04182 null
2023-06-07 Physics-informed reinforcement learning for sample-efficient optimization of freeform nanophotonic devices Chaejin Park et.al. 2306.04108 link
2023-06-07 XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations Yusen Zhang et.al. 2306.04085 link
2023-06-06 Guiding The Last Layer in Federated Learning with Pre-Trained Models Gwen Legate et.al. 2306.03937 link
2023-06-01 On the Robustness of Arabic Speech Dialect Identification Peter Sullivan et.al. 2306.03789 null
2023-06-06 Deep Learning-Enabled Sleep Staging From Vital Signs and Activity Measured Using a Near-Infrared Video Camera Jonathan Carter et.al. 2306.03711 null
2023-06-06 The Creative Frontier of Generative AI: Managing the Novelty-Usefulness Tradeoff Anirban Mukherjee et.al. 2306.03601 null
2023-06-06 “A Little is Enough”: Few-Shot Quality Estimation based Corpus Filtering improves Machine Translation Akshay Batheja et.al. 2306.03507 null
2023-06-06 Subgraph Networks Based Contrastive Learning Jinhuan Wang et.al. 2306.03506 null
2023-06-05 Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model Hoyeon Lee et.al. 2306.02579 null
2023-06-06 Training Like a Medical Resident: Universal Medical Image Segmentation via Context Prior Learning Yunhe Gao et.al. 2306.02416 link
2023-06-02 Distilling Efficient Language-Specific Models for Cross-Lingual Transfer Alan Ansell et.al. 2306.01709 link
2023-06-02 Resolving Interference When Merging Models Prateek Yadav et.al. 2306.01708 link
2023-06-02 Transfer learning for atomistic simulations using GNNs and kernel mean embeddings John Falk et.al. 2306.01589 link
2023-06-02 Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23 Ioannis Tsiamas et.al. 2306.01327 null
2023-06-02 A new method using deep transfer learning on ECG to predict the response to cardiac resynchronization therapy Zhuo He et.al. 2306.01210 null
2023-06-01 TMI! Finetuned Models Leak Private Information from their Pretraining Data John Abascal et.al. 2306.01181 link
2023-06-01 Improved Cross-Lingual Transfer Learning For Automatic Speech Translation Sameer Khurana et.al. 2306.00789 null
2023-06-01 Improving Polish to English Neural Machine Translation with Transfer Learning: Effects of Data Volume and Language Similarity Juuso Eronen et.al. 2306.00660 null
2023-06-01 The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech Phat Do et.al. 2306.00535 null
2023-06-01 Divide, Conquer, and Combine: Mixture of Semantic-Independent Experts for Zero-Shot Dialogue State Tracking Qingyue Wang et.al. 2306.00434 null
2023-06-01 Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting Shubin Huang et.al. 2306.00409 link
2023-06-01 Autism Disease Detection Using Transfer Learning Techniques: Performance Comparison Between Central Processing Unit vs Graphics Processing Unit Functions for Neural Networks Mst Shapna Akter et.al. 2306.00283 null
2023-06-01 Transfer Learning for Underrepresented Music Generation Anahita Doosti et.al. 2306.00281 null
2023-06-01 Maximal Domain Independent Representations Improve Transfer Learning Adrian Shuai Li et.al. 2306.00262 null
2023-06-01 Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior Shashank Subramanian et.al. 2306.00258 null
2023-05-31 Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation Chunliu Wang et.al. 2306.00124 link
2023-05-31 Additional Positive Enables Better Representation Learning for Medical Images Dewen Zeng et.al. 2306.00112 null
2023-05-31 MetaXLR – Mixed Language Meta Representation Transformation for Low-resource Cross-lingual Learning based on Multi-Armed Bandit Liat Bezalel et.al. 2306.00100 link
2023-05-31 A Survey of Label-Efficient Deep Learning for 3D Point Clouds Aoran Xiao et.al. 2305.19812 link
2023-05-31 Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning Shuyue Stella Li et.al. 2305.19759 null
2023-05-31 Hypothesis Transfer Learning with Surrogate Classification Losses Anass Aghbalou et.al. 2305.19694 null
2023-05-31 VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges Robert-Jan Bruintjes et.al. 2305.19688 null
2023-06-01 Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast Guofan Fan et.al. 2305.19623 link
2023-05-31 SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with BERT Aditya Yadavalli et.al. 2305.19589 null
2023-05-31 Deep into The Domain Shift: Transfer Learning through Dependence Regularization Shumin Ma et.al. 2305.19499 link
2023-05-30 Transfer Learning With Efficient Estimators to Optimally Leverage Historical Data in Analysis of Randomized Trials Lauren D. Liao et.al. 2305.19180 link

diffusion model

Publish Date Title Authors PDF Code
2025-06-30 Epona: Autoregressive Diffusion World Model for Autonomous Driving Kaiwen Zhang et.al. 2506.24113 null
2025-06-30 Navigating with Annealing Guidance Scale in Diffusion Space Shai Yehezkel et.al. 2506.24108 null
2025-06-30 Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention Wonwoong Cho et.al. 2506.24085 null
2025-06-30 Faster Diffusion Models via Higher-Order Approximation Gen Li et.al. 2506.24042 null
2025-06-30 Supervised Diffusion-Model-Based PET Image Reconstruction George Webber et.al. 2506.24034 null
2025-06-30 VMoBA: Mixture-of-Block Attention for Video Diffusion Models Jianzong Wu et.al. 2506.23858 null
2025-06-30 Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors Ce Wang et.al. 2506.23801 null
2025-06-30 Radioactive Watermarks in Diffusion and Autoregressive Image Generative Models Michel Meintz et.al. 2506.23731 null
2025-06-30 Proteus-ID: ID-Consistent and Motion-Coherent Video Customization Guiyu Zhang et.al. 2506.23729 null
2025-06-30 MDPG: Multi-domain Diffusion Prior Guidance for MRI Reconstruction Lingtong Zhang et.al. 2506.23701 null
2025-06-30 A Unified Framework for Stealthy Adversarial Generation via Latent Optimization and Transferability Enhancement Gaozheng Pei et.al. 2506.23676 null
2025-06-30 Diffusion Model-based Data Augmentation Method for Fetal Head Ultrasound Segmentation Fangyijie Wang et.al. 2506.23664 null
2025-06-30 Blending Concepts with Text-to-Image Diffusion Models Lorenzo Olearo et.al. 2506.23630 null
2025-06-30 TurboVSR: Fantastic Video Upscalers and Where to Find Them Zhongdao Wang et.al. 2506.23618 null
2025-06-30 SG-LDM: Semantic-Guided LiDAR Generation via Latent-Aligned Diffusion Zhengkang Xiang et.al. 2506.23606 null
2025-06-30 Metadata, Wavelet, and Time Aware Diffusion Models for Satellite Image Super Resolution Luigi Sigillo et.al. 2506.23566 null
2025-06-30 Uncertainty-aware Diffusion and Reinforcement Learning for Joint Plane Localization and Anomaly Diagnosis in 3D Ultrasound Yuhao Huang et.al. 2506.23538 null
2025-06-30 WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image Jiwoo Park et.al. 2506.23518 null
2025-06-30 ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models Zixun Fang et.al. 2506.23513 null
2025-06-30 MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting Jun Huang et.al. 2506.23482 null
2025-06-26 SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture Kehan Sui et.al. 2506.21478 null
2025-06-26 Rethinking Oversaturation in Classifier-Free Guidance via Low Frequency Kaiyu Song et.al. 2506.21452 null
2025-06-26 Controllable 3D Placement of Objects with Scene-Aware Diffusion Models Mohamed Omran et.al. 2506.21446 null
2025-06-26 HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation Diego Biagini et.al. 2506.21287 null
2025-06-27 FairyGen: Storied Cartoon Video from a Single Child-Drawn Character Jiayi Zheng et.al. 2506.21272 null
2025-06-27 Alternating Spintronics: Capacitive Behavior of Spin Valves and Resonator Applications Yunwen Liu et.al. 2506.21176 null
2025-06-26 Compressed and Smooth Latent Space for Text Diffusion Modeling Viacheslav Meshchaninov et.al. 2506.21170 null
2025-06-26 Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image Pufan Li et.al. 2506.21152 null
2025-06-26 Learning to See in the Extremely Dark Hai Jiang et.al. 2506.21132 null
2025-06-26 Unlasting: Unpaired Single-Cell Multi-Perturbation Estimation by Dual Conditional Diffusion Implicit Bridges Changxi Chi et.al. 2506.21107 null
2025-06-26 Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling Hansam Cho et.al. 2506.21045 null
2025-06-26 Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability Boyong He et.al. 2506.21042 null
2025-06-27 DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation Wenzhou Lyu et.al. 2506.21034 null
2025-06-26 From Cradle to Cane: A Two-Pass Framework for High-Fidelity Lifespan Face Aging Tao Liu et.al. 2506.20977 null
2025-06-26 ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation Shruti Bansal et.al. 2506.20969 null
2025-06-26 Antibody Design and Optimization with Multi-scale Equivariant Graph Diffusion Models for Accurate Complex Antigen Binding Jiameng Chen et.al. 2506.20957 null
2025-06-25 Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models Cansu Korkmaz et.al. 2506.20832 null
2025-06-25 Stochastic and Non-local Closure Modeling for Nonlinear Dynamical Systems via Latent Score-based Generative Models Xinghao Dong et.al. 2506.20771 null
2025-06-25 StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation Haodong Li et.al. 2506.20756 null
2025-06-25 On Convolutions, Intrinsic Dimension, and Diffusion Models Kin Kwan Leung et.al. 2506.20705 null
2025-06-25 EditP23: 3D Editing via Propagation of Image Prompts to Multi-View Roi Bar-On et.al. 2506.20652 null
2025-06-25 Telegrapher’s Generative Model via Kac Flows Richard Duong et.al. 2506.20641 null
2025-06-26 DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Shansan Gong et.al. 2506.20639 null
2025-06-25 MC for Agriculture: A Framework for Nature-inspired Sustainable Pest Control Fardad Vakilipoor et.al. 2506.20637 null
2025-06-25 Shape2Animal: Creative Animal Generation from Natural Silhouettes Quoc-Duy Tran et.al. 2506.20616 null
2025-06-25 Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks Manyi Li et.al. 2506.20548 null
2025-06-25 HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling Tobias Vontobel et.al. 2506.20452 null
2025-06-25 TDiR: Transformer based Diffusion for Image Restoration Tasks Abbas Anwar et.al. 2506.20302 null
2025-06-25 Ctrl-Z Sampling: Diffusion Sampling with Controlled Random Zigzag Explorations Shunqi Mao et.al. 2506.20294 null
2025-06-25 Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement Kun Yuan et.al. 2506.20254 null
2025-06-25 Towards Efficient Exemplar Based Image Editing with Multimodal VLMs Avadhoot Jadhav et.al. 2506.20155 null
2025-06-24 Robust Robotic Exploration and Mapping Using Generative Occupancy Map Synthesis Lorin Achey et.al. 2506.20049 null
2025-06-24 Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting Salva Rühling Cachay et.al. 2506.20024 null
2025-06-24 Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture Shuchen Xue et.al. 2506.19935 null
2025-06-24 Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation Xingyang Li et.al. 2506.19852 null
2025-06-24 AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models Zehuan Huang et.al. 2506.19851 null
2025-06-24 GenHSI: Controllable Generation of Human-Scene Interaction Videos Zekun Li et.al. 2506.19840 null
2025-06-24 Improving Progressive Generation with Decomposable Flow Matching Moayed Haji-Ali et.al. 2506.19839 null
2025-06-24 SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution Liangbin Xie et.al. 2506.19838 null
2025-06-24 Machine Learning with Privacy for Protected Attributes Saeed Mahloujifar et.al. 2506.19836 null
2025-06-23 Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models Kiymet Akdemir et.al. 2506.18900 null
2025-06-23 MinD: Unified Visual Imagination and Control via Hierarchical World Models Xiaowei Chi et.al. 2506.18897 null
2025-06-23 Let Your Video Listen to Your Music! Xinyu Zhang et.al. 2506.18881 null
2025-06-23 ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs Michal Nazarczuk et.al. 2506.18792 null
2025-06-23 TCDiff++: An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography Yuqin Dai et.al. 2506.18671 null
2025-06-23 GANs vs. Diffusion Models for virtual staining with the HER2match dataset Pascal Klöckner et.al. 2506.18484 null
2025-06-23 DIP: Unsupervised Dense In-Context Post-training of Visual Representations Sophia Sirko-Galouchenko et.al. 2506.18463 null
2025-06-23 CPAM: Context-Preserving Adaptive Manipulation for Zero-Shot Real Image Editing Dinh-Khoi Vo et.al. 2506.18438 null
2025-06-23 How Robust is Model Editing after Fine-Tuning? An Empirical Study on Text-to-Image Diffusion Models Feng He et.al. 2506.18428 null
2025-06-23 Generative Diffusion Receivers: Achieving Pilot-Efficient MIMO-OFDM Communications Yuzhi Yang et.al. 2506.18419 null
2025-06-23 Large-Scale Training Data Attribution for Music Generative Models via Unlearning Woosung Choi et.al. 2506.18312 null
2025-06-23 Instability in Diffusion ODEs: An Explanation for Inaccurate Image Reconstruction Han Zhang et.al. 2506.18290 null
2025-06-23 Adaptive Mask-guided K-space Diffusion for Accelerated MRI Reconstruction Qinrong Cai et.al. 2506.18270 null
2025-06-23 Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models Chao Li et.al. 2506.18251 null
2025-06-23 Exact Conditional Score-Guided Generative Modeling for Amortized Inference in Uncertainty Quantification Zezhong Zhang et.al. 2506.18227 null
2025-06-23 American options valuation in time-dependent jump-diffusion models via integral equations and characteristic functions Andrey Itkin et.al. 2506.18210 null
2025-06-22 CDG-MAE: Learning Correspondences from Diffusion Generated Views Varun Belagali et.al. 2506.18164 null
2025-06-22 Targeted False Positive Synthesis via Detector-guided Adversarial Diffusion Attacker for Robust Polyp Detection Quan Zhou et.al. 2506.18134 null
2025-06-22 Enabling PSO-Secure Synthetic Data Sharing Using Diversity-Aware Diffusion Models Mischa Dombrowski et.al. 2506.17975 null
2025-06-24 GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models Julien Guinot et.al. 2506.17886 null
2025-06-18 Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards Qingming Liu et.al. 2506.15684 null
2025-06-18 Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model Anirud Aggarwal et.al. 2506.15682 link
2025-06-18 UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting Kai He et.al. 2506.15673 null
2025-06-18 HOIDiNi: Human-Object Interaction through Diffusion Noise Optimization Roey Ron et.al. 2506.15625 null
2025-06-18 One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution Yujing Sun et.al. 2506.15591 link
2025-06-18 Control and Realism: Best of Both Worlds in Layout-to-Image without Training Bonan Li et.al. 2506.15563 null
2025-06-18 Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models Teysir Baoueb et.al. 2506.15530 null
2025-06-18 GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects Shujia Li et.al. 2506.15483 null
2025-06-18 Provable Maximum Entropy Manifold Exploration via Diffusion Models Riccardo De Santi et.al. 2506.15385 null
2025-06-18 When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class Yujin Kim et.al. 2506.15381 null
2025-06-18 Acoustic Waveform Inversion with Image-to-Image Schrödinger Bridges A. S. Stankevich et.al. 2506.15346 link
2025-06-19 Naive parton picture for color transparency of kaon in the electronuclear reaction $A(e,e’K^+)$ Kook-Jin Kong et.al. 2506.15331 null
2025-06-18 One-shot Face Sketch Synthesis in the Wild via Generative Diffusion Prior and Instruction Tuning Han Wu et.al. 2506.15312 link
2025-06-18 Human Motion Capture from Loose and Sparse Inertial Sensors with Garment-aware Diffusion Models Andela Ilic et.al. 2506.15290 null
2025-06-18 DM-FNet: Unified multimodal medical image fusion via diffusion process-trained encoder-decoder Dan He et.al. 2506.15218 link
2025-06-18 Echo-DND: A dual noise diffusion model for robust and precise left ventricle segmentation in echocardiography Abdur Rahman et.al. 2506.15166 null
2025-06-18 Fundamentals of the metal contact to p-type GaN: new multilayer design Konrad Sakowski et.al. 2506.15163 null
2025-06-18 Generative thermodynamic computing Stephen Whitelam et.al. 2506.15121 null
2025-06-17 Frequency-Calibrated Membership Inference Attacks on Medical Image Diffusion Models Xinkai Zhao et.al. 2506.14919 null
2025-06-17 CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion Jiahua Ma et.al. 2506.14769 null
2025-06-16 Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value Yixian Xu et.al. 2506.13763 null
2025-06-17 VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models Edward Li et.al. 2506.13754 null
2025-06-16 MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model Bi Yuda et.al. 2506.13667 null
2025-06-16 Exploiting the Exact Denoising Posterior Score in Training-Free Guidance of Diffusion Models Gregory Bellchambers et.al. 2506.13614 null
2025-06-16 Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching Weimin Bai et.al. 2506.13594 null
2025-06-16 Flexible-length Text Infilling for Discrete Diffusion Models Andrew Zhang et.al. 2506.13579 null
2025-06-16 X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability Yu Yang et.al. 2506.13558 null
2025-06-16 Seismic Acoustic Impedance Inversion Framework Based on Conditional Latent Generative Diffusion Model Jie Chen et.al. 2506.13529 null
2025-06-16 Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis Martina Pastorino et.al. 2506.13484 null
2025-06-16 PRO: Projection Domain Synthesis for CT Imaging Kang Chen et.al. 2506.13443 null
2025-06-16 Zero-Shot Solving of Imaging Inverse Problems via Noise-Refined Likelihood Guided Diffusion Models Zhen Wang et.al. 2506.13391 null
2025-06-16 LapDDPM: A Conditional Graph Diffusion Model for scRNA-seq Generation with Spectral Adversarial Perturbations Lorenzo Bini et.al. 2506.13344 null
2025-06-16 Quantitative Comparison of Fine-Tuning Techniques for Pretrained Latent Diffusion Models in the Generation of Unseen SAR Image Concepts Solène Debuysère et.al. 2506.13307 null
2025-06-16 AttentionDrag: Exploiting Latent Correlation Knowledge in Pre-trained Diffusion Models for Image Editing Biao Yang et.al. 2506.13301 null
2025-06-16 Overcoming Overfitting in Reinforcement Learning via Gaussian Process Diffusion Policy Amornyos Horprasert et.al. 2506.13111 link
2025-06-16 DualFast: Dual-Speedup Framework for Fast Sampling of Diffusion Models Hu Yu et.al. 2506.13058 null
2025-06-16 A Comprehensive Survey on Continual Learning in Generative Models Haiyang Guo et.al. 2506.13045 link
2025-06-15 Generative modeling of seismic data using diffusion models and its application to multi-purpose posterior sampling for noisy inverse problems Chuangji Meng et.al. 2506.12897 null
2025-06-15 EraserDiT: Fast Video Inpainting with Diffusion Transformer Model Jie Liu et.al. 2506.12853 null
2025-06-15 DiffS-NOCS: 3D Point Cloud Reconstruction through Coloring Sketches to NOCS Maps Using Diffusion Models Di Kong et.al. 2506.12835 null
2025-06-12 SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis Weiliang Chen et.al. 2506.10981 null
2025-06-12 Fine-Grained Perturbation Guidance via Attention Head Selection Donghoon Ahn et.al. 2506.10978 null
2025-06-12 What Exactly Does Guidance Do in Masked Discrete Diffusion Models He Ye et.al. 2506.10971 null
2025-06-13 MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning Yuxuan Luo et.al. 2506.10963 null
2025-06-12 SpectralAR: Spectral Autoregressive Visual Generation Yuanhui Huang et.al. 2506.10962 null
2025-06-12 ReGuidance: A Simple Diffusion Wrapper for Boosting Sample Quality on Hard Inverse Problems Aayush Karan et.al. 2506.10955 null
2025-06-12 The Diffusion Duality Subham Sekhar Sahoo et.al. 2506.10892 link
2025-06-12 ME: Trigger Element Combination Backdoor Attack on Copyright Infringement Feiyu Yang et.al. 2506.10776 null
2025-06-13 PDESpectralRefiner: Achieving More Accurate Long Rollouts with Spectral Adjustment Li Luo et.al. 2506.10711 null
2025-06-12 Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework Xia Du et.al. 2506.10685 null
2025-06-12 GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning Xiaoyi Bao et.al. 2506.10639 null
2025-06-12 Anatomy-Grounded Weakly Supervised Prompt Tuning for Chest X-ray Latent Diffusion Models Konstantinos Vilouras et.al. 2506.10633 null
2025-06-12 Hessian Geometry of Latent Space in Generative Models Alexander Lobashev et.al. 2506.10632 link
2025-06-12 TexTailor: Customized Text-aligned Texturing via Effective Resampling Suin Lee et.al. 2506.10612 link
2025-06-12 High-resolution efficient image generation from WiFi CSI using a pretrained latent diffusion model Eshan Ramesh et.al. 2506.10605 null
2025-06-12 Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres Muskan Dosi et.al. 2506.10576 null
2025-06-12 Equivariant Neural Diffusion for Molecule Generation François Cornet et.al. 2506.10532 link
2025-06-12 Edit360: 2D Image Edits to 3D Assets from Any Angle Junchao Huang et.al. 2506.10507 null
2025-06-12 A Crack in the Bark: Leveraging Public Knowledge to Remove Tree-Ring Watermarks Junhua Lin et.al. 2506.10502 null
2025-06-12 Measuring Semantic Information Production in Generative Diffusion Models Florian Handke et.al. 2506.10433 null
2025-06-09 StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets Anh-Quan Cao et.al. 2506.08013 link
2025-06-09 Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion Xun Huang et.al. 2506.08009 null
2025-06-09 Dynamic View Synthesis as an Inverse Problem Hidir Yesiltepe et.al. 2506.08004 null
2025-06-09 MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation Junhao Chen et.al. 2506.07999 null
2025-06-09 Generative Modeling of Weights: Generalization or Memorization? Boya Zeng et.al. 2506.07998 link
2025-06-09 Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers Zhengyao Lv et.al. 2506.07986 link
2025-06-09 Gradients: When Markets Meet Fine-tuning – A Distributed Approach to Model Optimisation Christopher Subia-Waud et.al. 2506.07940 null
2025-06-09 Efficient Seismic Data Interpolation via Sparse Attention Transformer and Diffusion Model Xiaoli Wei et.al. 2506.07923 null
2025-06-09 Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces Kevin Rojas et.al. 2506.07903 link
2025-06-09 FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling Sifan Wang et.al. 2506.07902 link
2025-06-09 Video Unlearning via Low-Rank Refusal Vector Simone Facchiano et.al. 2506.07891 null
2025-06-09 Diffusion Counterfactual Generation with Semantic Abduction Rajat Rasal et.al. 2506.07883 link
2025-06-09 Jarzynski Reweighting and Sampling Dynamics for Training Energy-Based Models: Theoretical Analysis of Different Transition Kernels Davide Carbone et.al. 2506.07843 null
2025-06-09 Diffusion models under low-noise regime Elizabeth Pavlova et.al. 2506.07841 link
2025-06-09 R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation William Ljungbergh et.al. 2506.07826 null
2025-06-09 Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation Xintong Duan et.al. 2506.07822 null
2025-06-09 Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution Junseo Bang et.al. 2506.07813 null
2025-06-09 Diffusion Models-Aided Uplink Channel Estimation for RIS-Assisted Systems Yang Wang et.al. 2506.07770 null
2025-06-09 Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation Hyunsoo Kim et.al. 2506.07750 null
2025-06-09 Consistent Video Editing as Flow-Driven Image-to-Video Generation Ge Wang et.al. 2506.07713 null
2025-06-05 Contrastive Flow Matching George Stoica et.al. 2506.05350 link
2025-06-06 Exploring Diffusion Transformer Designs via Grafting Keshigeyan Chandrasegaran et.al. 2506.05340 link
2025-06-05 Progressive Tempering Sampler with Diffusion Severi Rissanen et.al. 2506.05231 link
2025-06-05 OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View Yanbo Wang et.al. 2506.05204 link
2025-06-05 Quantifying Cross-Modality Memorization in Vision-Language Models Yuxin Wen et.al. 2506.05198 null
2025-06-05 Associative Memory and Generative Diffusion in the Zero-noise Limit Joshua Hess et.al. 2506.05178 null
2025-06-05 Neural Jumps for Option Pricing Duosi Zheng et.al. 2506.05137 null
2025-06-06 SeedEdit 3.0: Fast and High-Quality Generative Image Editing Peng Wang et.al. 2506.05083 null
2025-06-05 FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing Guangzhao Li et.al. 2506.05046 null
2025-06-05 Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking Yu-Feng Chen et.al. 2506.04879 link
2025-06-06 Sparse Autoencoders, Again? Yin Lu et.al. 2506.04859 null
2025-06-05 Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion Hongyu Wang et.al. 2506.04716 null
2025-06-05 Text-Aware Real-World Image Super-Resolution via Diffusion Model with Joint Segmentation Decoders Qiming Hu et.al. 2506.04641 null
2025-06-05 Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth Jinyoung Jun et.al. 2506.04612 null
2025-06-05 SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents Alexander Huang-Menders et.al. 2506.04606 null
2025-06-04 HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation Hermann Kumbong et.al. 2506.04421 null
2025-06-04 Is Perturbation-Based Image Protection Disruptive to Image Editing? Qiuyu Tang et.al. 2506.04394 null
2025-06-04 HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting Maksym Ivashechkin et.al. 2506.04351 null
2025-06-04 Sounding that Object: Interactive Object-Aware Image to Audio Generation Tingle Li et.al. 2506.04214 null
2025-06-04 Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector Boyong He et.al. 2506.04211 link
2025-06-04 Image Editing As Programs with Diffusion Models Yujia Hu et.al. 2506.04158 null
2025-06-04 Global convergence rates in the relaxation limits for the compressible Euler and Euler-Maxwell systems in Sobolev spaces Timothée Crin-Barat et.al. 2506.04103 null
2025-06-04 A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning Zhiyu Zhang et.al. 2506.04083 null
2025-06-04 Beyond water limitation in vegetation-autotoxicity patterning: a cross-diffusion model Francesco Giannino et.al. 2506.03981 null
2025-06-05 Solving Inverse Problems via Diffusion-Based Priors: An Approximation-Free Ensemble Sampling Approach Haoxuan Chen et.al. 2506.03979 null
2025-06-04 DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models Jia Fu et.al. 2506.03933 null
2025-06-04 Personalized MR-Informed Diffusion Models for 3D PET Image Reconstruction George Webber et.al. 2506.03804 null
2025-06-04 EmoArt: A Multidimensional Dataset for Emotion-Aware Artistic Generation Cheng Zhang et.al. 2506.03652 null
2025-06-04 DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models Ziyi Wu et.al. 2506.03517 null
2025-06-04 CHIME: Conditional Hallucination and Integrated Multi-scale Enhancement for Time Series Diffusion Model Yuxuan Chen et.al. 2506.03502 null
2025-06-04 Facial Appearance Capture at Home with Patch-Level Reflectance Prior Yuxuan Han et.al. 2506.03478 link
2025-06-03 A Data-Driven Diffusion-based Approach for Audio Deepfake Explanations Petr Grinberg et.al. 2506.03425 null
2025-06-03 Robustness in Both Domains: CLIP Needs a Robust Text Encoder Elias Abad Rocamora et.al. 2506.03355 null
2025-06-03 AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation Lu Qiu et.al. 2506.03126 null
2025-06-03 DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation Zhengyao Lv et.al. 2506.03123 null
2025-06-03 Rectified Flows for Fast Multiscale Fluid Flow Modeling Victor Armegioiu et.al. 2506.03111 null
2025-06-03 TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models Chetwin Low et.al. 2506.03099 null
2025-06-03 EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models Mingzhe Li et.al. 2506.03067 null
2025-05-30 AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion Yangyi Huang et.al. 2505.24877 null
2025-05-30 MiniMax-Remover: Taming Bad Noise Helps Video Object Removal Bojia Zi et.al. 2505.24873 null
2025-05-30 Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking Heli Ben-Hamu et.al. 2505.24857 null
2025-05-30 RealDrive: Retrieval-Augmented Driving with Diffusion Models Wenhao Ding et.al. 2505.24808 null
2025-05-30 Generalization Dynamics of Linear Diffusion Models Claudia Merger et.al. 2505.24769 null
2025-05-30 A Composite Predictive-Generative Approach to Monaural Universal Speech Enhancement Jie Zhang et.al. 2505.24576 null
2025-05-30 UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation Yang-Tian Sun et.al. 2505.24521 null
2025-05-30 EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering Runnan Lu et.al. 2505.24417 link
2025-05-30 IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models Hanting Wang et.al. 2505.24406 link
2025-06-03 Interpreting Large Text-to-Image Diffusion Models with Dictionary Learning Stepan Shabalin et.al. 2505.24360 link
2025-05-30 InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing Jinlu Zhang et.al. 2505.24315 null
2025-05-30 Category-aware EEG image generation based on wavelet transform and contrast semantic loss Enshang Zhang et.al. 2505.24301 link
2025-05-30 Large Language Models are Locally Linear Mappings James R. Golden et.al. 2505.24293 link
2025-05-30 MUSE: Model-Agnostic Tabular Watermarking via Multi-Sample Selection Liancheng Fang et.al. 2505.24267 null
2025-05-30 Generative AI for Urban Design: A Stepwise Approach Integrating Human Expertise with Multimodal Diffusion Models Mingyi He et.al. 2505.24260 null
2025-05-30 Interactive Video Generation via Domain Adaptation Ishaan Rawal et.al. 2505.24253 null
2025-05-30 LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework Xin Kang et.al. 2505.24245 null
2025-05-30 Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin Fangyikang Wang et.al. 2505.24222 link
2025-05-30 STORK: Improving the Fidelity of Mid-NFE Sampling for Diffusion and Flow Matching Models Zheng Tan et.al. 2505.24210 link
2025-05-30 Aligning Protein Conformation Ensemble Generation with Physical Feedback Jiarui Lu et.al. 2505.24203 null
2025-05-29 LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers Yusuf Dalva et.al. 2505.23758 null
2025-05-29 DarkDiff: Advancing Low-Light Raw Enhancement by Retasking Diffusion Models for Camera ISP Amber Yijia Zheng et.al. 2505.23743 null
2025-05-29 LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization Ronghuan Wu et.al. 2505.23740 null
2025-05-29 How Animals Dance (When You’re Not Looking) Xiaojuan Wang et.al. 2505.23738 null
2025-05-29 DiffER: Categorical Diffusion for Chemical Retrosynthesis Sean Current et.al. 2505.23721 link
2025-05-29 ImmunoDiff: A Diffusion Model for Immunotherapy Response Prediction in Lung Cancer Moinak Bhattacharya et.al. 2505.23675 null
2025-05-30 OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation Size Wu et.al. 2505.23661 link
2025-05-29 VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models Xiangdong Zhang et.al. 2505.23656 link
2025-05-29 Optimization-Free Diffusion Model – A Perturbation Theory Approach Yuehaw Khoo et.al. 2505.23652 null
2025-05-29 ZeroSep: Separate Anything in Audio with Zero Training Chao Huang et.al. 2505.23625 null
2025-05-29 Inference-time Scaling of Diffusion Models through Classical Search Xiangcheng Zhang et.al. 2505.23614 null
2025-05-29 Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model Qingyu Shi et.al. 2505.23606 link
2025-05-29 Normalizing Flows are Capable Models for RL Raj Ghugare et.al. 2505.23527 link
2025-05-29 LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter Runyi Li et.al. 2505.23462 null
2025-05-29 Diffusion Guidance Is a Controllable Policy Improvement Operator Kevin Frans et.al. 2505.23458 link
2025-05-29 CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis Runmin Jiang et.al. 2505.23444 null
2025-05-29 Enhanced DACER Algorithm with High Diffusion Efficiency Yinuo Wang et.al. 2505.23426 null
2025-05-29 Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering Sixian Wang et.al. 2505.23343 link
2025-05-29 TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models Finn Carter et.al. 2505.23312 null
2025-05-29 MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction Yunkee Chae et.al. 2505.23305 null
2025-05-28 SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation Dekai Zhu et.al. 2505.22643 null
2025-05-28 Principled Out-of-Distribution Generalization via Simplicity Jiawei Ge et.al. 2505.22622 null
2025-05-28 Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Chengyue Wu et.al. 2505.22618 null
2025-05-28 ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models Dmitrii Sorokin et.al. 2505.22569 null
2025-05-28 Test-Time Alignment of Discrete Diffusion Models with Sequential Monte Carlo Chinmay Pani et.al. 2505.22524 null
2025-05-28 PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models Junwen Chen et.al. 2505.22523 null
2025-05-28 Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics Siyeop Yoon et.al. 2505.22489 null
2025-05-28 Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation Jiadong Pan et.al. 2505.22407 null
2025-05-28 Physics-Informed Distillation of Diffusion Models for PDE-Constrained Generation Yi Zhang et.al. 2505.22391 null
2025-05-28 A Closer Look on Memorization in Tabular Diffusion Model: A Data-Centric Perspective Zhengyu Fang et.al. 2505.22322 null
2025-05-28 StateSpaceDiffuser: Bringing Long Context to Diffusion World Models Nedko Savov et.al. 2505.22246 null
2025-05-28 Physics-inspired Generative AI models via real hardware-based noisy quantum diffusion Marco Parigi et.al. 2505.22193 null
2025-05-28 Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes Bocheng Li et.al. 2505.22165 null
2025-05-28 What Makes for Text to 360-degree Panorama Generation with Stable Diffusion? Jinhong Ni et.al. 2505.22129 null
2025-05-28 SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model Yifan Chang et.al. 2505.22126 null
2025-05-28 Autoregression-free video prediction using diffusion model for mitigating error propagation Woonho Ko et.al. 2505.22111 link
2025-05-28 AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion Junqi Zhao et.al. 2505.22106 null
2025-05-28 High Volume Rate 3D Ultrasound Reconstruction with Diffusion Models Tristan S. W. Stevens et.al. 2505.22090 null
2025-05-28 Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences Jing-An Sun et.al. 2505.22008 null
2025-05-28 D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples Zijing Hu et.al. 2505.22002 null
2025-05-26 MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning Yuanxin Zhuang et.al. 2505.20131 null
2025-05-26 Understanding Generalization in Diffusion Models via Probability Flow Distance Huijie Zhang et.al. 2505.20123 null
2025-05-26 Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning Ziyi Zhang et.al. 2505.20107 link
2025-05-26 PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation Hongsong Wang et.al. 2505.20056 null
2025-05-26 Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion Zheqi Lv et.al. 2505.20053 link
2025-05-26 ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications Tong Wu et.al. 2505.19983 null
2025-05-26 UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space Yong Liu et.al. 2505.19958 null
2025-05-26 Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling Junhong Lee et.al. 2505.19868 null
2025-05-26 On a retarded stochastic system with discrete diffusion modeling life tables Tomás Caraballo et.al. 2505.19835 null
2025-05-26 TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning Yuhui Chen et.al. 2505.19769 null
2025-05-26 On some coupled local and nonlocal diffusion models Juan Pablo Borthagaray et.al. 2505.19765 null
2025-05-27 SAIL: Self-supervised Albedo Estimation from Real Images with a Latent Diffusion Model Hala Djeghim et.al. 2505.19751 null
2025-05-26 Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning Quentin Rouxel et.al. 2505.19717 null
2025-05-26 Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition Wen Yin et.al. 2505.19694 null
2025-05-26 Graph Guided Diffusion: Unified Guidance for Conditional Graph Generation Victor M. Tenorio et.al. 2505.19685 null
2025-05-26 Calibrating Pre-trained Language Classifiers on LLM-generated Noisy Labels via Iterative Refinement Liqin Ye et.al. 2505.19675 link
2025-05-26 ReDDiT: Rehashing Noise for Discrete Visual Generation Tianren Ma et.al. 2505.19656 null
2025-05-26 Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment Jeongsoo Choi et.al. 2505.19595 link
2025-05-26 On scalable and efficient training of diffusion samplers Minkyu Kim et.al. 2505.19552 null
2025-05-26 Unlocking the Power of Diffusion Models in Sequential Recommendation: A Simple and Effective Approach Jialei Chen et.al. 2505.19544 link
2025-05-22 When Are Concepts Erased From Diffusion Models? Kevin Lu et.al. 2505.17013 link
2025-05-22 Guided Diffusion Sampling on Function Spaces with Applications to PDEs Jiachen Yao et.al. 2505.17004 link
2025-05-22 Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction Dong Li et.al. 2505.16980 null
2025-05-22 Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On Siqi Wan et.al. 2505.16977 link
2025-05-22 Creatively Upscaling Images with Global-Regional Priors Yurui Qian et.al. 2505.16976 null
2025-05-22 Bigger Isn’t Always Memorizing: Early Stopping Overparameterized Diffusion Models Alessandro Favero et.al. 2505.16959 null
2025-05-22 LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Zebin You et.al. 2505.16933 null
2025-05-22 T2I-ConBench: Text-to-Image Benchmark for Continual Post-training Zhehao Huang et.al. 2505.16875 null
2025-05-22 Training-Free Efficient Video Generation via Dynamic Token Carving Yuechen Zhang et.al. 2505.16864 link
2025-05-22 Conditional Panoramic Image Generation via Masked Autoregressive Modeling Chaoyang Wang et.al. 2505.16862 null
2025-05-23 LaViDa: A Large Diffusion Language Model for Multimodal Understanding Shufan Li et.al. 2505.16839 link
2025-05-22 From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization Haonian Ji et.al. 2505.16832 link
2025-05-22 SEED: Speaker Embedding Enhancement Diffusion Model KiHyun Nam et.al. 2505.16798 link
2025-05-22 Learning Flexible Forward Trajectories for Masked Molecular Diffusion Hyunjin Seo et.al. 2505.16790 null
2025-05-22 Forward-only Diffusion Probabilistic Models Ziwei Luo et.al. 2505.16733 link
2025-05-22 Masked Conditioning for Deep Generative Models Phillip Mueller et.al. 2505.16725 null
2025-05-22 Towards Coordinate- and Dimension-Agnostic Machine Learning for Partial Differential Equations Trung V. Phan et.al. 2505.16549 null
2025-05-22 Joint Relational Database Generation via Graph-Conditional Diffusion Models Mohamed Amine Ketata et.al. 2505.16527 null
2025-05-22 Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection Jiaxin Liu et.al. 2505.16512 null
2025-05-22 Consistent World Models via Foresight Diffusion Yu Zhang et.al. 2505.16474 null
2025-05-19 Faster Video Diffusion with Trainable Sparse Attention Peiyuan Zhang et.al. 2505.13389 null
2025-05-19 Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation Yasi Zhang et.al. 2505.13377 null
2025-05-20 Minimum-Excess-Work Guidance Christopher Kolloff et.al. 2505.13375 null
2025-05-20 One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling Nimrod Berman et.al. 2505.13358 link
2025-05-19 FlowPure: Continuous Normalizing Flows for Adversarial Purification Elias Collaert et.al. 2505.13280 link
2025-05-19 Seeing the Unseen: How EMoE Unveils Bias in Text-to-Image Diffusion Models Lucas Berry et.al. 2505.13273 null
2025-05-19 Diffusion Models with Double Guidance: Generate with aggregated datasets Yanfeng Yang et.al. 2505.13213 null
2025-05-19 Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model Jonas Brenig et.al. 2505.13152 link
2025-05-19 Neurosymbolic Diffusion Models Emile van Krieken et.al. 2505.13138 link
2025-05-19 Constraint-Aware Diffusion Guidance for Robotics: Real-Time Obstacle Avoidance for Autonomous Racing Hao Ma et.al. 2505.13131 null
2025-05-19 Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction Yuanbo Wang et.al. 2505.13091 null
2025-05-19 Anti-Inpainting: A Proactive Defense against Malicious Diffusion-based Inpainters under Unknown Conditions Yimao Guo et.al. 2505.13023 null
2025-05-19 LatentINDIGO: An INN-Guided Latent Diffusion Algorithm for Image Restoration Di You et.al. 2505.12935 null
2025-05-19 PhyDA: Physics-Guided Diffusion Models for Data Assimilation in Atmospheric Systems Hao Wang et.al. 2505.12882 null
2025-05-19 Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses Yingkai Kang et.al. 2505.12710 null
2025-05-19 CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models Shristi Das Biswas et.al. 2505.12677 null
2025-05-19 Few-Step Diffusion via Score identity Distillation Mingyuan Zhou et.al. 2505.12674 link
2025-05-19 Multi-View Wireless Sensing via Conditional Generative Learning: Framework and Model Design Ziqing Xing et.al. 2505.12664 null
2025-05-19 MVPainter: Accurate and Detailed 3D Texture Generation via Multi-View Diffusion with Geometric Control Mingqi Shao et.al. 2505.12635 null
2025-05-18 FreqSelect: Frequency-Aware fMRI-to-Image Reconstruction Junliang Ye et.al. 2505.12552 null
2025-05-15 3D-Fixup: Advancing Photo Editing with 3D Priors Yen-Chi Cheng et.al. 2505.10566 null
2025-05-15 Style Customization of Text-to-Vector Generation with Image Diffusion Priors Peiying Zhang et.al. 2505.10558 null
2025-05-15 Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data Yiwen Liu et.al. 2505.10551 link
2025-05-15 Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design Amira Alakhdar et.al. 2505.10545 null
2025-05-15 Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps Ningyuan Yang et.al. 2505.10482 null
2025-05-15 Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models Zemin Huang et.al. 2505.10446 null
2025-05-15 Score-based diffusion nowcasting of GOES imagery Randy J. Chase et.al. 2505.10432 null
2025-05-16 Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems Jeffrey Alido et.al. 2505.10311 link
2025-05-15 FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation Jun Guo et.al. 2505.10075 null
2025-05-15 ORL-LDM: Offline Reinforcement Learning Guided Latent Diffusion Model Super-Resolution Reconstruction Shijie Lyu et.al. 2505.10027 null
2025-05-15 From Air to Wear: Personalized 3D Digital Fashion with AR/VR Immersive 3D Sketching Ying Zang et.al. 2505.09998 null
2025-05-15 Ordered-subsets Multi-diffusion Model for Sparse-view CT Reconstruction Pengfei Yu et.al. 2505.09985 null
2025-05-15 Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity Zichen Liu et.al. 2505.09922 null
2025-05-15 Diffusion-SAFE: Shared Autonomy Framework with Diffusion for Safe Human-to-Robot Driving Handover Yunxin Fan et.al. 2505.09889 null
2025-05-15 Unsupervised Radar Point Cloud Enhancement via Arbitrary LiDAR Guided Diffusion Prior Yanlong Yang et.al. 2505.09887 null
2025-05-14 Mission Balance: Generating Under-represented Class Samples using Video Diffusion Models Danush Kumar Venkatesh et.al. 2505.09858 link
2025-05-14 On the Well-Posedness of Green’s Function Reconstruction via the Kirchhoff-Helmholtz Equation for One-Speed Neutron Diffusion Roberto Ponciroli et.al. 2505.09766 null
2025-05-14 EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models Hu Yue et.al. 2505.09694 link
2025-05-14 LightLab: Controlling Light Sources in Images with Diffusion Models Nadav Magar et.al. 2505.09608 null
2025-05-14 Don’t Forget your Inverse DDIM for Image Editing Guillermo Gomez-Trenado et.al. 2505.09571 null
2025-05-14 BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Jiuhai Chen et.al. 2505.09568 link
2025-05-14 Diffusion Recommender Models and the Illusion of Progress: A Concerning Study of Reproducibility and a Conceptual Mismatch Michael Benigni et.al. 2505.09364 null
2025-05-14 Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis Bingxin Ke et.al. 2505.09358 link
2025-05-14 TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving Xuefeng Jiang et.al. 2505.09315 null
2025-05-14 Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations Panqi Chen et.al. 2505.09284 null
2025-05-14 A Note on Semantic Diffusion Alexander P. Ryjov et.al. 2505.09283 null
2025-05-14 Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation Guan Gui et.al. 2505.09263 link
2025-05-15 Generating time-consistent dynamics with discriminator-guided image diffusion models Philipp Hess et.al. 2505.09089 null
2025-05-13 Predictive Digital Twins with Quantified Uncertainty for Patient-Specific Decision Making in Oncology Graham Pash et.al. 2505.08927 link
2025-05-15 IntrinsicEdit: Precise generative image manipulation in intrinsic space Linjie Lyu et.al. 2505.08889 null
2025-05-13 Generative AI for Autonomous Driving: Frontiers and Opportunities Yuping Wang et.al. 2505.08854 link
2025-05-13 Controllable Image Colorization with Instance-aware Texts and Masks Yanru An et.al. 2505.08705 null
2025-05-13 Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World Yuran Wang et.al. 2505.08607 null
2025-05-15 Diffusion-assisted Model Predictive Control Optimization for Power System Real-Time Operation Linna Xu et.al. 2505.08535 null
2025-05-13 Building-Block Aware Generative Modeling for 3D Crystals of Metal Organic Frameworks Chenru Duan et.al. 2505.08531 link
2025-05-14 Improving Data Fidelity via Diffusion Model-based Correction and Super-Resolution Wuzhe Xu et.al. 2505.08526 null
2025-05-13 ConDiSim: Conditional Diffusion Models for Simulation Based Inference Mayank Nautiyal et.al. 2505.08403 null
2025-05-13 Adaptive Diffusion Policy Optimization for Robotic Manipulation Huiyun Jiang et.al. 2505.08376 null
2025-05-12 DanceGRPO: Unleashing GRPO on Visual Generation Zeyue Xue et.al. 2505.07818 null
2025-05-12 Pixel Motion as Universal Representation for Robot Control Kanchana Ranasinghe et.al. 2505.07817 null
2025-05-12 LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention Jiangling Zhang et.al. 2505.07734 null
2025-05-12 ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models Ozgur Kara et.al. 2505.07652 null
2025-05-12 Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models Riccardo Passoni et.al. 2505.07615 null
2025-05-12 Noise Optimized Conditional Diffusion for Domain Adaptation Lingkun Luo et.al. 2505.07548 null
2025-05-12 Addressing degeneracies in latent interpolation for diffusion models Erik Landolsi et.al. 2505.07481 null
2025-05-12 You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts Hongkun Dou et.al. 2505.07477 link
2025-05-12 DiffCrysGen: A Score-Based Diffusion Model for Design of Diverse Inorganic Crystalline Materials Sourav Mal et.al. 2505.07442 null
2025-05-12 Diffusion-driven SpatioTemporal Graph KANsformer for Medical Examination Recommendation Jianan Li et.al. 2505.07431 null
2025-05-12 GAN-based synthetic FDG PET images from T1 brain MRI can serve to improve performance of deep unsupervised anomaly detection models Daria Zotova et.al. 2505.07364 null
2025-05-11 Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution Zihang Liu et.al. 2505.07071 link
2025-05-11 DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models Junhao Xia et.al. 2505.07057 null
2025-05-11 CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation Peng Li et.al. 2505.07003 null
2025-05-11 Replay-Based Continual Learning with Dual-Layered Distillation and a Streamlined U-Net for Efficient Text-to-Image Generation Md. Naimur Asif Borno et.al. 2505.06995 null
2025-05-11 Unsupervised Learning for Class Distribution Mismatch Pan Du et.al. 2505.06948 link
2025-05-11 Near-Field Channel Estimation for XL-MIMO: A Deep Generative Model Guided by Side Information Zhenzhou Jin et.al. 2505.06900 null
2025-05-11 Image Classification Using a Diffusion Model as a Pre-Training Model Kosuke Ukita et.al. 2505.06890 null
2025-05-11 Topology Guidance: Controlling the Outputs of Generative Models via Vector Field Topology Xiaohan Wang et.al. 2505.06804 null
2025-05-11 HistDiST: Histopathological Diffusion-based Stain Transfer Erik Großkopf et.al. 2505.06793 null
2025-05-08 SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation Yonwoo Choi et.al. 2505.05475 link
2025-05-08 3D Scene Generation: A Survey Beichen Wen et.al. 2505.05474 link
2025-05-08 DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion Qitao Zhao et.al. 2505.05473 null
2025-05-08 Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation Chao Liao et.al. 2505.05472 null
2025-05-08 Denoising Diffusion Probabilistic Models for Coastal Inundation Forecasting Kazi Ashik Islam et.al. 2505.05381 null
2025-05-08 Diffusion Model Quantization: A Review Qian Zeng et.al. 2505.05215 link
2025-05-08 EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution Haizhen Xie et.al. 2505.05209 null
2025-05-08 Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning Chuangtao Chen et.al. 2505.05151 link
2025-05-08 Research on Anomaly Detection Methods Based on Diffusion Models Yi Chen et.al. 2505.05137 null
2025-05-08 MDAA-Diff: CT-Guided Multi-Dose Adaptive Attention Diffusion Model for PET Denoising Xiaolong Niu et.al. 2505.05112 null
2025-05-08 MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models Hongyang Zhu et.al. 2505.05101 null
2025-05-08 ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model Sagnik Bhattacharya et.al. 2505.05082 null
2025-05-08 PIDiff: Image Customization for Personalized Identities with Diffusion Models Jinyu Gu et.al. 2505.05081 null
2025-05-08 Divide-and-Conquer: Cold-Start Bundle Recommendation via Mixture of Diffusion Experts Ming Li et.al. 2505.05035 null
2025-05-08 SOAP: Style-Omniscient Animatable Portraits Tingting Liao et.al. 2505.05022 link
2025-05-08 Inter-Diffusion Generation Model of Speakers and Listeners for Effective Communication Jinhe Huang et.al. 2505.04996 null
2025-05-08 ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment Wanjiang Weng et.al. 2505.04974 null
2025-05-08 Graffe: Graph Representation Learning via Diffusion Probabilistic Models Dingshuo Chen et.al. 2505.04956 null
2025-05-08 Accurate and Fast Channel Estimation for Fluid Antenna Systems with Diffusion Models Erqiang Tang et.al. 2505.04930 null
2025-05-08 GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing Tong Wang et.al. 2505.04915 null
2025-05-07 Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond Jessie Richter-Powell et.al. 2505.04621 null
2025-05-07 Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model Pengfei Guo et.al. 2505.04522 null
2025-05-07 Efficient Flow Matching using Latent Variables Anirban Samaddar et.al. 2505.04486 null
2025-05-07 Localized Diffusion Models for High Dimensional Distributions Generation Georg A. Gottwald et.al. 2505.04417 null
2025-05-07 CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion Yanyu Li et.al. 2505.04347 null
2025-05-07 MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition Qiannan Fan et.al. 2505.04306 null
2025-05-07 TS-Diff: Two-Stage Diffusion Model for Low-Light RAW Image Enhancement Yi Li et.al. 2505.04281 link
2025-05-07 HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation Yajie Fu et.al. 2505.04276 link
2025-05-07 Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting Feng Yang et.al. 2505.04262 null
2025-05-07 DiffPattern-Flex: Efficient Layout Pattern Generation via Discrete Diffusion Zixiao Wang et.al. 2505.04173 null
2025-05-07 Person-In-Situ: Scene-Consistent Human Image Insertion with Occlusion-Aware Pose Control Shun Masuda et.al. 2505.04052 null
2025-05-07 BuildingBlock: A Hybrid Approach for Structured Building Generation Junming Huang et.al. 2505.04051 null
2025-05-07 TerraFusion: Joint Generation of Terrain Geometry and Texture Using Latent Diffusion Models Kazuki Higo et.al. 2505.04050 null
2025-05-06 Diffusion Models are Secretly Exchangeable: Parallelizing DDPMs via Autospeculation Hengyuan Hu et.al. 2505.03983 null
2025-05-06 nuGAN: Generative Adversarial Emulator for Cosmic Web with Neutrinos Neerav Kaushal et.al. 2505.03936 null
2025-05-06 CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting Huawei Sun et.al. 2505.03679 null
2025-05-06 Distribution-Conditional Generation: From Class Distribution to Creative Generation Fu Feng et.al. 2505.03667 null
2025-05-06 Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map Alessandro Simoni et.al. 2505.03623 link
2025-05-07 PAHA: Parts-Aware Audio-Driven Human Animation with Diffusion Model Y. B. Wang et.al. 2505.03603 null
2025-05-06 A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges Feibo Jiang et.al. 2505.03556 link
2025-05-05 Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models Kuofeng Gao et.al. 2505.02824 link
2025-05-05 Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models Yankai Jiang et.al. 2505.02753 link
2025-05-06 MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation Mingcheng Li et.al. 2505.02648 null
2025-05-06 Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces Yang Lyu et.al. 2505.02508 null
2025-05-05 Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction Biao Gong et.al. 2505.02471 link
2025-05-05 Predicting the Dynamics of Complex System via Multiscale Diffusion Autoencoder Ruikun Li et.al. 2505.02450 null
2025-05-05 T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models Yunfeng Ge et.al. 2505.02417 link
2025-05-04 Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset Jakub Wąsala et.al. 2505.02255 null
2025-05-04 Quantizing Diffusion Models from a Sampling-Aware Perspective Qian Zeng et.al. 2505.02242 null
2025-05-06 Regression is all you need for medical image translation Sebastian Rassmann et.al. 2505.02048 link
2025-05-03 Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling Javier E. Santos et.al. 2505.01917 null
2025-05-03 Rethinking Score Distilling Sampling for 3D Editing and Generation Xingyu Miao et.al. 2505.01888 null
2025-05-03 DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion Haoteng Li et.al. 2505.01857 null
2025-05-03 Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning Jifeng Hu et.al. 2505.01822 null
2025-05-02 The DCR Delusion: Measuring the Privacy Risk of Synthetic Data Zexi Yao et.al. 2505.01524 null
2025-05-02 WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation Daoan Zhang et.al. 2505.01490 null
2025-05-02 VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models Mohammadreza Teymoorianfard et.al. 2505.01406 link
2025-05-02 Provable Efficiency of Guidance in Diffusion Models for General Data Distribution Gen Li et.al. 2505.01382 null
2025-05-02 FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors Chenxi Li et.al. 2505.01322 null
2025-05-02 Model See Model Do: Speech-Driven Facial Animation with Style Control Yifang Pan et.al. 2505.01319 null
2025-05-01 Controllable Weather Synthesis and Removal with Video Diffusion Models Chih-Hao Lin et.al. 2505.00704 null
2025-05-01 GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution Aditya Arora et.al. 2505.00687 null
2025-05-01 ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models Jiarong Wei et.al. 2505.00586 null
2025-05-01 Safety-Critical Traffic Simulation with Guided Latent Diffusion Model Mingxing Peng et.al. 2505.00515 null
2025-05-01 Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly Ruiyuan Zhang et.al. 2505.00426 null
2025-05-01 Denoising weak lensing mass maps with diffusion model: systematic comparison with generative adversarial network Shohei D. Aoyama et.al. 2505.00345 null
2025-05-01 Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution Luigi Sigillo et.al. 2505.00334 null
2025-04-30 Generative Multimodal Multiscale Data Fusion for Digital Twins in Aerosol Jet Electronics Printing Fatemeh Elhambakhsh et.al. 2505.00176 null
2025-04-30 Materials discovery acceleration by using condition generative methodology Caiyuan Ye et.al. 2505.00076 link
2025-04-30 ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction Qihao Liu et.al. 2504.21855 null
2025-04-30 HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Haiyang Zhou et.al. 2504.21650 link
2025-04-30 Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection Liqin Wang et.al. 2504.21646 null
2025-04-30 ODE and PDE models for COVID-19, with reinfection and vaccination process for Cameroon and Germany Hamadjam Abboubakar et.al. 2504.21613 null
2025-04-30 Latent Feature-Guided Conditional Diffusion for High-Fidelity Generative Image Semantic Communication Zehao Chen et.al. 2504.21577 null
2025-04-30 MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance Mengting Wei et.al. 2504.21497 link
2025-04-30 DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration Hebaixu Wang et.al. 2504.21487 link
2025-04-30 Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision Weicai Yan et.al. 2504.21423 null
2025-04-30 IDDM: Bridging Synthetic-to-Real Domain Gap from Physics-Guided Diffusion for Real-world Image Dehazing Shijun Zhou et.al. 2504.21385 null
2025-04-30 Sparse-to-Sparse Training of Diffusion Models Inês Cardoso Oliveira et.al. 2504.21380 null
2025-04-30 Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Hong Zhang et.al. 2504.21356 link
2025-04-30 Text-Conditioned Diffusion Model for High-Fidelity Korean Font Generation Abdul Sami et.al. 2504.21325 null
2025-04-30 Capturing Conditional Dependence via Auto-regressive Diffusion Models Xunpeng Huang et.al. 2504.21314 null
2025-04-30 The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning Siyi Chen et.al. 2504.21307 null
2025-04-30 Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions ZiYi Dong et.al. 2504.21292 null
2025-04-30 CoCoDiff: Diversifying Skeleton Action Features via Coarse-Fine Text-Co-Guided Latent Diffusion Zhifu Zhao et.al. 2504.21266 null
2025-04-29 T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection Manikanta Varaganti et.al. 2504.21231 null
2025-04-29 ProT-GFDM: A Generative Fractional Diffusion Model for Protein Generation Xiao Liang et.al. 2504.21092 null
2025-04-29 Erased but Not Forgotten: How Backdoors Compromise Concept Erasure Jonas Henry Grebe et.al. 2504.21072 null
2025-04-29 AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection Lorenzo Pellegrini et.al. 2504.20865 null
2025-04-28 DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images Mamadou Keita et.al. 2504.19876 link
2025-04-28 CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback Chenhan Jiang et.al. 2504.19860 null
2025-04-28 Multimodal Conditioned Diffusive Time Series Forecasting Chen Su et.al. 2504.19669 null
2025-04-28 Robot Motion Planning using One-Step Diffusion with Noise-Optimized Approximate Motions Tomoharu Aizu et.al. 2504.19652 null
2025-04-28 AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis Haroui Ma et.al. 2504.19621 link
2025-04-28 Image Generation Method Based on Heat Diffusion Models Pengfei Zhang et.al. 2504.19600 null
2025-04-28 GenPTW: In-Generation Image Watermarking for Provenance Tracing and Tamper Localization Zhenliang Gan et.al. 2504.19567 null
2025-04-28 SynergyAmodal: Deocclude Anything with Text Control Xinyang Li et.al. 2504.19506 null
2025-04-28 Simultaneous Pick and Place Detection by Combining SE(3) Diffusion Models with Differential Kinematics Tianyi Ko et.al. 2504.19502 null
2025-04-28 GTSD: Generative Text Steganography Based on Diffusion Model Zhengxian Wu et.al. 2504.19433 null
2025-04-28 Boosting 3D Liver Shape Datasets with Diffusion Models and Implicit Neural Representations Khoa Tuan Nguyen et.al. 2504.19402 null
2025-04-27 Sketch2Anim: Towards Transferring Sketch Storyboards into 3D Animation Lei Zhong et.al. 2504.19189 null
2025-04-27 Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions Mohammad Mahdi Abootorabi et.al. 2504.19056 link
2025-04-26 Learning Stochastic Thermodynamics Directly from Correlation and Trajectory-Fluctuation Currents Jinghao Lyu et.al. 2504.19007 null
2025-04-26 REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models Gal Almog et.al. 2504.18989 link
2025-04-25 Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection Brian K. S. Isaac-Medina et.al. 2504.18746 null
2025-04-25 Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation Gérôme Andry et.al. 2504.18720 null
2025-04-25 SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations Shuting Zhao et.al. 2504.18332 null
2025-04-25 STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting Yunze Deng et.al. 2504.18318 null
2025-04-25 Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding Kun Li et.al. 2504.18204 null
2025-04-24 LiDPM: Rethinking Point Diffusion for Lidar Scene Completion Tetiana Martyniuk et.al. 2504.17791 null
2025-04-24 Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Xu Ma et.al. 2504.17789 null
2025-04-24 polyGen: A Learning Framework for Atomic-level Polymer Structure Generation Ayush Jain et.al. 2504.17656 null
2025-04-24 Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization Abderrachid Hamrani et.al. 2504.17628 null
2025-04-24 ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting Junyan Zhang et.al. 2504.17524 null
2025-04-24 3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models Min Wei et.al. 2504.17414 null
2025-04-24 DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition Yiyan Xu et.al. 2504.17349 null
2025-04-24 CKMDiff: A Generative Diffusion Model for CKM Construction via Inverse Problems with Learned Priors Shen Fu et.al. 2504.17323 null
2025-04-24 Towards Generalized and Training-Free Text-Guided Semantic Manipulation Yu Hong et.al. 2504.17269 null
2025-04-24 DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks Yinqi Li et.al. 2504.17253 link
2025-04-24 AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models Mohammad Zarei et.al. 2504.17179 null
2025-04-23 Physics-guided and fabrication-aware inverse design of photonic devices using diffusion models Dongjin Seo et.al. 2504.17077 link
2025-04-23 Diffusion Probabilistic Models for Compressive SAR Imaging Odysseas Pappas et.al. 2504.17053 null
2025-04-23 Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials Peichen Zhong et.al. 2504.16893 null
2025-04-23 Planning with Diffusion Models for Target-Oriented Dialogue Systems Hanwen Du et.al. 2504.16858 null
2025-04-23 Physically Consistent Humanoid Loco-Manipulation using Latent Diffusion Models Ilyass Taouil et.al. 2504.16843 null
2025-04-24 Simple Graph Contrastive Learning via Fractional-order Neural Diffusion Networks Yanan Zhao et.al. 2504.16748 null
2025-04-23 MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning Itamar Mishani et.al. 2504.16738 null
2025-04-24 Hyper-Transforming Latent Diffusion Models Ignacio Peis et.al. 2504.16580 null
2025-04-23 A Comprehensive Survey of Synthetic Tabular Data Generation Ruxue Shi et.al. 2504.16506 link
2025-04-23 The Dance of Atoms-De Novo Protein Design with Diffusion Model Yujie Qin et.al. 2504.16479 null
2025-04-23 Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion Ruixiang Zhang et.al. 2504.16431 null
2025-04-23 VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models Xuming Hu et.al. 2504.16359 null
2025-04-22 SignX: The Foundation Model for Sign Recognition Sen Fang et.al. 2504.16315 null
2025-04-22 Aerial Active STAR-RIS-assisted Satellite-Terrestrial Covert Communications Chuang Zhang et.al. 2504.16146 null
2025-04-22 Survey of Video Diffusion Models: Foundations, Implementations, and Applications Yimu Wang et.al. 2504.16081 link
2025-04-22 From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning Le Zhuo et.al. 2504.16080 null
2025-04-22 Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation Yuanpeng Qu et.al. 2504.16077 link
2025-04-22 Boosting Generative Image Modeling via Joint Image-Feature Synthesis Theodoros Kouzelis et.al. 2504.16064 null
2025-04-22 Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework Xinyuan Song et.al. 2504.16016 null
2025-04-22 Adversarial Observations in Weather Forecasting Erik Imgrund et.al. 2504.15942 link
2025-04-22 Text-based Animatable 3D Avatars with Morphable Model Alignment Yiqian Wu et.al. 2504.15835 link
2025-04-22 Satellite to GroundScape – Large-scale Consistent Ground View Generation from Satellite Views Ningli Xu et.al. 2504.15786 null
2025-04-21 Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction Vaishnavh Nagarajan et.al. 2504.15266 link
2025-04-21 Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation Yunxuan Cai et.al. 2504.15259 null
2025-04-21 DRAGON: Distributional Rewards Optimize Diffusion Generative Models Yatong Bai et.al. 2504.15217 null
2025-04-21 FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image Fei Yin et.al. 2504.15179 null
2025-04-21 DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution Miaomiao Cai et.al. 2504.15176 null
2025-04-21 Automatic Generation of Aerobatic Flight in Complex Environments via Diffusion Models Yuhang Zhong et.al. 2504.15138 null
2025-04-22 VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation Mingxia Zhan et.al. 2504.15095 null
2025-04-21 Generative Artificial Intelligence for Beamforming in Low-Altitude Economy Geng Sun et.al. 2504.15079 null
2025-04-21 SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation Yue Li et.al. 2504.15035 null
2025-04-21 Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models Zijin Yang et.al. 2504.15026 null
2025-04-21 PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV Qianyu Zhu et.al. 2504.14952 link
2025-04-21 TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models Mazharul Islam Rakib et.al. 2504.14933 null
2025-04-21 What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale Xiaoyong Yuan et.al. 2504.14815 null
2025-04-21 When Cloud Removal Meets Diffusion Model in Remote Sensing Zhenyu Yu et.al. 2504.14785 null
2025-04-21 Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model Ahmed Sobhi Saleh et.al. 2504.14782 null
2025-04-20 Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens Kaihang Pan et.al. 2504.14666 null
2025-04-20 REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models Chongye Guo et.al. 2504.14554 null
2025-04-20 FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models Kuanting Wu et.al. 2504.14535 null
2025-04-20 SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization Liang Peng et.al. 2504.14534 link
2025-04-20 DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning Fulong Ye et.al. 2504.14509 link
2025-04-17 Personalized Text-to-Image Generation with Auto-Regressive Models Kaiyue Sun et.al. 2504.13162 link
2025-04-17 UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models Guanlong Jiao et.al. 2504.13109 null
2025-04-18 SkyReels-V2: Infinite-length Film Generative Model Guibin Chen et.al. 2504.13074 link
2025-04-17 TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution Yide Liu et.al. 2504.13026 link
2025-04-17 Image-Editing Specialists: An RLAIF Approach for Diffusion Models Elior Benarous et.al. 2504.12833 link
2025-04-17 Privacy Protection Against Personalized Text-to-Image Synthesis via Cross-image Consistency Constraints Guanyu Wang et.al. 2504.12747 null
2025-04-17 A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation Rongtao Xu et.al. 2504.12636 null
2025-04-17 Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Lvmin Zhang et.al. 2504.12626 link
2025-04-17 Prompt-Driven and Training-Free Forgetting Approach and Dataset for Large Language Models Zhenyu Yu et.al. 2504.12574 null
2025-04-16 Generalization through variance: how noise shapes inductive biases in diffusion models John J. Vastola et.al. 2504.12532 link
2025-04-16 Diffusion Based Robust LiDAR Place Recognition Benjamin Krummenacher et.al. 2504.12412 null
2025-04-16 Cobra: Efficient Line Art COlorization with BRoAder References Junhao Zhuang et.al. 2504.12240 null
2025-04-16 Coding-Prior Guided Diffusion Network for Video Deblurring Yike Liu et.al. 2504.12222 null
2025-04-16 Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis Songping Wang et.al. 2504.12129 null
2025-04-16 A Diffusion-Based Framework for Terrain-Aware Remote Sensing Image Reconstruction Zhenyu Yu et.al. 2504.12112 null
2025-04-16 Generalized Visual Relation Detection with Diffusion Models Kaifeng Gao et.al. 2504.12100 null
2025-04-16 Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM Zirui Pan et.al. 2504.12048 null
2025-04-17 Understanding Attention Mechanism in Video Diffusion Models Bingyan Liu et.al. 2504.12027 null
2025-04-17 Dual-Energy Cone-Beam CT Using Two Orthogonal Projection Views: A Phantom Study Junbo Peng et.al. 2504.12010 null
2025-04-16 Generative Recommendation with Continuous-Token Diffusion Haohao Qu et.al. 2504.12007 null
2025-04-16 R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors Haoyang Wang et.al. 2504.11946 null
2025-04-16 SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models Zeyu Dai et.al. 2504.11923 null
2025-04-16 A Bidirectional DeepParticle Method for Efficiently Solving Low-dimensional Transport Map Problems Tan Zhang et.al. 2504.11851 null
2025-04-16 ACE: Attentional Concept Erasure in Diffusion Models Finn Carter et.al. 2504.11850 null
2025-04-16 TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation Kangbo Ma et.al. 2504.11825 null
2025-04-16 PCDiff: Proactive Control for Ownership Protection in Diffusion Models with Watermark Compatibility Keke Gai et.al. 2504.11774 null
2025-04-16 EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos Jilan Xu et.al. 2504.11732 null
2025-04-16 Towards Safe Synthetic Image Generation On the Web: A Multimodal Robust NSFW Defense and Million Scale Dataset Muhammad Shahid Muneer et.al. 2504.11707 link
2025-04-16 DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction Sicong Pan et.al. 2504.11674 link
2025-04-15 Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception Ziqi Pang et.al. 2504.11457 link
2025-04-16 Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion An Zhao et.al. 2504.11447 link
2025-04-14 REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers Xingjian Leng et.al. 2504.10483 null
2025-04-14 Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing Taihang Hu et.al. 2504.10434 link
2025-04-14 MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model Jian Liu et.al. 2504.10433 link
2025-04-14 Improving diffusion modeling in all-solid-state lithium batteries: a novel approach for grain boundary effects Lena Scholz et.al. 2504.10348 null
2025-04-14 DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing Jinyue Zhang et.al. 2504.10278 null
2025-04-14 Efficient Generative Model Training via Embedded Representation Warmup Deyuan Liu et.al. 2504.10188 link
2025-04-14 NaviDiffusor: Cost-Guided Diffusion Model for Visual Navigation Yiming Zeng et.al. 2504.10003 null
2025-04-15 OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation Si-Tong Wei et.al. 2504.09975 link
2025-04-14 Semi-implicit-explicit Runge-Kutta method for nonlinear differential equations Lingyun Ding et.al. 2504.09969 link
2025-04-14 Efficient Task-specific Conditional Diffusion Policies: Shortcut Model Acceleration and SO(3) Optimization Haiyong Yu et.al. 2504.09927 null
2025-04-14 Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis Zihao Liu et.al. 2504.09885 null
2025-04-14 EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise Chao Liu et.al. 2504.09789 null
2025-04-13 Stochastic generative methods for stable and accurate closure modeling of chaotic dynamical systems Emily Williams et.al. 2504.09750 null
2025-04-13 SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow Kenan Tang et.al. 2504.09697 link
2025-04-13 Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training Lexington Whalen et.al. 2504.09606 null
2025-04-13 Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark Jinhao Li et.al. 2504.09555 null
2025-04-13 DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion Puyu Han et.al. 2504.09513 null
2025-04-13 CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models Pooja Guhan et.al. 2504.09472 null
2025-04-13 D $^2$ iT: Dynamic Diffusion Transformer for Accurate Image Generation Weinan Jia et.al. 2504.09454 null
2025-04-13 Structure-Accurate Medical Image Translation based on Dynamic Frequency Balance and Knowledge Guidance Jiahua Xu et.al. 2504.09441 null
2025-04-10 Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Zeren Jiang et.al. 2504.07961 link
2025-04-10 VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning Zhong-Yu Li et.al. 2504.07960 null
2025-04-10 GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces Hao Yu et.al. 2504.07945 null
2025-04-10 Optimal Control For Anti-Abeta Treatment in Alzheimer’s Disease using a Reaction-Diffusion Model Wenrui Hao et.al. 2504.07913 null
2025-04-10 Revisiting Likelihood-Based Out-of-Distribution Detection by Modeling Representations Yifan Ding et.al. 2504.07793 link
2025-04-10 Virtual-mask Informed Prior for Sparse-view Dual-Energy CT Reconstruction Zini Chen et.al. 2504.07753 null
2025-04-10 PhaseGen: A Diffusion-Based Approach for Complex-Valued MRI Data Generation Moritz Rempe et.al. 2504.07560 link
2025-04-10 STeP: A General and Scalable Framework for Solving Video Inverse Problems with Spatiotemporal Diffusion Priors Bingliang Zhang et.al. 2504.07549 link
2025-04-10 A mass conserved reaction-diffusion system reveals switching between coexisting polar and oscillatory cell motility states Jack M. Hughes et.al. 2504.07446 null
2025-04-10 Unifying and extending Diffusion Models through PDEs for solving Inverse Problems Agnimitra Dasgupta et.al. 2504.07437 null
2025-04-10 Conditional Data Synthesis Augmentation Xinyu Tian et.al. 2504.07426 null
2025-04-10 Routing to the Right Expertise: A Trustworthy Judge for Instruction-based Image Editing Chenxi Sun et.al. 2504.07424 null
2025-04-10 ID-Booth: Identity-consistent Face Generation with Diffusion Models Darian Tomašević et.al. 2504.07392 link
2025-04-10 Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction Junyi Ma et.al. 2504.07375 link
2025-04-09 MoEDiff-SR: Mixture of Experts-Guided Diffusion Model for Region-Adaptive MRI Super-Resolution Zhe Wang et.al. 2504.07308 link
2025-04-09 MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data Paul Borne–Pons et.al. 2504.07210 link
2025-04-09 Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies Jonas Loos et.al. 2504.07008 link
2025-04-09 PathSegDiff: Pathology Segmentation using Diffusion model representations Sachin Kumar Danisetty et.al. 2504.06950 null
2025-04-09 MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs Jiawei Mao et.al. 2504.06897 null
2025-04-09 EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation Diljeet Jagpal et.al. 2504.06861 null
2025-04-09 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading Mishan Aliev et.al. 2504.06856 null
2025-04-09 DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation Wangbo Zhao et.al. 2504.06803 link
2025-04-09 DIMA: DIffusing Motion Artifacts for unsupervised correction in brain MRI images Paolo Angella et.al. 2504.06767 null
2025-04-10 Compass Control: Multi Object Orientation Control for Text-to-Image Generation Rishubh Parihar et.al. 2504.06752 null
2025-04-09 Probability Density Geodesics in Image Diffusion Latent Space Qingtao Yu et.al. 2504.06675 null
2025-04-09 RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism Elia Peruzzo et.al. 2504.06672 null
2025-04-09 Diffusion Factor Models: Generating High-Dimensional Returns with Factor Structure Minshuo Chen et.al. 2504.06566 link
2025-04-09 DiffusionCom: Structure-Aware Multimodal Diffusion Model for Multimodal Knowledge Graph Completion Wei Huang et.al. 2504.06543 null
2025-04-08 D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition Rupayan Mallick et.al. 2504.06432 null
2025-04-08 Unifying Autoregressive and Diffusion-Based Sequence Generation Nima Fathi et.al. 2504.06416 null
2025-04-08 Transfer between Modalities with MetaQueries Xichen Pan et.al. 2504.06256 null
2025-04-08 OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model Xiaochen Wei et.al. 2504.06027 null
2025-04-08 CamContextI2V: Context-aware Controllable Video Generation Luis Denninger et.al. 2504.06022 link
2025-04-08 An Empirical Study of GPT-4o Image Generation Capabilities Sixiang Chen et.al. 2504.05979 link
2025-04-08 Diffusion Based Ambiguous Image Segmentation Jakob Lønborg Christensen et.al. 2504.05977 null
2025-04-08 Physics-aware generative models for turbulent fluid flows through energy-consistent stochastic interpolants Nikolaj T. Mücke et.al. 2504.05852 link
2025-04-07 CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models Kavana Venkatesh et.al. 2504.05306 null
2025-04-07 Gaussian Mixture Flow Matching Models Hansheng Chen et.al. 2504.05304 link
2025-04-07 Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures Gen Li et.al. 2504.05300 null
2025-04-07 DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration Jiamei Xiong et.al. 2504.05135 null
2025-04-07 Graph-based Diffusion Model for Collaborative Filtering Xuan Zhang et.al. 2504.05029 null
2025-04-08 REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning Jihyun Lee et.al. 2504.04956 null
2025-04-08 TabRep: a Simple and Effective Continuous Representation for Training Tabular Diffusion Models Jacob Si et.al. 2504.04798 link
2025-04-07 Disentangling Instruction Influence in Diffusion Transformers for Parallel Multi-Instruction-Guided Image Editing Hui Liu et.al. 2504.04784 null
2025-04-07 Continuous Locomotive Crowd Behavior Generation Inhwan Bae et.al. 2504.04756 link
2025-04-07 Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches Eloi Moliner et.al. 2504.04751 null
2025-04-06 Diffusion-Based Approximate MPC: Fast and Consistent Imitation of Multi-Modal Action Distributions Pau Marquez Julbe et.al. 2504.04603 null
2025-04-08 Your Image Generator Is Your New Private Dataset Nicolo Resmini et.al. 2504.04582 null
2025-04-06 Cramer-Rao Bounds for Laplacian Matrix Estimation Morad Halihal et.al. 2504.04576 null
2025-04-06 BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis Moinak Bhattacharya et.al. 2504.04532 null
2025-04-06 PRISM: Probabilistic Representation for Integrated Shape Modeling and Generation Lei Cheng et.al. 2504.04454 null
2025-04-06 From Coarse to Fine: A Physics-Informed Self-Guided Flow Diffusion Model Ruoyan Li et.al. 2504.04375 null
2025-04-06 DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation Jinyang Li et.al. 2504.04351 null
2025-04-05 Multi-resolution Score-Based Variational Graphical Diffusion for Causal Disaster System Modeling and Inference Xuechun Li et.al. 2504.04015 link
2025-04-05 DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion Maksim Siniukov et.al. 2504.04010 null
2025-04-04 Enhancing Causal Effect Estimation with Diffusion-Generated Data Li Chen et.al. 2504.03630 null
2025-04-03 Concept Lancet: Image Editing with Compositional Representation Transplant Jinqi Luo et.al. 2504.02828 null
2025-04-03 F-ViTA: Foundation Model Guided Visible to Thermal Translation Jay N. Paranjape et.al. 2504.02801 link
2025-04-03 Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model Shengjun Zhang et.al. 2504.02764 null
2025-04-03 MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection Ahmet Burak Yildirim et.al. 2504.02762 null
2025-04-04 RBT4DNN: Requirements-based Testing of Neural Networks Nusrat Jahan Mozumder et.al. 2504.02737 link
2025-04-03 RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models ZhongLi Fang et.al. 2504.02640 null
2025-04-03 Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression Lucas Relic et.al. 2504.02579 null
2025-04-03 MAD: Makeup All-in-One with Cross-Domain Diffusion Model Bo-Kai Ruan et.al. 2504.02545 null
2025-04-03 Translation of Fetal Brain Ultrasound Images into Pseudo-MRI Images using Artificial Intelligence Naomi Silverstein et.al. 2504.02408 null
2025-04-03 Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation Laibin Chang et.al. 2504.02391 null
2025-04-03 OmniCam: Unified Multimodal Video Generation via Camera Control Xiaoda Yang et.al. 2504.02312 null
2025-04-03 WonderTurbo: Generating Interactive 3D World in 0.72 Seconds Chaojun Ni et.al. 2504.02261 null
2025-04-02 FreSca: Unveiling the Scaling Space in Diffusion Models Chao Huang et.al. 2504.02154 null
2025-04-02 Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis Niluthpol Chowdhury Mithun et.al. 2504.01960 null
2025-04-03 VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step Hanyang Wang et.al. 2504.01956 null
2025-04-02 A Unified Approach to Analysis and Design of Denoising Markov Models Yinuo Ren et.al. 2504.01938 null
2025-04-03 ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement Runhui Huang et.al. 2504.01934 null
2025-04-02 Multi-fidelity Parameter Estimation Using Conditional Diffusion Models Caroline Tatsuoka et.al. 2504.01894 null
2025-04-02 A Diffusion-Based Framework for Occluded Object Movement Zheng-Peng Duan et.al. 2504.01873 null
2025-04-02 Implicit Bias Injection Attacks against Text-to-Image Diffusion Models Huayang Huang et.al. 2504.01819 link
2025-04-02 The protein escape process at the ribosomal exit tunnel has conserved mechanisms across the domains of life Phuong Thuy Bui et.al. 2504.01731 null
2025-04-02 InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems Noam Elata et.al. 2504.01689 link
2025-04-02 Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology Lirui Qi et.al. 2504.01577 null
2025-04-02 Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training Luca Ciampi et.al. 2504.01547 link
2025-04-02 Hyperbolic Diffusion Recommender Model Meng Yuan et.al. 2504.01541 null
2025-04-02 Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model Jincheng Zhong et.al. 2504.01521 link
2025-04-02 From Easy to Hard: Building a Shortcut for Differentially Private Image Synthesis Kecen Li et.al. 2504.01395 link
2025-04-02 Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks Jiawei Wang et.al. 2504.01308 link
2025-04-01 Prompting Forgetting: Unlearning in GANs via Textual Guidance Piyush Nagasubramaniam et.al. 2504.01218 null
2025-04-01 Articulated Kinematics Distillation from Video Diffusion Models Xuan Li et.al. 2504.01204 null
2025-04-01 Towards Sign Distance Function based Metamaterial Design: Neural Operator Transformer for Forward Prediction and Diffusion Models for Inverse Design Qibang Liu et.al. 2504.01195 link
2025-04-01 Neural Approaches to SAT Solving: Design Choices and Interpretability David Mojžíšek et.al. 2504.01173 null
2025-04-01 MixerMDM: Learnable Composition of Human Motion Diffusion Models Pablo Ruiz-Ponce et.al. 2504.01019 null
2025-03-31 Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach Francesco Pio Ramunno et.al. 2503.24271 link
2025-04-01 Visual Acoustic Fields Yuelei Li et.al. 2503.24270 null
2025-03-31 Controlled Latent Diffusion Models for 3D Porous Media Reconstruction Danilo Naiff et.al. 2503.24083 link
2025-03-31 DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model Ming Yuan et.al. 2503.23993 null
2025-03-31 JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation Fangda Chen et.al. 2503.23951 null
2025-03-31 DiffuSE: Cross-Layer Design Space Exploration of DNN Accelerator via Diffusion-Driven Optimization Yi Ren et.al. 2503.23945 null
2025-03-31 Training-Free Text-Guided Image Editing with Visual Autoregressive Model Yufei Wang et.al. 2503.23897 link
2025-03-31 DiffScale: Continuous Downscaling and Bias Correction of Subseasonal Wind Speed Forecasts using Diffusion Models Maximilian Springenberg et.al. 2503.23893 null
2025-03-31 MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach Xin Zhang et.al. 2503.23888 null
2025-03-31 ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image Tianyi Gong et.al. 2503.23881 null
2025-03-31 Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism Linghao Feng et.al. 2503.23767 null
2025-03-31 StrokeFusion: Vector Sketch Generation via Joint Stroke-UDF Encoding and Latent Sequence Diffusion Jin Zhou et.al. 2503.23752 null
2025-03-31 Effective Cloud Removal for Remote Sensing Images by an Improved Mean-Reverting Denoising Model with Elucidated Design Space Yi Liu et.al. 2503.23717 link
2025-03-31 Expanding-and-Shrinking Binary Neural Networks Xulong Shi et.al. 2503.23709 link
2025-03-31 Bayesian Inference for a Time-Fractional HIV Model with Nonlinear Diffusion Mohamed BenSalah et.al. 2503.23638 null
2025-03-30 Language-Guided Trajectory Traversal in Disentangled Stable Diffusion Latent Space for Factorized Medical Image Generation Zahra TehraniNasab et.al. 2503.23623 null
2025-03-30 Make Autoregressive Great Again: Diffusion-Free Graph Generation with Next-Scale Prediction Samuel Belkadi et.al. 2503.23612 null
2025-03-30 DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution Zheng-Peng Duan et.al. 2503.23580 null
2025-03-30 Enhancing Creative Generation on Stable Diffusion-based Models Jiyeon Han et.al. 2503.23538 link
2025-03-30 Diffusion Meets Few-shot Class Incremental Learning Junsu Kim et.al. 2503.23402 null
2025-03-27 VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models Chi-Pin Huang et.al. 2503.21781 null
2025-03-27 StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion Ziyu Guo et.al. 2503.21775 null
2025-03-27 Optimal Stepsize for Diffusion Sampling Jianning Pei et.al. 2503.21774 link
2025-03-27 Exploring the Evolution of Physics Cognition in Video Generation: A Survey Minghui Lin et.al. 2503.21765 link
2025-03-27 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Zhiyuan Ma et.al. 2503.21694 link
2025-03-27 Audio-driven Gesture Generation via Deviation Feature in the Latent Space Jiahui Chen et.al. 2503.21616 null
2025-03-27 Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs Yoann Boget et.al. 2503.21592 null
2025-03-27 AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion Liuyue Xie et.al. 2503.21581 null
2025-03-27 SyncSDE: A Probabilistic Framework for Diffusion Synchronization Hyunjun Lee et.al. 2503.21555 null
2025-03-28 LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing Achint Soni et.al. 2503.21541 link
2025-03-27 Nonlinear Stability of Large-Period Traveling Waves Bifurcating from the Heteroclinic Loop in the FitzHugh-Nagumo Equation Ji Li et.al. 2503.21509 null
2025-03-27 Invert2Restore: Zero-Shot Degradation-Blind Image Restoration Hamadi Chihaoui et.al. 2503.21486 null
2025-03-27 Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving Lucas Nunes et.al. 2503.21449 link
2025-03-27 Exploring the flavor structure of leptons via diffusion models Satsuki Nishimura et.al. 2503.21432 null
2025-03-27 Diffusion Image Prior Hamadi Chihaoui et.al. 2503.21410 null
2025-03-27 HORT: Monocular Hand-held Objects Reconstruction with Transformers Zerui Chen et.al. 2503.21313 null
2025-03-27 GenFusion: Closing the Loop between Reconstruction and Generation via Videos Sibo Wu et.al. 2503.21219 null
2025-03-27 ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model Jinwei Qi et.al. 2503.21144 null
2025-03-27 Can Video Diffusion Model Reconstruct 4D Geometry? Jinjie Mai et.al. 2503.21082 null
2025-03-27 Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing Fan Qi et.al. 2503.21069 null
2025-03-26 Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Tianqi Liu et.al. 2503.20785 link
2025-03-26 FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks Jinwei Li et.al. 2503.20784 link
2025-03-26 RecTable: Fast Modeling Tabular Data with Rectified Flow Masane Fuchi et.al. 2503.20731 link
2025-03-26 Dynamic Motion Blending for Versatile Motion Editing Nan Jiang et.al. 2503.20724 null
2025-03-26 ARMO: Autoregressive Rigging for Multi-Category Objects Mingze Sun et.al. 2503.20663 null
2025-03-26 MMGen: Unified Multi-modal Image Generation and Understanding in One Go Jiepeng Wang et.al. 2503.20644 null
2025-03-26 Stochastic Transport Maps in Diffusion Models and Sampling Xicheng Zhang et.al. 2503.20573 null
2025-03-26 Exploring Robustness of Cortical Morphometry in the presence of white matter lesions, using Diffusion Models for Lesion Filling Vinzenz Uhr et.al. 2503.20571 null
2025-03-26 TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration Ziying Zhang et.al. 2503.20537 null
2025-03-26 Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation Qi Si et.al. 2503.20484 null
2025-03-26 Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability Yingdong Shi et.al. 2503.20483 null
2025-03-26 Latent Beam Diffusion Models for Decoding Image Sequences Guilherme Fernandes et.al. 2503.20429 null
2025-03-26 ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On Ji Woo Hong et.al. 2503.20418 null
2025-03-27 Consistency Trajectory Matching for One-Step Generative Super-Resolution Weiyi You et.al. 2503.20349 null
2025-03-26 EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame Interpolation Ziran Zhang et.al. 2503.20268 link
2025-03-26 Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models Prin Phunyaphibarn et.al. 2503.20240 null
2025-03-26 Automated UI Interface Generation via Diffusion Models: Enhancing Personalization and Efficiency Yifei Duan et.al. 2503.20229 null
2025-03-26 Video Motion Graphs Haiyang Liu et.al. 2503.20218 null
2025-03-26 Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models Alex Jinpeng Wang et.al. 2503.20198 null
2025-03-26 AIGC-assisted Federated Learning for Edge Intelligence: Architecture Design, Research Challenges and Future Directions Xianke Qiang et.al. 2503.20166 link
2025-03-24 Target-Aware Video Diffusion Models Taeksoo Kim et.al. 2503.18950 null
2025-03-24 Training-free Diffusion Acceleration with Bottleneck Sampling Ye Tian et.al. 2503.18940 null
2025-03-24 SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction Enrico Pallotta et.al. 2503.18933 link
2025-03-24 Dual-domain Multi-path Self-supervised Diffusion Model for Accelerated MRI Reconstruction Yuxuan Zhang et.al. 2503.18836 null
2025-03-24 Thermalizer: Stable autoregressive neural emulation of spatiotemporal chaos Chris Pedersen et.al. 2503.18731 null
2025-03-24 Human Motion Unlearning Edoardo De Matteis et.al. 2503.18674 null
2025-03-24 Dig2DIG: Dig into Diffusion Information Gains for Image Fusion Bing Cao et.al. 2503.18627 null
2025-03-24 Generative Dataset Distillation using Min-Max Diffusion Model Junqiao Fan et.al. 2503.18626 null
2025-03-24 Unified Uncertainty-Aware Diffusion for Multi-Agent Trajectory Modeling Guillem Capellera et.al. 2503.18589 null
2025-03-24 Adapting Video Diffusion Models for Time-Lapse Microscopy Alexander Holmberg et.al. 2503.18583 link
2025-03-25 AMD-Hummingbird: Towards an Efficient Text-to-Video Model Takashi Isobe et.al. 2503.18559 link
2025-03-24 EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation Qiang Qu et.al. 2503.18552 null
2025-03-24 Discriminative protein sequence modelling with Latent Space Diffusion Eoin Quinn et.al. 2503.18551 null
2025-03-24 DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels Erjian Guo et.al. 2503.18536 null
2025-03-25 AIM2PC: Aerial Image to 3D Building Point Cloud Reconstruction Soulaimene Turki et.al. 2503.18527 null
2025-03-24 Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model Leheng Zhang et.al. 2503.18512 null
2025-03-24 Hiding Images in Diffusion Models by Editing Learned Score Functions Haoyu Chen et.al. 2503.18459 null
2025-03-24 InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment Yunhong Lu et.al. 2503.18454 link
2025-03-25 Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models Jinho Jeong et.al. 2503.18446 link
2025-03-24 Panorama Generation From NFoV Image Done Right Dian Zheng et.al. 2503.18420 link
2025-03-20 DreamTexture: Shape from Virtual Texture with Analysis by Augmentation Ananta R. Bhattarai et.al. 2503.16412 null
2025-03-20 VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness SeungJu Cha et.al. 2503.16406 link
2025-03-20 ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos Haolin Yang et.al. 2503.16400 null
2025-03-20 Scale-wise Distillation of Diffusion Models Nikita Starodubcev et.al. 2503.16397 null
2025-03-21 SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation Chun-Han Yao et.al. 2503.16396 null
2025-03-20 Do Visual Imaginations Improve Vision-and-Language Navigation Agents? Akhil Perincherry et.al. 2503.16394 null
2025-03-20 LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images Leyang Wang et.al. 2503.16376 null
2025-03-20 Heat transfer and mixing in initiated Chemical Vapor Deposition analyzed by in-situ gas composition sensing Simon Shindler et.al. 2503.16373 null
2025-03-20 Ultra-Resolution Adaptation with Ease Ruonan Yu et.al. 2503.16322 link
2025-03-20 Unleashing Vecset Diffusion Model for Fast Shape Generation Zeqiang Lai et.al. 2503.16302 link
2025-03-20 Diffusion-augmented Graph Contrastive Learning for Collaborative Filter Fan Huang et.al. 2503.16290 null
2025-03-20 SceneMI: Motion In-betweening for Modeling Human-Scene Interactions Inwoo Hwang et.al. 2503.16289 null
2025-03-21 Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens Shuqi Lu et.al. 2503.16278 link
2025-03-20 Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts Yu Cao et.al. 2503.16218 null
2025-03-20 Improving Discriminator Guidance in Diffusion Models Alexandre Verine et.al. 2503.16117 null
2025-03-20 Universal class of exactly solvable diffusions from space-time transformations Costantino Di Bello et.al. 2503.16090 null
2025-03-20 Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model Yingmao Miao et.al. 2503.16065 null
2025-03-20 Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts Yike Yuan et.al. 2503.16057 null
2025-03-20 Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models Marc Benedí San Millán et.al. 2503.15996 null
2025-03-20 A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli Pengyu Liu et.al. 2503.15978 null
2025-03-19 FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers Ruichen Chen et.al. 2503.15465 link
2025-03-19 Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator Yuanzhi Zhu et.al. 2503.15457 null
2025-03-19 MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space Lixing Xiao et.al. 2503.15451 null
2025-03-19 Visual Persona: Foundation Model for Full-Body Human Customization Jisu Nam et.al. 2503.15406 null
2025-03-19 CCDP: Composition of Conditional Diffusion Policies with Guided Sampling Amirreza Razmjoo et.al. 2503.15386 null
2025-03-19 Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers Corentin Vazia et.al. 2503.15383 null
2025-03-19 Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images Euclid Collaboration et.al. 2503.15321 null
2025-03-19 Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization Feifei Li et.al. 2503.15197 null
2025-03-19 Single-Step Bidirectional Unpaired Image Translation Using Implicit Bridge Consistency Distillation Suhyeon Lee et.al. 2503.15056 null
2025-03-19 Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training Yunwei Lan et.al. 2503.15017 link
2025-03-19 Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening Zihan Cao et.al. 2503.14975 null
2025-03-19 Language-based Image Colorization: A Benchmark and Beyond Yifan Li et.al. 2503.14974 link
2025-03-19 Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models Tingxiu Chen et.al. 2503.14966 link
2025-03-19 POSTA: A Go-to Framework for Customized Artistic Poster Generation Haoyu Chen et.al. 2503.14908 null
2025-03-19 FetalFlex: Anatomy-Guided Diffusion Model for Flexible Control on Fetal Ultrasound Image Synthesis Yaofei Duan et.al. 2503.14906 null
2025-03-19 Efficient Personalization of Quantized Diffusion Model without Backpropagation Hoigi Seo et.al. 2503.14868 null
2025-03-19 Temporal-Consistent Video Restoration with Pre-trained Diffusion Models Hengkang Wang et.al. 2503.14863 null
2025-03-19 Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability Zihao Liu et.al. 2503.14833 link
2025-03-18 ShapeShift: Towards Text-to-Shape Arrangement Synthesis with Content-Aware Geometric Constraints Vihaan Misra et.al. 2503.14720 null
2025-03-18 A Simple Combination of Diffusion Models for Better Quality Trade-Offs in Image Denoising Jonas Dornbusch et.al. 2503.14654 null
2025-03-17 One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Daniil Selikhanovych et.al. 2503.13358 null
2025-03-17 Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors Katja Schwarz et.al. 2503.13272 null
2025-03-17 FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis Luxi Chen et.al. 2503.13265 null
2025-03-17 MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis Marvin Seyfarth et.al. 2503.13211 null
2025-03-17 Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images Yaxi Chen et.al. 2503.13131 null
2025-03-17 DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry Jing Li et.al. 2503.13110 link
2025-03-17 Beyond Classical Diffusion: Fractional Derivatives in Transport and Stochastic Systems Cypres Verbeeck et.al. 2503.13096 null
2025-03-17 TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba Jiaxu Liu et.al. 2503.13004 null
2025-03-17 Training Video Foundation Models with NVIDIA NeMo Zeeshan Patel et.al. 2503.12964 null
2025-03-17 Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait Chaolong Yang et.al. 2503.12963 link
2025-03-17 Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction Zheyuan Liu et.al. 2503.12953 null
2025-03-17 FNSE-SBGAN: Far-field Speech Enhancement with Schrodinger Bridge and Generative Adversarial Networks Tong Lei et.al. 2503.12936 link
2025-03-17 AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction Xuying Zhang et.al. 2503.12929 null
2025-03-17 DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode Junjia Huang et.al. 2503.12838 null
2025-03-17 VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis Zhifeng Wang et.al. 2503.12758 null
2025-03-16 UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing Tsu-Jui Fu et.al. 2503.12652 null
2025-03-16 Understanding Driver Cognition and Decision-Making Behaviors in High-Risk Scenarios: A Drift Diffusion Perspective Heye Huang et.al. 2503.12637 null
2025-03-16 LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization Alessio Spagnoletti et.al. 2503.12615 null
2025-03-16 BalancedDPO: Adaptive Multi-Metric Alignment Dipesh Tamboli et.al. 2503.12575 null
2025-03-16 Diffusion on Graph: Augmentation of Graph Structure for Node Classification Yancheng Wang et.al. 2503.12563 null
2025-03-13 GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Rongyao Fang et.al. 2503.10639 link
2025-03-13 Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective Xiaoming Zhao et.al. 2503.10638 null
2025-03-14 Distilling Diversity and Control in Diffusion Models Rohit Gandikota et.al. 2503.10637 null
2025-03-13 HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model Jiaming Liu et.al. 2503.10631 null
2025-03-13 NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models Mert Albaba et.al. 2503.10626 null
2025-03-13 DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Chen Chen et.al. 2503.10618 null
2025-03-13 MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction Yingshuang Zou et.al. 2503.10604 null
2025-03-13 CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models Hao He et.al. 2503.10592 null
2025-03-13 Long Context Tuning for Video Generation Yuwei Guo et.al. 2503.10589 null
2025-03-13 Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion Evgeniia Vu et.al. 2503.10488 null
2025-03-13 CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance Yufan Deng et.al. 2503.10391 null
2025-03-13 Enhancing Facial Privacy Protection via Weakening Diffusion Purification Ali Salar et.al. 2503.10350 link
2025-03-13 DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image Qi Zhao et.al. 2503.10342 null
2025-03-13 CoDiPhy: A General Framework for Applying Denoising Diffusion Models to the Physical Layer of Wireless Communication Systems Peyman Neshaastegaran et.al. 2503.10297 null
2025-03-13 Efficient Diffusion Posterior Sampling for Noisy Inverse Problems Ji Li et.al. 2503.10237 null
2025-03-13 Probability-Flow ODE in Infinite-Dimensional Function Spaces Kunwoo Na et.al. 2503.10219 null
2025-03-13 Data augmentation using diffusion models to enhance inverse Ising inference Yechan Lim et.al. 2503.10154 null
2025-03-13 Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation Yi Wu et.al. 2503.10125 null
2025-03-13 Improving Diffusion-based Inverse Algorithms under Few-Step Constraint via Learnable Linear Extrapolation Jiawei Zhang et.al. 2503.10103 link
2025-03-13 Light-weighted foundation model for seismic data processing based on representative and non-redundant pre-training dataset Xintong Dong et.al. 2503.10092 null
2025-03-12 PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop Chenyu Li et.al. 2503.09595 link
2025-03-12 Minimax Optimality of the Probability Flow ODE for Diffusion Models Changxiao Cai et.al. 2503.09583 null
2025-03-12 Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Marianne Arriola et.al. 2503.09573 link
2025-03-12 TPDiff: Temporal Pyramid Video Diffusion Model Lingmin Ran et.al. 2503.09566 null
2025-03-12 FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model Jiahao Xia et.al. 2503.09560 null
2025-03-12 CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images Bin Hu et.al. 2503.09514 null
2025-03-12 DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction Junjie Zhou et.al. 2503.09491 link
2025-03-12 Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models Zhihua Tian et.al. 2503.09446 link
2025-03-12 SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation Qijian Zhang et.al. 2503.09439 null
2025-03-12 Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space Yifan Zhou et.al. 2503.09419 link
2025-03-12 Diff-CL: A Novel Cross Pseudo-Supervision Method for Semi-supervised Medical Image Segmentation Xiuzhen Guo et.al. 2503.09408 null
2025-03-12 UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer Haoxuan Wang et.al. 2503.09277 null
2025-03-12 Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets Hannah Kniesel et.al. 2503.09221 null
2025-03-12 Reangle-A-Video: 4D Video Generation as Video-to-Video Translation Hyeonho Jeong et.al. 2503.09151 null
2025-03-12 Spiritus: An AI-Assisted Tool for Creating 2D Characters and Animations Qirui Sun et.al. 2503.09127 null
2025-03-12 AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks Jin Li et.al. 2503.09124 null
2025-03-12 Sequential Multi-Object Grasping with One Dexterous Hand Sicheng He et.al. 2503.09078 null
2025-03-12 Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows Chengyue Gong et.al. 2503.09069 null
2025-03-11 SICNav-Diffusion: Safe and Interactive Crowd Navigation with Diffusion Trajectory Predictions Sepehr Samavi et.al. 2503.08858 null
2025-03-11 GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing Yuanhao Wang et.al. 2503.08678 null
2025-03-10 Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation Tianyu Chen et.al. 2503.07578 null
2025-03-11 Inductive Moment Matching Linqi Zhou et.al. 2503.07565 null
2025-03-10 DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks Feiran You et.al. 2503.07433 link
2025-03-10 AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion Mingzhen Sun et.al. 2503.07418 null
2025-03-10 TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision Shaobin Zhuang et.al. 2503.07416 null
2025-03-10 SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models Ouxiang Li et.al. 2503.07392 link
2025-03-10 PersonaBooth: Personalized Text-to-Motion Generation Boeun Kim et.al. 2503.07390 null
2025-03-10 TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models Ruidong Chen et.al. 2503.07389 link
2025-03-10 AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models Bo Huang et.al. 2503.07307 link
2025-03-10 Efficient Distillation of Classifier-Free Guidance using Adapters Cristian Perez Jensen et.al. 2503.07274 link
2025-03-11 AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis Zhangyu Lai et.al. 2503.07253 null
2025-03-11 Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios Chenglu Pan et.al. 2503.07232 null
2025-03-10 Synthetic Lung X-ray Generation through Cross-Attention and Affinity Transformation Ruochen Pi et.al. 2503.07209 null
2025-03-10 Effective and Efficient Masked Image Generation Models Zebin You et.al. 2503.07197 link
2025-03-10 Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms Jiaming Song et.al. 2503.07154 null
2025-03-10 Controllable 3D Outdoor Scene Generation via Scene Graphs Yuheng Liu et.al. 2503.07152 link
2025-03-10 VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation Hanzhi Chen et.al. 2503.07135 null
2025-03-10 TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation Victor Shea-Jay Huang et.al. 2503.07050 null
2025-03-10 Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion Yongle Zhang et.al. 2503.07047 null
2025-03-10 EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Yuxuan Zhang et.al. 2503.07027 null
2025-03-06 Compositional World Knowledge leads to High Utility Synthetic data Sachit Gaudi et.al. 2503.04687 null
2025-03-06 The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Aoxiong Yin et.al. 2503.04606 link
2025-03-06 How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects Wonkwang Lee et.al. 2503.04257 null
2025-03-06 Synthetic Data is an Elegant GIFT for Continual Vision-Language Models Bin Wu et.al. 2503.04229 null
2025-03-06 Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models Rui Jiang et.al. 2503.04215 null
2025-03-06 CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation Yuki Tanaka et.al. 2503.04164 null
2025-03-07 Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration Qianliang Wu et.al. 2503.04127 null
2025-03-06 FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis Ziqi Ni et.al. 2503.04067 null
2025-03-06 RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics Replanning Xi Ye et.al. 2503.04051 null
2025-03-06 Underlying Semantic Diffusion for Effective and Efficient In-Context Learning Zhong Ji et.al. 2503.04050 null
2025-03-06 Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details Yifei Gao et.al. 2503.04037 null
2025-03-06 TextDoctor: Unified Document Image Inpainting via Patch Pyramid Diffusion Models Wanglong Lu et.al. 2503.04021 null
2025-03-05 All-atom Diffusion Transformers: Unified generative modelling of molecules and materials Chaitanya K. Joshi et.al. 2503.03965 link
2025-03-05 Generative Learning of Densities on Manifolds Dimitris G. Giovanis et.al. 2503.03963 null
2025-03-05 GuardDoor: Safeguarding Against Malicious Diffusion Editing via Protective Backdoors Yaopei Zeng et.al. 2503.03944 null
2025-03-05 A non-homogeneous, non-stationary and path-dependent Markov anomalous diffusion model Nestor Barraza et.al. 2503.03896 null
2025-03-05 Metallicity Gradients in Modern Cosmological Simulations I: Tension Between Smooth Stellar Feedback Models and Observations Alex M. Garcia et.al. 2503.03804 null
2025-03-05 Rethinking Video Tokenization: A Conditioned Diffusion-based Approach Nianzu Yang et.al. 2503.03708 link
2025-03-05 DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance Zhao Yang et.al. 2503.03689 link
2025-03-05 Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias Rui Lu et.al. 2503.03595 null
2025-03-05 Generative Artificial Intelligence in Robotic Manipulation: A Survey Kun Zhang et.al. 2503.03464 null
2025-03-05 Top-K Maximum Intensity Projection Priors for 3D Liver Vessel Segmentation Xiaotong Zhang et.al. 2503.03367 null
2025-03-05 Video Super-Resolution: All You Need is a Video Diffusion Model Zhihao Zhan et.al. 2503.03355 null
2025-03-05 Optimizing for the Shortest Path in Denoising Diffusion Model Ping Chen et.al. 2503.03265 link
2025-03-05 GenColor: Generative Color-Concept Association in Visual Design Yihan Hou et.al. 2503.03236 null
2025-03-05 Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture Zhumei Wang et.al. 2503.03222 null
2025-03-05 An Analytical Theory of Power Law Spectral Bias in the Learning Dynamics of Diffusion Models Binxu Wang et.al. 2503.03206 null
2025-03-05 WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models Tao Feng et.al. 2503.03110 null
2025-03-05 From Architectural Sketch to Conceptual Representation: Using Structure-Aware Diffusion Model to Generate Renderings of School Buildings Zhengyang Wang et.al. 2503.03090 null
2025-03-05 Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings Xusheng Du et.al. 2503.03068 null
2025-03-04 Can Diffusion Models Provide Rigorous Uncertainty Quantification for Bayesian Inverse Problems? Evan Scope Crafts et.al. 2503.03007 link
2025-03-04 Diverse Controllable Diffusion Policy with Signal Temporal Logic Yue Meng et.al. 2503.02924 link
2025-03-04 Straight-Line Diffusion Model for Efficient 3D Molecular Generation Yuyan Ni et.al. 2503.02918 link
2025-03-04 Generating Reliable Initial Velocity Models for Full-waveform Inversion with Well and Structural Constraints Qingchen Zhang et.al. 2503.02815 null
2025-03-04 StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts Zhaoxing Gan et.al. 2503.02595 null
2025-03-04 TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping Xinying Hong et.al. 2503.02578 link
2025-03-04 SPG: Improving Motion Diffusion by Smooth Perturbation Guidance Boseong Jeon et.al. 2503.02577 null
2025-02-28 Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos Zhiyu Tan et.al. 2502.21314 null
2025-02-28 Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion Kulin Shah et.al. 2502.21278 null
2025-02-28 A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images Zineb Sordo et.al. 2502.21151 null
2025-02-28 Generative Uncertainty in Diffusion Models Metod Jazbec et.al. 2502.20946 null
2025-02-28 DiffBrush:Just Painting the Art by Your Hands Jiaming Chu et.al. 2502.20904 null
2025-02-28 CADDreamer: CAD object Generation from Single-view Images Yuan Li et.al. 2502.20732 null
2025-02-28 Diffusion Restoration Adapter for Real-World Image Restoration Hanbang Liang et.al. 2502.20679 null
2025-02-28 Wavelet-based density sketching with functional hierarchical tensor Xun Tang et.al. 2502.20655 null
2025-02-28 Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion Models Yu Pan et.al. 2502.20650 link
2025-02-28 T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting Yifei Qian et.al. 2502.20625 null
2025-02-27 Unifying Model Predictive Path Integral Control, Reinforcement Learning, and Diffusion Models for Optimal Control and Planning Yankai Li et.al. 2502.20476 null
2025-02-27 Tight Inversion: Image-Conditioned Inversion for Real Image Editing Edo Kadosh et.al. 2502.20376 null
2025-02-27 Constrained Generative Modeling with Manually Bridged Diffusion Models Saeid Naderiparizi et.al. 2502.20371 null
2025-02-27 FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction Siyu Jiao et.al. 2502.20313 link
2025-02-27 Mobius: Text to Seamless Looping Video Generation via Latent Shift Xiuli Bi et.al. 2502.20307 link
2025-02-27 Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions Palawat Busaranuvong et.al. 2502.20277 null
2025-02-27 Attention Distillation: A Unified Approach to Visual Characteristics Transfer Yang Zhou et.al. 2502.20235 link
2025-02-27 Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Liang Chen et.al. 2502.20172 link
2025-02-27 Scalability of the second-order reliability method for stochastic differential equations with multiplicative noise Timo Schorlepp et.al. 2502.20114 null
2025-02-27 Generative augmentations for improved cardiac ultrasound segmentation using diffusion models Gilles Van De Vyver et.al. 2502.20100 link
2025-02-27 Image Referenced Sketch Colorization Based on Animation Creation Workflow Dingkun Yan et.al. 2502.19937 link
2025-02-27 DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models Weihao wu et.al. 2502.19924 null
2025-02-27 High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model Mingtao Guo et.al. 2502.19894 link
2025-02-27 C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation Yuhao Li et.al. 2502.19868 link
2025-02-27 One-for-More: Continual Diffusion Model for Anomaly Detection Xiaofan Li et.al. 2502.19848 link
2025-02-27 Analyzing CLIP’s Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study Reza Abbasi et.al. 2502.19828 null
2025-02-27 Implicit Search via Discrete Diffusion: A Study on Chess Jiacheng Ye et.al. 2502.19805 link
2025-02-27 UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition Xiao Lin et.al. 2502.19803 link
2025-02-27 MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery Lianping Yang et.al. 2502.19797 null
2025-02-27 Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network Xingyu Qiu et.al. 2502.19754 link
2025-02-27 Recent Advances on Generalizable Diffusion-generated Image Detection Qijie Xu et.al. 2502.19716 link
2025-02-26 HDM: Hybrid Diffusion Model for Unified Image Anomaly Detection Zekang Weng et.al. 2502.19200 null
2025-02-26 RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images Yuhan Tang et.al. 2502.19153 null
2025-02-26 Modulation of the galactic cosmic ray spectrum in an anisotropic diffusion approach V. D. Borisov et.al. 2502.19062 null
2025-02-26 A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models Vu Tuan Truong Long et.al. 2502.19047 null
2025-02-26 DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model Lei Zhao et.al. 2502.18952 null
2025-02-26 Physics-Aware Inverse Design for Nanowire Single-Photon Avalanche Detectors via Deep Learning Boyang Zhang et.al. 2502.18857 null
2025-02-26 Optimal Stochastic Trace Estimation in Generative Modeling Xinyang Liu et.al. 2502.18808 null
2025-02-26 Ptychographic Image Reconstruction from Limited Data via Score-Based Diffusion Models with Physics-Guidance Refik Mert Cam et.al. 2502.18767 null
2025-02-25 Adaptive conditional latent diffusion maps beam loss to 2D phase space projections Alexander Scheinker et.al. 2502.18684 null
2025-02-25 Diffusion Models for conditional MRI generation Miguel Herencia García del Castillo et.al. 2502.18620 null
2025-02-25 K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs Ziheng Ouyang et.al. 2502.18461 null
2025-02-25 ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies Pedro Sequeira et.al. 2502.18438 null
2025-02-25 LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation Pengzhi Li et.al. 2502.18302 null
2025-02-25 Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training Botao Ye et.al. 2502.18219 null
2025-02-25 Training Consistency Models with Variational Noise Coupling Gianluigi Silvestri et.al. 2502.18197 link
2025-02-25 Multi-Perspective Data Augmentation for Few-shot Object Detection Anh-Khoa Nguyen Vu et.al. 2502.18195 link
2025-02-25 Joint Reconstruction of Spatially-Coherent and Realistic Clothed Humans and Objects from a Single Image Ayushi Dutta et.al. 2502.18150 null
2025-02-25 PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching Han Nie et.al. 2502.18104 link
2025-02-25 Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models Jia Yu et.al. 2502.17951 link
2025-02-25 3D Anatomical Structure-guided Deep Learning for Accurate Diffusion Microstructure Imaging Xinrui Ma et.al. 2502.17933 null
2025-02-24 GCC: Generative Color Constancy via Diffusing a Color Checker Chen-Wei Chang et.al. 2502.17435 null
2025-02-24 S4S: Solving for a Diffusion Model Solver Eric Frankel et.al. 2502.17423 null
2025-02-24 X-Dancer: Expressive Music to Human Dance Video Generation Zeyuan Chen et.al. 2502.17414 null
2025-02-24 AnyTop: Character Animation Diffusion with Any Topology Inbar Gat et.al. 2502.17327 link
2025-02-24 VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Xiangpeng Yang et.al. 2502.17258 null
2025-02-24 Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation Baptiste Chopin et.al. 2502.17198 null
2025-02-24 DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks Canyu Zhao et.al. 2502.17157 link
2025-02-24 Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions Zhong Li et.al. 2502.17119 link
2025-02-24 SFLD: Reducing the content bias for AI-generated Image Detection Seoyeon Gye et.al. 2502.17105 null
2025-02-24 Generative Models in Decision Making: A Survey Yinchuan Li et.al. 2502.17100 null
2025-02-24 Conditional Diffusion-Flow models for generating 3D cosmic density fields: applications to f(R) cosmologies Julieth Katherine Riveros et.al. 2502.17087 link
2025-02-24 SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations Wendi Liu et.al. 2502.17056 null
2025-02-24 TraFlow: Trajectory Distillation on Pre-Trained Rectified Flow Zhangkai Wu et.al. 2502.16972 null
2025-02-24 Autoregressive Image Generation Guided by Chains of Thought Miaomiao Cai et.al. 2502.16965 null
2025-02-24 MAD-AD: Masked Diffusion for Unsupervised Brain Anomaly Detection Farzad Beizaee et.al. 2502.16943 link
2025-02-24 Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model Kang Fu et.al. 2502.16915 link
2025-02-24 Mitigating Hallucinations in Diffusion Models through Adaptive Attention Modulation Trevine Oorloff et.al. 2502.16872 null
2025-02-24 Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization Taeyoung Yun et.al. 2502.16824 link
2025-02-24 Fast, Accurate Manifold Denoising by Tunneling Riemannian Optimization Shiyu Wang et.al. 2502.16819 null
2025-02-24 DiffKAN-Inpainting: KAN-based Diffusion model for brain tumor inpainting Tianli Tao et.al. 2502.16771 null
2025-02-20 Improving the Diffusability of Autoencoders Ivan Skorokhodov et.al. 2502.14831 null
2025-02-20 A Survey on Text-Driven 360-Degree Panorama Generation Hai Wang et.al. 2502.14799 null
2025-02-20 DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models Hongji Yang et.al. 2502.14779 null
2025-02-20 Textured 3D Regenerative Morphing with 3D Diffusion Prior Songlin Yang et.al. 2502.14316 null
2025-02-19 DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models Daewon Chae et.al. 2502.14070 null
2025-02-19 d-Sketch: Improving Visual Fidelity of Sketch-to-Image Translation with Pretrained Latent Diffusion Models without Retraining Prasun Roy et.al. 2502.14007 link
2025-02-19 Im2SurfTex: Surface Texture Generation via Neural Backprojection of Multi-View Images Yiangos Georgiou et.al. 2502.14006 null
2025-02-19 SigStyle: Signature Style Transfer via Personalized Text-to-Image Models Ye Wang et.al. 2502.13997 null
2025-02-19 FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation Yunpeng Zhang et.al. 2502.13995 link
2025-02-19 Generative Detail Enhancement for Physically Based Materials Saeed Hadadan et.al. 2502.13994 null
2025-02-19 SelfAge: Personalized Facial Age Transformation Using Self-reference Images Taishi Ito et.al. 2502.13987 link
2025-02-19 IP-Composer: Semantic Composition of Visual Concepts Sara Dorfman et.al. 2502.13951 null
2025-02-19 TESS 2: A Large-Scale Generalist Diffusion Language Model Jaesung Tae et.al. 2502.13917 link
2025-02-19 Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions Xinwei Shen et.al. 2502.13747 null
2025-02-19 RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior Ching-Hua Lee et.al. 2502.13574 null
2025-02-19 Diffusion Model Agnostic Social Influence Maximization in Hyperbolic Space Hongliang Qiao et.al. 2502.13571 null
2025-02-19 Interleaved Gibbs Diffusion for Constrained Generation Gautham Govind Anil et.al. 2502.13450 null
2025-02-18 Secure and Efficient Watermarking for Latent Diffusion Models in Model Distribution Scenarios Liangqi Lei et.al. 2502.13345 null
2025-02-18 Geometry-Aware Diffusion Models for Multiview Scene Inpainting Ahmad Salimi et.al. 2502.13335 null
2025-02-18 MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching Yen-Siang Wu et.al. 2502.13234 null
2025-02-18 Fundus2Globe: Generative AI-Driven 3D Digital Twins for Personalized Myopia Management Danli Shi et.al. 2502.13182 null
2025-02-18 Is Noise Conditioning Necessary for Denoising Generative Models? Qiao Sun et.al. 2502.13129 null
2025-02-18 Score Matching Riemannian Diffusion Means Frederik Möbius Rygaard et.al. 2502.13106 null
2025-02-18 Personalized Image Generation with Deep Generative Models: A Decade Survey Yuxiang Wei et.al. 2502.13081 link
2025-02-18 Does Training with Synthetic Data Truly Protect Privacy? Yunpeng Zhao et.al. 2502.12976 link
2025-02-18 Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression Jaemoon Lee et.al. 2502.12951 null
2025-02-18 RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models Tanqiu Jiang et.al. 2502.12794 link
2025-02-18 Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo James Thornton et.al. 2502.12786 null
2025-02-18 High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion Xiang Zhang et.al. 2502.12752 null
2025-02-18 3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces Fabian Bongratz et.al. 2502.12742 null
2025-02-18 NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation Zhiyuan Liu et.al. 2502.12638 link
2025-02-17 Diffusion Models without Classifier-free Guidance Zhicong Tang et.al. 2502.12154 link
2025-02-17 Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening Ye Tian et.al. 2502.12146 link
2025-02-17 How compositional generalization and creativity improve as diffusion models are trained Alessandro Favero et.al. 2502.12089 null
2025-02-17 HumanGif: Single-View Human Diffusion with Generative Prior Shoukang Hu et.al. 2502.12080 link
2025-02-17 A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond Shreya Shukla et.al. 2502.12048 null
2025-02-17 Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images Negar Kamali et.al. 2502.11989 link
2025-02-17 Image Inversion: A Survey from GANs to Diffusion and Beyond Yinan Chen et.al. 2502.11974 link
2025-02-17 Approximating a spatially-heterogeneously mass-emitting object by multiple point sources in a diffusion model Qiyao Peng et.al. 2502.11908 null
2025-02-17 BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model Weilin Lin et.al. 2502.11798 link
2025-02-17 MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow Hanzhuo Huang et.al. 2502.11697 null
2025-02-17 GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text Gyumin Shim et.al. 2502.11642 null
2025-02-17 Membership Inference Attacks for Face Images Against Fine-Tuned Latent Diffusion Models Lauritz Christian Holme et.al. 2502.11619 null
2025-02-17 Maximum Entropy Reinforcement Learning with Diffusion Policy Xiaoyi Dong et.al. 2502.11612 link
2025-02-17 Continuous Diffusion Model for Language Modeling Jaehyeong Jo et.al. 2502.11564 link
2025-02-17 Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation Zexi Jia et.al. 2502.11532 null
2025-02-17 SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion Junxian Ma et.al. 2502.11515 null
2025-02-17 Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation Taeyoung Yun et.al. 2502.11477 link
2025-02-17 Inverse Flow and Consistency Models Yuchen Zhang et.al. 2502.11333 null
2025-02-17 Deep Learning of Proteins with Local and Global Regions of Disorder Oufan Zhang et.al. 2502.11326 link
2025-02-16 Collaborative Deterministic-Diffusion Model for Probabilistic Urban Spatiotemporal Prediction Zhi Sheng et.al. 2502.11013 null
2025-02-13 Theoretical Benefit and Limitation of Diffusion Language Model Guhao Feng et.al. 2502.09622 null
2025-02-13 RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets Isabella Liu et.al. 2502.09615 null
2025-02-13 Score-of-Mixture Training: Training One-Step Generative Models Made Simple Tejas Jayashankar et.al. 2502.09609 null
2025-02-13 Rolling Ahead Diffusion for Traffic Scene Simulation Yunpeng Liu et.al. 2502.09587 null
2025-02-13 Memorization and Generalization in Generative Diffusion under the Manifold Hypothesis Beatrice Achilli et.al. 2502.09578 null
2025-02-13 DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra Montgomery Bohde et.al. 2502.09571 link
2025-02-13 Diffusing DeBias: a Recipe for Turning a Bug into a Feature Massimiliano Ciranni et.al. 2502.09564 null
2025-02-13 Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model Fei Shen et.al. 2502.09533 null
2025-02-13 Diffusion Models for Molecules: A Survey of Methods and Tasks Liang Wang et.al. 2502.09511 link
2025-02-13 Redistribute Ensemble Training for Mitigating Memorization in Diffusion Models Xiaoliu Guan et.al. 2502.09434 link
2025-02-13 ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Rotem Shalev-Arkushin et.al. 2502.09411 null
2025-02-13 Non-asymptotic Analysis of Diffusion Annealed Langevin Monte Carlo for Generative Modelling Paula Cordero-Encinar et.al. 2502.09306 null
2025-02-13 ConsistentDreamer: View-Consistent Meshes Through Balanced Multi-View Gaussian Optimization Onat Şahin et.al. 2502.09278 null
2025-02-13 From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine Lukas Buess et.al. 2502.09242 null
2025-02-13 E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization Trung X. Pham et.al. 2502.09164 null
2025-02-13 Regularization can make diffusion models more efficient Mahsa Taheri et.al. 2502.09151 null
2025-02-13 Exact Bayesian inference for Markov switching diffusions Timothée Stumpf-Fétizon et.al. 2502.09126 null
2025-02-13 StyleBlend: Enhancing Style-Specific Content Creation in Text-to-Image Diffusion Models Zichong Chen et.al. 2502.09064 link
2025-02-13 MTDP: Modulated Transformer Diffusion Policy Model Qianhao Wang et.al. 2502.09029 null
2025-02-13 Dynamic watermarks in images generated by diffusion models Yunzhuo Chen et.al. 2502.08927 null
2025-02-12 SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation Ellie Arar et.al. 2502.08642 null
2025-02-12 CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation Qinghe Wang et.al. 2502.08639 null
2025-02-12 Chasing Charge Carriers: Diffusion Dynamics in Mixed-n Quasi-Two-Dimensional Colloidal MAPbBr3 Perovskites Ronja Maria Piehler et.al. 2502.08601 null
2025-02-12 Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio Khaled Kahouli et.al. 2502.08598 link
2025-02-12 Light-A-Video: Training-free Video Relighting via Progressive Light Fusion Yujie Zhou et.al. 2502.08590 link
2025-02-12 Ultrasound Image Generation using Latent Diffusion Models Benoit Freiche et.al. 2502.08580 null
2025-02-12 Mapping the Landscape of Generative AI in Network Monitoring and Management Giampaolo Bovenzi et.al. 2502.08576 null
2025-02-12 BCDDM: Branch-Corrected Denoising Diffusion Model for Black Hole Image Generation Ao liu et.al. 2502.08528 null
2025-02-12 One-Shot Federated Learning with Classifier-Free Diffusion Models Obaidullah Zaland et.al. 2502.08488 null
2025-02-12 A Survey on Pre-Trained Diffusion Model Distillations Xuhui Fan et.al. 2502.08364 null
2025-02-12 A posteriori error control for a finite volume scheme for a cross-diffusion model of ion transport Arne Berrens et.al. 2502.08306 null
2025-02-12 BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video Yu Hong et.al. 2502.08297 null
2025-02-12 FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis Wonjoon Jin et.al. 2502.08244 null
2025-02-12 DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias Song Park et.al. 2502.08167 null
2025-02-12 PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation Ziyan Wang et.al. 2502.08106 null
2025-02-12 End-to-End Predictive Planner for Autonomous Driving with Consistency Models Anjian Li et.al. 2502.08033 null
2025-02-11 Training-Free Safe Denoisers for Safe Use of Diffusion Models Mingyu Kim et.al. 2502.08011 null
2025-02-11 Greed is Good: Guided Generation from a Greedy Perspective Zander W. Blasingame et.al. 2502.08006 null
2025-02-11 Towards Training One-Step Diffusion Models Without Distillation Mingtian Zhang et.al. 2502.08005 null
2025-02-11 SurGrID: Controllable Surgical Simulation via Scene Graph to Image Diffusion Yannik Frisch et.al. 2502.07945 null
2025-02-10 Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions Jaeyeon Kim et.al. 2502.06768 null
2025-02-10 History-Guided Video Diffusion Kiwhan Song et.al. 2502.06764 null
2025-02-10 Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene Tai-Yu Pan et.al. 2502.06682 null
2025-02-10 Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification Jiachen Li et.al. 2502.06619 link
2025-02-10 MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models Kamil Garifullin et.al. 2502.06606 null
2025-02-10 A Large-scale AI-generated Image Inpainting Benchmark Paschalis Giakoumoglou et.al. 2502.06593 null
2025-02-10 Diffusion Models for Computational Neuroimaging: A Survey Haokai Zhao et.al. 2502.06552 link
2025-02-10 Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation Soobin Um et.al. 2502.06516 link
2025-02-10 WyckoffDiff - A Generative Diffusion Model for Crystal Symmetry Filip Ekström Kelvinius et.al. 2502.06485 link
2025-02-10 Habitizing Diffusion Planning for Efficient and Effective Decision Making Haofei Lu et.al. 2502.06401 link
2025-02-10 TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints Pengyu Long et.al. 2502.06392 null
2025-02-10 Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo Filip Ekström Kelvinius et.al. 2502.06379 null
2025-02-10 Guidance-base Diffusion Models for Improving Photoacoustic Image Quality Tatsuhiro Eguchi et.al. 2502.06354 null
2025-02-10 Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior Lee Hyoseok et.al. 2502.06338 null
2025-02-10 Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance Li Hu et.al. 2502.06145 null
2025-02-10 CDM: Contact Diffusion Model for Multi-Contact Point Localization Seo Wook Han et.al. 2502.06109 null
2025-02-10 Debiasing Guidance for Discrete Diffusion with Sequential Monte Carlo Cheuk Kit Lee et.al. 2502.06079 null
2025-02-09 Generating 3D Binding Molecules Using Shape-Conditioned Diffusion Models with Guidance Ziqi Chen et.al. 2502.06027 null
2025-02-09 Dual Caption Preference Optimization for Diffusion Models Amir Saeidi et.al. 2502.06023 link
2025-02-09 Diffusion Models for Inverse Problems in the Exponential Family Alessandro Micheli et.al. 2502.05994 null
2025-02-06 HOG-Diff: Higher-Order Guided Diffusion for Graph Generation Yiming Huang et.al. 2502.04308 link
2025-02-06 MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation Jinbo Xing et.al. 2502.04299 null
2025-02-06 Diffusion-based mass map reconstruction from weak lensing data Supranta S. Boruah et.al. 2502.04158 null
2025-02-06 Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis Zhen Ye et.al. 2502.04128 link
2025-02-06 Generative Adversarial Networks Bridging Art and Machine Intelligence Junhao Song et.al. 2502.04116 null
2025-02-06 TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers Younghye Hwang et.al. 2502.04056 null
2025-02-06 PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models Aleksandar Cvejic et.al. 2502.04050 null
2025-02-06 Hierarchical Entropic Diffusion for Ransomware Detection: A Probabilistic Approach to Behavioral Anomaly Isolation Vasili Iskorohodov et.al. 2502.03882 null
2025-02-06 DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models Lingshun Kong et.al. 2502.03810 null
2025-02-06 DICE: Distilling Classifier-Free Guidance into Text Embeddings Zhenyu Zhou et.al. 2502.03726 null
2025-02-06 Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free Gian Mario Favero et.al. 2502.03687 null
2025-02-06 Variational Control for Guidance in Diffusion Models Kushagra Pandey et.al. 2502.03686 link
2025-02-05 Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach Yunuo Chen et.al. 2502.03639 null
2025-02-05 SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models Daniel Levy et.al. 2502.03638 link
2025-02-05 Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models Jinhao Liang et.al. 2502.03607 null
2025-02-05 Path Planning for Masked Diffusion Model Sampling Fred Zhangzhi Peng et.al. 2502.03540 null
2025-02-05 Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics Xuan Li et.al. 2502.03449 null
2025-02-05 Masked Autoencoders Are Effective Tokenizers for Diffusion Models Hao Chen et.al. 2502.03444 null
2025-02-05 TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer Zhihong Xu et.al. 2502.03426 null
2025-02-05 A Mixture-Based Framework for Guiding Diffusion Models Yazid Janati et.al. 2502.03332 link
2025-02-05 An efficient end-to-end computational framework for the generation of ECG calibrated volumetric models of human atrial electrophysiology Elena Zappon et.al. 2502.03322 null
2025-02-05 MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent Xinyao Liao et.al. 2502.03207 null
2025-02-05 Poisson Flow Joint Model for Multiphase contrast-enhanced CT Rongjun Ge et.al. 2502.03079 null
2025-02-05 Direct Distributional Optimization for Provable Alignment of Diffusion Models Ryotaro Kawata et.al. 2502.02954 null
2025-02-05 Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization Yang Li et.al. 2502.02941 null
2025-02-05 Elucidating the Preconditioning in Consistency Distillation Kaiwen Zheng et.al. 2502.02922 null
2025-02-04 When are Diffusion Priors Helpful in Sparse Reconstruction? A Study with Sparse-view CT Matt Y. Cheung et.al. 2502.02771 null
2025-02-04 Calibrated Multi-Preference Optimization for Aligning Diffusion Models Kyungmin Lee et.al. 2502.02588 null
2025-02-04 Open Materials Generation with Stochastic Interpolants Philipp Hoellmer et.al. 2502.02582 null
2025-02-04 Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation Jian Liu et.al. 2502.02525 link
2025-02-04 Privacy Attacks on Image AutoRegressive Models Antoni Kowalczuk et.al. 2502.02514 link
2025-02-04 Do Graph Diffusion Models Accurately Capture and Generate Substructure Distributions? Xiyuan Wang et.al. 2502.02488 null
2025-02-04 Distributional Diffusion Models with Scoring Rules Valentin De Bortoli et.al. 2502.02483 null
2025-02-04 Towards Consistent and Controllable Image Synthesis for Face Editing Mengting Wei et.al. 2502.02465 null
2025-02-04 Sparse Data Generation Using Diffusion Models Phil Ostheimer et.al. 2502.02448 null
2025-02-04 Towards Fast Graph Generation via Autoregressive Noisy Filtration Modeling Markus Krimmel et.al. 2502.02415 link
2025-01-31 Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions Sören Christensen et.al. 2501.19373 null
2025-01-31 Pathological MRI Segmentation by Synthetic Pathological Data Generation in Fetuses and Neonates Misha P. T Kaandorp et.al. 2501.19338 null
2025-01-31 Medical Semantic Segmentation with Diffusion Pretrain David Li et.al. 2501.19265 null
2025-01-31 Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search Yuta Oshima et.al. 2501.19252 null
2025-01-31 PSyDUCK: Training-Free Steganography for Latent Diffusion Georgia Channing et.al. 2501.19172 null
2025-01-31 RMDM: Radio Map Diffusion Model with Physics Informed Haozhe Jia et.al. 2501.19160 link
2025-01-31 Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data Xichen Xu et.al. 2501.19094 null
2025-01-31 MotionPCM: Real-Time Motion Synthesis with Phased Consistency Model Lei Jiang et.al. 2501.19083 null
2025-01-31 Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations Dahye Kim et.al. 2501.19066 link
2025-01-31 Collaborative Diffusion Model for Recommender System Gyuseok Lee et.al. 2501.18997 null
2025-01-31 OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation Yuchen Lin et.al. 2501.18982 null
2025-01-31 Fantastic Targets for Concept Erasure in Diffusion Models and Where To Find Them Anh Bui et.al. 2501.18950 link
2025-01-31 Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior Tongda Xu et.al. 2501.18913 link
2025-01-31 Trustworthy Evaluation of Generative AI Models Zijun Gao et.al. 2501.18897 null
2025-01-31 Distorting Embedding Space for Safety: A Defense Mechanism for Adversarially Robust Diffusion Models Jaesin Ahn et.al. 2501.18877 link
2025-01-31 REG: Rectified Gradient Guidance for Conditional Diffusion Models Zhengqi Gao et.al. 2501.18865 null
2025-01-31 Equivariant Hypergraph Diffusion for Crystal Structure Prediction Yang Liu et.al. 2501.18850 null
2025-01-31 Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential Chenyu Gao et.al. 2501.18834 null
2025-01-30 Distillation-Driven Diffusion Model for Multi-Scale MRI Super-Resolution: Make 1.5T MRI Great Again Zhe Wang et.al. 2501.18736 link
2025-01-30 Strong and Controllable 3D Motion Generation Canxuan Gang et.al. 2501.18726 null
2025-01-30 DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models Ruofan Liang et.al. 2501.18590 null
2025-01-30 Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss Wenshuo Chen et.al. 2501.18232 link
2025-01-30 Inverse source problem of sub-diffusion of variable exponent Zhiyuan Li et.al. 2501.18228 null
2025-01-29 SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders Bartosz Cywiński et.al. 2501.18052 link
2025-01-28 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models Ruiqi Xu et.al. 2501.17895 null
2025-01-29 VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback Sayeh Gholipour Picha et.al. 2501.17726 link
2025-01-29 Distinguished Quantized Guidance for Diffusion-based Sequence Recommendation Wenyu Mao et.al. 2501.17670 null
2025-01-29 Solving Inverse Problems using Diffusion with Fast Iterative Renoising Matt C. Bendel et.al. 2501.17468 null
2025-01-28 MDDM: A Molecular Dynamics Diffusion Model to Predict Particle Self-Assembly Kevin Ferguson et.al. 2501.17319 null
2025-01-28 CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation Nikolai Kalischek et.al. 2501.17162 null
2025-01-28 IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait Han Yang et.al. 2501.17159 null
2025-01-28 Generative diffusion models from a PDE perspective Fei Cao et.al. 2501.17054 null
2025-01-28 Adversarial Masked Autoencoder Purifier with Defense Transferability Yuan-Chih Chen et.al. 2501.16904 null
2025-01-28 DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model Josua Spisak et.al. 2501.16800 null
2025-01-28 FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation Arvin Tashakori et.al. 2501.16778 null
2025-01-28 DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Chenguo Lin et.al. 2501.16764 null
2025-01-28 ITVTON:Virtual Try-On Diffusion Transformer Model Based on Integrated Image and Text Haifeng Ni et.al. 2501.16757 null
2025-01-28 Consistency Diffusion Models for Single-Image 3D Reconstruction with Priors Chenru Jiang et.al. 2501.16737 null
2025-01-28 Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models Huijie Liu et.al. 2501.16714 null
2025-01-28 CascadeV: An Implementation of Wurstchen Architecture for Video Generation Wenfeng Lin et.al. 2501.16612 link
2025-01-27 PackDiT: Joint Human Motion and Text Generation via Mutual Prompting Zhongyu Jiang et.al. 2501.16551 null
2025-01-27 PhysAnimator: Physics-Guided Generative Cartoon Animation Tianyi Xie et.al. 2501.16550 null
2025-01-27 Decrypting the temperature field in flow boiling with latent diffusion models UngJin Na et.al. 2501.16510 null
2025-01-27 RelightVid: Temporal-Consistent Diffusion Model for Video Relighting Ye Fang et.al. 2501.16330 null
2025-01-27 Congested Crossing Pedestrian Traffic Flow : Dispersion vs. Transport in Crowded Areas Mariam Al Khatib et.al. 2501.16275 null
2025-01-27 UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images Tatiana Taís Schein et.al. 2501.16211 link
2025-01-27 Multi-front dynamics in spatially inhomogeneous Allen-Cahn equations Robbin Bastiaansen et.al. 2501.16195 null
2025-01-27 BAG: Body-Aligned 3D Wearable Asset Generation Zhongjin Luo et.al. 2501.16177 null
2025-01-27 Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors Zhiyuan Lu et.al. 2501.16147 null
2025-01-27 Using Generative Models to Produce Realistic Populations of UK Windstorms Yee Chun Tsoi et.al. 2501.16110 null
2025-01-27 Improving Tropical Cyclone Forecasting With Video Diffusion Models Zhibo Ren et.al. 2501.16003 link
2025-01-27 MatCLIP: Light- and Shape-Insensitive Assignment of PBR Material Models Michael Birsak et.al. 2501.15981 null
2025-01-27 Generative AI for Lyapunov Optimization Theory in UAV-based Low-Altitude Economy Networking Zhang Liu et.al. 2501.15928 null
2025-01-27 Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation Adil Kaan Akan et.al. 2501.15878 null
2025-01-27 Can Location Embeddings Enhance Super-Resolution of Satellite Imagery? Daniel Panangian et.al. 2501.15847 null
2025-01-27 Memorization and Regularization in Generative Diffusion Models Ricardo Baptista et.al. 2501.15785 link
2025-01-26 BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation Ali Khodabandeh Yalabadi et.al. 2501.15631 link
2025-01-26 Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models Spencer Ramsey et.al. 2501.15571 null
2025-01-26 CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary Jiahang Tu et.al. 2501.15562 null
2025-01-26 Distributionally Robust Graph Out-of-Distribution Recommendation via Diffusion Model Chu Zhao et.al. 2501.15555 link
2025-01-26 LoRAGuard: An Effective Black-box Watermarking Approach for LoRAs Peizhuo Lv et.al. 2501.15478 null
2025-01-26 SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity Zichen Fan et.al. 2501.15448 null
2025-01-26 StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces Kyeongmin Yeo et.al. 2501.15445 null
2025-01-23 IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models Jiayi Lei et.al. 2501.13920 null
2025-01-23 Improving Video Generation with Human Feedback Jie Liu et.al. 2501.13918 null
2025-01-23 Unveiling the Power of Noise Priors: Enhancing Diffusion Models for Mobile Traffic Prediction Zhi Sheng et.al. 2501.13794 null
2025-01-23 An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem Mingzhao Wang et.al. 2501.13767 link
2025-01-23 Training-Free Consistency Pipeline for Fashion Repose Potito Aghilar et.al. 2501.13692 null
2025-01-23 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt Tao Liu et.al. 2501.13554 link
2025-01-23 Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse Wenzhuo Ma et.al. 2501.13528 null
2025-01-23 LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation JiaXin Chen et.al. 2501.13475 null
2025-01-23 Zero-Shot Trajectory Planning for Signal Temporal Logic Tasks Ruijia Liu et.al. 2501.13457 null
2025-01-23 Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement Meng-Ping Lin et.al. 2501.13375 null
2025-01-23 MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize Haohang Xu et.al. 2501.13349 null
2025-01-23 One Fits All: General Mobility Trajectory Modeling via Masked Conditional Diffusion Qingyue Long et.al. 2501.13347 null
2025-01-23 Retrievals Can Be Detrimental: A Contrastive Backdoor Attack Paradigm on Retrieval-Augmented Diffusion Models Hao Fang et.al. 2501.13340 null
2025-01-23 Gradient-Free Adversarial Purification with Diffusion Models Xuelong Dai et.al. 2501.13336 null
2025-01-22 State Combinatorial Generalization In Decision Making With Conditional Diffusion Models Xintong Duan et.al. 2501.13241 null
2025-01-23 Accelerate High-Quality Diffusion Models with Inner Loop Feedback Matthew Gwilliam et.al. 2501.13107 null
2025-01-22 Robust Representation Consistency Model via Contrastive Denoising Jiachen Lei et.al. 2501.13094 link
2025-01-22 Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation Akshay Krishnan et.al. 2501.13087 null
2025-01-22 Robust Body Composition Analysis by Generating 3D CT Volumes from Limited 2D Slices Lianrui Zuo et.al. 2501.13071 null
2025-01-22 Beyond the Lungs: Extending the Field of View in Chest CT with Latent Diffusion Models Lianrui Zuo et.al. 2501.13068 null
2025-01-22 Low-dimensional adaptation of diffusion models: Convergence in total variation Jiadong Liang et.al. 2501.12982 null
2025-01-22 3D Object Manipulation in a Single Image using Generative Models Ruisi Zhao et.al. 2501.12935 null
2025-01-22 CrossDiff: Diffusion Probabilistic Model With Cross-conditional Encoder-Decoder for Crack Segmentation Xianglong Shi et.al. 2501.12860 null
2025-01-22 AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation Aghiles Kebaili et.al. 2501.12840 null
2025-01-22 Certified Guidance for Planning with Deep Generative Models Francesco Giacomarra et.al. 2501.12815 null
2025-01-22 T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation Lijun Li et.al. 2501.12612 link
2025-01-22 Image Motion Blur Removal in the Temporal Dimension with Video Diffusion Models Wang Pang et.al. 2501.12604 null
2025-01-21 Federated Discrete Denoising Diffusion Model for Molecular Generation with OpenFL Kevin Ta et.al. 2501.12523 link
2025-01-21 Towards Affordance-Aware Articulation Synthesis for Rigged Objects Yu-Chu Yu et.al. 2501.12393 null
2025-01-22 GPS as a Control Signal for Image Generation Chao Feng et.al. 2501.12390 null
2025-01-21 Audio Texture Manipulation by Exemplar-Based Analogy Kan Jen Cheng et.al. 2501.12385 null
2025-01-21 DiffDoctor: Diagnosing Image Diffusion Models Before Treating Yiyang Wang et.al. 2501.12382 null
2025-01-21 VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models Chaohao Xie et.al. 2501.12267 null
2025-01-21 Joint Reconstruction and Motion Estimation in Sparse-View 4DCT Using Diffusion Models within a Blind Inverse Problem Framework Antoine De Paepe et.al. 2501.12249 null
2025-01-21 TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space Daniel Garibi et.al. 2501.12224 null
2025-01-17 DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration Huiyun Cao et.al. 2501.10325 null
2025-01-17 DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency Xiaohui Li et.al. 2501.10110 null
2025-01-17 Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning Shengkui Zhao et.al. 2501.10052 link
2025-01-17 DiffuEraser: A Diffusion Model for Video Inpainting Xiaowen Li et.al. 2501.10018 link
2025-01-17 Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks Junlan Chen et.al. 2501.10017 null
2025-01-17 Physics-informed DeepCT: Sinogram Wavelet Decomposition Meets Masked Diffusion Zekun Zhou et.al. 2501.09935 link
2025-01-16 Geometry-Preserving Encoder/Decoder in Latent Generative Models Wonjun Lee et.al. 2501.09876 null
2025-01-16 CrossModalityDiffusion: Multi-Modal Novel View Synthesis with Unified Intermediate Representation Alex Berian et.al. 2501.09838 link
2025-01-16 PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery Shristi Das Biswas et.al. 2501.09826 link
2025-01-16 Lossy Compression with Pretrained Diffusion Models Jeremy Vonderfecht et.al. 2501.09815 link
2025-01-16 SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Sumit Chaturvedi et.al. 2501.09756 null
2025-01-16 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Nanye Ma et.al. 2501.09732 null
2025-01-16 Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review Masatoshi Uehara et.al. 2501.09685 null
2025-01-16 Pruning for Sparse Diffusion Models based on Gradient Flow Ben Wan et.al. 2501.09464 null
2025-01-16 CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Hwan Heo et.al. 2501.09433 link
2025-01-16 Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse Guangyuan Liu et.al. 2501.09391 null
2025-01-16 UVRM: A Scalable 3D Reconstruction Model from Unposed Videos Shiu-hong Kao et.al. 2501.09347 null
2025-01-16 Domain-conditioned and Temporal-guided Diffusion Modeling for Accelerated Dynamic MRI Reconstruction Liping Zhang et.al. 2501.09305 null
2025-01-16 Text Semantics to Flexible Design: A Residential Layout Generation Method Based on Stable Diffusion Model Zijin Qiu et.al. 2501.09279 null
2025-01-16 PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving Desen Sun et.al. 2501.09253 null
2025-01-15 Grounding Text-To-Image Diffusion Models For Controlled High-Quality Image Generation Ahmad Süleyman et.al. 2501.09194 null
2025-01-15 Generative diffusion model with inverse renormalization group flows Kanta Masuki et.al. 2501.09064 link
2025-01-15 NeurOp-Diff:Continuous Remote Sensing Image Super-Resolution via Neural Operator Diffusion Zihao Xu et.al. 2501.09054 link
2025-01-15 SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation Aditya Bhat et.al. 2501.09008 null
2025-01-15 RepVideo: Rethinking Cross-Layer Representation for Video Generation Chenyang Si et.al. 2501.08994 null
2025-01-15 Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution Shao-Hao Lu et.al. 2501.08819 link
2025-01-15 Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models Zerui Tao et.al. 2501.08727 null
2025-01-15 FlexiClip: Locality-Preserving Free-Form Character Animation Anant Khandelwal et.al. 2501.08676 null
2025-01-15 TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis Bailiang Jian et.al. 2501.08667 null
2025-01-15 Product of Gaussian Mixture Diffusion Model for non-linear MRI Inversion Laurenz Nagler et.al. 2501.08662 null
2025-01-15 Joint Learning of Depth and Appearance for Portrait Image Animation Xinya Ji et.al. 2501.08649 null
2025-01-15 Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT) Krishna Panthi et.al. 2501.08604 null
2025-01-15 DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors Runqi Wang et.al. 2501.08553 null
2025-01-14 Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Weichen Fan et.al. 2501.08453 null
2025-01-14 DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models Hyeonwoo Kim et.al. 2501.08333 null
2025-01-14 MangaNinja: Line Art Colorization with Precise Reference Following Zhiheng Liu et.al. 2501.08332 null
2025-01-14 Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Ryan Burgert et.al. 2501.08331 link
2025-01-14 GameFactory: Creating New Games with Generative Interactive Videos Jiwen Yu et.al. 2501.08325 null
2025-01-14 Diffusion Adversarial Post-Training for One-Step Video Generation Shanchuan Lin et.al. 2501.08316 null
2025-01-14 LayerAnimate: Layer-specific Control for Animation Yuxue Yang et.al. 2501.08295 null
2025-01-14 Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints Jonathan Nöther et.al. 2501.08246 null
2025-01-14 FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Yabo Zhang et.al. 2501.08225 link
2025-01-14 D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models Qian Zeng et.al. 2501.08180 link
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-13 Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection Shiman Zhang et.al. 2501.07533 link
2025-01-13 IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion Tharun Anand et.al. 2501.07530 null
2025-01-13 PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations Ting-Yu Dai et.al. 2501.07447 null
2025-01-13 Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation Xiyue Zhu et.al. 2501.07430 null
2025-01-13 OCORD: Open-Campus Object Removal Dataset Shuo Zhang et.al. 2501.07397 null
2025-01-13 Bigger Isn’t Always Better: Towards a General Prior for Medical Image Reconstruction Lukas Glaszner et.al. 2501.07376 link
2025-01-13 Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Li Liang et.al. 2501.07260 link
2025-01-13 D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation Zhejun Zhang et.al. 2501.07077 link
2025-01-13 Erasing Noise in Signal Detection with Diffusion Model: From Theory to Application Xiucheng Wang et.al. 2501.07030 null
2025-01-13 Global Search for Optimal Low Thrust Spacecraft Trajectories using Diffusion Models and the Indirect Method Jannik Graebner et.al. 2501.07005 null
2025-01-13 Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps Henry Li et.al. 2501.06999 link
2025-01-12 A General Framework for Inference-time Scaling and Steering of Diffusion Models Raghav Singhal et.al. 2501.06848 link
2025-01-12 ODPG: Outfitting Diffusion with Pose Guided Condition Seohyun Lee et.al. 2501.06769 null
2025-01-12 Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models Michael Toker et.al. 2501.06751 null
2025-01-12 DRDT3: Diffusion-Refined Decision Test-Time Training Model Xingshuai Huang et.al. 2501.06718 null
2025-01-11 Personalized Preference Fine-tuning of Diffusion Models Meihua Dang et.al. 2501.06655 null
2025-01-11 Boundary-enhanced time series data imputation with long-term dependency diffusion models Chunjing Xiao et.al. 2501.06585 null
2025-01-11 A Diffusive Data Augmentation Framework for Reconstruction of Complex Network Evolutionary History En Xu et.al. 2501.06485 null
2025-01-10 MEt3R: Measuring Multi-View Consistency in Generated Images Mohammad Asim et.al. 2501.06336 null
2025-01-09 Decentralized Diffusion Models David McAllister et.al. 2501.05450 null
2025-01-09 Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces Aniruddha Mahapatra et.al. 2501.05442 null
2025-01-09 The GAN is dead; long live the GAN! A Modern GAN Baseline Yiwen Huang et.al. 2501.05441 link
2025-01-09 Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation Xuyi Meng et.al. 2501.05427 null
2025-01-09 TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts Yu-Hao Huang et.al. 2501.05403 link
2025-01-09 Accelerated Diffusion Models via Speculative Sampling Valentin De Bortoli et.al. 2501.05370 null
2025-01-09 CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models Junha Park et.al. 2501.05359 null
2025-01-09 Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes Ludwic Leonard et.al. 2501.05226 link
2025-01-09 FaceMe: Robust Blind Face Restoration with Personal Identification Siyu Liu et.al. 2501.05177 null
2025-01-09 EquiBoost: An Equivariant Boosting Approach to Molecular Conformation Generation Yixuan Yang et.al. 2501.05109 link
2025-01-09 Recovery of activation propagation and self-sustained oscillation abilities in stroke brain networks Yingpeng Liu et.al. 2501.05099 null
2025-01-09 ResPanDiff: Diffusion Model with Disentangled Modulations for Image Fusion Shiqi Cao et.al. 2501.05091 null
2025-01-09 D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription Hounsu Kim et.al. 2501.05068 link
2025-01-09 On a reaction-diffusion virus model with general boundary conditions in heterogeneous environments Mingxin Wang et.al. 2501.04992 null
2025-01-09 FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching Jun-Hak Yun et.al. 2501.04926 link
2025-01-08 Geophysical inverse problems with measurement-guided diffusion models Matteo Ravasi et.al. 2501.04881 null
2025-01-08 Using Diffusion Models for Reducing Spatiotemporal Errors of Deep Learning Based Urban Microclimate Predictions at Post-Processing Stage Sepehrdad Tahmasebi et.al. 2501.04847 null
2025-01-08 EditAR: Unified Conditional Generation with Autoregressive Models Jiteng Mu et.al. 2501.04699 null
2025-01-08 ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Yuzhou Huang et.al. 2501.04698 null
2025-01-08 SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Zixuan Huang et.al. 2501.04689 null
2025-01-08 A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI Kazusato Oko et.al. 2501.04641 link
2025-01-08 Disentangled Clothed Avatar Generation with Layered Representation Weitian Zhang et.al. 2501.04631 null
2025-01-08 MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation Daniele Molino et.al. 2501.04614 null
2025-01-08 Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion Yangfan He et.al. 2501.04606 link
2025-01-08 ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training Xinfa Zhu et.al. 2501.04416 null
2025-01-08 Edit as You See: Image-guided Video Editing via Masked Motion Modeling Zhi-Lin Huang et.al. 2501.04325 null
2025-01-08 DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models Hyogon Ryu et.al. 2501.04304 link
2025-01-08 ContextMRI: Enhancing Compressed Sensing MRI through Metadata Conditioning Hyungjin Chung et.al. 2501.04284 link
2025-01-08 DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions Weidong Chen et.al. 2501.04256 null
2025-01-07 NeuralSVG: An Implicit Representation for Text-to-Vector Generation Sagi Polaczek et.al. 2501.03992 null
2025-01-07 Stabilising effect of generic anomalous diffusion independent of the Rayleigh number Antonio Barletta et.al. 2501.03990 null
2025-01-07 A precise asymptotic analysis of learning diffusion models: theory and insights Hugo Cui et.al. 2501.03937 link
2025-01-07 Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Yuechen Zhang et.al. 2501.03931 link
2025-01-07 Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Zekai Gu et.al. 2501.03847 link
2025-01-07 Impact of diffusion mechanisms on persistence and spreading Nathanaël Boutillon et.al. 2501.03816 null
2025-01-07 Mixing by Internal Gravity Waves in Stars: Assessing Numerical Simulations Against Theory Jack Morton et.al. 2501.03796 null
2025-01-07 Exploring Molecule Generation Using Latent Space Graph Diffusion Prashanth Pombala et.al. 2501.03696 link
2025-01-06 MObI: Multimodal Object Inpainting Using Diffusion Models Alexandru Buburuzan et.al. 2501.03173 null
2025-01-06 Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches Alhassan Mumuni et.al. 2501.03151 null
2025-01-06 DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models Mehmet Onurcan Kaya et.al. 2501.03030 link
2025-01-06 STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Rui Xie et.al. 2501.02976 null
2025-01-06 SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild Jiawei Liu et.al. 2501.02962 null
2025-01-06 Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions Jianhua Pei et.al. 2501.02928 null
2025-01-06 Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis Thang-Anh-Quan Nguyen et.al. 2501.02913 null
2025-01-06 Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems Shayan Mohajer Hamidi et.al. 2501.02880 null
2025-01-06 Towards HRTF Personalization using Denoising Diffusion Models Juan Camilo Albarracín Sánchez et.al. 2501.02871 null
2025-01-06 Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans Rezkellah Noureddine Khiati et.al. 2501.02867 null
2025-01-06 InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models Kai Wang et.al. 2501.02816 null
2025-01-06 Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising Yunlong Yuan et.al. 2501.02741 null
2025-01-06 Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment Jiaze Li et.al. 2501.02706 null
2025-01-05 From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering Wen-ran Li et.al. 2501.02680 null
2025-01-05 DepthMaster: Taming Diffusion Models for Monocular Depth Estimation Ziyang Song et.al. 2501.02576 link
2025-01-05 Decoding fMRI Data into Captions using Prefix Language Modeling Vyacheslav Shen et.al. 2501.02570 link
2025-01-05 Unified Guidance for Geometry-Conditioned Molecular Generation Sirine Ayadi et.al. 2501.02526 null
2025-01-05 Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation Dawei Dai et.al. 2501.02523 link
2025-01-05 Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors Minglin Chen et.al. 2501.02519 null
2025-01-05 ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Chaojie Mao et.al. 2501.02487 null
2025-01-02 Object-level Visual Prompts for Compositional Image Generation Gaurav Parmar et.al. 2501.01424 null
2025-01-02 Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Jingfeng Yao et.al. 2501.01423 link
2025-01-02 Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement Z. Zhang et.al. 2501.01368 null
2025-01-02 Conditional Consistency Guided Image Translation and Enhancement A. V. Subramanyam et.al. 2501.01223 link
2025-01-02 Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission Maojun Zhang et.al. 2501.01138 link
2025-01-02 EliGen: Entity-Level Controlled Image Generation with Regional Attention Hong Zhang et.al. 2501.01097 link
2025-01-02 DiffCL: A Diffusion-Based Contrastive Learning Framework with Semantic Alignment for Multimodal Recommendations Qiya Song et.al. 2501.01066 null
2025-01-02 Optimizing Noise Schedules of Generative Models in High Dimensionss Santiago Aranguri et.al. 2501.00988 null
2025-01-01 Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model Omid Saghatchian et.al. 2501.00946 link
2025-01-01 Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion Hao Wang et.al. 2501.00944 null
2025-01-01 A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset Junhuan Yang et.al. 2501.00941 null
2025-01-01 Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models Emily Johnson et.al. 2501.00917 null
2025-01-01 Diffusion Policies for Generative Modeling of Spacecraft Trajectories Julia Briden et.al. 2501.00915 null
2025-01-01 Population Aware Diffusion for Time Series Generation Yang Li et.al. 2501.00910 link
2025-01-01 RORem: Training a Robust Object Remover with Human-in-the-Loop Ruibin Li et.al. 2501.00740 link
2024-12-31 SoundBrush: Sound as a Brush for Visual Scene Editing Kim Sung-Bin et.al. 2501.00645 null
2024-12-31 Flash-Split: 2D Reflection Removal with Flash Cues and Latent Diffusion Separation Tianfu Wang et.al. 2501.00637 null
2024-12-31 DiC: Rethinking Conv3x3 Designs in Diffusion Models Yuchuan Tian et.al. 2501.00603 link
2024-12-31 DreamDrive: Generative 4D Scene Modeling from Street View Images Jiageng Mao et.al. 2501.00601 null
2024-12-31 Polynomial time sampling from log-smooth distributions in fixed dimension under semi-log-concavity of the forward diffusion with application to strongly dissipative distributions Adrien Vacher et.al. 2501.00565 null
2024-12-30 Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation Yuanbo Yang et.al. 2412.21117 null
2024-12-30 Quantum Diffusion Model for Quark and Gluon Jet Generation Mariia Baidachna et.al. 2412.21082 link
2024-12-30 Edicho: Consistent Image Editing in the Wild Qingyan Bai et.al. 2412.21079 link
2024-12-30 Varformer: Adapting VAR’s Generative Prior for Image Restoration Siyang Wang et.al. 2412.21063 link
2024-12-30 E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models Zhiyu Tan et.al. 2412.21044 null
2024-12-30 Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration Wanglong Lu et.al. 2412.21042 link
2024-12-30 AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies Yibo Wen et.al. 2412.20984 null
2024-12-30 Influence Maximization in Temporal Networks with Persistent and Reactive Behaviors Aaqib Zahoor et.al. 2412.20936 null
2024-12-30 DDIM sampling for Generative AIBIM, a faster intelligent structural design framework Zhili He et.al. 2412.20899 null
2024-12-30 VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control Shaojin Wu et.al. 2412.20800 link
2024-12-30 M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs Bei Yan et.al. 2412.20718 link
2024-12-30 HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images Sungik Choi et.al. 2412.20704 null
2024-12-30 Diffgrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model Yonghao Zhang et.al. 2412.20657 null
2024-12-30 Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis Yousef Yeganeh et.al. 2412.20651 null
2024-12-29 Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond) Tomer Garber et.al. 2412.20596 link
2024-12-29 Testing and Improving the Robustness of Amortized Bayesian Inference for Cognitive Models Yufei Wu et.al. 2412.20586 link
2024-12-29 Derivations of Animal Movement Models with Explicit Memory Tianxu Wang et.al. 2412.20568 null
2024-12-29 DPBridge: Latent Diffusion Bridge for Dense Prediction Haorui Ji et.al. 2412.20506 null
2024-12-29 Single-image reflection removal via self-supervised diffusion models Zhengyang Lu et.al. 2412.20466 null
2024-12-29 Image Augmentation Agent for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.20439 null
2024-12-24 PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models Minghao Chen et.al. 2412.18608 null
2024-12-24 DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers Yuntao Chen et.al. 2412.18607 null
2024-12-24 Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models Tahira Kazimi et.al. 2412.18604 null
2024-12-24 DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Minghong Cai et.al. 2412.18597 link
2024-12-24 LatentCRF: Continuous CRF for Efficient Latent Diffusion Kanchana Ranasinghe et.al. 2412.18596 null
2024-12-24 Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation Anselm Krainovic et.al. 2412.18584 null
2024-12-24 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement Yihang Luo et.al. 2412.18565 null
2024-12-24 Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models Qice Qin et.al. 2412.18421 null
2024-12-24 Discovery of 2D Materials via Symmetry-Constrained Diffusion Model Shihang Xu et.al. 2412.18414 null
2024-12-24 FameBias: Embedding Manipulation Bias Attack in Text-to-Image Models Jaechul Roh et.al. 2412.18302 null
2024-12-24 GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications Zhenzhou Jin et.al. 2412.18281 null
2024-12-24 Schödinger Bridge Type Diffusion Models as an Extension of Variational Autoencoders Kentaro Kaba et.al. 2412.18237 null
2024-12-24 Expand VSR Benchmark for VLLM to Expertize in Spatial Rules Peijin Xie et.al. 2412.18224 link
2024-12-24 Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks Changfu Xu et.al. 2412.18212 link
2024-12-24 Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence Yinbin Han et.al. 2412.18164 null
2024-12-24 Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction Xiao Guo et.al. 2412.18149 null
2024-12-24 Ensuring Consistency for In-Image Translation Chengpeng Fu et.al. 2412.18139 null
2024-12-23 Multi-Agent Path Finding in Continuous Spaces with Projected Diffusion Models Jinhao Liang et.al. 2412.17993 null
2024-12-23 Causal Composition Diffusion Model for Closed-loop Traffic Generation Haohong Lin et.al. 2412.17920 null
2024-12-23 FaceLift: Single Image to 3D Head with View Generation and GS-LRM Weijie Lyu et.al. 2412.17812 null
2024-12-23 PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion Sophia Tang et.al. 2412.17780 null
2024-12-23 The Superposition of Diffusion Models Using the Itô Density Estimator Marta Skreta et.al. 2412.17762 null
2024-12-23 A Bias-Free Training Paradigm for More General AI-generated Image Detection Fabrizio Guillaro et.al. 2412.17671 null
2024-12-23 Benchmarking Generative AI Models for Deep Learning Test Input Generation Maryam et.al. 2412.17652 link
2024-12-23 DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder Ente Lin et.al. 2412.17644 null
2024-12-23 Retention Score: Quantifying Jailbreak Risks for Vision Language Models Zaitang Li et.al. 2412.17544 null
2024-12-23 DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak Hao Wang et.al. 2412.17522 null
2024-12-23 Heterogeneous carrying capacities and global extinction in metapopulations Jakub Hesoun et.al. 2412.17461 null
2024-12-23 AeroDiT: Diffusion Transformers for Reynolds-Averaged Navier-Stokes Simulations of Airfoil Flows Hui Xiang et.al. 2412.17394 null
2024-12-23 Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement Hyeonjin Kim et.al. 2412.17387 link
2024-12-23 Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition Jaeheun Jung et.al. 2412.17333 null
2024-12-23 Free-viewpoint Human Animation with Pose-correlated Reference Selection Fa-Ting Hong et.al. 2412.17290 null
2024-12-23 Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory Xingyao Li et.al. 2412.17254 null
2024-12-23 OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving Tianyi Yan et.al. 2412.17226 null
2024-12-23 CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder Lichen Ma et.al. 2412.17225 null
2024-12-23 Discriminative Image Generation with Diffusion Models for Zero-Shot Learning Dingjie Fu et.al. 2412.17219 null
2024-12-22 Generative Diffusion Modeling: A Practical Handbook Zihan Ding et.al. 2412.17162 null
2024-12-22 Similarity Trajectories: Linking Sampling Process to Artifacts in Diffusion-Generated Images Dennis Menn et.al. 2412.17109 null
2024-12-22 Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation Luoxu Jin et.al. 2412.17042 null
2024-12-19 LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis Hanlin Wang et.al. 2412.15214 link
2024-12-19 Flowing from Words to Pixels: A Framework for Cross-Modality Evolution Qihao Liu et.al. 2412.15213 null
2024-12-19 Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation Hadi Alzayer et.al. 2412.15211 null
2024-12-19 AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation Moayed Haji-Ali et.al. 2412.15191 null
2024-12-19 Tiled Diffusion Or Madar et.al. 2412.15185 null
2024-12-19 OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization Jiacheng Zhang et.al. 2412.15159 null
2024-12-19 Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Yatai Ji et.al. 2412.15156 link
2024-12-19 Jet: A Modern Transformer-Based Normalizing Flow Alexander Kolesnikov et.al. 2412.15129 null
2024-12-19 Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion Zhifei Chen et.al. 2412.15050 null
2024-12-19 DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space Mang Ning et.al. 2412.15032 link
2024-12-19 Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls Riccardo Fosco Gramaccioni et.al. 2412.15023 null
2024-12-19 MagicNaming: Consistent Identity Generation by Finding a “Name Space” in T2I Diffusion Models Jing Zhao et.al. 2412.14902 null
2024-12-19 Diffusion priors for Bayesian 3D reconstruction from incomplete measurements Julian L. Möbius et.al. 2412.14897 null
2024-12-19 Generative CKM Construction using Partially Observed Data with Diffusion Model Shen Fu et.al. 2412.14812 null
2024-12-19 Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations Yucheng Hu et.al. 2412.14803 null
2024-12-19 EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space Jianrong Zhang et.al. 2412.14706 null
2024-12-19 Event-assisted 12-stop HDR Imaging of Dynamic Scene Shi Guo et.al. 2412.14705 null
2024-12-19 Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model Minglong Xue et.al. 2412.14630 link
2024-12-19 Qua $^2$ SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models Keith G. Mills et.al. 2412.14628 null
2024-12-19 LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining Huawen Shen et.al. 2412.14596 null
2024-12-18 AniDoc: Animation Creation Made Easier Yihao Meng et.al. 2412.14173 null
2024-12-18 E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling Zhihang Yuan et.al. 2412.14170 null
2024-12-18 Autoregressive Video Generation without Vector Quantization Haoge Deng et.al. 2412.14169 link
2024-12-18 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Runtao Liu et.al. 2412.14167 null
2024-12-18 MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation Shenhao Zhu et.al. 2412.14148 null
2024-12-18 SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation Tong Chen et.al. 2412.14018 null
2024-12-18 Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates Sen Yan et.al. 2412.13966 null
2024-12-18 IDEQ: an improved diffusion model for the TSP Mickael Basson et.al. 2412.13858 null
2024-12-18 Object Style Diffusion for Generalized Object Detection in Urban Scene Hao Li et.al. 2412.13815 null
2024-12-18 Text2Relight: Creative Portrait Relighting with Text Guidance Junuk Cha et.al. 2412.13734 null
2024-12-18 Diffusion models and stochastic quantisation in lattice field theory Gert Aarts et.al. 2412.13704 null
2024-12-18 MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing Chuang Yang et.al. 2412.13684 null
2024-12-18 VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement Chen Zhao et.al. 2412.13655 link
2024-12-18 TAUDiff: Improving statistical downscaling for extreme weather events using generative diffusion models Rahul Sundar et.al. 2412.13627 null
2024-12-18 SemiDFL: A Semi-Supervised Paradigm for Decentralized Federated Learning Xinyang Liu et.al. 2412.13589 link
2024-12-18 Urban Air Temperature Prediction using Conditional Diffusion Models Siyang Dai et.al. 2412.13504 null
2024-12-18 VaeDiff-DocRE: End-to-end Data Augmentation Framework for Document-level Relation Extraction Khai Phan Tran et.al. 2412.13503 link
2024-12-18 Real-time One-Step Diffusion-based Expressive Portrait Videos Generation Hanzhong Guo et.al. 2412.13479 link
2024-12-18 SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation Kazuki Shimada et.al. 2412.13462 null
2024-12-18 Zero-Shot Low Light Image Enhancement with Diffusion Prior Joshua Cho et.al. 2412.13401 link
2024-12-16 Causal Diffusion Transformers for Generative Modeling Chaorui Deng et.al. 2412.12095 link
2024-12-16 CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models Felix Taubner et.al. 2412.12093 null
2024-12-16 Wonderland: Navigating 3D Scenes from a Single Image Hanwen Liang et.al. 2412.12091 null
2024-12-16 A LoRA is Worth a Thousand Pictures Chenxi Liu et.al. 2412.12048 null
2024-12-16 The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation Gilles Mordant et.al. 2412.12007 null
2024-12-16 Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data Onur Tasar et.al. 2412.11972 null
2024-12-16 ColorFlow: Retrieval-Augmented Image Sequence Colorization Junhao Zhuang et.al. 2412.11815 null
2024-12-16 InterDyn: Controllable Interactive Dynamics with Video Diffusion Models Rick Akkerman et.al. 2412.11785 null
2024-12-16 Joint Reconstruction of the Activity and the Attenuation in PET by Diffusion Posterior Sampling: a Feasibility Study Clémentine Phung-Ngoc et.al. 2412.11776 null
2024-12-16 No More Adam: Learning Rate Scaling at Initialization is All You Need Minghao Xu et.al. 2412.11768 link
2024-12-16 Conditional Diffusion Models Based Conditional Independence Testing Yanfeng Yang et.al. 2412.11744 link
2024-12-16 Re-Attentional Controllable Video Diffusion Editing Yuanzhi Wang et.al. 2412.11710 link
2024-12-16 VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Muhammet Furkan Ilaslan et.al. 2412.11621 link
2024-12-16 3D $^2$ -Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling Zichen Tang et.al. 2412.11599 link
2024-12-16 StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors Xiaokun Sun et.al. 2412.11586 link
2024-12-16 MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models Weilun Feng et.al. 2412.11549 link
2024-12-16 EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting Dong In Lee et.al. 2412.11520 null
2024-12-16 LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model Xi Wang et.al. 2412.11519 null
2024-12-16 IGR: Improving Diffusion Model for Garment Restoration from Person Image Le Shen et.al. 2412.11513 null
2024-12-16 MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes Ruijie Lu et.al. 2412.11457 null
2024-12-12 FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Haonan Qiu et.al. 2412.09626 null
2024-12-12 Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors Yue Feng et.al. 2412.09625 null
2024-12-12 OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation Weiqi Li et.al. 2412.09623 null
2024-12-12 LoRACLR: Contrastive Adaptation for Customization of Diffusion Models Enis Simsar et.al. 2412.09622 null
2024-12-12 SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Dongting Hu et.al. 2412.09619 null
2024-12-12 EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM Zhuofan Zong et.al. 2412.09618 null
2024-12-12 Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG Kavana Venkatesh et.al. 2412.09614 null
2024-12-12 LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors Yabo Chen et.al. 2412.09597 null
2024-12-12 Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion Zexin He et.al. 2412.09593 null
2024-12-12 SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing Xueting Li et.al. 2412.09545 null
2024-12-12 Learned Compression for Compressed Learning Dan Jacobellis et.al. 2412.09405 link
2024-12-12 Diffusion Model with Representation Alignment for Protein Inverse Folding Chenglin Wang et.al. 2412.09380 null
2024-12-12 Diffusion Predictive Control with Constraints Ralf Römer et.al. 2412.09342 link
2024-12-12 Auto-Regressive Moving Diffusion Models for Time Series Forecasting Jiaxin Gao et.al. 2412.09328 link
2024-12-12 Are Conditional Latent Diffusion Models Effective for Image Restoration? Yunchen Yuan et.al. 2412.09324 null
2024-12-12 GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression Ziqi Zhou et.al. 2412.09296 link
2024-12-12 LatentSync: Audio Conditioned Latent Diffusion Models for Lip Sync Chunyu Li et.al. 2412.09262 link
2024-12-12 ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring Zhongbao Yang et.al. 2412.09193 null
2024-12-12 RAD: Region-Aware Diffusion Models for Image Inpainting Sora Kim et.al. 2412.09191 null
2024-12-12 DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization Geonhui Jang et.al. 2412.09169 null
2024-12-11 Generative Semantic Communication: Architectures, Technologies, and Applications Jinke Ren et.al. 2412.08642 null
2024-12-11 DMin: Scalable Training Data Influence Estimation for Diffusion Models Huawei Lin et.al. 2412.08637 link
2024-12-11 TryOffAnyone: Tiled Cloth Generation from a Dressed Person Ioannis Xarchakos et.al. 2412.08573 link
2024-12-11 Learning Flow Fields in Attention for Controllable Person Image Generation Zijian Zhou et.al. 2412.08486 link
2024-12-11 InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models Min Hou et.al. 2412.08480 link
2024-12-11 CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis Mu Zhang et.al. 2412.08464 null
2024-12-11 Reliable Uncertainty Quantification for Fiber Orientation in Composite Molding Processes using Multilevel Polynomial Surrogates Stjepan Salatovic et.al. 2412.08459 null
2024-12-11 Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views Songchun Zhang et.al. 2412.08412 null
2024-12-11 Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR3 Joao Carvalho et.al. 2412.08398 null
2024-12-11 Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion Jisheng Chu et.al. 2412.08326 link
2024-12-11 GDSG: Graph Diffusion-based Solution Generation for Optimization Problems in MEC Networks Ruihuai Liang et.al. 2412.08296 link
2024-12-11 Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations Nikil Roashan Selvam et.al. 2412.08292 link
2024-12-11 Toward Near-Globally Optimal Nonlinear Model Predictive Control via Diffusion Models Tzu-Yuan Huang et.al. 2412.08278 null
2024-12-11 Unicorn: Unified Neural Image Compression with One Number Reconstruction Qi Zheng et.al. 2412.08210 null
2024-12-11 LatentSpeech: Latent Diffusion for Text-To-Speech Generation Haowei Lou et.al. 2412.08117 null
2024-12-11 DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation Jaeho Moon et.al. 2412.08116 null
2024-12-10 Diffusion-Based Attention Warping for Consistent 3D Scene Editing Eyal Gomel et.al. 2412.07984 null
2024-12-10 Non-Normal Diffusion Models Henry Li et.al. 2412.07935 null
2024-12-10 Score Change of Variables Stephen Robbins et.al. 2412.07904 null
2024-12-10 Score-Optimal Diffusion Schedules Christopher Williams et.al. 2412.07877 null
2024-12-09 [MASK] is All You Need Vincent Tao Hu et.al. 2412.06787 link
2024-12-09 Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation Ruihan Gao et.al. 2412.06785 link
2024-12-09 Diverse Score Distillation Yanbo Xu et.al. 2412.06780 null
2024-12-09 Visual Lexicon: Rich Image Features in Language Space XuDong Wang et.al. 2412.06774 null
2024-12-09 InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention Howard Zhang et.al. 2412.06753 null
2024-12-09 ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet Andrei-Robert Alexandrescu et.al. 2412.06742 null
2024-12-09 Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection Caiyun Xie et.al. 2412.06727 link
2024-12-09 You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale Baorui Ma et.al. 2412.06699 link
2024-12-09 Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy Yuxuan Xue et.al. 2412.06698 null
2024-12-09 Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset Shanshan Wang et.al. 2412.06666 null
2024-12-09 Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion Shuaiting Li et.al. 2412.06661 null
2024-12-09 MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences Weitao Wang et.al. 2412.06614 null
2024-12-09 Diffusion on the circle and a stochastic correlation model Sourav Majumdar et.al. 2412.06343 null
2024-12-09 Normalizing Flows are Capable Generative Models Shuangfei Zhai et.al. 2412.06329 link
2024-12-09 See Further When Clear: Curriculum Consistency Model Yunpeng Liu et.al. 2412.06295 null
2024-12-09 No Annotations for Object Detection in Art through Stable Diffusion Patrick Ramos et.al. 2412.06286 link
2024-12-09 Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction Dongxu Wei et.al. 2412.06273 null
2024-12-09 Rendering-Refined Stable Diffusion for Privacy Compliant Synthetic Data Kartik Patwari et.al. 2412.06248 null
2024-12-09 ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance Yuming Li et.al. 2412.06163 null
2024-12-09 Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters Yuan Wang et.al. 2412.06143 link
2024-12-05 PaintScene4D: Consistent 4D Scene Generation from Text Prompts Vinayak Gupta et.al. 2412.04471 null
2024-12-05 LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors Yusuf Dalva et.al. 2412.04460 null
2024-12-05 Four-Plane Factorized Video Autoencoders Mohammed Suhail et.al. 2412.04452 null
2024-12-05 MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Longtao Zheng et.al. 2412.04448 null
2024-12-05 DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models Yizhuo Li et.al. 2412.04446 null
2024-12-05 Learning Artistic Signatures: Symmetry Discovery and Style Transfer Emma Finn et.al. 2412.04441 null
2024-12-05 Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Yuying Ge et.al. 2412.04432 link
2024-12-05 Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis Jian Han et.al. 2412.04431 link
2024-12-05 Reversible molecular simulation for training classical and machine learning force fields Joe G Greener et.al. 2412.04374 link
2024-12-05 ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation Dayoung Gong et.al. 2412.04353 null
2024-12-05 RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse Zhouyingcheng Liao et.al. 2412.04343 null
2024-12-05 Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction George Webber et.al. 2412.04324 null
2024-12-05 Structure-Aware Stylized Image Synthesis for Robust Medical Image Segmentation Jie Bao et.al. 2412.04296 link
2024-12-05 LMDM:Latent Molecular Diffusion Model For 3D Molecule Generation Xiang Chen et.al. 2412.04242 null
2024-12-05 CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model Ruoyu Yao et.al. 2412.04209 null
2024-12-05 Instructional Video Generation Yayuan Li et.al. 2412.04189 null
2024-12-05 AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models Xinghui Li et.al. 2412.04146 null
2024-12-05 Understanding Memorization in Generative Models via Sharpness in Probability Landscapes Dongjae Jeon et.al. 2412.04140 null
2024-12-05 Compositional Generative Multiphysics and Multi-component Simulation Tao Zhang et.al. 2412.04134 link
2024-12-05 IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation Sejong Yang et.al. 2412.04000 null
2024-12-04 MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation Zehuan Huang et.al. 2412.03558 null
2024-12-04 NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Lingen Li et.al. 2412.03517 null
2024-12-04 Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion Shengyuan Zhang et.al. 2412.03515 link
2024-12-04 CleanDIFT: Diffusion Features without Noise Nick Stracke et.al. 2412.03439 link
2024-12-04 SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model Yan Li et.al. 2412.03430 null
2024-12-04 Skel3D: Skeleton Guided Novel View Synthesis Aron Fóthi et.al. 2412.03407 null
2024-12-04 Identifiability implies consistency of MLE in partially observed diffusions on a torus Ibrahim Ekren et.al. 2412.03380 null
2024-12-04 TASR: Timestep-Aware Diffusion Model for Image Super-Resolution Qinwei Lin et.al. 2412.03355 link
2024-12-04 DIVE: Taming DINO for Subject-Driven Video Editing Yi Huang et.al. 2412.03347 null
2024-12-04 Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis Tao Jun Lin et.al. 2412.03315 null
2024-12-04 Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression Junjie Wen et.al. 2412.03293 null
2024-12-04 Black-Box Forgery Attacks on Semantic Watermarks for Diffusion Models Andreas Müller et.al. 2412.03283 null
2024-12-04 Generating Synthetic Genotypes using Diffusion Models Philip Kenneweg et.al. 2412.03278 link
2024-12-04 RFSR: Improving ISR Diffusion Models via Reward Feedback Learning Xiaopeng Sun et.al. 2412.03268 link
2024-12-04 DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation Qingdong He et.al. 2412.03255 null
2024-12-04 A seamless local-nonlocal coupling diffusion model with $H^1$ vanishing nonlocality convergence Yanzun Meng et.al. 2412.03153 null
2024-12-04 Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis Siyoon Jin et.al. 2412.03150 null
2024-12-04 Generalized Diffusion Model with Adjusted Offset Noise Takuro Kutsuna et.al. 2412.03134 null
2024-12-04 MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction Gangjian Zhang et.al. 2412.03103 null
2024-12-04 Mimir: Improving Video Diffusion Models for Precise Text Understanding Shuai Tan et.al. 2412.03085 null
2024-11-29 MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks Yiming Wu et.al. 2411.19786 null
2024-11-29 Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy Jeheon Woo et.al. 2411.19769 null
2024-11-29 TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting Bojun Xiong et.al. 2411.19654 link
2024-11-29 Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing Wenyi Mo et.al. 2411.19652 link
2024-11-29 Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook Florinel-Alin Croitoru et.al. 2411.19537 link
2024-11-29 Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis Tianqi Li et.al. 2411.19509 link
2024-11-29 Diffusion Models Meet Network Management: Improving Traffic Matrix Analysis with Diffusion-based Approach Xinyu Yuan et.al. 2411.19493 link
2024-11-28 DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models Shwetha Ram et.al. 2411.19390 null
2024-11-28 Enhancing Sketch Animation: Text-to-Video Diffusion Models with Temporal Consistency and Rigidity Constraints Gaurav Rai et.al. 2411.19381 null
2024-11-28 Towards a Mechanistic Explanation of Diffusion Model Generalization Matthew Niedoba et.al. 2411.19339 null
2024-11-28 Trajectory Attention for Fine-grained Video Motion Control Zeqi Xiao et.al. 2411.19324 null
2024-11-28 Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention Huiguo He et.al. 2411.19261 null
2024-11-28 Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes Thomas Wimmer et.al. 2411.19233 link
2024-11-28 Z-STAR+: A Zero-shot Style Transfer Method via Adjusting Style Distribution Yingying Deng et.al. 2411.19231 null
2024-11-28 Video Depth without Video Models Bingxin Ke et.al. 2411.19189 null
2024-11-28 SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generation Yuhan Pei et.al. 2411.19182 null
2024-11-28 Bayesian Deconvolution of Astronomical Images with Diffusion Models: Quantifying Prior-Driven Features in Reconstructions Alessio Spagnoletti et.al. 2411.19158 link
2024-11-28 Timestep Embedding Tells: It’s Time to Cache for Video Diffusion Model Feng Liu et.al. 2411.19108 null
2024-11-28 I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting Nicola Fanelli et.al. 2411.19050 link
2024-11-28 3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes Tejaswini Medi et.al. 2411.19037 null
2024-11-27 GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data Wentao Wang et.al. 2411.18624 null
2024-11-27 Diffusion Self-Distillation for Zero-Shot Customized Image Generation Shengqu Cai et.al. 2411.18616 null
2024-11-27 CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Rundi Wu et.al. 2411.18613 null
2024-11-27 Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis Eva Prakash et.al. 2411.18602 null
2024-11-27 FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion Haosen Yang et.al. 2411.18552 null
2024-11-27 Enhancing weed detection performance by means of GenAI-based image augmentation Sourav Modak et.al. 2411.18513 null
2024-11-27 Learning the Evolution of Physical Structure of Galaxies via Diffusion Models Andrew Lizarraga et.al. 2411.18440 link
2024-11-27 Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models Yiming Wu et.al. 2411.18375 null
2024-11-27 TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models Riza Velioglu et.al. 2411.18350 link
2024-11-27 HiFiVFS: High Fidelity Video Face Swapping Xu Chen et.al. 2411.18293 null
2024-11-27 TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution Linwei Dong et.al. 2411.18263 link
2024-11-27 Dependency-Aware CAV Task Scheduling via Diffusion-Based Reinforcement Learning Xiang Cheng et.al. 2411.18230 null
2024-11-27 Uniqueness and regularity of weak solutions of a drift-diffusion system for perovskite solar cells Annegret Glitzky et.al. 2411.18223 null
2024-11-27 Prediction with Action: Visual Policy Learning via Joint Denoising Process Yanjiang Guo et.al. 2411.18179 null
2024-11-27 ModeDreamer: Mode Guiding Score Distillation for Text-to-3D Generation using Reference Image Prompts Uy Dieu Tran et.al. 2411.18135 null
2024-11-27 Training Data Synthesis with Difficulty Controlled Diffusion Model Zerun Wang et.al. 2411.18109 null
2024-11-27 PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion Gwanghyun Kim et.al. 2411.18068 null
2024-11-27 Generative Semantic Communication for Joint Image Transmission and Segmentation Weiwen Yuan et.al. 2411.18005 null
2024-11-27 Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery Zhenyu Yu et.al. 2411.17973 null
2024-11-27 ROICtrl: Boosting Instance Control for Visual Generation Yuchao Gu et.al. 2411.17949 null
2024-11-25 Generative Omnimatte: Learning to Decompose Video into Layers Yao-Chih Lee et.al. 2411.16683 null
2024-11-25 Diffusion Features for Zero-Shot 6DoF Object Pose Estimation Bernd Von Gimborn et.al. 2411.16668 null
2024-11-25 LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction Yiran Sun et.al. 2411.16629 link
2024-11-25 Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models Ronghuan Wu et.al. 2411.16602 null
2024-11-25 Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification Andre Kassis et.al. 2411.16598 link
2024-11-25 Rethinking Diffusion for Text-Driven Human Motion Generation Zichong Meng et.al. 2411.16575 null
2024-11-25 Representation Collapsing Problems in Vector Quantization Wenhao Zhao et.al. 2411.16550 null
2024-11-25 ADOBI: Adaptive Diffusion Bridge For Blind Inverse Problems with Application to MRI Reconstruction Yuyang Hu et.al. 2411.16535 null
2024-11-25 Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis Boming Miao et.al. 2411.16503 null
2024-11-25 Model-based reinforcement corrosion prediction: Continuous calibration with Bayesian optimization and corrosion wire sensor data A. Potnis et.al. 2411.16447 null
2024-11-25 Privacy Protection in Personalized Diffusion Models via Targeted Cross-Attention Adversarial Attack Xide Xu et.al. 2411.16437 null
2024-11-25 Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing Kaifeng Gao et.al. 2411.16375 link
2024-11-25 One Diffusion to Generate Them All Duong H. Le et.al. 2411.16318 link
2024-11-25 An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models Wentao Qu et.al. 2411.16308 link
2024-11-25 DiffDesign: Controllable Diffusion with Meta Prior for Efficient Interior Design Generation Yuxuan Yang et.al. 2411.16301 null
2024-11-25 SMGDiff: Soccer Motion Generation using diffusion probabilistic models Hongdi Yang et.al. 2411.16216 null
2024-11-25 Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation Qiao Yu et.al. 2411.16185 link
2024-11-25 Image Generation Diversity Issues and How to Tame Them Mischa Dombrowski et.al. 2411.16171 link
2024-11-25 Text-to-Image Synthesis: A Decade Survey Nonghai Zhang et.al. 2411.16164 null
2024-11-25 MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model Chenjie Cao et.al. 2411.16157 link
2024-11-21 Stable Flow: Vital Layers for Training-Free Image Editing Omri Avrahami et.al. 2411.14430 link
2024-11-21 Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation Yuanhao Cai et.al. 2411.14384 null
2024-11-21 CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields Xin-Yang Liu et.al. 2411.14378 null
2024-11-21 Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models Houze Liu et.al. 2411.14353 null
2024-11-21 StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart Jian Shi et.al. 2411.14295 link
2024-11-21 Guided MRI Reconstruction via Schrödinger Bridge Yue Wang et.al. 2411.14269 null
2024-11-21 TaQ-DiT: Time-aware Quantization for Diffusion Transformers Xinyan Liu et.al. 2411.14172 null
2024-11-21 RestorerID: Towards Tuning-Free Face Restoration with ID Preservation Jiacheng Ying et.al. 2411.14125 link
2024-11-21 Point Cloud Resampling with Learnable Heat Diffusion Wenqiang Xu et.al. 2411.14120 null
2024-11-21 Transforming Static Images Using Generative Models for Video Salient Object Detection Suhwan Cho et.al. 2411.13975 link
2024-11-21 Decoupled Sparse Priors Guided Diffusion Compression Model for Point Clouds Xiaoge Zhang et.al. 2411.13860 null
2024-11-21 Detecting Human Artifacts from Text-to-Image Models Kaihong Wang et.al. 2411.13842 link
2024-11-21 CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Lin Sun et.al. 2411.13836 link
2024-11-21 MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control Ruiyuan Gao et.al. 2411.13807 null
2024-11-20 Non-Linear Outlier Synthesis for Out-of-Distribution Detection Lars Doorenbos et.al. 2411.13619 link
2024-11-20 REDUCIO! Generating 1024 $\times$ 1024 Video within 16 Seconds using Extremely Compressed Motion Latents Rui Tian et.al. 2411.13552 link
2024-11-20 Identity Preserving 3D Head Stylization with Multiview Score Distillation Bahri Batuhan Bilecen et.al. 2411.13536 null
2024-11-20 Heuristically Adaptive Diffusion-Model Evolutionary Strategy Benedikt Hartl et.al. 2411.13420 null
2024-11-20 XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation Ziyi Wang et.al. 2411.13243 link
2024-11-20 A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM) Antonino Visalli et.al. 2411.13203 link
2024-11-20 RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation Christoph Reinders et.al. 2411.13150 link
2024-11-20 CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models Naen Xu et.al. 2411.13144 null
2024-11-20 Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry Yijie Zhang et.al. 2411.13120 null
2024-11-19 Breaking the wire: the impact of critical length on melting pathways in silver nanowires Kannan M Ridings et.al. 2411.12891 null
2024-11-19 From Text to Pose to Image: Improving Diffusion Model Control and Quality Clément Bonnett et.al. 2411.12872 link
2024-11-19 CDI: Copyrighted Data Identification in Diffusion Models Jan Dubiński et.al. 2411.12858 link
2024-11-19 Towards motion from video diffusion models Paul Janson et.al. 2411.12831 null
2024-11-19 Stylecodes: Encoding Stylistic Information For Image Generation Ciara Rowles et.al. 2411.12811 link
2024-11-19 PoM: Efficient Image and Video Generation with the Polynomial Mixer David Picard et.al. 2411.12663 link
2024-11-19 Improving Controllability and Editability for Pretrained Text-to-Music Generation Models Yixiao Zhang et.al. 2411.12641 null
2024-11-19 Data Pruning in Generative Diffusion Models Rania Briq et.al. 2411.12523 link
2024-11-19 Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models Jun Xiao et.al. 2411.12450 null
2024-11-19 Combinational Backdoor Attack against Customized Text-to-Image Models Wenbo Jiang et.al. 2411.12389 null
2024-11-19 Scalable and Effective Negative Sample Generation for Hyperedge Prediction Shilin Qu et.al. 2411.12354 null
2024-11-19 Diffusion Product Quantization Jie Shao et.al. 2411.12306 null
2024-11-18 Aligning Few-Step Diffusion Models with Dense Reward Difference Learning Ziyi Zhang et.al. 2411.11727 link
2024-11-18 Robust Reinforcement Learning under Diffusion Models for Data with Jumps Chenyang Jiang et.al. 2411.11697 null
2024-11-18 Conceptwm: A Diffusion Model Watermark for Concept Protection Liangqi Lei et.al. 2411.11688 null
2024-11-18 Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation Rüveyda Yilmaz et.al. 2411.11515 link
2024-11-18 MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion Dongseok Shim et.al. 2411.11475 null
2024-11-18 CLUE-MARK: Watermarking Diffusion Models using CLWE Kareem Shehata et.al. 2411.11434 null
2024-11-18 Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge Qinglong Cao et.al. 2411.11343 null
2024-11-18 Stochastic quantization and diffusion models Kenji Fukushima et.al. 2411.11297 null
2024-11-17 Stealing Training Graphs from Graph Neural Networks Minhua Lin et.al. 2411.11197 null
2024-11-17 DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images Zhen Yuan et.al. 2411.11190 null
2024-11-17 Integrated Ising Model with global inhibition for decision making Olga Tapinova et.al. 2411.11143 null
2024-11-17 Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method Yan Zheng et.al. 2411.11135 null
2024-11-17 Dynamic Dimensioning of Frequency Containment Reserves: The Case of the Nordic Grid Jöbke Janssen et.al. 2411.11093 null
2024-11-17 D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification Minhee Jang et.al. 2411.11087 link
2024-11-17 Time Step Generating: A Universal Synthesized Deepfake Image Detector Ziyue Zeng et.al. 2411.11016 link
2024-11-17 Direct and Explicit 3D Generation from a Single Image Haoyu Wu et.al. 2411.10947 null
2024-11-17 Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion Ni Ou et.al. 2411.10936 null
2024-11-17 Constrained Diffusion with Trust Sampling William Huang et.al. 2411.10932 link
2024-11-16 Generating Compositional Scenes via Text-to-image RGBA Instance Generation Alessandro Fontanella et.al. 2411.10913 null
2024-11-16 MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation Ansh Shah et.al. 2411.10886 link
2024-11-14 Golden Noise for Diffusion Models: A Learning Framework Zikai Zhou et.al. 2411.09502 link
2024-11-14 DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing Junjie Zhou et.al. 2411.09451 null
2024-11-14 Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models Chutian Meng et.al. 2411.09449 null
2024-11-12 Mediffusion: Joint Diffusion for Self-Explainable Semi-Supervised Classification and Medical Image Generation Joanna Kaleta et.al. 2411.09434 null
2024-11-14 A survey of probabilistic generative frameworks for molecular simulations Richard John et.al. 2411.09388 link
2024-11-14 EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models Soowon Kim et.al. 2411.09302 null
2024-11-14 Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance Md Fahim Anjum et.al. 2411.09174 null
2024-11-14 VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation Youpeng Wen et.al. 2411.09153 null
2024-11-14 General linear threshold models with application to influence maximization Alexander Kagan et.al. 2411.09100 link
2024-11-13 Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples Noël Vouitsis et.al. 2411.08954 link
2024-11-13 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization Mijeong Kim et.al. 2411.08879 null
2024-11-13 Offline Adaptation of Quadruped Locomotion using Diffusion Models Reece O’Mahoney et.al. 2411.08832 link
2024-11-13 Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models Chengdong Dong et.al. 2411.08642 null
2024-11-13 V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion Xun Huang et.al. 2411.08402 link
2024-11-13 Physics Informed Distillation for Diffusion Models Joshua Tian Jin Tee et.al. 2411.08378 link
2024-11-13 Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study Jinbo Wen et.al. 2411.08341 null
2024-11-13 Motion Control for Enhanced Complex Action Video Generation Qiang Zhou et.al. 2411.08328 null
2024-11-13 DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach Xin Tang et.al. 2411.08299 null
2024-11-12 Joint Diffusion models in Continual Learning Paweł Skierś et.al. 2411.08224 null
2024-11-12 Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing Zitao Shuai et.al. 2411.08196 null
2024-11-12 Well-posedness of a Variable-Exponent Telegraph Equation Applied to Image Despeckling Sudeb Majee et.al. 2411.08175 null
2024-11-12 An age-structured diffusive model for epidemic modelling: Lie symmetries and exact solutions Roman Cherniha et.al. 2411.08083 null
2024-11-13 Scaling Properties of Diffusion Models for Perceptual Tasks Rahul Ravishankar et.al. 2411.08034 null
2024-11-12 GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation Yushi Lan et.al. 2411.08033 null
2024-11-12 Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules Binxu Wang et.al. 2411.07873 null
2024-11-12 Novel View Synthesis with Pixel-Space Diffusion Models Noam Elata et.al. 2411.07765 null
2024-11-12 Nanosecond nanothermometry in an electron microscope Florian Castioni et.al. 2411.07764 null
2024-11-12 Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion Kaiyu Song et.al. 2411.07627 null
2024-11-12 Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation Kaiyu Song et.al. 2411.07625 null
2024-11-12 Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer F. Qi et.al. 2411.07539 null
2024-11-11 Score-based generative diffusion with “active” correlated noise sources Alexandra Lamtyugina et.al. 2411.07233 null
2024-11-11 Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models Yoad Tewel et.al. 2411.07232 null
2024-11-11 DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID Nyle Siddiqui et.al. 2411.07205 link
2024-11-11 Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter Domitille Gérard et.al. 2411.07202 null
2024-11-11 OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision Cong Wei et.al. 2411.07199 null
2024-11-11 More Expressive Attention with Negative Weights Ang Lv et.al. 2411.07176 link
2024-11-11 Edify 3D: Scalable High-Quality 3D Asset Generation NVIDIA et.al. 2411.07135 null
2024-11-11 Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models NVIDIA et.al. 2411.07126 null
2024-11-11 White-Box Diffusion Transformer for single-cell RNA-seq generation Zhuorui Cui et.al. 2411.06785 link
2024-11-11 DiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite Observations Xuming He et.al. 2411.06714 null
2024-11-11 Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model Guandong Li et.al. 2411.06692 null
2024-11-11 SeedEdit: Align Image Re-Generation to Image Editing Yichun Shi et.al. 2411.06686 null
2024-11-10 Using Diffusion Models as Generative Replay in Continual Federated Learning – What will Happen? Yongsheng Mei et.al. 2411.06618 null
2024-11-10 CASC: Condition-Aware Semantic Communication with Latent Diffusion Models Weixuan Chen et.al. 2411.06552 null
2024-11-10 Numerical analysis of the cross-diffusion Cahn-Hilliard model in lymphangiogenesis Boyi Wang et.al. 2411.06488 null
2024-11-10 Improved Video VAE for Latent Video Diffusion Model Pingyu Wu et.al. 2411.06449 null
2024-11-10 Detecting AutoEncoder is Enough to Catch LDM Generated Images Dmitry Vesnin et.al. 2411.06441 link
2024-11-10 PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling Hyukhun Koh et.al. 2411.06438 null
2024-11-09 Exploring Out-of-distribution Detection for Sparse-view Computed Tomography with Diffusion Models Ezgi Demircan-Tureyen et.al. 2411.06308 null
2024-11-09 Text2CAD: Text to 3D CAD Generation via Technical Drawings Mohsen Yavartanoo et.al. 2411.06206 null
2024-11-07 SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models Muyang Li et.al. 2411.05007 link
2024-11-07 ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing Jun-Kun Chen et.al. 2411.05006 null
2024-11-07 Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models Shuhong Zheng et.al. 2411.05005 null
2024-11-07 ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning David Junhao Zhang et.al. 2411.05003 null
2024-11-07 SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation Koichi Namekata et.al. 2411.04989 null
2024-11-07 Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification Mischa Dombrowski et.al. 2411.04956 null
2024-11-07 DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion Wenqiang Sun et.al. 2411.04928 null
2024-11-07 Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion Kaizhe Hu et.al. 2411.04919 link
2024-11-06 Boosting Latent Diffusion with Perceptual Objectives Tariq Berrada et.al. 2411.04873 null
2024-11-07 Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation Benito Buchheim et.al. 2411.04724 null
2024-11-07 DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction Li Zhao et.al. 2411.04646 null
2024-11-07 Brain Tumour Removing and Missing Modality Generation using 3D WDM André Ferreira et.al. 2411.04630 link
2024-11-07 Social EgoMesh Estimation Luca Scofano et.al. 2411.04598 link
2024-11-07 Series-to-Series Diffusion Bridge Model Hao Yang et.al. 2411.04491 null
2024-11-07 HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images Zhenyue Qin et.al. 2411.04332 null
2024-11-06 PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing Siddharth Seth et.al. 2411.04249 link
2024-11-06 Quantum Diffusion Models for Few-Shot Learning Ruhan Wang et.al. 2411.04217 null
2024-11-06 DiMSUM: Diffusion Mamba – A Scalable and Unified Spatial-Frequency Method for Image Generation Hao Phung et.al. 2411.04168 link
2024-11-06 Community Forensics: Using Thousands of Generators to Train Fake Image Detectors Jeongsoo Park et.al. 2411.04125 link
2024-11-06 Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging Yuan Bi et.al. 2411.04004 link
2024-11-06 ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy Chenrui Tie et.al. 2411.03990 null
2024-11-06 ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models Ashutosh Srivastava et.al. 2411.03982 null
2024-11-06 ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization Huayang Huang et.al. 2411.03862 link
2024-11-06 Sub-DM:Subspace Diffusion Model with Orthogonal Decomposition for MRI Reconstruction Yu Guan et.al. 2411.03758 link
2024-11-06 Zero-shot Dynamic MRI Reconstruction with Global-to-local Diffusion Model Yu Guan et.al. 2411.03723 link
2024-11-06 Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation Chihaya Matsuhira et.al. 2411.03595 null
2024-11-05 Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data Seunggeun Chi et.al. 2411.03561 null
2024-11-05 SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture Andrew Heschl et.al. 2411.03505 link
2024-11-05 DM4Steal: Diffusion Model For Link Stealing Attack On Graph Neural Networks Jinyin Chen et.al. 2411.03364 null
2024-11-05 DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models Ying Zhou et.al. 2411.03250 null
2024-11-05 On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models Tariq Berrada Ifriqi et.al. 2411.03177 null
2024-11-05 Unleashing the power of novel conditional generative approaches for new materials discovery Lev Novitskiy et.al. 2411.03156 link
2024-11-05 Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising Tao Huang et.al. 2411.03053 null
2024-11-05 GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details Zhongjin Luo et.al. 2411.03047 null
2024-11-05 IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems Heiko Oppel et.al. 2411.02954 null
2024-11-05 LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior Xingjian Tang et.al. 2411.02951 null
2024-11-05 How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion Giannis Daras et.al. 2411.02780 link
2024-11-04 Modelling Alzheimer’s Protein Dynamics: A Data-Driven Integration of Stochastic Methods, Machine Learning and Connectome Insights Alec MacIver et.al. 2411.02644 null
2024-11-04 Training-free Regional Prompting for Diffusion Transformers Anthony Chen et.al. 2411.02395 link
2024-11-04 Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition Xinkai Liu et.al. 2411.02334 null
2024-11-04 LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation Mufei Li et.al. 2411.02322 link
2024-11-04 Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation Xianghui Yang et.al. 2411.02293 null
2024-11-04 FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training Ruihong Yin et.al. 2411.02229 null
2024-11-04 CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality Yiqin Zhao et.al. 2411.02179 null
2024-11-04 Model Integrity when Unlearning with T2I Diffusion Models Andrea Schioppa et.al. 2411.02068 null
2024-11-04 DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability Bo Gao et.al. 2411.01819 null
2024-11-04 MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence Fuming You et.al. 2411.01805 null
2024-11-04 A Regressor-Guided Graph Diffusion Model for Predicting Enzyme Mutations to Enhance Turnover Number Xiaozhu Yu et.al. 2411.01745 link
2024-11-04 xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism Jiarui Fang et.al. 2411.01738 link
2024-11-04 LaGDif: Latent Graph Diffusion Model for Efficient Protein Inverse Folding with Self-Ensemble Taoyu Wu et.al. 2411.01737 link
2024-11-03 Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation Zhenbin Wang et.al. 2411.01647 null
2024-11-03 HC $^3$ L-Diff: Hybrid conditional latent diffusion with high frequency enhancement for CBCT-to-CT synthesis Shi Yin et.al. 2411.01575 null
2024-11-03 Conditional Controllable Image Fusion Bing Cao et.al. 2411.01573 link
2024-11-03 Statistical guarantees for denoising reflected diffusion models Asbjørn Holk et.al. 2411.01563 null
2024-11-03 Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach Qihe Pan et.al. 2411.01545 link
2024-11-03 Digressions on Irreversibility and Stochastic Systems Giorgio Picci et.al. 2411.01516 null
2024-11-03 DPCL-Diff: The Temporal Knowledge Graph Reasoning based on Graph Node Diffusion Model with Dual-Domain Periodic Contrastive Learning Yukun Cao et.al. 2411.01477 null
2024-11-03 Two-Timescale Model Caching and Resource Allocation for Edge-Enabled AI-Generated Content Services Zhang Liu et.al. 2411.01458 null
2024-10-31 DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion Weicai Ye et.al. 2410.24203 link
2024-10-31 **Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation** Fu Feng et.al. 2410.24160 null
2024-10-31 Scaling Concept With Text-Guided Diffusion Models Chao Huang et.al. 2410.24151 null
2024-10-31 Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure Xiang Li et.al. 2410.24060 link
2024-10-31 TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation Sunjae Yoon et.al. 2410.24037 null
2024-10-31 DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination Jia Fu et.al. 2410.24006 link
2024-10-31 Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model Wenjia Xie et.al. 2410.23994 null
2024-10-31 Stochastic Reconstruction of Gappy Lagrangian Turbulent Signals by Conditional Diffusion Models Tianyi Li et.al. 2410.23971 link
2024-10-31 Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation Yihang Zhou et.al. 2410.23962 null
2024-10-31 Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model Hao Zhang et.al. 2410.23905 link
2024-10-31 DiffBatt: A Diffusion Model for Battery Degradation Prediction and Synthesis Hamidreza Eivazi et.al. 2410.23893 link
2024-10-31 Denoising Diffusion Models for Anomaly Localization in Medical Images Cosmin I. Bercea et.al. 2410.23834 null
2024-10-31 Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models Youngjun Jun et.al. 2410.23820 null
2024-10-31 EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching Xinwang Chen et.al. 2410.23788 link
2024-10-31 On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection Xiufeng Song et.al. 2410.23623 link
2024-10-31 There and Back Again: On the relation between noises, images, and their inversions in diffusion models Łukasz Staniszewski et.al. 2410.23530 null
2024-10-30 MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts Jie Zhu et.al. 2410.23332 null
2024-10-30 ReferEverything: Towards Segmenting Everything We Can Speak of in Videos Anurag Bagchi et.al. 2410.23287 null
2024-10-30 Provable acceleration for diffusion models under minimal assumptions Gen Li et.al. 2410.23285 null
2024-10-30 RelationBooth: Towards Relation-Aware Customized Object Generation Qingyu Shi et.al. 2410.23280 null
2024-10-30 SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation Yining Hong et.al. 2410.23277 null
2024-10-30 Multi-student Diffusion Distillation for Better One-step Generators Yanke Song et.al. 2410.23274 null
2024-10-30 CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense Mingkun Zhang et.al. 2410.23091 link
2024-10-30 Controlling Language and Diffusion Models by Transporting Activations Pau Rodriguez et.al. 2410.23054 link
2024-10-30 Improving Musical Accompaniment Co-creation via Diffusion Transformers Javier Nistal et.al. 2410.23005 null
2024-10-30 DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes Jialiang Zhang et.al. 2410.23004 null
2024-10-30 LumiSculpt: A Consistency Lighting Control Network for Video Generation Yuxin Zhang et.al. 2410.22979 null
2024-10-30 Private Synthetic Text Generation with Diffusion Models Sebastian Ochs et.al. 2410.22971 link
2024-10-31 DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data Hanyang Chen et.al. 2410.22938 link
2024-10-30 HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models Shengkai Zhang et.al. 2410.22901 link
2024-10-30 Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images Hanlin Wu et.al. 2410.22830 link
2024-10-30 Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models Arash Marioriyad et.al. 2410.22775 null
2024-10-30 FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images Zheng Yu et.al. 2410.22771 link
2024-10-31 Consistency Diffusion Bridge Models Guande He et.al. 2410.22637 null
2024-10-29 Stochastic Trajectories and Spectral Boundary Conditions for Enhanced Diffusion in Immersed Boundary Problems Rômulo Damasclin Chaves dos Santos et.al. 2410.22579 null
2024-10-29 Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components Carl Allen et.al. 2410.22559 null
2024-10-31 FairSkin: Fair Diffusion for Skin Disease Image Generation Ruichen Zhang et.al. 2410.22551 null
2024-10-28 On Inductive Biases That Enable Generalization of Diffusion Transformers Jie An et.al. 2410.21273 link
2024-10-28 One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation Zhendong Wang et.al. 2410.21257 null
2024-10-28 On learning higher-order cumulants in diffusion models Gert Aarts et.al. 2410.21212 null
2024-10-28 Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences Zhihao Zhao et.al. 2410.21130 null
2024-10-28 Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models Wenda Li et.al. 2410.21088 link
2024-10-28 Federated Time Series Generation on Feature and Temporally Misaligned Data Chenrui Fan et.al. 2410.21072 null
2024-10-28 Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework Vladimir Arkhipkin et.al. 2410.21061 link
2024-10-28 Beyond Autoregression: Fast LLMs via Self-Distillation Through Time Justin Deschenaux et.al. 2410.21035 link
2024-10-29 EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior Xin Xiang et.al. 2410.20981 null
2024-10-28 Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models! Arash Marioriyad et.al. 2410.20972 null
2024-10-28 Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models Weijian Luo et.al. 2410.20898 link
2024-10-28 Novel Object Synthesis via Adaptive Text-Image Harmony Zeren Xiong et.al. 2410.20823 null
2024-10-28 Development of a conditional diffusion model to predict process parameters and microstructures of dendrite crystals of matrix resin based on mechanical properties Arisa Ikeda et.al. 2410.20822 null
2024-10-28 Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design Xiangxin Zhou et.al. 2410.20688 link
2024-10-27 TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation Juntong Shi et.al. 2410.20626 link
2024-10-27 Generator Matching: Generative modeling with arbitrary Markov processes Peter Holderrieth et.al. 2410.20587 null
2024-10-27 Hamiltonian Score Matching and Generative Flows Peter Holderrieth et.al. 2410.20470 null
2024-10-27 Lodge++: High-quality and Long Dance Generation with Vivid Choreography Patterns Ronghui Li et.al. 2410.20389 null
2024-10-27 Conditional GAN for Enhancing Diffusion Models in Efficient and Authentic Global Gesture Generation from Audios Yongkang Cheng et.al. 2410.20359 null
2024-10-26 MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Haozhe Liu et.al. 2410.20280 null
2024-10-24 MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Ling-Hao Chen et.al. 2410.18977 null
2024-10-24 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation Hansheng Chen et.al. 2410.18974 link
2024-10-24 On the Crucial Role of Initialization for Matrix Factorization Bingcong Li et.al. 2410.18965 null
2024-10-24 Stable Consistency Tuning: Understanding and Improving Consistency Models Fu-Yun Wang et.al. 2410.18958 link
2024-10-24 Generation of synthetic financial time series by diffusion models Tomonori Takahashi et.al. 2410.18897 null
2024-10-24 The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods Linda Laurier et.al. 2410.18866 null
2024-10-24 Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation Xiaoyu Zhang et.al. 2410.18830 null
2024-10-24 Fast constrained sampling in pre-trained diffusion models Alexandros Graikos et.al. 2410.18804 null
2024-10-24 Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances Shilin Lu et.al. 2410.18775 link
2024-10-25 Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing Haonan Lin et.al. 2410.18756 null
2024-10-24 Rectified Diffusion Guidance for Conditional Generation Mengfei Xia et.al. 2410.18737 null
2024-10-24 Retrieval-Augmented Diffusion Models for Time Series Forecasting Jingwei Liu et.al. 2410.18712 link
2024-10-24 Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model Ali Hamza et.al. 2410.18678 null
2024-10-24 DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Yuang Ai et.al. 2410.18666 link
2024-10-25 Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Model Jinxu Lin et.al. 2410.18639 null
2024-10-24 SMITE: Segment Me In TimE Amirhossein Alimohammadi et.al. 2410.18538 link
2024-10-24 Beyond Color and Lines: Zero-Shot Style-Specific Image Variations with Coordinated Semantics Jinghao Hu et.al. 2410.18537 null
2024-10-24 Scaling up Masked Diffusion Models on Text Shen Nie et.al. 2410.18514 link
2024-10-24 FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling Zhengqiang Zhang et.al. 2410.18410 link
2024-10-23 DMTG: A Human-Like Mouse Trajectory Generation Bot Based on Entropy-Controlled Diffusion Networks Jiahua Liu et.al. 2410.18233 null
2024-10-23 DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes Hengwei Bian et.al. 2410.18084 null
2024-10-23 Prioritized Generative Replay Renhao Wang et.al. 2410.18082 null
2024-10-23 Optical Generative Models Shiqi Chen et.al. 2410.17970 null
2024-10-23 A Wavelet Diffusion GAN for Image Super-Resolution Lorenzo Aloisi et.al. 2410.17966 null
2024-10-23 Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation Wenfang Yao et.al. 2410.17918 link
2024-10-23 Scaling Diffusion Language Models via Adaptation from Autoregressive Models Shansan Gong et.al. 2410.17891 link
2024-10-23 Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech Danilo de Oliveira et.al. 2410.17834 null
2024-10-23 PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation Feiyan Feng et.al. 2410.17812 null
2024-10-23 AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution Yuanting Fan et.al. 2410.17752 null
2024-10-23 VISAGE: Video Synthesis using Action Graphs for Surgery Yousef Yeganeh et.al. 2410.17751 null
2024-10-23 Deep Generative Models for 3D Medical Image Synthesis Paul Friedrich et.al. 2410.17664 null
2024-10-23 Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation Muquan Li et.al. 2410.17606 link
2024-10-23 How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization? Jiahua Dong et.al. 2410.17594 link
2024-10-23 GDDA: Semantic OOD Detection on Graphs under Covariate Shift via Score-Based Diffusion Models Zhixia He et.al. 2410.17526 null
2024-10-23 Physics-driven AI for Channel Estimation in Cellular Network Xiaoqian Qi et.al. 2410.17525 null
2024-10-23 Diffusion Priors for Variational Likelihood Estimation and Image Denoising Jun Cheng et.al. 2410.17521 link
2024-10-23 Univariate Conditional Variational Autoencoder for Morphogenic Patterns Design in Frontal Polymerization-Based Manufacturing Qibang Liu et.al. 2410.17518 link
2024-10-22 EEG-DIF: Early Warning of Epileptic Seizures through Generative Diffusion Model-based Multi-channel EEG Signals Forecasting Zekun Jiang et.al. 2410.17343 link
2024-10-22 Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding Yasha Ektefaie et.al. 2410.17173 link
2024-10-22 DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization Haowei Zhu et.al. 2410.16942 null
2024-10-21 MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors Honghua Chen et.al. 2410.16272 null
2024-10-21 A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data Simon Deltadahl et.al. 2410.16177 null
2024-10-22 Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models Giannis Daras et.al. 2410.16152 null
2024-10-21 SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation Xinyi Zhou et.al. 2410.16119 null
2024-10-21 Continuous Speech Synthesis using per-token Latent Diffusion Arnon Turetzky et.al. 2410.16048 null
2024-10-22 CamI2V: Camera-Controlled Image-to-Video Diffusion Model Guangcong Zheng et.al. 2410.15957 link
2024-10-21 Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces Jifeng Hu et.al. 2410.15698 null
2024-10-21 Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation Anh Bui et.al. 2410.15618 link
2024-10-20 Data Augmentation via Diffusion Model to Enhance AI Fairness Christina Hastings Blow et.al. 2410.15470 null
2024-10-20 MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications Yongrui Yu et.al. 2410.15432 null
2024-10-20 ConSinger: Efficient High-Fidelity Singing Voice Generation with Minimal Steps Yulin Song et.al. 2410.15342 null
2024-10-20 Diffusion-PINN Sampler Zhekun Shi et.al. 2410.15336 null
2024-10-20 FoMo: A Foundation Model for Mobile Traffic Forecasting with Diffusion Model Haoye Chai et.al. 2410.15322 null
2024-10-20 FastSTI: A Fast Conditional Pseudo Numerical Diffusion Model for Spatio-temporal Traffic Data Imputation Shaokang Cheng et.al. 2410.15248 null
2024-10-19 Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization Zichen Wang et.al. 2410.15040 null
2024-10-19 DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer Ying Hu et.al. 2410.15007 link
2024-10-19 Attack as Defense: Run-time Backdoor Implantation for Image Content Protection Haichuan Zhang et.al. 2410.14966 link
2024-10-19 Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence Vansh Bansal et.al. 2410.14949 link
2024-10-19 ImmerseDiffusion: A Generative Spatial Audio Latent Diffusion Model Mojtaba Heydari et.al. 2410.14945 null
2024-10-19 Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step Mingyuan Zhou et.al. 2410.14919 link
2024-10-17 Diffusing States and Matching Scores: A New Framework for Imitation Learning Runzhe Wu et.al. 2410.13855 link
2024-10-17 Influence Functions for Scalable Data Attribution in Diffusion Models Bruno Mlodozeniec et.al. 2410.13850 null
2024-10-17 Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning Xiaodan Xing et.al. 2410.13823 link
2024-10-17 ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution Junhao Gu et.al. 2410.13807 null
2024-10-17 Probing the Latent Hierarchical Structure of Data via Diffusion Models Antonio Sclocchi et.al. 2410.13770 null
2024-10-17 Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers Yuchen Liang et.al. 2410.13746 null
2024-10-17 Improved Convergence Rate for Diffusion Probabilistic Models Gen Li et.al. 2410.13738 null
2024-10-18 DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation Hanbo Cheng et.al. 2410.13726 link
2024-10-18 Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion Yijun Liang et.al. 2410.13674 link
2024-10-17 Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design Chenyu Wang et.al. 2410.13643 link
2024-10-17 Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control Xinyi Yuan et.al. 2410.13586 null
2024-10-17 Can Medical Vision-Language Pre-training Succeed with Purely Synthetic Data? Che Liu et.al. 2410.13523 null
2024-10-17 Solving Prior Distribution Mismatch in Diffusion Models via Optimal Transport Zhanpeng Wang et.al. 2410.13431 null
2024-10-17 MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models Donghao Zhou et.al. 2410.13370 null
2024-10-17 DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone Hongfan Gao et.al. 2410.13338 null
2024-10-17 FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling Jintao Zhang et.al. 2410.13253 link
2024-10-17 Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration Yun-Yen Chuang et.al. 2410.13201 link
2024-10-17 TCP-Diffusion: A Multi-modal Diffusion Model for Global Tropical Cyclone Precipitation Forecasting with Change Awareness Cheng Huang et.al. 2410.13175 link
2024-10-17 Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance Jiwan Hur et.al. 2410.13136 link
2024-10-17 Boosting Imperceptibility of Stable Diffusion-based Adversarial Examples Generation with Momentum Nashrah Haque et.al. 2410.13122 link
2024-10-16 Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts Hongcheng Gao et.al. 2410.12777 link
2024-10-16 SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation Jaehong Yoon et.al. 2410.12761 null
2024-10-16 Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization Xingqi Wang et.al. 2410.12700 link
2024-10-16 AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing DuoSheng Chen et.al. 2410.12696 link
2024-10-16 One Step Diffusion via Shortcut Models Kevin Frans et.al. 2410.12557 link
2024-10-16 Disentangling data distribution for Federated Learning Xinyuan Zhao et.al. 2410.12530 null
2024-10-16 Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing Mingce Guo et.al. 2410.12526 null
2024-10-16 Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective Yongxin Zhu et.al. 2410.12490 link
2024-10-16 DaDiff: Domain-aware Diffusion Model for Nighttime UAV Tracking Haobo Zuo et.al. 2410.12270 link
2024-10-16 FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation Huadai Liu et.al. 2410.12266 null
2024-10-16 Preference Optimization with Multi-Sample Comparisons Chaoqi Wang et.al. 2410.12138 null
2024-10-15 DDIL: Improved Diffusion Distillation With Imitation Learning Risheek Garrepalli et.al. 2410.11971 null
2024-10-15 CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning Qingqing Cao et.al. 2410.11963 null
2024-10-15 High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion Junhwa Hur et.al. 2410.11838 null
2024-10-15 On the Effectiveness of Dataset Alignment for Fake Image Detection Anirudh Sundara Rajan et.al. 2410.11835 null
2024-10-15 Bayesian Experimental Design via Contrastive Diffusions Jacopo Iollo et.al. 2410.11826 link
2024-10-15 Improving Long-Text Alignment for Text-to-Image Diffusion Models Luping Liu et.al. 2410.11817 link
2024-10-15 SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing Zhiyuan Zhang et.al. 2410.11815 null
2024-10-16 Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices Zhiyuan Ma et.al. 2410.11795 null
2024-10-15 Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems Jason Hu et.al. 2410.11730 null
2024-10-14 Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models Jingzhi Bao et.al. 2410.10821 link
2024-10-14 Depth Any Video with Scalable Synthetic Data Honghui Yang et.al. 2410.10815 link
2024-10-14 HART: Efficient Visual Generation with Hybrid Autoregressive Transformer Haotian Tang et.al. 2410.10812 link
2024-10-14 TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction Qingze et.al. 2410.10804 link
2024-10-14 Boosting Camera Motion Control for Video Diffusion Transformers Soon Yau Cheong et.al. 2410.10802 null
2024-10-14 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Litu Rout et.al. 2410.10792 null
2024-10-14 ControlMM: Controllable Masked Motion Generation Ekkasit Pinyoanuntapong et.al. 2410.10780 null
2024-10-14 Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation Youwei Yu et.al. 2410.10766 link
2024-10-14 DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships Zhang Wan et.al. 2410.10751 null
2024-10-14 FlexGen: Flexible Multi-View Generation from Text and Image Inputs Xinli Xu et.al. 2410.10745 null
2024-10-14 Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models Junyu Chen et.al. 2410.10733 link
2024-10-14 TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model Jiazhi Guan et.al. 2410.10696 null
2024-10-14 Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation Peiwen Sun et.al. 2410.10676 null
2024-10-14 Generating Model Parameters for Controlling: Parameter Diffusion for Controllable Multi-Task Recommendation Chenglei Shen et.al. 2410.10639 null
2024-10-15 SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers Enze Xie et.al. 2410.10629 null
2024-10-14 UniGEM: A Unified Approach to Generation and Property Prediction for Molecules Shikun Feng et.al. 2410.10516 null
2024-10-14 Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing Kejie Wang et.al. 2410.10496 link
2024-10-14 An efficient numerical method for American options and their Greeks under the two-asset Kou jump-diffusion model Karel J. in ‘t Hout et.al. 2410.10444 null
2024-10-14 Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models Boheng Li et.al. 2410.10437 link
2024-10-14 DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model Songen Gu et.al. 2410.10429 null
2024-10-10 DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models Xiaoxiao He et.al. 2410.08207 null
2024-10-10 HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation Shanyan Guan et.al. 2410.08192 null
2024-10-10 DifFRelight: Diffusion-Based Facial Performance Relighting Mingming He et.al. 2410.08188 null
2024-10-10 ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion Zitian Zhang et.al. 2410.08168 link
2024-10-10 DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Jiatao Gu et.al. 2410.08159 null
2024-10-10 Progressive Autoregressive Video Diffusion Models Desai Xie et.al. 2410.08151 link
2024-10-10 Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction Jarrid Rector-Brooks et.al. 2410.08134 null
2024-10-10 Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models Vinith M. Suriyakumar et.al. 2410.08074 null
2024-10-10 LADIMO: Face Morph Generation through Biometric Template Inversion with Latent Diffusion Marcel Grimmer et.al. 2410.07988 link
2024-10-10 AI Surrogate Model for Distributed Computing Workloads David K. Park et.al. 2410.07940 null
2024-10-10 Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image Generative Models Abhishek Mandal et.al. 2410.07884 null
2024-10-10 FDDM: Frequency-Decomposed Diffusion Model for Rectum Cancer Dose Prediction in Radiotherapy Xin Liao et.al. 2410.07876 null
2024-10-10 RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation Songming Liu et.al. 2410.07864 link
2024-10-10 MinorityPrompt: Text to Minority Image Generation via Prompt Optimization Soobin Um et.al. 2410.07838 link
2024-10-10 Simulating images of radio galaxies with diffusion models Tobias Vičánek Martínez et.al. 2410.07794 link
2024-10-10 $\textit{Jump Your Steps}$ : Optimizing Sampling Schedule of Discrete Diffusion Models Yong-Hyun Park et.al. 2410.07761 null
2024-10-10 Synthesizing Multi-Class Surgical Datasets with Anatomy-Aware Diffusion Models Danush Kumar Venkatesh et.al. 2410.07753 link
2024-10-10 Flow control-oriented coherent mode prediction via Grassmann-kNN manifold learning Hongfu Zhang et.al. 2410.07683 null
2024-10-10 Relational Diffusion Distillation for Efficient Image Generation Weilun Feng et.al. 2410.07679 link
2024-10-10 MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion Onkar Susladkar et.al. 2410.07659 link
2024-10-09 IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Xinchen Zhang et.al. 2410.07171 link
2024-10-09 AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation Yukang Cao et.al. 2410.07164 null
2024-10-09 InstructG2I: Synthesizing Images from Multimodal Attributed Graphs Bowen Jin et.al. 2410.07157 link
2024-10-09 Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis Bohan Zeng et.al. 2410.07155 link
2024-10-09 Diffusion Density Estimators Akhil Premkumar et.al. 2410.06986 null
2024-10-09 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control Shimon Vainer et.al. 2410.06985 null
2024-10-09 Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think Sihyun Yu et.al. 2410.06940 link
2024-10-09 Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis Ahmed Abdullah et.al. 2410.06841 null
2024-10-09 Diffuse or Confuse: A Diffusion Deepfake Speech Dataset Anton Firc et.al. 2410.06796 link
2024-10-09 Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography Qianqian Xue et.al. 2410.06757 null
2024-10-10 Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques Benyuan Meng et.al. 2410.06719 link
2024-10-09 Decouple-Then-Merge: Towards Better Training for Diffusion Models Qianli Ma et.al. 2410.06664 null
2024-10-09 Chemistry-Inspired Diffusion with Non-Differentiable Guidance Yuchen Shen et.al. 2410.06502 null
2024-10-09 HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution Hua Li et.al. 2410.06488 link
2024-10-08 Generative Artificial Intelligence (GAI) for Mobile Communications: A Diffusion Model Perspective Xiaoxia Xu et.al. 2410.06389 link
2024-10-08 SymDiff: Equivariant Diffusion via Stochastic Symmetrisation Leo Zhang et.al. 2410.06262 null
2024-10-08 Story-Adapter: A Training-free Iterative Framework for Long Story Visualization Jiawei Mao et.al. 2410.06244 null
2024-10-08 Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach Sha Guo et.al. 2410.06149 null
2024-10-08 AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation Boyuan Cao et.al. 2410.06055 link
2024-10-08 Sparse Repellency for Shielded Generation in Text-to-image Diffusion Models Michael Kirchhof et.al. 2410.06025 null
2024-10-07 DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control Kaifeng Zhao et.al. 2410.05260 null
2024-10-07 GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting Yukang Cao et.al. 2410.05259 null
2024-10-07 SePPO: Semi-Policy Preference Optimization for Diffusion Alignment Daoan Zhang et.al. 2410.05255 link
2024-10-07 DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration Yongtai Zhuo et.al. 2410.05234 link
2024-10-07 Presto! Distilling Steps and Layers for Accelerating Music Generation Zachary Novack et.al. 2410.05167 null
2024-10-07 A Simulation-Free Deep Learning Approach to Stochastic Optimal Control Mengjian Hua et.al. 2410.05163 null
2024-10-07 Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information Timofey Efimov et.al. 2410.05143 null
2024-10-07 Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning Ayano Hiranaka et.al. 2410.05116 null
2024-10-07 DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects Nidhi Mathihalli et.al. 2410.05097 link
2024-10-07 A nodally bound-preserving discontinuous Galerkin method for the drift-diffusion equation Gabriel R. Barrenechea et.al. 2410.05040 null
2024-10-07 Revealing Directions for Text-guided 3D Face Editing Zhuo Chen et.al. 2410.04965 null
2024-10-07 Low-Rank Continual Personalization of Diffusion Models Łukasz Staniszewski et.al. 2410.04891 link
2024-10-07 Patch is Enough: Naturalistic Adversarial Patch against Vision-Language Pre-training Models Dehong Kong et.al. 2410.04884 null
2024-10-07 Real-time cardiac cine MRI – A comparison of a diffusion probabilistic model with alternative state-of-the-art image reconstruction techniques for undersampled spiral acquisitions Oliver Schad et.al. 2410.04843 link
2024-10-07 Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration Zhiyu Zhu et.al. 2410.04811 link
2024-10-07 FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models Haokun Chen et.al. 2410.04810 null
2024-10-07 Data-driven Diffusion Models for Enhancing Safety in Autonomous Vehicle Traffic Simulations Jinxiong Lu et.al. 2410.04809 null
2024-10-07 Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models Yuchen Wu et.al. 2410.04760 null
2024-10-07 Numerical analysis of American option pricing in a two-asset jump-diffusion model Hao Zhou et.al. 2410.04745 null
2024-10-07 Diffusion Models in 3D Vision: A Survey Zhen Wang et.al. 2410.04738 null
2024-10-03 Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models Zhengfeng Lai et.al. 2410.02740 null
2024-10-03 SteerDiff: Steering towards Safe Text-to-Image Diffusion Models Hongxiang Zhang et.al. 2410.02710 null
2024-10-03 ControlAR: Controllable Image Generation with Autoregressive Models Zongming Li et.al. 2410.02705 link
2024-10-03 GUD: Generation with Unified Diffusion Mathis Gerdes et.al. 2410.02667 null
2024-10-03 Efficient calibration of the shifted square-root diffusion model to credit default swap spreads using asymptotic approximations Ankush Agarwal et.al. 2410.02645 null
2024-10-04 Diffusion Models are Evolutionary Algorithms Yanbo Zhang et.al. 2410.02543 link
2024-10-03 Lightweight Diffusion Models for Resource-Constrained Semantic Communication Giovanni Pignata et.al. 2410.02491 link
2024-10-03 Towards a Theoretical Understanding of Memorization in Diffusion Models Yunhao Chen et.al. 2410.02467 null
2024-10-03 Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models Seyedmorteza Sadat et.al. 2410.02416 null
2024-10-03 Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks Zeyu Feng et.al. 2410.02389 null
2024-10-04 Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation Muzhi Zhu et.al. 2410.02369 link
2024-10-03 Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis Zikun Zhang et.al. 2410.02321 null
2024-10-03 Channel-aware Contrastive Conditional Diffusion for Multivariate Probabilistic Time Series Forecasting Siyang Li et.al. 2410.02168 link
2024-10-03 SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model Xinlei Niu et.al. 2410.02144 null
2024-10-03 MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation Trung X. Pham et.al. 2410.02130 null
2024-10-03 SC-CDM: Enhancing Quality of Image Semantic Communication with a Compact Diffusion Model Kexin Zhang et.al. 2410.02121 null
2024-10-02 Stochastic Deep Restoration Priors for Imaging Inverse Problems Yuyang Hu et.al. 2410.02057 null
2024-10-02 Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data Sreyan Ghosh et.al. 2410.02056 link
2024-10-02 Using Style Ambiguity Loss to Improve Aesthetics of Diffusion Models James Baker et.al. 2410.02055 link
2024-10-02 Discrete Copula Diffusion Anji Liu et.al. 2410.01949 null
2024-10-02 FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images Cheng Zhang et.al. 2410.01801 null
2024-10-02 Dynamical-generative downscaling of climate model ensembles Ignacio Lopez-Gomez et.al. 2410.01776 null
2024-10-02 ImageFolder: Autoregressive Image Generation with Folded Tokens Xiang Li et.al. 2410.01756 link
2024-10-02 VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models Kailai Feng et.al. 2410.01738 link
2024-10-02 HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration Yushi Huang et.al. 2410.01723 link
2024-10-02 KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models Pouyan Navard et.al. 2410.01595 link
2024-10-02 MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation Mingzhen Sun et.al. 2410.01594 link
2024-10-02 HRTF Estimation using a Score-based Prior Etienne Thuillier et.al. 2410.01562 null
2024-10-02 Edge-preserving noise for diffusion models Jente Vandersanden et.al. 2410.01540 null
2024-10-02 Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models Ching-Chia Kao et.al. 2410.01438 null
2024-10-02 Harnessing the Latent Diffusion Model for Training-Free Image Style Transfer Kento Masui et.al. 2410.01366 null
2024-10-02 Aggregation of Multi Diffusion Models for Enhancing Learned Representations Conghan Yue et.al. 2410.01262 link
2024-10-02 Generative Diffusion-based Contract Design for Efficient AI Twins Migration in Vehicular Embodied AI Networks Yue Zhong et.al. 2410.01176 null
2024-10-02 Text2PDE: Latent Diffusion Models for Accessible Physics Simulation Anthony Zhou et.al. 2410.01153 link
2024-10-02 Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation Junlin Han et.al. 2410.00890 null
2024-10-01 Diffusion-Informed Probabilistic Contact Search for Multi-Finger Manipulation Abhinav Kumar et.al. 2410.00841 null
2024-10-01 Absorbing State Phase Transitions and Stability of Long-Range Coherence in Dissipative Quantum State Preparation Matthew Wampler et.al. 2410.00819 null
2024-10-01 Modeling Neural Switching via Drift-Diffusion Models Nicholas Marco et.al. 2410.00781 link
2024-10-01 Improved Generation of Synthetic Imaging Data Using Feature-Aligned Diffusion Lakshmi Nair et.al. 2410.00731 link
2024-10-01 NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models Chi-Sheng Chen et.al. 2410.00712 null
2024-09-30 COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models Divyanshu Daiya et.al. 2409.20502 null
2024-09-30 FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing Lingling Cai et.al. 2409.20500 null
2024-09-30 Ensemble Kalman Diffusion Guidance: A Derivative-free Method for Inverse Problems Hongkai Zheng et.al. 2409.20175 null
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-30 Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation Rong Tang et.al. 2409.20124 null
2024-09-30 Reaction-diffusion model for a population structured in phenotype and space I – Criterion for persistence Nathanaël Boutillon et.al. 2409.20118 null
2024-09-30 RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models Jangyeong Kim et.al. 2409.19989 null
2024-09-30 Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function Chenyi Zhuang et.al. 2409.19967 link
2024-09-30 Image Copy Detection for Diffusion Models Wenhao Wang et.al. 2409.19952 null
2024-09-30 Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner Chenyou Fan et.al. 2409.19949 null
2024-09-30 Replace Anyone in Videos Xiang Wang et.al. 2409.19911 link
2024-09-30 GameLabel-10K: Collecting Image Preference Data Through Mobile Game Crowdsourcing Jonathan Zhou et.al. 2409.19830 null
2024-09-29 Text-driven Human Motion Generation with Motion Masked Diffusion Model Xingyu Chen et.al. 2409.19686 null
2024-09-29 Simple and Fast Distillation of Diffusion Models Zhenyu Zhou et.al. 2409.19681 link
2024-09-29 SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal Fang Long et.al. 2409.19679 link
2024-09-29 Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection Yuhang Ma et.al. 2409.19624 null
2024-09-29 MCDDPM: Multichannel Conditional Denoising Diffusion Model for Unsupervised Anomaly Detection in Brain MRI Vivek Kumar Trivedi et.al. 2409.19623 link
2024-09-29 Causal Deciphering and Inpainting in Spatio-Temporal Dynamics via Diffusion Model Yifan Duan et.al. 2409.19608 null
2024-09-29 DiffCP: Ultra-Low Bit Collaborative Perception via Diffusion Model Ruiqing Mao et.al. 2409.19592 null
2024-09-29 Effective Diffusion Transformer Architecture for Image Super-Resolution Kun Cheng et.al. 2409.19589 link
2024-09-26 FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner Wenliang Zhao et.al. 2409.18128 link
2024-09-26 Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Jing He et.al. 2409.18124 null
2024-09-26 EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation Jiaxiang Tang et.al. 2409.18114 null
2024-09-26 StackGen: Generating Stable Structures from Silhouettes via Diffusion Luzhe Sun et.al. 2409.18098 null
2024-09-26 DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models Helin Cao et.al. 2409.18092 null
2024-09-26 Stable Video Portraits Mirela Ostrek et.al. 2409.18083 null
2024-09-26 PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging Xin Cai et.al. 2409.17996 null
2024-09-26 Joint Localization and Planning using Diffusion L. Lao Beyer et.al. 2409.17995 null
2024-09-26 CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors Linye Lyu et.al. 2409.17963 link
2024-09-26 Relativistic diffusion model for hadron production in p-Pb collisions at the LHC Philipp Schulz et.al. 2409.17960 null
2024-09-26 Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion Hengrui Gu et.al. 2409.17928 link
2024-09-26 Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation Qihan Huang et.al. 2409.17920 link
2024-09-26 Continual learning with task specialist Indu Solomon et.al. 2409.17806 null
2024-09-26 Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs Qinpeng Cui et.al. 2409.17778 link
2024-09-26 Text Image Generation for Low-Resource Languages with Dual Translation Learning Chihiro Noguchi et.al. 2409.17747 null
2024-09-26 AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status Jinghao Zhang et.al. 2409.17740 null
2024-09-26 Dark Miner: Defend against unsafe generation for text-to-image diffusion models Zheling Meng et.al. 2409.17682 null
2024-09-26 Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation Huan Yang et.al. 2409.17674 null
2024-09-26 ID $^3$ : Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition Shen Li et.al. 2409.17576 null
2024-09-26 Flexiffusion: Segment-wise Neural Architecture Search for Flexible Denoising Schedule Hongtao Huang et.al. 2409.17566 null
2024-09-25 DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion Yukun Huang et.al. 2409.17145 link
2024-09-25 Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model Xinfeng Wei et.al. 2409.17104 null
2024-09-25 Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors Aiping Zhang et.al. 2409.17058 link
2024-09-25 ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis Fangshuo Zhou et.al. 2409.17049 link
2024-09-25 Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion Vineet Punyamoorty et.al. 2409.16950 null
2024-09-25 DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling Kyuheon Jung et.al. 2409.16949 link
2024-09-25 Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model Hongliang Zhong et.al. 2409.16938 link
2024-09-25 A Versatile and Differentiable Hand-Object Interaction Representation Théo Morales et.al. 2409.16855 null
2024-09-25 Analytical assessment of workers’ safety concerning direct and indirect ways of getting infected by dangerous pathogen Krzysztof Domino et.al. 2409.16809 null
2024-09-25 Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model Shoma Iwai et.al. 2409.16689 null
2024-09-25 CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models Xin Jing et.al. 2409.16619 null
2024-09-25 Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models Deepak Sridhar et.al. 2409.16535 link
2024-09-24 Diffusion Models to Enhance the Resolution of Microscopy Images: A Tutorial Harshith Bachimanchi et.al. 2409.16488 null
2024-09-24 Generative Factor Chaining: Coordinated Manipulation with Diffusion-based Factor Graph Utkarsh A. Mishra et.al. 2409.16275 null
2024-09-24 MaskBit: Embedding-free Image Generation via Bit Tokens Mark Weber et.al. 2409.16211 link
2024-09-24 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling Yifang Men et.al. 2409.16160 null
2024-09-24 Spreading dynamics of a Fisher-KPP nonlocal diffusion model with a free boundary Lei Li et.al. 2409.16101 null
2024-09-24 PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation Mingyo Seo et.al. 2409.16012 null
2024-09-24 Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification Leire Benito-Del-Valle et.al. 2409.16002 link
2024-09-24 ASD-Diffusion: Anomalous Sound Detection with Diffusion Models Fengrun Zhang et.al. 2409.15957 null
2024-09-18 Massively Multi-Person 3D Human Motion Forecasting with Scene Context Felix B Mueller et.al. 2409.12189 link
2024-09-18 MoRAG – Multi-Fusion Retrieval Augmented Generation for Human Motion Kalakonda Sai Shashank et.al. 2409.12140 link
2024-09-18 Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance Jaehoon Joo et.al. 2409.12099 null
2024-09-18 Denoising diffusion models for high-resolution microscopy image restoration Pamela Osuna-Vargas et.al. 2409.12078 null
2024-09-18 LEMON: Localized Editing with Mesh Optimization and Neural Shaders Furkan Mert Algan et.al. 2409.12024 null
2024-09-18 Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models Lorenzo Mandelli et.al. 2409.11920 null
2024-09-18 DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech Xin Qi et.al. 2409.11835 null
2024-09-18 RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets Jikai Ye et.al. 2409.11831 null
2024-09-18 InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models Yan Zheng et.al. 2409.11734 null
2024-09-18 GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation Shuowen Liang et.al. 2409.11689 link
2024-09-18 Recurrent Interpolants for Probabilistic Time Series Prediction Yu Chen et.al. 2409.11684 null
2024-09-18 SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation Mingze Sun et.al. 2409.11682 link
2024-09-18 PainDiffusion: Can robot express pain? Quang Tien Dam et.al. 2409.11635 null
2024-09-17 Context-Generative Default Policy for Bounded Rational Agent Durgakant Pushp et.al. 2409.11604 null
2024-09-17 DiffESM: Conditional Emulation of Temperature and Precipitation in Earth System Models with 3D Diffusion Models Seth Bassetti et.al. 2409.11601 null
2024-09-17 Ultrasound Image Enhancement with the Variance of Diffusion Models Yuxin Zhang et.al. 2409.11380 link
2024-09-17 OSV: One Step is Enough for High-Quality Image to Video Generation Xiaofeng Mao et.al. 2409.11367 null
2024-09-17 Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Gonzalo Martin Garcia et.al. 2409.11355 link
2024-09-17 OmniGen: Unified Image Generation Shitao Xiao et.al. 2409.11340 link
2024-09-17 fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction Jianxiong Gao et.al. 2409.11315 null
2024-09-16 Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation Noah Buchanan et.al. 2409.10494 null
2024-09-16 SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing Qi Qian et.al. 2409.10476 null
2024-09-16 MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion Lehong Wu et.al. 2409.10473 null
2024-09-16 Mamba-ST: State Space Model for Efficient Style Transfer Filippo Botti et.al. 2409.10385 link
2024-09-16 Taming Diffusion Models for Image Restoration: A Review Ziwei Luo et.al. 2409.10353 null
2024-09-16 Fairness, not Emotion, Drives Socioeconomic Decision Making Rudra Mukhopadhyay et.al. 2409.10322 null
2024-09-16 DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis Fa-Ting Hong et.al. 2409.10281 null
2024-09-16 RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models Başak Melis Öcal et.al. 2409.10180 null
2024-09-16 PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion Peng Li et.al. 2409.10141 null
2024-09-16 DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection Kun Fang et.al. 2409.10094 null
2024-09-16 MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior Weijing Tao et.al. 2409.10090 link
2024-09-16 Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models Alexander Koch et.al. 2409.10089 null
2024-09-16 StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion Yinghao Aaron Li et.al. 2409.10058 null
2024-09-16 AttnMod: Attention-Based New Art Styles Shih-Chieh Su et.al. 2409.10028 null
2024-09-15 GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion Vitor Guizilini et.al. 2409.09896 null
2024-09-15 Latent Diffusion Models for Controllable RNA Sequence Generation Kaixuan Huang et.al. 2409.09828 null
2024-09-15 E-Commerce Inpainting with Mask Guidance in Controlnet for Reducing Overcompletion Guandong Li et.al. 2409.09681 null
2024-09-15 EditBoard: Towards A Comprehensive Evaluation Benchmark for Text-based Video Editing Models Yupeng Chen et.al. 2409.09668 link
2024-09-15 Conditional sampling within generative diffusion models Zheng Zhao et.al. 2409.09650 link
2024-09-15 Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement Yudong Yang et.al. 2409.09642 null
2024-09-12 DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors Thomas Hanwen Zhu et.al. 2409.08278 null
2024-09-12 DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer Runjia Li et.al. 2409.08271 null
2024-09-12 Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation Samanta Rodriguez et.al. 2409.08269 null
2024-09-12 Improving Text-guided Object Inpainting with Semantic Pre-inpainting Yifu Chen et.al. 2409.08260 link
2024-09-12 Improving Virtual Try-On with Garment-focused Diffusion Models Siqi Wan et.al. 2409.08258 link
2024-09-12 LoRID: Low-Rank Iterative Diffusion for Adversarial Purification Geigh Zollicoffer et.al. 2409.08255 null
2024-09-12 Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding Hongyu Li et.al. 2409.08251 null
2024-09-12 IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation Yinwei Wu et.al. 2409.08240 null
2024-09-12 LT3SD: Latent Trees for 3D Scene Diffusion Quan Meng et.al. 2409.08215 null
2024-09-12 VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis Hao Chen et.al. 2409.08207 null
2024-09-12 MagicStyle: Portrait Stylization Based on Reference Image Zhaoli Deng et.al. 2409.08156 null
2024-09-12 EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance Zicheng Duan et.al. 2409.08091 link
2024-09-12 Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation Junsung Lee et.al. 2409.08077 null
2024-09-12 AI-accelerated discovery of high critical temperature superconductors Xiao-Qi Han et.al. 2409.08065 link
2024-09-12 Scribble-Guided Diffusion for Training-free Text-to-Image Generation Seonho Lee et.al. 2409.08026 link
2024-09-13 Estimating Atmospheric Variables from Digital Typhoon Satellite Images via Conditional Denoising Diffusion Models Zhangyue Ling et.al. 2409.07961 link
2024-09-12 Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models Nikolai L. Kühne et.al. 2409.07936 link
2024-09-12 UGAD: Universal Generative AI Detector utilizing Frequency Fingerprints Inzamamul Alam et.al. 2409.07913 null
2024-09-12 XMOL: Explainable Multi-property Optimization of Molecules Aye Phyu Phyu Aung et.al. 2409.07786 null
2024-09-12 DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing Zhenyuan Dong et.al. 2409.07756 link
2024-09-11 DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation Haibo Yang et.al. 2409.07454 null
2024-09-11 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Haibo Yang et.al. 2409.07452 link
2024-09-11 FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process Yang Luo et.al. 2409.07451 null
2024-09-11 Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging Yunzhen Wang et.al. 2409.07417 null
2024-09-11 Training-Free Guidance for Discrete Diffusion Models for Molecular Generation Thomas J. Kerby et.al. 2409.07359 null
2024-09-11 Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching Eugenio Chisari et.al. 2409.07343 null
2024-09-11 Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models Fengzhe Zhang et.al. 2409.07323 null
2024-09-11 Exploring User-level Gradient Inversion with a Diffusion Prior Zhuohang Li et.al. 2409.07291 null
2024-09-11 CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals Weixiang Gao et.al. 2409.07271 link
2024-09-11 Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models Sanoojan Baliah et.al. 2409.07269 link
2024-09-11 EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion Jian Zhang et.al. 2409.07255 link
2024-09-12 Alignment of Diffusion Models: Fundamentals, Challenges, and Future Buhua Liu et.al. 2409.07253 link
2024-09-11 Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning Yingling Lu et.al. 2409.07238 link
2024-09-11 Phy124: Fast Physics-Driven 4D Content Generation from a Single Image Jiajing Lin et.al. 2409.07179 null
2024-09-11 Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models Jiahang Cao et.al. 2409.07163 null
2024-09-11 MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis Hanyu Jiang et.al. 2409.07129 null
2024-09-11 Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and Education Ali Forootani et.al. 2409.07110 link
2024-09-11 From optimal score matching to optimal sampling Zehao Dou et.al. 2409.07032 null
2024-09-11 CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion Joshua Kazdan et.al. 2409.07025 null
2024-09-11 Towards Predicting Temporal Changes in a Patient’s Chest X-ray Images based on Electronic Health Records Daeun Kyung et.al. 2409.07012 link
2024-09-05 Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding Yunze Man et.al. 2409.03757 link
2024-09-05 ArtiFade: Learning to Generate High-quality Subject from Blemished Images Shuya Yang et.al. 2409.03745 null
2024-09-05 RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images Benzhi Wang et.al. 2409.03644 link
2024-09-05 DiffEVC: Any-to-Any Emotion Voice Conversion with Expressive Guidance Hsing-Hang Chou et.al. 2409.03636 null
2024-09-05 TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces Bernardo Biesseck et.al. 2409.03600 link
2024-09-05 DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture Qianlong Xiang et.al. 2409.03550 link
2024-09-05 Blended Latent Diffusion under Attention Control for Real-World Video Editing Deyin Liu et.al. 2409.03514 null
2024-09-05 Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration Pei Wang et.al. 2409.03455 null
2024-09-05 Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning Huaxi Huang et.al. 2409.03326 null
2024-09-05 SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model Weipeng Tan et.al. 2409.03270 null
2024-09-05 RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry Zhaowei Wang et.al. 2409.03198 null
2024-09-04 Spatial Diffusion for Cell Layout Generation Chen Li et.al. 2409.03106 link
2024-09-04 How DREAMS are made: Emulating Satellite Galaxy and Subhalo Populations with Diffusion Models and Point Clouds Tri Nguyen et.al. 2409.02980 link
2024-09-06 HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts Xinyu Liu et.al. 2409.02919 link
2024-09-04 Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling Kaiwen Zheng et.al. 2409.02908 null
2024-09-04 Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models Zhibin Liu et.al. 2409.02851 link
2024-09-04 Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model Tornike Karchkhadze et.al. 2409.02845 null
2024-09-04 Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects Kyungmin Jo et.al. 2409.02653 null
2024-09-04 MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos Junyi Ma et.al. 2409.02638 null
2024-09-05 Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Jianwen Jiang et.al. 2409.02634 null
2024-09-04 Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models Pujing Yang et.al. 2409.02597 null
2024-09-04 Solving Video Inverse Problems Using Image Diffusion Models Taesung Kwon et.al. 2409.02574 null
2024-09-04 StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models Wen Li et.al. 2409.02543 link
2024-09-04 Sample what you cant compress Vighnesh Birodkar et.al. 2409.02529 null
2024-09-04 Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal Jifeng Hu et.al. 2409.02512 link
2024-09-04 Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis Aishwarya Agarwal et.al. 2409.02429 null
2024-09-04 Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering Peng Wang et.al. 2409.02426 link
2024-09-04 Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing Siyi Chen et.al. 2409.02374 link
2024-09-03 QID $^2$ : An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data Zijian Chen et.al. 2409.02309 null
2024-09-03 FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation Takuhiro Kaneko et.al. 2409.02245 null
2024-09-05 LinFusion: 1 GPU, 1 Minute, 16K Image Songhua Liu et.al. 2409.02097 link
2024-09-03 DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos Wenbo Hu et.al. 2409.02095 link
2024-09-03 ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis Wangbo Yu et.al. 2409.02048 null
2024-08-30 Subspace Diffusion Posterior Sampling for Travel-Time Tomography Xiang Cao et.al. 2408.17333 null
2024-08-30 RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance Avideep Mukherjee et.al. 2408.17095 null
2024-08-30 Instant Adversarial Purification with Adversarial Consistency Distillation Chun Tong Lei et.al. 2408.17064 null
2024-08-30 Text-to-Image Generation Via Energy-Based CLIP Roy Ganz et.al. 2408.17046 null
2024-08-30 Contrastive Learning with Synthetic Positives Dewen Zeng et.al. 2408.16965 link
2024-08-29 Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis Theodoros Kouzelis et.al. 2408.16845 null
2024-08-29 ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model Fangfu Liu et.al. 2408.16767 null
2024-08-29 CSGO: Content-Style Composition in Text-to-Image Generation Peng Xing et.al. 2408.16766 null
2024-08-29 DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving Yongjie Fu et.al. 2408.16647 null
2024-08-29 RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model Zhuan Shi et.al. 2408.16634 null
2024-08-29 A Score-based Generative Solver for PDE-constrained Inverse Problems with Complex Priors Yankun Hong et.al. 2408.16626 null
2024-08-29 GRPose: Learning Graph Relations for Human Image Generation with Pose Priors Xiangchen Yin et.al. 2408.16540 link
2024-08-29 Spiking Diffusion Models Jiahang Cao et.al. 2408.16467 link
2024-08-29 What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer Chaeyeon Chung et.al. 2408.16450 link
2024-08-29 COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation Jiefeng Li et.al. 2408.16426 null
2024-08-29 Self-Improving Diffusion Models with Synthetic Data Sina Alemohammad et.al. 2408.16333 null
2024-08-29 Enhanced Control for Diffusion Bridge in Image Restoration Conghan Yue et.al. 2408.16303 link
2024-08-29 Advancing Architectural Floorplan Design with Geometry-enhanced Graph Diffusion Sizhe Hu et.al. 2408.16258 link
2024-08-29 Error analysis of conformal finite element method for nonlocal diffusion model Zuoqiang Shi et.al. 2408.16243 null
2024-08-29 Enhancing Conditional Image Generation with Explainable Latent Space Manipulation Kshitij Pathania et.al. 2408.16232 link
2024-08-28 TEDRA: Text-based Editing of Dynamic and Photoreal Actors Basavaraj Sunagad et.al. 2408.15995 null
2024-08-28 Distribution Backtracking Builds A Faster Convergence Trajectory for One-step Diffusion Distillation Shengyuan Zhang et.al. 2408.15991 link
2024-08-28 Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones Carlos Plou et.al. 2408.15899 null
2024-08-28 Airfoil Diffusion: Denoising Diffusion Model For Conditional Airfoil Generation Reid Graves et.al. 2408.15898 link
2024-08-28 Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data Ayodeji Ijishakin et.al. 2408.15890 null
2024-08-28 GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model Yongjie Fu et.al. 2408.15868 null
2024-08-28 Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor Attacks Oscar Chew et.al. 2408.15721 null
2024-08-28 Synthetic Forehead-creases Biometric Generation for Reliable User Verification Abhishek Tandon et.al. 2408.15693 link
2024-08-28 Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas Fabio Quattrini et.al. 2408.15660 link
2024-08-28 Grand canonical generative diffusion model for crystalline phases and grain boundaries Bo Lei et.al. 2408.15601 null
2024-08-28 MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning Yifu Yuan et.al. 2408.15501 null
2024-08-28 On the implementation of linear finite element method for nonlocal diffusion model over 2D domain Zuoqiang Shi et.al. 2408.15472 null
2024-08-28 Hand1000: Generating Realistic Hands from Text with Only 1,000 Images Haozhuo Zhang et.al. 2408.15461 null
2024-08-27 Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution Marcelo dos Santos et.al. 2408.15386 link
2024-08-27 GenRec: Unifying Video Generation and Recognition with Diffusion Models Zejia Weng et.al. 2408.15241 link
2024-08-27 Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation Xiaojuan Wang et.al. 2408.15239 null
2024-08-27 Simulation of Stochastic Discrete Dislocation Dynamics in Ductile Vs Brittle Materials Santosh Chhetri et.al. 2408.15157 null
2024-08-27 DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays Yiran Sun et.al. 2408.15118 link
2024-08-27 Constrained Diffusion Models via Dual Training Shervin Khalafi et.al. 2408.15094 null
2024-08-27 LN-Gen: Rectal Lymph Nodes Generation via Anatomical Features Weidong Guo et.al. 2408.14977 null
2024-08-27 Foundation Models for Music: A Survey Yinghao Ma et.al. 2408.14340 link
2024-08-26 TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation Anh-Dzung Doan et.al. 2408.14227 link
2024-08-26 MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement Xu He et.al. 2408.14211 null
2024-08-27 SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher Trung Dao et.al. 2408.14176 link
2024-08-26 Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models Chaohua Shi et.al. 2408.14135 null
2024-08-26 SurGen: Text-Guided Diffusion Model for Surgical Video Generation Joseph Cho et.al. 2408.14028 null
2024-08-26 Pixel-Aligned Multi-View Generation with Depth Guided Decoder Zhenggang Tang et.al. 2408.14016 null
2024-08-25 SimpleSpeech 2: Towards Simple and Efficient Text-to-Speech with Flow-based Scalar Latent Transformer Diffusion Models Dongchao Yang et.al. 2408.13893 null
2024-08-25 Particle-Filtering-based Latent Diffusion for Inverse Problems Amir Nazemi et.al. 2408.13868 null
2024-08-25 Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching Minghao Liu et.al. 2408.13858 null
2024-08-25 Bring the Power of Diffusion Model to Defect Detection Xuyi Yu et.al. 2408.13845 null
2024-08-25 3D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing Shichao Dong et.al. 2408.13788 null
2024-08-25 Guided and Fused: Efficient Frozen CLIP-ViT with Feature Guidance and Multi-Stage Feature Fusion for Generalizable Deepfake Detection Yingjian Chen et.al. 2408.13697 null
2024-08-24 GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars Keqiang Sun et.al. 2408.13674 null
2024-08-27 Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing Yitong Yang et.al. 2408.13623 null
2024-08-24 DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation Ying Jin et.al. 2408.13509 link
2024-08-24 Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model Chen Rao et.al. 2408.13459 link
2024-08-27 Training-free Long Video Generation with Chain of Diffusion Model Experts Wenhao Li et.al. 2408.13423 null
2024-08-24 TVG: A Training-free Transition Video Generation Method with Diffusion Models Rui Zhang et.al. 2408.13413 null
2024-08-23 Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing Yangyang Xu et.al. 2408.13395 null
2024-08-22 xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations Can Qin et.al. 2408.12590 null
2024-08-22 ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation Lujia Zhong et.al. 2408.12561 link
2024-08-22 Show-o: One Single Transformer to Unify Multimodal Understanding and Generation Jinheng Xie et.al. 2408.12528 null
2024-08-22 FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing Jue Wang et.al. 2408.12429 link
2024-08-22 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment Kaihui Cheng et.al. 2408.12419 null
2024-08-22 CODE: Confident Ordinary Differential Editing Bastien van Delft et.al. 2408.12418 link
2024-08-22 Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures Ce Liu et.al. 2408.12413 null
2024-08-22 LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation Shihao Chen et.al. 2408.12354 null
2024-08-23 GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections Shiyue Zhang et.al. 2408.12352 null
2024-08-22 Variance reduction of diffusion model’s gradients with Taylor approximation-based control variate Paul Jeha et.al. 2408.12270 null
2024-08-22 Scalable Autoregressive Image Generation with Mamba Haopeng Li et.al. 2408.12245 link
2024-08-22 DimeRec: A Unified Framework for Enhanced Sequential Recommendation via Generative Diffusion Models Wuchao Li et.al. 2408.12153 null
2024-08-22 An evidence-accumulating drift-diffusion model of competing information spread on networks Julien Corsin et.al. 2408.12127 null
2024-08-22 ZipGait: Bridging Skeleton and Silhouette with Diffusion Model for Advancing Gait Recognition Fanxu Min et.al. 2408.12111 null
2024-08-22 Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation Woo Kyung Kim et.al. 2408.12110 null
2024-08-22 Spin relaxation in graphite due to spin-orbital-phonon interaction from first-principles density-matrix approach Junqing Xu et.al. 2408.12054 null
2024-08-21 CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion Yunlong Tang et.al. 2408.12009 null
2024-08-21 Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models Chun-Yen Shih et.al. 2408.11810 null
2024-08-21 Timeline and Boundary Guided Diffusion Network for Video Shadow Detection Haipeng Zhou et.al. 2408.11785 link
2024-08-21 JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet Yujia Gu et.al. 2408.11744 null
2024-08-21 Iterative Object Count Optimization for Text-to-image Diffusion Models Oz Zafar et.al. 2408.11721 null
2024-08-21 FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting Liyao Jiang et.al. 2408.11706 null
2024-08-21 Moderate deviation principles for a reaction diffusion model in non-equilibrium Linjie Zhao et.al. 2408.11633 null
2024-08-21 Bayesian inversion for the identification of the doping profile in unipolar semiconductor devices Leila Taghizadeh et.al. 2408.11485 null
2024-08-21 Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection Jingwei Sun et.al. 2408.11408 link
2024-08-21 Video Diffusion Models are Strong Video Inpainter Minhyeok Lee et.al. 2408.11402 null
2024-08-21 Generative AI based Secure Wireless Sensing for ISAC Networks Jiacheng Wang et.al. 2408.11398 null
2024-08-21 Gender Bias Evaluation in Text-to-image Generation: A Survey Yankun Wu et.al. 2408.11358 null
2024-08-21 HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model Yi Wang et.al. 2408.11357 null
2024-08-21 UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Xiangyu Zhao et.al. 2408.11305 link
2024-08-21 Taming Generative Diffusion for Universal Blind Image Restoration Siwei Tu et.al. 2408.11287 null
2024-08-20 Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Chunting Zhou et.al. 2408.11039 null
2024-08-20 MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning Haoning Wu et.al. 2408.11001 link
2024-08-20 GreediRIS: Scalable Influence Maximization using Distributed Streaming Maximum Cover Reet Barik et.al. 2408.10982 null
2024-08-20 Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling Jaideep Pathak et.al. 2408.10958 null
2024-08-20 Large Point-to-Gaussian Model for Image-to-3D Generation Longfei Lu et.al. 2408.10935 null
2024-08-20 A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse Zhongliang Guo et.al. 2408.10901 null
2024-08-19 MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model Minghua Liu et.al. 2408.10198 null
2024-08-19 SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views Chao Xu et.al. 2408.10195 null
2024-08-19 Multi-layer diffusion model of photovoltaic installations Tomasz Weron et.al. 2408.09904 null
2024-08-19 Instruction-Based Molecular Graph Generation with Unified Text-Graph Diffusion Model Yuran Xiang et.al. 2408.09896 link
2024-08-19 SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models Danush Kumar Venkatesh et.al. 2408.09822 link
2024-08-19 Latent Diffusion for Guided Document Table Generation Syed Jawwad Haider Hamdani et.al. 2408.09800 null
2024-08-19 Unsupervised Composable Representations for Audio Giovanni Bindi et.al. 2408.09792 link
2024-08-19 Propagating the prior from shallow to deep with a pre-trained velocity-model Generative Transformer network Randy Harsuko et.al. 2408.09767 null
2024-08-19 Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering Ruofan Liang et.al. 2408.09702 null
2024-08-19 ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement Eashan Adhikarla et.al. 2408.09650 link
2024-08-18 Moonshine: Distilling Game Content Generators into Steerable Generative Models Yuhe Nie et.al. 2408.09594 null
2024-08-18 Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning Zhiwei Xu et.al. 2408.09501 null
2024-08-18 FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model Ziyu Yao et.al. 2408.09384 null
2024-08-18 Unpaired Volumetric Harmonization of Brain MRI with Conditional Latent Diffusion Mengqi Wu et.al. 2408.09315 null
2024-08-17 RepControlNet: ControlNet Reparameterization Zhaoli Deng et.al. 2408.09240 null
2024-08-17 Are CLIP features all you need for Universal Synthetic Image Origin Attribution? Dario Cioni et.al. 2408.09153 link
2024-08-17 Realistic Extreme Image Rescaling via Generative Latent Space Learning Ce Wang et.al. 2408.09151 link
2024-08-17 Barbie: Text to Barbie-Style 3D Avatars Xiaokun Sun et.al. 2408.09126 link
2024-08-17 Fragment-Masked Molecular Optimization Kun Li et.al. 2408.09106 null
2024-08-16 Efficient Autoregressive Audio Modeling via Next-Scale Prediction Kai Qiu et.al. 2408.09027 link
2024-08-15 Accelerated Image-Aware Generative Diffusion Modeling Tanmay Asthana et.al. 2408.08306 null
2024-08-15 Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding Xiner Li et.al. 2408.08252 link
2024-08-15 Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion Adi Haviv et.al. 2408.08184 null
2024-08-15 Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation Seon-Hoon Kim et.al. 2408.07947 link
2024-08-14 Moderator: Moderating Text-to-Image Diffusion Models through Fine-grained Context-based Policies Peiran Wang et.al. 2408.07728 link
2024-08-14 Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding Bing Hu et.al. 2408.07636 null
2024-08-14 Anisotropic Diffusion Model of Communication in 2D Biofilm Yanahan Paramalingam et.al. 2408.07626 null
2024-08-14 DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model Erez Yosef et.al. 2408.07541 null
2024-08-14 DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency Xiaojing Zhong et.al. 2408.07481 null
2024-08-14 One Step Diffusion-based Super-Resolution with Time-Aware Distillation Xiao He et.al. 2408.07476 link
2024-08-14 Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models Jean-Marie Lemercier et.al. 2408.07472 null
2024-08-14 KIND: Knowledge Integration and Diversion in Diffusion Models Yucheng Xie et.al. 2408.07337 link
2024-08-14 GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models Lei Kang et.al. 2408.07259 link
2024-08-13 Representation-space diffusion models for generating periodic materials Anshuman Sinha et.al. 2408.07213 null
2024-08-13 SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis Yuchen Mao et.al. 2408.07196 null
2024-08-13 Imagen 3 Imagen-Team-Google et.al. 2408.07009 null
2024-08-13 Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models Cheng Chen et.al. 2408.06995 null
2024-08-13 DCMSA: Multi-Head Self-Attention Mechanism Based on Deformable Convolution For Seismic Data Denoising Wang Mingwei et.al. 2408.06963 null
2024-08-13 Diffusion Model for Slate Recommendation Federico Tomasi et.al. 2408.06883 null
2024-08-13 DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion Yujia Wu et.al. 2408.06740 null
2024-08-13 DiffSG: A Generative Solver for Network Optimization with Diffusion Model Ruihuai Liang et.al. 2408.06701 link
2024-08-13 DC3DO: Diffusion Classifier for 3D Objects Nursena Koprucu et.al. 2408.06693 link
2024-08-13 Leveraging Priors via Diffusion Bridge for Time Series Generation Jinseong Park et.al. 2408.06672 null
2024-08-13 Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models Chenqian Yan et.al. 2408.06646 null
2024-08-13 ViMo: Generating Motions from Casual Videos Liangdong Qiu et.al. 2408.06614 null
2024-08-12 The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Chris Lu et.al. 2408.06292 link
2024-08-12 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) Jaydeep Rade et.al. 2408.06244 null
2024-08-12 Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance Taewon Kang et.al. 2408.06157 null
2024-08-12 Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models Ioannis Romanelis et.al. 2408.06145 link
2024-08-12 CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Zhuoyi Yang et.al. 2408.06072 link
2024-08-12 ControlNeXt: Powerful and Efficient Control for Image and Video Generation Bohao Peng et.al. 2408.06070 link
2024-08-12 BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training Xuanpu Zhang et.al. 2408.06047 link
2024-08-12 Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models Haifan Gong et.al. 2408.05985 null
2024-08-12 UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization Junjie He et.al. 2408.05939 link
2024-08-12 Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation Utkarsh Nath et.al. 2408.05938 null
2024-08-12 A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models Taehong Moon et.al. 2408.05927 link
2024-08-12 Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information Mingkun Zhang et.al. 2408.05900 null
2024-08-11 LaWa: Using Latent Space for In-Generation Image Watermarking Ahmad Rezaei et.al. 2408.05868 link
2024-08-11 Egocentric Vision Language Planning Zhirui Fang et.al. 2408.05802 null
2024-08-11 MTSCI: A Conditional Diffusion Model for Multivariate Time Series Consistent Imputation Jianping Zhou et.al. 2408.05740 link
2024-08-11 SSL: A Self-similarity Loss for Improving Generative Image Super-resolution Du Chen et.al. 2408.05713 link
2024-08-11 TC-KANRecon: High-Quality and Accelerated MRI Reconstruction via Adaptive KAN Mechanisms and Intelligent Feature Scaling Ruiquan Ge et.al. 2408.05705 link
2024-08-11 StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model Ziyin Zhou et.al. 2408.05669 link
2024-08-10 Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion Jacob K Christopher et.al. 2408.05636 null
2024-08-10 Diffusion Model-based Contrastive Learning for Human Activity Recognition Chunjing Xiao et.al. 2408.05567 null
2024-08-08 Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics Ruining Li et.al. 2408.04631 null
2024-08-08 Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User’s Casual Sketches Yongzhi Xu et.al. 2408.04567 null
2024-08-08 Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations Julen Urain et.al. 2408.04380 null
2024-08-08 InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting Xin-Yi Yu et.al. 2408.04249 null
2024-08-08 LLDif: Diffusion Models for Low-light Emotion Recognition Zhifeng Wang et.al. 2408.04235 null
2024-08-08 Connective Viewpoints of Signal-to-Noise Diffusion Models Khanh Doan et.al. 2408.04221 null
2024-08-08 Diffusion Guided Language Modeling Justin Lovelace et.al. 2408.04220 link
2024-08-07 Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model Guoqing Zhu et.al. 2408.03748 link
2024-08-07 Unsupervised Detection of Fetal Brain Anomalies using Denoising Diffusion Models Markus Ditlev Sjøgren Olsen et.al. 2408.03654 null
2024-08-07 TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization Kien T. Pham et.al. 2408.03637 null
2024-08-07 Dirichlet forms of diffusion processes on Thoma simplex Sergei Korotkikh et.al. 2408.03553 null
2024-08-06 Hybrid diffusion models: combining supervised and generative pretraining for label-efficient fine-tuning of segmentation models Bruno Sauvalle et.al. 2408.03433 null
2024-08-06 Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey Vu Tuan Truong et.al. 2408.03400 null
2024-08-06 Adversarial Domain Adaptation for Cross-user Activity Recognition Using Diffusion-based Noise-centred Learning Xiaozhou Ye et.al. 2408.03353 link
2024-08-06 MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation Xiaofeng Mao et.al. 2408.03312 null
2024-08-06 IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts Ciara Rowles et.al. 2408.03209 null
2024-08-06 Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models Sho Ozaki et.al. 2408.03156 null
2024-08-06 Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram Synthesis Van Phi Nguyen et.al. 2408.03035 link
2024-08-06 Diffusion Model Meets Non-Exemplar Class-Incremental Learning and Beyond Jichuan Zhang et.al. 2408.02983 null
2024-08-06 Data-Driven Stochastic Closure Modeling via Conditional Diffusion Model and Neural Operator Xinghao Dong et.al. 2408.02965 null
2024-08-06 Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection Sen Nie et.al. 2408.02891 null
2024-08-05 Back-Projection Diffusion: Solving the Wideband Inverse Scattering Problem with Diffusion Models Borong Zhang et.al. 2408.02866 link
2024-08-05 Text Conditioned Symbolic Drumbeat Generation using Latent Diffusion Models Pushkar Jajoria et.al. 2408.02711 null
2024-08-05 RCDM: Enabling Robustness for Conditional Diffusion Model Weifeng Xu et.al. 2408.02710 null
2024-08-05 LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba Yunxiang Fu et.al. 2408.02615 link
2024-08-05 Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models Tongtong Feng et.al. 2408.02408 null
2024-08-05 A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models Gen Li et.al. 2408.02320 null
2024-08-05 Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders Muhammad Abdullah Jamal et.al. 2408.02245 null
2024-08-04 LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation Dwij Mehta et.al. 2408.02078 null
2024-08-04 Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation Jean Yu et.al. 2408.02054 null
2024-08-04 Robustness of Watermarking on Text-to-Image Diffusion Models Xiaodong Wu et.al. 2408.02035 null
2024-08-04 Faster Diffusion Action Segmentation Shuaibing Wang et.al. 2408.02024 null
2024-08-04 AnomalySD: Few-Shot Multi-Class Anomaly Detection with Stable Diffusion Model Zhenyu Yan et.al. 2408.01960 null
2024-08-04 Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI Robert Wolfe et.al. 2408.01959 null
2024-08-04 Why Perturbing Symbolic Music is Necessary: Fitting the Distribution of Never-used Notes through a Joint Probabilistic Diffusion Model Shipei Liu et.al. 2408.01950 null
2024-08-03 SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm Junyan Ye et.al. 2408.01812 null
2024-08-03 Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation Jintao Tan et.al. 2408.01732 null
2024-08-02 Conformal Diffusion Models for Individual Treatment Effect Estimation and Inference Hengrui Cai et.al. 2408.01582 null
2024-08-02 Conditional LoRA Parameter Generation Xiaolong Jin et.al. 2408.01415 null
2024-08-02 TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling Dong Huo et.al. 2408.01291 null
2024-08-02 A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness Lutao Jiang et.al. 2408.01269 null
2024-08-02 CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models Kushal Kumar Jain et.al. 2408.01233 null
2024-08-02 EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts Die Chen et.al. 2408.01014 null
2024-08-06 FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation Xiang Gao et.al. 2408.00998 link
2024-08-01 Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation Yixiao Wang et.al. 2408.00766 null
2024-08-01 Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention Susung Hong et.al. 2408.00760 link
2024-08-01 TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models Gilad Deutch et.al. 2408.00735 null
2024-08-01 MotionFix: Text-Driven 3D Human Motion Editing Nikos Athanasiou et.al. 2408.00712 null
2024-08-01 Evaluation Metrics and Methods for Generative Models in the Wireless PHY Layer Michael Baur et.al. 2408.00634 null
2024-08-01 Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model Felipe Mahlow et.al. 2408.00544 null
2024-08-01 Towards Reliable Advertising Image Generation Using Human Feedback Zhenbang Du et.al. 2408.00418 link
2024-08-01 Deepfake Media Forensics: State of the Art and Challenges Ahead Irene Amerini et.al. 2408.00388 null
2024-08-01 On the Limitations and Prospects of Machine Unlearning for Generative AI Shiji Zhou et.al. 2408.00376 null
2024-08-01 DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 framework Fan Zhang et.al. 2408.00370 null
2024-08-01 A Simple Background Augmentation Method for Object Detection with Diffusion Model Yuhang Li et.al. 2408.00350 null
2024-08-01 ADBM: Adversarial diffusion bridge model for reliable adversarial purification Xiao Li et.al. 2408.00315 null
2024-08-01 Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection Jiacheng Deng et.al. 2408.00286 null
2024-08-01 Navigating Text-to-Image Generative Bias across Indic Languages Surbhi Mittal et.al. 2408.00283 null
2024-08-01 Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models Juntu Zhao et.al. 2408.00230 link
2024-07-31 Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution Mridul Khurana et.al. 2408.00160 null
2024-07-31 Generative Learning of the Solution of Parametric Partial Differential Equations Using Guided Diffusion Models and Virtual Observations Han Gao et.al. 2408.00157 null
2024-07-31 WAS: Dataset and Methods for Artistic Text Segmentation Xudong Xie et.al. 2408.00106 link
2024-07-31 Localized Gaussian Splatting Editing with Contextual Awareness Hanyuan Xiao et.al. 2408.00083 null
2024-07-31 Detecting, Explaining, and Mitigating Memorization in Diffusion Models Yuxin Wen et.al. 2407.21720 link
2024-07-31 Tora: Trajectory-oriented Diffusion Transformer for Video Generation Zhenghao Zhang et.al. 2407.21705 link
2024-07-31 Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification Xingchen Shi et.al. 2407.21683 null
2024-07-31 Explainable and Controllable Motion Curve Guided Cardiac Ultrasound Video Generation Junxuan Yu et.al. 2407.21490 null
2024-07-31 Fine-gained Zero-shot Video Sampling Dengsheng Chen et.al. 2407.21475 null
2024-07-31 Deformable 3D Shape Diffusion Model Dengsheng Chen et.al. 2407.21428 null
2024-07-31 Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models Jiang Hao et.al. 2407.21316 link
2024-07-31 State-observation augmented diffusion model for nonlinear assimilation Zhuoyuan Li et.al. 2407.21314 link
2024-07-31 DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations Dongwon Son et.al. 2407.21267 null
2024-07-30 Informed Correctors for Discrete Diffusion Models Yixiu Zhao et.al. 2407.21243 null
2024-07-30 Diffusion-Based Generation of Neural Activity from Disentangled Latent Codes Jonathan D. McCart et.al. 2407.21195 null
2024-07-30 Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models Jack He et.al. 2407.21159 null
2024-07-30 On the optimal design of a new class of proportional portfolio insurance strategies in a jump-diffusion framework Katia Colaneri et.al. 2407.21148 null
2024-07-30 Matting by Generation Zhixiang Wang et.al. 2407.21017 null
2024-07-30 Add-SD: Rational Generation without Manual Reference Lingfeng Yang et.al. 2407.21016 link
2024-07-30 Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks Yunfeng Diao et.al. 2407.20836 null
2024-07-30 Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning Norman Di Palo et.al. 2407.20798 null
2024-08-01 SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models Zheng Liu et.al. 2407.20756 link
2024-07-30 EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos Aashish Rai et.al. 2407.20592 null
2024-07-30 DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations Jiageng Zhu et.al. 2407.20553 null
2024-07-29 Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing Ekaterina Iakovleva et.al. 2407.20232 null
2024-07-29 LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework Zhenqi He et.al. 2407.20172 link
2024-07-29 Diffusion Feedback Helps CLIP See Better Wenxuan Wang et.al. 2407.20171 link
2024-07-29 DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models Jing Yang et.al. 2407.20141 null
2024-07-29 Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning Liyuan Mao et.al. 2407.20109 null
2024-07-29 Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations Fangyijie Wang et.al. 2407.20072 link
2024-07-29 ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning Delyan Boychev et.al. 2407.20020 link
2024-07-29 MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion Chencan Fu et.al. 2407.19976 null
2024-07-29 FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion Models Mingzhao Yang et.al. 2407.19953 null
2024-07-29 FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention Yu Lu et.al. 2407.19918 null
2024-07-29 Map2Traj: Street Map Piloted Zero-shot Trajectory Generation with Diffusion Model Zhenyu Tao et.al. 2407.19765 null
2024-07-30 Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture ShahRukh Athar et.al. 2407.19593 null
2024-07-28 Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle Zhenyu Tang et.al. 2407.19548 null
2024-07-28 Temporal Feature Matters: A Framework for Diffusion Model Quantization Yushi Huang et.al. 2407.19547 null
2024-07-28 MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability Buyu Liu et.al. 2407.19468 link
2024-07-28 White Matter Geometry-Guided Score-Based Diffusion Model for Tissue Microstructure Imputation in Tractography Imaging Yui Lo et.al. 2407.19460 null
2024-07-28 FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models Changgu Chen et.al. 2407.19453 link
2024-07-28 ClickDiff: Click to Induce Semantic Contact Map for Controllable Grasp Generation with Diffusion Models Peiming Li et.al. 2407.19370 link
2024-07-27 Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach Penghui Wen et.al. 2407.19244 link
2024-07-27 Data Processing Techniques for Modern Multimodal Models Yinheng Li et.al. 2407.19180 null
2024-07-25 RegionDrag: Fast Region-Based Image Editing with Diffusion Models Jingyi Lu et.al. 2407.18247 null
2024-07-25 VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads Orest Kupyn et.al. 2407.18245 link
2024-07-25 Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images Roberto Di Via et.al. 2407.18125 null
2024-07-25 Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions Jan Nikolas Morshuis et.al. 2407.18026 link
2024-07-25 Self-Supervision Improves Diffusion Models for Tabular Data Imputation Yixin Liu et.al. 2407.18013 link
2024-07-25 Lightweight Language-driven Grasp Detection using Conditional Consistency Model Nghia Nguyen et.al. 2407.17967 null
2024-07-25 ReCorD: Reasoning and Correcting Diffusion for HOI Generation Jian-Yu Jiang-Lin et.al. 2407.17911 link
2024-07-25 Amortized Posterior Sampling with Diffusion Prior Distillation Abbas Mammadov et.al. 2407.17907 null
2024-07-25 Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion Xiaodan Xing et.al. 2407.17882 null
2024-07-25 DragText: Rethinking Text Embedding in Point-based Image Editing Gayoon Choi et.al. 2407.17843 link
2024-07-25 Mpox Detection Advanced: Rapid Epidemic Response Through Synthetic Data Yudara Kularathne et.al. 2407.17762 null
2024-07-25 Multi-physics Simulation Guided Generative Diffusion Models with Applications in Fluid and Heat Dynamics Naichen Shi et.al. 2407.17720 link
2024-07-24 Diffusion Models for Multi-Task Generative Modeling Changyou Chen et.al. 2407.17571 null
2024-07-24 SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency Yiming Xie et.al. 2407.17470 null
2024-07-24 CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction Paul Goyes-Peñafiel et.al. 2407.17402 link
2024-07-25 LPGen: Enhancing High-Fidelity Landscape Painting Generation through Diffusion Model Wanggong Yang et.al. 2407.17229 null
2024-07-24 Unpaired Photo-realistic Image Deraining with Energy-informed Diffusion Model Yuanbo Wen et.al. 2407.17193 null
2024-07-24 MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models Chunsan Hong et.al. 2407.17095 link
2024-07-24 Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference Jian Xu et.al. 2407.17033 null
2024-07-24 Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model Lirui Zhao et.al. 2407.16982 link
2024-07-24 SAR to Optical Image Translation with Color Supervised Diffusion Model Xinyu Bai et.al. 2407.16921 null
2024-07-23 VisMin: Visual Minimal-Change Understanding Rabiul Awal et.al. 2407.16772 null
2024-07-23 Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions Fabio Tosi et.al. 2407.16698 link
2024-07-23 From Imitation to Refinement – Residual RL for Precise Visual Assembly Lars Ankile et.al. 2407.16677 null
2024-07-23 MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence Canyu Zhao et.al. 2407.16655 null
2024-07-23 DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models Zhenyu Xie et.al. 2407.16511 null
2024-07-23 MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection Youngmin Oh et.al. 2407.16448 link
2024-07-23 On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models Deniz Daum et.al. 2407.16405 link
2024-07-23 DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors Zizheng Yan et.al. 2407.16260 null
2024-07-23 OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person Ke Sun et.al. 2407.16224 null
2024-07-23 Diff-Shadow: Global-guided Diffusion Model for Shadow Removal Jinting Luo et.al. 2407.16214 link
2024-07-23 CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation Hajin Shim et.al. 2407.16193 null
2024-07-23 No Re-Train, More Gain: Upgrading Backbones with Diffusion Model for Few-Shot Segmentation Shuai Chen et.al. 2407.16182 null
2024-07-22 Artist: Aesthetically Controllable Text-Driven Stylization without Training Ruixiang Jiang et.al. 2407.15842 link
2024-07-22 Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget Vikash Sehwag et.al. 2407.15811 link
2024-07-22 Diffusion Model Based Resource Allocation Strategy in Ultra-Reliable Wireless Networked Control Systems Amirhassan Babazadeh Darabi et.al. 2407.15784 null
2024-07-22 A Hamilton-Jacobi approach to road-field reaction-diffusion models Christopher Henderson et.al. 2407.15760 null
2024-07-22 Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond Silvio Galesso et.al. 2407.15739 link
2024-07-22 Estimating Probability Densities with Transformer and Denoising Diffusion Henry W. Leung et.al. 2407.15703 link
2024-07-22 Voltage mapping in subcellular nanodomains using electro-diffusion modeling Frédéric Paquin-Lefebvre et.al. 2407.15697 null
2024-07-23 Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models Xin Ma et.al. 2407.15642 link
2024-07-23 A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control Karim Kadry et.al. 2407.15631 null
2024-07-22 StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation Nauman Riaz et.al. 2407.15608 null
2024-07-22 Discrete Flow Matching Itai Gat et.al. 2407.15595 null
2024-07-22 SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time Stanislav Frolov et.al. 2407.15507 link
2024-07-22 DiffX: Guide Your Layout to Cross-Modal Generative Modeling Zeyu Wang et.al. 2407.15488 link
2024-07-22 A New Perspective on the Diffuse Gamma-Ray Emission Excess Ensheng Chen et.al. 2407.15474 null
2024-07-22 A vector-host epidemic model with spatial structure and seasonality Mingxin Wang et.al. 2407.15361 null
2024-07-22 Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models Xiao Liu et.al. 2407.15328 link
2024-07-21 MedEdit: Counterfactual Diffusion-based Image Editing on Brain MRI Malek Ben Alaya et.al. 2407.15270 null
2024-07-23 CGB-DM: Content and Graphic Balance Layout Generation with Transformer-based Diffusion Model Yu Li et.al. 2407.15233 null
2024-07-21 Thermodynamics inconsistencies in cosmological unimodular gravity models Miguel Cruz et.al. 2407.15207 null
2024-07-21 HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions Haiyang Zhou et.al. 2407.15187 null
2024-07-18 LogoSticker: Inserting Logos into Diffusion Models for Customized Generation Mingkang Zhu et.al. 2407.13752 null
2024-07-18 Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review Masatoshi Uehara et.al. 2407.13734 link
2024-07-18 MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis Ziming Zhong et.al. 2407.13675 link
2024-07-18 Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models Xiaoyu Zhu et.al. 2407.13642 null
2024-07-18 Training-free Composite Scene Generation for Layout-to-Image Synthesis Jiaqi Liu et.al. 2407.13609 link
2024-07-18 EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models Nan Lin et.al. 2407.13538 link
2024-07-18 All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models Charumathi Badrinath et.al. 2407.13449 link
2024-07-18 Movement-based models for abundance data Ricardo Carrizo Vergara et.al. 2407.13384 null
2024-07-18 URCDM: Ultra-Resolution Image Synthesis in Histopathology Sarah Cechnicka et.al. 2407.13277 link
2024-07-18 Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models Qiao Li et.al. 2407.13252 null
2024-07-18 MEDIC: Zero-shot Music Editing with Disentangled Inversion Control Huadai Liu et.al. 2407.13220 null
2024-07-18 SpaDiT: Diffusion Transformer for Spatial Gene Expression Prediction using scRNA-seq Xiaoyu Li et.al. 2407.13182 link
2024-07-18 Training-Free Large Model Priors for Multiple-in-One Image Restoration Xuanhua He et.al. 2407.13181 null
2024-07-18 Image Inpainting Models are Effective Tools for Instruction-guided Image Editing Xuan Ju et.al. 2407.13139 null
2024-07-18 FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection Jianwei Zhao et.al. 2407.13133 null
2024-07-17 Denoising Diffusions in Latent Space for Medical Image Segmentation Fahim Ahmed Zaman et.al. 2407.12952 link
2024-07-17 DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion Huiguo He et.al. 2407.12899 null
2024-07-17 SMooDi: Stylized Motion Diffusion Model Lei Zhong et.al. 2407.12783 null
2024-07-17 VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control Sherwin Bahmani et.al. 2407.12781 null
2024-07-17 Hallucination Index: An Image Quality Metric for Generative Reconstruction Models Matthew Tivnan et.al. 2407.12780 null
2024-07-17 GroundUp: Rapid Sketch-Based 3D City Massing Gizem Esra Unlu et.al. 2407.12739 null
2024-07-17 NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model Zhongqun Zhang et.al. 2407.12727 null
2024-07-18 SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow Yuanzhi Zhu et.al. 2407.12718 link
2024-07-17 IMAGDressing-v1: Customizable Virtual Dressing Fei Shen et.al. 2407.12705 link
2024-07-17 4Dynamic: Text-to-4D Generation with Hybrid Priors Yu-Jie Yuan et.al. 2407.12684 null
2024-07-17 Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs Yiqing Shen et.al. 2407.12678 link
2024-07-17 CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems Jiankun Zhao et.al. 2407.12676 link
2024-07-17 Zero-shot Text-guided Infinite Image Synthesis with LLM guidance Soyeong Kwon et.al. 2407.12642 null
2024-07-17 VegeDiff: Latent Diffusion Model for Geospatial Vegetation Forecasting Sijie Zhao et.al. 2407.12592 null
2024-07-17 The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation Yi Yao et.al. 2407.12579 null
2024-07-17 High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion Juan Song et.al. 2407.12538 link
2024-07-17 Leveraging the Mahalanobis Distance to enhance Unsupervised Brain MRI Anomaly Detection Finn Behrendt et.al. 2407.12474 link
2024-07-17 Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning Xu-Hui Liu et.al. 2407.12448 link
2024-07-17 Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models Chao Gong et.al. 2407.12383 link
2024-07-17 HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects Xintao Lv et.al. 2407.12371 null
2024-07-17 I2AM: Interpreting Image-to-Image Latent Diffusion Models via Attribution Maps Junseo Park et.al. 2407.12331 null
2024-07-17 Label-Efficient 3D Brain Segmentation via Complementary 2D Diffusion Models with Orthogonal Views Jihoon Cho et.al. 2407.12329 null
2024-07-15 Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion Yongyuan Liang et.al. 2407.10973 null
2024-07-15 InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models Nirat Saini et.al. 2407.10958 null
2024-07-16 DataDream: Few-shot Guided Dataset Generation Jae Myung Kim et.al. 2407.10910 link
2024-07-15 Optical Diffusion Models for Image Generation Ilker Oguz et.al. 2407.10897 null
2024-07-15 R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection Zheyuan Zhou et.al. 2407.10862 null
2024-07-15 Physics-Inspired Generative Models in Medical Imaging: A Review Dennis Hein et.al. 2407.10856 null
2024-07-15 Conditional Guided Generative Diffusion for Particle Accelerator Beam Diagnostics Alexander Scheinker et.al. 2407.10693 null
2024-07-15 Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval Youngsun Lim et.al. 2407.10683 null
2024-07-15 Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction Lin Zhu et.al. 2407.10636 null
2024-07-15 WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models Zijian He et.al. 2407.10625 null
2024-07-15 InsertDiffusion: Identity Preserving Visualization of Objects through a Training-Free Diffusion Architecture Phillip Mueller et.al. 2407.10592 link
2024-07-15 Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation Peng Jin et.al. 2407.10528 null
2024-07-15 Kinetic Typography Diffusion Model Seonmi Park et.al. 2407.10476 null
2024-07-15 GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis Weizhi Liu et.al. 2407.10471 null
2024-07-15 LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis Zhenxiong Tan et.al. 2407.10468 link
2024-07-15 DiffStega: Towards Universal Training-Free Coverless Image Steganography with Diffusion Models Yiwei Yang et.al. 2407.10459 link
2024-07-15 Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion Jian Ma et.al. 2407.10373 null
2024-07-14 On an age-structured model in moving boundaries: The effects of nonlocal diffusion and harvesting pulse Haiyan Xu et.al. 2407.10363 null
2024-07-14 Addressing Class Imbalance and Data Limitations in Advanced Node Semiconductor Defect Inspection: A Generative Approach for SEM Images Bappaditya Dey et.al. 2407.10348 null
2024-07-14 Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors Jae Joong Lee et.al. 2407.10330 null
2024-07-11 Video Diffusion Alignment via Reward Gradients Mihir Prabhudesai et.al. 2407.08737 link
2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701 null
2024-07-11 Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density Shuangqi Li et.al. 2407.08659 null
2024-07-11 Latent Conditional Diffusion-based Data Augmentation for Continuous-Time Dynamic Graph Mode Yuxing Tian et.al. 2407.08500 null
2024-07-11 Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers Zhengbo Zhang et.al. 2407.08394 null
2024-07-11 Wind Power Assessment based on Super-Resolution and Downscaling – A Comparison of Deep Learning Methods Luca Schmidt et.al. 2407.08259 null
2024-07-11 Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling Noam Elata et.al. 2407.08256 null
2024-07-11 E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors Jinxiu Liang et.al. 2407.08231 null
2024-07-11 Survey on Fundamental Deep Learning 3D Reconstruction Techniques Yonge Bai et.al. 2407.08137 null
2024-07-10 Geospecific View Generation – Geometry-Context Aware High-resolution Ground View Inference from Satellite Views Ningli Xu et.al. 2407.08061 null
2024-07-10 Coherent and Multi-modality Image Inpainting via Latent Space Optimization Lingzhi Pan et.al. 2407.08019 link
2024-07-10 Generative Image as Action Models Mohit Shridhar et.al. 2407.07875 link
2024-07-10 Dynamical Measure Transport and Neural PDE Solvers for Sampling Jingtong Sun et.al. 2407.07873 null
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860 null
2024-07-10 Generic Numerical Analysis of Stochastic Reaction Diffusion Model with applications in excitable media Yahya Alnashri et.al. 2407.07834 null
2024-07-10 Universal and non-universal signatures in the scaling functions of critical variables Gianluca Teza et.al. 2407.07782 null
2024-07-10 VEnhancer: Generative Space-Time Enhancement for Video Generation Jingwen He et.al. 2407.07667 null
2024-07-11 MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis Wanggui He et.al. 2407.07614 link
2024-07-10 Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field Ganlin Yang et.al. 2407.07461 null
2024-07-10 Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion Yutong Hu et.al. 2407.07443 link
2024-07-10 Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis Jian-Qing Zheng et.al. 2407.07295 link
2024-07-09 A Very Effective and Simple Diffusion Reconstruction for the Diluted Ising Model Stefano Bae et.al. 2407.07266 null
2024-07-09 Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion Yu Cao et.al. 2407.07249 null
2024-07-09 Accelerating Mobile Edge Generation (MEG) by Constrained Learning Xiaoxia Xu et.al. 2407.07245 null
2024-07-09 ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement Muhammad Atif Butt et.al. 2407.07197 link
2024-07-09 CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model Xiaoding Yuan et.al. 2407.07174 null
2024-07-09 ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction Shaozhe Hao et.al. 2407.07077 link
2024-07-11 RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models Bowen Zhang et.al. 2407.06938 null
2024-07-09 HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance Guian Fang et.al. 2407.06937 link
2024-07-09 A reaction-diffusion model for relapsing-remitting multiple sclerosis with a treatment term Romina Travaglini et.al. 2407.06802 null
2024-07-09 Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning Fanyue Wei et.al. 2407.06642 link
2024-07-08 JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation Yu Zeng et.al. 2407.06187 null
2024-07-08 The Tug-of-War Between Deepfake Generation and Detection Hannah Lee et.al. 2407.06174 null
2024-07-08 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Ethan Chern et.al. 2407.06135 link
2024-07-08 Structured Generations: Using Hierarchical Clusters to guide Diffusion Models Jorge da Silva Goncalves et.al. 2407.06124 link
2024-07-08 PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models Jinhua Zhang et.al. 2407.06109 link
2024-07-08 Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation Xinyu Bai et.al. 2407.06095 null
2024-07-08 Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis Emaad Khwaja et.al. 2407.06079 null
2024-07-08 Analysis and finite element approximation of a diffuse interface approach to the Stokes–Biot coupling Francis R. A. Aznaran et.al. 2407.05949 null
2024-07-08 Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling Lintao Zhang et.al. 2407.05875 link
2024-07-08 RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features Inye Na et.al. 2407.05683 link
2024-07-08 BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space Yumeng Zhang et.al. 2407.05679 link
2024-07-08 Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder Jia Liu et.al. 2407.05552 null
2024-07-08 Read, Watch and Scream! Sound Generation from Text and Video Yujin Jeong et.al. 2407.05551 link
2024-07-08 LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction Kanghao Chen et.al. 2407.05547 null
2024-07-07 Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image Generation Marina Domínguez et.al. 2407.05428 link
2024-07-07 BiRoDiff: Diffusion policies for bipedal robot locomotion on unseen terrains GVS Mothish et.al. 2407.05424 null
2024-07-07 Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model Danni Yang et.al. 2407.05352 link
2024-07-07 Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models Chun-Mei Feng et.al. 2407.05323 null
2024-07-07 An Improved Method for Personalizing Diffusion Models Yan Zeng et.al. 2407.05312 null
2024-07-07 DM-MIMO: Diffusion Models for Robust Semantic Communications over MIMO Channels Yiheng Duan et.al. 2407.05289 null
2024-07-03 DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents Yilun Xu et.al. 2407.03300 link
2024-07-03 Improved Noise Schedule for Diffusion Training Tiankai Hang et.al. 2407.03297 null
2024-07-04 Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis Tong Zhou et.al. 2407.03089 null
2024-07-03 Electromagnetic Property Sensing Based on Diffusion Model in ISAC System Yuhua Jiang et.al. 2407.03075 null
2024-07-03 Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models Chunmei Xu et.al. 2407.03050 null
2024-07-03 SlerpFace: Face Template Protection via Spherical Linear Interpolation Zhizhou Zhong et.al. 2407.03043 null
2024-07-03 Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation Xiang Gao et.al. 2407.03006 link
2024-07-04 VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors Sungwon Hwang et.al. 2407.02945 link
2024-07-03 Single Image Rolling Shutter Removal with Diffusion Models Zhanglei Yang et.al. 2407.02906 null
2024-07-03 Robot Shape and Location Retention in Video Generation Using Diffusion Models Peng Wang et.al. 2407.02873 link
2024-07-03 Mirage Sources and Large TeV Halo-Pulsar Offsets: Exploring the Parameter Space Yiwei Bao et.al. 2407.02829 null
2024-07-03 Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models Jiayue Chu et.al. 2407.02744 null
2024-07-02 No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models Seyedmorteza Sadat et.al. 2407.02687 null
2024-07-02 Diffusion Models for Tabular Data Imputation and Synthetic Data Generation Mario Villaizán-Vallelado et.al. 2407.02549 null
2024-07-02 Magic Insert: Style-Aware Drag-and-Drop Nataniel Ruiz et.al. 2407.02489 null
2024-07-03 Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models Fei Shen et.al. 2407.02482 link
2024-07-02 GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models Jian Ma et.al. 2407.02252 link
2024-07-02 LaMoD: Latent Motion Diffusion Model For Myocardial Strain Generation Jiarui Xing et.al. 2407.02229 link
2024-07-04 UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks Jingjing Ren et.al. 2407.02158 null
2024-07-02 Counterfactual Data Augmentation with Denoising Diffusion for Graph Anomaly Detection Chunjing Xiao et.al. 2407.02143 link
2024-06-28 HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model Hieu T. Nguyen et.al. 2406.20077 null
2024-06-28 Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence Xiantao Fan et.al. 2406.20047 null
2024-06-28 HAITCH: A Framework for Distortion and Motion Correction in Fetal Multi-Shell Diffusion-Weighted MRI Haykel Snoussi et.al. 2406.20042 null
2024-06-28 Deceptive Diffusion: Generating Synthetic Adversarial Examples Lucas Beerens et.al. 2406.19807 null
2024-06-28 Comprehensive Generative Replay for Task-Incremental Segmentation with Concurrent Appearance and Semantic Forgetting Wei Li et.al. 2406.19796 link
2024-06-28 Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels Jie Zhang et.al. 2406.19769 null
2024-06-28 DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems Kexiong Yu et.al. 2406.19705 null
2024-06-28 Network Bending of Diffusion Models for Audio-Visual Generation Luke Dzwonczyk et.al. 2406.19589 link
2024-06-27 A Thermal Study of Terahertz Induced Protein Interactions Hadeel Elayan et.al. 2406.19521 null
2024-06-27 pop-cosmos: Scaleable inference of galaxy properties and redshifts with a data-driven population model Stephen Thorp et.al. 2406.19437 null
2024-06-27 Accelerating Multiphase Flow Simulations with Denoising Diffusion Model Driven Initializations Jaehong Chung et.al. 2406.19333 null
2024-06-27 Subtractive Training for Music Stem Insertion using Latent Diffusion Models Ivan Villa-Renteria et.al. 2406.19328 null
2024-06-27 Compositional Image Decomposition with Diffusion Models Jocelin Su et.al. 2406.19298 null
2024-06-27 Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model Jiangtong Tan et.al. 2406.19030 link
2024-06-28 AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation Yanan Sun et.al. 2406.18958 link
2024-06-27 Investigating and Defending Shortcut Learning in Personalized Diffusion Models Yixin Liu et.al. 2406.18944 link
2024-06-28 AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models Aishwarya Agarwal et.al. 2406.18893 null
2024-06-27 Chemical Continuous Time Random Walks under Anomalous Diffusion Hong Zhang et.al. 2406.18869 null
2024-06-26 MultiDiff: Consistent Novel View Synthesis from a Single Image Norman Müller et.al. 2406.18524 null
2024-06-26 Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Kang Liao et.al. 2406.18516 link
2024-06-26 DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance Younghyun Kim et.al. 2406.18459 link
2024-06-26 Towards diffusion models for large-scale sea-ice modelling Tobias Sebastian Finn et.al. 2406.18417 null
2024-06-27 Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process Tianyu Lin et.al. 2406.18361 link
2024-06-26 Molecular Diffusion Models with Virtual Receptors Matan Halfon et.al. 2406.18330 null
2024-06-26 Galaxy spectroscopy without spectra: Galaxy properties from photometric images with conditional diffusion models Lars Doorenbos et.al. 2406.18175 link
2024-06-26 Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models Xiaolin Hong et.al. 2406.18159 null
2024-06-26 Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation Qilai Zhang et.al. 2406.18054 link
2024-06-25 DiffusionPDE: Generative PDE-Solving Under Partial Observation Jiahe Huang et.al. 2406.17763 link
2024-06-25 Unified Auto-Encoding with Masked Diffusion Philippe Hansen-Estruch et.al. 2406.17688 link
2024-06-25 LaTable: Towards Large Tabular Models Boris van Breugel et.al. 2406.17673 null
2024-06-25 Aligning Diffusion Models with Noise-Conditioned Perception Alexander Gambashidze et.al. 2406.17636 null
2024-06-25 Diffusion-based Adversarial Purification for Intrusion Detection Mohamed Amine Merzouk et.al. 2406.17606 link
2024-06-25 Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text Xinyang Li et.al. 2406.17601 link
2024-06-25 Detection of Synthetic Face Images: Accuracy, Robustness, Generalization Nela Petrzelkova et.al. 2406.17547 null
2024-06-25 Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Felix Stillger et.al. 2406.17541 null
2024-06-25 The Tree of Diffusion Life: Evolutionary Embeddings to Understand the Generation Process of Diffusion Models Vidya Prasad et.al. 2406.17462 null
2024-06-25 SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing Ruihuang Li et.al. 2406.17396 null
2024-06-25 Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers Lei Chen et.al. 2406.17343 link
2024-06-24 FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models Haonan Qiu et.al. 2406.16863 link
2024-06-24 Dreamitate: Real-World Visuomotor Policy Learning via Video Generation Junbang Liang et.al. 2406.16862 null
2024-06-24 General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design Yue Jian et.al. 2406.16821 link
2024-06-24 Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image Jinkun Hao et.al. 2406.16710 null
2024-06-24 Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling Min-Seop Kwak et.al. 2406.16695 null
2024-06-24 Repulsive Score Distillation for Diverse Sampling of Diffusion Models Nicolas Zilberstein et.al. 2406.16683 link
2024-06-24 OAML: Outlier Aware Metric Learning for OOD Detection Enhancement Heng Gao et.al. 2406.16525 link
2024-06-24 DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution Aiwen Jiang et.al. 2406.16477 link
2024-06-24 ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance Shuwei Shi et.al. 2406.16476 null
2024-06-24 Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models Yichen Sun et.al. 2406.16333 null
2024-06-24 YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals Sandeep Mishra et.al. 2406.16273 null
2024-06-24 Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement Zhiyuan Chang et.al. 2406.16272 link
2024-06-24 Video-Infinity: Distributed Long Video Generation Zhenxiong Tan et.al. 2406.16260 null
2024-06-23 Provable Statistical Rates for Consistency Diffusion Models Zehao Dou et.al. 2406.16213 null
2024-06-23 UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery Pengfei Zhang et.al. 2406.16129 null
2024-06-23 Diffusion Spectral Representation for Reinforcement Learning Dmitry Shribak et.al. 2406.16121 null
2024-06-23 Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification Inès Hyeonsu Kim et.al. 2406.16042 null
2024-06-23 TimeAutoDiff: Combining Autoencoder and Diffusion model for time series tabular data synthesizing Namjoon Suh et.al. 2406.16028 link
2024-06-22 PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection Alvaro Lopez Pellcier et.al. 2406.15921 null
2024-06-22 Soft Masked Mamba Diffusion Model for CT to MRI Conversion Zhenbin Wang et.al. 2406.15910 link
2024-06-20 A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models Xincheng Shuai et.al. 2406.14555 link
2024-06-21 Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation Eyal Michaeli et.al. 2406.14551 link
2024-06-20 Consistency Models Made Easy Zhengyang Geng et.al. 2406.14548 link
2024-06-20 Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps Nikita Starodubcev et.al. 2406.14539 null
2024-06-20 V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data Rotem Shalev-Arkushin et.al. 2406.14510 null
2024-06-20 SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset Josef Dai et.al. 2406.14477 link
2024-06-20 CollaFuse: Collaborative Diffusion Models Simeon Allmendinger et.al. 2406.14429 link
2024-06-20 Active Diffusion Subsampling Oisin Nolan et.al. 2406.14388 link
2024-06-20 In Tree Structure Should Sentence Be Generated Yaguang Li et.al. 2406.14189 link
2024-06-20 CriDiff: Criss-cross Injection Diffusion Framework via Generative Pre-train for Prostate Segmentation Tingwei Liu et.al. 2406.14186 link
2024-06-20 ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning Zhongjie Duan et.al. 2406.14130 link
2024-06-20 HeartBeat: Towards Controllable Echocardiography Video Synthesis with Multimodal Conditions-Guided Diffusion Models Xinrui Zhou et.al. 2406.14098 null
2024-06-20 Bridging bulk and surface: An interacting particle system towards the field-road diffusion model Matthieu Alfaro et.al. 2406.14093 null
2024-06-20 A Practical Diffusion Path for Sampling Omar Chehab et.al. 2406.14040 null
2024-06-20 Similarity-aware Syncretic Latent Diffusion Model for Medical Image Translation with Representation Learning Tingyi Lin et.al. 2406.13977 null
2024-06-20 Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models Yuan Zhong et.al. 2406.13942 null
2024-06-20 EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations Jie Ren et.al. 2406.13933 null
2024-06-19 INFusion: Diffusion Regularized Implicit Neural Representations for 2D and 3D accelerated MRI reconstruction Yamin Arefeen et.al. 2406.13895 null
2024-06-19 Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics Weitong Zhang et.al. 2406.13652 null
2024-06-19 On AI-Inspired UI-Design Jialiang Wei et.al. 2406.13631 null
2024-06-18 Evaluating the design space of diffusion-based generative models Yuqing Wang et.al. 2406.12839 null
2024-06-18 Neural Approximate Mirror Maps for Constrained Diffusion Models Berthy T. Feng et.al. 2406.12816 null
2024-06-18 Extracting Training Data from Unconditional Diffusion Models Yunhao Chen et.al. 2406.12752 null
2024-06-18 Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation Miseul Kim et.al. 2406.12688 null
2024-06-18 GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models Yongtao Ge et.al. 2406.12671 link
2024-06-18 Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in Images Shivank Garg et.al. 2406.12592 link
2024-06-18 Training Diffusion Models with Federated Learning Matthijs de Goede et.al. 2406.12575 null
2024-06-18 Variational Distillation of Diffusion Policies into Mixture of Experts Hongyi Zhou et.al. 2406.12538 null
2024-06-18 HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors Panwang Pan et.al. 2406.12459 link
2024-06-18 Planning Using Schrödinger Bridge Diffusion Models Adarsh Srivastava et.al. 2406.12458 link
2024-06-18 Deep Temporal Deaggregation: Large-Scale Spatio-Temporal Generative Models David Bergström et.al. 2406.12423 null
2024-06-18 TADM: Temporally-Aware Diffusion Model for Neurodegenerative Progression on Brain MRI Mattia Litrico et.al. 2406.12411 null
2024-06-18 Effective Generation of Feasible Solutions for Integer Programming via Guided Diffusion Hao Zeng et.al. 2406.12349 link
2024-06-18 Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment Yiheng Li et.al. 2406.12303 null
2024-06-17 COT Flow: Learning Optimal-Transport Image Sampling and Editing by Contrastive Pairs Xinrui Zu et.al. 2406.12140 null
2024-06-17 Adding Conditional Control to Diffusion Models with Reinforcement Learning Yulai Zhao et.al. 2406.12120 null
2024-06-17 Optimal withdrawals in a general diffusion model with control rates subject to a state-dependent upper bound Hélène Guérin et.al. 2406.12067 null
2024-06-17 ARTIST: Improving the Generation of Text-rich Images by Disentanglement Jianyi Zhang et.al. 2406.12044 null
2024-06-17 Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models Alireza Ganjdanesh et.al. 2406.12042 link
2024-06-17 Decomposed evaluations of geographic disparities in text-to-image models Abhishek Sureddy et.al. 2406.11988 null
2024-06-17 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models Bingqi Ma et.al. 2406.11831 null
2024-06-17 MegaScenes: Scene-Level View Synthesis at Scale Joseph Tung et.al. 2406.11819 link
2024-06-17 DiffMM: Multi-Modal Diffusion Model for Recommendation Yangqin Jiang et.al. 2406.11781 link
2024-06-17 Latent Denoising Diffusion GAN: Faster sampling, Higher image quality Luan Thanh Trinh et.al. 2406.11713 link
2024-06-17 MusicScore: A Dataset for Music Score Modeling and Generation Yuheng Lin et.al. 2406.11462 link
2024-06-17 AnyTrans: Translate AnyText in the Image with Large Scale Models Zhipeng Qian et.al. 2406.11432 null
2024-06-17 DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer Keon Lee et.al. 2406.11427 null
2024-06-17 Unfolding Time: Generative Modeling for Turbulent Flows in 4D Abdullah Saydemir et.al. 2406.11390 null
2024-06-17 Diffusion Models in Low-Level Vision: A Survey Chunming He et.al. 2406.11138 link
2024-06-16 Exploiting Diffusion Prior for Out-of-Distribution Detection Armando Zhu et.al. 2406.11105 null
2024-06-16 An Analysis on Quantizing Diffusion Transformers Yuewei Yang et.al. 2406.11100 null
2024-06-16 A Bayesian Drift-Diffusion Model of Schachter-Singer’s Two Factor Theory of Emotion Lance Ying et.al. 2406.11086 null
2024-06-16 ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models Kaifeng Gao et.al. 2406.10981 link
2024-06-16 Graph Neural Reaction Diffusion Models Moshe Eliasof et.al. 2406.10871 null
2024-06-16 Diffusion Model With Optimal Covariance Matching Zijing Ou et.al. 2406.10808 null
2024-06-16 Diffusion Models Are Promising for Ab Initio Structure Solutions from Nanocrystalline Powder Diffraction Data Gabe Guo et.al. 2406.10796 link
2024-06-15 Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft Ian Vyse et.al. 2406.10724 link
2024-06-18 A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing Ming Meng et.al. 2406.10553 null
2024-06-15 Self-Supervised Vision Transformer for Enhanced Virtual Clothes Try-On Lingxiao Lu et.al. 2406.10539 null
2024-06-15 Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space Mohamed Amine Ketata et.al. 2406.10513 null
2024-06-12 Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation Raphael Tang et.al. 2406.08482 null
2024-06-12 Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models Yuxuan Xue et.al. 2406.08475 null
2024-06-12 $\texttt{DiffLense}$ : A Conditional Diffusion Model for Super-Resolution of Gravitational Lensing Data Pranath Reddy et.al. 2406.08442 null
2024-06-12 Diffusion Soup: Model Merging for Text-to-Image Diffusion Models Benjamin Biggs et.al. 2406.08431 null
2024-06-12 FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation Xinzhi Mu et.al. 2406.08392 null
2024-06-12 Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models Javier Nistal et.al. 2406.08384 null
2024-06-12 2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction Tianqi Chen et.al. 2406.08374 null
2024-06-12 WMAdapter: Adding WaterMark Control to Latent Diffusion Models Hai Ci et.al. 2406.08337 null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249 link
2024-06-12 Diffusion-Promoted HDR Video Reconstruction Yuanshen Guan et.al. 2406.08204 null
2024-06-12 LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation Wenhao Guan et.al. 2406.08203 link
2024-06-12 One-Step Effective Diffusion Network for Real-World Image Super-Resolution Rongyuan Wu et.al. 2406.08177 link
2024-06-12 Defect-related Anomalous Mobility of Small polarons in Oxides: the Case of Congruent Lithium Niobate Anton Pfannstiel et.al. 2406.08123 null
2024-06-12 Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement Runyi Yu et.al. 2406.08096 null
2024-06-12 CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models Hyungjin Chung et.al. 2406.08070 null
2024-06-12 Ablation Based Counterfactuals Zheng Dai et.al. 2406.07908 null
2024-06-12 DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition Jiacheng Liu et.al. 2406.07852 null
2024-06-12 Hierarchical Patch Diffusion Models for High-Resolution Video Generation Ivan Skorokhodov et.al. 2406.07792 null
2024-06-11 HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness Zihui Xue et.al. 2406.07754 null
2024-06-11 CUPID: Contextual Understanding of Prompt-conditioned Image Distributions Yayan Zhao et.al. 2406.07699 null
2024-06-10 IllumiNeRF: 3D Relighting without Inverse Rendering Xiaoming Zhao et.al. 2406.06527 null
2024-06-10 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Peize Sun et.al. 2406.06525 link
2024-06-10 Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer Sigal Raab et.al. 2406.06508 link
2024-06-10 AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Zhen Xing et.al. 2406.06465 null
2024-06-10 Cometh: A continuous-time discrete-state graph diffusion model Antoine Siraudin et.al. 2406.06449 null
2024-06-10 Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Jiwoo Hong et.al. 2406.06424 null
2024-06-10 Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization Yi Gu et.al. 2406.06382 link
2024-06-10 Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models Marek Wodzinski et.al. 2406.06372 null
2024-06-10 MVGamba: Unify 3D Content Generation as State Space Sequence Modeling Xuanyu Yi et.al. 2406.06367 link
2024-06-11 Tuning-Free Visual Customization via View Iterative Self-Attention Control Xiaojie Li et.al. 2406.06258 link
2024-06-10 Data Augmentation in Earth Observation: A Diffusion Model Approach Tiago Sousa et.al. 2406.06218 null
2024-06-10 The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems Philippe Gonzalez et.al. 2406.06160 null
2024-06-10 Thunder : Unified Regression-Diffusion Speech Enhancement with a Single Reverse Step using Brownian Bridge Thanapat Trachu et.al. 2406.06139 null
2024-06-10 DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection Donggeun Ko et.al. 2406.06134 null
2024-06-10 ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models Meng-Li Shih et.al. 2406.06133 null
2024-06-10 Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks Victor Boutin et.al. 2406.06079 null
2024-06-10 Generalizable Human Gaussians from Single-View Image Jinnan Chen et.al. 2406.06050 link
2024-06-10 Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training Ke Niu et.al. 2406.06045 link
2024-06-10 FRAG: Frequency Adapting Group for Diffusion Video Editing Sunjae Yoon et.al. 2406.06044 link
2024-06-09 Improving Antibody Design with Force-Guided Sampling in Diffusion Models Paulina Kulytė et.al. 2406.05832 null
2024-06-07 Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion Fangfu Liu et.al. 2406.04338 null
2024-06-06 Coherent Zero-Shot Visual Instruction Generation Quynh Phung et.al. 2406.04337 null
2024-06-06 BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Yang Sui et.al. 2406.04333 link
2024-06-06 Simplified and Generalized Masked Diffusion for Discrete Data Jiaxin Shi et.al. 2406.04329 link
2024-06-06 SF-V: Single Forward Video Generation Model Zhixing Zhang et.al. 2406.04324 link
2024-06-06 ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories Qianlan Yang et.al. 2406.04323 null
2024-06-07 DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data Qihao Liu et.al. 2406.04322 link
2024-06-06 Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step Zhanhao Liang et.al. 2406.04314 link
2024-06-06 Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment Jiayi Guo et.al. 2406.04295 link
2024-06-06 VideoTetris: Towards Compositional Text-to-Video Generation Ye Tian et.al. 2406.04277 link
2024-06-06 A Survey on 3D Human Avatar Modeling – From Reconstruction to Generation Ruihe Wang et.al. 2406.04253 null
2024-06-06 Diffusion-based image inpainting with internal learning Nicolas Cherel et.al. 2406.04206 link
2024-06-06 Multistep Distillation of Diffusion Models via Moment Matching Tim Salimans et.al. 2406.04103 null
2024-06-06 Enhancing Weather Predictions: Super-Resolution via Deep Diffusion Models Jan Martinů et.al. 2406.04099 null
2024-06-06 LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression Junhui Li et.al. 2406.03961 link
2024-06-06 LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model Yixuan Yang et.al. 2406.03866 null
2024-06-06 Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data Jingyang Ou et.al. 2406.03736 link
2024-06-06 JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits Minzhou Pan et.al. 2406.03720 link
2024-06-06 Pi-fusion: Physics-informed diffusion model for learning fluid dynamics Jing Qiu et.al. 2406.03711 null
2024-06-06 Mean-variance portfolio selection in jump-diffusion model under no-shorting constraint: A viscosity solution approach Xiaomin Shi et.al. 2406.03709 null
2024-06-05 Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input Joachim Ott et.al. 2406.03439 null
2024-06-05 Text-to-Image Rectified Flow as Plug-and-Play Priors Xiaofeng Yang et.al. 2406.03293 link
2024-06-05 Generative Diffusion Models for Fast Simulations of Particle Collisions at CERN Mikołaj Kita et.al. 2406.03233 null
2024-06-05 Searching Priors Makes Text-to-Video Synthesis Better Haoran Cheng et.al. 2406.03215 null
2024-06-05 Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion Hao Wen et.al. 2406.03184 link
2024-06-05 Tiny models from tiny data: Textual and null-text inversion for few-shot distillation Erik Landolsi et.al. 2406.03146 link
2024-06-05 Floating Anchor Diffusion Model for Multi-motif Scaffolding Ke Liu et.al. 2406.03141 link
2024-06-05 Phy-Diff: Physics-guided Hourglass Diffusion Model for Diffusion MRI Synthesis Juanhua Zhang et.al. 2406.03002 null
2024-06-05 Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models Zihan Ye et.al. 2406.02929 null
2024-06-06 U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation Chenxin Li et.al. 2406.02918 null
2024-06-05 TSPDiffuser: Diffusion Models as Learned Samplers for Traveling Salesperson Path Planning Problems Ryo Yonetani et.al. 2406.02858 null
2024-06-04 ORACLE: Leveraging Mutual Information for Consistent Character Generation with LoRAs in Diffusion Models Kiymet Akdemir et.al. 2406.02820 null
2024-06-04 Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following Qiaomu Miao et.al. 2406.02774 null
2024-06-04 Neural Representations of Dynamic Visual Stimuli Jacob Yeung et.al. 2406.02659 null
2024-06-04 Dreamguider: Improved Training free Diffusion-based Conditional Generation Nithin Gopalakrishnan Nair et.al. 2406.02549 null
2024-06-06 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin et.al. 2406.02541 null
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-06-04 Guiding a Diffusion Model with a Bad Version of Itself Tero Karras et.al. 2406.02507 link
2024-06-04 Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation Jiajun Wang et.al. 2406.02485 link
2024-06-04 Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion Colin Hansen et.al. 2406.02477 null
2024-05-31 Mixed Diffusion for 3D Indoor Scene Synthesis Siyi Hu et.al. 2405.21066 link
2024-05-31 Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models Jingjing Wang et.al. 2405.21059 null
2024-05-31 Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models Xinxi Zhang et.al. 2405.21050 null
2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu et.al. 2405.21048 null
2024-05-31 Amortizing intractable inference in diffusion models for vision, language, and control Siddarth Venkatraman et.al. 2405.20971 link
2024-05-31 Flow matching achieves minimax optimal convergence Kenji Fukumizu et.al. 2405.20879 null
2024-05-31 MegActor: Harness the Power of Raw Video for Vivid Portrait Animation Shurong Yang et.al. 2405.20851 link
2024-05-31 Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning Aditya Shankar et.al. 2405.20761 link
2024-05-31 Information Theoretic Text-to-Image Alignment Chao Wang et.al. 2405.20759 null
2024-05-31 Diffusion Models Are Innate One-Step Generators Bowen Zheng et.al. 2405.20750 link
2024-05-31 Unleashing the Potential of Diffusion Models for Incomplete Data Imputation Hengrui Zhang et.al. 2405.20690 link
2024-05-31 Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling Kidist Amde Mekonnen et.al. 2405.20675 link
2024-05-31 4Diffusion: Multi-view Video Diffusion Model for 4D Generation Haiyu Zhang et.al. 2405.20674 null
2024-05-31 Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation Shuzhou Yang et.al. 2405.20669 link
2024-05-31 GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification Hansang Lee et.al. 2405.20650 null
2024-06-03 Stochastic Optimal Control for Diffusion Bridges in Function Spaces Byoungwoo Park et.al. 2405.20630 link
2024-05-31 Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization Yisu Liu et.al. 2405.20584 link
2024-05-31 Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning Linjiajie Fang et.al. 2405.20555 link
2024-05-30 Diffusion On Syntax Trees For Program Synthesis Shreyas Kapur et.al. 2405.20519 null
2024-05-30 Slight Corruption in Pre-training Data Makes Better Diffusion Models Hao Chen et.al. 2405.20494 null
2024-05-30 Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image Kailu Wu et.al. 2405.20343 link
2024-05-30 VividDream: Generating 3D Scene with Ambient Dynamics Yao-Chih Lee et.al. 2405.20334 null
2024-05-30 MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion Shuyuan Tu et.al. 2405.20325 link
2024-05-30 Don’t drop your samples! Coherence-aware training benefits Conditional diffusion Nicolas Dufour et.al. 2405.20324 null
2024-05-30 Improving the Training of Rectified Flows Sangyun Lee et.al. 2405.20320 link
2024-05-30 DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2405.20289 null
2024-05-30 MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model Muyao Niu et.al. 2405.20222 link
2024-05-30 Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Sanghyeon Na et.al. 2405.20216 null
2024-05-30 MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models Lukas Uzolas et.al. 2405.20155 null
2024-05-31 DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild Honghao Fu et.al. 2405.19996 link
2024-05-30 DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World Wenli Sun et.al. 2405.19990 null
2024-05-30 PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting Qiaowei Miao et.al. 2405.19957 link
2024-05-30 Exploring Diffusion Models’ Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks Xiaoyu Wu et.al. 2405.19931 null
2024-05-30 Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models Zeyu Fang et.al. 2405.19878 null
2024-05-31 HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization Wenxuan Liu et.al. 2405.19751 null
2024-05-30 Streaming Video Diffusion: Online Video Editing with Diffusion Models Feng Chen et.al. 2405.19726 link
2024-05-30 Text Guided Image Editing with Automatic Concept Locating and Forgetting Jia Li et.al. 2405.19708 null
2024-05-30 Diffusion Policies creating a Trust Region for Offline Reinforcement Learning Tianyu Chen et.al. 2405.19690 link
2024-05-30 Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2405.19673 null
2024-05-29 Blind Image Restoration via Fast Diffusion Inversion Hamadi Chihaoui et.al. 2405.19572 link
2024-05-29 ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning Ruchika Chavhan et.al. 2405.19237 link
2024-05-30 $E^{3}$ Gen: Efficient, Expressive and Editable Avatars Generation Weitian Zhang et.al. 2405.19203 null
2024-05-29 Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning Hanye Zhao et.al. 2405.19189 link
2024-05-29 Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization Zhiwei Tang et.al. 2405.18881 link
2024-05-29 Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors Zihui Wu et.al. 2405.18782 link
2024-05-29 RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching Divya Nori et.al. 2405.18768 link
2024-05-29 Stationary distribution approximations of Two-island Wright-Fisher and seed-bank models using Stein’s method Han L. Gan et.al. 2405.18763 null
2024-05-29 Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning Tianle Zhang et.al. 2405.18729 null
2024-05-29 Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI Che Liu et.al. 2405.18726 null
2024-05-29 Learning Diffeomorphism for Image Registration with Time-Continuous Networks using Semigroup Regularization Mohammadjavad Matinkia et.al. 2405.18684 link
2024-05-29 Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering Ido Sobol et.al. 2405.18677 null
2024-05-28 DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention Lianghui Zhu et.al. 2405.18428 link
2024-05-28 Phased Consistency Model Fu-Yun Wang et.al. 2405.18407 link
2024-05-28 RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives Jaehong Yoon et.al. 2405.18406 link
2024-05-28 Multi-modal Generation via Cross-Modal In-Context Learning Amandeep Kumar et.al. 2405.18304 link
2024-05-28 CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths Reihaneh Teimouri et.al. 2405.18267 link
2024-05-28 EG4D: Explicit Generation of 4D Object without Score Distillation Qi Sun et.al. 2405.18132 link
2024-05-28 Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? Zebin You et.al. 2405.18029 null
2024-05-28 Unveiling the Power of Diffusion Features For Personalized Segmentation and Retrieval Dvir Samuel et.al. 2405.18025 link
2024-05-28 MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling Bowen Zhang et.al. 2405.18003 link
2024-05-27 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer Ruizhi Shao et.al. 2405.17405 null
2024-05-27 A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training Kai Wang et.al. 2405.17403 link
2024-05-27 RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control Litu Rout et.al. 2405.17401 null
2024-05-27 EASI-Tex: Edge-Aware Mesh Texturing from Single Image Sai Raj Kishore Perla et.al. 2405.17393 null
2024-05-28 Controllable Longer Image Animation with Diffusion Models Qiang Wang et.al. 2405.17306 null
2024-05-27 Does Diffusion Beat GAN in Image Super Resolution? Denis Kuznedelev et.al. 2405.17261 link
2024-05-27 DreamMat: High-quality PBR Material Generation with Geometry- and Light-aware Diffusion Models Yuqing Zhang et.al. 2405.17176 null
2024-05-27 Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction Wenhao Zhang et.al. 2405.17167 null
2024-05-27 PatchScaler: An Efficient Patch-independent Diffusion Model for Super-Resolution Yong Liu et.al. 2405.17158 link
2024-05-27 Ensembling Diffusion Models via Adaptive Feature Aggregation Cong Wang et.al. 2405.17082 link
2024-05-27 The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models Saravanan Kandasamy et.al. 2405.17068 null
2024-05-27 Glauber Generative Model: Discrete Diffusion Models via Binary Classification Harshit Varma et.al. 2405.17035 null
2024-05-27 $\text{Di}^2\text{Pose}$ : Discrete Diffusion Model for Occluded 3D Human Pose Estimation Weiquan Wang et.al. 2405.17016 null
2024-05-28 MotionLLM: Multimodal Motion-Language Learning with Large Language Models Qi Wu et.al. 2405.17013 link
2024-05-27 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition Zilu Guo et.al. 2405.16952 link
2024-05-27 Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models Qian Wang et.al. 2405.16947 link
2024-05-27 PASTA: Pathology-Aware MRI to PET Cross-Modal Translation with Diffusion Models Yitong Li et.al. 2405.16942 link
2024-05-28 GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning Jaewoo Lee et.al. 2405.16907 link
2024-05-27 Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation Liang Shi et.al. 2405.16895 null
2024-05-27 Part123: Part-aware 3D Reconstruction from a Single-view Image Anran Liu et.al. 2405.16888 null
2024-05-23 Improved Distribution Matching Distillation for Fast Image Synthesis Tianwei Yin et.al. 2405.14867 link
2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller Zeqi Xiao et.al. 2405.14864 null
2024-05-23 Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models Gen Li et.al. 2405.14861 null
2024-05-23 Semantica: An Adaptable Image-Conditioned Diffusion Model Manoj Kumar et.al. 2405.14857 null
2024-05-23 TerDiT: Ternary Diffusion Models with Transformers Xudong Lu et.al. 2405.14854 link
2024-05-23 Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer Shuang Wu et.al. 2405.14832 null
2024-05-23 Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models Katherine Xu et.al. 2405.14828 null
2024-05-23 PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher Dongjun Kim et.al. 2405.14822 link
2024-05-24 Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation Hongxu Jiang et.al. 2405.14802 link
2024-05-23 Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy Shengfang Zhai et.al. 2405.14800 link
2024-05-23 EditWorld: Simulating World Dynamics for Instruction-Following Image Editing Ling Yang et.al. 2405.14785 link
2024-05-23 Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography Shuo Han et.al. 2405.14770 link
2024-05-23 RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance Zhicheng Sun et.al. 2405.14677 link
2024-05-23 Reinforcement Learning for Fine-tuning Text-to-speech Diffusion Models Jingyi Chen et.al. 2405.14632 null
2024-05-23 Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields Tom Fischer et.al. 2405.14599 null
2024-05-23 Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation Shiqi Yang et.al. 2405.14598 null
2024-05-23 LDM: Large Tensorial SDF Model for Textured Mesh Generation Rengan Xie et.al. 2405.14580 link
2024-05-23 Regressor-free Molecule Generation to Support Drug Response Prediction Kun Li et.al. 2405.14536 null
2024-05-23 LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models Seyedmorteza Sadat et.al. 2405.14477 null
2024-05-23 TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing Teng Xu et.al. 2405.14455 null
2024-05-21 Personalized Residuals for Concept-Driven Text-to-Image Generation Cusuh Ham et.al. 2405.12978 null
2024-05-21 Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control Yue Han et.al. 2405.12970 null
2024-05-21 Impact of inhomogeneous diffusion on secondary cosmic ray and antiproton local spectra Álvaro Tovar-Pardo et.al. 2405.12918 null
2024-05-21 Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images Xiaofei Yu et.al. 2405.12875 link
2024-05-21 Model Free Prediction with Uncertainty Assessment Yuling Jiao et.al. 2405.12684 null
2024-05-21 CustomText: Customized Textual Image Generation using Diffusion Models Shubham Paliwal et.al. 2405.12531 null
2024-05-21 Customize Your Own Paired Data via Few-shot Way Jinshu Chen et.al. 2405.12490 null
2024-05-21 One-step data-driven generative model via Schrödinger Bridge Hanwen Huang et.al. 2405.12453 null
2024-05-20 Diffusion for World Modeling: Visual Details Matter in Atari Eloi Alonso et.al. 2405.12399 link
2024-05-20 Images that Sound: Composing Images and Sounds on a Single Canvas Ziyang Chen et.al. 2405.12221 null
2024-05-20 Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen et.al. 2405.12211 link
2024-05-20 Nonequilbrium physics of generative diffusion models Zhendong Yu et.al. 2405.11932 null
2024-05-20 “Set It Up!”: Functional Object Arrangement with Compositional Generative Models Yiqing Xu et.al. 2405.11928 null
2024-05-20 Diff-BGM: A Diffusion Model for Video Background Music Generation Sizhe Li et.al. 2405.11913 link
2024-05-20 Out-of-Distribution Detection with a Single Unconditional Diffusion Model Alvin Heng et.al. 2405.11881 link
2024-05-20 Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models Xiyu Wang et.al. 2405.11852 null
2024-05-20 Alternators For Sequence Modeling Mohammad Reza Rezaei et.al. 2405.11848 null
2024-05-20 ViViD: Video Virtual Try-on using Diffusion Models Zixun Fang et.al. 2405.11794 null
2024-05-20 Guided Multi-objective Generative AI to Enhance Structure-based Drug Design Amit Kadan et.al. 2405.11785 link
2024-05-20 Diffusion Models for Generating Ballistic Spacecraft Trajectories Tyler Presser et.al. 2405.11738 link
2024-05-19 InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios Yinghao Huang et.al. 2405.11690 null
2024-05-19 Uncertainty-Aware PPG-2-ECG for Enhanced Cardiovascular Diagnosis using Diffusion Models Omer Belhasin et.al. 2405.11566 null
2024-05-19 Diffusion-Based Hierarchical Image Steganography Youmin Xu et.al. 2405.11523 null
2024-05-19 FIFO-Diffusion: Generating Infinite Videos from Text without Training Jihwan Kim et.al. 2405.11473 link
2024-05-19 Discrete-state Continuous-time Diffusion for Graph Generation Zhe Xu et.al. 2405.11416 link
2024-05-18 On the Trajectory Regularity of ODE-based Diffusion Sampling Defang Chen et.al. 2405.11326 link
2024-05-18 Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification Ming Hu et.al. 2405.11289 null
2024-05-18 HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos Qifeng Chen et.al. 2405.11270 null
2024-05-18 AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA Weitao Feng et.al. 2405.11135 link
2024-05-16 Text-to-Vector Generation with Neural Path Representation Peiying Zhang et.al. 2405.10317 null
2024-05-16 Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model Zheng Gu et.al. 2405.10316 null
2024-05-16 CAT3D: Create Anything in 3D with Multi-View Diffusion Models Ruiqi Gao et.al. 2405.10314 null
2024-05-16 Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks João Bordalo et.al. 2405.10122 null
2024-05-16 Spurious reconstruction from brain activity Ken Shirakawa et.al. 2405.10078 link
2024-05-16 Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution Xingjian Wang et.al. 2405.10014 null
2024-05-16 VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing Binghui Chen et.al. 2405.09985 null
2024-05-16 Language-Oriented Semantic Latent Representation for Image Transmission Giordano Cicchetti et.al. 2405.09976 link
2024-05-16 Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models Ziyu Wang et.al. 2405.09901 link
2024-05-16 DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection Yuhao Sun et.al. 2405.09882 link
2024-05-16 Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion Xinyang Li et.al. 2405.09874 null
2024-05-16 Rethinking Multi-User Semantic Communications with Deep Generative Models Eleonora Grassucci et.al. 2405.09866 null
2024-05-16 MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis Joseph Cho et.al. 2405.09806 null
2024-05-15 A Survey of Generative Techniques for Spatial-Temporal Data Mining Qianru Zhang et.al. 2405.09592 null
2024-05-16 MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer Chengyu Wu et.al. 2405.09539 link
2024-05-15 Diffusion-based Contrastive Learning for Sequential Recommendation Ziqiang Cui et.al. 2405.09369 link
2024-05-15 Dance Any Beat: Blending Beats with Visuals in Dance Video Generation Xuanchen Wang et.al. 2405.09266 null
2024-05-15 SOEDiff: Efficient Distillation for Small Object Editing Qihe Pan et.al. 2405.09114 null
2024-05-15 RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing Jiamei Xiong et.al. 2405.09083 link
2024-05-17 Naturalistic Music Decoding from EEG Data via Latent Diffusion Models Emilian Postolache et.al. 2405.09062 null
2024-05-15 Response Matching for generating materials and molecules Bingqing Cheng et.al. 2405.09057 null
2024-05-15 CTS: A Consistency-Based Medical Image Segmentation Model Kejia Zhang et.al. 2405.09056 link
2024-05-14 Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models Bingdong Li et.al. 2405.08674 null
2024-05-14 Towards Multi-Task Generative-AI Edge Services with an Attention-based Diffusion DRL Approach Yaju Liu et.al. 2405.08328 null
2024-05-14 Compositional Text-to-Image Generation with Dense Blob Representations Weili Nie et.al. 2405.08246 null
2024-05-13 Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis Yifan Wang et.al. 2405.08210 null
2024-05-13 Do Bayesian imaging methods report trustworthy probabilities? David Y. W. Thong et.al. 2405.08179 null
2024-05-13 DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation Ziang Cao et.al. 2405.08055 link
2024-05-13 Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning Wenqi Dong et.al. 2405.08054 null
2024-05-13 Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data Mahdi Morafah et.al. 2405.07925 null
2024-05-13 CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models Nick Stracke et.al. 2405.07913 null
2024-05-13 SAR Image Synthesis with Diffusion Models Denisa Qosja et.al. 2405.07776 null
2024-05-13 CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution Qingguo Liu et.al. 2405.07648 link
2024-05-13 De novo antibody design with SE(3) diffusion Daniel Cutting et.al. 2405.07622 null
2024-05-13 Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models Andrii Tytarenko et.al. 2405.07603 null
2024-05-13 PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator Hanshu Yan et.al. 2405.07510 link
2024-05-13 GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting Haodong Chen et.al. 2405.07472 null
2024-05-12 Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning Masane Fuchi et.al. 2405.07288 link
2024-05-12 Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising Yao Liu et.al. 2405.07164 null
2024-05-12 Stable Signature is Unstable: Removing Image Watermark from Diffusion Models Yuepeng Hu et.al. 2405.07145 null
2024-05-11 Diffusion models as probabilistic neural operators for recovering unobserved states of dynamical systems Katsiaryna Haitsiukevich et.al. 2405.07097 null
2024-05-11 Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior Ce Wang et.al. 2405.07044 link
2024-05-11 Non-confusing Generation of Customized Concepts in Diffusion Models Wang Lin et.al. 2405.06914 null
2024-05-10 Self-Consistent Recursive Diffusion Bridge for Medical Image Translation Fuat Arslan et.al. 2405.06789 link
2024-05-10 Shape Conditioned Human Motion Generation with Diffusion Model Kebing Xue et.al. 2405.06778 null
2024-05-10 OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation Jinwei Lin et.al. 2405.06547 link
2024-05-14 SketchDream: Sketch-based Text-to-3D Generation and Editing Feng-Lin Liu et.al. 2405.06461 null
2024-05-10 PUMA: margin-based data pruning Javier Maroto et.al. 2405.06298 null
2024-05-10 Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging Zhuchen Shao et.al. 2405.06175 null
2024-05-09 Distilling Diffusion Models into Conditional GANs Minguk Kang et.al. 2405.05967 null
2024-05-09 Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask Zineb Senane et.al. 2405.05959 link
2024-05-09 Frame Interpolation with Consecutive Brownian Bridge Diffusion Zonglin Lyu et.al. 2405.05953 link
2024-05-09 Composable Part-Based Manipulation Weiyu Liu et.al. 2405.05876 null
2024-05-09 Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control Gunshi Gupta et.al. 2405.05852 link
2024-05-09 Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models Zhe Ma et.al. 2405.05846 link
2024-05-09 MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction Pinhuang Tan et.al. 2405.05814 null
2024-05-10 MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation Yuxiang Wei et.al. 2405.05806 link
2024-05-09 DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation Sitian Shen et.al. 2405.05800 null
2024-05-09 Sequential Amodal Segmentation via Cumulative Occlusion Learning Jiayang Ao et.al. 2405.05791 null
2024-05-09 DP-MDM: Detail-Preserving MR Reconstruction via Multiple Diffusion Models Mengxiao Geng et.al. 2405.05763 link
2024-05-09 LatentColorization: Latent Diffusion-Based Speaker Video Colorization Rory Ward et.al. 2405.05707 null
2024-05-09 StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework Yiheng Huang et.al. 2405.05691 null
2024-05-09 SubGDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning Jiying Zhang et.al. 2405.05665 link
2024-05-09 AI in Your Toolbox: A Plugin for Generating Renderings from 3D Models Mingming Wang et.al. 2405.05627 null
2024-05-09 Denoising Diffusion Delensing Delight: Reconstructing the Non-Gaussian CMB Lensing Potential with Diffusion Models Thomas Flöss et.al. 2405.05598 link
2024-05-09 Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft Debabrata Pal et.al. 2405.05574 null
2024-05-09 A Survey on Personalized Content Synthesis with Diffusion Models Xulu Zhang et.al. 2405.05538 null
2024-05-08 Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo Nayantara Mudur et.al. 2405.05255 link
2024-05-08 Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models Hongjie Wang et.al. 2405.05252 null
2024-05-08 Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation Jonas Kohler et.al. 2405.05224 null
2024-05-08 FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models Jinglin Xu et.al. 2405.05216 link
2024-05-08 An anti-noise seismic inversion method based on diffusion model Yingtian Liu et.al. 2405.05026 link
2024-05-08 Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI Keqiang Fan et.al. 2405.04974 null
2024-05-08 Empowering Wireless Networks with Artificial Intelligence Generated Graph Jiacheng Wang et.al. 2405.04907 null
2024-05-08 Fast LiDAR Upsampling using Conditional Diffusion Models Sander Elias Magnussen Helgesen et.al. 2405.04889 link
2024-05-08 FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation Xuehai He et.al. 2405.04834 null
2024-05-08 Variational Schrödinger Diffusion Models Wei Deng et.al. 2405.04795 null
2024-05-07 Remote Diffusion Kunal Sunil Kasodekar et.al. 2405.04717 null
2024-05-07 TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model Yongming Zhang et.al. 2405.04675 null
2024-05-07 Tactile-Augmented Radiance Fields Yiming Dou et.al. 2405.04534 link
2024-05-07 Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing Yi Zuo et.al. 2405.04496 null
2024-05-07 CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model Haixia Xiao et.al. 2405.04483 null
2024-05-07 Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos Junyi Ma et.al. 2405.04370 link
2024-05-07 Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation Jihyun Kim et.al. 2405.04356 link
2024-05-08 Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer Zhuoyi Yang et.al. 2405.04312 link
2024-05-07 BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models Eloi Moliner et.al. 2405.04272 null
2024-05-07 Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models Fan Bao et.al. 2405.04233 null
2024-05-06 Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models Ludwig Winkler et.al. 2405.03549 null
2024-05-06 CCDM: Continuous Conditional Diffusion Models for Image Generation Xin Ding et.al. 2405.03546 link
2024-05-06 LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model Haowen Sun et.al. 2405.03485 link
2024-05-06 Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond Jiuxiang Gu et.al. 2405.03251 null
2024-05-06 Hyperbolic Geometric Latent Diffusion Model for Graph Generation Xingcheng Fu et.al. 2405.03188 link
2024-05-06 DeepMpMRI: Tensor-decomposition Regularized Learning for Fast and High-Fidelity Multi-Parametric Microstructural MR Imaging Wenxin Fan et.al. 2405.03159 null
2024-05-06 Video Diffusion Models: A Survey Andrew Melnik et.al. 2405.03150 link
2024-05-06 AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding Tao Liu et.al. 2405.03121 link
2024-05-05 Matten: Video Generation with Mamba-Attention Yu Gao et.al. 2405.03025 null
2024-05-05 Exploring Text-based Realistic Building Facades Editing Applicaiton Jing Wang et.al. 2405.02967 null
2024-05-05 Efficient Text-driven Motion Generation via Latent Consistency Training Mengxian Hu et.al. 2405.02791 link
2024-05-04 DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model Liangqi Lei et.al. 2405.02696 null
2024-05-03 Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI Minhui Yu et.al. 2405.02504 link
2024-05-03 Continuous Learned Primal Dual Christina Runkel et.al. 2405.02478 null
2024-05-03 CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding Kaiyuan Chen et.al. 2405.02384 null
2024-05-03 DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos Wen-Hsuan Chu et.al. 2405.02280 link
2024-05-03 Multi-grid reaction-diffusion master equation: applications to morphogen gradient modelling Radek Erban et.al. 2405.02117 null
2024-05-03 DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model Peijin Jia et.al. 2405.02008 null
2024-05-03 Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition Yichun Tai et.al. 2405.01872 null
2024-05-03 Creation of Novel Soft Robot Designs using Generative AI Wee Kiat Chan et.al. 2405.01824 null
2024-05-02 LocInv: Localization-aware Inversion for Text-Guided Image Editing Chuanming Tang et.al. 2405.01496 link
2024-05-02 Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models Matias Mendieta et.al. 2405.01494 null
2024-05-02 Statistical algorithms for low-frequency diffusion data: A PDE approach Matteo Giordano et.al. 2405.01372 link
2024-05-02 DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Ye Tian et.al. 2405.01248 null
2024-05-02 Automated Virtual Product Placement and Assessment in Images using Diffusion Models Mohammad Mahmudul Alam et.al. 2405.01130 null
2024-05-02 Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields Yuhang Huang et.al. 2405.00998 null
2024-05-02 Generative manufacturing systems using diffusion models and ChatGPT Xingyu Li et.al. 2405.00958 null
2024-05-02 EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion Guangyao Zhai et.al. 2405.00915 null
2024-05-01 SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models Burak Can Biner et.al. 2405.00878 null
2024-05-01 Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers Palawat Busaranuvong et.al. 2405.00858 null
2024-05-01 ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties Jiahui Li et.al. 2405.00797 link
2024-05-01 Obtaining Favorable Layouts for Multiple Object Generation Barak Battash et.al. 2405.00791 null
2024-05-01 Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models Xiaoshi Wu et.al. 2405.00760 null
2024-05-01 TexSliders: Diffusion-Based Texture Editing in CLIP Space Julia Guerrero-Viu et.al. 2405.00672 null
2024-05-01 RGB $\leftrightarrow$ X: Image decomposition and synthesis using material- and lighting-aware diffusion models Zheng Zeng et.al. 2405.00666 null
2024-05-01 Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure Assefa Seyoum Wahd et.al. 2405.00631 null
2024-05-01 Lane Segmentation Refinement with Diffusion Models Antonio Ruiz et.al. 2405.00620 null
2024-05-01 Pricing and delta computation in jump-diffusion models with stochastic intensity by Malliavin calculus Ayub Ahmadi et.al. 2405.00473 null
2024-05-01 Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable Haozhe Liu et.al. 2405.00466 null
2024-05-01 Detail-Enhancing Framework for Reference-Based Image Super-Resolution Zihan Wang et.al. 2405.00431 null
2024-05-01 Streamlining Image Editing with Layered Diffusion Brushes Peyman Gholami et.al. 2405.00313 null
2024-05-02 An Unstructured Mesh Reaction-Drift-Diffusion Master Equation with Reversible Reactions Samuel A. Isaacson et.al. 2405.00283 null
2024-05-01 ASAM: Boosting Segment Anything Model with Adversarial Tuning Bo Li et.al. 2405.00256 link
2024-04-30 Semantically Consistent Video Inpainting with Conditional Diffusion Models Dylan Green et.al. 2405.00251 null
2024-04-30 IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images Shadab Ahamed et.al. 2405.00239 link
2024-04-30 SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound Haohe Liu et.al. 2405.00233 null
2024-04-30 Target-Specific De Novo Peptide Binder Design with DiffPepBuilder Fanhao Wang et.al. 2405.00128 null
2024-04-30 MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model Wenxun Dai et.al. 2404.19759 link
2024-04-30 Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting Paul Engstler et.al. 2404.19758 null
2024-04-30 Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation Ian Dunn et.al. 2404.19739 link
2024-04-30 X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models Emmanuelle Bourigault et.al. 2404.19604 null
2024-04-30 MicroDreamer: Zero-shot 3D Generation in $\sim$ 20 Seconds by Score-based Iterative Reconstruction Luxi Chen et.al. 2404.19525 link
2024-04-30 TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models Teng Zhou et.al. 2404.19475 link
2024-04-29 Stylus: Automatic Adapter Selection for Diffusion Models Michael Luo et.al. 2404.18928 null
2024-04-29 TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation Junhao Cheng et.al. 2404.18919 link
2024-04-29 Learning general Gaussian mixtures with efficient score matching Sitan Chen et.al. 2404.18893 null
2024-04-29 A Survey on Diffusion Models for Time Series and Spatio-Temporal Data Yiyuan Yang et.al. 2404.18886 link
2024-04-29 Learning Mixtures of Gaussians Using Diffusion Models Khashayar Gatmiry et.al. 2404.18869 null
2024-04-29 Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior Zhiyuan Li et.al. 2404.18820 link
2024-04-29 Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting Yifei Gao et.al. 2404.18669 null
2024-04-29 FlexiFilm: Long Video Generation with Flexible Conditions Yichen Ouyang et.al. 2404.18620 link
2024-04-29 Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting Tianyidan Xie et.al. 2404.18598 null
2024-04-29 U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models Song Mei et.al. 2404.18444 null
2024-04-28 Fisher Information Improved Training-Free Conditional Diffusion Model Kaiyu Song et.al. 2404.18252 null
2024-04-28 Paint by Inpaint: Learning to Add Image Objects by Removing Them First Navve Wasserman et.al. 2404.18212 link
2024-04-28 Generative AI for Visualization: State of the Art and Future Directions Yilin Ye et.al. 2404.18144 null
2024-04-28 Generative AI for Low-Carbon Artificial Intelligence of Things Jinbo Wen et.al. 2404.18077 null
2024-04-28 Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model Xiaolong Li et.al. 2404.18065 null
2024-04-28 Exposing Text-Image Inconsistency Using Diffusion Models Mingzhen Huang et.al. 2404.18033 link
2024-04-30 Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching Robert Denkert et.al. 2404.17939 null
2024-04-27 Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling Di Wu et.al. 2404.17900 null
2024-04-27 DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction Chenhe Du et.al. 2404.17890 null
2024-04-27 Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission Mingyu Yang et.al. 2404.17736 link
2024-04-25 Inferring solid-state diffusivity in lithium-ion battery active materials: improving upon the classical GITT method A. Emir Gumrukcuoglu et.al. 2404.16658 null
2024-04-25 MuseumMaker: Continual Style Customization without Catastrophic Forgetting Chenxi Liu et.al. 2404.16612 null
2024-04-25 Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models Parul Gupta et.al. 2404.16556 null
2024-04-25 DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference Zhihao Shuai et.al. 2404.16474 null
2024-04-25 TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models Haomiao Ni et.al. 2404.16306 link
2024-04-25 CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions Haoyuan Li et.al. 2404.16302 link
2024-04-25 One Noise to Rule Them All: Learning a Unified Model of Spatially-Varying Noise Patterns Arman Maesumi et.al. 2404.16292 null
2024-04-24 Editable Image Elements for Controllable Synthesis Jiteng Mu et.al. 2404.16029 null
2024-04-24 RetinaRegNet: A Versatile Approach for Retinal Image Registration Vishal Balaji Sivaraman et.al. 2404.16017 link
2024-04-24 MYCloth: Towards Intelligent and Interactive Online T-Shirt Customization based on User’s Preference Yexin Liu et.al. 2404.15801 null
2024-04-24 MotionMaster: Training-free Camera Motion Transfer For Video Generation Teng Hu et.al. 2404.15789 null
2024-04-24 Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations Kaiwen Xue et.al. 2404.15766 link
2024-04-24 DeepFeatureX Net: Deep Features eXtractors based Network for discriminating synthetic from real images Orazio Pontorno et.al. 2404.15697 link
2024-04-24 Generative Diffusion Model (GDM) for Optimization of Wi-Fi Networks Tie Liu et.al. 2404.15684 null
2024-04-24 AnoFPDM: Anomaly Segmentation with Forward Process of Diffusion Models for Brain MRI Yiming Che et.al. 2404.15683 link
2024-04-24 CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models Qinghe Wang et.al. 2404.15677 link
2024-04-24 Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models Xu Shen et.al. 2404.15625 null
2024-04-26 A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution Zhixiong Yang et.al. 2404.15620 link
2024-04-23 ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning Weifeng Chen et.al. 2404.15449 null
2024-04-23 GLoD: Composing Global Contexts and Local Details in Image Generation Moyuru Yamada et.al. 2404.15447 null
2024-04-23 ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model Yuanshao Zhu et.al. 2404.15380 null
2024-04-23 Heat flow, log-concavity, and Lipschitz transport maps Giovanni Brigati et.al. 2404.15205 null
2024-04-23 CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method Mingbao Lin et.al. 2404.15141 link
2024-04-23 Taming Diffusion Probabilistic Models for Character Control Rui Chen et.al. 2404.15121 null
2024-04-23 Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models Jingyao Xu et.al. 2404.15081 link
2024-04-23 Music Style Transfer With Diffusion Model Hong Huang et.al. 2404.14771 null
2024-04-23 Gradient Guidance for Diffusion Models: An Optimization Perspective Yingqing Guo et.al. 2404.14743 link
2024-04-25 FlashSpeech: Efficient Zero-Shot Speech Synthesis Zhen Ye et.al. 2404.14700 null
2024-04-23 DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance Linxuan Xin et.al. 2404.14676 null
2024-04-22 UVMap-ID: A Controllable and Personalized UV Map Generative Model Weijie Wang et.al. 2404.14568 link
2024-04-22 Align Your Steps: Optimizing Sampling Schedules in Diffusion Models Amirmojtaba Sabour et.al. 2404.14507 null
2024-04-22 Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses Inhee Lee et.al. 2404.14410 null
2024-04-22 GeoDiffuser: Geometry-Based Image Editing with Diffusion Models Rahul Sajnani et.al. 2404.14403 null
2024-04-22 TAVGBench: Benchmarking Text to Audible-Video Generation Yuxin Mao et.al. 2404.14381 link
2024-04-22 Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion Alexander Shmakov et.al. 2404.14332 null
2024-04-22 X-Ray: A Sequential 3D Representation for Generation Tao Hu et.al. 2404.14329 link
2024-04-22 Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity Yu Hou et.al. 2404.14240 link
2024-04-22 MultiBooth: Towards Generating All Your Concepts in an Image from Text Chenyang Zhu et.al. 2404.14239 link
2024-04-22 Face2Face: Label-driven Facial Retouching Restoration Guanhua Zhao et.al. 2404.14177 null
2024-04-22 FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on Chenhui Wang et.al. 2404.14162 null
2024-04-22 Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments Jiacheng Wang et.al. 2404.14140 null
2024-04-23 RingID: Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification Hai Ci et.al. 2404.14055 link
2024-04-22 RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance Chengrui Wang et.al. 2404.13984 null
2024-04-22 MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets Zeyu Li et.al. 2404.13923 null
2024-04-23 Accelerating Image Generation with Sub-path Linear Approximation Model Chen Xu et.al. 2404.13903 null
2024-04-22 Towards Better Text-to-Image Generation Alignment via Attention Modulation Yihang Wu et.al. 2404.13899 null
2024-04-23 Decoherence of a charged Brownian particle in a magnetic field : an analysis of the roles of coupling via position and momentum variables Suraka Bhattacharjee et.al. 2404.13883 null
2024-04-21 Universal Fingerprint Generation: Controllable Diffusion Model with Multimodal Conditions Steven A. Grosz et.al. 2404.13791 null
2024-04-21 Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control Maria Mihaela Trusca et.al. 2404.13766 null
2024-04-21 A Splice Method for Local-to-Nonlocal Coupling of Weak Forms Shuai Jiang et.al. 2404.13744 null
2024-04-21 Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models Vitali Petsiuk et.al. 2404.13706 null
2024-04-18 G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis Yufei Ye et.al. 2404.12383 null
2024-04-18 Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models Trevor J. Chan et.al. 2404.12361 null
2024-04-18 AniClipart: Clipart Animation with Text-to-Video Priors Ronghuan Wu et.al. 2404.12347 null
2024-04-18 Guided Discrete Diffusion for Electronic Health Record Generation Zixiang Chen et.al. 2404.12314 null
2024-04-18 StyleBooth: Image Style Editing with Multimodal Instruction Zhen Han et.al. 2404.12154 link
2024-04-18 LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights Thibault Castells et.al. 2404.11936 null
2024-04-18 FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models Wei Wu et.al. 2404.11895 link
2024-04-17 Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning Marzi Heidari et.al. 2404.11795 null
2024-04-17 Diffusion Schrödinger Bridge Models for High-Quality MR-to-CT Synthesis for Head and Neck Proton Treatment Planning Muheng Li et.al. 2404.11741 null
2024-04-17 Factorized Diffusion: Perceptual Illusions by Noise Decomposition Daniel Geng et.al. 2404.11615 null
2024-04-17 IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination Xi Chen et.al. 2404.11593 null
2024-04-17 Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding Zezhong Fan et.al. 2404.11589 null
2024-04-17 MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation Kuan-Chieh et.al. 2404.11565 null
2024-04-17 Predicting Long-horizon Futures by Conditioning on Geometry and Time Tarasha Khurana et.al. 2404.11554 null
2024-04-17 SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening Yu Zhong et.al. 2404.11537 null
2024-04-17 Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt Zhanjie Zhang et.al. 2404.11474 link
2024-04-17 Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption Buzhen Huang et.al. 2404.11291 link
2024-04-17 Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case João Gabriel Vinholi et.al. 2404.11243 null
2024-04-17 RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models Han Huang et.al. 2404.11199 link
2024-04-19 LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models Dingkun Zhang et.al. 2404.11098 null
2024-04-16 Molecular relaxation by reverse diffusion with time step prediction Khaled Kahouli et.al. 2404.10935 link
2024-04-16 RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting Ashkan Mirzaei et.al. 2404.10765 null
2024-04-16 LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? Yuchi Wang et.al. 2404.10763 link
2024-04-16 GazeHTA: End-to-end Gaze Target Detection with Head-Target Association Zhi-Yi Lin et.al. 2404.10718 null
2024-04-16 Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution Yutao Yuan et.al. 2404.10688 link
2024-04-16 Generating Human Interaction Motions in Scenes with Text Control Hongwei Yi et.al. 2404.10685 null
2024-04-16 StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization Yingshu Chen et.al. 2404.10681 null
2024-04-18 Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay Jinmei Liu et.al. 2404.10662 link
2024-04-16 Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences Seungwook Kim et.al. 2404.10603 null
2024-04-15 Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement Wenyi Lian et.al. 2404.09735 link
2024-04-15 Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Ziwei Luo et.al. 2404.09732 link
2024-04-15 All-in-one simulation-based inference Manuel Gloeckler et.al. 2404.09636 link
2024-04-15 TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models Haojun Sun et.al. 2404.09532 null
2024-04-15 Magic Clothing: Controllable Garment-Driven Image Synthesis Weifeng Chen et.al. 2404.09512 link
2024-04-15 PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI Yandan Yang et.al. 2404.09465 null
2024-04-15 Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models Peifei Zhu et.al. 2404.09401 null
2024-04-14 Fault Detection in Mobile Networks Using Diffusion Models Mohamad Nabeel et.al. 2404.09240 null
2024-04-14 DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling Xuening Yuan et.al. 2404.09227 null
2024-04-14 LoopAnimate: Loopable Salient Object Animation Fanyi Wang et.al. 2404.09172 null
2024-04-14 RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion Guoxuan Chi et.al. 2404.09140 link
2024-04-13 Rethinking Iterative Stereo Matching from Diffusion Bridge Model Perspective Yuguang Shi et.al. 2404.09051 null
2024-04-13 Theoretical research on generative diffusion models: an overview Melike Nur Yeğin et.al. 2404.09016 null
2024-04-13 Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles Abhijnan Nath et.al. 2404.08949 link
2024-04-13 Enforcing Paraphrase Generation via Controllable Latent Diffusion Wei Zou et.al. 2404.08938 link
2024-04-13 Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives Yidan Liu et.al. 2404.08926 null
2024-04-13 ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model Kai Tang et.al. 2404.08892 link
2024-04-12 Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation Brinnae Bent et.al. 2404.08799 link
2024-04-12 Diffusion-Based Joint Temperature and Precipitation Emulation of Earth System Models Katie Christensen et.al. 2404.08797 null
2024-04-12 Lossy Image Compression with Foundation Diffusion Models Lucas Relic et.al. 2404.08580 null
2024-04-12 PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction Siming Shan et.al. 2404.08412 null
2024-04-12 Struggle with Adversarial Defense? Try Diffusion Yujie Li et.al. 2404.08273 link
2024-04-12 Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models Zeyu Yang et.al. 2404.08254 link
2024-04-12 Interest Maximization in Social Networks Rahul Kumar Gautam et.al. 2404.08236 null
2024-04-11 ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback Ming Li et.al. 2404.07987 link
2024-04-11 Taming Stable Diffusion for Text to 360° Panorama Image Generation Cheng Zhang et.al. 2404.07949 link
2024-04-11 Adaptive Hyperbolic-cross-space Mapped Jacobi Method on Unbounded Domains with Applications to Solving Multidimensional Spatiotemporal Integrodifferential Equations Yunhong Deng et.al. 2404.07844 null
2024-04-11 ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model Lifan Jiang et.al. 2404.07773 link
2024-04-11 An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization Minshuo Chen et.al. 2404.07771 null
2024-04-11 Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations Yufeng Yue et.al. 2404.07770 null
2024-04-11 Diffusing in Someone Else’s Shoes: Robotic Perspective Taking with Diffusion Josua Spisak et.al. 2404.07735 null
2024-04-11 Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models Tuomas Kynkäänniemi et.al. 2404.07724 link
2024-04-11 Implicit and Explicit Language Guidance for Diffusion-based Visual Perception Hefeng Wang et.al. 2404.07600 null
2024-04-11 ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation Stanislav Frolov et.al. 2404.07564 null
2024-04-11 Effects of phase separation on extinction times in population models Janik Schüttler et.al. 2404.07563 null
2024-04-11 CAT: Contrastive Adapter Training for Personalized Image Generation Jae Wan Park et.al. 2404.07554 link
2024-04-10 Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models Yasi Zhang et.al. 2404.07389 null
2024-04-10 GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models Zewei Zhang et.al. 2404.07206 null
2024-04-10 RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion Jaidev Shriram et.al. 2404.07199 null
2024-04-10 InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models Jiale Xu et.al. 2404.07191 link
2024-04-10 Move Anything with Layered Scene Diffusion Jiawei Ren et.al. 2404.07178 null
2024-04-10 Diffusion-based inpainting of incomplete Euclidean distance matrices of trajectories generated by a fractional Brownian motion Alexander Lobashev et.al. 2404.07029 link
2024-04-10 DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting Shijie Zhou et.al. 2404.06903 null
2024-04-10 Fine color guidance in diffusion models and its application to image compression at extremely low bitrates Tom Bordin et.al. 2404.06865 null
2024-04-10 UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion Junsheng Zhou et.al. 2404.06851 null
2024-04-10 Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer Yanqi Ge et.al. 2404.06835 null
2024-04-10 Zero-shot Point Cloud Completion Via 2D Priors Tianxin Huang et.al. 2404.06814 link
2024-04-10 Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior Fan Lu et.al. 2404.06780 null
2024-04-10 DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space Jianxiang Xiang et.al. 2404.06760 null
2024-04-10 Disguised Copyright Infringement of Latent Diffusion Model Yiwei Lu et.al. 2404.06737 link
2024-04-10 Efficient Denoising using Score Embedding in Score-based Diffusion Models Andrew S. Na et.al. 2404.06661 null
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542 null
2024-04-09 GeoDirDock: Guiding Docking Along Geodesic Paths Raúl Miñán et.al. 2404.06481 null
2024-04-09 Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion Fan Yang et.al. 2404.06429 link
2024-04-09 ZeST: Zero-Shot Material Transfer from a Single Image Ta-Ying Cheng et.al. 2404.06425 null
2024-04-09 Policy-Guided Diffusion Matthew Thomas Jackson et.al. 2404.06356 link
2024-04-09 Quantum State Generation with Structure-Preserving Diffusion Model Yuchen Zhu et.al. 2404.06336 null
2024-04-08 MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Kunpeng Song et.al. 2404.05674 link
2024-04-08 YaART: Yet Another ART Rendering Technology Sergey Kastryulin et.al. 2404.05666 null
2024-04-08 BinaryDM: Towards Accurate Binarization of Diffusion Model Xingyu Zheng et.al. 2404.05662 link
2024-04-08 Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model Jichang Yang et.al. 2404.05648 link
2024-04-08 Learning a Category-level Object Pose Estimator without Pose Annotations Fengrui Tian et.al. 2404.05626 null
2024-04-08 UniFL: Improve Stable Diffusion via Unified Feedback Learning Jiacheng Zhang et.al. 2404.05595 null
2024-04-08 Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models Saman Motamed et.al. 2404.05519 null
2024-04-08 Taming Transformers for Realistic Lidar Point Cloud Generation Hamed Haghighi et.al. 2404.05505 link
2024-04-08 Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance Dazhong Shen et.al. 2404.05384 link
2024-04-08 Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt Zhiqi Huang et.al. 2404.05331 null
2024-04-08 Text-to-Image Synthesis for Any Artistic Styles: Advancements in Personalized Artistic Image Generation via Subdivision and Dual Binding Junseo Park et.al. 2404.05256 null
2024-04-08 DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation Yingtao Tian et.al. 2404.05212 null
2024-04-07 Context-dependent Causality (the Non-Nonotonic Case) Nir Billfeld et.al. 2404.05021 null
2024-04-07 Generative downscaling of PDE solvers with physics-guided diffusion models Yulong Lu et.al. 2404.05009 link
2024-04-07 Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models Zijin Yang et.al. 2404.04956 link
2024-04-07 Regularized Conditional Diffusion Model for Multi-Task Preference Alignment Xudong Yu et.al. 2404.04920 null
2024-04-07 Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder Yiyang Ma et.al. 2404.04916 null
2024-04-07 ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model Binghui Chen et.al. 2404.04833 null
2024-04-07 Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving Jinlong Li et.al. 2404.04804 null
2024-04-07 Rethinking Diffusion Model for Multi-Contrast MRI Super-Resolution Guangyuan Li et.al. 2404.04785 link
2024-04-04 MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation Hanzhe Hu et.al. 2404.03656 null
2024-04-04 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Dongzhi Jiang et.al. 2404.03653 link
2024-04-04 The More You See in 2D, the More You Perceive in 3D Xinyang Han et.al. 2404.03652 null
2024-04-04 DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior Yiming Zhang et.al. 2404.03642 null
2024-04-04 LCM-Lookahead for Encoder-based Text-to-Image Personalization Rinon Gal et.al. 2404.03620 null
2024-04-04 DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images Zhou Jie et.al. 2404.03595 link
2024-04-04 PointInfinity: Resolution-Invariant Point Diffusion Models Zixuan Huang et.al. 2404.03566 null
2024-04-04 Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models Siyuan Mei et.al. 2404.03541 null
2024-04-04 A Directional Diffusion Graph Transformer for Recommendation Zixuan Yi et.al. 2404.03326 null
2024-04-04 SiloFuse: Cross-silo Synthetic Data Generation with Latent Tabular Diffusion Models Aditya Shankar et.al. 2404.03299 null
2024-04-04 Future-Proofing Class Incremental Learning Quentin Jodelet et.al. 2404.03200 null
2024-04-04 HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud Wencan Cheng et.al. 2404.03159 link
2024-04-04 DreamWalk: Style Space Exploration using Diffusion Guidance Michelle Shu et.al. 2404.03145 null
2024-04-04 Diverse and Tailored Image Generation for Zero-shot Multi-label Classification Kaixin Zhang et.al. 2404.03144 null
2024-04-04 The Diffusive Ultrasound Modulated Bioluminescence Tomography with Partial Data and Uncertain Optical Parameters Tianyu Yang et.al. 2404.03124 null
2024-04-03 Many-to-many Image Generation with Auto-regressive Diffusion Models Ying Shen et.al. 2404.03109 null
2024-04-03 Computing macroscopic reaction rates in reaction-diffusion systems using Monte Carlo simulations Mohamed Swailem et.al. 2404.03089 null
2024-04-03 ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale Jinbin Huang et.al. 2404.02990 null
2024-04-03 Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections Gabriel Loaiza-Ganem et.al. 2404.02954 link
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903 link
2024-04-03 Fast Diffusion Model For Seismic Data Noise Attenuation Junheng Peng et.al. 2404.02767 null
2024-04-03 Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models Wentian Zhang et.al. 2404.02747 link
2024-04-03 Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition Behrooz Razeghi et.al. 2404.02696 null
2024-04-03 Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models Matteo Pennisi et.al. 2404.02618 null
2024-04-03 A Unified Editing Method for Co-Speech Gesture Generation via Diffusion Inversion Zeyu Zhao et.al. 2404.02411 null
2024-04-03 Enhancing Diffusion-based Point Cloud Generation with Smoothness Constraint Yukun Li et.al. 2404.02396 null
2024-04-02 Semantic Augmentation in Images using Language Sahiti Yerramilli et.al. 2404.02353 null
2024-04-02 Heat Death of Generative Models in Closed-Loop Learning Matteo Marchi et.al. 2404.02325 null
2024-04-02 APEX: Ambidextrous Dual-Arm Robotic Manipulation Using Collision-Free Generative Diffusion Models Apan Dastider et.al. 2404.02284 null
2024-04-02 Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better Enshu Liu et.al. 2404.02241 link
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148 link
2024-04-02 WcDT: World-centric Diffusion Transformer for Traffic Scene Generation Chen Yang et.al. 2404.02082 link
2024-04-03 AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design Xinze Li et.al. 2404.02003 null
2024-04-02 Bi-LORA: A Vision-Language Approach for Synthetic Image Detection Mamadou Keita et.al. 2404.01959 link
2024-04-02 Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model Xu He et.al. 2404.01862 link
2024-04-02 Upsample Guidance: Scale Up Diffusion Models without Training Juno Hwang et.al. 2404.01709 null
2024-04-02 FashionEngine: Interactive Generation and Editing of 3D Clothed Humans Tao Hu et.al. 2404.01655 null
2024-04-02 Diffusion Deepfake Chaitali Bhattacharyya et.al. 2404.01579 link
2024-04-01 Prior Frequency Guided Diffusion Model for Limited Angle (LA)-CBCT Reconstruction Jiacheng Xie et.al. 2404.01448 null
2024-03-29 Relation Rectification in Diffusion Model Yinwei Wu et.al. 2403.20249 null
2024-03-29 Motion Inversion for Video Customization Luozhou Wang et.al. 2403.20193 null
2024-03-29 FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models Barbara Toniella Corradini et.al. 2403.20105 null
2024-03-29 SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Zhongrui Yu et.al. 2403.20079 null
2024-03-29 Probing solar modulation analytic models with cosmic ray periodic spectra Wei-Cheng Long et.al. 2403.20038 null
2024-04-01 Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting Haipeng Liu et.al. 2403.19898 link
2024-03-28 Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks Pooria Ashrafian et.al. 2403.19880 link
2024-03-28 ShapeFusion: A 3D diffusion model for localized shape editing Rolandos Alexandros Potamias et.al. 2403.19773 null
2024-03-28 MIST: Mitigating Intersectional Bias with Disentangled Cross-Attention Editing in Text-to-Image Diffusion Models Hidir Yesiltepe et.al. 2403.19738 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652 null
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645 null
2024-03-28 In the driver’s mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles Samir H. A. Mohammad et.al. 2403.19637 null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593 null
2024-03-28 Impact of Resin Molecular Weight on Drying Kinetics and Sag of Coatings Marola W. Issa et.al. 2403.19544 null
2024-03-28 Debiasing Cardiac Imaging with Controlled Latent Diffusion Models Grzegorz Skorupko et.al. 2403.19508 link
2024-03-28 Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality Kyotaro Tokoro et.al. 2403.19428 link
2024-03-28 Imperceptible Protection against Style Imitation from Diffusion Models Namhyuk Ahn et.al. 2403.19254 null
2024-03-28 RecDiffusion: Rectangling for Image Stitching with Diffusion Models Tianhao Zhou et.al. 2403.19164 link
2024-03-28 MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation Seyeon Kim et.al. 2403.19144 link
2024-03-28 QNCD: Quantization Noise Correction for Diffusion Models Huanpeng Chu et.al. 2403.19140 link
2024-03-27 Egocentric Scene-aware Human Trajectory Prediction Weizhuo Wang et.al. 2403.19026 null
2024-03-27 TextCraftor: Your Text Encoder Can be Image Quality Controller Yanyu Li et.al. 2403.18978 null
2024-03-27 CPR: Retrieval Augmented Generation for Copyright Protection Aditya Golatkar et.al. 2403.18920 null
2024-03-27 A Geometric Explanation of the Likelihood OOD Detection Paradox Hamidreza Kamkari et.al. 2403.18910 link
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818 null
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807 link
2024-03-27 Object Pose Estimation via the Aggregation of Diffusion Features Tianfu Wang et.al. 2403.18791 link
2024-03-27 ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object Chenshuang Zhang et.al. 2403.18775 link
2024-03-27 A Diffusion-Based Generative Equalizer for Music Restoration Eloi Moliner et.al. 2403.18636 link
2024-03-27 HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions Hao Xu et.al. 2403.18575 link
2024-03-27 Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning – A Review Mohammadreza Amirian et.al. 2403.18565 null
2024-03-27 CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection Jiayi Zhu et.al. 2403.18554 null
2024-03-27 CT-3DFlow : Leveraging 3D Normalizing Flows for Unsupervised Detection of Pathological Pulmonary CT scans Aissam Djahnine et.al. 2403.18514 null
2024-03-27 Synthesizing EEG Signals from Event-Related Potential Paradigms with Conditional Diffusion Models Guido Klein et.al. 2403.18486 link
2024-03-27 DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis Zhongxi Chen et.al. 2403.18471 link
2024-03-27 DiffStyler: Diffusion-based Localized Image Style Transfer Shaoxu Li et.al. 2403.18461 link
2024-03-27 SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model Inhwan Bae et.al. 2403.18452 link
2024-03-27 U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models Ilias Mitsouras et.al. 2403.18425 null
2024-03-27 ECNet: Effective Controllable Text-to-Image Diffusion Models Sicheng Li et.al. 2403.18417 null
2024-03-27 Ship in Sight: Diffusion Models for Ship-Image Super Resolution Luigi Sigillo et.al. 2403.18370 link
2024-03-27 DODA: Diffusion for Object-detection Domain Adaptation in Agriculture Shuai Xiang et.al. 2403.18334 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-27 NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation Jingyang Huo et.al. 2403.18211 null
2024-03-28 Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models Kartikeya Bhardwaj et.al. 2403.18159 null
2024-03-25 Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning Sicong Pan et.al. 2403.16803 link
2024-03-25 Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases Sophie Starck et.al. 2403.16776 null
2024-03-25 Improving Diffusion Models’s Data-Corruption Resistance using Scheduled Pseudo-Huber Loss Artem Khrapov et.al. 2403.16728 link
2024-03-25 SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Yuda Song et.al. 2403.16627 link
2024-03-25 SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation Aysim Toker et.al. 2403.16605 null
2024-03-25 Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization Xiangxin Zhou et.al. 2403.16576 null
2024-03-25 An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models Zizhao Hu et.al. 2403.16530 null
2024-03-25 Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models Ziyou Liang et.al. 2403.16513 null
2024-03-25 Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework Ziyao Huang et.al. 2403.16510 link
2024-03-25 Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation Sanyam Lakhanpal et.al. 2403.16422 null
2024-03-25 FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models Lin Zhao et.al. 2403.16379 null
2024-03-24 Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis Atefeh Khoshkhahtinat et.al. 2403.16258 null
2024-03-24 Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and Editing Yongqing Liang et.al. 2403.16207 null
2024-03-24 Diffusion Model is a Good Pose Estimator from 3D RF-Vision Junqiao Fan et.al. 2403.16198 null
2024-03-24 Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery Siddharth Tourani et.al. 2403.16194 link
2024-03-26 Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method Jie Tian et.al. 2403.16169 null
2024-03-24 Robust Diffusion Models for Adversarial Purification Guang Lin et.al. 2403.16067 null
2024-03-24 A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA Ayush Thakur et.al. 2403.16024 null
2024-03-23 Feature Manipulation for DDPM based Change Detection Zhenglin Li et.al. 2403.15943 null
2024-03-26 X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention You Xie et.al. 2403.15931 null
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621 link
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613 null
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602 null
2024-03-21 Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting Alicia Durrer et.al. 2403.14499 link
2024-03-21 Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation Mathias Öttl et.al. 2403.14429 null
2024-03-21 DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning Jonathan Lebensold et.al. 2403.14421 link
2024-03-21 Physics-Informed Diffusion Models Jan-Hendrik Bastek et.al. 2403.14404 link
2024-03-21 Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models Pablo Marcos-Manchón et.al. 2403.14291 link
2024-03-21 Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation Francesco Di Felice et.al. 2403.14279 null
2024-03-21 Diffusion Models with Ensembled Structure-Based Anomaly Scoring for Unsupervised Anomaly Detection Finn Behrendt et.al. 2403.14262 link
2024-03-21 Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition Sihyun Yu et.al. 2403.14148 null
2024-03-21 Protein Conformation Generation via Force-Guided SE(3) Diffusion Models Yan Wang et.al. 2403.14088 link
2024-03-21 QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping Zhuang Xiong et.al. 2403.14070 null
2024-03-21 LeFusion: Synthesizing Myocardial Pathology on Cardiac MRI via Lesion-Focus Diffusion Models Hantao Zhang et.al. 2403.14066 link
2024-03-21 DiffSTOCK: Probabilistic relational Stock Market Predictions using Diffusion Models Divyanshu Daiya et.al. 2403.14063 null
2024-03-20 Enhancing Fingerprint Image Synthesis with GANs, Diffusion Models, and Style Transfer Techniques W. Tang et.al. 2403.13916 null
2024-03-20 Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models Richard Osuala et.al. 2403.13890 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800 null
2024-03-20 DepthFM: Fast Monocular Depth Estimation with Flow Matching Ming Gui et.al. 2403.13788 link
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745 link
2024-03-20 DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance Zixuan Wang et.al. 2403.13667 link
2024-03-20 ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer Hiroki Azuma et.al. 2403.13652 link
2024-03-20 ReGround: Improving Textual and Spatial Grounding at No Cost Yuseung Lee et.al. 2403.13589 null
2024-03-20 Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing Hangeol Chang et.al. 2403.13551 link
2024-03-20 Compress3D: a Compressed Latent Space for 3D Generation from a Single Image Bowen Zhang et.al. 2403.13524 null
2024-03-20 VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis Yumeng Li et.al. 2403.13501 link
2024-03-20 Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion Lucas Nunes et.al. 2403.13470 link
2024-03-20 S2DM: Sector-Shaped Diffusion Models for Video Generation Haoran Lang et.al. 2403.13408 null
2024-03-20 IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis Feng Liu et.al. 2403.13378 link
2024-03-20 AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation Jingkun An et.al. 2403.13352 null
2024-03-20 LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment Peishan Cong et.al. 2403.13307 link
2024-03-20 DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception Yibo Wang et.al. 2403.13304 null
2024-03-20 Building Optimal Neural Architectures using Interpretable Knowledge Keith G. Mills et.al. 2403.13293 link
2024-03-20 Beyond Skeletons: Integrative Latent Mapping for Coherent 4D Sequence Generation Qitong Yang et.al. 2403.13238 null
2024-03-20 A Contact Model based on Denoising Diffusion to Learn Variable Impedance Control for Contact-rich Manipulation Masashi Okada et.al. 2403.13221 null
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706 link
2024-03-19 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667 link
2024-03-18 Arc2Face: A Foundation Model of Human Faces Foivos Paraperas Papantoniou et.al. 2403.11641 link
2024-03-18 LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Yang Yang et.al. 2403.11627 link
2024-03-18 CRS-Diff: Controllable Generative Remote Sensing Foundation Model Datao Tang et.al. 2403.11614 link
2024-03-18 EffiVED:Efficient Video Editing via Text-instruction Diffusion Models Zhenghao Zhang et.al. 2403.11568 link
2024-03-18 EchoReel: Enhancing Action Generation of Existing Video Diffusion Models Jianzhi liu et.al. 2403.11535 link
2024-03-18 Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors Ruicheng Wang et.al. 2403.11503 null
2024-03-18 SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction Shuang Wang et.al. 2403.11482 link
2024-03-18 ALDM-Grasping: Diffusion-aided Zero-Shot Sim-to-Real Transfer for Robot Grasping Yiwei Li et.al. 2403.11459 null
2024-03-18 CasSR: Activating Image Power for Real-World Image Super-Resolution Haolan Chen et.al. 2403.11451 null
2024-03-18 VmambaIR: Visual State Space Model for Image Restoration Yuan Shi et.al. 2403.11423 link
2024-03-18 DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation Jeongsol Kim et.al. 2403.11415 link
2024-03-18 Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors Yazid Janati et.al. 2403.11407 link
2024-03-17 StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining Tushar Kataria et.al. 2403.11340 null
2024-03-17 Fast Personalized Text-to-Image Syntheses With Attention Injection Yuxuan Zhang et.al. 2403.11284 null
2024-03-17 Understanding Diffusion Models by Feynman’s Path Integral Yuji Hirono et.al. 2403.11262 null
2024-03-17 THOR: Text to Human-Object Interaction Diffusion via Relation Intervention Qianyang Wu et.al. 2403.11208 null
2024-03-17 MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation Yasufumi Kawano et.al. 2403.11194 link
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625 null
2024-03-14 Score-Guided Diffusion for 3D Human Recovery Anastasis Stathopoulos et.al. 2403.09623 link
2024-03-14 Explore In-Context Segmentation via Latent Diffusion Models Chaoyang Wang et.al. 2403.09616 null
2024-03-14 MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models Zunnan Xu et.al. 2403.09471 link
2024-03-14 Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing Wonjun Kang et.al. 2403.09468 link
2024-03-14 Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk Zhangheng Li et.al. 2403.09450 link
2024-03-14 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation Frank Zhang et.al. 2403.09439 null
2024-03-14 LM2D: Lyrics- and Music-Driven Dance Synthesis Wenjie Yin et.al. 2403.09407 null
2024-03-14 Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction Hanyu Chen et.al. 2403.09355 null
2024-03-14 HeadEvolver: Text to Head Avatars via Locally Learnable Mesh Deformation Duotun Wang et.al. 2403.09326 null
2024-03-14 Regularity and trend to equilibrium for a non-local advection-diffusion model of active particles Luca Alasio et.al. 2403.09282 null
2024-03-14 XReal: Realistic Anatomy and Pathology-Aware X-ray Generation via Controllable Diffusion Model Anees Ur Rehman Hashmi et.al. 2403.09240 link
2024-03-14 Intention-driven Ego-to-Exo Video Generation Hongchen Luo et.al. 2403.09194 null
2024-03-14 Intention-aware Denoising Diffusion Model for Trajectory Prediction Chen Liu et.al. 2403.09190 null
2024-03-14 Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts Byeongjun Park et.al. 2403.09176 link
2024-03-14 Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior Cheng Chen et.al. 2403.09140 null
2024-03-14 Rethinking Referring Object Removal Xiangtian Xue et.al. 2403.09128 null
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764 null
2024-03-13 Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08758 null
2024-03-13 Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI Shihan Qiu et.al. 2403.08749 null
2024-03-14 GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing Jing Wu et.al. 2403.08733 link
2024-03-13 Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data Asad Aali et.al. 2403.08728 link
2024-03-13 Data Augmentation in Human-Centric Vision Wentao Jiang et.al. 2403.08650 null
2024-03-13 ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional Videos Lei Shi et.al. 2403.08591 null
2024-03-13 Federated Knowledge Graph Unlearning via Diffusion Model Bingchen Liu et.al. 2403.08554 null
2024-03-13 Model Will Tell: Training Membership Inference for Diffusion Models Xiaomeng Fu et.al. 2403.08487 null
2024-03-13 MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction Linjie Fu et.al. 2403.08479 link
2024-03-13 An Analysis of Human Alignment of Latent Diffusion Models Lorenz Linhardt et.al. 2403.08469 null
2024-03-13 Diffusion Models with Implicit Guidance for Medical Anomaly Detection Cosmin I. Bercea et.al. 2403.08464 link
2024-03-13 Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion Model Ruibin Zhang et.al. 2403.08460 link
2024-03-13 PFStorer: Personalized Face Restoration and Super-Resolution Tuomas Varanka et.al. 2403.08436 null
2024-03-13 Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification Shuhan Li et.al. 2403.08407 null
2024-03-13 Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models Pengze Zhang et.al. 2403.08381 link
2024-03-13 Mitigate Target-level Insensitivity of Infrared Small Target Detection via Posterior Distribution Modeling Haoqing Li et.al. 2403.08380 link
2024-03-13 VIGFace: Virtual Identity Generation Model for Face Image Synthesis Minsoo Kim et.al. 2403.08277 link
2024-03-13 Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models Jian Lin et.al. 2403.08266 null
2024-03-13 Make Me Happier: Evoking Emotions Through Image Diffusion Models Qing Lin et.al. 2403.08255 null
2024-03-11 BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion Xuan Ju et.al. 2403.06976 link
2024-03-11 Bayesian Diffusion Models for 3D Shape Reconstruction Haiyang Xu et.al. 2403.06973 null
2024-03-11 POD-ROM methods: from a finite set of snapshots to continuous-in-time approximations Bosco Garcia-Archilla et.al. 2403.06967 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952 null
2024-03-12 DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations Tianhao Qi et.al. 2403.06951 link
2024-03-11 Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction Qing Xiao et.al. 2403.06940 null
2024-03-11 Estimation of parameters and local times in a discretely observed threshold diffusion model Sara Mazzonetto et.al. 2403.06858 null
2024-03-11 Multistep Consistency Models Jonathan Heek et.al. 2403.06807 null
2024-03-11 Distribution-Aware Data Expansion with Diffusion Models Haowei Zhu et.al. 2403.06741 link
2024-03-11 V3D: Video Diffusion Models are Effective 3D Generators Zilong Chen et.al. 2403.06738 link
2024-03-11 Active Generation for Image Classification Tao Huang et.al. 2403.06517 link
2024-03-11 Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning Woojung Han et.al. 2403.06516 null
2024-03-11 Incorporating Improved Sinusoidal Threshold-based Semi-supervised Method and Diffusion Models for Osteoporosis Diagnosis Wenchi Ke et.al. 2403.06498 null
2024-03-11 Are you sure? Modelling Drivers’ Confidence Judgments in Left-Turn Gap Acceptance Decisions Arkady Zgonnikov et.al. 2403.06496 null
2024-03-11 Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation Guangyang Wu et.al. 2403.06452 link
2024-03-11 DivCon: Divide and Conquer for Progressive Text-to-Image Generation Yuhao Jia et.al. 2403.06400 link
2024-03-11 FSViewFusion: Few-Shots View Generation of Novel Objects Rukhshanda Hussain et.al. 2403.06394 null
2024-03-11 Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models Yang Zhang et.al. 2403.06381 link
2024-03-12 Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style Shuai Tan et.al. 2403.06365 null
2024-03-10 Transferable Reinforcement Learning via Generalized Occupancy Models Chuning Zhu et.al. 2403.06328 null
2024-03-07 ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes Hashmat Shadab Malik et.al. 2403.04701 link
2024-03-07 Delving into the Trajectory Long-tail Distribution for Muti-object Tracking Sijia Chen et.al. 2403.04700 link
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692 link
2024-03-07 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634 link
2024-03-07 A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images Cristiana Tiago et.al. 2403.04612 null
2024-03-07 Anatomy-Guided Surface Diffusion Model for Alzheimer’s Disease Normative Modeling Jianwei Zhang et.al. 2403.04531 null
2024-03-07 Effect of turbulent diffusion in modeling anaerobic digestion Jeremy Z. Yan et.al. 2403.04457 null
2024-03-07 Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser Qingyuan Cai et.al. 2403.04444 link
2024-03-07 StableDrag: Stable Dragging for Point-based Image Editing Yutao Cui et.al. 2403.04437 null
2024-03-07 On-demand Quantization for Green Federated Generative Diffusion in Mobile Edge Networks Bingkun Lai et.al. 2403.04430 null
2024-03-07 Controllable Generation with Text-to-Image Diffusion Models: A Survey Pu Cao et.al. 2403.04279 link
2024-03-06 PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement Zhijie Wang et.al. 2403.04014 link
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938 link
2024-03-06 Latent Dataset Distillation with Diffusion Models Brian B. Moser et.al. 2403.03881 null
2024-03-06 Accelerating Convergence of Score-Based Diffusion Models, Provably Gen Li et.al. 2403.03852 null
2024-03-06 Diffusion on language model embeddings for protein sequence generation Viacheslav Meshchaninov et.al. 2403.03726 null
2024-03-06 Efficient Search and Learning for Agile Locomotion on Stepping Stones Adithya Kumar Chinnakkonda Ravi et.al. 2403.03639 null
2024-03-06 Diffusion-based Generative Prior for Low-Complexity MIMO Channel Estimation Benedikt Fesl et.al. 2403.03545 link
2024-03-06 NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging Takahiro Shirakawa et.al. 2403.03485 link
2024-03-06 FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion Hao Wang et.al. 2403.03463 link
2024-03-06 Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing Bingyan Liu et.al. 2403.03431 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206 null
2024-03-05 MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Hossein Aboutalebi et.al. 2403.03194 link
2024-03-05 NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models Zeqian Ju et.al. 2403.03100 null
2024-03-05 Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn’s Rings Naoya Torii et.al. 2403.03012 null
2024-03-05 Cross-Domain Image Conversion by CycleDM Sho Shimotsumagari et.al. 2403.02919 null
2024-03-05 MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model Sen Wang et.al. 2403.02905 link
2024-03-05 Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders Daniele Mari et.al. 2403.02887 null
2024-03-05 Zero-LED: Zero-Reference Lighting Estimation Diffusion Model for Low-Light Image Enhancement Jinhong He et.al. 2403.02879 null
2024-03-05 Scalable Continuous-time Diffusion Framework for Network Inference and Influence Estimation Keke Huang et.al. 2403.02867 link
2024-03-05 Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation Weijie Li et.al. 2403.02827 null
2024-03-05 Fast, Scale-Adaptive, and Uncertainty-Aware Downscaling of Earth System Model Fields with Generative Foundation Models Philipp Hess et.al. 2403.02774 null
2024-03-02 DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction Junwen Xiong et.al. 2403.01226 null
2024-03-02 TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion Salaheldin Mohamed et.al. 2403.01212 null
2024-03-02 Training Unbiased Diffusion Models From Biased Dataset Yeongmin Kim et.al. 2403.01189 link
2024-03-02 Volume diffusion modelling of a sheared granular gas Duncan Dockar et.al. 2403.01188 null
2024-03-02 Text-guided Explorable Image Super-resolution Kanchana Vaishnavi Gandikota et.al. 2403.01124 null
2024-03-02 Face Swap via Diffusion Model Feifei Wang et.al. 2403.01108 link
2024-03-01 A time-stepping deep gradient flow method for option pricing in (rough) diffusion models Antonis Papapantoleon et.al. 2403.00746 link
2024-03-01 Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks Yuhao Liu et.al. 2403.00644 null
2024-03-01 Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset Ander Salaberria et.al. 2403.00587 link
2024-03-01 Rethinking cluster-conditioned diffusion models Nikolas Adaloglou et.al. 2403.00570 link
2024-03-01 Waves, patterns and bifurcations: a tutorial review on the vertebrate segmentation clock Paul François et.al. 2403.00457 null
2024-03-01 An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels Shumpei Takezaki et.al. 2403.00452 null
2024-03-01 LoMOE: Localized Multi-Object Editing via Multi-Diffusion Goirik Chakrabarty et.al. 2403.00437 null
2024-03-01 Abductive Ego-View Accident Video Understanding for Safe Driving Perception Jianwu Fang et.al. 2403.00436 null
2024-03-01 HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation Zhiying Leng et.al. 2403.00372 null
2024-03-01 Robust Policy Learning via Offline Skill Diffusion Woo Kyung Kim et.al. 2403.00225 null
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481 link
2024-02-29 Towards Generalizable Tumor Synthesis Qi Chen et.al. 2402.19470 link
2024-02-29 Listening to the Noise: Blind Denoising with Gibbs Diffusion David Heurtel-Depeiges et.al. 2402.19455 link
2024-02-29 Structure Preserving Diffusion Models Haoye Lu et.al. 2402.19369 null
2024-02-29 A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation Hanxi Li et.al. 2402.19330 link
2024-02-29 DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly Gianluca Scarpellini et.al. 2402.19302 link
2024-02-29 TEncDM: Understanding the Properties of Diffusion Model in the Space of Language Model Encodings Alexander Shabalin et.al. 2402.19097 link
2024-02-29 Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach Sarina Thomas et.al. 2402.19062 null
2024-02-29 WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis Paul Friedrich et.al. 2402.19043 link
2024-02-29 Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding Guangyi Liu et.al. 2402.19009 link
2024-02-29 ViewFusion: Towards Multi-View Consistency via Interpolated Denoising Xianghui Yang et.al. 2402.18842 link
2024-02-29 Extended Flow Matching: a Method of Conditional Generation with Generalized Continuity Equation Noboru Isobe et.al. 2402.18839 null
2024-02-29 A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D Xiaohan Fei et.al. 2402.18780 null
2024-02-28 Exploring Privacy and Fairness Risks in Sharing Diffusion Models: An Adversarial Perspective Xinjian Luo et.al. 2402.18607 null
2024-02-28 Logarithmic Sobolev Inequalities for Bounded Domains and Applications to Drift-Diffusion Equations Elie Abdo et.al. 2402.18572 null
2024-02-28 Dynamical Regimes of Diffusion Models Giulio Biroli et.al. 2402.18491 null
2024-02-28 Deep Confident Steps to New Pockets: Strategies for Docking Generalization Gabriele Corso et.al. 2402.18396 link
2024-02-28 Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model Sangjoon Park et.al. 2402.18362 null
2024-02-28 FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes Ziying Pan et.al. 2402.18331 link
2024-02-28 Balancing Act: Distribution-Guided Debiasing in Diffusion Models Rishubh Parihar et.al. 2402.18206 null
2024-02-28 Diffusion-based Neural Network Weights Generation Bedionita Soro et.al. 2402.18153 link
2024-02-28 Context-aware Talking Face Video Generation Meidai Xuanyuan et.al. 2402.18092 null
2024-02-28 Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis Yanzuo Lu et.al. 2402.18078 link
2024-02-28 SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model Bin Cao et.al. 2402.18068 link
2024-02-28 Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints Lingkai Kong et.al. 2402.18012 null
2024-02-28 Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning Zeyang Liu et.al. 2402.17978 null
2024-02-27 Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models Ashkan Taghipour et.al. 2402.17910 link
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768 null
2024-02-27 Structure-Guided Adversarial Training of Diffusion Models Ling Yang et.al. 2402.17563 null
2024-02-27 Diffusion Model-Based Image Editing: A Survey Yi Huang et.al. 2402.17525 link
2024-02-27 Label-Noise Robust Diffusion Models Byeonghu Na et.al. 2402.17517 link
2024-02-27 EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Linrui Tian et.al. 2402.17485 null
2024-02-28 DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models Shyam Marjit et.al. 2402.17412 null
2024-02-27 Generative diffusion model for surface structure discovery Nikolaj Rønne et.al. 2402.17404 null
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506 link
2024-02-26 Outline-Guided Object Inpainting with Diffusion Models Markus Pobitzer et.al. 2402.16421 null
2024-02-26 Placing Objects in Context via Inpainting for Out-of-distribution Segmentation Pau de Jorge et.al. 2402.16392 link
2024-02-26 Generative AI in Vision: A Survey on Models, Metrics and Applications Gaurav Raut et.al. 2402.16369 null
2024-02-26 Feedback Efficient Online Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2402.16359 null
2024-02-26 Graph Diffusion Policy Optimization Yijing Liu et.al. 2402.16302 link
2024-02-25 Photon-counting CT using a Conditional Diffusion Model for Super-resolution and Texture-preservation Christopher Wiedeman et.al. 2402.16212 null
2024-02-25 Towards Efficient Quantum Hybrid Diffusion Models Francesca De Falco et.al. 2402.16147 null
2024-02-25 Cinematographic Camera Diffusion Model Hongda Jiang et.al. 2402.16143 link
2024-02-25 Behavioral Refinement via Interpolant-based Policy Diffusion Kaiqi Chen et.al. 2402.16075 link
2024-02-24 HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models Li Pang et.al. 2402.15865 link
2024-02-23 Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions Kaihong Zhang et.al. 2402.15602 null
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504 link
2024-02-23 ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation Yi Zhang et.al. 2402.15429 link
2024-02-23 Let’s Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models Shunyu Liu et.al. 2402.15289 link
2024-02-23 Weak Reproductive Solutions for a Convection-Diffusion Model Describing a Binary Alloy Solidification Processes Blanca Climent-Ezquerra et.al. 2402.15221 null
2024-02-23 Label-efficient Multi-organ Segmentation Method with Diffusion Model Yongzhi Huang et.al. 2402.15216 null
2024-02-23 Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control Masatoshi Uehara et.al. 2402.15194 null
2024-02-23 Dynamics-Guided Diffusion Model for Robot Manipulator Design Xiaomeng Xu et.al. 2402.15038 null
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780 null
2024-02-22 Debiasing Text-to-Image Diffusion Models Ruifei He et.al. 2402.14577 null
2024-02-22 Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems Christina Schenk et.al. 2402.14446 null
2024-02-22 Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning Haoran He et.al. 2402.14407 link
2024-02-22 Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment Zhaoyang Wang et.al. 2402.14401 link
2024-02-22 Typographic Text Generation with Off-the-Shelf Diffusion Model KhayTze Peong et.al. 2402.14314 null
2024-02-22 Font Style Interpolation with Diffusion Models Tetta Kondo et.al. 2402.14311 null
2024-02-22 Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion Yujia Huang et.al. 2402.14285 link
2024-02-22 MVD $^2$ : Efficient Multiview 3D Reconstruction for Multiview Diffusion Xin-Yang Zheng et.al. 2402.14253 null
2024-02-21 T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching Zizheng Pan et.al. 2402.14167 link
2024-02-21 Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate Yuchen Liang et.al. 2402.13901 null
2024-02-21 NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion Haoyu Li et.al. 2402.13809 link
2024-02-22 Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions Jiayu Chen et.al. 2402.13777 link
2024-02-21 Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion Lianghu Guo et.al. 2402.13776 null
2024-02-21 Music Style Transfer with Time-Varying Inversion of Diffusion Models Sifei Li et.al. 2402.13763 null
2024-02-21 SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model Xudong Ling et.al. 2402.13737 link
2024-02-21 Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation Kihong Kim et.al. 2402.13729 null
2024-02-21 Flexible Physical Camouflage Generation Based on a Differential Approach Yang Li et.al. 2402.13575 null
2024-02-21 ToDo: Token Downsampling for Efficient Generation of High-Resolution Images Ethan Smith et.al. 2402.13573 null
2024-02-21 Generative AI for Secure Physical Layer Communications: A Survey Changyuan Zhao et.al. 2402.13553 null
2024-02-21 DiffPLF: A Conditional Diffusion Model for Probabilistic Forecasting of EV Charging Load Siyang Li et.al. 2402.13548 link
2024-02-21 Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models Chen Wu et.al. 2402.13490 null
2024-02-20 Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control Denis Lukovnikov et.al. 2402.13404 null
2024-02-20 The Uncanny Valley: A Comprehensive Analysis of Diffusion Models Karam Ghanem et.al. 2402.13369 null
2024-02-20 Neural Network Diffusion Kai Wang et.al. 2402.13144 link
2024-02-20 Text-Guided Molecule Generation with Diffusion Language Model Haisong Gong et.al. 2402.13040 link
2024-02-21 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974 link
2024-02-20 CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection Sohail Ahmed Khan et.al. 2402.12927 link
2024-02-20 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models Xinchen Zhang et.al. 2402.12908 link
2024-02-20 Two-stage Rainfall-Forecasting Diffusion Model XuDong Ling et.al. 2402.12779 link
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376 link
2024-02-19 Synthetic location trajectory generation using categorical diffusion models Simon Dirmeier et.al. 2402.12242 link
2024-02-19 Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training Leo Hyun Park et.al. 2402.12187 null
2024-02-19 Human Video Translation via Query Warping Haiming Zhu et.al. 2402.12099 null
2024-02-19 Direct Consistency Optimization for Compositional Text-to-Image Personalization Kyungmin Lee et.al. 2402.12004 null
2024-02-19 Privacy-Preserving Low-Rank Adaptation for Latent Diffusion Models Zihao Luo et.al. 2402.11989 link
2024-02-19 DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation Chong Zeng et.al. 2402.11929 link
2024-02-19 A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning Yuan Yuan et.al. 2402.11922 link
2024-02-19 ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image Yan Hong et.al. 2402.11849 null
2024-02-19 UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models Yihua Zhang et.al. 2402.11846 link
2024-02-19 WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection Yan Hong et.al. 2402.11843 null
2024-02-19 Statistical Test for Generated Hypotheses by Diffusion Models Teruyuki Katsuoka et.al. 2402.11789 null
2024-02-19 Towards Theoretical Understandings of Self-Consuming Generative Models Shi Fu et.al. 2402.11778 null
2024-02-18 SDiT: Spiking Diffusion Model with Transformer Shu Yang et.al. 2402.11588 null
2024-02-18 CaloGraph: Graph-based diffusion model for fast shower generation in calorimeters with irregular geometry Dmitrii Kobylianskii et.al. 2402.11575 null
2024-02-18 Temporal Disentangled Contrastive Diffusion Model for Spatiotemporal Imputation Yakun Chen et.al. 2402.11558 null
2024-02-18 Visual Concept-driven Image Generation with Text-to-Image Diffusion Model Tanzila Rahman et.al. 2402.11487 null
2024-02-17 Partial Ly $α$ thermalization in an analytic nonlinear diffusion model Georg Wolschin et.al. 2402.11320 null
2024-02-17 TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet method Chenyan Zhang et.al. 2402.11274 link
2024-02-17 DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model Yu Feng et.al. 2402.11241 null
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210 null
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207 link
2024-02-15 Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model Mariia Drozdova et.al. 2402.10204 link
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095 null
2024-02-15 Diffusion Models Meet Contextual Bandits with Large Action Spaces Imad Aouali et.al. 2402.10028 null
2024-02-15 Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion Hila Manor et.al. 2402.10009 null
2024-02-15 Accelerating Parallel Sampling of Diffusion Models Zhiwei Tang et.al. 2402.09970 link
2024-02-15 Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation Junjie Shentu et.al. 2402.09966 link
2024-02-15 Lester: rotoscope animation through video object segmentation and tracking Ruben Tous et.al. 2402.09883 link
2024-02-15 Diffusion Models for Audio Restoration Jean-Marie Lemercier et.al. 2402.09821 null
2024-02-15 DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization Jisu Nam et.al. 2402.09812 link
2024-02-15 Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement Tao Yang et.al. 2402.09712 null
2024-02-14 Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection Pengfei Zhou et.al. 2402.09242 link
2024-02-14 Semi-Supervised Diffusion Model for Brain Age Prediction Ayodeji Ijishakin et.al. 2402.09137 null
2024-02-14 L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects Yutaro Yamada et.al. 2402.09052 null
2024-02-14 Extreme Video Compression with Pre-trained Diffusion Models Bohan Li et.al. 2402.08934 link
2024-02-14 The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes Myeongseob Ko et.al. 2402.08922 link
2024-02-13 Percolating transition to turbulence without puffs or bands Sébastien Gomé et.al. 2402.08829 null
2024-02-13 LDTrack: Dynamic People Tracking by Service Robots using Diffusion Models Angus Fung et.al. 2402.08774 null
2024-02-13 Towards the Detection of AI-Synthesized Human Face Images Yuhang Lu et.al. 2402.08750 null
2024-02-13 PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models Fei Deng et.al. 2402.08714 null
2024-02-13 Zero Shot Molecular Generation via Similarity Kernels Rokas Elijošius et.al. 2402.08708 link
2024-02-13 Chain Reaction of Ideas: Can Radioactive Decay Predict Technological Innovation? Guilherme S. Y. Giardini et.al. 2402.08681 null
2024-02-13 Target Score Matching Valentin De Bortoli et.al. 2402.08667 null
2024-02-13 Learning Continuous 3D Words for Text-to-Image Generation Ta-Ying Cheng et.al. 2402.08654 link
2024-02-13 Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator Amartya Mukherjee et.al. 2402.08563 null
2024-02-13 Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases Ziyi Zhang et.al. 2402.08552 link
2024-02-13 A Dense Reward View on Aligning Text-to-Image Diffusion with Preference Shentao Yang et.al. 2402.08265 link
2024-02-13 Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation AprilPyone MaungMaung et.al. 2402.08200 null
2024-02-14 Convergence Analysis of Discrete Diffusion Model: Exact Implementation through Uniformization Hongrui Chen et.al. 2402.08095 null
2024-02-12 Nearest Neighbour Score Estimators for Diffusion Generative Models Matthew Niedoba et.al. 2402.08018 link
2024-02-12 Towards a mathematical theory for consistency training in diffusion models Gen Li et.al. 2402.07802 null
2024-02-12 Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models Jiacheng Ye et.al. 2402.07754 link
2024-02-12 Cosmology at the Field Level with Probabilistic Machine Learning Adam Rouhiainen et.al. 2402.07694 null
2024-02-12 Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback Cansu Korkmaz et.al. 2402.07597 null
2024-02-12 Score-based Diffusion Models via Stochastic Differential Equations – a Technical Tutorial Wenpin Tang et.al. 2402.07487 null
2024-02-12 SALAD: Smart AI Language Assistant Daily Ragib Amin Nihal et.al. 2402.07431 null
2024-02-12 Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation Tonglong Wei et.al. 2402.07369 link
2024-02-11 Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL Sungyoon Kim et.al. 2402.07226 link
2024-02-11 Towards Fast Stochastic Sampling in Diffusion Generative Models Kushagra Pandey et.al. 2402.07211 null
2024-02-10 Synthesizing CTA Image Data for Type-B Aortic Dissection using Stable Diffusion Models Ayman Abaid et.al. 2402.06969 null
2024-02-09 Towards Principled Assessment of Tabular Data Synthesis Algorithms Yuntao Du et.al. 2402.06806 link
2024-02-09 Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following Brian Yang et.al. 2402.06559 link
2024-02-09 Sequential Flow Matching for Generative Modeling Jongmin Yoon et.al. 2402.06461 null
2024-02-09 ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation Fengyi Shen et.al. 2402.06446 null
2024-02-09 Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation Peter Hönig et.al. 2402.06436 null
2024-02-09 Particle Denoising Diffusion Sampler Angus Phillips et.al. 2402.06320 link
2024-02-09 Controllable seismic velocity synthesis using generative diffusion models Fu Wang et.al. 2402.06277 null
2024-02-09 MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models Yixiao Zhang et.al. 2402.06178 link
2024-02-08 CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models Maitreya Suin et.al. 2402.06106 null
2024-02-08 Animated Stickers: Bringing Stickers to Life with Video Diffusion David Yan et.al. 2402.06088 null
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937 null
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933 link
2024-02-08 AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning Wamiq Reyaz Para et.al. 2402.05803 null
2024-02-08 DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer Zhiyuan Ma et.al. 2402.05712 link
2024-02-08 Scalable Diffusion Models with State Space Backbone Zhengcong Fei et.al. 2402.05608 link
2024-02-08 Get What You Want, Not What You Don’t: Image Content Suppression for Text-to-Image Diffusion Models Senmao Li et.al. 2402.05375 link
2024-02-08 Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model Junghun Cha et.al. 2402.05350 null
2024-02-07 SPAD : Spatially Aware Multiview Diffusers Yash Kant et.al. 2402.05235 null
2024-02-07 Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models Nicholas Konz et.al. 2402.05210 link
2024-02-07 $λ$ -ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space Maitreya Patel et.al. 2402.05195 null
2024-02-07 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098 link
2024-02-07 NITO: Neural Implicit Fields for Resolution-free Topology Optimization Amin Heyrani Nobari et.al. 2402.05073 link
2024-02-07 LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation Jiaxiang Tang et.al. 2402.05054 null
2024-02-07 Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design Andrew Campbell et.al. 2402.04997 link
2024-02-07 Blue noise for diffusion models Xingchang Huang et.al. 2402.04930 link
2024-02-07 Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation Shivang Chopra et.al. 2402.04929 null
2024-02-07 Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints Jian Chen et.al. 2402.04754 link
2024-02-07 Cortical Surface Diffusion Generative Models Zhenshan Xie et.al. 2402.04753 null
2024-02-07 EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions Shashank Kotyan et.al. 2402.04699 link
2024-02-07 Noise Map Guidance: Inversion with Spatial Context for Real Image Editing Hansam Cho et.al. 2402.04625 link
2024-02-07 BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception Aniket Roy et.al. 2402.04541 link
2024-02-07 Text2Street: Controllable Text-to-image Generation for Street Views Jinming Su et.al. 2402.04504 null
2024-02-06 Fine-Tuned Language Models Generate Stable Inorganic Materials as Text Nate Gruver et.al. 2402.04379 link
2024-02-06 Bidirectional Autoregressive Diffusion Model for Dance Generation Canyu Zhang et.al. 2402.04356 link
2024-02-06 Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation Zolnamar Dorjsembe et.al. 2402.04031 link
2024-02-06 Space Group Constrained Crystal Generation Rui Jiao et.al. 2402.03992 null
2024-02-06 Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting Yiming Xu et.al. 2402.03981 null
2024-02-06 EscherNet: A Generative Model for Scalable View Synthesis Xin Kong et.al. 2402.03908 link
2024-02-06 On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models Christian Horvat et.al. 2402.03845 null
2024-02-06 SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising Yu-Tung Liu et.al. 2402.03808 link
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305 null
2024-02-05 Zero-shot Object-Level OOD Detection with Context-Aware Inpainting Quang-Huy Nguyen et.al. 2402.03292 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290 link
2024-02-05 Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? Anna Yoo Jeong Ha et.al. 2402.03214 null
2024-02-05 Light and Optimal Schrödinger Bridge Matching Nikita Gushchin et.al. 2402.03207 link
2024-02-05 Guidance with Spherical Gaussian Constraint for Conditional Diffusion Lingxiao Yang et.al. 2402.03201 link
2024-02-05 Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion Shiyuan Yang et.al. 2402.03162 null
2024-02-05 PFDM: Parser-Free Virtual Try-on via Diffusion Model Yunfang Niu et.al. 2402.03047 null
2024-02-05 Diffusive Gibbs Sampling Wenlin Chen et.al. 2402.03008 link
2024-02-05 DexDiffuser: Generating Dexterous Grasps with Diffusion Models Zehang Weng et.al. 2402.02989 null
2024-02-05 Retrieval-Augmented Score Distillation for Text-to-3D Generation Junyoung Seo et.al. 2402.02972 link
2024-02-05 ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis Bernard Spiegl et.al. 2402.02906 link
2024-02-05 SynthVision – Harnessing Minimal Input for Maximal Output in Computer Vision Models using Synthetic Image data Yudara Kularathne et.al. 2402.02826 null
2024-02-05 Extreme Two-View Geometry From Object Poses with Diffusion Models Yujing Sun et.al. 2402.02800 link
2024-02-05 Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning Yixiang Shan et.al. 2402.02772 null
2024-02-05 DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models Yang Sui et.al. 2402.02739 null
2024-02-04 DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing Chong Mou et.al. 2402.02583 link
2024-02-04 Latent Graph Diffusion: A Unified Framework for Generation and Prediction on Graphs Zhou Cai et.al. 2402.02518 link
2024-02-04 PoCo: Policy Composition from and for Heterogeneous Robot Learning Lirui Wang et.al. 2402.02511 null
2024-02-04 PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal Tao Wang et.al. 2402.02374 link
2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields Jiahua Dong et.al. 2402.00864 link
2024-02-01 An Analysis of the Variance of Diffusion-based Speech Enhancement Bunlong Lay et.al. 2402.00811 null
2024-02-01 Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching Shangzhe Li et.al. 2402.00807 null
2024-02-01 AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Fu-Yun Wang et.al. 2402.00769 link
2024-01-31 SeFi-IDE: Semantic-Fidelity Identity Embedding for Personalized Diffusion-Based Generation Yang Li et.al. 2402.00631 null
2024-02-01 Cylindrically symmetric diffusion model for relativistic heavy-ion collisions Johannes Hoelck et.al. 2402.00628 null
2024-02-01 CapHuman: Capture Your Moments in Parallel Universes Chao Liang et.al. 2402.00627 link
2024-02-01 Masked Conditional Diffusion Model for Enhancing Deepfake Detection Tiewen Chen et.al. 2402.00541 null
2024-02-01 Energetic Particles in the Central Starburst, Disc, and Halo of NGC253 Yoel Rephaeli et.al. 2402.00523 null
2024-02-01 LRDif: Diffusion Models for Under-Display Camera Emotion Recognition Zhifeng Wang et.al. 2402.00250 null
2024-01-31 SuperDiff: Diffusion Models for Conditional Generation of Hypothetical New Families of Superconductors Samuel Yuan et.al. 2402.00198 link
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085 null
2024-01-31 Ljusternik-Schnirelmann eigenvalues for the fractional $m-$Laplacian without the $Δ_2$ condition Julian Fernandez Bonder et.al. 2401.18041 null
2024-01-31 Diagnosing the particle transport mechanism in the pulsar halo via X-ray observations Qi-Zuo Wu et.al. 2401.17982 null
2024-01-31 Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances Xuefeng Gao et.al. 2401.17958 null
2024-01-31 AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error Jonas Ricker et.al. 2401.17879 link
2024-01-31 Drift Diffusion Model to understand (mis)information sharing dynamic in complex networks Lucila G. Alvarez-Zuzek et.al. 2401.17846 null
2024-01-31 A new class of efficient high order semi-Lagrangian IMEX discontinuous Galerkin methods on staggered unstructured meshes M. Tavelli et.al. 2401.17806 null
2024-01-31 Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models Sifei Li et.al. 2401.17800 link
2024-01-31 Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation Yuanhuiyi Lyu et.al. 2401.17664 null
2024-01-31 Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models Kyungsung Lee et.al. 2401.17629 null
2024-01-31 Topology-Aware Latent Diffusion for 3D Shape Generation Jiangbei Hu et.al. 2401.17603 null
2024-01-31 Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model Yafei Dong et.al. 2401.17593 null
2024-01-31 Task-Oriented Diffusion Model Compression Geonung Kim et.al. 2401.17547 null
2024-01-31 Enhancing Score-Based Sampling Methods with Ensembles Tobias Bischoff et.al. 2401.17539 null
2024-01-30 You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation Mehdi Noroozi et.al. 2401.17258 null
2024-01-30 ContactGen: Contact-Guided Interactive 3D Human Generation for Partners Dongjun Gu et.al. 2401.17212 null
2024-01-30 Transfer Learning for Text Diffusion Models Kehang Han et.al. 2401.17181 null
2024-01-30 PlantoGraphy: Incorporating Iterative Design Process into Generative Artificial Intelligence for Landscape Rendering Rong Huang et.al. 2401.17120 null
2024-01-30 Local modification of subdiffusion by initial Fickian diffusion: Multiscale modeling, analysis and computation Xiangcheng Zheng et.al. 2401.16885 null
2024-01-30 A Literature Review on Fetus Brain Motion Correction in MRI Haoran Zhang et.al. 2401.16782 null
2024-01-29 Using multiple Dirac delta points to describe inhomogeneous flux density over a cell boundary in a single-cell diffusion model Qiyao Peng et.al. 2401.16261 null
2024-01-29 Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models Zhongjie Duan et.al. 2401.16224 null
2024-01-29 Spatial-Aware Latent Initialization for Controllable Image Generation Wenqiang Sun et.al. 2401.16157 null
2024-01-29 DMCE: Diffusion Model Channel Enhancer for Multi-User Semantic Communication Systems Youcheng Zeng et.al. 2401.16017 null
2024-01-29 Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling Xiaoyu Shi et.al. 2401.15977 null
2024-01-29 EmoDM: A Diffusion Model for Evolutionary Multi-objective Optimization Xueming Yan et.al. 2401.15931 null
2024-01-28 Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding Jianxiang Lu et.al. 2401.15708 null
2024-01-28 Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance Qingcheng Zhao et.al. 2401.15687 null
2024-01-28 CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement Xiaowen Shi et.al. 2401.15649 null
2024-01-28 FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models Feihong He et.al. 2401.15636 link
2024-01-28 Generative AI-enabled Blockchain Networks: Fundamentals, Applications, and Case Study Cong T. Nguyen et.al. 2401.15625 null
2024-01-28 Diffusion-based graph generative methods Hongyang Chen et.al. 2401.15617 link
2024-01-28 Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization Yinbin Han et.al. 2401.15604 null
2024-01-28 BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry Xiang Xu et.al. 2401.15563 link
2024-01-27 Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models Fabio Merizzi et.al. 2401.15469 link
2024-01-27 A Survey on Data Augmentation in Large Model Era Yue Zhou et.al. 2401.15422 link
2024-01-27 GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis Jing Hao et.al. 2401.15282 link
2024-01-26 Annotated Hands for Generative Models Yue Yang et.al. 2401.15075 link
2024-01-26 Text Image Inpainting via Global Structure-Guided Diffusion Models Shipeng Zhu et.al. 2401.14832 link
2024-01-25 Opposite variations for pore pressure on and off the fault during simulated earthquakes in the laboratory Dong Liu et.al. 2401.14506 null
2024-01-25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen et.al. 2401.14404 null
2024-01-25 pix2gestalt: Amodal Segmentation by Synthesizing Wholes Ege Ozguroglu et.al. 2401.14398 link
2024-01-25 UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models Timo Kapsalis et.al. 2401.14379 null
2024-01-25 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257 null
2024-01-25 Scene Graph to Image Synthesis: Integrating CLIP Guidance with Graph Conditioning in Diffusion Models Rameshwar Mishra et.al. 2401.14111 null
2024-01-25 CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion Nisha Huang et.al. 2401.14066 link
2024-01-25 Diffusion-based Data Augmentation for Object Counting Problems Zhen Wang et.al. 2401.13992 null
2024-01-25 BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models Senthil Purushwalkam et.al. 2401.13974 link
2024-01-25 StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models Yalong Bai et.al. 2401.13942 null
2024-01-24 Inverse Molecular Design with Multi-Conditional Diffusion Guidance Gang Liu et.al. 2401.13858 link
2024-01-24 Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All Mehmet Saygin Seyfioglu et.al. 2401.13795 null
2024-01-24 Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials Yanyan Yang et.al. 2401.13570 link
2024-01-25 UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion Wei Li et.al. 2401.13388 null
2024-01-24 Generative Design of Crystal Structures by Point Cloud Representations and Diffusion Model Zhelin Li et.al. 2401.13192 link
2024-01-24 Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model Yuanming Li et.al. 2401.13191 null
2024-01-24 Compositional Generative Inverse Design Tailin Wu et.al. 2401.13171 link
2024-01-24 Choose Your Diffusion: Efficient and flexible ways to accelerate the diffusion model in fast high energy physics simulation Cheng Jiang et.al. 2401.13162 null
2024-01-23 GALA: Generating Animatable Layered Assets from a Single Scan Taeksoo Kim et.al. 2401.12979 null
2024-01-24 Zero-Shot Learning for the Primitives of 3D Affordance in General Objects Hyeonwoo Kim et.al. 2401.12978 link
2024-01-23 Lumiere: A Space-Time Diffusion Model for Video Generation Omer Bar-Tal et.al. 2401.12945 null
2024-01-23 UniHDA: Towards Universal Hybrid Domain Adaptation of Image Generators Hengjia Li et.al. 2401.12596 null
2024-01-23 ToDA: Target-oriented Diffusion Attacker against Recommendation System Xiaohao Liu et.al. 2401.12578 null
2024-01-23 DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations Dogyun Park et.al. 2401.12517 link
2024-01-22 DITTO: Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2401.12179 null
2024-01-22 Single-View 3D Human Digitalization with Large Reconstruction Models Zhenzhen Weng et.al. 2401.12175 null
2024-01-22 Feature Denoising Diffusion Model for Blind Image Quality Assessment Xudong Li et.al. 2401.11949 null
2024-01-22 EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models Koichi Namekata et.al. 2401.11739 null
2024-01-22 Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Ling Yang et.al. 2401.11708 link
2024-01-21 Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers Katherine Crowson et.al. 2401.11605 link
2024-01-20 Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient Weiguo Lu et.al. 2401.11261 null
2024-01-20 Product-Level Try-on: Characteristics-preserving Try-on with Realistic Clothes Shading and Wrinkles Yanlong Zang et.al. 2401.11239 null
2024-01-20 MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation Nhat M. Hoang et.al. 2401.11115 link
2024-01-20 UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures Mingyuan Zhou et.al. 2401.11078 null
2024-01-20 Make-A-Shape: a Ten-Million-scale 3D Shape Model Ka-Hei Hui et.al. 2401.11067 link
2024-01-19 Synthesizing Moving People with 3D Control Boyi Li et.al. 2401.10889 null
2024-01-19 ActAnywhere: Subject-Aware Video Background Generation Boxiao Pan et.al. 2401.10822 null
2024-01-19 From Market Saturation to Social Reinforcement: Understanding the Impact of Non-Linearity in Information Diffusion Models Tobias Friedrich et.al. 2401.10818 null
2024-01-19 Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion Zuoyue Li et.al. 2401.10786 null
2024-01-19 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Yinan Zheng et.al. 2401.10700 link
2024-01-19 MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images Rui Xu et.al. 2401.10561 null
2024-01-18 Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution Xin Yuan et.al. 2401.10404 null
2024-01-18 A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting Wouter Van Gansbeke et.al. 2401.10227 link
2024-01-22 Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation Changgu Chen et.al. 2401.10150 null
2024-01-18 DiffusionGPT: LLM-Driven Text-to-Image Generation System Jie Qin et.al. 2401.10061 null
2024-01-18 CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects Zhao Wang et.al. 2401.09962 null
2024-01-18 BlenDA: Domain Adaptive Object Detection through diffusion-based blending Tzuhsuan Huang et.al. 2401.09921 link
2024-01-18 Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework Junkun Jiang et.al. 2401.09836 link
2024-01-18 Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing Gwanhyeong Koo et.al. 2401.09794 null
2024-01-18 Image Translation as Diffusion Visual Programmers Cheng Han et.al. 2401.09742 null
2024-01-17 Total fraction of drug released from diffusion-controlled delivery systems with binding reactions Elliot J. Carr et.al. 2401.09644 link
2024-01-17 Efficient generative adversarial networks using linear additive-attention Transformers Emilio Morales-Juarez et.al. 2401.09596 link
2024-01-17 TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion Yu-Ying Yeh et.al. 2401.09416 null
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414 link
2024-01-17 On the $\varepsilon$ -Euler-Maruyama scheme for time inhomogeneous jump-driven SDEs Mireille Bossy et.al. 2401.09338 null
2024-01-17 Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery Jia Jia et.al. 2401.09325 null
2024-01-17 T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis Yoonjin Chung et.al. 2401.09294 link
2024-01-17 Training-Free Semantic Video Composition via Pre-trained Diffusion Model Jiaqi Guo et.al. 2401.09195 null
2024-01-17 Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior Zike Wu et.al. 2401.09050 link
2024-01-17 Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis Jonghyun Lee et.al. 2401.09048 link
2024-01-17 VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models Haoxin Chen et.al. 2401.09047 link
2024-01-17 Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation Tong Xie et.al. 2401.09031 link
2024-01-17 3D Human Pose Analysis via Diffusion Synthesis Haorui Ji et.al. 2401.08930 null
2024-01-16 Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive Yumeng Li et.al. 2401.08815 link
2024-01-16 Fixed Point Diffusion Models Xingjian Bai et.al. 2401.08741 link
2024-01-16 SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers Nanye Ma et.al. 2401.08740 link
2024-01-16 RoHM: Robust Human Motion Reconstruction via Diffusion Siwei Zhang et.al. 2401.08570 null
2024-01-16 Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation Mathis Petrovich et.al. 2401.08559 null
2024-01-16 Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing Bin Zhang et.al. 2401.08275 null
2024-01-16 Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization Chongzhi Zhang et.al. 2401.08232 null
2024-01-16 Photonic Modes Prediction via Multi-Modal Diffusion Model Jinyang Sun et.al. 2401.08199 null
2024-01-16 Key-point Guided Deformable Image Manipulation Using Diffusion Model Seok-Hwan Oh et.al. 2401.08178 null
2024-01-12 A deep implicit-explicit minimizing movement method for option pricing in jump-diffusion models Emmanuil H. Georgoulis et.al. 2401.06740 null
2024-01-12 Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks Stefan Blücher et.al. 2401.06654 link
2024-01-12 Adversarial Examples are Misaligned in Diffusion Model Manifolds Peter Lorenz et.al. 2401.06637 null
2024-01-12 Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking Wei Cao et.al. 2401.06614 null
2024-01-12 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model Qian Wang et.al. 2401.06578 null
2024-01-12 RotationDrag: Point-based Image Editing with Rotated Diffusion Features Minxing Luo et.al. 2401.06442 link
2024-01-12 Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering Chang Yu et.al. 2401.06345 null
2024-01-11 Frequency-Time Diffusion with Neural Cellular Automata John Kalkhof et.al. 2401.06291 null
2024-01-11 Demystifying Variational Diffusion Models Fabio De Sousa Ribeiro et.al. 2401.06281 null
2024-01-11 Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications Yuwen Xiong et.al. 2401.06197 link
2024-01-11 TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation Rajaei Khatib et.al. 2401.06191 null
2024-01-11 E $^{2}$ GAN: Efficient Training of Efficient GANs for Image-to-Image Translation Yifan Gong et.al. 2401.06127 null
2024-01-11 DiffDA: a diffusion model for weather-scale data assimilation Langwen Huang et.al. 2401.05932 link
2024-01-11 Efficient Image Deblurring Networks based on Diffusion Models Kang Chen et.al. 2401.05907 link
2024-01-11 HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models Hanzhang Wang et.al. 2401.05870 null
2024-01-11 EraseDiff: Erasing Data Influence in Diffusion Models Jing Wu et.al. 2401.05779 link
2024-01-10 Diffusion Priors for Dynamic View Synthesis from Monocular Videos Chaoyang Wang et.al. 2401.05583 null
2024-01-10 From Pampas to Pixels: Fine-Tuning Diffusion Models for Gaúcho Heritage Marcellus Amadeus et.al. 2401.05520 null
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335 null
2024-01-10 Score Distillation Sampling with Learned Manifold Corrective Thiemo Alldieck et.al. 2401.05293 null
2024-01-10 PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models Junsong Chen et.al. 2401.05252 link
2024-01-10 Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN Muhammad Ali Farooq et.al. 2401.05159 null
2024-01-10 CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model Yinghui Xing et.al. 2401.05153 null
2024-01-10 SwiMDiff: Scene-wide Matching Contrastive Learning with Diffusion Constraint for Remote Sensing Image Jiayuan Tian et.al. 2401.05093 null
2024-01-10 A novel bond-based nonlocal diffusion model with matrix-valued coefficients in non-divergence form and its collocation discretization Lili Ju et.al. 2401.04973 null
2024-01-09 Transmission-eigenchannel velocity and diffusion Azriel Z. Genack et.al. 2401.04818 null
2024-01-09 DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation Junming Chen et.al. 2401.04747 null
2024-01-09 Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation Xiyi Chen et.al. 2401.04728 link
2024-01-09 Efficient estimation for ergodic diffusion processes sampled at high frequency Michael Sørensen et.al. 2401.04689 null
2024-01-09 EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models Jingyuan Yang et.al. 2401.04608 null
2024-01-09 Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Xuewen Liu et.al. 2401.04585 link
2024-01-09 MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation Weimin Wang et.al. 2401.04468 null
2024-01-09 D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection Justin Tebbe et.al. 2401.04463 link
2024-01-09 SonicVisionLM: Playing Sound with Vision Language Models Zhifeng Xie et.al. 2401.04394 null
2024-01-09 Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example Kwan Yun et.al. 2401.04362 null
2024-01-09 Memory-Efficient Personalization using Quantized Diffusion Model Hyogon Ryu et.al. 2401.04339 null
2024-01-08 FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation Yang Liu et.al. 2401.04283 null
2024-01-08 Robust Image Watermarking using Stable Diffusion Lijun Zhang et.al. 2401.04247 link
2024-01-08 scDiffusion: conditional generation of high-quality single-cell data using diffusion model Erpai Luo et.al. 2401.03968 link
2024-01-08 D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement Danqi Yan et.al. 2401.03914 null
2024-01-08 DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement Jiaqi Liu et.al. 2401.03629 null
2024-01-07 ROIC-DM: Robust Text Inference and Classification via Diffusion Model Shilong Yuan et.al. 2401.03514 null
2024-01-07 Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness Sicheng Yang et.al. 2401.03476 null
2024-01-07 Deep Learning-based Image and Video Inpainting: A Survey Weize Quan et.al. 2401.03395 null
2024-01-06 Reflected Schrödinger Bridge for Constrained Generative Modeling Wei Deng et.al. 2401.03228 null
2024-01-06 MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond Yupei Lin et.al. 2401.03221 null
2024-01-06 Fair Sampling in Diffusion Models through Switching Mechanism Yujin Choi et.al. 2401.03140 link
2024-01-05 Latte: Latent Diffusion Transformer for Video Generation Xin Ma et.al. 2401.03048 link
2024-01-05 The Rise of Diffusion Models in Time-Series Forecasting Caspar Meijer et.al. 2401.03006 link
2024-01-08 Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction Yuxin Yang et.al. 2401.02916 null
2024-01-05 Plug-in Diffusion Model for Sequential Recommendation Haokai Ma et.al. 2401.02913 link
2024-01-05 Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors Top Piriyakulkij et.al. 2401.02739 link
2024-01-05 Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation Can Xu et.al. 2401.02683 link
2024-01-04 Comprehensive Exploration of Synthetic Data Generation: A Survey André Bauer et.al. 2401.02524 null
2024-01-04 VASE: Object-Centric Appearance and Shape Manipulation of Real Videos Elia Peruzzo et.al. 2401.02473 null
2024-01-04 Bring Metric Functions into Diffusion Models Jie An et.al. 2401.02414 null
2024-01-06 GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation Xuehao Gao et.al. 2401.02142 link
2024-01-04 Preserving Image Properties Through Initializations in Diffusion Models Jeffrey Zhang et.al. 2401.02097 null
2024-01-04 Energy based diffusion generator for efficient sampling of Boltzmann distributions Yan Wang et.al. 2401.02080 null
2024-01-04 DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection Yunfan Ye et.al. 2401.02032 link
2024-01-04 Improving Diffusion-Based Image Synthesis with Context Prediction Ling Yang et.al. 2401.02015 null
2024-01-03 Instruct-Imagen: Image Generation with Multi-modal Instruction Hexiang Hu et.al. 2401.01952 null
2024-01-03 Can We Generate Realistic Hands Only Using Convolution? Mehran Hosseini et.al. 2401.01951 null
2024-01-03 Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions David Junhao Zhang et.al. 2401.01827 link
2024-01-03 DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models Yichen Liu et.al. 2401.01659 null
2024-01-03 SIGNeRF: Scene Integrated Generation for Neural Radiance Fields Jan-Niklas Dihlmann et.al. 2401.01647 null
2024-01-03 S $^{2}$ -DMs:Skip-Step Diffusion Models Yixuan Wang et.al. 2401.01520 link
2024-01-02 ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text Dingkun Yan et.al. 2401.01456 link
2024-01-02 VALD-MD: Visual Attribution via Latent Diffusion for Medical Diagnostics Ammar A. Siddiqui et.al. 2401.01414 null
2024-01-01 DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition Parul Gupta et.al. 2401.01387 null
2024-01-02 VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM Fuchen Long et.al. 2401.01256 link
2024-01-02 Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation Renshuai Liu et.al. 2401.01207 null
2024-01-02 A comparative study of resistivity models for simulations of magnetic reconnection in the solar atmosphere. II. Plasmoid formation Øystein Håvard Færder et.al. 2401.01177 null
2024-01-02 Joint Generative Modeling of Scene Graphs and Images via Diffusion Models Bicheng Xu et.al. 2401.01130 null
2024-01-02 Robust single-particle cryo-EM image denoising and restoration Jing Zhang et.al. 2401.01097 null
2024-01-02 Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation Jinlong Xue et.al. 2401.01044 link
2024-01-01 DiffMorph: Text-less Image Morphing with Diffusion Models Shounak Chatterjee et.al. 2401.00739 null
2024-01-01 Diffusion Models, Image Super-Resolution And Everything: A Survey Brian B. Moser et.al. 2401.00736 null
2024-01-02 GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields Xiao Pan et.al. 2401.00616 null
2024-01-03 Diff-PCR: Diffusion-Based Correspondence Searching in Doubly Stochastic Matrix Space for Point Cloud Registration Qianliang Wu et.al. 2401.00436 null
2023-12-31 SynCDR : Training Cross Domain Retrieval Models with Synthetic Data Samarth Mishra et.al. 2401.00420 link
2023-12-31 Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion Wei-Jer Chang et.al. 2401.00391 null
2023-12-30 Probing the Limits and Capabilities of Diffusion Models for the Anatomic Editing of Digital Twins Karim Kadry et.al. 2401.00247 null
2023-12-28 iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views Chin-Hsuan Wu et.al. 2312.17250 link
2023-12-28 Personalized Restoration via Dual-Pivot Tuning Pradyumna Chari et.al. 2312.17234 null
2023-12-28 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency Yuyang Yin et.al. 2312.17225 null
2023-12-28 Restoration by Generation with Constrained Priors Zheng Ding et.al. 2312.17161 null
2023-12-28 DiffKG: Knowledge Graph Diffusion Model for Recommendation Yangqin Jiang et.al. 2312.16890 link
2023-12-28 DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors Biwen Lei et.al. 2312.16837 null
2023-12-27 I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models Xun Guo et.al. 2312.16693 link
2023-12-27 Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection Huan Liu et.al. 2312.16649 link
2023-12-27 Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance Tomer Garber et.al. 2312.16519 link
2023-12-27 PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion Guansong Lu et.al. 2312.16486 null
2023-12-27 SVGDreamer: Text Guided SVG Generation with Diffusion Model Ximing Xing et.al. 2312.16476 link
2023-12-27 Natural Adversarial Patch Generation Method Based on Latent Diffusion Model Xianyi Chen et.al. 2312.16401 null
2023-12-26 One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications Mengyao Lyu et.al. 2312.16145 null
2023-12-26 Compositional Search of Stable Crystalline Structures in Multi-Component Alloys Using Generative Diffusion Models Grzegorz Kaszuba et.al. 2312.16073 null
2023-12-26 HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Sangmin Woo et.al. 2312.15980 link
2023-12-26 Semantic Guidance Tuning for Text-To-Image Diffusion Models Hyun Kang et.al. 2312.15964 link
2023-12-26 Implied volatility (also) is path-dependent Hervé Andrès et.al. 2312.15950 link
2023-12-26 EnchantDance: Unveiling the Potential of Music-Driven Dance Movement Bo Han et.al. 2312.15946 link
2023-12-26 Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection Songmin Dai et.al. 2312.15911 null
2023-12-26 Cross Initialization for Personalized Text-to-Image Generation Lianyu Pang et.al. 2312.15905 link
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134 link
2023-12-21 Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation Philipp Schröppel et.al. 2312.14124 link
2023-12-21 HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models Hayk Manukyan et.al. 2312.14091 link
2023-12-21 Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning Desai Xie et.al. 2312.13980 null
2023-12-21 Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models Xianfang Zeng et.al. 2312.13913 link
2023-12-21 Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models Huan Ling et.al. 2312.13763 null
2023-12-21 Free-Editor: Zero-shot Text-driven 3D Scene Editing Nazmul Karim et.al. 2312.13663 link
2023-12-21 Diff-Oracle: Diffusion Model for Oracle Character Generation with Controllable Styles and Contents Jing Li et.al. 2312.13631 null
2023-12-21 Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion Nishtha Madaan et.al. 2312.13616 null
2023-12-21 Front stability of infinitely steep travelling waves in population biology Matthew J Simpson et.al. 2312.13601 link
2023-12-20 Unlocking Pre-trained Image Backbones for Semantic Image Synthesis Tariq Berrada et.al. 2312.13314 null
2023-12-21 Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting Junwu Zhang et.al. 2312.13271 link
2023-12-20 Conditional Image Generation with Pretrained Generative Model Rajesh Shrestha et.al. 2312.13253 null
2023-12-20 Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model Saurabh Saxena et.al. 2312.13252 null
2023-12-20 Diffusion Models With Learned Adaptive Noise Subham Sekhar Sahoo et.al. 2312.13236 link
2023-12-21 DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis Yuming Gu et.al. 2312.13016 link
2023-12-20 RadEdit: stress-testing biomedical vision models via diffusion image editing Fernando Pérez-García et.al. 2312.12865 null
2023-12-20 ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement Yuhui Wu et.al. 2312.12826 null
2023-12-20 All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models Seunghoo Hong et.al. 2312.12807 null
2023-12-21 AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion Beibei Jing et.al. 2312.12763 null
2023-12-20 How Good Are Deep Generative Models for Solving Inverse Problems? Shichong Peng et.al. 2312.12691 null
2023-12-19 Surf-CDM: Score-Based Surface Cold-Diffusion Model For Medical Image Segmentation Fahim Ahmed Zaman et.al. 2312.12649 null
2023-12-19 Fixed-point Inversion for Text-to-image diffusion models Barak Meiri et.al. 2312.12540 link
2023-12-19 StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation Akio Kodaira et.al. 2312.12491 link
2023-12-19 InstructVideo: Instructing Video Diffusion Models with Human Feedback Hangjie Yuan et.al. 2312.12490 null
2023-12-19 Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models Angela Castillo et.al. 2312.12487 null
2023-12-19 On Inference Stability for Diffusion Models Viet Nguyen et.al. 2312.12431 link
2023-12-19 Scene-Conditional 3D Object Stylization and Composition Jinghao Zhou et.al. 2312.12419 null
2023-12-19 Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models Shweta Mahajan et.al. 2312.12416 null
2023-12-19 Travelling pulses on three spatial scales in a Klausmeier-type vegetation-autotoxicity model Paul Carter et.al. 2312.12277 null
2023-12-19 Intrinsic Image Diffusion for Single-view Material Estimation Peter Kocsis et.al. 2312.12274 link
2023-12-18 A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm Yong Niu et.al. 2312.10885 null
2023-12-17 Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models Nikita Starodubcev et.al. 2312.10835 link
2023-12-17 CogCartoon: Towards Practical Story Visualization Zhongyang Zhu et.al. 2312.10718 null
2023-12-17 VidToMe: Video Token Merging for Zero-Shot Video Editing Xirui Li et.al. 2312.10656 link
2023-12-16 VecFusion: Vector Font Generation with Diffusion Vikas Thamizharasan et.al. 2312.10540 null
2023-12-16 A Unified Filter Method for Jointly Estimating State and Parameters of Stochastic Dynamical Systems via the Ensemble Score Filter Feng Bao et.al. 2312.10503 null
2023-12-16 Continuous Diffusion for Mixed-Type Tabular Data Markus Mueller et.al. 2312.10431 link
2023-12-16 Lecture Notes in Probabilistic Diffusion Models Inga Strümke et.al. 2312.10393 null
2023-12-16 Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge Conghan Yue et.al. 2312.10299 link
2023-12-15 Two simple criterion to prove the existence of patterns in reaction-diffusion models of two components Francisco J. Vielma-Leal et.al. 2312.10231 null
2023-12-15 Tell Me What You See: Text-Guided Real-World Image Denoising Erez Yosef et.al. 2312.10191 null
2023-12-15 Improving new physics searches with diffusion models for event observables and jet constituents Debajyoti Sengupta et.al. 2312.10130 null
2023-12-15 MVHuman: Tailoring 2D Diffusion with Multi-view Sampling For Realistic 3D Human Generation Suyi Jiang et.al. 2312.10120 null
2023-12-15 Plasticine3D: Non-rigid 3D editting with text guidance Yige Chen et.al. 2312.10111 null
2023-12-15 Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology Pedro Osorio et.al. 2312.09792 null
2023-12-15 DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models Yifeng Ma et.al. 2312.09767 link
2023-12-15 PPFM: Image denoising in photon-counting CT using single-step posterior sampling Poisson flow generative models Dennis Hein et.al. 2312.09754 link
2023-12-15 Positivity and global existence for nonlocal advection-diffusion models of interacting populations Valeria Giunta et.al. 2312.09692 null
2023-12-15 Exploring the Feasibility of Generating Realistic 3D Models of Endangered Species Using DreamGaussian: An Analysis of Elevation Angle’s Impact on Model Generation Selcuk Anil Karatopak et.al. 2312.09682 null
2023-12-15 Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models Senmao Li et.al. 2312.09608 link
2023-12-14 LIME: Localized Image Editing via Attention Regularization in Diffusion Models Enis Simsar et.al. 2312.09256 null
2023-12-14 FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection Hongsuk Choi et.al. 2312.09252 null
2023-12-14 Single Mesh Diffusion Models with Field Latents for Texture Generation Thomas W. Mitchel et.al. 2312.09250 null
2023-12-14 A framework for conditional diffusion modelling with applications in motif scaffolding for protein design Kieran Didi et.al. 2312.09236 null
2023-12-14 Mosaic-SDF for 3D Generative Models Lior Yariv et.al. 2312.09222 null
2023-12-14 Fast Sampling via De-randomization for Discrete Diffusion Models Zixiang Chen et.al. 2312.09193 null
2023-12-14 Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures Huijie Zhang et.al. 2312.09181 link
2023-12-14 DiffusionLight: Light Probes for Free by Painting a Chrome Ball Pakkapon Phongthawee et.al. 2312.09168 link
2023-12-14 Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers Zi-Xin Zou et.al. 2312.09147 null
2023-12-14 VideoLCM: Video Latent Consistency Model Xiang Wang et.al. 2312.09109 null
2023-12-14 PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion Ying-Tian Liu et.al. 2312.09069 null
2023-12-14 Brain Diffuser with Hierarchical Transformer for MCI Causality Analysis Qiankun Zuo et.al. 2312.09022 null
2023-12-14 OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers Han Liang et.al. 2312.08985 null
2023-12-14 Motion Flow Matching for Human Motion Synthesis and Editing Vincent Tao Hu et.al. 2312.08895 null
2023-12-14 VaLID: Variable-Length Input Diffusion for Novel View Synthesis Shijie Li et.al. 2312.08892 null
2023-12-14 Diffusion-C: Unveiling the Generative Challenges of Diffusion Models through Corrupted Data Keywoong Bae et.al. 2312.08843 null
2023-12-14 Speeding up Photoacoustic Imaging using Diffusion Models Irem Loc et.al. 2312.08834 link
2023-12-14 Guided Diffusion from Self-Supervised Diffusion Features Vincent Tao Hu et.al. 2312.08825 null
2023-12-14 Reconstruction of Sound Field through Diffusion Models Federico Miotello et.al. 2312.08821 null
2023-12-14 Local Conditional Controlling for Text-to-Image Diffusion Models Yibo Zhao et.al. 2312.08768 link
2023-12-13 PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models Anis Bourou et.al. 2312.08290 link
2023-12-13 Black-box Membership Inference Attacks against Fine-tuned Diffusion Models Yan Pang et.al. 2312.08207 link
2023-12-13 Concept-centric Personalization with Large-scale Diffusion Priors Pu Cao et.al. 2312.08195 link
2023-12-13 $ρ$ -Diffusion: A diffusion-based density estimation framework for computational physics Maxwell X. Cai et.al. 2312.08153 link
2023-12-13 Clockwork Diffusion: Efficient Generation With Model-Step Distillation Amirhossein Habibian et.al. 2312.08128 link
2023-12-13 Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision Shengguang Wu et.al. 2312.08056 null
2023-12-13 Compositional Inversion for Stable Diffusion Models Xu-Lu Zhang et.al. 2312.08048 link
2023-12-13 AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing Zhiyuan Ma et.al. 2312.08019 link
2023-12-13 Time Series Diffusion Method: A Denoising Diffusion Probabilistic Model for Vibration Signal Generation Haiming Yi et.al. 2312.07981 null
2023-12-13 LMD: Faster Image Reconstruction with Latent Masking Diffusion Zhiyuan Ma et.al. 2312.07971 link
2023-12-13 Semantic-aware Data Augmentation for Text-to-image Synthesis Zhaorui Tan et.al. 2312.07951 link
2023-12-13 BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics Wenqian Zhang et.al. 2312.07937 link
2023-12-13 SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models Feifei Wang et.al. 2312.07865 link
2023-12-13 Diffusion Models Enable Zero-Shot Pose Estimation for Lower-Limb Prosthetic Users Tianxun Zhou et.al. 2312.07854 null
2023-12-13 Noise in the reverse process improves the approximation capabilities of diffusion models Karthik Elamvazhuthi et.al. 2312.07851 null
2023-12-13 Stable Rivers: A Case Study in the Application of Text-to-Image Generative Models for Earth Sciences C Kupferschmidt et.al. 2312.07833 null
2023-12-12 Brain-optimized inference improves reconstructions of fMRI brain activity Reese Kneeland et.al. 2312.07705 link
2023-12-12 FreeInit: Bridging Initialization Gap in Video Diffusion Models Tianxing Wu et.al. 2312.07537 link
2023-12-12 FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition Sicheng Mo et.al. 2312.07536 null
2023-12-12 Cosmological Field Emulation and Parameter Inference with Diffusion Models Nayantara Mudur et.al. 2312.07534 null
2023-12-11 CAD: Photorealistic 3D Generation via Adversarial Distillation Ziyu Wan et.al. 2312.06663 null
2023-12-11 Photorealistic Video Generation with Diffusion Models Agrim Gupta et.al. 2312.06662 null
2023-12-11 UpFusion: Novel View Diffusion from Unposed Sparse View Observations Bharath Raj Nagoor Kani et.al. 2312.06661 null
2023-12-11 Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior Fangfu Liu et.al. 2312.06655 link
2023-12-11 Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Shangchen Zhou et.al. 2312.06640 null
2023-12-11 DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection Haoyang He et.al. 2312.06607 link
2023-12-11 ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models Denis Zavadski et.al. 2312.06573 link
2023-12-11 HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models Xiaogang Peng et.al. 2312.06553 null
2023-12-11 STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction Xi Ye et.al. 2312.06486 link
2023-12-11 Semantic Image Synthesis for Abdominal CT Yan Zhuang et.al. 2312.06453 null
2023-12-11 DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior Tianyu Huang et.al. 2312.06439 link
2023-12-11 DiT-Head: High-Resolution Talking Head Synthesis using Diffusion Transformers Aaron Mir et.al. 2312.06400 null
2023-12-11 PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization Xu Peng et.al. 2312.06354 null
2023-12-11 DiffAIL: Diffusion Adversarial Imitation Learning Bingzheng Wang et.al. 2312.06348 link
2023-12-11 Compensation Sampling for Improved Convergence in Diffusion Models Hui Lu et.al. 2312.06285 link
2023-12-11 UIEDP:Underwater Image Enhancement with Diffusion Prior Dazhao Du et.al. 2312.06240 link
2023-12-11 The Journey, Not the Destination: How Data Guides Diffusion Models Kristian Georgiev et.al. 2312.06205 link
2023-12-11 Offloading and Quality Control for AI Generated Content Services in Edge Computing Networks Yitong Wang et.al. 2312.06203 null
2023-12-11 Optimized View and Geometry Distillation from Multi-view Diffuser Youjia Zhang et.al. 2312.06198 link
2023-12-11 SP-DiffDose: A Conditional Diffusion Model for Radiation Dose Prediction Based on Multi-Scale Fusion of Anatomical Structures, Guided by SwinTransformer and Projector Linjie Fu et.al. 2312.06187 null
2023-12-07 Gen2Det: Generate to Detect Saksham Suri et.al. 2312.04566 null
2023-12-07 NeRFiller: Completing Scenes via Generative 3D Inpainting Ethan Weber et.al. 2312.04560 null
2023-12-07 PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation Zhaoxi Chen et.al. 2312.04559 link
2023-12-07 GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Shoufa Chen et.al. 2312.04557 null
2023-12-07 Generating Illustrated Instructions Sachit Menon et.al. 2312.04552 link
2023-12-07 PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play Lili Chen et.al. 2312.04549 null
2023-12-07 Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance Yuto Enyo et.al. 2312.04529 null
2023-12-07 RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models Ozgur Kara et.al. 2312.04524 link
2023-12-07 Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation Zhiwu Qing et.al. 2312.04483 link
2023-12-07 Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion Kiran Chhatre et.al. 2312.04466 link
2023-12-07 FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models Stathis Galanakis et.al. 2312.04465 null
2023-12-07 DreamVideo: Composing Your Dream Videos with Customized Subject and Motion Yujie Wei et.al. 2312.04433 link
2023-12-07 Approximate Caching for Efficiently Serving Diffusion Models Shubham Agarwal et.al. 2312.04429 null
2023-12-07 Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views Yabo Chen et.al. 2312.04424 null
2023-12-07 Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models Jiayi Guo et.al. 2312.04410 link
2023-12-07 Adversarial Denoising Diffusion Model for Unsupervised Anomaly Detection Jongmin Yu et.al. 2312.04382 null
2023-12-07 Generating Multiphase Fluid Configurations in Fractures using Diffusion Models Jaehong Chung et.al. 2312.04375 null
2023-12-07 Investigating the Design Space of Diffusion Models for Speech Enhancement Philippe Gonzalez et.al. 2312.04370 link
2023-12-07 Improved Efficient Two-Stage Denoising Diffusion Power System Measurement Recovery Against False Data Injection Attacks and Data Losses Jianhua Pei et.al. 2312.04346 null
2023-12-07 Multi-View Unsupervised Image Generation with Cross Attention Guidance Llukman Cerkezi et.al. 2312.04337 null
2023-12-06 Self-conditioned Image Generation via Generating Representations Tianhong Li et.al. 2312.03701 link
2023-12-06 Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication Ali Naseh et.al. 2312.03692 null
2023-12-06 WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on xujie zhang et.al. 2312.03667 null
2023-12-06 TokenCompose: Grounding Diffusion with Token-level Supervision Zirui Wang et.al. 2312.03626 link
2023-12-06 DreamComposer: Controllable 3D Object Generation via Multi-View Conditions Yunhan Yang et.al. 2312.03611 link
2023-12-06 DiffusionSat: A Generative Foundation Model for Satellite Imagery Samar Khanna et.al. 2312.03606 null
2023-12-06 MMM: Generative Masked Motion Model Ekkasit Pinyoanuntapong et.al. 2312.03596 link
2023-12-06 Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention Jianjin Xu et.al. 2312.03556 null
2023-12-06 FoodFusion: A Latent Diffusion Model for Realistic Food Image Generation Olivia Markham et.al. 2312.03540 null
2023-12-06 FRDiff: Feature Reuse for Exquisite Zero-shot Acceleration of Diffusion Models Junhyuk So et.al. 2312.03517 null
2023-12-06 Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Zehua Chen et.al. 2312.03491 null
2023-12-06 F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis Sitong Su et.al. 2312.03459 null
2023-12-06 Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning Sangwoong Yoon et.al. 2312.03397 null
2023-12-06 Diffused Task-Agnostic Milestone Planner Mineui Hong et.al. 2312.03395 null
2023-12-06 DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction Yanlong Li et.al. 2312.03298 link
2023-12-06 Cache Me if You Can: Accelerating Diffusion Models through Block Caching Felix Wimbauer et.al. 2312.03209 null
2023-12-05 ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet Soon Yau Cheong et.al. 2312.03154 link
2023-12-05 DiffusionPCR: Diffusion Models for Robust Multi-Step Point Cloud Registration Zhi Chen et.al. 2312.03053 link
2023-12-05 Alchemist: Parametric Control of Material Properties with Diffusion Models Prafull Sharma et.al. 2312.02970 null
2023-12-05 AmbiGen: Generating Ambigrams from Pre-trained Diffusion Model Boheng Zhao et.al. 2312.02967 null
2023-12-04 Latent Feature-Guided Diffusion Models for Shadow Removal Kangfu Mei et.al. 2312.02156 null
2023-12-04 Readout Guidance: Learning Control from Diffusion Features Grace Luo et.al. 2312.02150 null
2023-12-04 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation Bingxin Ke et.al. 2312.02145 link
2023-12-04 DiffiT: Diffusion Vision Transformers for Image Generation Ali Hatamizadeh et.al. 2312.02139 link
2023-12-04 Stochastic Optimal Control Matching Carles Domingo-Enrich et.al. 2312.02027 link
2023-12-04 UniGS: Unified Representation for Image Generation and Segmentation Lu Qi et.al. 2312.01985 link
2023-12-04 Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation Joshua Niemeijer et.al. 2312.01850 link
2023-12-04 Collaborative Neural Painting Nicola Dall’Asen et.al. 2312.01800 null
2023-12-04 Open-DDVM: A Reproduction and Extension of Diffusion Model for Optical Flow Estimation Qiaole Dong et.al. 2312.01746 link
2023-12-04 Fully Spiking Denoising Diffusion Implicit Models Ryo Watanabe et.al. 2312.01742 link
2023-12-04 StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On Jeongho Kim et.al. 2312.01725 link
2023-12-04 ResEnsemble-DDPM: Residual Denoising Diffusion Probabilistic Models for Ensemble Learning Shi Zhenning et.al. 2312.01682 null
2023-12-03 CalliPaint: Chinese Calligraphy Inpainting with Diffusion Model Qisheng Liao et.al. 2312.01536 null
2023-12-03 CityGen: Infinite and Controllable 3D City Layout Generation Jie Deng et.al. 2312.01508 null
2023-12-03 Existence of finite time blow-up in Keller-Segel system Federico Buseghin et.al. 2312.01475 null
2023-12-03 Distilling Functional Rearrangement Priors from Large Models Yiming Zeng et.al. 2312.01474 null
2023-12-03 Diffusion Posterior Sampling for Nonlinear CT Reconstruction Shudong Li et.al. 2312.01464 null
2023-12-03 Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models Shengqu Cai et.al. 2312.01409 null
2023-12-03 Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts Tianqi Chen et.al. 2312.01408 null
2023-12-03 ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models Jeong-gi Kwak et.al. 2312.01305 null
2023-11-30 VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models Zhen Xing et.al. 2311.18837 null
2023-11-30 ART $\boldsymbol{\cdot}$ V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng et.al. 2311.18834 null
2023-11-30 Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction Hsin-Ying Lee et.al. 2311.18832 link
2023-11-30 MotionEditor: Editing Video Motion via Content-Aware Diffusion Shuyuan Tu et.al. 2311.18830 link
2023-11-30 MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Yanhui Wang et.al. 2311.18829 null
2023-11-30 One-step Diffusion with Distribution Matching Distillation Tianwei Yin et.al. 2311.18828 null
2023-11-30 ElasticDiffusion: Training-free Arbitrary Size Image Generation Moayed Haji-Ali et.al. 2311.18822 link
2023-11-30 Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters James Seale Smith et.al. 2311.18763 null
2023-11-30 Detailed Human-Centric Text Description-Driven Large Scene Synthesis Gwanghyun Kim et.al. 2311.18654 null
2023-11-30 Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing Hyelin Nam et.al. 2311.18608 null
2023-11-30 DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution Axi Niu et.al. 2311.18508 null
2023-11-30 Layered Rendering Diffusion Model for Zero-Shot Guided Image Synthesis Zipeng Qi et.al. 2311.18435 null
2023-11-30 CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model Jianhao Zeng et.al. 2311.18405 link
2023-11-30 Age Effects on Decision-Making, Drift Diffusion Model Zahra Kavian et.al. 2311.18376 null
2023-11-30 Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning Ruxiao Duan et.al. 2311.18266 link
2023-11-30 Diffusion Models Without Attention Jing Nathan Yan et.al. 2311.18257 null
2023-11-30 SMaRt: Improving GANs with Score Matching Regularity Mengfei Xia et.al. 2311.18208 null
2023-11-30 HiPA: Enabling One-Step Text-to-Image Diffusion Models via High-Frequency-Promoting Adaptation Yifan Zhang et.al. 2311.18158 null
2023-11-29 Zooming Out on Zooming In: Advancing Super-Resolution for Remote Sensing Piper Wolters et.al. 2311.18082 link
2023-11-29 DiffGEPCI: 3D MRI Synthesis from mGRE Signals using 2.5D Diffusion Model Yuyang Hu et.al. 2311.18073 null
2023-11-29 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921 link
2023-11-29 Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models Daniel Geng et.al. 2311.17919 null
2023-11-29 AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text Jianfeng Zhang et.al. 2311.17917 null
2023-11-29 CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting Alexander Vilesov et.al. 2311.17907 null
2023-11-29 SODA: Bottleneck Diffusion Models for Representation Learning Drew A. Hudson et.al. 2311.17901 null
2023-11-29 Leveraging Graph Diffusion Models for Network Refinement Tasks Puja Trivedi et.al. 2311.17856 null
2023-11-29 SPiC-E : Structural Priors in 3D Diffusion Models using Cross Entity Attention Etai Sella et.al. 2311.17834 null
2023-11-29 Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers Chi-Pin Huang et.al. 2311.17717 link
2023-11-29 Fair Text-to-Image Diffusion via Fair Mapping Jia Li et.al. 2311.17695 null
2023-11-29 AnyLens: A Generative Diffusion Model with Any Rendering Lens Andrey Voynov et.al. 2311.17609 null
2023-11-29 Query-Relevant Images Jailbreak Large Multi-Modal Models Xin Liu et.al. 2311.17600 link
2023-11-29 Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning Liang Peng et.al. 2311.17536 link
2023-11-29 HiDiffusion: Unlocking High-Resolution Creativity and Efficiency in Low-Resolution Trained Diffusion Models Shen Zhang et.al. 2311.17528 null
2023-11-29 MMA-Diffusion: MultiModal Attack on Diffusion Models Yijun Yang et.al. 2311.17516 link
2023-11-29 When StyleGAN Meets Stable Diffusion: a $\mathscr{W}_+$ Adapter for Personalized Image Generation Xiaoming Li et.al. 2311.17461 link
2023-11-29 DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Diffusion Model Jiuming Liu et.al. 2311.17456 link
2023-11-29 Wireless Network Digital Twin for 6G: Generative AI as A Key Enabler Zhenyu Tao et.al. 2311.17451 null
2023-11-29 VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model Haoyu Zhao et.al. 2311.17338 link
2023-11-28 Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation Hang Li et.al. 2311.17216 null
2023-11-28 A point cloud approach to generative modeling for galaxy surveys at the field level Carolina Cuesta-Lazaro et.al. 2311.17141 link
2023-11-27 Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback Mihir Prabhudesai et.al. 2311.16102 null
2023-11-27 Self-correcting LLM-controlled Diffusion Models Tsung-Han Wu et.al. 2311.16090 link
2023-11-27 DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization Zhaoyang Xia et.al. 2311.16060 link
2023-11-27 Exploring Attribute Variations in Style-based GANs using Diffusion Models Rishubh Parihar et.al. 2311.16052 null
2023-11-27 GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions Jiemin Fang et.al. 2311.16037 null
2023-11-27 Closing the ODE-SDE gap in score-based diffusion models through the Fokker-Planck equation Teo Deveney et.al. 2311.15996 null
2023-11-27 DiffAnt: Diffusion Models for Action Anticipation Zeyun Zhong et.al. 2311.15991 null
2023-11-27 Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion Yuanxun Lu et.al. 2311.15980 null
2023-11-27 Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models Claudio Rota et.al. 2311.15908 link
2023-11-27 InterControl: Generate Human Motion Interactions by Controlling Every Joint Zhenzhi Wang et.al. 2311.15864 link
2023-11-27 SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion Hsuan-I Ho et.al. 2311.15855 link
2023-11-27 FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax Yu Lu et.al. 2311.15813 null
2023-11-27 Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation Biao Gong et.al. 2311.15773 null
2023-11-27 One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls Minghui Hu et.al. 2311.15744 null
2023-11-27 SceneDM: Scene-level Multi-agent Trajectory Generation with Consistent Diffusion Models Zhiming Guo et.al. 2311.15736 null
2023-11-27 Regularization by Texts for Latent Diffusion Inverse Solvers Jeongsol Kim et.al. 2311.15658 link
2023-11-27 Enhancing Diffusion Models with Text-Encoder Reinforcement Learning Chaofeng Chen et.al. 2311.15657 link
2023-11-27 ET3D: Efficient Text-to-3D Generation via Multi-View Distillation Yiming Chen et.al. 2311.15561 null
2023-11-27 Instruct2Attack: Language-Guided Semantic Adversarial Attacks Jiang Liu et.al. 2311.15551 null
2023-11-27 Efficient Dataset Distillation via Minimax Diffusion Jianyang Gu et.al. 2311.15529 link
2023-11-22 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570 null
2023-11-22 ADriver-I: A General World Model for Autonomous Driving Fan Jia et.al. 2311.13549 null
2023-11-22 DiffusionMat: Alpha Matting as Sequential Refinement Learning Yangyang Xu et.al. 2311.13535 null
2023-11-22 Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure Ian Dunn et.al. 2311.13466 link
2023-11-22 Guided Flows for Generative Modeling and Decision Making Qinqing Zheng et.al. 2311.13443 null
2023-11-22 Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution Yuxuan Zhou et.al. 2311.13317 null
2023-11-22 Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Kai Yang et.al. 2311.13231 link
2023-11-22 Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models Mengyang Feng et.al. 2311.13141 link
2023-11-22 Toward Robust Imperceptible Perturbation against Unauthorized Text-to-image Diffusion-based Synthesis Yixin Liu et.al. 2311.13127 link
2023-11-22 On the Limitation of Diffusion Models for Synthesizing Training Datasets Shin’ya Yamaguchi et.al. 2311.13090 null
2023-11-22 FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline Vladimir Arkhipkin et.al. 2311.13073 link
2023-11-21 Diffusion Model Alignment Using Direct Preference Optimization Bram Wallace et.al. 2311.12908 null
2023-11-21 Text-Guided Texturing by Synchronized Multi-View Diffusion Yuxin Liu et.al. 2311.12891 link
2023-11-21 Fine-Grained Open Domain Image Animation with Motion Guidance Zuozhuo Dai et.al. 2311.12886 link
2023-11-21 GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning Jiaxi Lv et.al. 2311.12631 null
2023-11-21 Stable Diffusion For Aerial Object Detection Yanan Jian et.al. 2311.12345 null
2023-11-21 LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis Peiang Zhao et.al. 2311.12342 null
2023-11-20 NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation Shachar Rosenman et.al. 2311.12229 link
2023-11-20 Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models Rohit Gandikota et.al. 2311.12092 link
2023-11-20 An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis Aishwarya Agarwal et.al. 2311.11919 null
2023-11-20 Multiplicative noise removal based on a variable-order fractional diffusion model Yuhang Li et.al. 2311.11680 null
2023-11-20 Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model Chunming He et.al. 2311.11638 link
2023-11-20 Generating Realistic Counterfactuals for Retinal Fundus and OCT Images using Diffusion Models Indu Ilanchezian et.al. 2311.11629 link
2023-11-20 Deep Equilibrium Diffusion Restoration with Parallel Sampling Jiezhang Cao et.al. 2311.11600 link
2023-11-20 Advancing Urban Renewal: An Automated Approach to Generating Historical Arcade Facades with Stable Diffusion Models Zheyuan Kuang et.al. 2311.11590 null
2023-11-19 DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model Zhenghao Pan et.al. 2311.11417 link
2023-11-19 A Survey of Emerging Applications of Diffusion Probabilistic Models in MRI Yuheng Fan et.al. 2311.11383 null
2023-11-19 MoVideo: Motion-Aware Video Generation with Diffusion Models Jingyun Liang et.al. 2311.11325 null
2023-11-19 GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise Xinhai Li et.al. 2311.11221 null
2023-11-19 On the Noise Scheduling for Generating Plausible Designs with Diffusion Models Jiajie Fan et.al. 2311.11207 null
2023-11-18 Mitigating Exposure Bias in Discriminator Guided Diffusion Models Eleftherios Tsonis et.al. 2311.11164 null
2023-11-18 User-Centric Interactive AI for Distributed Diffusion Model-based AI-Generated Content Hongyang Du et.al. 2311.11094 null
2023-11-18 DSCom: A Data-Driven Self-Adaptive Community-Based Framework for Influence Maximization in Social Networks Yuxin Zuo et.al. 2311.11080 null
2023-11-18 Make Pixels Dance: High-Dynamic Video Generation Yan Zeng et.al. 2311.10982 null
2023-11-17 The Hidden Linear Structure in Score-Based Models and its Application Binxu Wang et.al. 2311.10892 null
2023-11-17 SDDPM: Speckle Denoising Diffusion Probabilistic Models Soumee Guha et.al. 2311.10868 null
2023-11-17 A Study on Altering the Latent Space of Pretrained Text to Speech Models for Improved Expressiveness Mathias Vogel et.al. 2311.10804 null
2023-11-17 SelfEval: Leveraging the discriminative nature of generative models for evaluation Sai Saketh Rambhatla et.al. 2311.10708 null
2023-11-17 Enhancing Object Coherence in Layout-to-Image Synthesis Yibin Wang et.al. 2311.10522 link
2023-11-16 The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Omri Avrahami et.al. 2311.10093 null
2023-11-16 TransFusion – A Transparency-Based Diffusion Model for Anomaly Detection Matic Fučka et.al. 2311.09999 link
2023-11-16 DSR-Diff: Depth Map Super-Resolution with Diffusion Model Yuan Shi et.al. 2311.09919 null
2023-11-16 Diffusion-Augmented Neural Processes Lorenzo Bonito et.al. 2311.09848 null
2023-11-16 MAM-E: Mammographic synthetic image generation with diffusion models Ricardo Montoya-del-Angel et.al. 2311.09822 link
2023-11-16 Scene Text Image Super-resolution based on Text-conditional Diffusion Models Chihiro Noguchi et.al. 2311.09759 link
2023-11-16 DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics Aniket Roy et.al. 2311.09753 null
2023-11-16 What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization Yuhan Liu et.al. 2311.09741 link
2023-11-16 DECDM: Document Enhancement using Cycle-Consistent Diffusion Models Jiaxin Zhang et.al. 2311.09625 null
2023-11-16 3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation Dale Decatur et.al. 2311.09571 link
2023-11-15 Synthetically Enhanced: Unveiling Synthetic Data’s Potential in Medical Imaging Research Bardia Khosravi et.al. 2311.09402 link
2023-11-15 Privacy Threats in Stable Diffusion Models Thomas Cilloni et.al. 2311.09355 null
2023-11-15 Generative AI-Based Probabilistic Constellation Shaping With Diffusion Models Mehdi Letafati et.al. 2311.09349 null
2023-11-15 FastBlend: a Powerful Model-Free Toolkit Making Video Stylization Easier Zhongjie Duan et.al. 2311.09265 link
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221 null
2023-11-15 DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model Yinghao Xu et.al. 2311.09217 null
2023-11-15 Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search Hefeng Wu et.al. 2311.09084 link
2023-11-15 A Spectral Diffusion Prior for Hyperspectral Image Super-Resolution Jianjun Liu et.al. 2311.08955 link
2023-11-16 One-Shot Federated Learning with Classifier-Guided Diffusion Models Mingzhao Yang et.al. 2311.08870 null
2023-11-15 A Diffusion Model Based Quality Enhancement Method for HEVC Compressed Video Zheng Liu et.al. 2311.08746 null
2023-11-15 Towards Graph-Aware Diffusion Modeling for Collaborative Filtering Yunqin Zhu et.al. 2311.08744 link
2023-11-15 EDMSound: Spectrogram Based Diffusion Models for Efficient and High-Quality Audio Synthesis Ge Zhu et.al. 2311.08667 null
2023-11-14 Probabilistic reconstruction of Dark Matter fields from biased tracers using diffusion models Core Francisco Park et.al. 2311.08558 link
2023-11-14 Mustango: Toward Controllable Text-to-Music Generation Jan Melechovsky et.al. 2311.08355 link
2023-11-15 Generative De-Quantization for Neural Speech Codec via Latent Diffusion Haici Yang et.al. 2311.08330 null
2023-11-14 Diffusion-based generation of Histopathological Whole Slide Images at a Gigapixel scale Robert Harb et.al. 2311.08199 null
2023-11-14 Influence of departures from LTE on determinations of the scandium abundances in A-B type stars L. Mashonkina et.al. 2311.07982 null
2023-11-14 Brain-Driven Representation Learning Based on Diffusion Model Soowon Kim et.al. 2311.07925 null
2023-11-14 Bayesian Conditional Diffusion Models for Versatile Spatiotemporal Turbulence Generation Han Gao et.al. 2311.07896 null
2023-11-14 One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion Minghua Liu et.al. 2311.07885 null
2023-11-13 Fast and Space-Efficient Parallel Algorithms for Influence Maximization Letong Wang et.al. 2311.07554 link
2023-11-13 Robust semi-supervised segmentation with timestep ensembling diffusion models Margherita Rosnati et.al. 2311.07421 null
2023-11-13 Zero-Shot Duet Singing Voices Separation with Diffusion Models Chin-Yun Yu et.al. 2311.07345 link
2023-11-13 A Gaussian Process Based Method with Deep Kernel Learning for Pricing High-dimensional American Options Jirong Zhuang et.al. 2311.07211 null
2023-11-13 MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model Shuwei Shao et.al. 2311.07198 link
2023-11-13 Adversarial Purification for Data-Driven Power System Event Classifiers with Diffusion Models Yuanbin Cheng et.al. 2311.07110 null
2023-11-12 Augmented Bridge Matching Valentin De Bortoli et.al. 2311.06978 null
2023-11-12 Sampler Scheduler for Diffusion Models Zitong Cheng et.al. 2311.06845 link
2023-11-12 IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models Zhaoyuan Yang et.al. 2311.06792 link
2023-11-11 A 3D Conditional Diffusion Model for Image Quality Transfer – An Application to Low-Field MRI Seunghoi Kim et.al. 2311.06631 link
2023-11-11 Generative AI for Space-Air-Ground Integrated Networks (SAGIN) Ruichen Zhang et.al. 2311.06523 null
2023-11-11 Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance June-Woo Kim et.al. 2311.06480 link
2023-11-10 On degenerate reaction-diffusion epidemic models with mass action or standard incidence mechanism Rachidi Salako et.al. 2311.06434 null
2023-11-10 Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models Siao Tang et.al. 2311.06322 link
2023-11-10 Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization Weiyang Liu et.al. 2311.06243 null
2023-11-10 Diffusion Models for Earth Observation Use-cases: from cloud removal to urban change detection Fulvio Sanguigni et.al. 2311.06222 null
2023-11-10 Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model Jiahao Li et.al. 2311.06214 null
2023-11-10 Enhancing Rock Image Segmentation in Digital Rock Physics: A Fusion of Generative AI and State-of-the-Art Neural Networks Zhaoyang Ma et.al. 2311.06079 null
2023-11-10 Semantic Map Guided Synthesis of Wireless Capsule Endoscopy Images using Diffusion Models Haejin Lee et.al. 2311.05889 null
2023-11-10 Diffusion Shape Prior for Wrinkle-Accurate Cloth Registration Jingfan Guo et.al. 2311.05828 null
2023-11-09 LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Simian Luo et.al. 2311.05556 link
2023-11-09 Onset of pattern formation for the stochastic Allen-Cahn equation Stella Brassesco et.al. 2311.05526 null
2023-11-09 3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models Haibo Yang et.al. 2311.05464 link
2023-11-09 ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors Jingwen Chen et.al. 2311.05463 null
2023-11-09 Control3D: Towards Controllable Text-to-3D Generation Yang Chen et.al. 2311.05461 null
2023-11-09 Predicting the Position Uncertainty at the Time of Closest Approach with Diffusion Models Marta Guimarães et.al. 2311.05417 null
2023-11-09 ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image Senthil Purushwalkam et.al. 2311.05230 null
2023-11-09 Super-Resolution Emulation of Large Cosmological Fields with a 3D Conditional Diffusion Model Adam Rouhiainen et.al. 2311.05217 null
2023-11-09 BrainNetDiff: Generative AI Empowers Brain Network Generation via Multimodal Diffusion Model Yongcheng Zong et.al. 2311.05199 null
2023-11-08 Lightweight Diffusion Models with Distillation-Based Block Neural Architecture Search Siao Tang et.al. 2311.04950 null
2023-11-08 Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation Ha-Yeong Choi et.al. 2311.04693 link
2023-11-08 Weakly-supervised deepfake localization in diffusion-generated images Dragos Tantaru et.al. 2311.04584 link
2023-11-08 A 3D generative model of pathological multi-modal MR images and segmentations Virginia Fernandez et.al. 2311.04552 link
2023-11-07 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Chenfeng Xu et.al. 2311.04391 null
2023-11-07 Dose-aware Diffusion Model for 3D Ultra Low-dose PET Imaging Huidong Xie et.al. 2311.04248 null
2023-11-07 I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models Shiwei Zhang et.al. 2311.04145 link
2023-11-07 Generative Structural Design Integrating BIM and Diffusion Model Zhili He et.al. 2311.04052 link
2023-11-07 Formulating Discrete Probability Flow Through Optimal Transport Pengze Zhang et.al. 2311.03886 link
2023-11-07 Reducing Spatial Fitting Error in Distillation of Denoising Diffusion Models Shengzhe Zhou et.al. 2311.03830 link
2023-11-07 3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion Xinhao Xiang et.al. 2311.03742 null
2023-11-06 The steady state of the boundary-driven multiparticle asymmetric diffusion model Rouven Frassek et.al. 2311.03603 null
2023-11-06 Generative Diffusion Models for Lattice Field Theory Lingxiao Wang et.al. 2311.03578 null
2023-11-06 Multi-Resolution Diffusion for Privacy-Sensitive Recommender Systems Derek Lilienthal et.al. 2311.03488 link
2023-11-06 TS-Diffusion: Generating Highly Complex Time Series with Diffusion Models Yangming Li et.al. 2311.03303 null
2023-11-06 LDM3D-VR: Latent Diffusion Model for 3D VR Gabriela Ben Melech Stan et.al. 2311.03226 null
2023-11-06 Algebraic Dynamical Systems in Machine Learning Iolo Jones et.al. 2311.03118 null
2023-11-07 AnyText: Multilingual Visual Text Generation And Editing Yuxiang Tuo et.al. 2311.03054 link
2023-11-06 Exploring the Capability of Text-to-Image Diffusion Models with Structural Edge Guidance for Multi-Spectral Satellite Image Inpainting Mikolaj Czerkawski et.al. 2311.03008 null
2023-11-06 Diffusion-based Radiotherapy Dose Prediction Guided by Inter-slice Aware Structure Encoding Zhenghao Feng et.al. 2311.02991 null
2023-11-06 Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video Yanqin Jiang et.al. 2311.02848 null
2023-11-04 From Trojan Horses to Castle Walls: Unveiling Bilateral Backdoor Effects in Diffusion Models Zhuoshi Pan et.al. 2311.02373 link
2023-11-04 Domain Transfer in Latent Space (DTLS) Wins on Image Super-Resolution – a Non-Denoising Model Chun-Chuen Hui et.al. 2311.02358 link
2023-11-04 Stable Diffusion Reference Only: Image Prompt and Blueprint Jointly Guided Multi-Condition Diffusion Model for Secondary Painting Hao Ai et.al. 2311.02343 link
2023-11-03 Patch-based Selection and Refinement for Early Object Detection Tianyi Zhang et.al. 2311.02274 link
2023-11-03 Sparse Training of Discrete Diffusion Models for Graph Generation Yiming Qin et.al. 2311.02142 link
2023-11-03 Quantum circuit synthesis with diffusion models Florian Fürrutter et.al. 2311.02041 link
2023-11-03 Latent Diffusion Model for Conditional Reservoir Facies Generation Daesoo Lee et.al. 2311.01968 link
2023-11-03 On the Generalization Properties of Diffusion Models Puheng Li et.al. 2311.01797 link
2023-11-06 CDGraph: Dual Conditional Social Graph Synthesizing via Diffusion Model Jui-Yi Tsai et.al. 2311.01729 null
2023-11-02 Improving Fairness using Vision-Language Driven Image Augmentation Moreno D’Incà et.al. 2311.01573 link
2023-11-02 Exploring the Hyperparameter Space of Image Diffusion Models for Echocardiogram Generation Hadrien Reynaud et.al. 2311.01567 null
2023-11-02 Investigating the Behavior of Diffusion Models for Accelerating Electronic Structure Calculations Daniel Rothchild et.al. 2311.01491 null
2023-11-02 Time Series Anomaly Detection using Diffusion-based Models Ioana Pintilie et.al. 2311.01452 link
2023-11-02 Constrained-Context Conditional Diffusion Models for Imitation Learning Vaibhav Saxena et.al. 2311.01419 null
2023-11-02 Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors Gabriele M. Caddeo et.al. 2311.01380 link
2023-11-02 DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning Wenxuan Bao et.al. 2311.01295 link
2023-11-02 Optimal Transport-Guided Conditional Score-Based Diffusion Models Xiang Gu et.al. 2311.01226 link
2023-11-02 Diffusion Models for Reinforcement Learning: A Survey Zhengbang Zhu et.al. 2311.01223 link
2023-11-02 Add and Thin: Diffusion for Temporal Point Processes David Lüdke et.al. 2311.01139 null
2023-11-02 Infusion: Internal Diffusion for Video Inpainting Nicolas Cherel et.al. 2311.01090 link
2023-11-02 Expanding Expressiveness of Diffusion Models with Limited Data via Self-Distillation based Fine-Tuning Jiwan Hur et.al. 2311.01018 null
2023-11-02 Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs Peng Jin et.al. 2311.01015 link
2023-11-02 Optimal Noise pursuit for Augmenting Text-to-Video Generation Shijie Ma et.al. 2311.00949 null
2023-11-02 Gaussian Mixture Solvers for Diffusion Models Hanzhong Guo et.al. 2311.00941 link
2023-11-02 Bridging the Gap: Addressing Discrepancies in Diffusion Model Training for Classifier-Free Guidance Niket Patel et.al. 2311.00938 null
2023-11-02 Towards High-quality HDR Deghosting with Conditional Diffusion Models Qingsen Yan et.al. 2311.00932 null
2023-11-01 HIDM: Emulating Large Scale HI Maps using Score-based Diffusion Models Sultan Hassan et.al. 2311.00833 null
2023-11-01 Quantum Computational Algorithms for Derivative Pricing and Credit Risk in a Regime Switching Economy Eric Ghysels et.al. 2311.00825 null
2023-11-01 De-Diffusion Makes Text a Strong Cross-Modal Interface Chen Wei et.al. 2311.00618 null
2023-11-01 Controllable Music Production with Diffusion Models and Guidance Gradients Mark Levy et.al. 2311.00613 null
2023-11-01 Intriguing Properties of Data Attribution on Diffusion Models Xiaosen Zheng et.al. 2311.00500 link
2023-11-01 Generating HSR Bogie Vibration Signals via Pulse Voltage-Guided Conditional Diffusion Model Xuan Liu et.al. 2311.00496 link
2023-11-01 Diffusion models for probabilistic programming Simon Dirmeier et.al. 2311.00474 link
2023-11-01 Dual Conditioned Diffusion Models for Out-Of-Distribution Detection: Application to Fetal Ultrasound Videos Divyanshu Mishra et.al. 2311.00469 null
2023-11-01 LatentWarp: Consistent Diffusion Latents for Zero-Shot Video-to-Video Translation Yuxiang Bao et.al. 2311.00353 null
2023-11-01 Space Narrative: Generating Images and 3D Scenes of Chinese Garden from Text using Deep Learning Jiaxi Shi1 et.al. 2311.00339 null
2023-11-01 Adaptive Latent Diffusion Model for 3D Medical Image to Image Translation: Multi-modal Magnetic Resonance Imaging Study Jonghun Kim et.al. 2311.00265 link
2023-10-31 Score Normalization for a Faster Diffusion Exponential Integrator Sampler Guoxuan Xia et.al. 2311.00157 link
2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction Xinyuan Chen et.al. 2310.20700 null
2023-10-31 Diffusion Reconstruction of Ultrasound Images with Informative Uncertainty Yuxin Zhang et.al. 2310.20618 null
2023-10-31 Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion Zhengyi Yang et.al. 2310.20453 link
2023-10-31 In Search of Lost Online Test-time Adaptation: A Survey Zixin Wang et.al. 2310.20199 link
2023-10-31 A Perturbative Solution to the Linear Influence/Network Autocorrelation Model Under Network Dynamics Carter T. Butts et.al. 2310.20163 null
2023-10-31 Synthesizing Diabetic Foot Ulcer Images with Diffusion Model Reza Basiri et.al. 2310.20140 null
2023-10-31 Beyond U: Making Diffusion Models Faster & Lighter Sergio Calvo-Ordonez et.al. 2310.20092 null
2023-10-30 Scaling Riemannian Diffusion Models Aaron Lou et.al. 2310.20030 null
2023-10-30 DiffEnc: Variational Diffusion with a Learned Encoder Beatrix M. G. Nielsen et.al. 2310.19789 link
2023-10-30 CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models Ziyang Yuan et.al. 2310.19784 null
2023-10-29 Learning to Follow Object-Centric Image Editing Instructions Faithfully Tuhin Chakrabarty et.al. 2310.19145 link
2023-10-29 Adversarial Examples Are Not Real Features Ang Li et.al. 2310.18936 link
2023-10-28 Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models Hai Wang et.al. 2310.18840 link
2023-10-28 Successfully Applying Lottery Ticket Hypothesis to Diffusion Model Chao Jiang et.al. 2310.18823 link
2023-10-28 Purify++: Improving Diffusion-Purification with Advanced Diffusion Models and Control of Randomness Boya Zhang et.al. 2310.18762 null
2023-10-27 From Generative AI to Generative Internet of Things: Fundamentals, Framework, and Outlooks Jinbo Wen et.al. 2310.18382 null
2023-10-27 Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models Pushkal Katara et.al. 2310.18308 null
2023-10-27 ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Kyle Sargent et.al. 2310.17994 link
2023-10-26 6-DoF Stability Field via Diffusion Models Takuma Yoneda et.al. 2310.17649 null
2023-10-26 Generative Fractional Diffusion Models Gabriel Nobis et.al. 2310.17638 link
2023-10-26 Noise-Free Score Distillation Oren Katzir et.al. 2310.17590 null
2023-10-26 Convergence of flow-based generative models via proximal gradient descent in Wasserstein space Xiuyuan Cheng et.al. 2310.17582 link
2023-10-27 Global Structure-Aware Diffusion Process for Low-Light Image Enhancement Jinhui Hou et.al. 2310.17577 link
2023-10-26 DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation Yongxin Zhu et.al. 2310.17570 null
2023-10-26 SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching Xinghui Li et.al. 2310.17569 null
2023-10-27 The Expressive Power of Low-Rank Adaptation Yuchen Zeng et.al. 2310.17513 link
2023-10-26 The statistical thermodynamics of generative diffusion models Luca Ambrogioni et.al. 2310.17467 null
2023-10-26 Likelihood-based Out-of-Distribution Detection with Denoising Diffusion Probabilistic Models Joseph Goodier et.al. 2310.17432 null
2023-10-26 Causal Modeling with Stationary Diffusions Lars Lorch et.al. 2310.17405 link
2023-10-26 Towards Unifying Diffusion Models for Probabilistic Spatio-Temporal Graph Learning Junfeng Hu et.al. 2310.17360 null
2023-10-26 SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation Haobo Jiang et.al. 2310.17359 null
2023-10-26 CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling Seyedmorteza Sadat et.al. 2310.17347 null
2023-10-26 Attribute Based Interpretable Evaluation Metrics for Generative Models Dongkyun Kim et.al. 2310.17261 link
2023-10-26 Exploring Iterative Refinement with Diffusion Models for Video Grounding Xiao Liang et.al. 2310.17189 link
2023-10-26 Improving Denoising Diffusion Models via Simultaneous Estimation of Image and Noise Zhenkai Zhang et.al. 2310.17167 null
2023-10-26 Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration Longlin Yu et.al. 2310.17153 link
2023-10-25 Discrete Diffusion Language Modeling by Estimating the Ratios of the Data Distribution Aaron Lou et.al. 2310.16834 link
2023-10-25 PERF: Panoramic Neural Radiance Field from a Single Panorama Guangcong Wang et.al. 2310.16831 link
2023-10-25 CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images Aaron Gokaslan et.al. 2310.16825 link
2023-10-26 DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior Jingxiang Sun et.al. 2310.16818 link
2023-10-25 Using Diffusion Models to Generate Synthetic Labelled Data for Medical Image Segmentation Daniel Saragih et.al. 2310.16794 link
2023-10-26 Multi-scale Diffusion Denoised Smoothing Jongheon Jeong et.al. 2310.16779 link
2023-10-25 Local Statistics for Generative Image Detection Yung Jer Wong et.al. 2310.16684 null
2023-10-25 A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation Eyal Segalis et.al. 2310.16656 null
2023-10-25 Constraining the slow-diffusion zone size and electron injection spectral index for the Geminga pulsar halo Kun Fang et.al. 2310.16594 null
2023-10-25 Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models Weijie Chen et.al. 2310.16573 null
2023-10-25 Open Knowledge Base Canonicalization with Multi-task Unlearning Bingchen Liu et.al. 2310.16419 null
2023-10-25 Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models Tianyi Lu et.al. 2310.16400 link
2023-10-25 DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection Se-Ho Kim et.al. 2310.16349 null
2023-10-25 Diffusion model approach to simulating electron-proton scattering events Peter Devlin et.al. 2310.16308 null
2023-10-25 Dolfin: Diffusion Layout Transformers without Autoencoder Yilin Wang et.al. 2310.16305 null
2023-10-25 Removing Dust from CMB Observations with Diffusion Models David Heurtel-Depeiges et.al. 2310.16285 null
2023-10-24 iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis Yash Kant et.al. 2310.16167 null
2023-10-24 RePoseDM: Recurrent Pose Alignment and Gradient Guidance for Pose Guided Image Synthesis Anant Khandelwal et.al. 2310.16074 null
2023-10-25 Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles Xing Shen et.al. 2310.15952 null
2023-10-24 Language-driven Scene Synthesis using Multi-conditional Diffusion Model An Vuong et.al. 2310.15948 link
2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling Haonan Qiu et.al. 2310.15169 link
2023-10-23 Matryoshka Diffusion Models Jiatao Gu et.al. 2310.15111 link
2023-10-23 Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model Ruoxi Shi et.al. 2310.15110 link
2023-10-24 Wonder3D: Single Image to 3D using Cross-Domain Diffusion Xiaoxiao Long et.al. 2310.15008 null
2023-10-23 Orientation-Aware Leg Movement Learning for Action-Driven Human Motion Prediction Chunzhi Gu et.al. 2310.14907 null
2023-10-23 Joint Non-Linear MRI Inversion with Diffusion Priors Moritz Erlacher et.al. 2310.14842 null
2023-10-23 MAS: Multi-view Ancestral Sampling for 3D motion generation using 2D diffusion Roy Kapon et.al. 2310.14729 null
2023-10-23 $Λ$ -Split: A Privacy-Preserving Split Computing Framework for Cloud-Powered Generative AI Shoki Ohta et.al. 2310.14651 link
2023-10-23 DICE: Diverse Diffusion Model with Scoring for Trajectory Prediction Younwoo Choi et.al. 2310.14570 null
2023-10-22 Diffusion-Model-Assisted Supervised Learning of Generative Models for Density Estimation Yanfang Liu et.al. 2310.14458 null
2023-10-22 Diffusion-based Data Augmentation for Nuclei Image Segmentation Xinyi Yu et.al. 2310.14197 link
2023-10-22 Improved Techniques for Training Consistency Models Yang Song et.al. 2310.14189 null
2023-10-21 Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models Jincheng Zhang et.al. 2310.14044 link
2023-10-21 Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions Jincheng Zhang et.al. 2310.14040 null
2023-10-21 Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States Zidan Wang et.al. 2310.13914 null
2023-10-20 GraphMaker: Can Diffusion Models Generate Large Attributed Graphs? Mufei Li et.al. 2310.13833 link
2023-10-20 TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models Tianshi Cao et.al. 2310.13772 null
2023-10-20 Localizing and Editing Knowledge in Text-to-Image Generative Models Samyadeep Basu et.al. 2310.13730 null
2023-10-20 ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection Zhongzhan Huang et.al. 2310.13545 link
2023-10-19 CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation Sihan Xu et.al. 2310.13165 link
2023-10-19 EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model Zheyuan Zhang et.al. 2310.12868 link
2023-10-19 Energy-Based Models For Speech Synthesis Wanli Sun et.al. 2310.12765 null
2023-10-19 TapMo: Shape-aware Motion Generation of Skeleton-free Characters Jiaxu Zhang et.al. 2310.12678 null
2023-10-19 Product of Gaussian Mixture Diffusion Models Martin Zach et.al. 2310.12653 link
2023-10-19 Denoising Heat-inspired Diffusion with Insulators for Collision Free Motion Planning Junwoo Chang et.al. 2310.12609 null
2023-10-19 Diverse Diffusion: Enhancing Image Diversity in Text-to-Image Generation Mariia Zameshina et.al. 2310.12583 null
2023-10-19 SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation Chongyu Fan et.al. 2310.12508 link
2023-10-19 Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping Zijie Pan et.al. 2310.12474 link
2023-10-19 Closed-Form Diffusion Models Christopher Scarvelis et.al. 2310.12395 null
2023-10-18 DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors Jinbo Xing et.al. 2310.12190 link
2023-10-18 Quality Diversity through Human Feedback Li Ding et.al. 2310.12103 link
2023-10-20 Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach Feng Luo et.al. 2310.12004 link
2023-10-18 Bayesian Flow Networks in Continual Learning Mateusz Pyla et.al. 2310.12001 null
2023-10-18 InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation Renzhi Wang et.al. 2310.11976 link
2023-10-18 To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images … For Now Yimeng Zhang et.al. 2310.11868 link
2023-10-20 Equivariant Bootstrapping for Uncertainty Quantification in Imaging Inverse Problems Julian Tachella et.al. 2310.11838 link
2023-10-18 Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts Xinhua Cheng et.al. 2310.11784 null
2023-10-18 Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale Qichao Wang et.al. 2310.11778 null
2023-10-18 On the Evaluation of Generative Models in Distributed Learning Tasks Zixiao Wang et.al. 2310.11714 null
2023-10-17 Reflection-Equivariant Diffusion for 3D Structure Determination from Isotopologue Rotational Spectra in Natural Abundance Austin Cheng et.al. 2310.11609 link
2023-10-17 GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment Dhruba Ghosh et.al. 2310.11513 link
2023-10-17 Elucidating The Design Space of Classifier-Guided Diffusion Generation Jiajun Ma et.al. 2310.11311 link
2023-10-17 BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference Siqi Kou et.al. 2310.11142 link
2023-10-17 3D Structure-guided Network for Tooth Alignment in 2D Photograph Yulong Dou et.al. 2310.11106 link
2023-10-16 LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation Ruiqi Wu et.al. 2310.10769 link
2023-10-18 BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys Yu Gu et.al. 2310.10765 null
2023-10-16 MOFDiff: Coarse-grained Diffusion for Metal-Organic Framework Design Xiang Fu et.al. 2310.10732 null
2023-10-16 A Survey on Video Diffusion Models Zhen Xing et.al. 2310.10647 link
2023-10-16 LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts Hanan Gani et.al. 2310.10640 link
2023-10-16 Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models Kevin Black et.al. 2310.10639 link
2023-10-16 ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a protein language diffusion model Bo Ni et.al. 2310.10605 null
2023-10-16 Generation or Replication: Auscultating Audio Latent Diffusion Models Dimitrios Bralios et.al. 2310.10604 null
2023-10-16 Model Selection of Anomaly Detectors in the Absence of Labeled Validation Data Clement Fung et.al. 2310.10461 null
2023-10-16 ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion Jiayu Yang et.al. 2310.10343 link
2023-10-16 Scene Graph Conditioning in Latent Diffusion Frank Fundel et.al. 2310.10338 link
2023-10-16 Towards image compression with perfect realism at ultra-low bitrates Marlène Careil et.al. 2310.10325 null
2023-10-16 Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model Junpeng Tan et.al. 2310.10209 null
2023-10-16 Ring-A-Bell! How Reliable are Concept Removal Methods for Diffusion Models? Yu-Lin Tsai et.al. 2310.10012 link
2023-10-15 Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models Zijian Zhang et.al. 2310.09912 null
2023-10-15 Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation Wangyu Wu et.al. 2310.09760 null
2023-10-15 LOVECon: Text-driven Training-Free Long Video Editing with ControlNet Zhenyi Liao et.al. 2310.09711 link
2023-10-14 Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space Hengrui Zhang et.al. 2310.09656 link
2023-10-14 Adaptive Online Replanning with Diffusion Models Siyuan Zhou et.al. 2310.09629 null
2023-10-14 JSMoCo: Joint Coil Sensitivity and Motion Correction in Parallel MRI with a Self-Calibrating Score-Based Diffusion Model Lixuan Chen et.al. 2310.09625 null
2023-10-14 Neural Network for valuing Bitcoin options under jump-diffusion and market sentiment model Edson Pindza et.al. 2310.09622 null
2023-10-14 Unified High-binding Watermark for Unconditional Image Generation Models Ruinan Ma et.al. 2310.09479 null
2023-10-14 Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner Mengfei Xia et.al. 2310.09469 null
2023-10-12 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Xian Liu et.al. 2310.08579 null
2023-10-12 NetDiffusion: Network Data Augmentation Through Protocol-Constrained Traffic Generation Xi Jiang et.al. 2310.08543 null
2023-10-12 GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors Taoran Yi et.al. 2310.08529 link
2023-10-12 MotionDirector: Motion Customization of Text-to-Video Diffusion Models Rui Zhao et.al. 2310.08465 link
2023-10-12 Debias the Training of Diffusion Models Hu Yu et.al. 2310.08442 link
2023-10-12 A new local and explicit kinetic method for linear and non-linear convection-diffusion problems with finite kinetic speeds: I. One-dimensional case Gauthier Wissocq et.al. 2310.08356 null
2023-10-12 Neural Diffusion Models Grigory Bartosh et.al. 2310.08337 null
2023-10-12 Consistent123: Improve Consistency for One Image to 3D Object Synthesis Haohan Weng et.al. 2310.08092 null
2023-10-12 Interpretable Diffusion via Information Decomposition Xianghao Kong et.al. 2310.07972 link
2023-10-11 NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration Ajay Sridhar et.al. 2310.07896 link
2023-10-11 Efficient Integrators for Diffusion Generative Models Kushagra Pandey et.al. 2310.07894 link
2023-10-13 Generative Modeling with Phase Stochastic Bridges Tianrong Chen et.al. 2310.07805 link
2023-10-11 Quantum sequential scattering model for quantum state learning Mingrui Jing et.al. 2310.07797 null
2023-10-11 DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model Xiaofan Li et.al. 2310.07771 link
2023-10-11 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Yingqing He et.al. 2310.07702 link
2023-10-12 Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models Zeqiang Lai et.al. 2310.07653 link
2023-10-11 Boosting Black-box Attack to Deep Neural Networks with Conditional Diffusion Models Renyang Liu et.al. 2310.07492 link
2023-10-11 Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else Hazarapet Tunanyan et.al. 2310.07419 null
2023-10-12 WiGenAI: The Symphony of Wireless and Generative AI via Diffusion Models Mehdi Letafati et.al. 2310.07312 null
2023-10-12 Score Regularized Policy Optimization through Diffusion Behavior Huayu Chen et.al. 2310.07297 link
2023-10-11 Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model Shiyuan Yang et.al. 2310.07222 link
2023-10-11 Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes Jaehyeong Jo et.al. 2310.07216 link
2023-10-11 State of the Art on Diffusion Models for Visual Computing Ryan Po et.al. 2310.07204 null
2023-10-11 The Ubiquity of Diffusiophoresis: Exploring Human Population Dynamics While Including Concentration Gradient-Driven Advection Benjamin M. Alessio et.al. 2310.07185 null
2023-10-11 Imitation Learning from Purified Demonstration Yunke Wang et.al. 2310.07143 link
2023-10-11 Denoising Task Routing for Diffusion Models Byeongjun Park et.al. 2310.07138 link
2023-10-11 Echocardiography video synthesis from end diastolic semantic map via diffusion model Phi Nguyen Van et.al. 2310.07131 null
2023-10-10 Investigating the Adversarial Robustness of Density Estimation Using the Probability Flow ODE Marius Arvinte et.al. 2310.07084 null
2023-10-10 ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning Alec Helbling et.al. 2310.06968 null
2023-10-10 Monsters in the Dark: Sanitizing Hidden Threats with Diffusion Models Preston K. Robinette et.al. 2310.06951 null
2023-10-10 Stochastic Super-resolution of Cosmological Simulations with Denoising Diffusion Models Andreas Schanz et.al. 2310.06929 null
2023-10-10 HiFi-123: Towards High-fidelity One Image to 3D Content Generation Wangbo Yu et.al. 2310.06744 null
2023-10-10 Tweedie Moment Projected Diffusions For Inverse Problems Benjamin Boys et.al. 2310.06721 null
2023-10-10 Latent Diffusion Counterfactual Explanations Karim Farid et.al. 2310.06668 null
2023-10-09 FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Yuren Cong et.al. 2310.05922 null
2023-10-10 Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models Zhili Liu et.al. 2310.05873 null
2023-10-09 A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models Sebastian G. Gruber et.al. 2310.05833 link
2023-10-09 DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models Shansan Gong et.al. 2310.05793 link
2023-10-09 Language Model Beats Diffusion – Tokenizer is Key to Visual Generation Lijun Yu et.al. 2310.05737 link
2023-10-09 DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning Longxiang He et.al. 2310.05333 link
2023-10-08 Image Compression and Decompression Framework Based on Latent Diffusion Model for Breast Mammography InChan Hwang et.al. 2310.05299 link
2023-10-08 Fast protein backbone generation with SE(3) flow matching Jason Yim et.al. 2310.05297 null
2023-10-08 The Emergence of Reproducibility and Consistency in Diffusion Models Huijie Zhang et.al. 2310.05264 null
2023-10-08 Latent Diffusion Model for Medical Image Standardization and Enhancement Md Selim et.al. 2310.05237 null
2023-10-07 Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models Gabriele Tolomei et.al. 2310.04875 null
2023-10-07 Conditional Diffusion Model for Target Speaker Extraction Theodor Nguyen et.al. 2310.04791 null
2023-10-10 DiffNAS: Bootstrapping Diffusion Models by Prompting for Better Architectures Wenhao Li et.al. 2310.04750 null
2023-10-07 SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection Pengfei Zhou et.al. 2310.04689 link
2023-10-07 Understanding and Improving Adversarial Attacks on Latent Diffusion Model Boyang Zheng et.al. 2310.04687 link
2023-10-07 VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model Yayun He et.al. 2310.04681 null
2023-10-07 EasyPhoto: Your Smart AI Photo Generator Ziheng Wu et.al. 2310.04672 link
2023-10-07 Score-based Diffusion Models With Self-supervised Learning For Accelerated 3D Multi-contrast Cardiac Magnetic Resonance Imaging Yuanyuan Liu et.al. 2310.04669 null
2023-10-06 DragD3D: Vertex-based Editing for Realistic Mesh Deformations using 2D Diffusion Priors Tianhao Xie et.al. 2310.04561 null
2023-10-06 Generative Diffusion From An Action Principle Akhil Premkumar et.al. 2310.04490 null
2023-10-05 Aligning Text-to-Image Diffusion Models with Reward Backpropagation Mihir Prabhudesai et.al. 2310.03739 link
2023-10-05 Certification of Deep Learning Models for Medical Image Segmentation Othmane Laousy et.al. 2310.03664 link
2023-10-05 Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints Chuan Fang et.al. 2310.03602 null
2023-10-05 Deep Generative Models of Music Expectation Ninon Lizé Masclef et.al. 2310.03500 null
2023-10-05 FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators Haiping Wang et.al. 2310.03420 link
2023-10-05 ACT-Net: Anchor-context Action Detection in Surgery Videos Luoying Hao et.al. 2310.03377 null
2023-10-05 Realistic Speech-to-Face Generation with Speech-Conditioned Latent Diffusion Model with Face Prior Jinting Wang et.al. 2310.03363 null
2023-10-05 Denoising Diffusion Step-aware Models Shuai Yang et.al. 2310.03337 link
2023-10-05 EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models Yefei He et.al. 2310.03270 link
2023-10-04 Low-Energy Radiative Backgrounds in CCD-Based Dark-Matter Detectors Peizhi Du et.al. 2310.03068 null
2023-10-04 Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models Jianglong Ye et.al. 2310.03020 null
2023-10-04 Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day Yifan Jiang et.al. 2310.03015 null
2023-10-04 Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples Phillip Howard et.al. 2310.02988 null
2023-10-04 T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation Yuze He et.al. 2310.02977 link
2023-10-04 Fast, Expressive SE $(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation Space Erik J Bekkers et.al. 2310.02970 link
2023-10-04 Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts Shiyi Du et.al. 2310.02906 null
2023-10-04 Magicremover: Tuning-free Text-guided Image inpainting with Diffusion Models Siyuan Yang et.al. 2310.02848 null
2023-10-04 ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF Jangho Park et.al. 2310.02712 null
2023-10-04 On Memorization in Diffusion Models Xiangming Gu et.al. 2310.02664 link
2023-10-05 MagicDrive: Street View Generation with Diverse 3D Geometry Control Ruiyuan Gao et.al. 2310.02601 null
2023-10-04 SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D Weiyu Li et.al. 2310.02596 link
2023-10-04 Generalization in diffusion models arises from geometry-adaptive harmonic representation Zahra Kadkhodaie et.al. 2310.02557 link
2023-10-04 Prepare Ansatz for VQE with Diffusion Model Yilin Shen et.al. 2310.02511 null
2023-10-04 Learning to Reach Goals via Diffusion Vineet Jain et.al. 2310.02505 link
2023-10-03 FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models Yingqian Cui et.al. 2310.02401 null
2023-10-03 Generalized Schrödinger Bridge Matching Guan-Horng Liu et.al. 2310.02233 link
2023-10-03 A Variable Eddington Factor Model for Thermal Radiative Transfer with Closure based on Data-Driven Shape Function Joseph M. Coale et.al. 2310.02072 null
2023-10-03 Global Attractor for a Reaction-Diffusion Model Arising in Biological Dynamic in 3D Soil Structure Mohamed Elghandouri et.al. 2310.02060 null
2023-10-03 AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Zibin Dong et.al. 2310.02054 null
2023-10-03 Amazing Combinatorial Creation: Acceptable Swap-Sampling for Text-to-Image Generation Jun Li et.al. 2310.01819 null
2023-10-02 LLM-grounded Video Diffusion Models Long Lian et.al. 2309.17444 null
2023-09-29 Directly Fine-Tuning Diffusion Models on Differentiable Rewards Kevin Clark et.al. 2309.17400 null
2023-09-29 Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation Tuan Le et.al. 2309.17296 null
2023-09-29 In search of dispersed memories: Generative diffusion models are associative memory networks Luca Ambrogioni et.al. 2309.17290 null
2023-09-29 Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors Yukang Lin et.al. 2309.17261 null
2023-09-29 ResBit: Residual Bit Vector for Categorical Values Masane Fuchi et.al. 2309.17196 null
2023-09-29 Advances in Kidney Biopsy Structural Assessment through Dense Instance Segmentation Zhan Xiong et.al. 2309.17166 null
2023-09-29 Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining Tianyu Han et.al. 2309.17123 link
2023-09-29 Diffusion Models as Stochastic Quantization in Lattice Field Theory Lingxiao Wang et.al. 2309.17082 link
2023-09-29 DeeDiff: Dynamic Uncertainty-Aware Early Exiting for Accelerating Diffusion Model Generation Shengkun Tang et.al. 2309.17074 null
2023-09-29 ReFlow-TTS: A Rectified Flow Model for High-fidelity Text-to-Speech Wenhao Guan et.al. 2309.17056 null
2023-09-29 Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning Zihan Ding et.al. 2309.16984 link
2023-09-29 Leveraging Optimization for Adaptive Attacks on Image Watermarks Nils Lukas et.al. 2309.16952 link
2023-09-29 Denoising Diffusion Bridge Models Linqi Zhou et.al. 2309.16948 link
2023-09-28 SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models Orkhan Baghirli et.al. 2309.16812 link
2023-09-28 Memory in Plain Sight: A Survey of the Uncanny Resemblances between Diffusion Models and Associative Memories Benjamin Hoover et.al. 2309.16750 null
2023-09-28 KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing Jiancheng Huang et.al. 2309.16608 null
2023-09-28 CCEdit: Creative and Controllable Video Editing via Diffusion Models Ruoyu Feng et.al. 2309.16496 null
2023-09-28 Distilling ODE Solvers of Diffusion Models into Smaller Steps Sanghwan Kim et.al. 2309.16421 null
2023-09-28 DeepPCR: Parallelizing Sequential Operations in Neural Networks Federico Danieli et.al. 2309.16318 null
2023-09-28 Long time behavior of the field-road diffusion model: an entropy method and a finite volume scheme Matthieu Alfaro et.al. 2309.16242 null
2023-09-28 Object Motion Guided Human Motion Synthesis Jiaman Li et.al. 2309.16237 null
2023-09-28 Compositional Sculpting of Iterative Generative Processes Timur Garipov et.al. 2309.16115 link
2023-09-27 High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models Selim F. Yilmaz et.al. 2309.15889 link
2023-09-27 Exploiting the Signal-Leak Bias in Diffusion Models Martin Nicolas Everaert et.al. 2309.15842 null
2023-09-27 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation David Junhao Zhang et.al. 2309.15818 link
2023-09-27 Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack Xiaoliang Dai et.al. 2309.15807 null
2023-09-27 Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation Xin Yuan et.al. 2309.15726 null
2023-09-27 Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing Kai Wang et.al. 2309.15664 link
2023-09-27 Uncertainty Quantification via Neural Posterior Principal Components Elias Nehme et.al. 2309.15533 null
2023-09-27 High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models Chunyu Qiang et.al. 2309.15512 null
2023-09-27 DreamCom: Finetuning Text-guided Inpainting Model for Image Composition Lingxiao Lu et.al. 2309.15508 null
2023-09-27 LD4MRec: Simplifying and Powering Diffusion Model for Multimedia Recommendation Penghang Yu et.al. 2309.15363 null
2023-09-26 Learning Using Generated Privileged Information by Text-to-Image Diffusion Models Rafael-Edy Menadil et.al. 2309.15238 null
2023-09-27 LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Yaohui Wang et.al. 2309.15103 link
2023-09-26 The ATM implied skew in the ADO-Heston model Andrey Itkin et.al. 2309.15044 null
2023-09-26 FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image Editing Songyan Chen et.al. 2309.14934 null
2023-09-27 ITEM3D: Illumination-Aware Directional Texture Editing for 3D Models Shengqi Liu et.al. 2309.14872 null
2023-09-26 On a class of solvable stationary non equilibrium states for mass exchange models Monia Capanna et.al. 2309.14836 null
2023-09-26 Diffusion-based Holistic Texture Rectification and Synthesis Guoqing Hao et.al. 2309.14759 null
2023-09-26 On quantifying and improving realism of images generated with diffusion Yunzhuo Chen et.al. 2309.14756 null
2023-09-26 Text-image guided Diffusion Model for generating Deepfake celebrity interactions Yunzhuo Chen et.al. 2309.14751 null
2023-09-26 Bootstrap Diffusion Model Curve Estimation for High Resolution Low-Light Image Enhancement Jiancheng Huang et.al. 2309.14709 null
2023-09-26 Efficient Post-training Quantization with FP8 Formats Haihao Shen et.al. 2309.14592 link
2023-09-25 Bayesian parameter estimation for characterising mobile ion vacancies in perovskite solar cells Samuel G. McCallum et.al. 2309.14302 null
2023-09-25 Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models Yangming Li et.al. 2309.14068 null
2023-09-24 VoiceLDM: Text-to-Speech with Environmental Context Yeonghyeon Lee et.al. 2309.13664 null
2023-09-26 Adaptation of the super resolution SOTA for Art Restoration in camera capture images Sandeep Nagar et.al. 2309.13655 link
2023-09-23 Dream the Impossible: Outlier Imagination with Diffusion Models Xuefeng Du et.al. 2309.13415 link
2023-09-23 GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER Mingzhen Sun et.al. 2309.13274 link
2023-09-22 Invisible Watermarking for Audio Generation Diffusion Models Xirong Cao et.al. 2309.13166 link
2023-09-22 AntiBARTy Diffusion for Property Guided Antibody Design Jordan Venderley et.al. 2309.13129 null
2023-09-22 MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation Jiahao Xie et.al. 2309.13042 link
2023-09-22 Diffusion Augmentation for Sequential Recommendation Qidong Liu et.al. 2309.12858 link
2023-09-22 Synthetic Boost: Leveraging Synthetic Data for Enhanced Vision-Language Segmentation in Echocardiography Rabin Adhikari et.al. 2309.12829 link
2023-09-21 A Diffusion-Model of Joint Interactive Navigation Matthew Niedoba et.al. 2309.12508 null
2023-09-21 License Plate Super-Resolution Using Diffusion Models Sawsan AlHalawani et.al. 2309.12506 null
2023-09-21 Synthetic Image Detection: Highlights from the IEEE Video and Image Processing Cup 2022 Student Competition Davide Cozzolino et.al. 2309.12428 null
2023-09-21 Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal Xiao Feng Zhang et.al. 2309.11715 null
2023-09-24 Latent Diffusion Models for Structural Component Design Ethan Herron et.al. 2309.11601 null
2023-09-20 Light Field Diffusion for Single-View Novel View Synthesis Yifeng Xiong et.al. 2309.11525 null
2023-09-20 FreeU: Free Lunch in Diffusion U-Net Chenyang Si et.al. 2309.11497 link
2023-09-20 Generative Agent-Based Modeling: Unveiling Social System Dynamics through Coupling Mechanistic Models with Generative Artificial Intelligence Navid Ghaffarzadegan et.al. 2309.11456 null
2023-09-20 Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models Song Mei et.al. 2309.11420 null
2023-09-20 Face Aging via Diffusion-based Editing Xiangyi Chen et.al. 2309.11321 link
2023-09-20 Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates Ka Chun Shum et.al. 2309.11281 link
2023-09-20 TwinTex: Geometry-aware Texture Generation for Abstracted 3D Architectural Models Weidan Xiong et.al. 2309.11258 null
2023-09-20 Investigating Personalization Methods in Text to Music Generation Manos Plitsis et.al. 2309.11140 link
2023-09-20 PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement Chengyou Jia et.al. 2309.11125 null
2023-09-19 Language-Conditioned Affordance-Pose Detection in 3D Point Clouds Toan Nguyen et.al. 2309.10911 null
2023-09-19 Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context Rucha Deshpande et.al. 2309.10817 null
2023-09-19 PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance Peiqing Yang et.al. 2309.10810 link
2023-09-19 Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation Yatong Bai et.al. 2309.10740 link
2023-09-19 Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising Yujin Wang et.al. 2309.10714 null
2023-09-19 Forgedit: Text Guided Image Editing via Learning and Forgetting Shiwen Zhang et.al. 2309.10556 link
2023-09-19 Towards Generative Modeling of Urban Flow through Knowledge-enhanced Denoising Diffusion Zhilun Zhou et.al. 2309.10547 link
2023-09-21 Learning End-to-End Channel Coding with Diffusion Models Muah Kim et.al. 2309.10505 null
2023-09-19 Unsupervised speech enhancement with diffusion-based generative models Berné Nortier et.al. 2309.10450 link
2023-09-19 Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder Mostafa Sadeghi et.al. 2309.10439 null
2023-09-19 AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration Lijiang Li et.al. 2309.10438 link
2023-09-19 $Γ$ -convergence of Nonlocal Dirichlet Energies With Penalty Formulations of Dirichlet Boundary Data Weiye Gan et.al. 2309.10352 null
2023-09-18 What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews Zoe De Simone et.al. 2309.09944 link
2023-09-18 DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving Xiaofeng Wang et.al. 2309.09777 null
2023-09-18 Application-driven Validation of Posteriors in Inverse Problems Tim J. Adler et.al. 2309.09764 null
2023-09-18 Single and Few-step Diffusion for Generative Speech Enhancement Bunlong Lay et.al. 2309.09677 link
2023-09-18 Speeding Up Speech Synthesis In Diffusion Models By Reducing Data Distribution Recovery Steps Via Content Transfer Peter Ochieng et.al. 2309.09652 null
2023-09-18 Gradpaint: Gradient-Guided Inpainting with Diffusion Models Asya Grechka et.al. 2309.09614 null
2023-09-18 Causal-Story: Local Causal Attention Utilizing Parameter-Efficient Tuning For Visual Story Synthesis Tianyi Song et.al. 2309.09553 link
2023-09-18 Progressive Text-to-Image Diffusion with Soft Latent Direction YuTeng Ye et.al. 2309.09466 link
2023-09-17 Enhancing Knee Osteoarthritis severity level classification using diffusion augmented images Paleti Nikhil Chowdary et.al. 2309.09328 null
2023-09-17 PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts Jixun Yao et.al. 2309.09262 null
2023-09-16 CDDM: Channel Denoising Diffusion Models for Wireless Semantic Communications Tong Wu et.al. 2309.08895 null
2023-09-15 Probabilistic Constellation Shaping With Denoising Diffusion Probabilistic Models: A Novel Approach Mehdi Letafati et.al. 2309.08688 null
2023-09-15 Compositional Foundation Models for Hierarchical Planning Anurag Ajay et.al. 2309.08587 null
2023-09-15 Denoising Diffusion Probabilistic Models for Hardware-Impaired Communications Mehdi Letafati et.al. 2309.08568 null
2023-09-15 Breathing New Life into 3D Assets with Generative Repainting Tianfu Wang et.al. 2309.08523 link
2023-09-15 Generalised Probabilistic Diffusion Scale-Spaces Pascal Peter et.al. 2309.08511 null
2023-09-15 Biological invasions and epidemics with nonlocal diffusion along a line Henri Berestycki et.al. 2309.08298 null
2023-09-15 Large Intestine 3D Shape Refinement Using Point Diffusion Models for Digital Phantom Generation Kaouther Mouheb et.al. 2309.08289 null
2023-09-15 Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models Ruian He et.al. 2309.08273 link
2023-09-15 Cartoondiff: Training-free Cartoon Image Generation with Diffusion Transformer Models Feihong He et.al. 2309.08251 null
2023-09-15 Large-Vocabulary 3D Diffusion Model with Transformer Ziang Cao et.al. 2309.07920 null
2023-09-14 Beta Diffusion Mingyuan Zhou et.al. 2309.07867 link
2023-09-14 EMOCONV-DIFF: Diffusion-based Speech Emotion Conversion for Non-parallel and In-the-wild Data Navin Raj Prabhu et.al. 2309.07828 null
2023-09-14 DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks Zipeng Qi et.al. 2309.07509 null
2023-09-14 Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos Fen Fang et.al. 2309.07409 link
2023-09-14 Semantic Adversarial Attacks via Diffusion Models Chenan Wang et.al. 2309.07398 link
2023-09-14 Beta quantile regression for robust estimation of uncertainty in the presence of outliers Haleh Akrami et.al. 2309.07374 null
2023-09-13 Unbiased Face Synthesis With Diffusion Models: Are We There Yet? Harrison Rosenberg et.al. 2309.07277 link
2023-09-13 Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement Chenghao Li et.al. 2309.07254 link
2023-09-13 Diffusion models for audio semantic communication Eleonora Grassucci et.al. 2309.07195 null
2023-09-13 UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons Sicheng Yang et.al. 2309.07051 link
2023-09-13 VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance Carlos Hernandez-Olivan et.al. 2309.06934 null
2023-09-13 DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models Namhyuk Ahn et.al. 2309.06933 null
2023-09-13 DCTTS: Discrete Diffusion Model with Contrastive Learning for Text-to-speech Generation Zhichao Wu et.al. 2309.06787 null
2023-09-12 Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion Models Zalan Fabian et.al. 2309.06642 link
2023-09-12 InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation Xingchao Liu et.al. 2309.06380 link
2023-09-12 Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model Yin Wang et.al. 2309.06284 null
2023-09-15 Spreading speeds of a nonlocal diffusion model with free boundaries in the time almost periodic media Chengcheng Cheng et.al. 2309.06190 null
2023-09-12 Dynamics and spreading speeds of a nonlocal diffusion model with advection and free boundaries Chengcheng Cheng et.al. 2309.06185 null
2023-09-12 Elucidating the solution space of extended reverse-time SDE for diffusion models Qinpeng Cui et.al. 2309.06169 link
2023-09-12 Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts Zhi-Yi Chin et.al. 2309.06135 link
2023-09-12 A monotone numerical integration method for mean-variance portfolio optimization under jump-diffusion models Hanwen Zhang et.al. 2309.05977 null
2023-09-12 Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation Zhiqing Zhang et.al. 2309.05929 null
2023-09-11 Predicting the Radiation Field of Molecular Clouds using Denoising Diffusion Probabilistic Models Duo Xu et.al. 2309.05811 null
2023-09-11 Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models Sumeet Singh et.al. 2309.05803 null
2023-09-11 Diffusion-based Adversarial Purification for Robust Deep MRI Reconstruction Ismail Alkhouri et.al. 2309.05794 link
2023-09-11 PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models Li Chen et.al. 2309.05793 null
2023-09-11 CaloClouds II: Ultra-Fast Geometry-Independent Highly-Granular Calorimeter Simulation Erik Buhmann et.al. 2309.05704 link
2023-09-11 PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud Chengyu Wang et.al. 2309.05534 null
2023-09-14 Treatment-aware Diffusion Probabilistic Model for Longitudinal MRI Generation and Diffuse Glioma Growth Prediction Qinghui Liu et.al. 2309.05406 null
2023-09-11 Diff-Privacy: Diffusion-based Face Privacy Protection Xiao He et.al. 2309.05330 null
2023-09-10 Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood Yaxuan Zhu et.al. 2309.05153 link
2023-09-10 VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching Yiwei Guo et.al. 2309.05027 link
2023-09-10 SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models Shuchen Xue et.al. 2309.05019 link
2023-09-10 Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning Guisheng Liu et.al. 2309.04965 null
2023-09-10 Seismic Data Strong Noise Attenuation Based on Diffusion Model and Principal Component Analysis Junheng Peng et.al. 2309.04944 link
2023-09-10 Text-driven Editing of 3D Scenes without Retraining Shuangkang Fang et.al. 2309.04917 link
2023-09-09 Global Convergence of Receding-Horizon Policy Search in Learning Estimator Designs Xiangyuan Zhang et.al. 2309.04831 link
2023-09-09 Influence Maximization in Social Networks: A Survey Hui Li et.al. 2309.04668 null
2023-09-08 The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion Yujin Jeong et.al. 2309.04509 null
2023-09-08 Create Your World: Lifelong Text-to-Image Diffusion Gan Sun et.al. 2309.04430 null
2023-09-08 MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask Yupeng Zhou et.al. 2309.04399 null
2023-09-08 MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers Sijia Li et.al. 2309.04372 null
2023-09-08 From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models Changming Xiao et.al. 2309.04109 link
2023-09-07 DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection Manlin Zhang et.al. 2309.03893 null
2023-09-07 Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption Teng Hu et.al. 2309.03729 link
2023-09-07 DiffDefense: Defending against Adversarial Attacks via Diffusion Models Hondamunige Prasanna Silva et.al. 2309.03702 link
2023-09-07 Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model Sungwon Hwang et.al. 2309.03550 null
2023-09-07 Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation Jiaxi Gu et.al. 2309.03549 null
2023-09-07 SyncDreamer: Generating Multiview-consistent Images from a Single-view Image Yuan Liu et.al. 2309.03453 link
2023-09-07 Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy Yi Tang et.al. 2309.03445 link
2023-09-07 Mean field limits of particle-based stochastic reaction-drift-diffusion models Max Heldman et.al. 2309.03431 null
2023-09-06 SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction Nivetha Jayakumar et.al. 2309.03335 null
2023-09-06 My Art My Choice: Adversarial Protection Against Unruly AI Anthony Rhodes et.al. 2309.03198 null
2023-09-06 Optical pulse induced ultrafast antiferrodistortive transition in SrTiO3 Saqeeb Adnan et.al. 2309.03172 null
2023-09-06 MCM: Multi-condition Motion Synthesis Framework for Multi-scenario Zeyu Ling et.al. 2309.03031 null
2023-09-06 Predicting the emergence of localised dihedral patterns in models for dryland vegetation Dan J. Hill et.al. 2309.02956 link
2023-09-06 Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter Jinglong Wang et.al. 2309.02773 link
2023-09-05 Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts Hongyang Du et.al. 2309.02616 null
2023-09-05 Diffusion on the Probability Simplex Griffin Floto et.al. 2309.02530 null
2023-09-05 Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models Haixu Song et.al. 2309.02218 link
2023-09-05 Hierarchical Masked 3D Diffusion Model for Video Outpainting Fanda Fan et.al. 2309.02119 null
2023-09-05 Diffusion-based 3D Object Detection with Random Boxes Xin Zhou et.al. 2309.02049 null
2023-09-05 Diffusion Generative Inverse Design Marin Vlastelica et.al. 2309.02040 null
2023-09-05 sasdim: self-adaptive noise scaling diffusion model for spatial time series imputation Shunyang Zhang et.al. 2309.01988 null
2023-09-05 Efficient Bayesian Computational Imaging with a Surrogate Score-Based Prior Berthy T. Feng et.al. 2309.01949 link
2023-09-05 Gradient Domain Diffusion Models for Image Synthesis Yuanhao Gong et.al. 2309.01875 null
2023-09-04 Turbulent Flow Simulation using Autoregressive Conditional Diffusion Models Georg Kohl et.al. 2309.01745 link
2023-09-07 Generative-based Fusion Mechanism for Multi-Modal Tracking Zhangyong Tang et.al. 2309.01728 link
2023-09-04 ControlMat: A Controlled Generative Approach to Material Capture Giuseppe Vecchio et.al. 2309.01700 null
2023-09-07 Improving Visual Quality and Transferability of Adversarial Attacks on Face Recognition Simultaneously with Adversarial Restoration Fengfan Zhou et.al. 2309.01582 null
2023-09-04 DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion Cédric Rommel et.al. 2309.01575 null
2023-09-04 Image denoising in photon-counting CT using PFGM++ with hijacked regularized sampling Dennis Hein et.al. 2309.01553 link
2023-09-01 Iterative Multi-granular Image Editing using Diffusion Models K J Joseph et.al. 2309.00613 null
2023-09-01 VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation Xin Li et.al. 2309.00398 null
2023-09-01 Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution Charles Laroche et.al. 2309.00287 link
2023-09-01 DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion Models Michael Shenoda et.al. 2309.00248 link
2023-09-01 Diffusion Model with Clustering-based Conditioning for Food Image Generation Yue Han et.al. 2309.00199 null
2023-09-01 Breakdown of the drift-diffusion model for transverse spin transport in a disordered Pt film K. D. Belashchenko et.al. 2309.00183 null
2023-08-31 BuilDiff: 3D Building Shape Generation using Single-Image Conditional Point Cloud Diffusion Models Yao Wei et.al. 2309.00158 null
2023-08-31 InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion Sirui Xu et.al. 2308.16905 link
2023-08-31 Diffusion Models for Interferometric Satellite Aperture Radar Alexandre Tuel et.al. 2308.16847 link
2023-08-31 Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains Xuan Liu et.al. 2308.16742 link
2023-08-31 Modelling of highly extended Gamma-ray emission around the Geminga Pulsar as detected with H.E.S.S A. M. W. Mitchell et.al. 2308.16669 null
2023-08-31 Generate Your Own Scotland: Satellite Image Generation Conditioned on Maps Miguel Espinosa et.al. 2308.16648 link
2023-08-31 MFR-Net: Multi-faceted Responsive Listening Head Generation via Denoising Diffusion Model Jin Liu et.al. 2308.16635 null
2023-08-31 Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images Qingping Zheng et.al. 2308.16582 null
2023-08-31 Conditioning Score-Based Generative Models by Neuro-Symbolic Constraints Davide Scassola et.al. 2308.16534 link
2023-08-31 MVDream: Multi-view Diffusion for 3D Generation Yichun Shi et.al. 2308.16512 null
2023-08-30 A Recycling Training Strategy for Medical Image Segmentation with Diffusion Denoising Models Yunguan Fu et.al. 2308.16355 link
2023-08-30 Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art Tanujit Chakraborty et.al. 2308.16316 null
2023-08-30 Modality Cycles with Masked Conditional Diffusion for Unsupervised Anomaly Segmentation in MRI Ziyun Liang et.al. 2308.16150 link
2023-08-30 SignDiff: Learning Diffusion Models for American Sign Language Production Sen Fang et.al. 2308.16082 null
2023-08-30 DiffuVolume: Diffusion Model for Volume based Stereo Matching Dian Zheng et.al. 2308.15989 null
2023-08-30 Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT Reconstruction Kai Xu et.al. 2308.15942 link
2023-08-30 Physics-Informed DeepMRI: Bridging the Gap from Heat Diffusion to k-Space Interpolation Zhuo-Xu Cui et.al. 2308.15918 null
2023-08-30 Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models Zhanbo Feng et.al. 2308.15854 link
2023-08-30 A Dual-Zone Diffusion Model for High Energy Emissions of the Cygnus Cocoon Shihong Zhan et.al. 2308.15831 null
2023-08-30 Intriguing Properties of Diffusion Models: A Large-Scale Dataset for Evaluating Natural Attack Capability in Text-to-Image Generative Models Takami Sato et.al. 2308.15692 null
2023-08-30 Asymptotics for Short Maturity Asian Options in a Jump-Diffusion model with Local Volatility Dan Pirjol et.al. 2308.15672 null
2023-08-29 ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer Zachary Horvitz et.al. 2308.15459 link
2023-08-30 Elucidating the Exposure Bias in Diffusion Models Mang Ning et.al. 2308.15321 link
2023-08-29 DiffusionVMR: Diffusion Model for Video Moment Retrieval Henghao Zhao et.al. 2308.15109 null
2023-08-29 DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior Xinqi Lin et.al. 2308.15070 link
2023-08-29 C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model Longbin Ji et.al. 2308.15016 link
2023-08-28 Identifying and Mitigating the Security Risks of Generative AI Clark Barrett et.al. 2308.14840 null
2023-08-28 Generating tabular datasets under differential privacy Gianluca Truda et.al. 2308.14784 link
2023-08-30 Priority-Centric Human Motion Generation in Discrete Latent Space Hanyang Kong et.al. 2308.14480 null
2023-08-28 Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization Tao Yang et.al. 2308.14469 link
2023-08-28 Data-iterative Optimization Score Model for Stable Ultra-Sparse-View CT Reconstruction Weiwen Wu et.al. 2308.14437 null
2023-08-28 Steerable Conditional Diffusion for Out-of-Distribution Adaptation in Imaging Inverse Problems Riccardo Barbano et.al. 2308.14409 link
2023-08-28 InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models Bing Han et.al. 2308.14360 null
2023-08-28 DiffSmooth: Certifiably Robust Learning via Diffusion Models and Local Smoothing Jiawei Zhang et.al. 2308.14333 link
2023-08-27 SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation Zhiyu Qu et.al. 2308.14191 link
2023-08-27 Diffusion Schrödinger Bridges for Bayesian Computation Jeremy Heng et.al. 2308.14106 null
2023-08-27 Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views Zi-Xin Zou et.al. 2308.14078 null
2023-08-26 Unsupervised Domain Adaptation via Domain-Adaptive Diffusion Duo Peng et.al. 2308.13893 null
2023-08-26 The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 Sicheng Yang et.al. 2308.13879 link
2023-08-26 Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models Hao Fei et.al. 2308.13812 null
2023-08-26 DiffI2I: Efficient Diffusion Model for Image-to-Image Translation Bin Xia et.al. 2308.13767 null
2023-08-25 Residual Denoising Diffusion Models Jiawei Liu et.al. 2308.13712 link
2023-08-25 Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation Debaditya Shome et.al. 2308.13568 link
2023-08-25 Distribution-Aligned Diffusion for Human Mesh Recovery Lin Geng Foo et.al. 2308.13369 null
2023-08-25 EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior Minda Zhao et.al. 2308.13223 link
2023-08-25 Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model Xunpeng Yi et.al. 2308.13164 null
2023-08-25 A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions Tianyi Zhang et.al. 2308.13142 null
2023-08-24 Full-dose PET Synthesis from Low-dose PET Using High-efficiency Diffusion Denoising Probabilistic Model Shaoyan Pan et.al. 2308.13072 link
2023-08-24 Dense Text-to-Image Generation with Attention Modulation Yunji Kim et.al. 2308.12964 link
2023-08-24 Hydrogen jet diffusion modeling by using physics-informed graph neural network and sparsely-distributed sensor data Xinqi Zhang et.al. 2308.12621 null
2023-08-24 APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency Yupu Yao et.al. 2308.12605 null
2023-08-23 Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion Junjiao Tian et.al. 2308.12469 link
2023-08-23 InverseSR: 3D Brain MRI Super-Resolution Using a Latent Diffusion Model Jueqi Wang et.al. 2308.12465 link
2023-08-23 Augmenting medical image classifiers with synthetic data from latent diffusion models Luke W. Sagers et.al. 2308.12453 null
2023-08-23 Renormalizing Diffusion Models Jordan Cotler et.al. 2308.12355 null
2023-08-23 Improving Generative Model-based Unfolding with Schrödinger Bridges Sascha Diefenbacher et.al. 2308.12351 link
2023-08-23 Score diffusion models without early stopping: finite Fisher information is all you need Giovanni Conforti et.al. 2308.12240 null
2023-08-25 Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning Jiasheng Ye et.al. 2308.12219 link
2023-08-23 Quantum-Noise-driven Generative Diffusion Models Marco Parigi et.al. 2308.12013 null
2023-08-23 High-quality Image Dehazing with Diffusion Model Hu Yu et.al. 2308.11949 link
2023-08-23 Efficient Transfer Learning in Diffusion Models via Adversarial Noise Xiyu Wang et.al. 2308.11948 null
2023-08-23 LongDanceDiff: Long-term Dance Generation with Conditional Diffusion Model Siqi Yang et.al. 2308.11945 null
2023-08-23 Boosting Diffusion Models with an Adaptive Momentum Sampler Xiyu Wang et.al. 2308.11941 null
2023-08-23 Audio Generation with Multiple Conditional Diffusion Model Zhifang Guo et.al. 2308.11940 null
2023-08-23 Shape-conditioned 3D Molecule Generation via Equivariant Diffusion Models Ziqi Chen et.al. 2308.11890 null
2023-08-22 IT3D: Improved Text-to-3D Generation with Explicit View Synthesis Yiwen Chen et.al. 2308.11473 link
2023-08-22 Convergence guarantee for consistency models Junlong Lyu et.al. 2308.11449 null
2023-08-22 MatFuse: Controllable Material Generation with Diffusion Models Giuseppe Vecchio et.al. 2308.11408 link
2023-08-22 MusicJam: Visualizing Music Insights via Generated Narrative Illustrations Chuer Chen et.al. 2308.11329 null
2023-08-22 DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment Xujie Zhang et.al. 2308.11206 null
2023-08-22 Hey That’s Mine Imperceptible Watermarks are Preserved in Diffusion Generated Outputs Luke Ditria et.al. 2308.11123 null
2023-08-21 TADA! Text to Animatable Digital Avatars Tingting Liao et.al. 2308.10899 null
2023-08-23 Backdooring Textual Inversion for Concept Censorship Yutong Wu et.al. 2308.10718 null
2023-08-21 EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints Yutao Chen et.al. 2308.10648 null
2023-08-21 Frequency Compensated Diffusion Model for Real-scene Dehazing Jing Wang et.al. 2308.10510 link
2023-08-21 Texture Generation on 3D Meshes with Point-UV Diffusion Xin Yu et.al. 2308.10490 null
2023-08-21 DySuse: Susceptibility Estimation in Dynamic Social Networks Yingdan Shi et.al. 2308.10442 null
2023-08-21 Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models Heyang Xue et.al. 2308.10428 null
2023-08-20 Turning Waste into Wealth: Leveraging Low-Quality Samples for Enhancing Continuous Conditional Generative Adversarial Networks Xin Ding et.al. 2308.10273 link
2023-08-20 Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image Liao Shen et.al. 2308.10257 null
2023-08-20 Spiking-Diffusion: Vector Quantized Discrete Diffusion Model with Spiking Neural Networks Mingxuan Liu et.al. 2308.10187 link
2023-08-20 Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction Zeyu Han et.al. 2308.10157 link
2023-08-20 SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation Chengyou Jia et.al. 2308.10156 null
2023-08-20 Disorder-induced linear magnetoresistance in Al $_2$O$_3$/SrTiO$_3$ heterostructures Gao Kuang Hong et.al. 2308.10152 null
2023-08-19 MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance Ernie Chu et.al. 2308.10079 null
2023-08-19 ControlCom: Controllable Image Composition using Diffusion Model Bo Zhang et.al. 2308.10040 link
2023-08-19 AltDiffusion: A Multilingual Text-to-Image Diffusion Model Fulong Ye et.al. 2308.09991 link
2023-08-19 Physics-Guided Human Motion Capture with Pose Probability Modeling Jingyi Ju et.al. 2308.09910 link
2023-08-19 DiffusionTrack: Diffusion Model For Multi-Object Tracking Run Luo et.al. 2308.09905 link
2023-08-18 DiffCharge: Generating EV Charging Scenarios via a Denoising Diffusion Model Siyang Li et.al. 2308.09857 link
2023-08-18 Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization Soumik Mukhopadhyay et.al. 2308.09716 link
2023-08-16 TeCH: Text-guided Reconstruction of Lifelike Clothed Humans Yangyi Huang et.al. 2308.08545 link
2023-08-16 Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model Ran Jiang et.al. 2308.08367 null
2023-08-18 Dual-Stream Diffusion Net for Text-to-Video Generation Binhui Liu et.al. 2308.08316 null
2023-08-15 Interplay between particle trapping and heterogeneity in anomalous diffusion Haroldo V. Ribeiro et.al. 2308.07989 null
2023-08-15 Monte Carlo guided Diffusion for Bayesian linear inverse problems Gabriel Cardoso et.al. 2308.07983 link
2023-08-15 StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models Zhizhong Wang et.al. 2308.07863 null
2023-08-15 CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction Yan Di et.al. 2308.07837 null
2023-08-15 Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model Bosheng Qin et.al. 2308.07749 null
2023-08-16 DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models Ruiyuan Gao et.al. 2308.07687 link
2023-08-15 Maat: Performance Metric Anomaly Anticipation for Cloud Services with Conditional Diffusion Cheryl Lee et.al. 2308.07676 link
2023-08-15 Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations without Training Ximing Xing et.al. 2308.07665 link
2023-08-15 SGDiff: A Style Guided Diffusion Model for Fashion Synthesis Zhengwentai Sun et.al. 2308.07605 link
2023-08-14 UniBrain: Unify Image Reconstruction and Captioning All in One Diffusion Model from Human Brain Activity Weijian Mai et.al. 2308.07428 null
2023-08-14 U-Turn Diffusion Hamidreza Behjoo et.al. 2308.07421 null
2023-08-14 DiffHopp: A Graph Diffusion Model for Novel Drug Design via Scaffold Hopping Jos Torge et.al. 2308.07416 link
2023-08-14 Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation Alexander Martin et.al. 2308.07316 link
2023-08-14 Bayesian Flow Networks Alex Graves et.al. 2308.07037 link
2023-08-14 Discrete Conditional Diffusion for Reranking in Recommendation Xiao Lin et.al. 2308.06982 null
2023-08-13 Well-posedness of a reaction-diffusion model with stochastic dynamical boundary conditions Mario Maurelli et.al. 2308.06847 null
2023-08-13 Shape-guided Conditional Latent Diffusion Models for Synthesising Brain Vasculature Yash Deo et.al. 2308.06781 null
2023-08-13 TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution Baolin Liu et.al. 2308.06743 link
2023-08-13 Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks David Junhao Zhang et.al. 2308.06739 null
2023-08-13 Precipitation nowcasting with generative diffusion models Andrea Asperti et.al. 2308.06733 link
2023-08-13 CLE Diffusion: Controllable Light Enhancement Diffusion Model Yuyang Yin et.al. 2308.06725 null
2023-08-13 IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models Hu Ye et.al. 2308.06721 null
2023-08-13 LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts Binbin Yang et.al. 2308.06713 null
2023-08-12 Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation Junwei Huang et.al. 2308.06644 link
2023-08-12 CMR exploration II – filament identification with machine learning Duo Xu et.al. 2308.06641 null
2023-08-12 EquiDiff: A Conditional Equivariant Diffusion Model For Trajectory Prediction Kehua Chen et.al. 2308.06564 null
2023-08-11 White-box Membership Inference Attacks against Diffusion Models Yan Pang et.al. 2308.06405 null
2023-08-11 Mirror Diffusion Models Jaesung Tae et.al. 2308.06342 null
2023-08-11 DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models Weijia Wu et.al. 2308.06160 link
2023-08-11 Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow Junhong Gou et.al. 2308.06101 link
2023-08-11 Head Rotation in Denoising Diffusion Models Andrea Asperti et.al. 2308.06057 link
2023-08-11 Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning Chun-Mei Feng et.al. 2308.06038 link
2023-08-10 AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Haohe Liu et.al. 2308.05734 link
2023-08-10 PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers Phillip Lippe et.al. 2308.05732 null
2023-08-10 Masked Diffusion as Self-supervised Representation Learner Zixuan Pan et.al. 2308.05695 link
2023-08-10 Generative Diffusion Models for Radio Wireless Channel Modelling and Sampling Ushnish Sengupta et.al. 2308.05583 null
2023-08-10 Beyond Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization Hongyang Du et.al. 2308.05384 link
2023-08-09 Do Diffusion Models Suffer Error Propagation? Theoretical Analysis and Consistency Regularization Yangming Li et.al. 2308.05021 null
2023-08-10 IDiff-Face: Synthetic-based Face Recognition through Fizzy Identity-Conditioned Diffusion Models Fadi Boutros et.al. 2308.04995 link
2023-08-09 JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models Peike Li et.al. 2308.04729 null
2023-08-08 Semi-Supervised Semantic Segmentation of Cell Nuclei via Diffusion-based Large-Scale Pre-Training and Collaborative Learning Zhuchen Shao et.al. 2308.04578 null
2023-08-08 3D Scene Diffusion Guidance using Scene Graphs Mohammad Naanaa et.al. 2308.04468 null
2023-08-08 DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images Xuechao Zou et.al. 2308.04417 link
2023-08-08 Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On Daiheng Gao et.al. 2308.04288 null
2023-08-08 Synthetic Augmentation with Large-scale Unconditional Pre-training Jiarong Ye et.al. 2308.04020 link
2023-08-08 Target Speech Extraction with Conditional Diffusion Model Naoyuki Kamo et.al. 2308.03987 null
2023-08-07 A staggered-in-time and non-conforming-in-space numerical framework for realistic cardiac electrophysiology outputs Elena Zappon et.al. 2308.03884 null
2023-08-07 CaloDiffusion with GLaM for High Fidelity Calorimeter Simulation Oz Amram et.al. 2308.03876 link
2023-08-07 CaloScore v2: Single-shot Calorimeter Shower Simulation with Diffusion Models Vinicius Mikuni et.al. 2308.03847 link
2023-08-07 Linear Convergence Bounds for Diffusion Models via Stochastic Localization Joe Benton et.al. 2308.03686 null
2023-08-07 Diffusion Model in Causal Inference with Unmeasured Confounders Tatsuhiro Shimizu et.al. 2308.03669 link
2023-08-07 AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose Huichao Zhang et.al. 2308.03610 link
2023-08-10 DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis Zhongjie Duan et.al. 2308.03463 link
2023-08-07 Energy-Guided Diffusion Model for CBCT-to-CT Synthesis Linjie Fu et.al. 2308.03354 null
2023-08-06 Photorealistic and Identity-Preserving Image-Based Emotion Manipulation with Latent Diffusion Models Ioannis Pikoulis et.al. 2308.03183 link
2023-08-05 Generative Approach for Probabilistic Human Mesh Recovery using Diffusion Models Hanbyel Cho et.al. 2308.02963 link
2023-08-05 DermoSegDiff: A Boundary-aware Segmentation Diffusion Model for Skin Lesion Delineation Afshin Bozorgpour et.al. 2308.02959 link
2023-08-05 DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation Qiaosong Qi et.al. 2308.02915 null
2023-08-05 Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation Zijie Wu et.al. 2308.02874 null
2023-08-05 Thin On-Sensor Nanophotonic Array Cameras Praneeth Chakravarthula et.al. 2308.02797 null
2023-08-04 A geometric singular perturbation analysis of generalised shock selection rules in reaction-nonlinear diffusion models Bronwyn H Bradshaw-Hajek et.al. 2308.02719 null
2023-08-04 Diffusion-Augmented Depth Prediction with Sparse Annotations Jiaqi Li et.al. 2308.02283 null
2023-08-04 Painterly Image Harmonization using Diffusion Model Lingxiao Lu et.al. 2308.02228 link
2023-08-04 Towards Personalized Prompt-Model Retrieval for Generative Recommendation Yuanhe Guo et.al. 2308.02205 link
2023-08-04 Optimal Control of Stationary Doubly Diffusive Flows on Two and Three Dimensional Bounded Lipschitz Domains: A Theoretical Study Jai Tushar et.al. 2308.02178 null
2023-08-04 Improved Order Analysis and Design of Exponential Integrator for Diffusion Models Sampling Qinsheng Zhang et.al. 2308.02157 null
2023-08-04 SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation Shikun Sun et.al. 2308.02154 null
2023-08-03 On the Biometric Capacity of Generative Face Models Vishnu Naresh Boddeti et.al. 2308.02065 null
2023-08-03 Diffusion Models for Counterfactual Generation and Anomaly Detection in Brain Images Alessandro Fontanella et.al. 2308.02062 link
2023-08-03 Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling Zhao Yang et.al. 2308.01850 link
2023-08-03 DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models Jianxin Lin et.al. 2308.01655 null
2023-08-03 Reference-Free Isotropic 3D EM Reconstruction using Diffusion Models Kyungryun Lee et.al. 2308.01594 null
2023-08-03 Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS Myeongjin Ko et.al. 2308.01573 link
2023-08-03 Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models Joao Carvalho et.al. 2308.01557 null
2023-08-03 MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies Ke Chen et.al. 2308.01546 link
2023-08-02 Reverse Stable Diffusion: What prompt was used to generate this image? Florinel-Alin Croitoru et.al. 2308.01472 link
2023-08-02 Patched Denoising Diffusion Models For High-Resolution Image Synthesis Zheng Ding et.al. 2308.01316 link
2023-08-02 Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation Guojin Zhong et.al. 2308.01147 link
2023-08-02 Exploiting Synthetic Data for Data Imbalance Problems: Baselines from a Data Perspective Moon Ye-Bin et.al. 2308.00994 null
2023-08-01 Radial Evolution in a Reaction-Diffusion Model Sofia M. Silveira et.al. 2308.00671 null
2023-08-01 Diffusion Model for Camouflaged Object Detection Zhennan Chen et.al. 2308.00303 null
2023-08-02 EC-Conf: An Ultra-fast Diffusion Model for Molecular Conformation Generation with Equivariant Consistency Zhiguang Fan et.al. 2308.00237 link
2023-07-31 DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models Chao Huang et.al. 2308.00122 null
2023-08-02 Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models Weikang Yu et.al. 2307.16865 link
2023-07-31 DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation Runyang Feng et.al. 2307.16687 null
2023-08-03 On the Trustworthiness Landscape of State-of-the-art Generative Models: A Comprehensive Survey Mingyuan Fan et.al. 2307.16680 null
2023-07-31 Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech Guangyan Zhang et.al. 2307.16679 null
2023-07-31 Contrastive Conditional Latent Diffusion for Audio-visual Segmentation Yuxin Mao et.al. 2307.16579 null
2023-07-31 DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training Hyung-Seok Oh et.al. 2307.16549 link
2023-07-31 Don’t be so negative! Score-based Generative Modeling with Oracle-assisted Guidance Saeid Naderiparizi et.al. 2307.16463 null
2023-07-31 MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning Baoquan Zhang et.al. 2307.16424 null
2023-07-31 Mapping brain microstructure in vivo in health and disease using diffusion MRI Ying Liao et.al. 2307.16386 link
2023-07-31 MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text Junchen Zhu et.al. 2307.16371 null
2023-07-30 TransFusion: A Practical and Effective Transformer-based Diffusion Model for 3D Human Motion Prediction Sibo Tian et.al. 2307.16106 link
2023-07-29 UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models Sen Fang et.al. 2307.15898 null
2023-07-29 Parameter identifiability in PDE models of fluorescence recovery after photobleaching Maria-Veronica Ciocanel et.al. 2307.15857 null
2023-07-28 Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding Chunyu Qiang et.al. 2307.15484 null
2023-07-27 Generative AI for Medical Imaging: extending the MONAI Framework Walter H. L. Pinaya et.al. 2307.15208 link
2023-07-27 LLDiffusion: Learning Degradation Representations in Diffusion Models for Low-Light Image Enhancement Tao Wang et.al. 2307.14659 link
2023-07-29 Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior Adam Block et.al. 2307.14619 null
2023-07-26 Visual Instruction Inversion: Image Editing via Visual Prompting Thao Nguyen et.al. 2307.14331 link
2023-07-26 Founding a mathematical diffusion model in linguistics. The case study of German syntactic features in the North-Eastern Italian dialects I. Lazzizzera et.al. 2307.14291 null
2023-07-26 VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet Zhihao Hu et.al. 2307.14073 null
2023-07-27 Pre-Training with Diffusion models for Dental Radiography segmentation Jérémy Rousseau et.al. 2307.14066 null
2023-07-26 MCMC-Correction of Score-Based Diffusion Models for Model Composition Anders Sjöberg et.al. 2307.14012 link
2023-07-26 How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data? Huazheng Wang et.al. 2307.13949 link
2023-07-26 Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation Chaohui Yu et.al. 2307.13908 null
2023-07-25 **Composite Diffusion whole >= Σparts** Vikram Jamwal et.al. 2307.13720
2023-07-25 Score-based Diffusion Models for Generating Liquid Argon Time Projection Chamber Images Zeviel Imani et.al. 2307.13687 link
2023-07-25 Fake It Without Making It: Conditioned Face Generation for Accurate 3D Face Shape Estimation Will Rowan et.al. 2307.13639 null
2023-07-25 XDLM: Cross-lingual Diffusion Language Model for Machine Translation Linyao Chen et.al. 2307.13560 null
2023-07-25 Not with my name! Inferring artists’ names of input strings employed by Diffusion Models Roberto Leotta et.al. 2307.13527 link
2023-07-25 Modelling functionalized drug release for a spherical capsule Elliot J. Carr et.al. 2307.13224 link
2023-07-24 Deep Learning Approaches for Data Augmentation in Medical Imaging: A Review Aghiles Kebaili et.al. 2307.13125 null
2023-07-24 Data-free Black-box Attack based on Diffusion Model Mingwen Shao et.al. 2307.12872 link
2023-07-24 Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry Yong-Hyun Park et.al. 2307.12868 link
2023-07-24 TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with Transformers Md Fahim Sikder et.al. 2307.12667 link
2023-07-24 Interpolating between Images with Diffusion Models Clinton J. Wang et.al. 2307.12560 null
2023-07-24 AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models Xuelong Dai et.al. 2307.12499 link
2023-07-25 TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition Shilin Lu et.al. 2307.12493 link
2023-07-25 ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting Zongsheng Yue et.al. 2307.12348 link
2023-07-23 TabADM: Unsupervised Tabular Anomaly Detection with Diffusion Models Guy Zamberg et.al. 2307.12336 null
2023-07-23 An axiomatized PDE model of deep neural networks Tangjun Wang et.al. 2307.12333 null
2023-07-22 PLANTAIN: Diffusion-inspired Pose Score Minimization for Fast and Accurate Molecular Docking Michael Brocidiacono et.al. 2307.12090 link
2023-07-22 Iterative Reconstruction Based on Latent Diffusion Model for Sparse Data Reconstruction Linchao He et.al. 2307.12070 null
2023-07-22 FSDiffReg: Feature-wise and Score-wise Diffusion-guided Unsupervised Deformable Image Registration for Cardiac Images Yi Qin et.al. 2307.12035 link
2023-07-21 PartDiff: Image Super-resolution with Partial Diffusion Models Kai Zhao et.al. 2307.11926 null
2023-07-21 Learning minimal representations of stochastic processes with variational autoencoders Gabriel Fernández-Fernández et.al. 2307.11608 link
2023-07-21 Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting Marcel Kollovieh et.al. 2307.11494 link
2023-07-21 Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning Jian Ma et.al. 2307.11410 link
2023-07-20 Dehazing Ultrasound using Diffusion Models Tristan S. W. Stevens et.al. 2307.11204 null
2023-07-20 Diffusion Models for Probabilistic Deconvolution of Galaxy Images Zhiwei Xue et.al. 2307.11122 link
2023-07-20 Diffusion Sampling with Momentum for Mitigating Divergence Artifacts Suttisak Wizadwongsa et.al. 2307.11118 link
2023-07-20 Progressive distillation diffusion for raw music generation Svetlana Pavlova et.al. 2307.10994 null
2023-07-20 Structure-preserving schemes for drift-diffusion systems on general meshes: DDFV vs HFV Stella Krell et.al. 2307.10911 null
2023-07-20 BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion Jinheng Xie et.al. 2307.10816 link
2023-07-21 AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models Jiachun Pan et.al. 2307.10711 link
2023-07-20 Reference-based Painterly Inpainting via Diffusion: Crossing the Wild Reference Domain Gap Dejia Xu et.al. 2307.10584 null
2023-07-19 PreDiff: Precipitation Nowcasting with Latent Diffusion Models Zhihan Gao et.al. 2307.10422 link
2023-07-19 TokenFlow: Consistent Diffusion Features for Consistent Video Editing Michal Geyer et.al. 2307.10373 null
2023-07-19 Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls Lejun Min et.al. 2307.10304 link
2023-07-18 Modeling pattern formation in communities by using information particles Junichi Miyakoshi et.al. 2307.10270 null
2023-07-19 FABRIC: Personalizing Diffusion Models with Iterative Feedback Dimitri von Rütte et.al. 2307.10159 link
2023-07-19 Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis Lingting Zhu et.al. 2307.10094 null
2023-07-19 Modelling the Spatial Spread of COVID-19 in aGerman District using a Diffusion Model Moritz Schäfer et.al. 2307.09956 null
2023-07-19 BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection Jitao Ma et.al. 2307.09861 link
2023-07-19 A Siamese-based Verification System for Open-set Architecture Attribution of Synthetic Images Lydia Abady et.al. 2307.09822 link
2023-07-19 DiffDP: Radiotherapy Dose Prediction via a Diffusion Model Zhenghao Feng et.al. 2307.09794 null
2023-07-19 Text2Layer: Layered Image Generation using Latent Diffusion Model Xinyang Zhang et.al. 2307.09781 null
2023-07-18 An approximate maximum likelihood estimator of drift parameters in a multidimensional diffusion model Miljenko Huzak et.al. 2307.09199 null
2023-07-18 DiTTO: Diffusion-inspired Temporal Transformer Operator Oded Ovadia et.al. 2307.09072 null
2023-07-18 Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond Yang Zhao et.al. 2307.08996 null
2023-07-17 Autoregressive Diffusion Model for Graph Generation Lingkai Kong et.al. 2307.08849 null
2023-07-17 Diffusion Models Beat GANs on Image Classification Soumik Mukhopadhyay et.al. 2307.08702 null
2023-07-17 SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor Defect Classification and Segmentation Vic De Ridder et.al. 2307.08693 null
2023-07-17 Identity-Preserving Aging of Face Images via Latent Diffusion Models Sudipta Banerjee et.al. 2307.08585 link
2023-07-17 Synthetic Lagrangian Turbulence by Generative Diffusion Models Tianyi Li et.al. 2307.08529 link
2023-07-17 Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation Luozhou Wang et.al. 2307.08448 link
2023-07-18 Unstoppable Attack: Label-Only Model Inversion via Conditional Diffusion Model Rongke Liu et.al. 2307.08424 null
2023-07-17 Complexity Matters: Rethinking the Latent Space for Generative Modeling Tianyang Hu et.al. 2307.08283 null
2023-07-17 Manifold-Guided Sampling in Diffusion Models for Unbiased Image Generation Xingzhe Su et.al. 2307.08199 null
2023-07-16 Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency Bowen Song et.al. 2307.08123 link
2023-07-16 Discovering a reaction-diffusion model for Alzheimer’s disease by combining PINNs with symbolic regression Zhen Zhang et.al. 2307.08107 null
2023-07-16 Diffusion to Confusion: Naturalistic Adversarial Patch Generation Based on Diffusion Model for Object Detector Shuo-Yen Lin et.al. 2307.08076 null
2023-07-16 LafitE: Latent Diffusion Model with Feature Editing for Unsupervised Multi-class Anomaly Detection Haonan Yin et.al. 2307.08059 null
2023-07-16 Noise-aware Speech Enhancement using Diffusion Probabilistic Model Yuchen Hu et.al. 2307.08029 link
2023-07-15 ExposureDiffusion: Learning to Expose for Low-light Image Enhancement Yufei Wang et.al. 2307.07710 link
2023-07-14 NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis Nilesh Kulkarni et.al. 2307.07511 null
2023-07-14 Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks Chaoyu Liu et.al. 2307.07344 null
2023-07-14 Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection Alessandro Flaborea et.al. 2307.07205 link
2023-07-14 Federated Learning-Empowered AI-Generated Content in Wireless Networks Xumin Huang et.al. 2307.07146 null
2023-07-13 Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement Hui Yuan et.al. 2307.07055 null
2023-07-13 HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Nataniel Ruiz et.al. 2307.06949 null
2023-07-14 PC-Droid: Faster diffusion and improved quality for particle cloud generation Matthew Leigh et.al. 2307.06836 null
2023-07-13 AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion Shuo Huang et.al. 2307.06526 null
2023-07-13 Improving Nonalcoholic Fatty Liver Disease Classification Performance With Latent Diffusion Models Romain Hardy et.al. 2307.06507 null
2023-07-12 Exposing the Fake: Effective Diffusion-Generated Images Detection Ruipeng Ma et.al. 2307.06272 null
2023-07-12 Diffusion Based Multi-Agent Adversarial Tracking Sean Ye et.al. 2307.06244 null
2023-07-12 Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models Sanghyun Kim et.al. 2307.05977 link
2023-07-11 WHFast512: A symplectic N-body integrator for planetary systems optimized with AVX512 instructions Pejvak Javaheri et.al. 2307.05683 link
2023-07-07 AutoDecoding Latent 3D Diffusion Models Evangelos Ntavelis et.al. 2307.05445 link
2023-07-11 Metropolis Sampling for Constrained Diffusion Models Nic Fishman et.al. 2307.05439 null
2023-07-11 Geometric Neural Diffusion Processes Emile Mathieu et.al. 2307.05431 link
2023-07-11 On the Vulnerability of DeepFake Detectors to Attacks Generated by Denoising Diffusion Models Marija Ivanovska et.al. 2307.05397 null
2023-07-11 Diffusion idea exploration for art generation Nikhil Verma et.al. 2307.04978 null
2023-07-10 Articulated 3D Head Avatar Generation using Text-to-Image Diffusion Models Alexander W. Bergman et.al. 2307.04859 null
2023-07-10 Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback Jaskirat Singh et.al. 2307.04749 null
2023-07-10 Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning Suzan Ece Ada et.al. 2307.04726 null
2023-07-10 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Yuwei Guo et.al. 2307.04725 link
2023-07-10 Timbre transfer using image-to-image denoising diffusion models Luca Comanducci et.al. 2307.04586 null
2023-07-10 Enhancing Adversarial Robustness via Score-Based Optimization Boya Zhang et.al. 2307.04333 link
2023-07-11 DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer Dan Ruta et.al. 2307.04157 null
2023-07-08 Measuring the Success of Diffusion Models at Imitating Human Artists Stephen Casper et.al. 2307.04028 null
2023-07-08 Stimulating the Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling Tong Li et.al. 2307.03992 link
2023-07-07 Nonresonant scattering of energetic electrons by electromagnetic ion cyclotron waves: spacecraft observations and theoretical framework Xin An et.al. 2307.03795 null
2023-07-07 Unsupervised 3D out-of-distribution detection with latent diffusion models Mark S. Graham et.al. 2307.03777 link
2023-07-07 IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model Tianhao Wu et.al. 2307.03177 null
2023-07-06 Patterning of nonlocal transport models in biology: the impact of spatial dimension Thomas Jun Jewell et.al. 2307.03117 null
2023-07-06 How to Detect Unauthorized Data Usages in Text-to-image Diffusion Models Zhenting Wang et.al. 2307.03108 link
2023-07-06 On the Cultural Gap in Text-to-Image Generation Bingshuai Liu et.al. 2307.02971 null
2023-07-06 Probabilistic and Semantic Descriptions of Image Manifolds and Their Applications Peter Tu et.al. 2307.02881 null
2023-07-06 A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task Shiqi Yang et.al. 2307.02862 null
2023-07-06 Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback TaeHo Yoon et.al. 2307.02770 link
2023-07-06 Towards Symmetry-Aware Generation of Periodic Materials Youzhi Luo et.al. 2307.02707 link
2023-07-06 Applying a Color Palette with Local Control using Diffusion Models Vaibhav Vavilala et.al. 2307.02698 link
2023-07-05 Pattern formation and bifurcation analysis of delay induced fractional-order epidemic spreading on networks Jiaying Zhou et.al. 2307.02669 null
2023-07-05 Diffusion Models for Computational Design at the Example of Floor Plans Joern Ploennigs et.al. 2307.02511 link
2023-07-05 DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models Chong Mou et.al. 2307.02421 link
2023-07-05 RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation Renato Sortino et.al. 2307.02392 null
2023-07-05 Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality Peter Lorenz et.al. 2307.02347 link
2023-07-05 SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection Yuguang Shi et.al. 2307.02270 null
2023-07-05 Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions Sandipana Dowerah et.al. 2307.02244 null
2023-07-05 DiffFlow: A Unified SDE Framework for Score-Based Diffusion Models and Generative Adversarial Networks Jingwei Zhang et.al. 2307.02159 null
2023-07-05 Prompting Diffusion Representations for Cross-Domain Semantic Segmentation Rui Gong et.al. 2307.02138 null
2023-07-05 Monte Carlo Sampling without Isoperimetry: A Reverse Diffusion Approach Xunpeng Huang et.al. 2307.02037 null
2023-07-04 Hybrid Neural Diffeomorphic Flow for Shape Representation and Generation via Triplane Kun Han et.al. 2307.01957 null
2023-07-04 SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Dustin Podell et.al. 2307.01952 link
2023-07-04 ProtoDiffusion: Classifier-Free Diffusion Guidance with Prototype Learning Gulcin Baykal et.al. 2307.01924 link
2023-07-04 Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning Xiang Li et.al. 2307.01849 link
2023-07-04 Stochastic and self-consistent 3D modeling of streamer discharge trees with Kinetic Monte Carlo Robert Marskar et.al. 2307.01797 link
2023-07-04 On the Constrained Time-Series Generation Problem Andrea Coletta et.al. 2307.01717 null
2023-07-04 Disentanglement in a GAN for Unconditional Speech Synthesis Matthew Baas et.al. 2307.01673 link
2023-07-04 SwinGNN: Rethinking Permutation Invariance in Diffusion Models for Graph Generation Qi Yan et.al. 2307.01646 link
2023-07-04 Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations Anil Osman Tur et.al. 2307.01533 link
2023-07-04 LEAT: Towards Robust Deepfake Disruption in Real-World Scenarios via Latent Ensemble Attack Joonkyo Shim et.al. 2307.01520 null
2023-07-04 Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning Zhuoran Li et.al. 2307.01472 null
2023-07-03 Squeezing Large-Scale Diffusion Models for Mobile Jiwoong Choi et.al. 2307.01193 null
2023-06-30 Practical and Asymptotically Exact Conditional Sampling in Diffusion Models Luhuan Wu et.al. 2306.17775 link
2023-06-30 Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling Li Sanqian et.al. 2306.17717 null
2023-06-30 Counting Guidance for High Fidelity Text-to-Image Synthesis Wonjun Kang et.al. 2306.17567 null
2023-06-30 Class-Incremental Learning using Diffusion Model for Distillation and Replay Quentin Jodelet et.al. 2306.17560 null
2023-06-29 Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models Simian Luo et.al. 2306.17203 link
2023-06-29 Generate Anything Anywhere in Any Scene Yuheng Li et.al. 2306.17154 null
2023-06-29 Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion Models Zeqi Gu et.al. 2306.17141 link
2023-06-29 ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models Weihao Cheng et.al. 2306.17140 null
2023-07-03 Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation Zibo Zhao et.al. 2306.17115 link
2023-06-29 Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation Zhongwei Qiu et.al. 2306.17074 null
2023-06-29 One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization Minghua Liu et.al. 2306.16928 link
2023-06-28 PFB-Diff: Progressive Feature Blending Diffusion for Text-driven Image Editing Wenjing Huang et.al. 2306.16894 link
2023-06-29 SaGess: Sampling Graph Denoising Diffusion Model for Scalable Graph Generation Stratis Limnios et.al. 2306.16827 null
2023-06-29 Graph Denoising Diffusion for Inverse Protein Folding Kai Yi et.al. 2306.16819 link
2023-06-29 DiffusionSTR: Diffusion Model for Scene Text Recognition Masato Fujitake et.al. 2306.16707 null
2023-06-29 Self-Supervised MRI Reconstruction with Unrolled Diffusion Models Yilmaz Korkmaz et.al. 2306.16654 link
2023-06-28 DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy Yiwen Zhang et.al. 2306.16324 link
2023-06-28 SVNR: Spatially-variant Noise Removal with Denoising Diffusion Naama Pearl et.al. 2306.16052 null
2023-06-28 GeXSe (Generative Explanatory Sensor System): An Interpretable Deep Generative Model for Human Activity Recognition in Smart Spaces Yuan Sun et.al. 2306.15857 null
2023-06-27 Easing Color Shifts in Score-Based Diffusion Models Katherine Deck et.al. 2306.15832 link
2023-06-26 Restart Sampling for Improving Generative Processes Yilun Xu et.al. 2306.14878 link
2023-06-26 ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion Yingjun Du et.al. 2306.14770 link
2023-06-26 DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models Ximing Xing et.al. 2306.14685 link
2023-06-26 A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis Aishwarya Agarwal et.al. 2306.14544 null
2023-06-27 DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing Yujun Shi et.al. 2306.14435 link
2023-06-26 Decompose and Realign: Tackling Condition Misalignment in Text-to-Image Diffusion Models Luozhou Wang et.al. 2306.14408 link
2023-06-25 CDiffMR: Can We Replace the Gaussian Noise with K-Space Undersampling for Fast MRI? Jiahao Huang et.al. 2306.14350 link
2023-06-25 Diffusion Model Based Low-Light Image Enhancement for Space Satellite Yiman Zhu et.al. 2306.14227 null
2023-06-25 DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data Jingyuan Zhu et.al. 2306.14153 null
2023-06-25 YOLO-based Semantic Communication with Generative AI-aided Resource Allocation for Digital Twins Construction Baoxia Du et.al. 2306.14138 null
2023-06-25 DiffMix: Diffusion Model-based Data Synthesis for Nuclei Segmentation and Classification in Imbalanced Pathology Image Datasets Hyun-Jic Oh et.al. 2306.14132 null
2023-06-24 SEEDS: Emulation of Weather Forecast Ensembles with Diffusion Models Lizao Li et.al. 2306.14066 null
2023-06-24 DiffDTM: A conditional structure-free framework for bioactive molecules generation targeted for dual proteins Lei Huang et.al. 2306.13957 null
2023-06-23 The role of convection in the limit shape of the critical front profile for Born-Infeld diffusion models Maurizio Garrione et.al. 2306.13806 null
2023-06-23 Asymptotic study of critical wave fronts for parameter-dependent Born-Infeld models: physically predicted behaviors and new phenomena Maurizio Garrione et.al. 2306.13788 null
2023-06-23 Zero-shot spatial layout conditioning for text-to-image diffusion models Guillaume Couairon et.al. 2306.13754 null
2023-06-23 Decoupled Diffusion Models with Explicit Transition Probability Yuhang Huang et.al. 2306.13720 link
2023-06-23 DreamEditor: Text-Driven 3D Scene Editing with Neural Fields Jingyu Zhuang et.al. 2306.13455 link
2023-06-23 DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology Marco Aversa et.al. 2306.13384 link
2023-06-22 Directional diffusion models for graph representation learning Run Yang et.al. 2306.13210 null
2023-06-22 Continuous Layout Editing of Single Images with Diffusion Models Zhiyuan Zhang et.al. 2306.13078 null
2023-06-22 Towards More Realistic Membership Inference Attacks on Large Diffusion Models Jan Dubiński et.al. 2306.12983 null
2023-06-22 DiffWA: Diffusion Models for Watermark Attack Xinyu Li et.al. 2306.12790 null
2023-06-22 A prior regularized full waveform inversion using generative diffusion models Fu Wang et.al. 2306.12776 null
2023-06-22 One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation Bohan Li et.al. 2306.12681 null
2023-06-23 Semi-Implicit Denoising Diffusion Models (SIDDMs) Yanwu Xu et.al. 2306.12511 link
2023-06-21 DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation Yukun Huang et.al. 2306.12422 null
2023-06-21 Diffusion Posterior Sampling for Informed Single-Channel Dereverberation Jean-Marie Lemercier et.al. 2306.12286 link
2023-06-21 HumanDiffusion: diffusion model using perceptual gradients Yota Ueda et.al. 2306.12169 null
2023-06-21 DiffuseIR:Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images Mingjie Pan et.al. 2306.12109 null
2023-06-21 HSR-Diff:Hyperspectral Image Super-Resolution via Conditional Diffusion Models Chanyue Wu et.al. 2306.12085 null
2023-06-21 Ambigram Generation by A Diffusion Model Takahiro Shirakawa et.al. 2306.12049 link
2023-06-22 Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems Prashant K. Jha et.al. 2306.12047 null
2023-06-21 TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent Diffusion Models Se-In Jang et.al. 2306.11984 null
2023-06-20 Mercury’s chaotic secular evolution as a subdiffusive process Dorian S. Abbot et.al. 2306.11870 null
2023-06-20 Exploring the Effectiveness of Dataset Synthesis: An application of Apple Detection in Orchards Alexander van Meekeren et.al. 2306.11763 null
2023-06-20 Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning Huiguo He et.al. 2306.11731 null
2023-06-20 Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision Ayush Tewari et.al. 2306.11719 null
2023-06-20 Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputs Yu Takagi et.al. 2306.11536 link
2023-06-20 Align, Adapt and Inject: Sound-guided Unified Image Generation Yue Yang et.al. 2306.11504 null
2023-06-20 EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model Lianying Yin et.al. 2306.11496 null
2023-06-20 Hierarchical GNNs for Large Graph Generation Alex O. Davies et.al. 2306.11412 null
2023-06-20 Masked Diffusion Models are Fast Learners Jiachen Lei et.al. 2306.11363 link
2023-06-20 RS5M: A Large Scale Vision-Language Dataset for Remote Sensing Vision-Language Foundation Model Zilun Zhang et.al. 2306.11300 link
2023-06-20 Eliminating Lipschitz Singularities in Diffusion Models Zhantao Yang et.al. 2306.11251 null
2023-06-19 GD-VDM: Generated Depth for better Diffusion-based Video Generation Ariel Lapid et.al. 2306.11173 link
2023-06-16 Group Orthogonalization Regularization For Vision Models Adaptation and Robustness Yoav Kurtz et.al. 2306.10001 link
2023-06-16 Towards Better Certified Segmentation via Diffusion Models Othmane Laousy et.al. 2306.09949 link
2023-06-16 Drag-guided diffusion models for vehicle image generation Nikos Arechiga et.al. 2306.09935 null
2023-06-16 Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models Geon Yeong Park et.al. 2306.09869 link
2023-06-16 AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation Yifei Zeng et.al. 2306.09864 null
2023-06-16 Understanding Deep Generative Models with Generalized Empirical Likelihoods Suman Ravuri et.al. 2306.09780 link
2023-06-16 The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models Roy Voetman et.al. 2306.09762 null
2023-06-16 CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models Hao-Wen Dong et.al. 2306.09635 null
2023-06-15 Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model Lu Yu et.al. 2306.09551 null
2023-06-15 Hierarchical Planning and Control for Box Loco-Manipulation Zhaoming Xie et.al. 2306.09532 null
2023-06-15 R2-Diff: Denoising by diffusion as a refinement of retrieved motion for image-based motion prediction Takeru Oba et.al. 2306.09483 null
2023-06-15 Generative Proxemics: A Prior for 3D Social Interaction from Images Lea Müller et.al. 2306.09337 link
2023-06-19 ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models Dar-Yen Chen et.al. 2306.09330 link
2023-06-15 Diffusion Models for Zero-Shot Open-Vocabulary Segmentation Laurynas Karazija et.al. 2306.09316 null
2023-06-15 Fast Training of Diffusion Models with Masked Transformers Hongkai Zheng et.al. 2306.09305 link
2023-06-15 A Score-based Nonlinear Filter for Data Assimilation Feng Bao et.al. 2306.09282 null
2023-06-15 Conditional Human Sketch Synthesis with Explicit Abstraction Control Dar-Yen Chen et.al. 2306.09274 null
2023-06-15 Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models Gen Li et.al. 2306.09251 null
2023-06-15 Training Diffusion Classifiers with Denoising Assistance Chandramouli Sastry et.al. 2306.09192 null
2023-06-15 DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks in the Physical World Caixin Kang et.al. 2306.09124 link
2023-06-15 Relation-Aware Diffusion Model for Controllable Poster Layout Generation Fengheng Li et.al. 2306.09086 link
2023-06-15 Parameterizing Vertical Mixing Coefficients in the Ocean Surface Boundary Layer using Neural Networks Aakash Sane et.al. 2306.09045 null
2023-06-15 Annotator Consensus Prediction for Medical Image Segmentation with Diffusion Models Tomer Amit et.al. 2306.09004 link
2023-06-15 When Hyperspectral Image Classification Meets Diffusion Models: An Unsupervised Feature Learning Framework Jingyi Zhou et.al. 2306.08964 link
2023-06-15 RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation Gabriel Bénédict et.al. 2306.08947 link
2023-06-15 Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment Royi Rassin et.al. 2306.08877 link
2023-06-15 OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models Enshu Liu et.al. 2306.08860 link
2023-06-14 InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models Yingheng Wang et.al. 2306.08757 null
2023-06-14 VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing Paul Couairon et.al. 2306.08707 null
2023-06-14 GHP-MOFassemble: Diffusion modeling, high throughput screening, and molecular dynamics for rational discovery of novel metal-organic frameworks for carbon capture at scale Hyun Park et.al. 2306.08695 link
2023-06-14 Norm-guided latent space exploration for text-to-image generation Dvir Samuel et.al. 2306.08687 link
2023-06-13 Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Shuai Yang et.al. 2306.07954 null
2023-06-13 Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data Stanislaw Szymanowicz et.al. 2306.07881 null
2023-06-13 StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models Yinghao Aaron Li et.al. 2306.07691 link
2023-06-15 Hyperbolic Graph Diffusion Model for Molecule Generation Lingfeng Wen et.al. 2306.07618 link
2023-06-13 Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model Xin Zhang et.al. 2306.07596 null
2023-06-13 User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems Marc Finzi et.al. 2306.07526 null
2023-06-13 Multi-objective Molecular Optimization for Opioid Use Disorder Treatment Using Generative Network Complex Hongsong Feng et.al. 2306.07484 null
2023-06-13 3D molecule generation by denoising voxel grids Pedro O. Pinheiro et.al. 2306.07473 link
2023-06-12 Controlling Text-to-Image Diffusion by Orthogonal Finetuning Zeju Qiu et.al. 2306.07280 null
2023-06-12 MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images Junchen Zhu et.al. 2306.07257 null
2023-06-12 Diffusion Models for Black-Box Optimization Siddarth Krishnamoorthy et.al. 2306.07180 link
2023-06-12 InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions Jiale Xu et.al. 2306.07154 null
2023-06-12 Fast Diffusion Model Zike Wu et.al. 2306.06991 link
2023-06-13 VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models Sheng-Yen Chou et.al. 2306.06874 link
2023-06-12 HiddenSinger: High-Quality Singing Voice Synthesis via Neural Audio Codec and Latent Diffusion Models Ji-Sang Hwang et.al. 2306.06814 null
2023-06-11 Stable Remaster: Bridging the Gap Between Old Content and New Displays Nathan Paull et.al. 2306.06803 link
2023-06-10 How movement bias to attractive regions determines population spread and critical habitat size Vivian Dornelas et.al. 2306.06450 link
2023-06-10 Language-Guided Traffic Simulation via Scene-Level Diffusion Ziyuan Zhong et.al. 2306.06344 null
2023-06-09 Boosting GUI Prototyping with Diffusion Models Jialiang Wei et.al. 2306.06233 null
2023-06-09 Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions Ian Huang et.al. 2306.06212 link
2023-06-09 Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics Alignment with Diffusion Model Yule Wang et.al. 2306.06138 link
2023-06-09 Beyond Diffusion: A Generalized Mean-Field Theory of Turbulent Dust Transport in Protoplanetary Disks Fabian Binkert et.al. 2306.06103 null
2023-06-09 Beyond Surface Statistics: Scene Representations in a Latent Diffusion Model Yida Chen et.al. 2306.05720 link
2023-06-12 Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion Haogeng Liu et.al. 2306.05708 null
2023-06-09 RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models Xingchen Zhou et.al. 2306.05668 null
2023-06-08 BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping Jiatao Gu et.al. 2306.05544 null
2023-06-08 Grounded Text-to-Image Synthesis with Attention Refocusing Quynh Phung et.al. 2306.05427 null
2023-06-08 Stochastic Multi-Person 3D Motion Forecasting Sirui Xu et.al. 2306.05421 link
2023-06-08 PriSampler: Mitigating Property Inference of Diffusion Models Hailong Hu et.al. 2306.05208 null
2023-06-08 A cognitive process approach to modeling gap acceptance in overtaking Samir H. A. Mohammad et.al. 2306.05203 null
2023-06-08 SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions Yuseung Lee et.al. 2306.05178 null
2023-06-08 Non-autoregressive Conditional Diffusion Models for Time Series Prediction Lifeng Shen et.al. 2306.05043 null
2023-06-08 Multi-Architecture Multi-Expert Diffusion Models Yunsung Lee et.al. 2306.04990 null
2023-06-08 Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning Jifeng Hu et.al. 2306.04875 null
2023-06-09 Complexity-aware Large Scale Origin-Destination Network Generation via Diffusion Model Can Rong et.al. 2306.04873 null
2023-06-08 Ground states for aggregation-diffusion models on Cartan-Hadamard manifolds Razvan C. Fetecau et.al. 2306.04856 null
2023-06-08 Interpreting and Improving Diffusion Models Using the Euclidean Distance Function Frank Permenter et.al. 2306.04848 link
2023-06-07 WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models Changhoon Kim et.al. 2306.04744 link
2023-06-07 ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models Maitreya Patel et.al. 2306.04695 link
2023-06-07 Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models George Stein et.al. 2306.04675 link
2023-06-07 Designing a Better Asymmetric VQGAN for StableDiffusion Zixin Zhu et.al. 2306.04632 link
2023-06-07 ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections Chun-Han Yao et.al. 2306.04619 null
2023-06-09 Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt Kai Chen et.al. 2306.04607 null
2023-06-07 On the Design Fundamentals of Diffusion Models: A Survey Ziyi Chang et.al. 2306.04542 null
2023-06-07 Multi-modal Latent Diffusion Mustapha Bounoua et.al. 2306.04445 link
2023-06-07 Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance Gihyun Kwon et.al. 2306.04396 link
2023-06-07 Generative Semantic Communication: Diffusion Models Beyond Bit Recovery Eleonora Grassucci et.al. 2306.04321 link
2023-06-07 A Survey on Generative Diffusion Models for Structured Data Heejoon Koo et.al. 2306.04139 null
2023-06-07 Phoenix: A Federated Generative Diffusion Model Fiona Victoria Stanley Jothiraj et.al. 2306.04098 null
2023-06-07 Professional Basketball Player Behavior Synthesis via Planning with Diffusion Xiusi Chen et.al. 2306.04090 link
2023-06-06 A machine learning potential-based generative algorithm for on-lattice crystal structure prediction Vadim Sotskov et.al. 2306.03989 null
2023-06-06 High-dimensional and Permutation Invariant Anomaly Detection Vinicius Mikuni et.al. 2306.03933 link
2023-06-06 Emergent Correspondence from Image Diffusion Luming Tang et.al. 2306.03881 link
2023-06-06 Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation Xinrong Hu et.al. 2306.03878 link
2023-06-06 Towards Visual Foundational Models of Physical Scenes Chethan Parameshwara et.al. 2306.03727 null
2023-06-06 Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias Ziyue Jiang et.al. 2306.03509 null
2023-06-08 DFormer: Diffusion-guided Transformer for Universal Image Segmentation Hefeng Wang et.al. 2306.03437 link
2023-06-06 Protecting the Intellectual Property of Diffusion Models by the Watermark Diffusion Process Sen Peng et.al. 2306.03436 link
2023-06-06 Change Diffusion: Change Detection Map Generation Based on Difference-Feature Guided DDPM Yihan Wen et.al. 2306.03424 link
2023-06-08 DreamSparse: Escaping from Plato’s Cave with 2D Diffusion Model Given Sparse Views Paul Yoo et.al. 2306.03414 null
2023-06-05 Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models Andrew F. Luo et.al. 2306.03089 null
2023-06-05 HeadSculpt: Crafting 3D Head Avatars with Text Xiao Han et.al. 2306.03038 null
2023-06-05 Brain tumor segmentation using synthetic MR images – A comparison of GANs and diffusion models Muhammad Usman Akbar et.al. 2306.02986 link
2023-06-05 Complex Preferences for Different Convergent Priors in Discrete Graph Diffusion Alex M. Tseng et.al. 2306.02957 null
2023-06-05 INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems Di You et.al. 2306.02949 null
2023-06-05 Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions Shaoxu Li et.al. 2306.02903 link
2023-06-06 Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark Shuyu Yang et.al. 2306.02898 link
2023-06-05 User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques Sunwoo Kim et.al. 2306.02717 null
2023-06-05 Faster Training of Diffusion Models and Improved Density Estimation via Parallel Score Matching Etrit Haxholli et.al. 2306.02658 null
2023-06-05 Physics-Informed Kernel Function Neural Networks for Solving Partial Differential Equations Zhuojia Fu et.al. 2306.02606 null
2023-06-05 Video Diffusion Models with Local-Global Context Guidance Siyuan Yang et.al. 2306.02562 link
2023-06-05 PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model Yizhe Zhang et.al. 2306.02531 link
2023-06-04 Spear or Shield: Leveraging Generative AI to Tackle Security Threats of Intelligent Network Services Hongyang Du et.al. 2306.02384 null
2023-06-04 Temporal Dynamic Quantization for Diffusion Models Junhyuk So et.al. 2306.02316 null
2023-06-04 Detector Guidance for Multi-Object Text-to-Image Generation Luping Liu et.al. 2306.02236 link
2023-06-03 Training Data Attribution for Diffusion Models Zheng Dai et.al. 2306.02174 link
2023-06-03 Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution Yiji Cheng et.al. 2306.02083 null
2023-06-03 Exploring the Optimal Choice for Generative Processes in Diffusion Models: Ordinary vs Stochastic Differential Equations Yu Cao et.al. 2306.02063 null
2023-06-03 DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting Salva Rühling Cachay et.al. 2306.01984 link
2023-06-02 Generative Autoencoders as Watermark Attackers: Analyses of Vulnerabilities and Threats Xuandong Zhao et.al. 2306.01953 link
2023-06-02 Video Colorization with Pre-trained Text-to-Image Diffusion Models Hanyuan Liu et.al. 2306.01732 null
2023-06-02 Denoising Diffusion Semantic Segmentation with Mask Prior Modeling Zeqiang Lai et.al. 2306.01721 link
2023-06-02 DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation Guanqun Bi et.al. 2306.01657 null
2023-06-02 PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models Jiacheng Chen et.al. 2306.01461 link
2023-06-02 Zero-Shot Blind Audio Bandwidth Extension Eloi Moliner et.al. 2306.01433 link
2023-06-02 Audio-Visual Speech Enhancement with Score-Based Generative Models Julius Richter et.al. 2306.01432 null
2023-06-02 Quantifying Sample Anonymity in Score-Based Generative Models with Adversarial Fingerprinting Mischa Dombrowski et.al. 2306.01363 null
2023-06-02 Privacy Distillation: Reducing Re-identification Risk of Multimodal Diffusion Models Virginia Fernandez et.al. 2306.01322 null
2023-06-02 Diffusion Self-Guidance for Controllable Image Generation Dave Epstein et.al. 2306.00986 null
2023-06-01 SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds Yanyu Li et.al. 2306.00980 link
2023-06-01 Intriguing Properties of Text-guided Diffusion Models Qihao Liu et.al. 2306.00974 link
2023-06-01 Intelligent Grimm – Open-ended Visual Storytelling via Latent Diffusion Models Chang Liu et.al. 2306.00973 link
2023-06-01 ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation Shaozhe Hao et.al. 2306.00971 link
2023-06-01 The Hidden Language of Diffusion Models Hila Chefer et.al. 2306.00966 link
2023-06-01 Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation Minghui Hu et.al. 2306.00964 null
2023-06-01 Differential Diffusion: Giving Each Pixel Its Strength Eran Levin et.al. 2306.00950 link
2023-06-01 Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance Jinbo Xing et.al. 2306.00943 null
2023-06-01 Inserting Anybody in Diffusion Models via Celeb Basis Ge Yuan et.al. 2306.00926 link
2023-06-01 Conditioning Diffusion Models via Attributes and Semantic Masks for Face Generation Nico Giambi et.al. 2306.00914 null
2023-06-01 Robust Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers Ruotong Wang et.al. 2306.00816 null
2023-06-01 UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning Xiao Dong et.al. 2306.00813 null
2023-06-01 FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models Hao Zhang et.al. 2306.00783 link
2023-06-01 UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model Anastasiia Iashchenko et.al. 2306.00721 link
2023-06-01 EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis Haobin Tang et.al. 2306.00648 null
2023-06-01 AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars Mohit Mendiratta. Xingang Pan et.al. 2306.00547 null
2023-06-01 Image generation with shortest path diffusion Ayan Das et.al. 2306.00501 link
2023-06-01 Random advection-diffusion models and their statistics Stefano Lepri et.al. 2306.00463 null
2023-06-01 Controllable Motion Diffusion Model Yi Shi et.al. 2306.00416 link

semantic segmentation

Publish Date Title Authors PDF Code
2025-06-30 Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors Ce Wang et.al. 2506.23801 null
2025-06-30 Deep Learning-Based Semantic Segmentation for Real-Time Kidney Imaging and Measurements with Augmented Reality-Assisted Ultrasound Gijs Luijten et.al. 2506.23721 null
2025-06-30 PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum Shiqi Zhang et.al. 2506.23607 null
2025-06-30 Interactive Interface For Semantic Segmentation Dataset Synthesis Ngoc-Do Tran et.al. 2506.23470 null
2025-06-30 Contrastive Learning with Diffusion Features for Weakly Supervised Medical Image Segmentation Dewen Zeng et.al. 2506.23460 null
2025-06-29 Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement Siyuan Chai et.al. 2506.23353 null
2025-06-29 FastSeg: Efficient Training-Free Open-Vocabulary Segmentation via Hierarchical Attention Refinement Method Quang-Huy Che et.al. 2506.23323 null
2025-06-29 BPD-Neo: An MRI Dataset for Lung-Trachea Segmentation with Clinical Data for Neonatal Bronchopulmonary Dysplasia Rachit Saluja et.al. 2506.23305 null
2025-06-29 High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation Lunhao Duan et.al. 2506.23227 null
2025-06-28 Probabilistic Prototype Calibration of Vision-Language Models for Generalized Few-shot Semantic Segmentation Jie Liu et.al. 2506.22979 null
2025-06-28 Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception Hang-Cheng Dong et.al. 2506.22866 null
2025-06-28 Unleashing the Multi-View Fusion Potential: Noise Correction in VLM for Open-Vocabulary 3D Scene Understanding Xingyilang Yin et.al. 2506.22817 null
2025-06-27 Dual Atrous Separable Convolution for Improving Agricultural Semantic Segmentation Chee Mei Ling et.al. 2506.22570 null
2025-06-27 Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation Jialei Chen et.al. 2506.22032 null
2025-06-27 TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models Meng Yu et.al. 2506.21975 null
2025-06-27 SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Naftaly Wambugu et.al. 2506.21945 null
2025-06-26 Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection Tobias J. Riedlinger et.al. 2506.21486 null
2025-06-27 ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation Xiwei Xuan et.al. 2506.21233 null
2025-06-26 Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 Jongyeon Park et.al. 2506.21174 null
2025-06-27 DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation Wenzhou Lyu et.al. 2506.21034 null
2025-06-26 TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation Chade Li et.al. 2506.20991 null
2025-06-26 Segment Anything in Pathology Images with Natural Language Zhixuan Chen et.al. 2506.20988 null
2025-06-25 U-R-VEDA: Integrating UNET, Residual Links, Edge and Dual Attention, and Vision Transformer for Accurate Semantic Segmentation of CMRs Racheal Mukisa et.al. 2506.20689 null
2025-06-25 Building Lightweight Semantic Segmentation Models for Aerial Images Using Dual Relation Distillation Minglong Li et.al. 2506.20688 null
2025-06-25 A Deep Learning Approach to Identify Rock Bolts in Complex 3D Point Clouds of Underground Mines Captured Using Mobile Laser Scanners Dibyayan Patra et.al. 2506.20464 null
2025-06-26 Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition Man Duc Chuc et.al. 2506.20174 null
2025-06-24 A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects Shulan Ruan et.al. 2506.19769 null
2025-06-24 A Global-Local Cross-Attention Network for Ultra-high Resolution Remote Sensing Image Semantic Segmentation Chen Yi et.al. 2506.19406 null
2025-06-25 AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation Ziyan Zhao et.al. 2506.19269 null
2025-06-23 Orthogonal Projection Subspace to Aggregate Online Prior-knowledge for Continual Test-time Adaptation Jinlong Li et.al. 2506.19022 null
2025-06-23 Multi-Scale Spectral Attention Module-based Hyperspectral Segmentation in Autonomous Driving Scenarios Imad Ali Shah et.al. 2506.18682 null
2025-06-22 OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model Shuaiyu Chen et.al. 2506.18006 null
2025-06-22 Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation Xiaodong Guo et.al. 2506.17869 null
2025-06-20 ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds Binbin Xiang et.al. 2506.16991 null
2025-06-19 From Semantic To Instance: A Semi-Self-Supervised Learning Approach Keyhan Najafian et.al. 2506.16563 null
2025-06-19 Structured Semantic 3D Reconstruction (S23DR) Challenge 2025 – Winning solution Jan Skvrna et.al. 2506.16421 null
2025-06-19 LBMamba: Locally Bi-directional Mamba Jingwei Zhang et.al. 2506.15976 null
2025-06-19 Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging Jiawen Yang et.al. 2506.15971 null
2025-06-19 Polyline Path Masked Attention for Vision Transformer Zhongchen Zhao et.al. 2506.15940 link
2025-06-18 MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning Leonid Ivanov et.al. 2506.15313 link
2025-06-18 Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation Jiaqi Shi et.al. 2506.15160 link
2025-06-17 Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset Nikolaos Dionelis et.al. 2506.14765 link
2025-06-17 VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy Zhuoyue Tan et.al. 2506.14525 null
2025-06-17 DepthSeg: Depth prompting in remote sensing semantic segmentation Ning Zhou et.al. 2506.14382 null
2025-06-16 HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment Numair Nadeem et.al. 2506.13925 null
2025-06-16 A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects Guohuan Xie et.al. 2506.13552 null
2025-06-16 Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning Rohit Mohan et.al. 2506.13265 null
2025-06-16 ViewPCL: a point cloud based active learning method for multi-view segmentation Christian Hilaire et.al. 2506.13043 null
2025-06-15 A large-scale, physically-based synthetic dataset for satellite pose estimation Szabolcs Velkei et.al. 2506.12782 null
2025-06-15 Unleashing Diffusion and State Space Models for Medical Image Segmentation Rong Wu et.al. 2506.12747 null
2025-06-15 Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups Zhenghao Xi et.al. 2506.12712 null
2025-06-13 A $^2$ LC: Active and Automated Label Correction for Semantic Segmentation Youjin Jeon et.al. 2506.11599 null
2025-06-12 GynSurg: A Comprehensive Gynecology Laparoscopic Surgery Dataset Sahar Nasirihaghighi et.al. 2506.11356 null
2025-06-11 FARCLUSS: Fuzzy Adaptive Rebalancing and Contrastive Uncertainty Learning for Semi-Supervised Semantic Segmentation Ebenezer Tarubinga et.al. 2506.11142 link
2025-06-12 Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes Masahiro Yasuda et.al. 2506.10676 link
2025-06-12 Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models Francisco Caetano et.al. 2506.10634 null
2025-06-12 Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration Jun Wang et.al. 2506.10573 null
2025-06-12 Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation Shuyang Li et.al. 2506.10503 null
2025-06-12 Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success Che Wang et.al. 2506.10359 null
2025-06-11 Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements Mustafa Atahan Nuhoglu et.al. 2506.10107 null
2025-06-11 Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation Siyu Chen et.al. 2506.09881 link
2025-06-11 The Four Color Theorem for Cell Instance Segmentation Ye Zhang et.al. 2506.09724 link
2025-06-11 Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments Fatemeh Mohammadi Amin et.al. 2506.09552 null
2025-06-12 Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20 $^{th}$ century Urban Landscapes with Satellite Imageries Tianxiang Hao et.al. 2506.09476 link
2025-06-11 MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Tong Wang et.al. 2506.09327 null
2025-06-10 WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos Negin Ghamsarian et.al. 2506.08896 null
2025-06-11 RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation Jiayi Song et.al. 2506.08772 link
2025-06-10 ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction Juan Yeo et.al. 2506.08678 null
2025-06-10 ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network Feixiang Du et.al. 2506.08629 null
2025-06-10 DCD: A Semantic Segmentation Model for Fetal Ultrasound Four-Chamber View Donglian Li et.al. 2506.08534 null
2025-06-11 IGraSS: Learning to Identify Infrastructure Networks from Satellite Imagery by Iterative Graph-constrained Semantic Segmentation Oishee Bintey Hoque et.al. 2506.08137 null
2025-06-09 LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds Zihui Zhang et.al. 2506.07857 link
2025-06-09 F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation Hengzhi Chen et.al. 2506.07847 null
2025-06-09 Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity Mohamed Djilani et.al. 2506.07773 link
2025-06-09 Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation Jintao Tong et.al. 2506.07376 null
2025-06-09 Multiple Object Stitching for Unsupervised Representation Learning Chengchao Shen et.al. 2506.07364 link
2025-06-08 BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite Liyang Chen et.al. 2506.07116 null
2025-06-08 Technical Report for ICRA 2025 GOOSE 3D Semantic Segmentation Challenge: Adaptive Point Cloud Understanding for Heterogeneous Robotic Systems Xiaoya Zhang et.al. 2506.06995 null
2025-06-07 Position Prediction Self-Supervised Learning for Multimodal Satellite Imagery Semantic Segmentation John Waithaka et.al. 2506.06852 null
2025-06-07 EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery Guankun Wang et.al. 2506.06830 null
2025-06-06 GS4: Generalizable Sparse Splatting Semantic SLAM Mingqi Jiang et.al. 2506.06517 null
2025-06-06 NeurNCD: Novel Class Discovery via Implicit Neural Representation Junming Wang et.al. 2506.06412 null
2025-06-06 Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness Steven Landgraf et.al. 2506.05917 null
2025-06-05 FRAME: Pre-Training Video Feature Representations via Anticipation and Memory Sethuraman TV et.al. 2506.05543 null
2025-06-05 U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation Marwane Kzadri et.al. 2506.05444 null
2025-06-05 Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting Alfred T. Christiansen et.al. 2506.05009 null
2025-06-04 You Only Train Once Christos Sakaridis et.al. 2506.04349 null
2025-06-04 AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives Aniruddh Sikdar et.al. 2506.03709 null
2025-06-04 OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation Aditya Gandhamal et.al. 2506.03706 null
2025-06-04 BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation Jialei Chen et.al. 2506.03675 null
2025-06-03 Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery Pengyu Chen et.al. 2506.03388 null
2025-06-03 Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding Weiqing Xiao et.al. 2506.03134 link
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 link
2025-06-03 Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather Longyu Yang et.al. 2506.02396 null
2025-06-04 SAB3R: Semantic-Augmented Backbone in 3D Reconstruction Xuweiyi Chen et.al. 2506.02112 null
2025-06-02 SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation Rafael Flor-Rodríguez et.al. 2506.01418 link
2025-06-01 Perceptual Inductive Bias Is What You Need Before Contrastive Learning Tianqin Li et.al. 2506.01201 null
2025-06-01 GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning Sahiti Yerramilli et.al. 2506.00785 null
2025-05-31 BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation Wei Tao et.al. 2506.00475 null
2025-05-30 Bi-Manual Joint Camera Calibration and Scene Representation Haozhan Tang et.al. 2505.24819 null
2025-06-02 NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation Xuzhi Wang et.al. 2505.24634 link
2025-05-30 Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation Roger Ferrod et.al. 2505.24361 link
2025-05-30 Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors Peiran Xu et.al. 2505.24103 link
2025-05-29 MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking Numair Nadeem et.al. 2505.24026 null
2025-05-29 Semantics-Guided Generative Image Compression Cheng-Lin Wu et.al. 2505.24015 link
2025-05-29 Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts Xuweiyi Chen et.al. 2505.23926 null
2025-05-29 TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models Yao Xiao et.al. 2505.23769 link
2025-05-29 Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation Georgios Voulgaris et.al. 2505.23597 null
2025-05-29 VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration Ben Li et.al. 2505.23439 link
2025-05-29 Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation Lingyan Ran et.al. 2505.23438 null
2025-05-29 Federated Unsupervised Semantic Segmentation Evangelos Charalampakis et.al. 2505.23292 null
2025-05-29 LeMoRe: Learn More Details for Lightweight Semantic Segmentation Mian Muhammad Naeem Abid et.al. 2505.23093 link
2025-05-28 ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions Maxence Wynen et.al. 2505.22537 null
2025-05-28 Universal Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2505.22458 null
2025-05-28 LiDAR Based Semantic Perception for Forklifts in Outdoor Environments Benjamin Serfling et.al. 2505.22258 null
2025-05-29 YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction Mingzhuang Wang et.al. 2505.22250 null
2025-05-28 Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation Zhisong Wang et.al. 2505.22230 null
2025-05-28 A Survey on Training-free Open-Vocabulary Semantic Segmentation Naomi Kombol et.al. 2505.22209 null
2025-05-28 S2AFormer: Strip Self-Attention for Efficient Vision Transformer Guoan Xu et.al. 2505.22195 null
2025-05-28 LiDARDustX: A LiDAR Dataset for Dusty Unstructured Road Environments Chenfeng Wei et.al. 2505.21914 null
2025-05-28 Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation Mehrdad Noori et.al. 2505.21844 link
2025-05-27 Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning Nikos Giannakakis et.al. 2505.20962 null
2025-05-27 DSOcc: Leveraging Depth Awareness and Semantic Aid to Boost Camera-Based 3D Semantic Occupancy Prediction Naiyu Fang et.al. 2505.20951 null
2025-05-26 Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments Julio de la Torre-Vanegas et.al. 2505.20423 null
2025-05-26 A fully automated urban PV parameterization framework for improved estimation of energy production profiles Bowen Tian et.al. 2505.19876 null
2025-05-29 Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation Nagito Saito et.al. 2505.19846 null
2025-05-26 The Missing Point in Vision Transformers for Universal Image Segmentation Sajjad Shahabodini et.al. 2505.19795 null
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 null
2025-05-25 A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic Segmentation Yuze Wang et.al. 2505.19159 link
2025-05-25 SPARS: Self-Play Adversarial Reinforcement Learning for Segmentation of Liver Tumours Catalina Tan et.al. 2505.18989 link
2025-05-25 LLM-Guided Taxonomy and Hierarchical Uncertainty for 3D Point CLoud Active Learning Chenxi Li et.al. 2505.18924 null
2025-05-23 REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders Savya Khosla et.al. 2505.18153 link
2025-05-23 SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Shashank Agnihotri et.al. 2505.18015 link
2025-05-23 Semantic segmentation with reward Xie Ting et.al. 2505.17905 null
2025-05-23 Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring Nikolas Papadopoulos et.al. 2505.17782 null
2025-05-23 EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy Yichun Yu et.al. 2505.17665 null
2025-05-22 Deep mineralogical segmentation of thin section images based on QEMSCAN maps Jean Pablo Vieira de Mello et.al. 2505.17008 link
2025-05-22 OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning Zongyan Han et.al. 2505.16974 link
2025-05-25 NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification NovelSeek Team et.al. 2505.16938 link
2025-05-22 TextureSAM: Towards a Texture Aware Foundation Model for Segmentation Inbal Cohen et.al. 2505.16540 null
2025-05-22 Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation Estelle Chigot et.al. 2505.16360 link
2025-05-21 VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation Niccolo Avogaro et.al. 2505.15592 null
2025-05-21 seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation Andrew Caunes et.al. 2505.15545 link
2025-05-21 Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation Ce Zhang et.al. 2505.15491 null
2025-05-21 From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation Quanwei Liu et.al. 2505.15147 null
2025-05-20 Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning Amine Elhafsi et.al. 2505.14938 null
2025-05-20 LOD1 3D City Model from LiDAR: The Impact of Segmentation Accuracy on Quality of Urban 3D Modeling and Morphology Extraction Fatemeh Chajaei et.al. 2505.14747 link
2025-05-19 Enhancing Shape Perception and Segmentation Consistency for Industrial Image Inspection Guoxuan Mao et.al. 2505.14718 null
2025-05-20 Instance Segmentation for Point Sets Abhimanyu Talwar et.al. 2505.14583 null
2025-05-20 ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains Guillaume Vray et.al. 2505.14511 null
2025-05-20 Intra-class Patch Swap for Self-Distillation Hongjun Choi et.al. 2505.14124 link
2025-05-20 Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts Xi Chen et.al. 2505.14088 null
2025-05-20 Scaling Vision Mamba Across Resolutions via Fractal Traversal Bo Li et.al. 2505.14062 null
2025-05-20 EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation Zelin Zhang et.al. 2505.14014 null
2025-05-19 Self-Supervised Learning for Image Segmentation: A Comprehensive Survey Thangarajah Akilan et.al. 2505.13584 null
2025-05-19 Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation Jiaqi Tan et.al. 2505.12861 link
2025-05-18 Temporal-Spectral-Spatial Unified Remote Sensing Dense Prediction Sijie Zhao et.al. 2505.12280 link
2025-05-17 EarthSynth: Generating Informative Earth Observation with Diffusion Models Jiancheng Pan et.al. 2505.12108 null
2025-05-17 Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Boosting Off-Road Segmentation via Photometric Distortion and Exponential Moving Average Wonjune Kim et.al. 2505.11769 null
2025-05-16 DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation Ziyu Zhao et.al. 2505.11676 null
2025-05-16 Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation David Minkwan Kim et.al. 2505.10781 null
2025-05-15 Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis Francisco Raverta Capua et.al. 2505.10751 link
2025-05-15 TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation Manthan Patel et.al. 2505.10696 null
2025-05-15 SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity Shihao Zou et.al. 2505.10352 null
2025-05-15 APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds Yuan Gao et.al. 2505.09971 link
2025-05-14 FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization Xiaoyang Yu et.al. 2505.09385 null
2025-05-14 MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning Bin-Bin Gao et.al. 2505.09265 null
2025-05-13 MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment Barak Pinkovich et.al. 2505.08589 null
2025-05-13 Dynamic Snake Upsampling Operater and Boundary-Skeleton Weighted Loss for Tubular Structure Segmentation Yiqi Chen et.al. 2505.08525 null
2025-05-13 Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency Adel Ammar et.al. 2505.08445 null
2025-05-13 GNCAF: A GNN-based Neighboring Context Aggregation Framework for Tertiary Lymphoid Structures Semantic Segmentation in WSI Lei Su et.al. 2505.08430 null
2025-05-12 Privacy Risks of Robot Vision: A User Study on Image Modalities and Resolution Xuying Huang et.al. 2505.07766 null
2025-05-12 Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation Negin Ghamsarian et.al. 2505.07691 null
2025-05-13 TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset Olaf Wysocki et.al. 2505.07396 null
2025-05-11 Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution Zihang Liu et.al. 2505.07071 link
2025-05-11 Depth-Sensitive Soft Suppression with RGB-D Inter-Modal Stylization Flow for Domain Generalization Semantic Segmentation Binbin Wei et.al. 2505.07050 null
2025-05-11 Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding Chih-Chung Hsu et.al. 2505.06991 null
2025-05-11 Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation Seokjun Kwon et.al. 2505.06951 null
2025-05-10 Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization Xu Zheng et.al. 2505.06635 null
2025-05-10 RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation Zhiwen Zeng et.al. 2505.06515 null
2025-05-06 Show or Tell? A Benchmark To Evaluate Visual and Textual Prompts in Semantic Segmentation Gabriele Rosi et.al. 2505.06280 link
2025-05-13 Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet Kodai Hirata et.al. 2505.06185 null
2025-05-09 UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model Timo Kaiser et.al. 2505.05049 link
2025-05-08 Split Matching for Inductive Zero-shot Semantic Segmentation Jialei Chen et.al. 2505.05023 null
2025-05-07 Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions? Shashank Agnihotri et.al. 2505.04835 link
2025-05-07 Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer Sainath Dey et.al. 2505.04740 null
2025-05-07 DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Junjie Wang et.al. 2505.04410 link
2025-05-07 MFSeg: Efficient Multi-frame 3D Semantic Segmentation Chengjie Huang et.al. 2505.04408 null
2025-05-06 CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting Huawei Sun et.al. 2505.03679 null
2025-05-06 Panoramic Out-of-Distribution Segmentation Mengfei Duan et.al. 2505.03539 link
2025-05-06 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation Andrew Caunes et.al. 2505.03300 null
2025-05-05 Platelet enumeration in dense aggregates H. Martin Gillis et.al. 2505.02751 null
2025-05-04 Segment Any RGB-Thermal Model with Language-aided Distillation Dong Xing et.al. 2505.01950 null
2025-05-03 OODTE: A Differential Testing Engine for the ONNX Optimizer Nikolaos Louloudakis et.al. 2505.01892 null
2025-05-02 A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative Learning Anan Yaghmour et.al. 2505.01558 link
2025-05-02 Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation Zhen Yao et.al. 2505.01548 link
2025-05-02 GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation Boris Kriuk et.al. 2505.01057 null
2025-05-03 Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook Muyi Bao et.al. 2505.00630 link
2025-05-01 Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation Feng Xue et.al. 2505.00378 null
2025-04-30 Real Time Semantic Segmentation of High Resolution Automotive LiDAR Scans Hannes Reichert et.al. 2504.21602 link
2025-05-04 Make Both Ends Meet: A Synergistic Optimization Infrared Small Target Detection with Streamlined Computational Overhead Yuxin Jing et.al. 2504.21581 null
2025-04-30 ClassWise-CRF: Category-Specific Fusion for Enhanced Semantic Segmentation of Remote Sensing Imagery Qinfeng Zhu et.al. 2504.21491 null
2025-04-29 DeepVoid: A Deep Learning Void Detector Sam Kumagai et.al. 2504.21134 null
2025-04-29 Learning a General Model: Folding Clothing with Topological Dynamics Yiming Liu et.al. 2504.20720 null
2025-04-28 DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes Junlin Guo et.al. 2504.20303 null
2025-04-28 SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation Yulong Guo et.al. 2504.19839 null
2025-04-28 Open-set Anomaly Segmentation in Complex Scenarios Song Xia et.al. 2504.19706 null
2025-04-28 Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding Yan Wang et.al. 2504.19500 null
2025-04-28 GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field Zuxing Lu et.al. 2504.19409 null
2025-04-27 DeepSPG: Exploring Deep Semantic Prior Guidance for Low-light Image Enhancement with Multimodal Learning Jialang Lu et.al. 2504.19127 null
2025-04-26 Federated Learning-based Semantic Segmentation for Lane and Object Detection in Autonomous Driving Gharbi Khamis Alshammari et.al. 2504.18939 null
2025-04-25 A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes Nicolas Münger et.al. 2504.18213 null
2025-04-25 Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition Yin Tang et.al. 2504.18201 null
2025-04-25 What is the Added Value of UDA in the VFM Era? Brunó B. Englert et.al. 2504.18190 null
2025-04-25 Back to Fundamentals: Low-Level Visual Features Guided Progressive Token Pruning Yuanbing Ouyang et.al. 2504.17996 null
2025-04-24 Virtual Roads, Smarter Safety: A Digital Twin Framework for Mixed Autonomous Traffic Safety Analysis Hao Zhang et.al. 2504.17968 null
2025-04-24 Masked strategies for images with small objects H. Martin Gillis et.al. 2504.17935 null
2025-04-24 Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images Zebo Huang et.al. 2504.17582 null
2025-04-23 SemanticSugarBeets: A Multi-Task Framework and Dataset for Inspecting Harvest and Storage Characteristics of Sugar Beets Gerardus Croonen et.al. 2504.16684 link
2025-04-23 Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections Max Kirchner et.al. 2504.16612 null
2025-04-23 SAIP-Net: Enhancing Remote Sensing Image Segmentation via Spectral Adaptive Information Propagation Zhongtao Wang et.al. 2504.16564 null
2025-04-22 Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications Leonardo Olivi et.al. 2504.15991 null
2025-04-22 DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining Wei Zhuo et.al. 2504.15669 null
2025-04-21 Segmentation with Noisy Labels via Spatially Correlated Distributions Ryu Tadokoro et.al. 2504.14795 link
2025-04-19 Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation Johannes Spoecklberger et.al. 2504.14231 null
2025-04-19 Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection Ghodsiyeh Rostami et.al. 2504.14138 null
2025-04-19 Lightweight Road Environment Segmentation using Vector Quantization Jiyong Kwag et.al. 2504.14113 null
2025-04-18 Occlusion-Ordered Semantic Instance Segmentation Soroosh Baselizadeh et.al. 2504.14054 null
2025-04-18 HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework Shuobin Wei et.al. 2504.13579 null
2025-04-18 Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping Wang Liu et.al. 2504.13458 link
2025-04-18 DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images Racheal Mukisa et.al. 2504.13415 null
2025-04-18 Cardiac MRI Semantic Segmentation for Ventricles and Myocardium using Deep Learning Racheal Mukisa et.al. 2504.13391 null
2025-04-17 SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling Yasin Almalioglu et.al. 2504.13310 null
2025-04-17 Digital Twin Generation from Visual Data: A Survey Andrew Melnik et.al. 2504.13159 link
2025-04-17 High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion Libo Zhang et.al. 2504.12844 null
2025-04-17 Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation Siyu Chen et.al. 2504.12753 link
2025-04-17 Parsimonious Dataset Construction for Laparoscopic Cholecystectomy Structure Segmentation Yuning Zhou et.al. 2504.12573 null
2025-04-17 Privacy-Preserving Operating Room Workflow Analysis using Digital Twins Alejandra Perez et.al. 2504.12552 null
2025-04-16 3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap Minmin Yang et.al. 2504.12442 link
2025-04-16 Remote sensing colour image semantic segmentation of trails created by large herbivorous Mammals Jose Francisco Diez-Pastor et.al. 2504.12121 null
2025-04-12 SDIGLM: Leveraging Large Language Models and Multi-Modal Chain of Thought for Structural Damage Identification Yunkai Zhang et.al. 2504.11477 null
2025-04-15 PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation Bo-Cheng Hu et.al. 2504.10986 link
2025-04-15 LightFormer: A lightweight and efficient decoder for remote sensing image segmentation Sihang Chen et.al. 2504.10834 null
2025-04-15 OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding Dianbing Xi et.al. 2504.10825 null
2025-04-15 Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics’ Gramian on the Manifold Underlying the Patch Space Kelum Gajamannage et.al. 2504.10820 null
2025-04-14 Real-time Seafloor Segmentation and Mapping Michele Grimaldi et.al. 2504.10750 null
2025-04-14 FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation Yasser Benigmim et.al. 2504.10487 link
2025-04-14 The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer Weixian Lei et.al. 2504.10462 link
2025-04-14 M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data Tzu-Yun Tseng et.al. 2504.10123 link
2025-04-14 DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation Beomseok Kang et.al. 2504.09814 null
2025-04-14 IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme Dinh Dai Quan Tran et.al. 2504.09797 null
2025-04-14 Advancing RFI-Detection in Radio Astronomy with Liquid State Machines Nicholas J Pritchard et.al. 2504.09796 null
2025-04-12 Evolved Hierarchical Masking for Self-Supervised Learning Zhanzhou Feng et.al. 2504.09155 null
2025-04-11 Data-Importance-Aware Power Allocation for Adaptive Real-Time Communication in Computer Vision Applications Chunmei Xu et.al. 2504.08922 null
2025-04-11 Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing Vinal Asodia et.al. 2504.08704 null
2025-04-11 SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis Yi Chen et.al. 2504.08361 link
2025-04-11 DSM: Building A Diverse Semantic Map for 3D Visual Grounding Qinghongbing Xie et.al. 2504.08307 null
2025-04-10 ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings Astitva Srivastava et.al. 2504.08022 null
2025-04-10 Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation Yanglin Huang et.al. 2504.07691 null
2025-04-10 RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability Jonggwon Park et.al. 2504.07416 null
2025-04-09 RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration Omar Alama et.al. 2504.06994 null
2025-04-09 Domain Generalization through Attenuation of Domain-Specific Information Reiji Saito et.al. 2504.06781 link
2025-04-08 SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation Hritam Basak et.al. 2504.06389 null
2025-04-09 Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation Xiaoxing Hu et.al. 2504.06220 link
2025-04-08 WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care Vanessa Borst et.al. 2504.06185 null
2025-04-08 Towards Varroa destructor mite detection using a narrow spectra illumination Samuel Bielik et.al. 2504.06099 null
2025-04-08 econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians Can Zhang et.al. 2504.06003 null
2025-04-08 Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques Luca Barco et.al. 2504.05882 null
2025-04-08 DefMamba: Deformable Visual State Space Model Leiye Liu et.al. 2504.05794 null
2025-04-08 Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation Enming Zhang et.al. 2504.05774 null
2025-04-07 Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection Jon Gutiérrez Zaballa et.al. 2504.05119 null
2025-04-07 DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation Bo-Wen Yin et.al. 2504.04701 link
2025-04-05 CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation Kai Fang et.al. 2504.04156 null
2025-04-05 DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning Xiao-Hui Li et.al. 2504.04085 null
2025-04-01 Input Resolution Downsizing as a Compression Technique for Vision Deep Learning Systems Jeremy Morlier et.al. 2504.03749 null
2025-04-04 Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation Xin Zhang et.al. 2504.03193 link
2025-04-02 Global Rice Multi-Class Segmentation Dataset (RiceSEG): A Comprehensive and Diverse High-Resolution RGB-Annotated Images for the Development and Benchmarking of Rice Segmentation Algorithms Junchi Zhou et.al. 2504.02880 null
2025-04-03 Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation Feng Gao et.al. 2504.02647 link
2025-04-03 Semantic segmentation of forest stands using deep learning Håkon Næss Sandum et.al. 2504.02471 null
2025-04-03 Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation Changshuo Wang et.al. 2504.02454 null
2025-04-02 Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation Junjie Chen et.al. 2504.01668 null
2025-04-03 Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks Haosheng Li et.al. 2504.01659 null
2025-04-02 ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation Haosheng Li et.al. 2504.01648 null
2025-04-02 Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions Giulia Marchiori Pietrosanti et.al. 2504.01632 null
2025-04-02 Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training Luca Ciampi et.al. 2504.01547 link
2025-04-02 Beyond Nearest Neighbor Interpolation in Data Augmentation Olivier Rukundo et.al. 2504.01527 null
2025-04-02 Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement Zaipeng Duan et.al. 2504.01449 null
2025-04-01 CAPE: Connectivity-Aware Path Enforcement Loss for Curvilinear Structure Delineation Elyar Esmaeilzadeh et.al. 2504.00753 null
2025-04-01 FSSUWNet: Mitigating the Fragility of Pre-trained Models with Feature Enhancement for Few-Shot Semantic Segmentation in Underwater Images Zhuohao Li et.al. 2504.00478 link
2025-03-31 Spectral-Adaptive Modulation Networks for Visual Perception Guhnoo Yun et.al. 2503.23947 link
2025-03-31 Bridge the Gap Between Visual and Linguistic Comprehension for Generalized Zero-shot Semantic Segmentation Xiaoqing Guo et.al. 2503.23806 null
2025-03-31 Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks Yu Zhou et.al. 2503.23751 null
2025-03-31 Semantic Packet Aggregation and Repeated Transmission for Text-to-Image Generation Seunghun Lee et.al. 2503.23734 null
2025-04-02 CrossFormer: Cross-Segment Semantic Fusion for Document Segmentation Tongke Ni et.al. 2503.23671 null
2025-03-30 BoundMatch: Boundary detection applied to semi-supervised segmentation for urban-driving scenes Haruya Ishikawa et.al. 2503.23519 null
2025-03-30 Improving underwater semantic segmentation with underwater image quality attention and muti-scale aggregation attention Xin Zuo et.al. 2503.23422 link
2025-03-29 Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments Yifan Xu et.al. 2503.23105 null
2025-03-28 Enhancing DeepLabV3+ to Fuse Aerial and Satellite Images for Semantic Segmentation Anas Berka et.al. 2503.22909 null
2025-03-28 The Marine Debris Forward-Looking Sonar Datasets Matias Valdenegro-Toro et.al. 2503.22880 null
2025-03-28 KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation Thomas Boucher et.al. 2503.22592 null
2025-03-28 A Dataset for Semantic Segmentation in the Presence of Unknowns Zakaria Laskar et.al. 2503.22309 null
2025-03-28 Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation Minho Park et.al. 2503.22172 null
2025-03-28 Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation Hongmei Yin et.al. 2503.22136 link
2025-03-28 Semantic segmentation for building houses from wooden cubes Ivan Beleacov et.al. 2503.22125 null
2025-03-28 Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes Binh Thien Nguyen et.al. 2503.22088 null
2025-03-28 A Deep Learning Framework for Boundary-Aware Semantic Segmentation Tai An et.al. 2503.22050 null
2025-03-27 Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation Reza Qorbani et.al. 2503.21780 link
2025-03-27 A Unified Image-Dense Annotation Generation Model for Underwater Scenes Hongkai Lin et.al. 2503.21771 link
2025-03-27 Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving Lucas Nunes et.al. 2503.21449 link
2025-03-26 Exploring CLIP’s Dense Knowledge for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2503.20826 link
2025-03-26 Exploiting Temporal State Space Sharing for Video Semantic Segmentation Syed Ariff Syed Hesham et.al. 2503.20824 link
2025-03-25 Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception Luke Chen et.al. 2503.20011 null
2025-03-25 The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs Jonathan Sauder et.al. 2503.20000 link
2025-03-25 LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation Vladan Stojnić et.al. 2503.19777 link
2025-03-25 OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations Christina Kassab et.al. 2503.19764 null
2025-03-25 Show or Tell? Effectively prompting Vision-Language Models for semantic segmentation Niccolo Avogaro et.al. 2503.19647 null
2025-03-25 Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model Peishan Huang et.al. 2503.19386 null
2025-03-25 BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation Hanshuo Qiu et.al. 2503.19303 null
2025-03-25 Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications Ben Rahman et.al. 2503.19276 null
2025-03-24 DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation Karim Abou Zeid et.al. 2503.18944 link
2025-03-24 Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation DeShin Hwa et.al. 2503.18862 null
2025-03-24 HiRes-FusedMIM: A High-Resolution RGB-DSM Pre-trained Model for Building-Level Remote Sensing Applications Guneet Mutreja et.al. 2503.18540 null
2025-03-24 Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness Chenfei Liao et.al. 2503.18445 link
2025-03-24 PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes Xinhua Xu et.al. 2503.18393 null
2025-03-24 MaSS13K: A Matting-level Semantic Segmentation Benchmark Chenxi Xie et.al. 2503.18364 link
2025-03-23 Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images Yara AlaaEldin et.al. 2503.17982 link
2025-03-23 FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation Dong Zhao et.al. 2503.17940 null
2025-03-23 Semi-supervised Semantic Segmentation with Multi-Constraint Consistency Learning Jianjian Yin et.al. 2503.17914 link
2025-03-22 HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving R. D. Lin et.al. 2503.17752 link
2025-03-22 Multi-modality Anomaly Segmentation on the Road Heng Gao et.al. 2503.17712 link
2025-03-21 Should we pre-train a decoder in contrastive learning for dense prediction tasks? Sébastien Quetin et.al. 2503.17526 null
2025-03-21 Center-guided Classifier for Semantic Segmentation of Remote Sensing Images Wei Zhang et.al. 2503.16963 link
2025-03-21 Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision Maoji Zheng et.al. 2503.16811 null
2025-03-20 SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality Chiara Schiavo et.al. 2503.16747 null
2025-03-20 Panoptic-CUDAL Technical Report: Rural Australia Point Cloud Dataset in Rainy Conditions Tzu-Yun Tseng et.al. 2503.16378 null
2025-03-20 Controllable Segmentation-Based Text-Guided Style Editing Jingwen Li et.al. 2503.16129 null
2025-03-24 No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather Junsung Park et.al. 2503.15910 null
2025-03-19 High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight Cédric Vincent et.al. 2503.15676 link
2025-03-19 Transport-Related Surface Detection with Machine Learning: Analyzing Temporal Trends in Madrid and Vienna Miguel Ureña Pliego et.al. 2503.15653 link
2025-03-19 CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation Masud Ahmed et.al. 2503.15617 link
2025-03-21 SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes Weixiao Gao et.al. 2503.15300 null
2025-03-19 Semantic Segmentation of Transparent and Opaque Drinking Glasses with the Help of Zero-shot Learning Annalena Blänsdorf et.al. 2503.15004 null
2025-03-19 USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network Joseph Emmanuel DL Dayo et.al. 2503.14950 null
2025-03-18 PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds Barza Nisar et.al. 2503.13914 null
2025-03-18 Exploiting Inherent Class Label: Towards Robust Scribble Supervised Semantic Segmentation Xinliang Zhang et.al. 2503.13895 link
2025-03-17 Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization Hao Li et.al. 2503.13617 null
2025-03-17 3D Hierarchical Panoptic Segmentation in Real Orchard Environments Across Different Sensors Matteo Sodano et.al. 2503.13188 null
2025-03-17 DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model Zhicheng Zhao et.al. 2503.13073 null
2025-03-17 Adaptive Transformer Attention and Multi-Scale Fusion for Spine 3D Segmentation Yanlin Xiang et.al. 2503.12853 null
2025-03-17 LangDA: Building Context-Awareness via Language for Domain Adaptive Semantic Segmentation Chang Liu et.al. 2503.12780 null
2025-03-17 TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image Haoxiao Wang et.al. 2503.12779 null
2025-03-16 Point Cloud Based Scene Segmentation: A Survey Dan Halperin et.al. 2503.12595 null
2025-03-16 BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis Weiguang Zhao et.al. 2503.12539 link
2025-03-16 SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs Guibiao Liao et.al. 2503.12535 null
2025-03-16 Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation Edgar Heinert et.al. 2503.12453 null
2025-03-17 COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation Sanghyun Jo et.al. 2503.11439 null
2025-03-14 SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets Hao Liu et.al. 2503.11133 null
2025-03-14 A Novel Decomposed Feature-Oriented Framework for Open-Set Semantic Segmentation on LiDAR Data Wenbang Deng et.al. 2503.11097 link
2025-03-12 Knowledge Consultation for Semi-Supervised Semantic Segmentation Thuan Than et.al. 2503.10693 null
2025-03-11 VFM-UDA++: Improving Network Architectures and Data Strategies for Unsupervised Domain Adaptive Semantic Segmentation Brunó B. Englert et.al. 2503.10685 null
2025-03-13 RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing Fengxiang Wang et.al. 2503.10392 link
2025-03-13 OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Maxim Popov et.al. 2503.10331 null
2025-03-12 CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation Hariprasath Govindarajan et.al. 2503.09878 null
2025-03-12 Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets Hannah Kniesel et.al. 2503.09221 null
2025-03-07 Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows Julien Posso et.al. 2503.08700 null
2025-03-11 SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation Sachin Verma et.al. 2503.08290 null
2025-03-16 Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation Deyi Ji et.al. 2503.08043 null
2025-03-11 DiffEGG: Diffusion-Driven Edge Generation as a Pixel-Annotation-Free Alternative for Instance Annotation Sanghyun Jo et.al. 2503.07982 null
2025-03-10 Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models? Yuru Jia et.al. 2503.07890 null
2025-03-10 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Yan Tai et.al. 2503.07413 link
2025-03-10 Semantic Communications with Computer Vision Sensing for Edge Video Transmission Yubo Peng et.al. 2503.07252 null
2025-03-10 OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation Ding Zhong et.al. 2503.07098 null
2025-03-10 Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation Xingye Fan et.al. 2503.06954 null
2025-03-10 Aligning Instance-Semantic Sparse Representation towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives Jiaxin Li et.al. 2503.06947 null
2025-03-10 HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors Siyu Li et.al. 2503.06821 link
2025-03-09 CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving Rui Song et.al. 2503.06744 null
2025-03-09 MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation Chenfei Liao et.al. 2503.06700 null
2025-03-09 Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence Zhaowei Chen et.al. 2503.06685 null
2025-03-09 Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation Renhao Lu et.al. 2503.06604 null
2025-03-09 MultiCo3D: Multi-Label Voxel Contrast for One-Shot Incremental Segmentation of 3D Neuroimages Hao Xu et.al. 2503.06598 null
2025-03-08 ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation Qizhen Lan et.al. 2503.06307 null
2025-03-11 PointDiffuse: A Dual-Conditional Diffusion Model for Enhanced Point Cloud Semantic Segmentation Yong He et.al. 2503.06094 null
2025-03-07 Kaiwu: A Multimodal Manipulation Dataset and Framework for Robot Learning and Human-Robot Interaction Shuo Jiang et.al. 2503.05231 null
2025-03-08 EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images Rohit Menon et.al. 2503.04441 null
2025-03-06 PointsToWood: A deep learning framework for complete canopy leaf-wood segmentation of TLS data across diverse European forests Harry J. F. Owen et.al. 2503.04420 null
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235 null
2025-03-06 MASTER: Multimodal Segmentation with Text Prompts Fuyang Liu et.al. 2503.04199 null
2025-03-06 Towards Intelligent Transportation with Pedestrians and Vehicles In-the-Loop: A Surveillance Video-Assisted Federated Digital Twin Framework Xiaolong Li et.al. 2503.04170 null
2025-03-06 H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision Yunxiao Shi et.al. 2503.04059 null
2025-03-06 GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding Xihan Wang et.al. 2503.04034 null
2025-03-06 DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation Amin Karimi et.al. 2503.04006 null
2025-03-05 COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation Aurelio Noca et.al. 2503.03947 null
2025-03-05 SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection Devanish N. Kamtam et.al. 2503.03942 null
2025-03-05 Golden Cudgel Network for Real-Time Semantic Segmentation Guoyu Yang et.al. 2503.03325 link
2025-03-05 Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters Julia Hindel et.al. 2503.03299 null
2025-03-05 Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria Asma A. Almutairi et.al. 2503.03100 null
2025-03-04 Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance Jiayi Zhao et.al. 2503.02581 link
2025-03-04 TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping Xinying Hong et.al. 2503.02578 link
2025-03-04 Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation Dengke Zhang et.al. 2503.02459 link
2025-03-03 SAGE: A Framework of Precise Retrieval for RAG Jintao Zhang et.al. 2503.01713 null
2025-03-04 UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface Hao Tang et.al. 2503.01342 link
2025-03-03 Convex Hull-based Algebraic Constraint for Visual Quadric SLAM Xiaolong Yu et.al. 2503.01254 link
2025-03-03 Identity documents recognition and detection using semantic segmentation with convolutional neural network Mykola Kozlenko et.al. 2503.01085 null
2025-03-02 Using Synthetic Images to Augment Small Medical Image Datasets Minh H. Vu et.al. 2503.00962 null
2025-03-02 Unifying Light Field Perception with Field of Parallax Fei Teng et.al. 2503.00747 link
2025-03-01 Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence Zhan Qu et.al. 2503.00518 null
2025-02-27 Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds Mohamed Abdelsamad et.al. 2502.20316 null
2025-02-27 OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels Meng Lou et.al. 2502.20087 link
2025-02-28 SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation Zijie Zhou et.al. 2502.20077 link
2025-03-04 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds Hengshuo Chu et.al. 2502.20041 null
2025-02-27 Learning Mask Invariant Mutual Information for Masked Image Modeling Tao Huang et.al. 2502.19718 null
2025-02-26 Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach Anton Backhaus et.al. 2502.19177 null
2025-02-26 Enhanced Neuromorphic Semantic Segmentation Latency through Stream Event D. Hareb et.al. 2502.18982 null
2025-02-22 Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition Chuanguang Yang et.al. 2502.18510 null
2025-02-28 OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation Yunpeng Gao et.al. 2502.18041 null
2025-02-25 CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems Rui Liu et.al. 2502.17821 null
2025-02-25 DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks Canyu Zhao et.al. 2502.17157 link
2025-02-24 SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations Wendi Liu et.al. 2502.17056 null
2025-02-25 VPNeXt – Rethinking Dense Decoding for Plain Vision Transformer Xikai Tang et.al. 2502.16654 null
2025-02-23 Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration Kim Jun-Seong et.al. 2502.16652 null
2025-02-23 OpenVox: Real-time Instance-level Open-vocabulary Probabilistic Voxel Representation Yinan Deng et.al. 2502.16528 null
2025-02-23 Deep learning approaches to surgical video segmentation and object detection: A Scoping Review Devanish N. Kamtam et.al. 2502.16459 null
2025-02-22 Importance-Aware Source-Channel Coding for Multi-Modal Task-Oriented Semantic Communication Yi Ma et.al. 2502.16194 null
2025-02-22 FeatSharp: Your Vision Model Features, Sharper Mike Ranzinger et.al. 2502.16025 null
2025-02-22 Cross-Model Transferability of Adversarial Patches in Real-time Segmentation for Autonomous Driving Prashant Shekhar et.al. 2502.16012 link
2025-02-21 Graph Attention Convolutional U-NET: A Semantic Segmentation Model for Identifying Flooded Areas Muhammad Umair Danish et.al. 2502.15907 null
2025-02-21 DOEI: Dual Optimization of Embedding Information for Attention-Enhanced Class Activation Maps Hongjie Zhu et.al. 2502.15885 link
2025-02-21 Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence Yufeng Diao et.al. 2502.15472 null
2025-02-24 DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation Luzhou Ge et.al. 2502.15309 link
2025-02-21 Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation Ebenezer Tarubinga et.al. 2502.15152 link
2025-02-20 RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation Henrique Piñeiro Monteagudo et.al. 2502.14792 null
2025-02-20 Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes Lukas Rauch et.al. 2502.14721 null
2025-02-20 Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving Jon Gutiérrez-Zaballa et.al. 2502.14416 null
2025-02-20 Bayesian SegNet for Semantic Segmentation with Improved Interpretation of Microstructural Evolution During Irradiation of Materials Marjolein Oostrom et.al. 2502.14184 null
2025-02-19 SegRet: An Efficient Design for Semantic Segmentation with Retentive Network Zhiyuan Li et.al. 2502.14014 link
2025-02-19 Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model Huiying Shi et.al. 2502.13990 null
2025-02-19 MGFI-Net: A Multi-Grained Feature Integration Network for Enhanced Medical Image Segmentation Yucheng Zeng et.al. 2502.13808 null
2025-02-19 CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models Nikolaos Dionelis et.al. 2502.13734 null
2025-02-18 Enhancing Power Grid Inspections with Machine Learning Diogo Lavado et.al. 2502.13037 null
2025-02-18 DAMamba: Vision State Space Model with Dynamic Adaptive Scan Tanzhe Li et.al. 2502.12627 link
2025-02-17 From Open-Vocabulary to Vocabulary-Free Semantic Segmentation Klara Reichard et.al. 2502.11891 null
2025-02-16 Detecting Cadastral Boundary from Satellite Images Using U-Net model Neda Rahimpour Anaraki et.al. 2502.11044 null
2025-02-15 NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing Shutong Zhang et.al. 2502.10720 null
2025-02-15 Deep Learning for Wound Tissue Segmentation: A Comprehensive Evaluation using A Novel Dataset Muhammad Ashad Kabir et.al. 2502.10652 link
2025-02-14 Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs – A Multinational Study Yin-Chih Chelsea Wang et.al. 2502.10277 link
2025-02-13 SQ-GAN: Semantic Image Communications Using Masked Vector Quantization Francesco Pezone et.al. 2502.09520 link
2025-02-13 FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation Bin Yang et.al. 2502.09274 null
2025-02-17 Memory-based Ensemble Learning in CMR Semantic Segmentation Yiwei Liu et.al. 2502.09269 link
2025-02-13 Latents of latents to delineate pixels: hybrid Matryoshka autoencoder-to-U-Net pairing for segmenting large medical images in GPU-poor and low-data regimes Tahir Syed et.al. 2502.08988 null
2025-02-17 Knowledge Swapping via Learning and Unlearning Mingyu Xing et.al. 2502.08075 link
2025-02-11 Efficient Continuous Group Convolutions for Local SE(3) Equivariance in 3D Point Clouds Lisa Weijler et.al. 2502.07505 link
2025-02-11 A Survey on Mamba Architecture for Vision Applications Fady Ibrahim et.al. 2502.07161 null
2025-02-09 A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation Wang Jiangtao et.al. 2502.06895 null
2025-02-10 SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement Yuqi Lin et.al. 2502.06756 link
2025-02-11 Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation Emanuele Mule et.al. 2502.06288 link
2025-02-10 Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds Lassi Ruoppa et.al. 2502.06227 null
2025-02-12 Traveling Waves Integrate Spatial Information Into Spectral Representations Mozes Jacobs et.al. 2502.06034 link
2025-02-09 LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification Shubham Kumar Nigam et.al. 2502.05836 null
2025-02-08 Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture Mitul Goswami et.al. 2502.05476 null
2025-02-08 LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation Shengdong Zhang et.al. 2502.05473 null
2025-02-08 A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation Canxuan Gang et.al. 2502.05396 null
2025-02-07 IPSeg: Image Posterior Mitigates Semantic Drift in Class-Incremental Segmentation Xiao Yu et.al. 2502.04870 link
2025-02-05 DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation Luciano Baresi et.al. 2502.04378 link
2025-02-06 Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation Yang Chen et.al. 2502.04111 null
2025-02-06 LeAP: Consistent multi-domain 3D labeling using Foundation Models Simon Gebraad et.al. 2502.03901 null
2025-02-06 Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation Xuan Li et.al. 2502.03813 null
2025-02-05 Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics Indrashis Das et.al. 2502.03654 link
2025-02-08 Disentangling CLIP Features for Enhanced Localized Understanding Samyak Rawlekar et.al. 2502.02977 null
2025-02-05 From DeepSense to Open RAN: AI/ML Advancements in Dynamic Spectrum Sensing and Their Applications Ryan Barker et.al. 2502.02889 null
2025-02-04 Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications William O’Donnell et.al. 2502.02624 null
2025-02-04 Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation Shutong Duan et.al. 2502.02340 null
2025-02-04 UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation Tao Zhang et.al. 2502.02257 link
2025-02-04 Deep Ensemble approach for Enhancing Brain Tumor Segmentation in Resource-Limited Settings Jeremiah Fadugba et.al. 2502.02179 null
2025-02-04 Memory Efficient Transformer Adapter for Dense Predictions Dong Zhang et.al. 2502.01962 null
2025-02-03 Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis Haowen Bai et.al. 2502.01467 null
2025-02-03 Temporal-consistent CAMs for Weakly Supervised Video Segmentation in Waste Sorting Andrea Marelli et.al. 2502.01455 null
2025-02-03 ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies Costin F. Ciusdel et.al. 2502.01335 null
2025-02-03 FSPGD: Rethinking Black-box Attacks on Semantic Segmentation Eun-Sol Park et.al. 2502.01262 link
2025-02-03 Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models Tongkun Liu et.al. 2502.01216 link
2025-02-02 SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation Mingyu Yang et.al. 2502.00960 null
2025-02-01 Complex Wavelet Mutual Information Loss: A Multi-Scale Loss Function for Semantic Segmentation Renhao Lu et.al. 2502.00563 link
2025-01-31 Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation Rohan Chacko et.al. 2502.00173 null
2025-01-31 CerraData-4MM: A multimodal benchmark dataset on Cerrado for land use and land cover classification Mateus de Souza Miranda et.al. 2502.00083 link
2025-01-31 GO: The Great Outdoors Multimodal Dataset Peng Jiang et.al. 2501.19274 null
2025-01-31 Medical Semantic Segmentation with Diffusion Pretrain David Li et.al. 2501.19265 null
2025-01-31 ContextFormer: Redefining Efficiency in Semantic Segmentation Mian Muhammad Naeem Abid et.al. 2501.19255 null
2025-01-31 Integrating Semi-Supervised and Active Learning for Semantic Segmentation Wanli Ma et.al. 2501.19227 null
2025-01-31 SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging Javier Montalvo et.al. 2501.19035 link
2025-01-31 Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks Xiaoyan Jiang et.al. 2501.18851 null
2025-02-03 Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models Hao Dong et.al. 2501.18592 link
2025-01-30 Ground Awareness in Deep Learning for Large Outdoor Point Cloud Segmentation Kevin Qiu et.al. 2501.18246 null
2025-01-29 Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation Lin Chen et.al. 2501.17642 null
2025-01-29 3DSES: an indoor Lidar point cloud segmentation dataset with real and pseudo-labels from a 3D model Maxime Mérizette et.al. 2501.17534 null
2025-01-29 Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models Muhammad Atta ur Rahman et.al. 2501.16769 null
2025-01-28 AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies Surojit Saha et.al. 2501.16760 null
2025-01-28 SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios Yinqi Chen et.al. 2501.16754 null
2025-01-27 Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation Philip Hughes et.al. 2501.16467 null
2025-01-27 DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation Han Sun et.al. 2501.16410 null
2025-01-27 The Linear Attention Resurrection in Vision Transformer Chuanyang Zheng et.al. 2501.16182 null
2025-01-27 D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation Maik Steinhauser et.al. 2501.15870 null
2025-01-26 iFormer: Integrating ConvNet and Transformer for Mobile Application Chuanyang Zheng et.al. 2501.15369 link
2025-01-25 A Training-free Synthetic Data Selection Method for Semantic Segmentation Hao Tang et.al. 2501.15201 link
2025-01-24 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving Jules Sanchez et.al. 2501.14605 link
2025-01-23 ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban 3D Change Detection Luqi Zhang et.al. 2501.14004 link
2025-01-23 IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models Jiayi Lei et.al. 2501.13920 null
2025-01-23 Where Do You Go? Pedestrian Trajectory Prediction using Scene Features Mohammad Ali Rezaei et.al. 2501.13848 null
2025-01-23 Overcoming Support Dilution for Robust Few-shot Semantic Segmentation Wailing Tang et.al. 2501.13529 null
2025-01-22 Revisiting Data Augmentation for Ultrasound Images Adam Tupper et.al. 2501.13193 link
2025-01-22 A Novel Scene Coupling Semantic Mask Network for Remote Sensing Image Segmentation Xiaowen Ma et.al. 2501.13130 link
2025-01-22 Hybridization of Attention UNet with Repeated Atrous Spatial Pyramid Pooling for Improved Brain Tumour Segmentation Satyaki Roy Chowdhury et.al. 2501.13129 null
2025-01-22 Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks Alessio Quercia et.al. 2501.12824 link
2025-01-19 Comparative Analysis of Hand-Crafted and Machine-Driven Histopathological Features for Prostate Cancer Classification and Segmentation Feda Bolus Al Baqain et.al. 2501.12415 null
2025-01-21 Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems Stefano Carlo Lambertenghi et.al. 2501.12269 link
2025-01-21 A margin-based replacement for cross-entropy loss Michael W. Spratling et.al. 2501.12191 null
2025-01-20 MedicoSAM: Towards foundation models for medical image segmentation Anwai Archit et.al. 2501.11734 link
2025-01-20 Automatic Labelling & Semantic Segmentation with 4D Radar Tensors Botao Sun et.al. 2501.11351 null
2025-01-20 Enhancing Uncertainty Estimation in Semantic Segmentation via Monte-Carlo Frequency Dropout Tal Zeevi et.al. 2501.11258 link
2025-01-19 Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation Zhengwen Shen et.al. 2501.10958 null
2025-01-22 OpenEarthMap-SAR: A Benchmark Synthetic Aperture Radar Dataset for Global High-Resolution Land Cover Mapping Junshi Xia et.al. 2501.10891 null
2025-01-18 GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation Yannik Frisch et.al. 2501.10819 null
2025-01-18 Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention Shanwen Wang et.al. 2501.10736 link
2025-01-17 Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks Michael Schwingshackl et.al. 2501.10080 link
2025-01-17 Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework Ali Can Karaca et.al. 2501.10075 null
2025-01-17 One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression Keita Miwa et.al. 2501.10064 null
2025-01-17 LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks Wei Lu et.al. 2501.10040 link
2025-01-16 The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning Wonjun Jo et.al. 2501.09485 null
2025-01-16 Scaling up self-supervised learning for improved surgical foundation models Tim J. M. Jaspers et.al. 2501.09436 link
2025-01-16 SVIA: A Street View Image Anonymization Framework for Self-Driving Applications Dongyu Liu et.al. 2501.09393 link
2025-01-15 UNIR-Net: A Novel Approach for Restoring Underwater Images with Non-Uniform Illumination Using Synthetic Data Ezequiel Perez-Zarate et.al. 2501.09053 link
2025-01-15 Pseudolabel guided pixels contrast for domain adaptive semantic segmentation Jianzi Xiang et.al. 2501.09040 link
2025-01-14 FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing Isaac Corley et.al. 2501.08490 null
2025-01-14 Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers Efstathios Karypidis et.al. 2501.08303 link
2025-01-14 A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation Steven Landgraf et.al. 2501.08188 null
2025-01-14 Threshold Attention Network for Semantic Segmentation of Remote Sensing Images Wei Long et.al. 2501.07984 null
2025-01-14 Balance Divergence for Knowledge Distillation Yafei Qi et.al. 2501.07804 null
2025-01-13 Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation Xianping Ma et.al. 2501.07390 link
2025-01-13 Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Li Liang et.al. 2501.07260 link
2025-01-12 LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier Haojun Yu et.al. 2501.06862 link
2025-01-12 SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation Javier Gamazo Tejero et.al. 2501.06836 null
2025-01-11 Parking Space Detection in the City of Granada Crespo-Orti Luis et.al. 2501.06651 link
2025-01-06 The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge Qing Wu et.al. 2501.05472 null
2025-01-09 Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions Shishir Muralidhara et.al. 2501.05246 null
2025-01-09 Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment Haoyi Xiu et.al. 2501.05095 link
2025-01-08 Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation Ulindu De Silva et.al. 2501.04696 link
2025-01-07 Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images Hongyi Wu et.al. 2501.03891 null
2025-01-07 Image Segmentation: Inducing graph-based learning Aryan Singh et.al. 2501.03765 link
2025-01-06 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation Jiexi Zhong et.al. 2501.02937 null
2025-01-08 GLoG-CSUnet: Enhancing Vision Transformers with Adaptable Radiomic Features for Medical Image Segmentation Niloufar Eghbali et.al. 2501.02788 link
2025-01-04 Unsupervised Class Generation to Expand Semantic Segmentation Datasets Javier Montalvo et.al. 2501.02264 null
2025-01-03 Semantic Segmentation for Sequential Historical Maps by Learning from Only One Map Yunshuang Yuan et.al. 2501.01845 null
2025-01-03 IAM: Enhancing RGB-D Instance Segmentation with New Benchmarks Aecheon Jung et.al. 2501.01685 link
2025-01-03 Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation Rini Smita Thakur et.al. 2501.01640 null
2025-01-02 A Multi-task Supervised Compression Model for Split Computing Yoshitomo Matsubara et.al. 2501.01420 link
2025-01-03 FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation Bingyu Li et.al. 2501.00877 link
2024-12-31 H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters Pedram Fekri et.al. 2501.00514 null
2024-12-31 PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM Runnan Chen et.al. 2501.00352 null
2024-12-31 OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Runnan Chen et.al. 2501.00326 null
2024-12-30 HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization Zijie Fang et.al. 2412.20924 link
2024-12-30 LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training Fardin Ayar et.al. 2412.20881 null
2024-12-29 Image Augmentation Agent for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.20439 null
2024-12-27 Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP Zhongxing Xu et.al. 2412.19650 null
2024-12-27 An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments Vignesh Kottayam Viswanathan et.al. 2412.19582 null
2024-12-27 Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation Chengyang Ye et.al. 2412.19492 link
2024-12-26 Impact of color and mixing proportion of synthetic point clouds on semantic segmentation Shaojie Zhou et.al. 2412.19145 link
2024-12-24 AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction Pufan Zou et.al. 2412.18255 null
2024-12-25 VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis Shicheng Yin et.al. 2412.18178 link
2024-12-24 UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision Yuru Wang et.al. 2412.18131 null
2024-12-24 LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding Hao Li et.al. 2412.17635 null
2024-12-25 AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation Jiaqi Ma et.al. 2412.17601 link
2024-12-24 Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation Jianjian Yin et.al. 2412.17331 link
2024-12-22 Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation Samuel Marschall et.al. 2412.16990 null
2024-12-22 Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection Yuhang Gan et.al. 2412.16918 null
2024-12-22 MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection Xu Zheng et.al. 2412.16876 null
2024-12-22 Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation Jongmin Yu et.al. 2412.16859 null
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 null
2024-12-21 IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks Yaming Zhang et.al. 2412.16654 link
2024-12-21 V”Mean”ba: Visual State Space Models only need 1 hidden dimension Tien-Yu Chi et.al. 2412.16602 null
2024-12-21 Leveraging Contrastive Learning for Semantic Segmentation with Consistent Labels Across Varying Appearances Javier Montalvo et.al. 2412.16592 null
2024-12-20 DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment Cijo Jose et.al. 2412.16334 null
2024-12-20 SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data Xinwei Ju et.al. 2412.16078 link
2024-12-20 Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer Xinyue Chen et.al. 2412.15835 link
2024-12-19 GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation G. Andrade-Miranda et.al. 2412.15054 link
2024-12-19 PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation Shoumeng Qiu et.al. 2412.14821 link
2024-12-19 Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation Zhenxin Lei et.al. 2412.14587 link
2024-12-18 Split Learning in Computer Vision for Semantic Segmentation Delay Minimization Nikos G. Evgenidis et.al. 2412.14272 null
2024-12-18 Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation Jianyu Zhang et.al. 2412.14145 null
2024-12-18 Prompt Categories Cluster for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.13823 null
2024-12-18 Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data Junki Mori et.al. 2412.13757 null
2024-12-18 Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration Dominik Werner Wolf et.al. 2412.13695 null
2024-12-18 GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting Yuning Peng et.al. 2412.13654 null
2024-12-17 S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging Yimu Pan et.al. 2412.13156 link
2024-12-17 Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks Xiaxin Zhu et.al. 2412.12843 null
2024-12-17 Open-World Panoptic Segmentation Matteo Sodano et.al. 2412.12740 null
2024-12-17 SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing Chen Chen et.al. 2412.12685 null
2024-12-17 Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation Dongyue Wu et.al. 2412.12672 link
2024-12-17 Adaptive Prototype Replay for Class Incremental Semantic Segmentation Guilin Zhu et.al. 2412.12669 link
2024-12-17 SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation Shuangping Huang et.al. 2412.12660 null
2024-12-16 Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation Hongwei Niu et.al. 2412.12050 link
2024-12-16 SAMIC: Segment Anything with In-Context Spatial Prompt Engineering Savinay Nagendra et.al. 2412.11998 null
2024-12-16 SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation Yunxiang Fu et.al. 2412.11890 link
2024-12-16 Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation Svetlana Pavlitska et.al. 2412.11608 link
2024-12-15 MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2412.11076 link
2024-12-14 RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone Mustafa Munir et.al. 2412.10995 link
2024-12-14 DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting Luis Wiedmann et.al. 2412.10972 link
2024-12-14 SegACIL: Solving the Stability-Plasticity Dilemma in Class-Incremental Semantic Segmentation Jiaxu Li et.al. 2412.10834 link
2024-12-14 Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation Jurica Runtas et.al. 2412.10765 link
2024-12-14 OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving Lianqing Zheng et.al. 2412.10734 null
2024-12-13 A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation Wangkai Li et.al. 2412.10339 null
2024-12-13 SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians Siyun Liang et.al. 2412.10231 null
2024-12-13 Object-Focused Data Selection for Dense Prediction Tasks Niclas Popp et.al. 2412.10032 null
2024-12-12 Towards Open-Vocabulary Video Semantic Segmentation Xinhao Li et.al. 2412.09329 link
2024-12-16 FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation Yuntian Bo et.al. 2412.09319 link
2024-12-12 VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation Roberto Alcover-Couso et.al. 2412.09240 null
2024-12-11 A Deep Semantic Segmentation Network with Semantic and Contextual Refinements Zhiyan Wang et.al. 2412.08671 null
2024-12-11 A feature refinement module for light-weight semantic segmentation network Zhiyan Wang et.al. 2412.08670 null
2024-12-11 SegFace: Face Segmentation of Long-Tail Classes Kartik Narayan et.al. 2412.08647 link
2024-12-11 EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation Hongwei Niu et.al. 2412.08628 link
2024-12-12 Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning Fan Lu et.al. 2412.08614 link
2024-12-11 Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction Bohan Li et.al. 2412.08243 null
2024-12-11 THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots Zeshun Li et.al. 2412.08096 null
2024-12-11 Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation Zhigang Cen et.al. 2412.08034 null
2024-12-09 SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception Yaniv Benny et.al. 2412.06968 null
2024-12-10 ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet Andrei-Robert Alexandrescu et.al. 2412.06742 null
2024-12-09 Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation Fei Wu et.al. 2412.06470 null
2024-12-09 GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image Lei Su et.al. 2412.06129 null
2024-12-12 Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation Zipeng Qi et.al. 2412.05969 null
2024-12-08 CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation Elay Dahan et.al. 2412.05833 null
2024-12-10 RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts Xu Liu et.al. 2412.05679 link
2024-12-06 FogROS2-FT: Fault Tolerant Cloud Robotics Kaiyuan Chen et.al. 2412.05408 null
2024-12-06 Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images Junno Yun et.al. 2412.05341 null
2024-12-05 Assessing and Learning Alignment of Unimodal Vision and Language Models Le Zhang et.al. 2412.04616 null
2024-12-05 A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers Anaïs Halin et.al. 2412.04377 null
2024-12-05 Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts Chenyang Zhu et.al. 2412.04220 null
2024-12-05 Text Change Detection in Multilingual Documents Using Image Comparison Doyoung Park et.al. 2412.04137 null
2024-12-05 SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning Seokju Yun et.al. 2412.04077 link
2024-12-05 Quality Control in Open-Ended Crowdsourcing: A Survey Lei Chai et.al. 2412.03991 null
2024-12-05 Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation Hao Zhu et.al. 2412.03968 link
2024-12-05 LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model Yuan Xue et.al. 2412.03841 null
2024-12-04 Designing DNNs for a trade-off between robustness and processing performance in embedded devices Jon Gutiérrez-Zaballa et.al. 2412.03682 null
2024-12-04 Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective Jon Gutiérrez-Zaballa et.al. 2412.03630 link
2024-12-04 FLAIR: VLM with Fine-grained Language-informed Image Representations Rui Xiao et.al. 2412.03561 link
2024-12-04 Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy Ronald L. P. D. de Jong et.al. 2412.03401 null
2024-12-04 Task-driven Image Fusion with Learnable Fusion Loss Haowen Bai et.al. 2412.03240 null
2024-12-04 Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging Luca Ciampi et.al. 2412.03192 null
2024-12-04 Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype Song Tang et.al. 2412.02983 null
2024-12-04 Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch Qing Zhang et.al. 2412.02978 null
2024-12-04 Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution Jiahua Xiao et.al. 2412.02960 null
2024-12-03 SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection Joongwon Chae et.al. 2412.02565 link
2024-12-03 Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps Malik Abdul Manan et.al. 2412.02443 null
2024-12-03 AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation Jaehyun Choi et.al. 2412.02280 null
2024-12-03 Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance Jing Zeng et.al. 2412.02249 null
2024-12-02 INSIGHT: Explainable Weakly-Supervised Medical Image Analysis Wenbo Zhang et.al. 2412.02012 null
2024-12-02 Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers Alberto Gonzalo Rodriguez Salgado et.al. 2412.01941 null
2024-12-02 COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training Sanghwan Kim et.al. 2412.01814 link
2024-12-02 Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior Yi Yu et.al. 2412.01646 null
2024-12-02 Epipolar Attention Field Transformers for Bird’s Eye View Semantic Segmentation Christian Witte et.al. 2412.01595 null
2024-12-01 Token Cropr: Faster ViTs for Quite a Few Tasks Benjamin Bergner et.al. 2412.00965 link
2024-12-03 DPE-Net: Dual-Parallel Encoder Based Network for Semantic Segmentation of Polyps Malik Abdul Manan et.al. 2412.00888 null
2024-12-01 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification Jingwei Zhang et.al. 2412.00678 link
2024-11-30 Density-aware Global-Local Attention Network for Point Cloud Segmentation Chade Li et.al. 2412.00489 null
2024-11-29 LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention Zewen Du et.al. 2411.19585 link
2024-11-29 Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Wenbo Zhang et.al. 2411.19551 link
2024-11-29 Retrieval-guided Cross-view Image Synthesis Hongji Yang et.al. 2411.19510 null
2024-11-28 GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model Rui Zhou et.al. 2411.19289 null
2024-11-28 MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers Jongseong Bae et.al. 2411.18995 null
2024-11-28 Textured As-Is BIM via GIS-informed Point Cloud Segmentation Mohamed S. H. Alabassy et.al. 2411.18898 null
2024-11-27 The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation Daniel Morales-Brotons et.al. 2411.18728 null
2024-11-27 HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior Li-Yuan Tsao et.al. 2411.18662 link
2024-11-26 Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation Sudarshan Rajagopalan et.al. 2411.17814 null
2024-12-02 Efficient Multi-modal Large Language Models via Visual Token Grouping Minbin Huang et.al. 2411.17773 null
2024-11-26 Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation Niharika Hegde et.al. 2411.17610 null
2024-11-26 Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving Jon Gutiérrez-Zaballa et.al. 2411.17543 null
2024-11-26 Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning Hoàng-Ân Lê et.al. 2411.17536 link
2024-11-26 TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba Xiaowen Ma et.al. 2411.17473 link
2024-11-26 MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection Juefei He et.al. 2411.17167 null
2024-11-26 Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation Chanyoung Kim et.al. 2411.17150 null
2024-11-26 ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction Chang Li et.al. 2411.17088 null
2024-11-26 SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation Guoan Xu et.al. 2411.17061 null
2024-11-25 SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models Harsh Goel et.al. 2411.16776 null
2024-11-25 Deformable Mamba for Wide Field of View Segmentation Jie Hu et.al. 2411.16481 link
2024-11-25 A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models Manuel Schwonberg et.al. 2411.16407 null
2024-11-27 An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models Wentao Qu et.al. 2411.16308 link
2024-11-25 A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads Rafael S. Toledo et.al. 2411.16295 link
2024-11-25 Learn from Foundation Model: Fruit Detection Model without Manual Annotation Yanan Wang et.al. 2411.16196 link
2024-11-25 Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training Man Yao et.al. 2411.16061 link
2024-11-24 Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan Saba Zahid et.al. 2411.15923 null
2024-11-24 Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation Sule Bai et.al. 2411.15869 link
2024-11-24 ResCLIP: Residual Attention for Training-free Dense Vision-language Inference Yuhang Yang et.al. 2411.15851 link
2024-11-24 Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation Arvind Murari Vepa et.al. 2411.15763 link
2024-11-22 Effective SAM Combination for Open-Vocabulary Semantic Segmentation Minhyeok Lee et.al. 2411.14723 null
2024-11-21 Revisiting the Integration of Convolution and Attention for Vision Backbone Lei Zhu et.al. 2411.14429 link
2024-11-21 CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Lin Sun et.al. 2411.13836 link
2024-11-21 Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals Hussni Mohd Zakir et.al. 2411.13774 null
2024-11-20 FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting Ola Shorinwa et.al. 2411.13753 null
2024-11-20 BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Umamaheswaran Raman Kumar et.al. 2411.13251 null
2024-11-20 XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation Ziyi Wang et.al. 2411.13243 link
2024-11-20 Automating Sonologists USG Commands with AI and Voice Interface Emad Mohamed et.al. 2411.13006 null
2024-11-19 A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation Jiaqi Yang et.al. 2411.12615 link
2024-11-19 SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation Ron Keuth et.al. 2411.12602 link
2024-11-15 ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding Hesam Hosseini et.al. 2411.12589 null
2024-11-19 ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator Xiao Jiang et.al. 2411.12250 null
2024-11-18 ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements M. Arda Aydın et.al. 2411.12044 link
2024-11-18 Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation Hanieh Shojaei Miandashti et.al. 2411.11935 null
2024-11-18 MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models Harshita Sharma et.al. 2411.11362 null
2024-11-18 Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications Scarlett Raine et.al. 2411.11287 null
2024-11-16 Attention-based U-Net Method for Autonomous Lane Detection Mohammadhamed Tangestanizadeh et.al. 2411.10902 null
2024-11-16 Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation Jaisidh Singh et.al. 2411.10845 null
2024-11-19 Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients Maria Monzon et.al. 2411.10755 link
2024-11-15 Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images Ammar Qammaz et.al. 2411.10334 null
2024-11-15 CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Dengke Zhang et.al. 2411.10086 link
2024-11-14 OneNet: A Channel-Wise 1D Convolutional U-Net Sanghyun Byun et.al. 2411.09838 link
2024-11-14 Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks Zengyi Yang et.al. 2411.09387 null
2024-11-14 Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Yuheng Shi et.al. 2411.09219 link
2024-11-14 Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Ashim Dahal et.al. 2411.09101 link
2024-11-13 CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation Xuming Zhang et.al. 2411.09023 null
2024-11-14 Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation Yangyang Li et.al. 2411.08756 null
2024-11-13 Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model Jun Xie et.al. 2411.08592 null
2024-11-12 Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry Christopher Hahne et.al. 2411.07918 link
2024-11-12 Semantic segmentation on multi-resolution optical and microwave data using deep learning Jai G Singla et.al. 2411.07581 null
2024-11-11 SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation Jiale Chen et.al. 2411.06991 null
2024-11-14 Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision Yueyang Cang et.al. 2411.06727 null
2024-11-10 Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments Deegan Atha et.al. 2411.06632 null
2024-11-09 Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing Kaixuan Lu et.al. 2411.06091 null
2024-11-08 Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model Shuchang Lyu et.al. 2411.05878 link
2024-11-08 Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation Sien Li et.al. 2411.05307 link
2024-11-07 In the Era of Prompt Learning with Vision-Language Models Ankit Jha et.al. 2411.04892 null
2024-11-11 ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset Olaf Wysocki et.al. 2411.04865 link
2024-11-06 Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts Zhitong Gao et.al. 2411.03829 link
2024-11-06 Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model Yansong Qu et.al. 2411.03672 null
2024-11-05 Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation Zhiling Yue et.al. 2411.03551 null
2024-11-05 SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture Andrew Heschl et.al. 2411.03505 link
2024-11-05 Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need Qishuai Wen et.al. 2411.03033 link
2024-11-05 Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation Xavier Timoneda et.al. 2411.02969 null
2024-11-05 Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery Mohammad Kakooei et.al. 2411.02935 link
2024-11-05 CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation Jinchao Ge et.al. 2411.02715 link
2024-11-04 Deep Learning on 3D Semantic Segmentation: A Detailed Review Thodoris Betsas et.al. 2411.02104 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability Bo Gao et.al. 2411.01819 null
2024-11-04 Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations Thanh Nguyen Canh et.al. 2411.01816 null
2024-11-03 PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation Xinyu Xu et.al. 2411.01624 null
2024-11-01 Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions Lixiao Yang et.al. 2411.01039 null
2024-11-01 Event-guided Low-light Video Semantic Segmentation Zhen Yao et.al. 2411.00639 null
2024-11-01 Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data Hairuo Hu et.al. 2411.00499 null
2024-11-01 Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing Naufal Suryanto et.al. 2411.00425 link
2024-10-31 A Recipe for Geometry-Aware 3D Mesh Transformers Mohammad Farazi et.al. 2411.00164 null
2024-10-31 Federated Black-Box Adaptation for Semantic Segmentation Jay N. Paranjape et.al. 2410.24181 link
2024-10-31 COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes Muhammad Ali et.al. 2410.24139 link
2024-10-31 Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model Hao Zhang et.al. 2410.23905 link
2024-11-04 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 null
2024-10-31 CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation Ziyang Gong et.al. 2410.22629 link
2024-11-03 Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation Zhaochong An et.al. 2410.22489 link
2024-10-29 Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation Jintao Tong et.al. 2410.22135 link
2024-10-29 Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models Imad Ali Shah et.al. 2410.22101 link
2024-10-29 Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation Ruihao Xia et.al. 2410.21708 link
2024-10-28 Domain Adaptation with a Single Vision-Language Embedding Mohammad Fahes et.al. 2410.21361 null
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 link
2024-10-27 A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models Camilo Espinosa-Curilem et.al. 2410.20595 link
2024-10-27 Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist et.al. 2410.20459 link
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 null
2024-10-25 IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation Kaixian Qu et.al. 2410.19697 null
2024-10-25 Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation Yao Wu et.al. 2410.19446 link
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-24 Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Alexander Jaus et.al. 2410.18684 null
2024-10-24 Unsupervised semantic segmentation of urban high-density multispectral point clouds Oona Oinonen et.al. 2410.18520 null
2024-10-26 CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator Stefanos Pasios et.al. 2410.18238 link
2024-10-23 Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers Achille Chiuchiarelli et.al. 2410.17738 null
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 null
2024-10-22 SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments Jumman Hossain et.al. 2410.16686 null
2024-10-21 TIPS: Text-Image Pretraining with Spatial Awareness Kevis-Kokitsi Maninis et.al. 2410.16512 null
2024-10-21 GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2410.16485 null
2024-10-21 LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training Thomas Kreutz et.al. 2410.15833 link
2024-10-21 TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight Hyun-Kurl Jang et.al. 2410.15674 link
2024-10-21 Deep Learning and Machine Learning – Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 null
2024-10-22 Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation Fnu Neha et.al. 2410.15472 null
2024-10-18 On the Influence of Shape, Texture and Color for Learning Semantic Segmentation Annika Mütze et.al. 2410.14878 null
2024-10-18 Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ Arpan Mahara et.al. 2410.14836 null
2024-10-17 ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Guangda Ji et.al. 2410.13924 link
2024-10-17 Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks Clément Playout et.al. 2410.13822 link
2024-10-22 EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything Joonhyeon Song et.al. 2410.13621 link
2024-10-17 Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation Ziyang Chen et.al. 2410.13472 null
2024-10-17 SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing Bin Wang et.al. 2410.13471 link
2024-10-17 Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation Florian Wulff et.al. 2410.13383 null
2024-10-17 Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation Houze Liu et.al. 2410.13099 null
2024-10-16 Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation Wenbo Xu et.al. 2410.13094 null
2024-10-16 Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation Jesús Alejandro Loera-Ponce et.al. 2410.12988 null
2024-10-16 VividMed: Vision Language Model with Versatile Visual Grounding for Medicine Lingxiao Luo et.al. 2410.12694 link
2024-10-16 Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans Luca Marsilio et.al. 2410.12641 null
2024-10-17 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-15 WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation Chenghao Qian et.al. 2410.12075 link
2024-10-15 Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning Rijun Wang et.al. 2410.11913 null
2024-10-15 RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation Anton Antonov et.al. 2410.11722 link
2024-10-15 InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Jiayi Lin et.al. 2410.11473 null
2024-10-15 MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation Xianping Ma et.al. 2410.11160 link
2024-10-14 Locality Alignment Improves Vision-Language Models Ian Covert et.al. 2410.11087 null
2024-10-14 Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes Tim Broedermann et.al. 2410.10791 link
2024-10-14 UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation Lihe Yang et.al. 2410.10777 link
2024-10-14 Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation Daniel Fusaro et.al. 2410.10510 link
2024-10-14 LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections Xuezhi Xiang et.al. 2410.10433 null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-13 AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model Yuchen Li et.al. 2410.09714 null
2024-10-12 An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation Wei Liang et.al. 2410.09443 null
2024-10-11 Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation Varduhi Yeghiazaryan et.al. 2410.08946 null
2024-10-11 Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation Hanieh Shojaei et.al. 2410.08687 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 null
2024-10-10 Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation Zhiyi Pan et.al. 2410.08091 null
2024-10-10 Shift and matching queries for video semantic segmentation Tsubasa Mizuno et.al. 2410.07635 null
2024-10-10 3D Vision-Language Gaussian Splatting Qucheng Peng et.al. 2410.07577 null
2024-10-11 Bridge the Points: Graph-based Few-shot Segment Anything Semantically Anqi Zhang et.al. 2410.06964 link
2024-10-09 Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation Seungho Lee et.al. 2410.06893 link
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 link
2024-10-10 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 link
2024-10-09 Transesophageal Echocardiography Generation using Anatomical Models Emmanuel Oladokun et.al. 2410.06781 null
2024-10-09 Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy Qinfeng Zhu et.al. 2410.06725 null
2024-10-09 Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments Meng Yu et.al. 2410.06626 null
2024-10-09 Towards Natural Image Matting in the Wild via Real-Scenario Prior Ruihao Xia et.al. 2410.06593 link
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 null
2024-10-08 Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading Fang Gao et.al. 2410.05762 null
2024-10-08 Advancements in Road Lane Mapping: Comparative Fine-Tuning Analysis of Deep Learning-based Semantic Segmentation Methods Using Aerial Imagery Xuanchen et.al. 2410.05717 null
2024-10-08 Remote Sensing Image Segmentation Using Vision Mamba and Multi-Scale Multi-Frequency Feature Fusion Yice Cao et.al. 2410.05624 null
2024-10-07 Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation Vince Zhu et.al. 2410.04689 null
2024-10-04 SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 Hao Yu et.al. 2410.03962 null
2024-10-10 Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features Benyuan Meng et.al. 2410.03558 link
2024-10-04 Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images Abhijeet Patil et.al. 2410.03289 link
2024-10-04 HRVMamba: High-Resolution Visual State Space Model for Dense Prediction Hao Zhang et.al. 2410.03174 null
2024-10-10 HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer Jingjing Ren et.al. 2410.02528 null
2024-10-04 Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation Muzhi Zhu et.al. 2410.02369 link
2024-10-03 RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds Remco Royen et.al. 2410.02323 link
2024-10-03 Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network Yangyang Qiu et.al. 2410.02224 null
2024-10-03 Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images Qingyuan Liu et.al. 2410.02207 null
2024-10-02 SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images Kaiyu Li et.al. 2410.01768 link
2024-10-02 One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations Shaokang Wu et.al. 2410.01630 null
2024-10-02 Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation Zhaofeng Shi et.al. 2410.01341 link
2024-10-02 VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings Andrea Carrara et.al. 2410.01336 null
2024-10-01 RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation Yazhou Zhu et.al. 2410.01110 link
2024-10-01 Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer Vlatko Spasev et.al. 2410.01092 null
2024-10-01 Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time Chiao-An Yang et.al. 2410.01083 link
2024-10-01 DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles Robert Krajewski et.al. 2410.00769 link
2024-10-01 Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection Pengxi Zeng et.al. 2410.00582 null
2024-10-01 Precise Workcell Sketching from Point Clouds Using an AR Toolbox Krzysztof Zieliński et.al. 2410.00479 null
2024-10-01 Deep Multimodal Fusion for Semantic Segmentation of Remote Sensing Earth Observation Data Ivica Dimitrovski et.al. 2410.00469 null
2024-10-01 AARK: An Open Toolkit for Autonomous Racing Research James Bockman et.al. 2410.00358 null
2024-09-30 Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna Kütük et.al. 2410.00266 null
2024-09-30 AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation Boyu Han et.al. 2409.20398 link
2024-09-30 Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation Tillmann Rheude et.al. 2409.20287 link
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-30 Segmenting Wood Rot using Computer Vision Models Roland Kammerbauer et.al. 2409.20137 null
2024-09-30 Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels Heeseong Shin et.al. 2409.19846 null
2024-09-27 Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation Raphael Hagmanns et.al. 2409.18788 null
2024-09-27 Learning from Pattern Completion: Self-supervised Controllable Generation Zhiqiang Chen et.al. 2409.18694 link
2024-09-27 Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast Xiaoke Hao et.al. 2409.18543 link
2024-10-01 Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization Siru Li et.al. 2409.18434 null
2024-09-26 Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning Siyi Lu et.al. 2409.17659 null
2024-09-26 Global-Local Medical SAM Adaptor Based on Full Adaption Meng Wang et.al. 2409.17486 null
2024-09-25 VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection Liangyu Zhong et.al. 2409.17330 null
2024-09-25 2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation Tommie Kerssies et.al. 2409.17208 link
2024-09-25 WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks Alberto Bacchin et.al. 2409.16999 link
2024-09-25 Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis Illia Tsiporenko et.al. 2409.16940 null
2024-09-24 A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation Avisha Kumar et.al. 2409.16441 link
2024-09-24 Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds Asad Ur Rahman et.al. 2409.16381 null
2024-09-24 Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation Hannah Kerner et.al. 2409.16252 link
2024-09-24 Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation Harry Rogers et.al. 2409.16213 link
2024-09-24 Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification Pang-Yuan Pao et.al. 2409.15846 null
2024-09-24 DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation Soojin Jang et.al. 2409.15801 null
2024-09-24 Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis Camndon Reed et.al. 2409.15671 null
2024-09-23 ZeroSCD: Zero-Shot Street Scene Change Detection Shyam Sundar Kannan et.al. 2409.15255 null
2024-09-27 Diffusion-based RGB-D Semantic Segmentation with Deformable Attention Transformer Minh Bui et.al. 2409.15117 null
2024-09-23 The BRAVO Semantic Segmentation Challenge Results in UNCV2024 Tuan-Hung Vu et.al. 2409.15107 link
2024-09-21 MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors Zhenhua Du et.al. 2409.14019 null
2024-09-21 Enhanced Semantic Segmentation for Large-Scale and Imbalanced Point Clouds Haoran Gong et.al. 2409.13983 null
2024-09-21 CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise Fuyang Yu et.al. 2409.13982 null
2024-09-20 Efficient Domain Augmentation for Autonomous Driving Testing Using Diffusion Models Luciano Baresi et.al. 2409.13661 null
2024-09-20 Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning Daniele Rege Cambrin et.al. 2409.13641 link
2024-09-20 Towards Semi-supervised Dual-modal Semantic Segmentation Qiulei Dong et.al. 2409.13325 null
2024-09-19 AutoPET III Challenge: PET/CT Semantic Segmentation Reza Safdari et.al. 2409.13006 null
2024-09-19 Automated Linear Disturbance Mapping via Semantic Segmentation of Sentinel-2 Imagery Andrew M. Nagel et.al. 2409.12817 null
2024-09-17 Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks Edgar Heinert et.al. 2409.11373 link
2024-09-17 MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping Amirreza Fateh et.al. 2409.11316 link
2024-09-17 Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark Clifford Broni-Bediako et.al. 2409.11227 link
2024-09-17 HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios Nick Theisen et.al. 2409.11205 link
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 link
2024-09-16 BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images Wentao Wang et.al. 2409.10269 null
2024-09-15 Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation Zhanteng Xie et.al. 2409.09899 null
2024-09-15 Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation Qilong Zhangli et.al. 2409.09893 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-14 Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation Hugo Porta et.al. 2409.09497 link
2024-09-13 AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation Zechao Sun et.al. 2409.08516 null
2024-09-13 VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation Ezra MacDonald et.al. 2409.08461 link
2024-09-12 Bayesian Self-Training for Semi-Supervised 3D Segmentation Ozan Unal et.al. 2409.08102 null
2024-09-12 Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes Siyu Chen et.al. 2409.07995 null
2024-09-12 SURGIVID: Annotation-Efficient Surgical Video Object Discovery Çağhan Köksal et.al. 2409.07801 null
2024-09-12 Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation Fuchen Zheng et.al. 2409.07793 link
2024-09-12 ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation Fuchen Zheng et.al. 2409.07779 link
2024-09-12 Open-Vocabulary Remote Sensing Image Semantic Segmentation Qinglong Cao et.al. 2409.07683 link
2024-09-11 Token Turing Machines are Efficient Vision Models Purvish Jajal et.al. 2409.07613 link
2024-09-11 AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution Wangduo Xie et.al. 2409.07171 null
2024-09-11 Brain-Inspired Stepwise Patch Merging for Vision Transformers Yonghao Yu et.al. 2409.06963 null
2024-09-10 Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds Mu Cai et.al. 2409.06827 link
2024-09-10 A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO Sabit Ahamed Preanto et.al. 2409.06671 null
2024-09-10 PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation Yin Hu et.al. 2409.06309 null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 link
2024-09-09 SVS-GAN: Leveraging GANs for Semantic Video Synthesis Khaled M. Seyam et.al. 2409.06074 null
2024-09-12 Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance Quang-Huy Che et.al. 2409.06002 null
2024-09-09 Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features Jacob Gildenblat et.al. 2409.05697 null
2024-09-09 ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions Furqan Ahmed Shaik et.al. 2409.05327 null
2024-09-08 RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network Zhiwei Lin et.al. 2409.04979 null
2024-09-06 Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation Björn Michele et.al. 2409.04409 link
2024-09-05 Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution Marga Don et.al. 2409.03754 link
2024-09-05 LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones Moritz Nottebaum et.al. 2409.03460 link
2024-09-05 Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications Tong Bu et.al. 2409.03368 link
2024-09-05 UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking Md. Mahfuzur Rahman et.al. 2409.03245 null
2024-09-05 Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation Xixi Jiang et.al. 2409.03228 link
2024-09-06 iSeg: An Iterative Refinement-based Framework for Training-free Segmentation Lin Sun et.al. 2409.03209 link
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-04 CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation Minhee Cho et.al. 2409.02699 null
2024-09-04 SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction Sumin Son et.al. 2409.02513 null
2024-09-03 K-Origins: Better Colour Quantification for Neural Networks Lewis Mason et.al. 2409.02281 link
2024-09-03 AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions Chenghao Qian et.al. 2409.02045 link
2024-09-03 Segmenting Object Affordances: Reproducibility and Sensitivity to Scale Tommaso Apicella et.al. 2409.01814 link
2024-09-03 Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation Haodong Wang et.al. 2409.01662 null
2024-09-02 Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition Xuanrui Zeng et.al. 2409.01472 link
2024-09-02 SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation Alberto Bacchin et.al. 2409.01109 link
2024-09-02 Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions Taorong Liu et.al. 2409.01072 null
2024-09-02 From Bird’s-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model Xiaojie Xu et.al. 2409.01014 null
2024-09-02 SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution Mevan Ekanayake et.al. 2409.01013 null
2024-09-02 IVGF: The Fusion-Guided Infrared and Visible General Framework Fangcen Liu et.al. 2409.00973 null
2024-09-01 Image-to-Lidar Relational Distillation for Autonomous Driving Data Anas Mahmoud et.al. 2409.00845 null
2024-09-01 Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background Biyuan Liu et.al. 2409.00589 link
2024-08-31 Plant detection from ultra high resolution remote sensing images: A Semantic Segmentation approach based on fuzzy loss Shivam Pande et.al. 2409.00513 null
2024-08-30 Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes Li Zhang et.al. 2408.17421 link
2024-08-30 Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations Ahmed Hammam et.al. 2408.17311 null
2024-08-30 Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training Zizheng Huang et.al. 2408.17081 link
2024-08-30 Transient Fault Tolerant Semantic Segmentation for Autonomous Driving Leonardo Iurada et.al. 2408.16952 link
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 link
2024-08-29 MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation Linyan Yang et.al. 2408.16478 null
2024-08-29 Multi-source Domain Adaptation for Panoramic Semantic Segmentation Jing Jiang et.al. 2408.16469 link
2024-08-29 EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More Kanghao Chen et.al. 2408.16254 null
2024-08-28 SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors Zhiqing Zhang et.al. 2408.15887 null
2024-08-28 DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries Yu Yang et.al. 2408.15813 null
2024-08-28 TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation Junbao Zhou et.al. 2408.15657 link
2024-08-27 Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images Silvia Seidlitz et.al. 2408.15373 link
2024-08-27 An Investigation on The Position Encoding in Vision-Based Dynamics Prediction Jiageng Zhu et.al. 2408.15201 null
2024-08-27 Applying ViT in Generalized Few-shot Semantic Segmentation Liyuan Geng et.al. 2408.14957 link
2024-08-27 Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack Naufal Suryanto et.al. 2408.14879 link
2024-08-27 MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation Yuanbing Zhu et.al. 2408.14776 null
2024-08-26 Physically Feasible Semantic Segmentation Shamik Basu et.al. 2408.14672 link
2024-08-25 OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation Muhammad Rameez ur Rahman et.al. 2408.13936 link
2024-08-25 Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation Yuwen Pan et.al. 2408.13838 null
2024-08-25 TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather Xiongwei Zhao et.al. 2408.13802 link
2024-08-25 ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation Xin Zhang et.al. 2408.13771 null
2024-08-25 Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation Zhaoyang Li et.al. 2408.13752 null
2024-08-24 ESA: Annotation-Efficient Active Learning for Semantic Segmentation Jinchao Ge et.al. 2408.13491 link
2024-08-23 Accuracy Improvement of Cell Image Segmentation Using Feedback Former Hinako Mitsuoka et.al. 2408.12974 null
2024-08-23 Image Segmentation in Foundation Model Era: A Survey Tianfei Zhou et.al. 2408.12957 link
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 null
2024-08-22 Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets Wolfgang Boettcher et.al. 2408.12489 link
2024-08-22 The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation Tuyen Tran et.al. 2408.12447 null
2024-08-26 UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images Enze Zhu et.al. 2408.11545 link
2024-08-21 Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation Chuandong Liu et.al. 2408.11280 link
2024-08-20 NeCo: Improving DINOv2’s spatial representations in 19 GPU hours with Patch Neighbor Consistency Valentinos Pariza et.al. 2408.11054 null
2024-08-20 CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients Karen Sanchez et.al. 2408.10827 link
2024-08-20 Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? Chen Liang et.al. 2408.10627 null
2024-08-20 Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation Jiawei Han et.al. 2408.10537 link
2024-08-19 Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network Rasha Alshawi et.al. 2408.10181 null
2024-08-19 Dynamic Label Injection for Imbalanced Industrial Defect Segmentation Emanuele Caruso et.al. 2408.10031 link
2024-08-19 Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis Kira Maag et.al. 2408.10021 null
2024-08-19 Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving Jun Yan et.al. 2408.09839 link
2024-08-18 OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras Muhammad Rameez Ur Rahman et.al. 2408.09424 link
2024-08-18 Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration Hao Ai et.al. 2408.09336 null
2024-08-17 Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology Junchao Zhu et.al. 2408.09278 link
2024-08-17 GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation Weiming Zhang et.al. 2408.09115 null
2024-08-17 Depth-guided Texture Diffusion for Image Semantic Segmentation Wei Sun et.al. 2408.09097 null
2024-08-15 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Dongshuo Yin et.al. 2408.08345 link
2024-08-14 MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis Nimeesha Chan et.al. 2408.07773 link
2024-08-15 MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation Beoungwoo Kang et.al. 2408.07576 link
2024-08-19 MagicFace: Training-free Universal-Style Human Image Customized Synthesis Yibin Wang et.al. 2408.07433 null
2024-08-14 Segment Using Just One Example Pratik Vora et.al. 2408.07393 null
2024-08-14 Ensemble architecture in polyp segmentation Hao-Yun Hsu et.al. 2408.07262 link
2024-08-14 Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks Raghavendra Singh et.al. 2408.07243 null
2024-08-14 Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training Ethan Kou et.al. 2408.07239 link
2024-08-13 ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation Jingyun Wang et.al. 2408.06747 link
2024-08-10 Dilated Convolution with Learnable Spacings Ismail Khalfaoui-Hassani et.al. 2408.06383 null
2024-08-12 Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images Siladittya Manna et.al. 2408.06235 null
2024-08-12 A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting Felix Assion et.al. 2408.06071 null
2024-08-12 Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning Xinrong Hu et.al. 2408.05889 link
2024-08-11 Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task Hannuo Zhang et.al. 2408.05777 null
2024-08-11 MacFormer: Semantic Segmentation with Fine Object Boundaries Guoan Xu et.al. 2408.05699 null
2024-08-10 Multimodal generative semantic communication based on latent diffusion model Weiqi Fu et.al. 2408.05455 null
2024-08-09 In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Dahyun Kang et.al. 2408.04961 link
2024-08-09 ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation Mengcheng Lan et.al. 2408.04883 link
2024-08-09 Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning Fumihiro Kaneko et.al. 2408.04795 null
2024-08-08 SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation Jieming Yu et.al. 2408.04593 null
2024-08-08 SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios Sriram Mandalika et.al. 2408.04482 null
2024-08-08 What could go wrong? Discovering and describing failure modes in computer vision Gabriela Csurka et.al. 2408.04471 null
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 link
2024-08-07 SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology Mingya Zhang et.al. 2408.03651 link
2024-08-06 Post-Mortem Human Iris Segmentation Analysis with Deep Learning Afzal Hossain et.al. 2408.03448 null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 link
2024-08-05 Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation Sai Prasanna et.al. 2408.02297 null
2024-08-05 Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs Jeongkee Lim et.al. 2408.02261 link
2024-08-05 Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders Muhammad Abdullah Jamal et.al. 2408.02245 null
2024-08-04 Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation Ye Du et.al. 2408.02039 null
2024-08-03 Bayesian Active Learning for Semantic Segmentation Sima Didari et.al. 2408.01694 null
2024-08-03 A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection Omkar Oak et.al. 2408.01692 null
2024-08-03 Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation Balázs Opra et.al. 2408.01640 null
2024-08-02 Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans Lukas Kratochvila et.al. 2408.01526 null
2024-08-02 Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation Yuanzhi Su et.al. 2408.01356 null
2024-08-02 StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation Bingyu Li et.al. 2408.01343 null
2024-08-02 Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach Yabin Zhu et.al. 2408.00969 link
2024-08-01 Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation Siyu Jiao et.al. 2408.00744 link
2024-08-01 Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function Matias Oscar Volman Stern et.al. 2408.00707 null
2024-08-01 AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation Asbjørn Munk et.al. 2408.00640 link
2024-08-01 SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation Shengbo Tan et.al. 2408.00496 link
2024-07-31 Open-Vocabulary Audio-Visual Semantic Segmentation Ruohao Guo et.al. 2407.21721 null
2024-07-31 MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment Anurag Das et.al. 2407.21654 null
2024-07-31 Small Object Few-shot Segmentation for Vision-based Industrial Inspection Zilong Zhang et.al. 2407.21351 link
2024-07-31 On-the-fly Point Feature Representation for Point Clouds Analysis Jiangyi Wang et.al. 2407.21335 null
2024-07-31 Fine-grained Metrics for Point Cloud Semantic Segmentation Zhuheng Lu et.al. 2407.21289 null
2024-07-30 PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds Kerem Mertoğlu et.al. 2407.21150 null
2024-07-30 Learning Ordinality in Semantic Segmentation Rafael Cristino et.al. 2407.20959 null
2024-07-29 Improving 2D Feature Representations by 3D-Aware Fine-Tuning Yuanwen Yue et.al. 2407.20229 null
2024-07-29 Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset Yimian Dai et.al. 2407.20078 link
2024-07-29 Language-driven Grasp Detection with Mask-guided Attention Tuan Van Vo et.al. 2407.19877 null
2024-07-29 Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets Muhammad Abdullah Jamal et.al. 2407.19714 null
2024-07-29 ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement Ezequiel Perez-Zarate et.al. 2407.19708 link
2024-07-28 ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding Zhen Chen et.al. 2407.19435 link
2024-07-27 Ensembling convolutional neural networks for human skin segmentation Patryk Kuban et.al. 2407.19310 null
2024-07-27 Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network Gang Pan et.al. 2407.19271 null
2024-07-26 Sparse Refinement for Efficient High-Resolution Semantic Segmentation Zhijian Liu et.al. 2407.19014 null
2024-07-29 Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation Jingjun Yi et.al. 2407.18568 null
2024-07-25 Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception Julia Hindel et.al. 2407.18145 null
2024-07-25 TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework Guanfeng Tang et.al. 2407.18038 null
2024-07-25 Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions Jan Nikolas Morshuis et.al. 2407.18026 link
2024-07-24 Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation Hyunwoo Yu et.al. 2407.17261 link
2024-07-24 Trans2Unet: Neural fusion for Nuclei Semantic Segmentation Dinh-Phu Tran et.al. 2407.17181 null
2024-07-24 PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning Mu Chen et.al. 2407.17101 null
2024-07-25 Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste Qinfeng Zhu et.al. 2407.17028 link
2024-07-24 Progressive Query Refinement Framework for Bird’s-Eye-View Semantic Segmentation from Surrounding Images Dooseop Choi et.al. 2407.17003 link
2024-07-23 Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving Anam Manzoor et.al. 2407.16647 null
2024-07-23 Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging Daniela L. Ramos et.al. 2407.16608 link
2024-07-23 Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision Aditya Krishnan et.al. 2407.16102 null
2024-07-22 MILAN: Milli-Annotations for Lidar Semantic Segmentation Nermin Samet et.al. 2407.15797 null
2024-07-22 Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond Silvio Galesso et.al. 2407.15739 link
2024-07-22 MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics Alexander Melekhin et.al. 2407.15663 link
2024-07-22 Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling Bo Yuan et.al. 2407.15429 link
2024-07-22 Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data Junha Song et.al. 2407.15383 link
2024-07-21 Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation Xiaoyang Wu et.al. 2407.15282 null
2024-07-20 Downstream-Pretext Domain Knowledge Traceback for Active Learning Beichen Zhang et.al. 2407.14720 null
2024-07-19 Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model Kun Zhao et.al. 2407.14326 null
2024-07-19 Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation Zhengyuan Xie et.al. 2407.14142 link
2024-07-19 GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation Florian Chabot et.al. 2407.14108 null
2024-07-18 Many Perception Tasks are Highly Redundant Functions of their Input Data Rahul Ramesh et.al. 2407.13841 null
2024-07-18 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model Abdelrahman Shaker et.al. 2407.13772 link
2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 null
2024-07-23 MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis Ziming Zhong et.al. 2407.13675 link
2024-07-18 Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models Xiaoyu Zhu et.al. 2407.13642 null
2024-07-18 FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures Hao Lu et.al. 2407.13500 link
2024-07-18 FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions Sohyun Lee et.al. 2407.13437 null
2024-07-18 Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability Judith Dijk et.al. 2407.13392 null
2024-07-18 Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation Chang Liu et.al. 2407.13363 link
2024-07-18 Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation Shoumeng Qiu et.al. 2407.13254 link
2024-07-18 OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation Jian Sun et.al. 2407.13137 null
2024-07-18 Tree semantic segmentation from aerial image time series Venkatesh Ramesh et.al. 2407.13102 null
2024-07-17 ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders Carlos Hinojosa et.al. 2407.13036 link
2024-07-17 Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation Prantik Howlader et.al. 2407.12630 link
2024-07-17 Instance-wise Uncertainty for Class Imbalance in Semantic Segmentation Luís Almeida et.al. 2407.12609 null
2024-07-18 Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk et.al. 2407.12588 link
2024-07-17 Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation Ruijie Xu et.al. 2407.12489 link
2024-07-17 Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation Hyun Seok Seong et.al. 2407.12463 link
2024-07-17 ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference Mengcheng Lan et.al. 2407.12442 null
2024-07-17 Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model Tao Wang et.al. 2407.12319 null
2024-07-16 FoodMem: Near Real-time and Precise Food Video Segmentation Ahmad AlMughrabi et.al. 2407.12121 null
2024-07-16 Mitigating Background Shift in Class-Incremental Semantic Segmentation Gilhan Park et.al. 2407.11859 link
2024-07-16 Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation Juncheng Ma et.al. 2407.11820 link
2024-07-16 XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach Truong Thanh Hung Nguyen et.al. 2407.11771 link
2024-07-16 OAM-TCD: A globally diverse dataset of high-resolution tree cover maps Josh Veitch-Michaelis et.al. 2407.11743 link
2024-07-16 SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds Yanbo Wang et.al. 2407.11569 link
2024-07-16 Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations Yunya Gao et.al. 2407.11381 link
2024-07-16 Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities Xu Zheng et.al. 2407.11351 null
2024-07-16 Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation Xu Zheng et.al. 2407.11344 null
2024-07-16 TCFormer: Visual Recognition via Token Clustering Transformer Wang Zeng et.al. 2407.11321 link
2024-07-15 Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding Danish Nazir et.al. 2407.11224 null
2024-07-15 Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras Hoonhee Cho et.al. 2407.11216 link
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964 link
2024-07-15 APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2407.10649 null
2024-07-15 Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs Rong Ma et.al. 2407.10534 null
2024-07-14 Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data Tuo Feng et.al. 2407.10200 link
2024-07-14 RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation Li Li et.al. 2407.10159 link
2024-07-14 HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation Chengjie Jiang et.al. 2407.10047 null
2024-07-13 Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation Anqi Zhang et.al. 2407.09838 null
2024-07-13 Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach Md Rakibul Islam et.al. 2407.09828 null
2024-07-13 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance Xiaoxu Xu et.al. 2407.09826 link
2024-07-13 TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation Xiaopei Wu et.al. 2407.09751 link
2024-07-12 Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion Shiqi Tan et.al. 2407.09697 null
2024-07-12 SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images Josh Myers-Dean et.al. 2407.09686 null
2024-07-12 FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background Muhammad Ali et.al. 2407.09379 link
2024-07-12 Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy Julian Wyatt et.al. 2407.09192 null
2024-07-12 Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off Levente Halmosi et.al. 2407.09150 link
2024-07-12 Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation Wei Cong et.al. 2407.09047 null
2024-07-12 Textual Query-Driven Mask Transformer for Domain Generalized Segmentation Byeonghyun Pak et.al. 2407.09033 link
2024-07-12 Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation Zihao Li et.al. 2407.08994 null
2024-07-11 Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation Tong Shao et.al. 2407.08268 link
2024-07-11 Enrich the content of the image Using Context-Aware Copy Paste Qiushi Guo et.al. 2407.08151 null
2024-07-10 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Ali Hatamizadeh et.al. 2407.08083 link
2024-07-10 Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift Elliot Vincent et.al. 2407.07616 link
2024-07-10 H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper Ryan Banks et.al. 2407.07604 link
2024-07-11 Trainable Highly-expressive Activation Functions Irit Chelly et.al. 2407.07564 link
2024-07-10 Deformable-Heatmap-Segmentation for Automobile Visual Perception Hongyu Jin et.al. 2407.07493 null
2024-07-10 Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining Tianfang Sun et.al. 2407.07465 null
2024-07-11 HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation Guoan Xu et.al. 2407.07441 null
2024-07-09 ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation Yuyuan Liu et.al. 2407.07171 link
2024-07-08 Training-free CryoET Tomogram Segmentation Yizhou Zhao et.al. 2407.06833 link
2024-07-09 CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM Aditya Murali et.al. 2407.06795 null
2024-07-09 LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration Jiayi Liu et.al. 2407.06512 link
2024-07-08 Leveraging image captions for selective whole slide image annotation Jingna Qiu et.al. 2407.06363 link
2024-07-08 Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Siva Krishna Ravipati et.al. 2407.06077 link
2024-07-08 Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts Puzuo Wang et.al. 2407.06043 null
2024-07-08 RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation Sarah Elmahdy et.al. 2407.06016 null
2024-07-07 Semantic Segmentation for Real-World and Synthetic Vehicle’s Forward-Facing Camera Images Tuan T. Nguyen et.al. 2407.05452 null
2024-07-07 Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness Idris Hamoud et.al. 2407.05448 null
2024-07-06 A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation Monika Wysoczańska et.al. 2407.05061 null
2024-07-06 BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support Vladyslav Polushko et.al. 2407.05007 null
2024-07-05 Explainable Metric Learning for Deflating Data Bias Emma Andrews et.al. 2407.04866 null
2024-07-10 LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes Zexian Huang et.al. 2407.04326 null
2024-07-04 Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier Prantik Howlader et.al. 2407.04036 link
2024-07-04 Relative Difficulty Distillation for Semantic Segmentation Dong Liang et.al. 2407.03719 link
2024-07-04 POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation Arindam Dutta et.al. 2407.03549 null
2024-07-03 A Unified Framework for 3D Scene Understanding Wei Xu et.al. 2407.03263 link
2024-07-03 ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation Chang Li et.al. 2407.03033 null
2024-07-03 ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation Yipin Guo et.al. 2407.02881 null
2024-07-03 Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation Tao Chen et.al. 2407.02768 link
2024-07-02 Open Panoramic Segmentation Junwei Zheng et.al. 2407.02685 link
2024-07-08 Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction Tinghuai Wang et.al. 2407.02639 null
2024-07-02 Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather Junsung Park et.al. 2407.02286 link
2024-07-02 MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders Baijiong Lin et.al. 2407.02228 link
2024-07-02 Occlusion-Aware Seamless Segmentation Yihong Cao et.al. 2407.02182 link
2024-07-02 VRBiom: A New Periocular Dataset for Biometric Applications of HMD Ketan Kotwal et.al. 2407.02150 null
2024-07-02 Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts Pasquale De Marinis et.al. 2407.02075 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-07-01 Label-free Neural Semantic Image Synthesis Jiayi Wang et.al. 2407.01790 null
2024-07-01 PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction Xuan Yu et.al. 2407.01349 null
2024-07-01 CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes Danial Qashqai et.al. 2407.01328 link
2024-06-29 SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City Guohao Wang et.al. 2407.00296 link
2024-06-28 Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review Moseli Mots’oehli et.al. 2407.00252 null
2024-07-01 Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding Yifan Tang et.al. 2406.19791 null
2024-06-28 Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation Junsung Park et.al. 2406.19638 link
2024-06-28 PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation Deyi Ji et.al. 2406.19632 null
2024-06-27 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model Haobo Yuan et.al. 2406.19369 link
2024-06-27 ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2406.19225 null
2024-06-30 Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO Fuseini Mumuni et.al. 2406.19057 null
2024-06-27 Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation Tao Lian et.al. 2406.18809 null
2024-06-26 CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data Nikolaos Dionelis et.al. 2406.18279 link
2024-06-26 The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval Meinardus Boris et.al. 2406.18113 link
2024-06-26 Few-Shot Medical Image Segmentation with High-Fidelity Prototypes Song Tang et.al. 2406.18074 link
2024-06-25 Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation Xuming Zhang et.al. 2406.17679 null
2024-06-25 DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation Ahmad Mohammadshirazi et.al. 2406.17591 link
2024-06-25 Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Felix Stillger et.al. 2406.17541 null
2024-06-25 Investigating Self-Supervised Methods for Label-Efficient Learning Srinivasa Rao Nandam et.al. 2406.17460 null
2024-06-25 Pseudo Labelling for Enhanced Masked Autoencoders Srinivasa Rao Nandam et.al. 2406.17450 null
2024-06-25 Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model Zhuoyuan Li et.al. 2406.17442 null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 link
2024-06-24 Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation Yizheng Wu et.al. 2406.16776 link
2024-06-24 μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation Pierangela Bruno et.al. 2406.16724 null
2024-06-24 GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection Harnaik Dhami et.al. 2406.16625 link
2024-06-24 LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images Xiaowen Ma et.al. 2406.16502 link
2024-06-24 Cascade Reward Sampling for Efficient Decoding-Time Alignment Bolian Li et.al. 2406.16306 link
2024-06-24 SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Neng Wang et.al. 2406.16279 link
2024-06-23 UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery Pengfei Zhang et.al. 2406.16129 null
2024-06-22 Fine-grained Background Representation for Weakly Supervised Semantic Segmentation Xu Yin et.al. 2406.15755 link
2024-06-20 Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery Ilham Adi Panuntun et.al. 2406.14220 null
2024-06-20 Trusting Semantic Segmentation Networks Samik Some et.al. 2406.14201 null
2024-06-20 EvSegSNN: Neuromorphic Semantic Segmentation for Event Data Dalia Hareb et.al. 2406.14178 null
2024-06-20 Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images Qinfeng Zhu et.al. 2406.14086 link
2024-06-19 Search-based DNN Testing and Retraining with GAN-enhanced Simulations Mohammed Oualid Attaoui et.al. 2406.13359 null
2024-06-19 Deep Learning-Based 3D Instance and Semantic Segmentation: A Review Siddiqui Muhammad Yasir et.al. 2406.13308 null
2024-06-18 Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation Guoyu Yang et.al. 2406.12496 link
2024-06-18 Agriculture-Vision Challenge 2024 – The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble Wang Liu et.al. 2406.12271 null
2024-06-17 OoDIS: Anomaly Instance Segmentation Benchmark Alexey Nekrasov et.al. 2406.11835 link
2024-06-17 Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT Maximilian E. Tschuchnig et.al. 2406.11650 null
2024-06-17 SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation Zhenchao Lin et.al. 2406.11441 link
2024-06-17 Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Yunsong Wang et.al. 2406.11283 null
2024-06-17 Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation Bingfeng Zhang et.al. 2406.11189 link
2024-06-21 $α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion Sanbao Su et.al. 2406.11021 null
2024-06-16 PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery Libo Wang et.al. 2406.10828 link
2024-06-15 GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR Bharat Singh et.al. 2406.10722 null
2024-06-15 A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection Chenyao Zhou et.al. 2406.10678 link
2024-06-14 ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers Narges Norouzi et.al. 2406.09936 link
2024-06-14 Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions Aldi Piroli et.al. 2406.09906 null
2024-06-17 Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation Brunó B. Englert et.al. 2406.09896 link
2024-06-14 Open-Vocabulary Semantic Segmentation with Image Embedding Balancing Xiangheng Shan et.al. 2406.09829 link
2024-06-13 Instance-level quantitative saliency in multiple sclerosis lesion segmentation Federico Spagnolo et.al. 2406.09335 link
2024-06-13 APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation Weizhao He et.al. 2406.08372 null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249 link
2024-06-16 A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder Lixian Zhang et.al. 2406.08079 null
2024-06-12 OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding Yinan Deng et.al. 2406.08009 link
2024-06-12 SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation Chanda Grover Kamra et.al. 2406.07986 link
2024-06-12 Small Scale Data-Free Knowledge Distillation He Liu et.al. 2406.07876 link
2024-06-11 Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph Sergey Linok et.al. 2406.07113 null
2024-06-11 PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Yining Shi et.al. 2406.07037 null
2024-06-12 LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection Jiahua Xu et.al. 2406.07023 null
2024-06-10 Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation Dong Zhao et.al. 2406.06813 link
2024-06-09 Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation Abdul Qayyum et.al. 2406.06643 null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512 null
2024-06-10 UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving Daniel Bogdoll et.al. 2406.06370 null
2024-06-09 Scaling Graph Convolutions for Mobile Vision William Avery et.al. 2406.05850 link
2024-06-09 Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation Jun Yu et.al. 2406.05837 null
2024-06-09 Convolution and Attention-Free Mamba-based Cardiac Image Segmentation Abbas Khan et.al. 2406.05786 link
2024-06-09 Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language Mark Hamilton et.al. 2406.05629 link
2024-06-08 A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ Jianzhao Wang et.al. 2406.05513 null
2024-06-08 Layered Image Vectorization via Semantic Simplification Zhenyu Wang et.al. 2406.05404 null
2024-06-08 1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR’24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation Qingfeng Liu et.al. 2406.05352 null
2024-06-07 USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation Xiaoqi Wang et.al. 2406.05271 null
2024-06-07 Semantic Segmentation on VSPW Dataset through Masked Video Consistency Chen Liang et.al. 2406.04979 null
2024-06-07 Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment Venkanna Babu Guthula et.al. 2406.04949 null
2024-06-06 Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis Chengeng Liu et.al. 2406.04149 null
2024-06-06 Frequency-based Matcher for Long-tailed Semantic Segmentation Shan Li et.al. 2406.03917 link
2024-06-07 Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge Nan Zhang et.al. 2406.03799 link
2024-06-06 DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation Zilu Guo et.al. 2406.03702 link
2024-06-05 Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation Maximilian Zenk et.al. 2406.03323 null
2024-06-05 Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy Yunho Kim et.al. 2406.02989 null
2024-06-04 W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics Andre Schreiber et.al. 2406.02822 link
2024-06-04 Window to Wall Ratio Detection using SegFormer Zoe De Simone et.al. 2406.02706 link
2024-06-04 Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning Heather Doig et.al. 2406.01932 null
2024-06-03 EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Thanh-Dat Truong et.al. 2406.01429 null
2024-06-03 TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation Antonio Santo et.al. 2406.01395 link
2024-06-03 ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds Ka Lung Cheung et.al. 2406.01337 link
2024-06-03 LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism Miao Fu et.al. 2406.01228 null
2024-06-04 GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer Ding Jia et.al. 2406.01210 link
2024-06-03 S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography Yuhan Song et.al. 2406.01191 link
2024-06-02 Diffusion Features to Bridge Domain Gap for Semantic Segmentation Yuxiang Ji et.al. 2406.00777 link
2024-06-06 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation Yunheng Li et.al. 2406.00670 link
2024-06-02 Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024 Biao Wu et.al. 2406.00587 null
2024-06-01 Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation Xinyue Chen et.al. 2406.00545 null
2024-06-01 2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation Biao Wu et.al. 2406.00500 null
2024-06-01 DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation Qihang Xie et.al. 2406.00341 null
2024-06-01 Complex Style Image Transformations for Domain Generalization in Medical Images Nikolaos Spanos et.al. 2406.00298 null
2024-05-31 TotalVibeSegmentator: Full Torso Segmentation for the NAKO and UK Biobank in Volumetric Interpolated Breath-hold Examination Body Images Robert Graf et.al. 2406.00125 link
2024-05-31 Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks Linlin Yu et.al. 2405.20986 null
2024-05-31 Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation Wooseok Shin et.al. 2405.20610 link
2024-05-30 P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation Qi Zhang et.al. 2405.20443 link
2024-05-30 SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow Chaoyang Wang et.al. 2405.20282 link
2024-05-30 MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion Angel Villar-Corrales et.al. 2405.19921 link
2024-05-30 Open-Set Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2405.19899 link
2024-05-30 DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation Ron Keuth et.al. 2405.19746 link
2024-05-30 Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes Yong-Qiang Mao et.al. 2405.19735 null
2024-05-30 CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation Ankush Gajanan Arudkar et.al. 2405.19672 null
2024-05-29 Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation Lianlei Shan et.al. 2405.19568 null
2024-05-29 Enabling Visual Recognition at Radio Frequency Haowen Lai et.al. 2405.19516 null
2024-05-29 Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models Tianrun Chen et.al. 2405.19326 null
2024-05-29 A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation Niclas Vödisch et.al. 2405.19035 link
2024-05-29 Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation Zelin Peng et.al. 2405.18840 null
2024-05-28 Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation JuneHyoung Kwon et.al. 2405.18148 null
2024-05-28 Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images Lianlei Shan et.al. 2405.18078 null
2024-05-28 RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields Mihnea-Bogdan Jurca et.al. 2405.18033 link
2024-05-28 DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture Shentong Mo et.al. 2405.17995 link
2024-05-28 The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention Xingyu Ding et.al. 2405.17776 null
2024-05-27 Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation Steven Landgraf et.al. 2405.17097 null
2024-05-27 DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking Hongtao Wang et.al. 2405.16980 null
2024-05-27 Collective Perception Datasets for Autonomous Driving: A Comprehensive Review Sven Teufel et.al. 2405.16973 null
2024-05-27 Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models Qian Wang et.al. 2405.16947 link
2024-05-27 A re-calibration method for object detection with multi-modal alignment bias in autonomous driving Zhihang Song et.al. 2405.16848 null
2024-05-25 BOLD: Boolean Logic Deep Learning Van Minh Nguyen et.al. 2405.16339 null
2024-05-25 Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation Huizhou Chen et.al. 2405.16099 null
2024-05-25 Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality Hakim Ikebayashi et.al. 2405.16008 null
2024-05-24 Visualize and Paint GAN Activations Rudolf Herdt et.al. 2405.15636 null
2024-05-24 Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets Hoàng-Ân Lê et.al. 2405.15394 link
2024-05-24 U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation Bingyu Li et.al. 2405.15365 link
2024-05-24 Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation Jiayi Chen et.al. 2405.15265 link
2024-05-23 Mamba-R: Vision Mamba ALSO Needs Registers Feng Wang et.al. 2405.14858 null
2024-05-23 Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation Daniel Kienzle et.al. 2405.14467 link
2024-05-23 MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models Jiuming Liu et.al. 2405.14338 null
2024-05-23 Tuning-free Universally-Supervised Semantic Segmentation Xiaobo Yang et.al. 2405.14294 null
2024-05-23 SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation Kai Yao et.al. 2405.14278 null
2024-05-23 Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations Mohammed Baharoon et.al. 2405.14239 link
2024-05-24 Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification Taylor Archibald et.al. 2405.14162 null
2024-05-23 Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips Yaotian Liu et.al. 2405.14154 null
2024-05-22 TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System Diogo Lavado et.al. 2405.13989 null
2024-05-22 Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer Qihang Fan et.al. 2405.13337 link
2024-05-22 Vision Transformer with Sparse Scan Prior Qihang Fan et.al. 2405.13335 link
2024-05-22 Deep Learning-Driven State Correction: A Hybrid Architecture for Radar-Based Dynamic Occupancy Grid Mapping Max Peter Ronecker et.al. 2405.13307 null
2024-05-21 Transparency Distortion Robustness for SOTA Image Segmentation Tasks Volker Knauthe et.al. 2405.12864 null
2024-05-20 A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation Sushmita Sarker et.al. 2405.11903 null
2024-05-20 Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments Jooyong Park et.al. 2405.11855 null
2024-05-20 Universal Organizer of SAM for Unsupervised Semantic Segmentation Tingting Li et.al. 2405.11742 link
2024-05-19 Interpreting a Semantic Segmentation Model for Coastline Detection Conor O’Sullivan et.al. 2405.11500 link
2024-05-17 CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation Mushui Liu et.al. 2405.10530 link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 link
2024-05-16 A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance Andrea Matteazzi et.al. 2405.10046 null
2024-05-16 Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation Jihwan Kwak et.al. 2405.09858 link
2024-05-15 Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation Guo Yachan et.al. 2405.09682 null
2024-05-14 CLIP with Quality Captions: A Strong Pretraining for Vision Tasks Pavan Kumar Anasosalu Vasu et.al. 2405.08911 null
2024-05-14 Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study Qinfeng Zhu et.al. 2405.08493 null
2024-05-14 TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road Detection Martín Bayón-Gutiérrez et.al. 2405.08429 link
2024-05-13 IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data Ziyang Zhang et.al. 2405.07916 null
2024-05-12 Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception Haoming Chen et.al. 2405.07201 link
2024-05-10 GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs Mustafa Munir et.al. 2405.06849 link
2024-05-10 Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach Elham Ravanbakhsh et.al. 2405.06586 null
2024-05-10 Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation Xiaowen Ma et.al. 2405.06525 link
2024-05-10 Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data Yonghao Xu et.al. 2405.06502 link
2024-05-10 Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data Rongyu Zhang et.al. 2405.06413 null
2024-05-10 Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation Zhenliang Ni et.al. 2405.06228 link
2024-05-10 Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection Koji Takeda et.al. 2405.06185 null
2024-05-10 Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging Zhuchen Shao et.al. 2405.06175 null
2024-05-09 Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation Yudian Zhang et.al. 2405.05830 null
2024-05-08 OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies Lingdong Kong et.al. 2405.05259 link
2024-05-08 Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving Lingdong Kong et.al. 2405.05258 link
2024-05-08 Weakly-supervised Semantic Segmentation via Dual-stream Contrastive Learning of Cross-image Contextual Information Qi Lai et.al. 2405.04913 null
2024-05-08 DeepDamageNet: A two-step deep-learning model for multi-disaster building damage segmentation and classification using satellite imagery Irene Alisjahbana et.al. 2405.04800 null
2024-05-13 FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes Charles Gaydon et.al. 2405.04634 link
2024-05-07 A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields Raiyan Rahman et.al. 2405.04305 null
2024-05-07 ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation Zhibo Zhang et.al. 2405.04121 null
2024-05-06 PTQ4SAM: Post-Training Quantization for Segment Anything Chengtao Lv et.al. 2405.03144 link
2024-05-04 MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning Vishal Nedungadi et.al. 2405.02771 link
2024-05-04 Few-Shot Fruit Segmentation via Transfer Learning Jordan A. James et.al. 2405.02556 link
2024-05-03 DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model Peijin Jia et.al. 2405.02008 null
2024-05-02 Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey Guoping Xu et.al. 2405.01725 link
2024-05-02 Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey Rokas Gipiškis et.al. 2405.01636 null
2024-05-02 CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation Chenying Liu et.al. 2405.01217 null
2024-05-02 Uncertainty-aware self-training with expectation maximization basis transformation Zijia Wang et.al. 2405.01175 null
2024-05-01 Exploring Self-Supervised Vision Transformers for Deepfake Detection: A Comparative Analysis Huy H. Nguyen et.al. 2405.00355 link
2024-04-30 Masked Multi-Query Slot Attention for Unsupervised Object Discovery Rishav Pramanik et.al. 2404.19654 link
2024-04-30 DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents Taylor Archibald et.al. 2404.19259 null
2024-04-29 Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing Leonardo Rossi et.al. 2404.18924 link
2024-04-29 IPixMatch: Boost Semi-supervised Semantic Segmentation with Inter-Pixel Relation Kebin Wu et.al. 2404.18891 null
2024-04-29 Towards Long-term Robotics in the Wild Stephen Hausler et.al. 2404.18477 null
2024-04-27 Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic Environments Benoît Gérin et.al. 2404.17930 link
2024-04-27 GLIMS: Attention-Guided Lightweight Multi-Scale Hybrid Network for Volumetric Semantic Segmentation Ziya Ata Yazıcı et.al. 2404.17854 link
2024-04-27 CLFT: Camera-LiDAR Fusion Transformer for Semantic Segmentation in Autonomous Driving Junyi Gu et.al. 2404.17793 link
2024-04-26 Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment Kazi Shahriar Sanjid et.al. 2404.17235 null
2024-04-25 Calculation of Femur Caput Collum Diaphyseal angle for X-Rays images using Semantic Segmentation Deepak Bhatia et.al. 2404.17083 null
2024-04-25 Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals Oliver Hahn et.al. 2404.16818 link
2024-04-26 Multi-Scale Representations by Varying Window Attention for Semantic Segmentation Haotian Yan et.al. 2404.16573 link
2024-04-25 360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes Xu Zheng et.al. 2404.16501 null
2024-04-25 Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models Hedda Cohen Indelman et.al. 2404.16325 null
2024-04-25 Style Adaptation for Domain-adaptive Semantic Segmentation Ting Li et.al. 2404.16301 null
2024-04-29 A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation Yifan Zhao et.al. 2404.16266 link
2024-04-24 3D Freehand Ultrasound using Visual Inertial and Deep Inertial Odometry for Measuring Patellar Tracking Russell Buchanan et.al. 2404.15847 null
2024-04-24 Vision Transformer-based Adversarial Domain Adaptation Yahan Li et.al. 2404.15817 link
2024-04-22 OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks Sophia Sirko-Galouchenko et.al. 2404.14027 link
2024-04-21 Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation Guanlong Jiao et.al. 2404.13701 null
2024-04-21 PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images Abhishek Jha et.al. 2404.13693 link
2024-04-21 A Complete System for Automated 3D Semantic-Geometric Mapping of Corrosion in Industrial Environments Rui Pimentel de Figueiredo et.al. 2404.13691 null
2024-04-21 LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing Tong Wang et.al. 2404.13659 null
2024-04-21 Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering Ben Fei et.al. 2404.13619 null
2024-04-20 AMMUNet: Multi-Scale Attention Map Merging for Remote Sensing Image Segmentation Yang Yang et.al. 2404.13408 link
2024-04-19 BACS: Background Aware Continual Semantic Segmentation Mostafa ElAraby et.al. 2404.13148 link
2024-04-19 ToNNO: Tomographic Reconstruction of a Neural Network’s Output for Weakly Supervised Segmentation of 3D Medical Images Marius Schmidt-Mengin et.al. 2404.13103 null
2024-04-19 Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation Yilong Chen et.al. 2404.12861 null
2024-04-19 COIN: Counterfactual inpainting for weakly supervised semantic segmentation for medical images Dmytro Shvetsov et.al. 2404.12832 link
2024-04-19 A Point-Based Approach to Efficient LiDAR Multi-Task Perception Christopher Lang et.al. 2404.12798 null
2024-04-19 Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework Zhuohong Li et.al. 2404.12721 link
2024-04-19 Improving Prediction Accuracy of Semantic Segmentation Methods Using Convolutional Autoencoder Based Pre-processing Layers Hisashi Shimodaira et.al. 2404.12718 null
2024-04-19 Show and Grasp: Few-shot Semantic Segmentation for Robot Grasping through Zero-shot Foundation Models Leonardo Barcellona et.al. 2404.12717 null
2024-04-18 A Perspective on Deep Vision Performance with Standard Image and Video Codecs Christoph Reich et.al. 2404.12330 null
2024-04-18 Deep Gaussian mixture model for unsupervised image segmentation Matthias Schwab et.al. 2404.12252 link
2024-04-18 Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training Jin Gao et.al. 2404.12210 link
2024-04-18 How to Benchmark Vision Foundation Models for Semantic Segmentation? Tommie Kerssies et.al. 2404.12172 link
2024-04-19 Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation Chongjie Si et.al. 2404.11981 null
2024-04-18 Group-On: Boosting One-Shot Segmentation with Supportive Query Hanjing Zhou et.al. 2404.11871 null
2024-04-17 Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach Mir Rayat Imtiaz Hossain et.al. 2404.11732 null
2024-04-17 A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching Francesco Pro et.al. 2404.11302 link
2024-04-17 Learning from Unlabelled Data with Transformers: Domain Adaptation for Semantic Segmentation of High Resolution Aerial Images Nikolaos Dionelis et.al. 2404.11299 link
2024-04-16 A Concise Tiling Strategy for Preserving Spatial Context in Earth Observation Imagery Ellianna Abrahams et.al. 2404.10927 link
2024-04-16 Vocabulary-free Image Classification and Semantic Segmentation Alessandro Conti et.al. 2404.10864 link
2024-04-16 Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging Toqi Tahamid Sarker et.al. 2404.10841 link
2024-04-16 Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark Jiangning Zhang et.al. 2404.10760 link
2024-04-16 ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation Iaroslav Melekhov et.al. 2404.10699 link
2024-04-16 Contextrast: Contextual Contrastive Learning for Semantic Segmentation Changki Sung et.al. 2404.10633 null
2024-04-16 Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation Aaron Kujawa et.al. 2404.10572 null
2024-04-16 LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System Shijing Hu et.al. 2404.10498 null
2024-04-16 Adversarial Identity Injection for Semantic Face Image Synthesis Giuseppe Tarollo et.al. 2404.10408 null
2024-04-16 Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation Jiapeng Su et.al. 2404.10322 link
2024-04-16 Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain Steve Andreas Immanuel et.al. 2404.10307 link
2024-04-15 Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL Fangwei Zhong et.al. 2404.09857 null
2024-04-15 In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation Han Xue et.al. 2404.09633 null
2024-04-15 The revenge of BiSeNet: Efficient Multi-Task Image Segmentation Gabriele Rosi et.al. 2404.09570 null
2024-04-16 Human-in-the-Loop Segmentation of Multi-species Coral Imagery Scarlett Raine et.al. 2404.09406 link
2024-04-14 Bridging Data Islands: Geographic Heterogeneity-Aware Federated Learning for Collaborative Remote Sensing Semantic Segmentation Jieyi Tan et.al. 2404.09292 null
2024-04-12 Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning Girmaw Abebe Tadesse et.al. 2404.08544 null
2024-04-12 LaSagnA: Language-based Segmentation Assistant for Complex Queries Cong Wei et.al. 2404.08506 link
2024-04-12 Tackling Ambiguity from Perspective of Uncertainty Inference and Affinity Diversification for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2404.08195 link
2024-04-12 Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation Sina Hajimiri et.al. 2404.08181 link
2024-04-10 AI-Guided Feature Segmentation Techniques to Model Features from Single Crystal Diamond Growth Rohan Reddy Mekala et.al. 2404.08017 null
2024-04-11 Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification Ricardo Pereira et.al. 2404.07739 null
2024-04-11 OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities Lasse H. Hansen et.al. 2404.07711 link
2024-04-11 Implicit and Explicit Language Guidance for Diffusion-based Visual Perception Hefeng Wang et.al. 2404.07600 null
2024-04-11 Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling Sourajit Saha et.al. 2404.07410 link
2024-04-10 AI-Guided Defect Detection Techniques to Model Single Crystal Diamond Growth Rohan Reddy Mekala et.al. 2404.07306 null
2024-04-10 RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds Remco Royen et.al. 2404.06863 null
2024-04-10 O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation Muer Tie et.al. 2404.06836 null
2024-04-10 Convolution-based Probability Gradient Loss for Semantic Segmentation Guohang Shan et.al. 2404.06704 link
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542 null
2024-04-09 QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding Yash Mehan et.al. 2404.06442 null
2024-04-09 DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird’s Eye View Segmentation with Occlusion Reasoning Senthil Yogamani et.al. 2404.06352 null
2024-04-09 Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation Mariella Dreissig et.al. 2404.06124 null
2024-04-09 Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation Zong-Wei Hong et.al. 2404.06029 null
2024-04-08 Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery Ionut M. Motoi et.al. 2404.05693 link
2024-04-08 AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation Jiannan Ge et.al. 2404.05667 null
2024-04-08 Impact of LiDAR visualisations on semantic segmentation of archaeological objects Raveerat Jaturapitpornchai et.al. 2404.05512 null
2024-04-08 Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance Dazhong Shen et.al. 2404.05384 link
2024-04-08 GPS-free Autonomous Navigation in Cluttered Tree Rows with Deep Semantic Segmentation Alessandro Navone et.al. 2404.05338 null
2024-04-08 Human Detection from 4D Radar Data in Low-Visibility Field Conditions Mikael Skog et.al. 2404.05307 null
2024-04-08 iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection Nan Zhou et.al. 2404.05207 null
2024-04-08 UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather Haimei Zhao et.al. 2404.05145 null
2024-04-07 D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation Xuan Sun et.al. 2404.04807 null
2024-04-06 HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene Ziang Guo et.al. 2404.04653 link
2024-04-06 Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation Danpei Zhao et.al. 2404.04608 null
2024-04-06 PIE: Physics-inspired Low-light Enhancement Dong Liang et.al. 2404.04586 null
2024-04-06 Frequency Decomposition-Driven Unsupervised Domain Adaptation for Remote Sensing Image Semantic Segmentation Xianping Ma et.al. 2404.04531 link
2024-04-05 Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation Zifu Wan et.al. 2404.04256 link
2024-04-05 Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation Ji-Jia Wu et.al. 2404.04231 link
2024-04-05 MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector Junbo Li et.al. 2404.04155 null
2024-04-04 Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation Elham Amin Mansour et.al. 2404.03799 null
2024-04-04 Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball Simon Weber et.al. 2404.03778 link
2024-04-09 Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation Izumi Fujimori et.al. 2404.03394 null
2024-04-03 GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation Meher Niger et.al. 2404.02813 null
2024-04-03 RS-Mamba for Large Remote Sensing Image Dense Prediction Sijie Zhao et.al. 2404.02668 link
2024-04-03 A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task Eduardo Neto et.al. 2404.02659 null
2024-04-03 SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation Junyan Ye et.al. 2404.02638 link
2024-04-03 Active learning for efficient annotation in precision agriculture: a use-case on crop-weed semantic segmentation Bart M. van Marrewijk et.al. 2404.02580 null
2024-04-03 HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras Zhongyu Xia et.al. 2404.02517 link
2024-04-03 Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression I. Dror et.al. 2404.02481 null
2024-04-03 RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation Xianping Ma et.al. 2404.02457 link
2024-04-02 Constrained Robotic Navigation on Preferred Terrains Using LLMs and Speech Instruction: Exploiting the Power of Adverbs Faraz Lotfi et.al. 2404.02294 null
2024-04-01 Versatile Navigation under Partial Observability via Value-guided Diffusion Policy Gengyu Zhang et.al. 2404.02176 null
2024-04-02 Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation Hui Xiao et.al. 2404.02065 null
2024-04-02 Synthetic Data for Robust Stroke Segmentation Liam Chalcroft et.al. 2404.01946 link
2024-04-02 Improving Bird’s Eye View Semantic Segmentation by Task Decomposition Tianhao Zhao et.al. 2404.01925 link
2024-04-02 Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model Qinfeng Zhu et.al. 2404.01705 link
2024-04-04 Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss Jaeha Kim et.al. 2404.01692 link
2024-04-01 PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation Jinfeng Xu et.al. 2404.00979 link
2024-04-01 GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields Yunsong Wang et.al. 2404.00931 link
2024-04-02 Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation Beomyoung Kim et.al. 2404.00918 link
2024-03-31 Training-Free Semantic Segmentation via LLM-Supervision Wenfang Sun et.al. 2404.00701 null
2024-03-31 LAESI: Leaf Area Estimation with Synthetic Imagery Jacek Kałużny et.al. 2404.00593 null
2024-03-29 Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation Qi Bi et.al. 2403.20092 null
2024-03-29 MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection Ali Behrouz et.al. 2403.19888 null
2024-03-28 Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation Qitian Ma et.al. 2403.19826 null
2024-03-28 ENet-21: An Optimized light CNN Structure for Lane Detection Seyed Rasoul Hosseini et.al. 2403.19782 null
2024-03-29 Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers Pingcheng Dong et.al. 2403.19591 link
2024-03-28 DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs Donghyun Kim et.al. 2403.19588 link
2024-03-28 Learning Multiple Representations with Inconsistency-Guided Detail Regularization for Mask-Guided Matting Weihao Jiang et.al. 2403.19213 null
2024-03-27 Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D Mukund Varma T et.al. 2403.18922 null
2024-03-27 I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation Ayoub Karine et.al. 2403.18490 null
2024-03-28 ViTAR: Vision Transformer with Any Resolution Qihang Fan et.al. 2403.18361 null
2024-03-27 Generating Diverse Agricultural Data for Vision-Based Farming Applications Mikolaj Cieslak et.al. 2403.18351 null
2024-03-27 Road Obstacle Detection based on Unknown Objectness Scores Chihiro Noguchi et.al. 2403.18207 null
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921 link
2024-03-26 Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation Carlos Gomes et.al. 2403.17886 link
2024-03-26 PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Chenhongyi Yang et.al. 2403.17695 link
2024-03-26 Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion Kazi Shahriar Sanjid et.al. 2403.17432 null
2024-03-25 Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions Ye Li et.al. 2403.17009 link
2024-03-25 DreamLIP: Language-Image Pre-training with Long Captions Kecheng Zheng et.al. 2403.17007 link
2024-03-25 TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation Quang-Huy Che et.al. 2403.16958 link
2024-03-25 HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation Linglin Jing et.al. 2403.16788 null
2024-03-25 SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation Aysim Toker et.al. 2403.16605 null
2024-03-25 Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes Tianwei Zhang et.al. 2403.16499 null
2024-03-25 GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation Weiming Zhang et.al. 2403.16370 null
2024-03-24 Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System Jing Li et.al. 2403.16227 null
2024-03-24 Segment Anything Model for Road Network Graph Extraction Congrui Hetang et.al. 2403.16051 link
2024-03-24 SM2C: Boost the Semi-supervised Segmentation for Medical Image by using Meta Pseudo Labels and Mixed Images Yifei Wang et.al. 2403.16009 null
2024-03-22 Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting Jun Guo et.al. 2403.15624 null
2024-03-22 A2DMN: Anatomy-Aware Dilated Multiscale Network for Breast Ultrasound Semantic Segmentation Kyle Lucke et.al. 2403.15560 null
2024-03-22 InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Yi Wang et.al. 2403.15377 link
2024-03-22 Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations Pranav Kulkarni et.al. 2403.15218 link
2024-03-22 Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion Sofia Casarin et.al. 2403.15194 null
2024-03-22 Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation Wenlve Zhou et.al. 2403.14995 link
2024-03-21 WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather Blake Gella et.al. 2403.14874 null
2024-03-21 Learning to Project for Cross-Task Knowledge Distillation Dylan Auty et.al. 2403.14494 null
2024-03-21 OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation Bohao Peng et.al. 2403.14418 link
2024-03-21 Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models Pablo Marcos-Manchón et.al. 2403.14291 link
2024-03-21 OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation Kwanyoung Kim et.al. 2403.14183 link
2024-03-21 Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference Junyoung Kim et.al. 2403.14138 null
2024-03-21 Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling Yong He et.al. 2403.14124 null
2024-03-21 Semantics from Space: Satellite-Guided Thermal Semantic Segmentation Annotation for Aerial Field Robots Connor Lee et.al. 2403.14056 null
2024-03-20 When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather Giulia Rizzoli et.al. 2403.13762 link
2024-03-20 Next day fire prediction via semantic segmentation Konstantinos Alexis et.al. 2403.13545 null
2024-03-20 MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining Di Wang et.al. 2403.13430 link
2024-03-20 AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments Mohamed Elnoor et.al. 2403.13235 null
2024-03-20 Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation Linshan Wu et.al. 2403.13225 link
2024-03-19 Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation Kasi Viswanath et.al. 2403.13188 link
2024-03-19 As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? Anjun Hu et.al. 2403.12693 null
2024-03-19 PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation Haruya Ishikawa et.al. 2403.12530 null
2024-03-19 Semantics, Distortion, and Style Matter: Towards Source-free UDA for Panoramic Segmentation Xu Zheng et.al. 2403.12505 null
2024-03-18 Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation Wangbo Zhao et.al. 2403.11808 link
2024-03-22 LSKNet: A Foundation Lightweight Backbone for Remote Sensing Yuxuan Li et.al. 2403.11735 link
2024-03-18 TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models Lisa Weijler et.al. 2403.11691 null
2024-03-18 OurDB: Ouroboric Domain Bridging for Multi-Target Domain Adaptive Semantic Segmentation Seungbeom Woo et.al. 2403.11582 null
2024-03-18 MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception Thien-Minh Nguyen et.al. 2403.11496 null
2024-03-18 Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting Mingkui Tan et.al. 2403.11491 null
2024-03-17 TAG: Guidance-free Open-Vocabulary Semantic Segmentation Yasufumi Kawano et.al. 2403.11197 link
2024-03-17 MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation Yasufumi Kawano et.al. 2403.11194 link
2024-03-17 DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation Yuanchen Wu et.al. 2403.11184 link
2024-03-17 LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation Hanze Ding et.al. 2403.11122 null
2024-03-17 Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution Jialu Sui et.al. 2403.11078 link
2024-03-17 Intelligent Railroad Grade Crossing: Leveraging Semantic Segmentation and Object Detection for Enhanced Safety Al Amin et.al. 2403.11060 null
2024-03-16 Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation Soumyajyoti Dey et.al. 2403.10884 null
2024-03-16 Active Label Correction for Semantic Segmentation with Foundation Models Hoyoung Kim et.al. 2403.10820 link
2024-03-15 SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images Pardis Taghavi et.al. 2403.10662 link
2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution Stephanie Fu et.al. 2403.10516 link
2024-03-15 Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search Hongyuan Yu et.al. 2403.10413 link
2024-03-15 Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning Meixuan Li et.al. 2403.10252 null
2024-03-15 Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation Marcos Fernández-Rodríguez et.al. 2403.10216 null
2024-03-15 TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model Changhong Hou et.al. 2403.10127 null
2024-03-15 Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation Jingyi Xu et.al. 2403.10001 link
2024-03-14 WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity Qiyuan Wang et.al. 2403.09551 null
2024-03-14 Annotation Free Semantic Segmentation with Vision Foundation Models Soroush Seifi et.al. 2403.09307 null
2024-03-14 When Semantic Segmentation Meets Frequency Aliasing Linwei Chen et.al. 2403.09065 link
2024-03-13 CART: Caltech Aerial RGB-Thermal Dataset in the Wild Connor Lee et.al. 2403.08997 link
2024-03-13 SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net Helin Cao et.al. 2403.08885 link
2024-03-13 Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches Yun Xin Teoh et.al. 2403.08761 null
2024-03-13 Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution Samuel Sze et.al. 2403.08748 null
2024-03-13 Semantic Segmentation of Solar Radio Spikes at Low Frequencies Pearse C. Murphy et.al. 2403.08546 null
2024-03-13 Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation Zicheng Zhang et.al. 2403.08426 null
2024-03-13 LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving Sicen Guo et.al. 2403.08215 null
2024-03-13 Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks Fuzhi Wu et.al. 2403.08157 link
2024-03-12 Mitigating the Impact of Attribute Editing on Face Recognition Sudipta Banerjee et.al. 2403.08092 null
2024-03-12 Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation Feilong Tang et.al. 2403.07630 link
2024-03-12 PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution Honghao Chen et.al. 2403.07589 null
2024-03-12 Open-World Semantic Segmentation Including Class Similarity Matteo Sodano et.al. 2403.07532 link
2024-03-11 Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation Theodore Barfoot et.al. 2403.06759 link
2024-03-11 Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation Bianca-Cerasela-Zelia Blaga et.al. 2403.06621 null
2024-03-11 OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation Baran Ozaydin et.al. 2403.06546 null
2024-03-11 3D Semantic Segmentation-Driven Representations for 3D Object Detection Hayeon O et.al. 2403.06501 link
2024-03-11 Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy Jiuming Liu et.al. 2403.06467 link
2024-03-14 Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation Xiaoyang Wang et.al. 2403.06462 link
2024-03-11 Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation Peng Zhang et.al. 2403.06401 null
2024-03-10 Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning Woo-Jin Ahn et.al. 2403.06122 link
2024-03-09 Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation Hairong Shi et.al. 2403.05912 link
2024-03-08 Attention-guided Feature Distillation for Semantic Segmentation Amir M. Mansourian et.al. 2403.05451 link
2024-03-08 Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation Yu Han et.al. 2403.05388 null
2024-03-12 Frequency-Adaptive Dilated Convolution for Semantic Segmentation Linwei Chen et.al. 2403.05369 link
2024-03-08 Embedded Deployment of Semantic Segmentation in Medicine through Low-Resolution Inputs Erik Ostrowski et.al. 2403.05340 null
2024-03-08 LVIC: Multi-modality segmentation by Lifting Visual Info as Cue Zichao Dong et.al. 2403.05159 null
2024-03-06 ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic Segmentation Erik Brorsson et.al. 2403.03854 link
2024-03-06 Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision Yajie Liu et.al. 2403.03707 null
2024-03-06 Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery Jingru Zhu et.al. 2403.03704 null
2024-03-06 GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding Zi-Ting Chou et.al. 2403.03608 null
2024-03-06 Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator Wonhyeok Choi et.al. 2403.03468 null
2024-03-05 Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection Mohamed Afifi et.al. 2403.03111 null
2024-03-05 ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving Han Lu et.al. 2403.02877 null
2024-03-05 DDF: A Novel Dual-Domain Image Fusion Strategy for Remote Sensing Image Semantic Segmentation with Unsupervised Domain Adaptation Lingyan Ran et.al. 2403.02784 null
2024-03-08 Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels Zhuohong Li et.al. 2403.02746 link
2024-03-05 FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View Jiawei Hou et.al. 2403.02710 null
2024-03-05 Deep Common Feature Mining for Efficient Video Semantic Segmentation Yaoyan Zheng et.al. 2403.02689 link
2024-03-04 Self-Supervised Facial Representation Learning with Facial Region Awareness Zheng Gao et.al. 2403.02138 null
2024-03-04 Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey Lingyan Ran et.al. 2403.01909 null
2024-03-04 Map-aided annotation for pole base detection Benjamin Missaoui et.al. 2403.01868 null
2024-03-06 AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation Haonan Wang et.al. 2403.01818 link
2024-03-03 EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation Chanyoung Kim et.al. 2403.01482 link
2024-03-02 Benchmarking Segmentation Models with Mask-Preserved Attribute Editing Zijin Yin et.al. 2403.01231 link
2024-03-02 Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation Lian Xu et.al. 2403.01156 null
2024-03-01 Rethinking Few-shot 3D Point Cloud Semantic Segmentation Zhaochong An et.al. 2403.00592 link
2024-03-01 Small, Versatile and Mighty: A Range-View Perception Framework Qiang Meng et.al. 2403.00325 null
2024-03-01 YOLO-MED : Multi-Task Interaction Network for Biomedical Images Suizhi Huang et.al. 2403.00245 null
2024-02-29 FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything Safouane El Ghazouali et.al. 2403.00175 link
2024-02-29 RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation Jie Zhang et.al. 2402.19004 null
2024-02-28 Spatial Coherence Loss for Salient and Camouflaged Object Detection and Beyond Ziyun Yang et.al. 2402.18698 null
2024-02-29 Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2402.18467 link
2024-02-29 A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation Francesco Barbato et.al. 2402.18402 link
2024-02-28 Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis Miriam Louise Carnot et.al. 2402.18309 null
2024-02-28 Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis Bashir Kazimi et.al. 2402.18286 null
2024-02-28 PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation Haoyu Xie et.al. 2402.18117 null
2024-02-28 Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image Annotation Samuel O. Folorunsho et.al. 2402.18084 link
2024-02-27 Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation Xinyu Yang et.al. 2402.17891 link
2024-02-27 Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data David S. W. Williams et.al. 2402.17653 null
2024-02-27 Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling David S. W. Williams et.al. 2402.17622 null
2024-02-27 A Large-scale Evaluation of Pretraining Paradigms for the Detection of Defects in Electroluminescence Solar Cell Images David Torpey et.al. 2402.17611 null
2024-02-27 Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label Xinliang Zhang et.al. 2402.17555 link
2024-02-26 ConSept: Continual Semantic Segmentation via Adapter-based Vision Transformer Bowen Dong et.al. 2402.16674 null
2024-02-26 UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images Zhen Chen et.al. 2402.16663 link
2024-02-26 Placing Objects in Context via Inpainting for Out-of-distribution Segmentation Pau de Jorge et.al. 2402.16392 link
2024-02-29 BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM Li Zhang et.al. 2402.16338 link
2024-02-23 Modified CycleGAN for the synthesization of samples for wheat head segmentation Jaden Myers et.al. 2402.15135 null
2024-02-22 Semantic Image Synthesis with Unconditional Generator Jungwoo Chae et.al. 2402.14395 null
2024-02-22 Think before You Leap: Content-Aware Low-Cost Edge-Assisted Video Semantic Segmentation Mingxuan Yan et.al. 2402.14326 null
2024-02-21 Tumor segmentation on whole slide images: training or prompting? Huaqian Wu et.al. 2402.13932 null
2024-02-26 BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery Loddo Fabio et.al. 2402.13918 link
2024-02-21 Zero-BEV: Zero-shot Projection of Any First-Person Modality to BEV Maps Gianluca Monaci et.al. 2402.13848 null
2024-02-21 Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation Jialei Chen et.al. 2402.13697 null
2024-02-20 Cross-Domain Transfer Learning with CoRTe: Consistent and Reliable Transfer from Black-Box to Lightweight Segmentation Model Claudia Cuttano et.al. 2402.13122 null
2024-02-19 LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks Truong Thanh Hung Nguyen et.al. 2402.12525 link
2024-02-19 Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization Abhishek Kuriyal et.al. 2402.12098 link
2024-02-19 ISCUTE: Instance Segmentation of Cables Using Text Embedding Shir Kozlovsky et.al. 2402.11996 null
2024-02-18 Key Patch Proposer: Key Patches Contain Rich Information Jing Xu et.al. 2402.11458 link
2024-02-17 ChatEarthNet: A Global-Scale, High-Quality Image-Text Dataset for Remote Sensing Zhenghang Yuan et.al. 2402.11325 link
2024-02-17 A Decoding Scheme with Successive Aggregation of Multi-Level Features for Light-Weight Semantic Segmentation Jiwon Yoo et.al. 2402.11201 null
2024-02-16 HistoSegCap: Capsules for Weakly-Supervised Semantic Segmentation of Histological Tissue Type in Whole Slide Images Mobina Mansoori et.al. 2402.10851 null
2024-02-16 Selective Prediction for Semantic Segmentation using Post-Hoc Confidence Estimation and Its Performance under Distribution Shift Bruno Laboissiere Camargos Borges et.al. 2402.10665 null
2024-02-16 Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation Steven Landgraf et.al. 2402.10580 null
2024-02-15 Is Continual Learning Ready for Real-world Challenges? Theodora Kontogianni et.al. 2402.10130 null
2024-02-15 Robust semi-automatic vessel tracing in the human retinal image by an instance segmentation neural network Siyi Chen et.al. 2402.10055 null
2024-02-22 MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding Hai-Tao Yu et.al. 2402.10002 link
2024-02-14 Automated Plaque Detection and Agatston Score Estimation on Non-Contrast CT Scans: A Multicenter Study Andrew M. Nguyen et.al. 2402.09569 null
2024-02-14 Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion Edgar Heinert et.al. 2402.09530 link
2024-02-13 Adaptive Hierarchical Certification for Segmentation using Randomized Smoothing Alaa Anani et.al. 2402.08400 link
2024-02-13 Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss Kei Iino et.al. 2402.08267 null
2024-02-12 Semantic segmentation for recognition of epileptiform patterns recorded via Microelectrode Arrays in vitro Gabriel Galeote-Checa et.al. 2402.08099 null
2024-02-11 Data Quality Aware Approaches for Addressing Model Drift of Semantic Segmentation Models Samiha Mirza et.al. 2402.07258 null
2024-02-09 More than the Sum of Its Parts: Ensembling Backbone Networks for Few-Shot Segmentation Nico Catalano et.al. 2402.06581 null
2024-02-09 Hybridnet for depth estimation and semantic segmentation Dalila Sánchez-Escobedo et.al. 2402.06539 null
2024-02-09 Classifying point clouds at the facade-level using geometric features and deep learning networks Yue Tan et.al. 2402.06506 link
2024-02-09 ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation Fengyi Shen et.al. 2402.06446 null
2024-02-08 Early Fusion of Features for Semantic Segmentation Anupam Gupta et.al. 2402.06091 null
2024-02-08 Privacy-Preserving Synthetic Continual Semantic Segmentation for Robotic Surgery Mengya Xu et.al. 2402.05860 link
2024-02-08 On the Effect of Image Resolution on Semantic Segmentation Ritambhara Singh et.al. 2402.05398 null
2024-02-07 Multi-Scale Semantic Segmentation with Modified MBConv Blocks Xi Chen et.al. 2402.04618 null
2024-02-06 Energy-based Domain-Adaptive Segmentation with Depth Guidance Jinjing Zhu et.al. 2402.03795 null
2024-02-05 SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM Mingrui Li et.al. 2402.03246 link
2024-02-05 RRWNet: Recursive Refinement Network for Effective Retinal Artery/Vein Segmentation and Classification José Morano et.al. 2402.03166 link
2024-02-05 Unsupervised semantic segmentation of high-resolution UAV imagery for road scene parsing Zihan Ma et.al. 2402.02985 link
2024-02-04 M $^3$ Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing Mohammadreza Mofayezi et.al. 2402.02369 null
2024-02-04 Exploring Intrinsic Properties of Medical Images for Self-Supervised Binary Semantic Segmentation Pranav Singh et.al. 2402.02367 null
2024-02-04 Region-Based Representations Revisited Michal Shlapentokh-Rothman et.al. 2402.02352 link
2024-02-03 Multi-Level Feature Aggregation and Recursive Alignment Network for Real-Time Semantic Segmentation Yanhua Zhang et.al. 2402.02286 link
2024-02-03 Revisiting Generative Adversarial Networks for Binary Semantic Segmentation on Imbalanced Datasets Lei Xu et.al. 2402.02245 link
2024-02-03 Evaluating the Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric analysis Pankaj Deoli et.al. 2402.02154 link
2024-02-03 Decomposition-based and Interference Perception for Infrared and Visible Image Fusion in Complex Scenes Xilai Li et.al. 2402.02096 null
2024-02-03 MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning Zhe Li et.al. 2402.02045 null
2024-02-02 Convolution kernel adaptation to calibrated fisheye Bruno Berenguel-Baeta et.al. 2402.01456 link
2024-02-02 Delving into Decision-based Black-box Attacks on Semantic Segmentation Zhaoyu Chen et.al. 2402.01220 null
2024-02-02 Scale Equalization for Multi-Level Feature Fusion Bum Jun Kim et.al. 2402.01149 link
2024-02-06 We’re Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline Simar Kareer et.al. 2402.00868 link
2024-02-01 Automatic Segmentation of the Spinal Cord Nerve Rootlets Jan Valosek et.al. 2402.00724 link
2024-02-01 A Framework for Building Point Cloud Cleaning, Plane Detection and Semantic Segmentation Ilyass Abouelaziz et.al. 2402.00692 null
2024-01-31 Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model Zihan Zhong et.al. 2401.17868 link
2024-01-31 Leveraging Swin Transformer for Local-to-Global Weakly Supervised Semantic Segmentation Rozhan Ahmadi et.al. 2401.17828 link
2024-02-01 Tiered approach for rapid damage characterisation of infrastructure enabled by remote sensing and deep learning technologies Nadiia Kopiika et.al. 2401.17759 null
2024-01-31 Towards Image Semantics and Syntax Sequence Learning Chun Tao et.al. 2401.17515 link
2024-01-30 Evaluation of Out-of-Distribution Detection Performance on Autonomous Driving Datasets Jens Henriksson et.al. 2401.17013 null
2024-01-30 CAFCT: Contextual and Attentional Feature Fusions of Convolutional Neural Networks and Transformer for Liver Tumor Segmentation Ming Kang et.al. 2401.16886 null
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459 null
2024-01-28 SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks Serdar Erisen et.al. 2401.15741 link
2024-01-28 UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration Nachuan Ma et.al. 2401.15647 null
2024-01-27 Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes Diandian Guo et.al. 2401.15261 link
2024-01-26 Biological Valuation Map of Flanders: A Sentinel-2 Imagery Analysis Mingshi Li et.al. 2401.15223 null
2024-01-26 Kitchen Food Waste Image Segmentation and Classification for Compost Nutrients Estimation Raiyan Rahman et.al. 2401.15175 null
2024-01-26 SSR: SAM is a Strong Regularizer for domain adaptive semantic segmentation Yanqi Ge et.al. 2401.14686 null
2024-01-25 CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds Muhammad Ahmed Chaudhry et.al. 2401.14486 null
2024-01-25 Unlocking Past Information: Temporal Embeddings in Cooperative Bird’s Eye View Prediction Dominik Rößle et.al. 2401.14325 null
2024-01-24 Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation Saiyang Na et.al. 2401.13220 null
2024-01-24 Boundary and Relation Distillation for Semantic Segmentation Dong Zhang et.al. 2401.13174 null
2024-01-23 DatUS^2: Data-driven Unsupervised Semantic Segmentation with Pre-trained Self-supervised Vision Transformer Sonal Kumar et.al. 2401.12820 link
2024-01-23 Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels Seungho Lee et.al. 2401.12535 null
2024-01-23 Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration Yifan Zhang et.al. 2401.12452 link
2024-01-22 Scaling Up Quantization-Aware Neural Architecture Search for Efficient Deep Learning on the Edge Yao Lu et.al. 2401.12350 null
2024-01-22 Exploring Simple Open-Vocabulary Semantic Segmentation Zihang Lai et.al. 2401.12217 link
2024-01-22 Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy Will LeVine et.al. 2401.12129 link
2024-01-22 HomeRobot Open Vocabulary Mobile Manipulation Challenge 2023 Participant Report (Team KuzHum) Volodymyr Kuzma et.al. 2401.12048 null
2024-01-22 SemPLeS: Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation Ci-Siang Lin et.al. 2401.11791 link
2024-01-22 EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models Koichi Namekata et.al. 2401.11739 null
2024-01-22 MetaSeg: Content-Aware Meta-Net for Omni-Supervised Semantic Segmentation Shenwang Jiang et.al. 2401.11738 null
2024-01-22 SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation Xinqiao Zhao et.al. 2401.11719 link
2024-01-21 A Survey on African Computer Vision Datasets, Topics and Researchers Abdul-Hakeem Omotayo et.al. 2401.11617 link
2024-01-21 Embedded Hyperspectral Band Selection with Adaptive Optimization for Image Semantic Segmentation Yaniv Zimmer et.al. 2401.11420 null
2024-01-21 S $^3$ M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving Zhiyuan Wu et.al. 2401.11414 null
2024-01-21 ANNA: A Deep Learning Based Dataset in Heterogeneous Traffic for Autonomous Vehicles Mahedi Kamal et.al. 2401.11358 link
2024-01-20 Weakly-Supervised Semantic Segmentation of Circular-Scan, Synthetic-Aperture-Sonar Imagery Isaac J. Sledge et.al. 2401.11313 null
2024-01-20 A Novel Benchmark for Few-Shot Semantic Segmentation in the Era of Foundation Models Reda Bensaid et.al. 2401.11311 link
2024-01-20 Spatial Structure Constraints for Weakly Supervised Semantic Segmentation Tao Chen et.al. 2401.11122 link
2024-01-19 One Step Learning, One Step Review Xiaolong Huang et.al. 2401.10962 link
2024-01-19 RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision Fernando Pérez-García et.al. 2401.10815 null
2024-01-19 Exploring Color Invariance through Image-Level Ensemble Learning Yunpeng Gong et.al. 2401.10512 link
2024-01-18 RAP-SAM: Towards Real-Time All-Purpose Segment Anything Shilin Xu et.al. 2401.10228 link
2024-01-18 Ventricular Segmentation: A Brief Comparison of U-Net Derivatives Ketan Suhaas Saichandran et.al. 2401.09980 null
2024-01-18 XAI-Enhanced Semantic Segmentation Models for Visual Quality Inspection Tobias Clement et.al. 2401.09900 null
2024-01-18 Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation Songhe Deng et.al. 2401.09883 link
2024-01-18 Boosting Few-Shot Semantic Segmentation Via Segment Anything Model Chen-Bin Feng et.al. 2401.09826 null
2024-01-18 P2Seg: Pointly-supervised Segmentation via Mutual Distillation Zipeng Wang et.al. 2401.09709 null
2024-01-17 Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Lianghui Zhu et.al. 2401.09417 link
2024-01-17 POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images Antonin Vobecky et.al. 2401.09413 null
2024-01-17 PixelDINO: Semi-Supervised Semantic Segmentation for Detecting Permafrost Disturbances Konrad Heidler et.al. 2401.09271 link
2024-01-17 Uncertainty estimates for semantic segmentation: providing enhanced reliability for automated motor claims handling Jan Küchler et.al. 2401.09245 null
2024-01-17 Learning to detect cloud and snow in remote sensing images from noisy labels Zili Liu et.al. 2401.08932 null
2024-01-16 Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive Yumeng Li et.al. 2401.08815 link
2024-01-16 ValUES: A Framework for Systematic Validation of Uncertainty Estimation in Semantic Segmentation Kim-Celine Kahl et.al. 2401.08501 link
2024-01-16 Faster ISNet for Background Bias Mitigation on Deep Neural Networks Pedro R. A. S. Bassi et.al. 2401.08409 link
2024-01-17 Generative Denoise Distillation: Simple Stochastic Noises Induce Efficient Knowledge Transfer for Dense Prediction Zhaoge Liu et.al. 2401.08332 link
2024-01-16 End-to-End Optimized Image Compression with the Frequency-Oriented Transform Yuefeng Zhang et.al. 2401.08194 null
2024-01-16 S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera Thanh Nguyen Canh et.al. 2401.08134 null
2024-01-16 UV-SAM: Adapting Segment Anything Model for Urban Village Identification Xin Zhang et.al. 2401.08083 link
2024-01-15 Semantic Scene Segmentation for Robotics Juana Valeria Hurtado et.al. 2401.07589 null
2024-01-15 Compositional Oil Spill Detection Based on Object Detector and Adapted Segment Anything Model from SAR Images Wenhui Wu et.al. 2401.07502 null
2024-01-15 Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention Xin Yang et.al. 2401.07459 null
2024-01-14 Semi-supervised Semantic Segmentation using Redesigned Self-Training for White Blood Cel Vinh Quoc Luu et.al. 2401.07278 null
2024-01-13 Weak Labeling for Cropland Mapping in Africa Gilles Quentin Hacheme et.al. 2401.07014 null
2024-01-13 Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization Mengtian Li et.al. 2401.06975 null
2024-01-12 Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery Caleb Robinson et.al. 2401.06762 link
2024-01-12 UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding Bowen Shi et.al. 2401.06397 link
2024-01-11 Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications Yuwen Xiong et.al. 2401.06197 link
2024-01-09 Generic Knowledge Boosted Pre-training For Remote Sensing Images Ziyue Huang et.al. 2401.04614 link
2024-01-08 Fully Attentional Networks with Self-emerging Token Labeling Bingyin Zhao et.al. 2401.03844 link
2024-01-07 SeTformer is What You Need for Vision and Language Pourya Shamsolmoali et.al. 2401.03540 null
2024-01-06 Multi-View 3D Instance Segmentation of Structural Anomalies for Enhanced Structural Inspection of Concrete Bridges Christian Benz et.al. 2401.03298 link
2024-01-02 Unsupervised Federated Domain Adaptation for Segmentation of MRI Images Navapat Nananukul et.al. 2401.02941 null
2024-01-04 ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation Xinyang Pu et.al. 2401.02326 link
2024-01-04 Source-Free Online Domain Adaptive Semantic Segmentation of Satellite Images under Image Degradation Fahim Faisal Niloy et.al. 2401.02113 null
2024-01-03 Towards Robust Semantic Segmentation against Patch-based Attack via Attention Refinement Zheng Yuan et.al. 2401.01750 null
2024-01-03 S3Net: Innovating Stereo Matching and Semantic Segmentation with a Single-Branch Semantic Stereo Network in Satellite Epipolar Imagery Qingyuan Yang et.al. 2401.01643 link
2024-01-03 Context-Aware Interaction Network for RGB-T Semantic Segmentation Ying Lv et.al. 2401.01624 link
2024-01-02 Off-Road LiDAR Intensity Based Semantic Segmentation Kasi Viswanath et.al. 2401.01439 link
2024-01-02 Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images Subin Sahayam et.al. 2401.01303 null
2024-01-02 Physics-informed Generalizable Wireless Channel Modeling with Segmentation and Deep Learning: Fundamentals, Methodologies, and Challenges Ethan Zhu et.al. 2401.01288 null
2024-01-02 GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction Yuping Hu et.al. 2401.01178 null
2024-01-02 DTBS: Dual-Teacher Bi-directional Self-training for Domain Adaptation in Nighttime Semantic Segmentation Fanding Huang et.al. 2401.01066 link
2024-01-02 Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal Representations Serban Stan et.al. 2401.01035 link
2023-12-31 Analyzing Local Representations of Self-supervised Vision Transformers Ani Vanyan et.al. 2401.00463 null
2023-12-28 Learning Vision from Models Rivals Learning Vision from Data Yonglong Tian et.al. 2312.17742 link
2024-01-04 HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping Xin Zhang et.al. 2312.17492 null
2023-12-28 Unsupervised Universal Image Segmentation Dantong Niu et.al. 2312.17243 link
2024-01-03 An Improved Baseline for Reasoning Segmentation with Large Language Model Senqiao Yang et.al. 2312.17240 null
2023-12-28 SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation Zhengze Xu et.al. 2312.17071 link
2023-12-28 EvPlug: Learn a Plug-and-Play Module for Event and Image Fusion Jianping Jiang et.al. 2312.16933 null
2023-12-29 Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation Xiawei Li et.al. 2312.16578 link
2023-12-27 ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments Maghsood Salimi et.al. 2312.16516 link
2023-12-26 VirtualPainting: Addressing Sparsity with Virtual Points and Distance-Aware Data Augmentation for 3D Object Detection Sudip Dhakal et.al. 2312.16141 null
2023-12-26 LangSplat: 3D Language Gaussian Splatting Minghan Qin et.al. 2312.16084 link
2023-12-23 WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments Kavisha Vidanapathirana et.al. 2312.15364 link
2023-12-23 Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models Gianni Franchi et.al. 2312.15297 null
2023-12-22 Harnessing Diffusion Models for Visual Perception with Meta Prompts Qiang Wan et.al. 2312.14733 link
2023-12-22 Variance-insensitive and Target-preserving Mask Refinement for Interactive Image Segmentation Chaowei Fang et.al. 2312.14387 null
2023-12-26 TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification Qinying Liu et.al. 2312.14149 link
2023-12-21 Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation Rasha Alshawi et.al. 2312.14053 link
2023-12-21 Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection Soopil Kim et.al. 2312.13783 link
2023-12-22 Weakly Supervised Semantic Segmentation for Driving Scenes Dongseob Kim et.al. 2312.13646 link
2023-12-20 DVIS++: Improved Decoupled Framework for Universal Video Segmentation Tao Zhang et.al. 2312.13305 link
2023-12-20 BEVSeg2TP: Surround View Camera Bird’s-Eye-View Based Joint Vehicle Segmentation and Ego Vehicle Trajectory Prediction Sushil Sharma et.al. 2312.13081 link
2023-12-20 Multi-task Learning To Improve Semantic Segmentation Of CBCT Scans Using Image Reconstruction Maximilian Ernst Tschuchnig et.al. 2312.12990 null
2023-12-20 TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training Yuqi Lin et.al. 2312.12828 link
2023-12-20 Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation Wenhao Xu et.al. 2312.12754 link
2023-12-20 MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images Libo Wang et.al. 2312.12735 null
2023-12-20 Segment Anything Model Meets Image Harmonization Haoxing Chen et.al. 2312.12729 null
2023-12-19 DDOS: The Drone Depth and Obstacle Segmentation Dataset Benedikt Kolbeinsson et.al. 2312.12494 null
2023-12-19 SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process Mengyu Wang et.al. 2312.12425 link
2023-12-19 CLIP-DINOiser: Teaching CLIP a few DINO tricks Monika Wysoczańska et.al. 2312.12359 link
2023-12-19 All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes Jose L. Gómez et.al. 2312.12176 null
2023-12-19 Domain Generalization in LiDAR Semantic Segmentation Leveraged by Density Discriminative Feature Embedding Jaeyeul Kim et.al. 2312.12098 null
2023-12-18 Detecting the edges of galaxies with deep learning Jesús Fernández et.al. 2312.11654 null
2023-12-18 PlaNet-S: Automatic Semantic Segmentation of Placenta Shinnosuke Yamamoto et.al. 2312.11580 null
2023-12-18 Language-Assisted 3D Scene Understanding Yanmin Wu et.al. 2312.11451 link
2023-12-18 Research on Multilingual Natural Scene Text Detection Algorithm Tao Wang et.al. 2312.11153 null
2023-12-18 SeeBel: Seeing is Believing Sourajit Saha et.al. 2312.10933 link
2023-12-17 Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s Maksim Makarenko et.al. 2312.10639 null
2023-12-16 Transformers in Unsupervised Structure-from-Motion Hemang Chawla et.al. 2312.10529 link
2023-12-16 All Attention U-NET for Semantic Segmentation of Intracranial Hemorrhages In Head CT Images Chia Shuo Chang et.al. 2312.10483 null
2023-12-16 Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning Kaiyou Song et.al. 2312.10457 link
2023-12-15 Forging Tokens for Improved Storage-efficient Training Minhyun Lee et.al. 2312.10105 link
2023-12-15 Collaborating Foundation models for Domain Generalized Semantic Segmentation Yasser Benigmim et.al. 2312.09788 link
2023-12-15 Density Matters: Improved Core-set for Active Domain Adaptive Segmentation Shizhan Liu et.al. 2312.09595 null
2023-12-15 AEGIS-Net: Attention-guided Multi-Level Feature Aggregation for Indoor Place Recognition Yuhang Ming et.al. 2312.09538 link
2023-12-15 WeatherProof: A Paired-Dataset Approach to Semantic Segmentation in Adverse Weather Blake Gella et.al. 2312.09534 null
2023-12-14 LIME: Localized Image Editing via Attention Regularization in Diffusion Models Enis Simsar et.al. 2312.09256 null
2023-12-14 Reliability in Semantic Segmentation: Can We Use Synthetic Data? Thibaut Loiseau et.al. 2312.09231 link
2023-12-18 Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation Jingxuan He et.al. 2312.08916 link
2023-12-14 Agent Attention: On the Integration of Softmax and Linear Attention Dongchen Han et.al. 2312.08874 link
2023-12-14 Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities Runwei Guan et.al. 2312.08851 link
2023-12-14 Offshore Wind Plant Instance Segmentation Using Sentinel-1 Time Series, GIS, and Semantic Segmentation Models Osmar Luiz Ferreira de Carvalho et.al. 2312.08773 null
2023-12-14 Segment Beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation Renjie Wu et.al. 2312.08673 null
2023-12-14 Semi-supervised Semantic Segmentation Meets Masked Modeling:Fine-grained Locality Learning Matters in Consistency Regularization Wentao Pan et.al. 2312.08631 null
2023-12-11 DFGET: Displacement-Field Assisted Graph Energy Transmitter for Gland Instance Segmentation Caiqing Jian et.al. 2312.07584 null
2023-12-12 X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer Linglin Jing et.al. 2312.07378 link
2023-12-12 Adversarial Semi-Supervised Domain Adaptation for Semantic Segmentation: A New Role for Labeled Target Samples Marwa Kechaou et.al. 2312.07370 null
2023-12-12 Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization Jiyoung Kim et.al. 2312.07342 null
2023-12-12 Transferring CLIP’s Knowledge into Zero-Shot Point Cloud Semantic Segmentation Yuanbin Wang et.al. 2312.07221 null
2023-12-12 MCFNet: Multi-scale Covariance Feature Fusion Network for Real-time Semantic Segmentation Xiaojie Fang et.al. 2312.07207 null
2023-12-11 Densify Your Labels: Unsupervised Clustering with Bipartite Matching for Weakly Supervised Point Cloud Segmentation Shaobo Xia et.al. 2312.06799 null
2023-12-11 Deciphering ‘What’ and ‘Where’ Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations Xiao Zhang et.al. 2312.06716 link
2023-12-10 AM-RADIO: Agglomerative Model – Reduce All Domains Into One Mike Ranzinger et.al. 2312.06709 link
2023-12-11 Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation Xiaoyi Bao et.al. 2312.06474 null
2023-12-11 Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation Dong Zhao et.al. 2312.06331 link
2023-12-11 U-MixFormer: UNet-like Transformer with Mix-Attention for Efficient Semantic Segmentation Seul-Ki Yeom et.al. 2312.06272 link
2023-12-11 Adaptive Annotation Distribution for Weakly Supervised Point Cloud Semantic Segmentation Zhiyi Pan et.al. 2312.06259 link
2023-12-10 Deep-Learning-Assisted Analysis of Cataract Surgery Videos Negin Ghamsarian et.al. 2312.05900 null
2023-12-09 CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen Hao Zhang et.al. 2312.05538 null
2023-12-08 Loss Functions in the Era of Semantic Segmentation: A Survey and Outlook Reza Azad et.al. 2312.05391 link
2023-12-08 Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects Junyu Lu et.al. 2312.05278 null
2023-12-08 Datasets, Models, and Algorithms for Multi-Sensor, Multi-agent Autonomy Using AVstack R. Spencer Hallyburton et.al. 2312.04970 null
2023-12-07 Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds Yujia Liu et.al. 2312.04962 null
2023-12-08 Segmentation of Kidney Tumors on Non-Contrast CT Images using Protuberance Detection Network Taro Hatsutani et.al. 2312.04796 null
2023-12-07 gcDLSeg: Integrating Graph-cut into Deep Learning for Binary Semantic Segmentation Hui Xie et.al. 2312.04713 null
2023-12-07 HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image Tong Wu et.al. 2312.04543 null
2023-12-07 Self-Guided Open-Vocabulary Semantic Segmentation Osman Ülger et.al. 2312.04539 link
2023-12-07 Semi-Supervised Active Learning for Semantic Segmentation in Unknown Environments Using Informative Path Planning Julius Rückin et.al. 2312.04402 link
2023-12-07 Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation Zhixiang Wei et.al. 2312.04265 link
2023-12-07 Fine-tune vision foundation model for crack segmentation in civil infrastructures Kang Ge et.al. 2312.04233 null
2023-12-07 Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation Jiawei Fan et.al. 2312.04168 link
2023-12-07 Residual Graph Convolutional Network for Bird’s-Eye-View Semantic Segmentation Qiuxiao Chen et.al. 2312.04044 null
2023-12-06 Novel class discovery meets foundation models for 3D semantic segmentation Luigi Riz et.al. 2312.03782 null
2023-12-10 Foundation Model Assisted Weakly Supervised Semantic Segmentation Xiaobo Yang et.al. 2312.03585 link
2023-12-06 ShareCMP: Polarization-Aware RGB-P Semantic Segmentation Zhuoyan Liu et.al. 2312.03430 link
2023-12-06 DeepPyramid+: Medical Image Segmentation using Pyramid View Fusion and Deformable Pyramid Reception Negin Ghamsarian et.al. 2312.03409 null
2023-12-06 Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields Shijie Zhou et.al. 2312.03203 link
2023-12-05 AI-SAM: Automatic and Interactive Segment Anything Model Yimu Pan et.al. 2312.03119 link
2023-12-05 DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control Yuru Jia et.al. 2312.03048 null
2023-12-05 Uni3DL: Unified Model for 3D and Language Understanding Xiang Li et.al. 2312.03026 null
2023-12-05 6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation K. Samarawickrama et.al. 2312.02593 link
2023-12-05 Towards More Unified In-context Visual Understanding Dianmo Sheng et.al. 2312.02520 null
2023-12-05 SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints Xianping Ma et.al. 2312.02464 link
2023-12-05 Towards Granularity-adjusted Pixel-level Semantic Annotation Rohit Kundu et.al. 2312.02420 null
2023-12-04 Class-Discriminative Attention Maps for Vision Transformers Lennart Brocki et.al. 2312.02364 link
2023-12-04 Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding Guofeng Mei et.al. 2312.02244 link
2023-12-04 Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation Aniruddh Sikdar et.al. 2312.02240 null
2023-12-04 VLTSeg: Simple Transfer of CLIP-Based Vision-Language Representations for Domain Generalized Semantic Segmentation Christoph Hümmer et.al. 2312.02021 null
2023-12-04 Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation Joshua Niemeijer et.al. 2312.01850 link
2023-12-04 Few Clicks Suffice: Active Test-Time Adaptation for Semantic Segmentation Longhui Yuan et.al. 2312.01835 null
2023-12-04 SE-LIO: Semantics-enhanced Solid-State-LiDAR-Inertial Odometry for Tree-rich Environments Tisheng Zhang et.al. 2312.01809 null
2023-12-04 SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference Feng Wang et.al. 2312.01597 link
2023-12-03 G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training Che Liu et.al. 2312.01522 link
2023-12-03 A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors Kangcheng Liu et.al. 2312.01262 null
2023-12-02 Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited Labels Changrui Chen et.al. 2312.01169 link
2023-12-01 Improve Supervised Representation Learning with Masked Image Modeling Kaifeng Chen et.al. 2312.00950 null
2023-12-01 Grounding Everything: Emerging Localization Properties in Vision-Language Transformers Walid Bousselham et.al. 2312.00878 link
2023-12-01 Sequential Modeling Enables Scalable Learning for Large Vision Models Yutong Bai et.al. 2312.00785 link
2023-12-01 GIFT: Generative Interpretable Fine-Tuning Transformers Chinmay Savadikar et.al. 2312.00700 link
2023-12-01 CellMixer: Annotation-free Semantic Cell Segmentation of Heterogeneous Cell Populations Mehdi Naouar et.al. 2312.00671 null
2023-12-01 SCHEME: Scalable Channer Mixer for Vision Transformers Deepak Sridhar et.al. 2312.00412 null
2023-12-04 Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning Shaohua Dong et.al. 2312.00360 link
2023-12-01 Improving Normalization with the James-Stein Estimator Seyedalireza Khoshsirat et.al. 2312.00313 null
2023-12-01 A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing Longfeng Nie et.al. 2312.00308 null
2023-11-30 InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation Rongyao Fang et.al. 2311.18835 link
2023-11-30 Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction Hsin-Ying Lee et.al. 2311.18832 link
2023-11-30 Semi-supervised Semantic Segmentation via Boosting Uncertainty on Unlabeled Data Daoan Zhang et.al. 2311.18758 null
2023-11-30 Learning Part Segmentation from Synthetic Animals Jiawei Peng et.al. 2311.18661 null
2023-11-30 A Lightweight Clustering Framework for Unsupervised Semantic Segmentation Yau Shing Jonathan Cheung et.al. 2311.18628 null
2023-11-30 Each Test Image Deserves A Specific Prompt: Continual Test-Time Adaptation for 2D Medical Image Segmentation Ziyang Chen et.al. 2311.18363 link
2023-11-30 MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation Sumanth Udupa et.al. 2311.18331 link
2023-11-30 Beyond Entropy: Style Transfer Guided Single Image Continual Test-Time Adaptation Younggeol Cho et.al. 2311.18270 null
2023-11-29 ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction Silvan Weder et.al. 2311.18068 null
2023-11-29 A Simple Recipe for Language-guided Domain Generalized Segmentation Mohammad Fahes et.al. 2311.17922 link
2023-11-30 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921 link
2023-11-29 Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation Yu Zheng et.al. 2311.17491 link
2023-11-29 Continual Learning for Image Segmentation with Dynamic Query Weijia Wu et.al. 2311.17450 link
2023-11-28 TransNeXt: Robust Foveal Visual Perception for Vision Transformers Dai Shi et.al. 2311.17132 link
2023-11-28 Generative Data Augmentation Improves Scribble-supervised Semantic Segmentation Jacob Schnell et.al. 2311.17121 null
2023-11-28 Plug-and-Play, Dense-Label-Free Extraction of Open-Vocabulary Semantic Segmentation from Vision-Language Models Luo Jiayun et.al. 2311.17095 link
2023-11-28 ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention Jiawei Wang et.al. 2311.16682 null
2023-11-27 Segment Every Out-of-Distribution Object Wenjie Zhao et.al. 2311.16516 link
2023-11-27 SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance Lukas Hoyer et.al. 2311.16241 link
2023-11-27 Seeing Beyond Cancer: Multi-Institutional Validation of Object Localization and 3D Semantic Segmentation using Deep Learning for Breast MRI Arda Pekis et.al. 2311.16213 null
2023-11-27 Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images Aiyu Cui et.al. 2311.16094 null
2023-11-27 FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding in Open World Thanh-Dat Truong et.al. 2311.15965 null
2023-11-27 2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation Ozan Unal et.al. 2311.15605 null
2023-11-27 An Ensemble of 2.5D ResUnet Based Models for Segmentation for Kidney and Masses Cancan Chen et.al. 2311.15586 null
2023-11-27 SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation Bin Xie et.al. 2311.15537 link
2023-11-26 Advancing Vision Transformers with Group-Mix Attention Chongjian Ge et.al. 2311.15157 link
2023-11-25 Can SAM recognize crops? Quantifying the zero-shot performance of a semantic segmentation foundation model on generating crop-type maps using satellite imagery for precision agriculture Rutuja Gurav et.al. 2311.15138 null
2023-11-25 Adapter is All You Need for Tuning Visual Tasks Dongshuo Yin et.al. 2311.15010 link
2023-11-28 Uncertainty Aware AI for 2D MRI Segmentation Lohith Konathala et.al. 2311.14875 null
2023-11-24 Understanding Self-Supervised Features for Learning Unsupervised Instance Segmentation Paul Engstler et.al. 2311.14665 null
2023-11-24 IDD-AW: A Benchmark for Safe and Robust Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Furqan Ahmed Shaik et.al. 2311.14459 null
2023-11-24 Segment (Almost) Nothing: Prompt-Agnostic Adversarial Attacks on Segmentation Models Francesco Croce et.al. 2311.14450 null
2023-11-24 OneFormer3D: One Transformer for Unified Point Cloud Segmentation Maxim Kolodiazhnyi et.al. 2311.14405 link
2023-11-23 Class Balanced Dynamic Acquisition for Domain Adaptive Semantic Segmentation using Active Learning Marc Schachtsiek et.al. 2311.14146 null
2023-11-23 Language-guided Few-shot Semantic Segmentation Jing Wang et.al. 2311.13865 null
2023-11-22 DiverseNet: Decision Diversified Semi-supervised Semantic Segmentation Networks for Remote Sensing Imagery Wanli Ma et.al. 2311.13716 null
2023-11-22 BenthIQ: a Transformer-Based Benthic Classification Model for Coral Restoration Rupa Kurinchi-Vendhan et.al. 2311.13661 null
2023-11-22 DA-STC: Domain Adaptive Video Semantic Segmentation via Spatio-Temporal Consistency Zhe Zhang et.al. 2311.13254 link
2023-11-22 Self-guided Few-shot Semantic Segmentation for Remote Sensing Imagery Based on Large Vision Models Xiyu Qi et.al. 2311.13200 null
2023-11-22 FuseNet: Self-Supervised Dual-Path Network for Medical Image Segmentation Amirhossein Kazerouni et.al. 2311.13069 link
2023-11-21 AI for Agriculture: the Comparison of Semantic Segmentation Methods for Crop Mapping with Sentinel-2 Imagery Irina Korotkova et.al. 2311.12993 null
2023-11-21 Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots Youqi Liao et.al. 2311.12651 link
2023-11-21 Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers Bo Sun et.al. 2311.12291 null
2023-11-20 Disentangling Structure and Appearance in ViT Feature Space Narek Tumanyan et.al. 2311.12193 null
2023-11-20 Model-aware 3D Eye Gaze from Weak and Few-shot Supervisions Nikola Popovic et.al. 2311.12157 link
2023-11-20 GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding Hao Li et.al. 2311.11863 null
2023-11-20 Predicting urban tree cover from incomplete point labels and limited background information Hui Zhang et.al. 2311.11592 null
2023-11-20 Generalized Category Discovery in Semantic Segmentation Zhengyuan Peng et.al. 2311.11525 link
2023-11-19 SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints Aditya Nalgunda Ganesh et.al. 2311.11371 null
2023-11-19 Optimizing rgb-d semantic segmentation through multi-modal interaction and pooling attention Shuai Zhang et.al. 2311.11312 null
2023-11-18 Low-Precision Floating-Point for Efficient On-Board Deep Neural Network Processing Cédric Gernigon et.al. 2311.11172 null
2023-11-18 SNI-SLAM: Semantic Neural Implicit SLAM Siting Zhu et.al. 2311.11016 link
2023-11-17 Labeling Indoor Scenes with Fusion of Out-of-the-Box Perception Models Yimeng Li et.al. 2311.10883 null
2023-11-17 Self-trained Panoptic Segmentation Shourya Verma et.al. 2311.10648 null
2023-11-17 A Framework of Landsat-8 Band Selection based on UMDA for Deforestation Detection Eduardo B. Neto et.al. 2311.10513 null
2023-11-15 NormNet: Scale Normalization for 6D Pose Estimation in Stacked Scenarios En-Te Lin et.al. 2311.09269 link
2023-11-15 Correlation-aware active learning for surgery video segmentation Fei Wu et.al. 2311.08811 null
2023-11-14 Efficient Rotation Invariance in Deep Neural Networks through Artificial Mental Rotation Lukas Tuggener et.al. 2311.08525 null
2023-11-14 LocaliseBot: Multi-view 3D object localisation with differentiable rendering for robot grasping Sujal Vijayaraghavan et.al. 2311.08438 null
2023-11-14 Test-Time Training for Semantic Segmentation with Output Contrastive Loss Yunlong Zhang et.al. 2311.07877 link
2023-11-13 Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks Laura Fieback et.al. 2311.07477 null
2023-11-14 Simultaneous Clutter Detection and Semantic Segmentation of Moving Objects for Automotive Radar Data Johannes Kopp et.al. 2311.07247 null
2023-11-13 SpectralGPT: Spectral Foundation Model Danfeng Hong et.al. 2311.07113 null
2023-11-11 Unsupervised and semi-supervised co-salient object detection via segmentation frequency statistics Souradeep Chakraborty et.al. 2311.06654 null
2023-11-10 Lidar-based Norwegian tree species detection using deep learning Martijn Vermeer et.al. 2311.06066 null
2023-11-09 PolyMaX: General Dense Prediction with Mask Transformer Xuan Yang et.al. 2311.05770 link
2023-11-09 TLCFuse: Temporal Multi-Modality Fusion Towards Occlusion-Aware Semantic Segmentation-Aided Motion Planning Gustavo Salazar-Gomez et.al. 2311.05319 null
2023-11-09 Reducing the Side-Effects of Oscillations in Training of Quantized YOLO Networks Kartik Gupta et.al. 2311.05109 null
2023-11-07 Data exploitation: multi-task learning of object detection and semantic segmentation on partially annotated data Hoàng-Ân Lê et.al. 2311.04040 link
2023-11-07 A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels Bipul Neupane et.al. 2311.03867 null
2023-11-07 Autonomous Exploration and General Visual Inspection of Ship Ballast Water Tanks using Aerial Robots Mihir Dharmadhikari et.al. 2311.03838 null
2023-11-06 Leveraging point annotations in segmentation learning with boundary loss Eva Breznik et.al. 2311.03537 null
2023-11-06 TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding Shuo Wang et.al. 2311.03427 link
2023-11-06 SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis Hanrong Ye et.al. 2311.03355 null
2023-11-06 Segmentation of Drone Collision Hazards in Airborne RADAR Point Clouds Using PointNet Hector Arroyo et.al. 2311.03221 null
2023-11-06 Pelvic floor MRI segmentation based on semi-supervised deep learning Jianwei Zuo et.al. 2311.03105 null
2023-11-06 COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving Jules Sanchez et.al. 2311.03017 null
2023-11-08 Deep Image Semantic Communication Model for Artificial Intelligent Internet of Things Li Ping Qian et.al. 2311.02926 link
2023-11-05 PotholeGuard: A Pothole Detection Approach by Point Cloud Semantic Segmentation Sahil Nawale et.al. 2311.02641 null
2023-11-05 TFNet: Tuning Fork Network with Neighborhood Pixel Aggregation for Improved Building Footprint Extraction Muhammad Ahmad Waseem et.al. 2311.02617 null
2023-11-03 Image Recognition of Oil Leakage Area Based on Logical Semantic Discrimination Weiying Lin et.al. 2311.02256 null
2023-11-03 MineSegSAT: An automated system to evaluate mining disturbed area extents from Sentinel-2 imagery Ezra MacDonald et.al. 2311.01676 link
2023-11-02 MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory Enxu Li et.al. 2311.01556 null
2023-11-02 AiluRus: A Scalable ViT Framework for Dense Prediction Jin Li et.al. 2311.01197 link
2023-11-02 A deep learning experiment for semantic segmentation of overlapping characters in palimpsests Michela Perino et.al. 2311.01130 null
2023-11-02 Overhead Line Defect Recognition Based on Unsupervised Semantic Segmentation Weixi Wang et.al. 2311.00979 null
2023-11-01 PAUMER: Patch Pausing Transformer for Semantic Segmentation Evann Courdier et.al. 2311.00586 null
2023-10-31 Joint Depth Prediction and Semantic Segmentation with Multi-View SAM Mykhailo Shvets et.al. 2311.00134 null
2023-10-31 Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation Liang Liao et.al. 2310.20305 link
2023-10-31 Annotator: A Generic Active Learning Baseline for LiDAR Semantic Segmentation Binhui Xie et.al. 2310.20293 null
2023-10-30 Dynamic Gaussian Splatting from Markerless Motion Capture can Reconstruct Infants Movements R. James Cotton et.al. 2310.19441 null
2023-10-30 Resource Constrained Semantic Segmentation for Waste Sorting Elisa Cascina et.al. 2310.19407 link
2023-10-30 L2T-DLN: Learning to Teach with Dynamic Loss Network Zhoyang Hai et.al. 2310.19313 null
2023-10-30 Revisiting Evaluation Metrics for Semantic Segmentation: Optimization and Evaluation of Fine-grained Intersection over Union Zifu Wang et.al. 2310.19252 link
2023-10-30 Modular Anti-noise Deep Learning Network for Robotic Grasp Detection Based on RGB Images Zhaocong Li et.al. 2310.19223 link
2023-10-29 Dynamic Task and Weight Prioritization Curriculum Learning for Multimodal Imagery Huseyin Fuat Alsan et.al. 2310.19109 link
2023-10-29 Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation Fei Zhang et.al. 2310.19001 null
2023-10-29 Mask Propagation for Efficient Video Semantic Segmentation Yuetian Weng et.al. 2310.18954 link
2023-10-28 Exploring Data Augmentations on Self-/Semi-/Fully- Supervised Pre-trained Models Shentong Mo et.al. 2310.18850 null
2023-10-28 One-shot Localization and Segmentation of Medical Images with Foundation Models Deepa Anand et.al. 2310.18642 null
2023-10-28 Switching Temporary Teachers for Semi-Supervised Semantic Segmentation Jaemin Na et.al. 2310.18640 link
2023-10-27 A Self-Supervised Approach to Land Cover Segmentation Charles Moore et.al. 2310.18251 null
2023-10-27 SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation Mengcheng Lan et.al. 2310.17874 link
2023-10-26 Image Prior and Posterior Conditional Probability Representation for Efficient Damage Assessment Jie Wei et.al. 2310.17801 null
2023-10-26 Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving Gilles Puy et.al. 2310.17504 link
2023-10-26 Uncertainty-weighted Loss Functions for Improved Adversarial Attacks on Semantic Segmentation Kira Maag et.al. 2310.17436 link
2023-10-26 BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds Corentin Sautier et.al. 2310.17281 link
2023-10-26 Virtual Accessory Try-On via Keypoint Hallucination Junhong Gou et.al. 2310.17131 null
2023-10-26 Automating lichen monitoring in ecological studies using instance segmentation of time-lapse images Safwen Naimi et.al. 2310.17080 null
2023-10-25 Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement Xingchen Zhao et.al. 2310.16979 null
2023-10-25 4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via 4D Semantic Segmentation Dadong Jiang et.al. 2310.16858 null
2023-10-25 Gramian Attention Heads are Strong yet Efficient Vision Learners Jongbin Ryu et.al. 2310.16483 link
2023-10-24 Pixel-Level Clustering Network for Unsupervised Image Segmentation Cuong Manh Hoang et.al. 2310.16234 null
2023-10-26 CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting Lei Li et.al. 2310.16069 null
2023-10-26 ConvBKI: Real-Time Probabilistic Semantic Mapping Network with Quantifiable Uncertainty Joey Wilson et.al. 2310.16020 null
2023-10-24 Semantic-preserving image coding based on Conditional Diffusion models Francesco Pezone et.al. 2310.15737 link
2023-10-26 GNeSF: Generalizable Neural Semantic Fields Hanlin Chen et.al. 2310.15712 null
2023-10-23 SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Haoxiang Wang et.al. 2310.15308 null
2023-10-23 FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models Lihe Yang et.al. 2310.15160 link
2023-10-23 P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Mohammed A. M. Elhassan et.al. 2310.15025 link
2023-10-22 A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application Bo Yuan et.al. 2310.14277 link
2023-10-22 Partition Speeds Up Learning Implicit Neural Representations Based on Exponential-Increase Hypothesis Ke Liu et.al. 2310.14184 link
2023-10-20 Longer-range Contextualized Masked Autoencoder Taekyung Kim et.al. 2310.13593 link
2023-10-20 ROSS: Radar Off-road Semantic Segmentation Peng Jiang et.al. 2310.13551 null
2023-10-20 Technical Report for ICCV 2023 Visual Continual Learning Challenge: Continuous Test-time Adaptation for Semantic Segmentation Damian Sójka et.al. 2310.13533 null
2023-10-20 A review of individual tree crown detection and delineation from optical remote sensing images Juepeng Zheng et.al. 2310.13481 null
2023-10-20 FLAIR: a Country-Scale Land Cover Semantic Segmentation Dataset From Multi-Source Optical Imagery Anatol Garioud et.al. 2310.13336 link
2023-10-19 LeTFuser: Light-weight End-to-end Transformer-Based Sensor Fusion for Autonomous Driving with Multi-Task Learning Pedram Agand et.al. 2310.13135 link
2023-10-19 Using Logic Programming and Kernel-Grouping for Improving Interpretability of Convolutional Neural Networks Parth Padalkar et.al. 2310.13073 null
2023-10-19 Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models Zhaozheng Chen et.al. 2310.13026 link
2023-10-19 Minimalist and High-Performance Semantic Segmentation with Plain Vision Transformers Yuanduo Hong et.al. 2310.12755 link
2023-10-19 Cross-attention Spatio-temporal Context Transformer for Semantic Segmentation of Historical Maps Sidi Wu et.al. 2310.12616 link
2023-10-19 RecolorCloud: A Point Cloud Tool for Recoloring, Segmentation, and Conversion Esteban Segarra Martinez et.al. 2310.12470 null
2023-10-19 Lidar Panoptic Segmentation and Tracking without Bells and Whistles Abhinav Agarwalla et.al. 2310.12464 link
2023-10-18 SegmATRon: Embodied Adaptive Semantic Segmentation for Indoor Environment Tatiana Zemskova et.al. 2310.12031 link
2023-10-16 IDRNet: Intervention-Driven Relation Network for Semantic Segmentation Zhenchao Jin et.al. 2310.10755 link
2023-10-16 Motion2Language, Unsupervised learning of synchronized semantic motion segmentation Karim Radouane et.al. 2310.10594 link
2023-10-16 RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets Zhicheng Cai et.al. 2310.10563 link
2023-10-17 Label-efficient Segmentation via Affinity Propagation Wentong Li et.al. 2310.10533 link
2023-10-16 On the Transferability of Learning Models for Semantic Segmentation for Remote Sensing Data Rongjun Qin et.al. 2310.10490 link
2023-10-15 Top-K Pooling with Patch Contrastive Learning for Weakly-Supervised Semantic Segmentation Wangyu Wu et.al. 2310.09828 null
2023-10-15 Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation Wangyu Wu et.al. 2310.09760 null
2023-10-13 Equirectangular image construction method for standard CNNs for Semantic Segmentation Haoqian Chen et.al. 2310.09122 null
2023-10-13 Faster 3D cardiac CT segmentation with Vision Transformers Lee Jollans et.al. 2310.09099 link
2023-10-13 Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving Feng Jiang et.al. 2310.08826 null
2023-10-12 SSG2: A new modelling paradigm for semantic segmentation Foivos I. Diakogiannis et.al. 2310.08671 link
2023-10-16 SegLoc: Novel Visual Self-supervised Learning Scheme for Dense Prediction Tasks of Security Inspection X-ray Images Shervin Halat et.al. 2310.08421 null
2023-10-12 UniPAD: A Universal Pre-training Paradigm for Autonomous Driving Honghui Yang et.al. 2310.08370 link
2023-10-12 NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding Yuhao Dong et.al. 2310.08326 null
2023-10-12 GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object Detection Ziying Song et.al. 2310.08261 null
2023-10-12 BaSAL: Size Balanced Warm Start Active Learning for LiDAR Semantic Segmentation Jiarong Wei et.al. 2310.08035 null
2023-10-11 HaarNet: Large-scale Linear-Morphological Hybrid Network for RGB-D Semantic Segmentation Rick Groenendijk et.al. 2310.07669 null
2023-10-11 Context-Enhanced Detector For Building Detection From Remote Sensing Images Ziyue Huang et.al. 2310.07638 null
2023-10-11 PeP: a Point enhanced Painting method for unified point cloud tasks Zichao Dong et.al. 2310.07591 null
2023-10-11 Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning Zhiming Qian et.al. 2310.07510 null
2023-10-11 CLIP for Lightweight Semantic Segmentation Ke Jin et.al. 2310.07394 null
2023-10-11 Causal Unsupervised Semantic Segmentation Junho Kim et.al. 2310.07379 link
2023-10-11 Distilling Efficient Vision Transformers from CNNs for Semantic Segmentation Xu Zheng et.al. 2310.07265 null
2023-10-11 Robust Unsupervised Domain Adaptation by Retaining Confident Entropy via Edge Concatenation Hye-Seong Hong et.al. 2310.07149 null
2023-10-10 Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images Che Liu et.al. 2310.07027 link
2023-10-10 CoinSeg: Contrast Inter- and Intra- Class Representations for Incremental Segmentation Zekang Zhang et.al. 2310.06368 link
2023-10-09 CoBEVFusion: Cooperative Perception with LiDAR-Camera Bird’s-Eye View Fusion Donghao Qiao et.al. 2310.06008 null
2023-10-09 Unleashing the power of Neural Collapse for Transferability Estimation Yuhe Ding et.al. 2310.05754 null
2023-10-10 Hierarchical Side-Tuning for Vision Transformers Weifeng Lin et.al. 2310.05393 link
2023-10-11 A Critical Look at Classic Test-Time Adaptation Methods in Semantic Segmentation Chang’an Yi et.al. 2310.05341 link
2023-10-08 Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation Dominik Hollidt et.al. 2310.05133 null
2023-10-08 Bidirectional Knowledge Reconfiguration for Lightweight Point Cloud Analysis Peipei Li et.al. 2310.05125 null
2023-10-08 Enhancing Representations through Heterogeneous Self-Supervised Learning Zhong-Yu Li et.al. 2310.05108 null
2023-10-08 OV-PARTS: Towards Open-Vocabulary Part Segmentation Meng Wei et.al. 2310.05107 link
2023-10-08 Low-Resolution Self-Attention for Semantic Segmentation Yu-Huan Wu et.al. 2310.05026 link
2023-10-08 Human-in-the-loop: The future of Machine Learning in Automated Electron Microscopy Sergei V. Kalinin et.al. 2310.05018 null
2023-10-08 SemST: Semantically Consistent Multi-Scale Image Translation via Structure-Texture Alignment Ganning Zhao et.al. 2310.04995 null
2023-10-07 Federated Self-Supervised Learning of Monocular Depth Estimators for Autonomous Vehicles Elton F. de S. Soares et.al. 2310.04837 null
2023-10-07 Combining UPerNet and ConvNeXt for Contrails Identification to reduce Global Warming Zhenkuan Wang et.al. 2310.04808 link
2023-10-07 Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation Jingyi Pan et.al. 2310.04747 null
2023-10-07 Activate and Reject: Towards Safe Domain Generalization under Category Shift Chaoqi Chen et.al. 2310.04724 null
2023-10-07 Memory-Constrained Semantic Segmentation for Ultra-High Resolution UAV Imagery Qi Li et.al. 2310.04721 null
2023-10-06 VTON-IT: Virtual Try-On using Image Translation Santosh Adhikari et.al. 2310.04558 link
2023-10-06 Semantic segmentation of longitudinal thermal images for identification of hot and cool spots in urban areas Vasantha Ramani et.al. 2310.04247 null
2023-10-06 DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions Sanket Kalwar et.al. 2310.04181 null
2023-10-06 A Deeply Supervised Semantic Segmentation Method Based on GAN Wei Zhao et.al. 2310.04081 null
2023-10-06 Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation Md Kaykobad Reza et.al. 2310.03986 null
2023-10-05 Ammonia-Net: A Multi-task Joint Learning Model for Multi-class Segmentation and Classification in Tooth-marked Tongue Diagnosis Shunkai Shi et.al. 2310.03472 null
2023-10-03 CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation Jialei Chen et.al. 2310.02296 null
2023-10-03 TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation Yahia Dalbah et.al. 2310.02260 link
2023-10-03 Exploring Model Learning Heterogeneity for Boosting Ensemble Robustness Yanzhao Wu et.al. 2310.02237 link
2023-10-03 TreeScope: An Agricultural Robotics Dataset for LiDAR-Based Mapping of Trees in Forests and Orchards Derek Cheng et.al. 2310.02162 link
2023-10-03 Trainable Noise Model as an XAI evaluation method: application on Sobol for remote sensing image segmentation Hossein Shreim et.al. 2310.01828 link
2023-10-03 Predicting Future Spatiotemporal Occupancy Grids with Semantics for Autonomous Driving Maneekwan Toyungyernsub et.al. 2310.01723 null
2023-10-02 CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Size Wu et.al. 2310.01403 link
2023-10-02 Efficient Remote Sensing Segmentation With Generative Adversarial Transformer Luyi Qiu et.al. 2310.01292 null
2023-10-02 LoCUS: Learning Multiscale 3D-consistent Features from Posed Images Dominik A. Kloepfer et.al. 2310.01095 null
2023-10-02 Improved Crop and Weed Detection with Diverse Data Ensemble Learning in Agriculture Muhammad Hamza Asad et.al. 2310.01055 null
2023-10-02 Multi-task Learning with 3D-Aware Regularization Wei-Hong Li et.al. 2310.00986 link
2023-10-01 Propagating Semantic Labels in Video Data David Balaban et.al. 2310.00783 null
2023-10-01 Counterfactual Image Generation for adversarially robust and interpretable Classifiers Rafael Bischof et.al. 2310.00761 null
2023-10-01 Win-Win: Training High-Resolution Vision Transformers from Two Windows Vincent Leroy et.al. 2310.00632 null
2023-09-30 Technical Report of 2023 ABO Fine-grained Semantic Segmentation Competition Zeyu Dong et.al. 2310.00427 null
2023-09-30 An easy zero-shot learning combination: Texture Sensitive Semantic Segmentation IceHrNet and Advanced Style Transfer Learning Strategy Zhiyong Yang et.al. 2310.00310 link
2023-09-30 Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation Jingliang Deng et.al. 2310.00307 null
2023-10-04 Text-image Alignment for Diffusion-based Perception Neehar Kondapaneni et.al. 2310.00031 link
2023-09-29 APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds Weijie Wei et.al. 2309.17162 link
2023-09-29 SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning Risa Shinoda et.al. 2309.17083 link
2023-09-29 Synthetic Data Generation and Deep Learning for the Topological Analysis of 3D Data Dylan Peek et.al. 2309.16968 null
2023-09-29 COMNet: Co-Occurrent Matching for Weakly Supervised Semantic Segmentation Yukun Su et.al. 2309.16959 null
2023-09-29 Model2Scene: Learning 3D Scene Representation via Contrastive Language-CAD Models Pre-training Runnan Chen et.al. 2309.16956 null
2023-09-29 YOLOR-Based Multi-Task Learning Hung-Shuo Chang et.al. 2309.16921 link
2023-10-02 Superpixel Transformers for Efficient Semantic Segmentation Alex Zihao Zhu et.al. 2309.16889 null
2023-10-03 Cross-City Matters: A Multimodal Remote Sensing Benchmark Dataset for Cross-City Semantic Segmentation using High-Resolution Domain Adaptation Networks Danfeng Hong et.al. 2309.16499 null
2023-09-28 Open Compound Domain Adaptation with Object Style Compensation for Semantic Segmentation Tingliang Feng et.al. 2309.16127 null
2023-09-27 Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback Teresa Yeo et.al. 2309.15762 null
2023-09-27 CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs Ao Wang et.al. 2309.15755 null
2023-09-27 InfraParis: A multi-modal and multi-task autonomous driving dataset Gianni Franchi et.al. 2309.15751 link
2023-09-27 Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation Xin Yuan et.al. 2309.15726 null
2023-09-27 Learning from SAM: Harnessing a Segmentation Foundation Model for Sim2Real Domain Adaptation through Regularization Mayara E. Bonani et.al. 2309.15562 null
2023-09-27 Investigating the changes in BOLD responses during viewing of images with varied complexity: An fMRI time-series based analysis on human vision Naveen Kanigiri et.al. 2309.15495 link
2023-09-27 The Robust Semantic Segmentation UNCV2023 Challenge Results Xuanlong Yu et.al. 2309.15478 null
2023-09-27 Inherit with Distillation and Evolve with Contrast: Exploring Class Incremental Semantic Segmentation Without Exemplar Memory Danpei Zhao et.al. 2309.15413 null
2023-09-27 Seeing Beyond the Patch: Scale-Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery based on Reinforcement Learning Yinhe Liu et.al. 2309.15372 null
2023-09-26 M $^{3}$ 3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding Muhammad Abdullah Jamal et.al. 2309.15313 null
2023-09-26 ZiCo-BC: A Bias Corrected Zero-Shot NAS for Vision Tasks Kartikeya Bhardwaj et.al. 2309.14666 null
2023-09-25 Dynamic Scene Graph Representation for Surgical Video Felix Holm et.al. 2309.14538 null
2023-09-29 Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation Quang Nguyen et.al. 2309.14303 link
2023-09-25 CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free Monika Wysoczańska et.al. 2309.14289 link
2023-09-25 Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic Segmentation Muxin Liao et.al. 2309.14282 link
2023-09-25 Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation Yuxi Wang et.al. 2309.14241 null
2023-09-25 Masked Image Residual Learning for Scaling Deeper Vision Transformers Guoxi Huang et.al. 2309.14136 link
2023-09-25 Small Objects Matters in Weakly-supervised Semantic Segmentation Cheolhyun Mun et.al. 2309.14117 null
2023-09-26 AsymFormer: Asymmetrical Cross-Modal Representation Learning for Mobile Platform Real-Time RGB-D Semantic Segmentation Siqi Du et.al. 2309.14065 link
2023-09-25 Weakly Supervised Semantic Segmentation by Knowledge Graph Inference Jia Zhang et.al. 2309.14057 link
2023-09-24 Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation Jiayi Ni et.al. 2309.13604 link
2023-09-24 LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning Liulei Li et.al. 2309.13556 null
2023-09-24 Towards Robust Robot 3D Perception in Urban Environments: The UT Campus Object Dataset Arthur Zhang et.al. 2309.13549 link
2023-09-24 Bridging Semantic Gaps for Language-Supervised Semantic Segmentation Yun Xing et.al. 2309.13505 link
2023-09-23 A Unified Scheme of ResNet and Softmax Zhao Song et.al. 2309.13482 null
2023-09-23 FedDrive v2: an Analysis of the Impact of Label Skewness in Federated Semantic Segmentation for Autonomous Driving Eros Fanì et.al. 2309.13336 link
2023-09-23 Discwise Active Learning for LiDAR Semantic Segmentation Ozan Unal et.al. 2309.13276 null
2023-09-22 ClusterFormer: Clustering As A Universal Visual Learner James C. Liang et.al. 2309.13196 link
2023-09-22 Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation Wei Zhai et.al. 2309.12943 link
2023-09-22 Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning Jonathan Sauder et.al. 2309.12804 null
2023-09-22 Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation Ping Li et.al. 2309.12557 null
2023-09-21 DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion Zhenzhen Chu et.al. 2309.12424 null
2023-09-21 MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic Segmentation Haozhi Cao et.al. 2309.11839 link
2023-09-21 2DDATA: 2D Detection Annotations Transmittable Aggregation for Semantic Segmentation on Point Cloud Guan-Cheng Lee et.al. 2309.11755 null
2023-09-21 MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic Segmentation Fei Pan et.al. 2309.11711 link
2023-09-20 EPTQ: Enhanced Post-Training Quantization via Label-Free Hessian Ofir Gordon et.al. 2309.11531 link
2023-09-20 RMT: Retentive Networks Meet Vision Transformers Qihang Fan et.al. 2309.11523 link
2023-09-20 Towards Robust Few-shot Point Cloud Semantic Segmentation Yating Xu et.al. 2309.11228 link
2023-09-20 Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation Heeseung Yun et.al. 2309.11081 link
2023-09-21 CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration A. Abdullah et.al. 2309.11038 null
2023-09-19 Change of Scenery: Unsupervised LiDAR Change Detection for Mobile Robots Alexander Krawciw et.al. 2309.10924 null
2023-09-19 Few-Shot Panoptic Segmentation With Foundation Models Markus Käppeler et.al. 2309.10726 link
2023-09-19 Cross-modal and Cross-domain Knowledge Transfer for Label-free 3D Segmentation Jingyu Zhang et.al. 2309.10649 null
2023-09-19 Adversarial Attacks Against Uncertainty Quantification Emanuele Ledda et.al. 2309.10586 null
2023-09-19 SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving Xiangchao Yan et.al. 2309.10527 link
2023-09-19 Spatial-Assistant Encoder-Decoder Network for Real Time Semantic Segmentation Yalun Wang et.al. 2309.10519 link
2023-09-19 RECALL+: Adversarial Web-based Replay for Continual Learning in Semantic Segmentation Chang Liu et.al. 2309.10479 null
2023-09-19 LineMarkNet: Line Landmark Detection for Valet Parking Zizhang Wu et.al. 2309.10475 null
2023-09-19 An Empirical Study of Attention Networks for Semantic Segmentation Hao Guo et.al. 2309.10217 null
2023-09-18 DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation Bowen Yin et.al. 2309.09668 link
2023-09-18 Heterogeneous Generative Knowledge Distillation with Masked Image Modeling Ziming Wang et.al. 2309.09571 null
2023-09-18 PanoMixSwap Panorama Mixing via Structural Swapping for Indoor Scene Understanding Yu-Cheng Hsieh et.al. 2309.09514 null
2023-09-18 Target-aware Bi-Transformer for Few-shot Segmentation Xianglin Wang et.al. 2309.09492 null
2023-09-17 Active Learning for Semantic Segmentation with Multi-class Label Query Sehyun Hwang et.al. 2309.09319 null
2023-09-17 CLIPUNetr: Assisting Human-robot Interface for Uncalibrated Visual Servoing Control with CLIP-driven Referring Expression Segmentation Chen Jiang et.al. 2309.09183 null
2023-09-15 T-UDA: Temporal Unsupervised Domain Adaptation in Sequential Point Clouds Awet Haileslassie Gebrehiwot et.al. 2309.08302 link
2023-09-14 Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation Zhaochong An et.al. 2309.08020 link
2023-09-17 TFNet: Exploiting Temporal Cues for Fast and Accurate LiDAR Semantic Segmentation Rong Li et.al. 2309.07849 null
2023-09-14 Large-scale Weakly Supervised Learning for Road Extraction from Satellite Imagery Shiqiao Meng et.al. 2309.07823 null
2023-09-14 Neural Field Representations of Articulated Objects for Robotic Manipulation Planning Phillip Grote et.al. 2309.07620 null
2023-09-14 JSMNet Improving Indoor Point Cloud Semantic and Instance Segmentation through Self-Attention and Multiscale Shuochen Xu et.al. 2309.07425 null
2023-09-13 Automated Assessment of Critical View of Safety in Laparoscopic Cholecystectomy Yunfan Li et.al. 2309.07330 null
2023-09-13 Lavender Autonomous Navigation with Semantic Segmentation at the Edge Alessandro Navone et.al. 2309.06863 null
2023-09-15 Dynamic Spectrum Mixer for Visual Recognition Zhiqiang Hu et.al. 2309.06721 null
2023-09-12 Padding-free Convolution based on Preservation of Differential Characteristics of Kernels Kuangdai Leng et.al. 2309.06370 null
2023-09-12 Exploring Flat Minima for Domain Generalization with Large Learning Rates Jian Zhang et.al. 2309.06337 null
2023-09-12 IBAFormer: Intra-batch Attention Transformer for Domain Generalized Semantic Segmentation Qiyu Sun et.al. 2309.06282 null
2023-09-12 Active Label Refinement for Semantic Segmentation of Satellite Images Tuan Pham Minh et.al. 2309.06159 null
2023-09-12 A2V: A Semi-Supervised Domain Adaptation Framework for Brain Vessel Segmentation via Two-Phase Training Angiography-to-Venography Translation Francesco Galati et.al. 2309.06075 null
2023-09-12 Real-Time Semantic Segmentation: A Brief Survey & Comparative Study in Remote Sensing Clifford Broni-Bediako et.al. 2309.06047 null
2023-09-15 Self-Correlation and Cross-Correlation Learning for Few-Shot Remote Sensing Image Semantic Segmentation Linhan Wang et.al. 2309.05840 link
2023-09-11 UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase Youquan Liu et.al. 2309.05573 link
2023-09-11 Learning Semantic Segmentation with Query Points Supervision on Aerial Images Santiago Rivier et.al. 2309.05490 link
2023-09-11 Panoptic Vision-Language Feature Fields Haoran Chen et.al. 2309.05448 link
2023-09-11 Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Guoyuan An et.al. 2309.05438 link
2023-09-15 DeCUR: decoupling common & unique representations for multimodal self-supervision Yi Wang et.al. 2309.05300 link
2023-09-12 MFPNet: Multi-scale Feature Propagation Network For Lightweight Semantic Segmentation Guoan Xu et.al. 2309.04914 null
2023-09-12 Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation Shyam Nandan Rai et.al. 2309.04573 null
2023-09-08 Long-Range Correlation Supervision for Land-Cover Classification from Remote Sensing Images Dawen Yu et.al. 2309.04225 null
2023-09-08 From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models Changming Xiao et.al. 2309.04109 link
2023-09-08 Weakly Supervised Point Clouds Transformer for 3D Object Detection Zuojin Tang et.al. 2309.04105 null
2023-09-07 Towards Comparable Knowledge Distillation in Semantic Image Segmentation Onno Niemann et.al. 2309.03659 null
2023-09-07 BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale Weakly Supervised Applications Jiatai Lin et.al. 2309.03509 link
2023-09-06 EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation Nikolai Körber et.al. 2309.03244 link
2023-09-11 Exploring Semantic Consistency in Unpaired Image Translation to Generate Data for Surgical Applications Danush Kumar Venkatesh et.al. 2309.03048 link
2023-09-06 Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter Jinglong Wang et.al. 2309.02773 link
2023-09-05 Compressing Vision Transformers for Low-Resource Visual Learning Eric Youn et.al. 2309.02617 link
2023-09-05 Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach Vimal K B et.al. 2309.02429 null
2023-09-05 DCP-Net: A Distributed Collaborative Perception Network for Remote Sensing Semantic Segmentation Zhechao Wang et.al. 2309.02230 null
2023-09-06 Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNN Kin Wai Lau et.al. 2309.01439 link
2023-09-04 DAT++: Spatially Dynamic Vision Transformer with Deformable Attention Zhuofan Xia et.al. 2309.01430 link
2023-09-04 Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion Ryota Yoshihashi et.al. 2309.01369 null
2023-09-03 FOR-instance: a UAV laser scanning benchmark dataset for semantic and instance segmentation of individual trees Stefano Puliti et.al. 2309.01279 null
2023-09-02 RevColV2: Exploring Disentangled Representations in Masked Image Modeling Qi Han et.al. 2309.01005 link
2023-09-07 Exploring the Robustness of Human Parsers Towards Common Corruptions Sanyi Zhang et.al. 2309.00938 null
2023-09-02 Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer for Exposure Correction Gehui Li et.al. 2309.00872 null
2023-09-02 Deep Learning and Inverse Problems Ali Mohammad-Djafari et.al. 2309.00802 null
2023-09-01 dacl10k: Benchmark for Semantic Bridge Damage Segmentation Johannes Flotzinger et.al. 2309.00460 null
2023-09-01 Dense Voxel 3D Reconstruction Using a Monocular Event Camera Haodong Chen et.al. 2309.00385 null
2023-08-31 Self-supervised Semantic Segmentation: Consistency over Transformation Sanaz Karimijafarbigloo et.al. 2309.00143 link
2023-08-31 Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection Reza Azad et.al. 2309.00108 link
2023-08-31 Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation Chaofan Ma et.al. 2309.00096 link
2023-08-31 PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction Sicheng Zuo et.al. 2308.16896 link
2023-08-31 BTSeg: Barlow Twins Regularization for Domain Adaptation in Semantic Segmentation Johannes Künzel et.al. 2308.16819 link
2023-08-31 Towards Optimal Patch Size in Vision Transformers for Tumor Segmentation Ramtin Mojtahedi et.al. 2308.16598 link
2023-09-01 Self-Sampling Meta SAM: Enhancing Few-shot Medical Image Segmentation with Meta-Learning Yiming Zhang et.al. 2308.16466 link
2023-09-04 Deep Video Codec Control Christoph Reich et.al. 2308.16215 null
2023-08-30 Semi-supervised Domain Adaptation with Inter and Intra-domain Mixing for Semantic Segmentation Weifu Fu et.al. 2308.15855 null
2023-08-31 CongNaMul: A Dataset for Advanced Image Processing of Soybean Sprouts Byunghyun Ban et.al. 2308.15690 null
2023-08-29 3D Adversarial Augmentations for Robust Out-of-Domain Predictions Alexander Lehner et.al. 2308.15479 null
2023-08-29 Complementing Onboard Sensors with Satellite Map: A New Perspective for HD Map Construction Wenjie Gao et.al. 2308.15427 link
2023-08-29 Learning to Upsample by Learning to Sample Wenze Liu et.al. 2308.15085 link
2023-08-28 Maturity-Aware Active Learning for Semantic Segmentation with Hierarchically-Adaptive Sample Assessment Amirsaeed Yazdani et.al. 2308.14904 link
2023-08-29 Compositional Semantic Mix for Domain Adaptation in Point Cloud Segmentation Cristiano Saltori et.al. 2308.14619 link
2023-08-28 Semi-Supervised Learning for Visual Bird’s Eye View Semantic Segmentation Junyu Zhu et.al. 2308.14525 link
2023-08-28 Attention-Guided Lidar Segmentation and Odometry Using Image-to-Point Cloud Saliency Transfer Guanqun Ding et.al. 2308.14332 null
2023-08-27 Rethinking Exemplars for Continual Semantic Segmentation in Endoscopy Scenes: Entropy-based Mini-Batch Pseudo-Replay Guankun Wang et.al. 2308.14100 null
2023-08-26 Semi-Supervised Semantic Segmentation via Marginal Contextual Information Moshe Kimhi et.al. 2308.13900 link
2023-08-26 ReFuSeg: Regularized Multi-Modal Fusion for Precise Brain Tumour Segmentation Aditya Kasliwal et.al. 2308.13883 null
2023-08-25 RestNet: Boosting Cross-Domain Few-Shot Segmentation with Residual Transformation Network Xinyang Huang et.al. 2308.13469 link
2023-08-25 A Re-Parameterized Vision Transformer (ReVT) for Domain-Generalized Semantic Segmentation Jan-Aike Termöhlen et.al. 2308.13331 link
2023-08-25 SVQNet: Sparse Voxel-Adjacent Query Network for 4D Spatio-Temporal LiDAR Semantic Segmentation Xuechao Chen et.al. 2308.13323 null
2023-08-25 Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin Memory Jingyi Zhang et.al. 2308.13236 link
2023-08-24 Enhancing Perception and Immersion in Pre-Captured Environments through Learning-Based Eye Height Adaptation Qi Feng et.al. 2308.13042 null
2023-08-24 Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks Xiangyang Zhu et.al. 2308.12961 link
2023-08-25 Efficient assessment of window views in high-rise, high-density urban areas using 3D color City Information Models Maosu Li et.al. 2308.12909 null
2023-08-24 Boosting Semantic Segmentation from the Perspective of Explicit Class Embeddings Yuhe Liu et.al. 2308.12894 null
2023-08-24 Logic-induced Diagnostic Reasoning for Semi-supervised Semantic Segmentation Chen Liang et.al. 2308.12595 null
2023-08-24 Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation Zikun Zhou et.al. 2308.12534 null
2023-08-23 A Spatiotemporal Correspondence Approach to Unsupervised LiDAR Segmentation with Traffic Applications Xiao Li et.al. 2308.12433 null
2023-08-23 Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation Duo Peng et.al. 2308.12350 null
2023-08-24 ACLS: Adaptive and Conditional Label Smoothing for Network Calibration Hyekang Park et.al. 2308.11911 null
2023-08-23 SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets Cody Simons et.al. 2308.11880 link
2023-08-22 Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations Mohammadreza Salehi et.al. 2308.11796 link
2023-08-22 G3Reg: Pyramid Graph-based Global Registration using Gaussian Ellipsoid Model Zhijian Qiao et.al. 2308.11573 link
2023-08-22 Food Image Classification and Segmentation with Attention-based Multiple Instance Learning Valasia Vlachopoulou et.al. 2308.11452 null
2023-08-22 Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding Jiantao Wu et.al. 2308.11448 null
2023-08-22 Semantic RGB-D Image Synthesis Shijie Li et.al. 2308.11356 null
2023-08-22 DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment Xujie Zhang et.al. 2308.11206 null
2023-08-22 A three in one bottom-up framework for simultaneous semantic segmentation, instance segmentation and classification of multi-organ nuclei in digital cancer histology Ibtihaj Ahmad et.al. 2308.11179 null
2023-08-22 Hierarchical Point-based Active Learning for Semi-supervised Point Cloud Semantic Segmentation Zongyi Xu et.al. 2308.11166 link
2023-08-21 Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for Weakly Supervised Semantic Segmentation M. Maruf et.al. 2308.11052 null
2023-08-21 Diffusion Model as Representation Learner Xingyi Yang et.al. 2308.10916 link
2023-08-21 Dataset Quantization Daquan Zhou et.al. 2308.10524 link
2023-08-21 PHE-SICH-CT-IDS: A Benchmark CT Image Dataset for Evaluation Semantic Segmentation, Object Detection and Radiomic Feature Extraction of Perihematomal Edema in Spontaneous Intracerebral Hemorrhage Deguo Ma et.al. 2308.10521 null
2023-08-21 SynDrone – Multi-modal UAV Dataset for Urban Scenarios Giulia Rizzoli et.al. 2308.10491 link
2023-08-21 CVFC: Attention-Based Cross-View Feature Consistency for Weakly Supervised Semantic Segmentation of Pathology Images Liangrui Pan et.al. 2308.10449 null
2023-08-20 Hyper Association Graph Matching with Uncertainty Quantification for Coronary Artery Semantic Labeling Chen Zhao et.al. 2308.10320 null
2023-08-20 Efficient-VRNet: An Exquisite Fusion Network for Riverway Panoptic Perception based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar Runwei Guan et.al. 2308.10287 link
2023-08-20 EDDense-Net: Fully Dense Encoder Decoder Network for Joint Segmentation of Optic Cup and Disc Mehwish Mehmood et.al. 2308.10192 null
2023-08-19 Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation Dan Zhang et.al. 2308.09965 null
2023-08-19 Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos Rui Qian et.al. 2308.09951 link
2023-08-18 ResQ: Residual Quantization for Video Perception Davide Abati et.al. 2308.09511 null
2023-08-18 Metadata Improves Segmentation Through Multitasking Elicitation Iaroslav Plutenko et.al. 2308.09411 link
2023-08-18 Single Frame Semantic Segmentation Using Multi-Modal Spherical Images Suresh Guttikonda et.al. 2308.09369 link
2023-08-18 Retro-FPN: Retrospective Feature Pyramid Network for Point Cloud Semantic Segmentation Peng Xiang et.al. 2308.09314 link
2023-08-18 A review of technical factors to consider when designing neural networks for semantic segmentation of Earth Observation imagery Sam Khallaghi et.al. 2308.09221 null
2023-08-16 ECPC-IDS:A benchmark endometrail cancer PET/CT image dataset for evaluation of semantic segmentation and detection of hypermetabolic regions Dechao Tang et.al. 2308.08313 null
2023-08-16 MEDOE: A Multi-Expert Decoder and Output Ensemble Framework for Long-tailed Semantic Segmentation Junao Shen et.al. 2308.08213 null
2023-08-16 AATCT-IDS: A Benchmark Abdominal Adipose Tissue CT Image Dataset for Image Denoising, Semantic Segmentation, and Radiomics Evaluation Zhiyu Ma et.al. 2308.08172 null
2023-08-15 Future Video Prediction from a Single Frame for Video Anomaly Detection Mohammad Baradaran et.al. 2308.07783 null
2023-08-15 Graph-Segmenter: Graph Transformer with Boundary-aware Attention for Semantic Segmentation Zizhang Wu et.al. 2308.07592 null
2023-08-15 Confidence Contours: Uncertainty-Aware Annotation for Medical Semantic Segmentation Andre Ye et.al. 2308.07528 link
2023-08-14 SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation An Wang et.al. 2308.07156 null
2023-08-14 ICPC: Instance-Conditioned Prompting with Contrastive Learning for Semantic Segmentation Chaohui Yu et.al. 2308.07078 null
2023-08-14 A One Stop 3D Target Reconstruction and multilevel Segmentation Method Jiexiong Xu et.al. 2308.06974 link
2023-08-14 Towards Open-Set Test-Time Adaptation Utilizing the Wisdom of Crowds in Entropy Minimization Jungsoo Lee et.al. 2308.06879 null
2023-08-12 LadleNet: Translating Thermal Infrared Images to Visible Light Images Using A Scalable Two-stage U-Net Tonghui Zou et.al. 2308.06603 link
2023-08-12 BEV-DG: Cross-Modal Learning under Bird’s-Eye View for Domain Generalization of 3D Semantic Segmentation Miaoyu Li et.al. 2308.06530 null
2023-08-12 Seed Feature Maps-based CNN Models for LEO Satellite Remote Sensing Services Zhichao Lu et.al. 2308.06515 null
2023-08-11 R2S100K: Road-Region Segmentation Dataset For Semi-Supervised Autonomous Driving in the Wild Muhammad Atif Butt et.al. 2308.06393 null
2023-08-11 Defensive Perception: Estimation and Monitoring of Neural Network Performance under Deployment Hendrik Vogt et.al. 2308.06299 null
2023-08-11 Physical Adversarial Attacks For Camera-based Smart Systems: Current Trends, Categorization, Applications, Research Challenges, and Future Outlook Amira Guesmi et.al. 2308.06173 null
2023-08-11 DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models Weijia Wu et.al. 2308.06160 link
2023-08-11 Spatial-information Guided Adaptive Context-aware Network for Efficient RGB-D Semantic Segmentation Yang Zhang et.al. 2308.06024 link
2023-08-11 FoodSAM: Any Food Segmentation Xing Lan et.al. 2308.05938 link
2023-08-11 Semantic-embedded Similarity Prototype for Scene Recognition Chuanxin Song et.al. 2308.05896 null
2023-08-10 SegDA: Maximum Separable Segment Mask with Pseudo Labels for Domain Adaptive Semantic Segmentation Anant Khandelwal et.al. 2308.05851 null
2023-08-10 DiLogics: Creating Web Automation Programs With Diverse Logics Kevin Pu et.al. 2308.05828 null
2023-08-10 Masked Diffusion as Self-supervised Representation Learner Zixuan Pan et.al. 2308.05695 link
2023-08-10 Category Feature Transformer for Semantic Segmentation Quan Tang et.al. 2308.05581 link
2023-08-10 Look at the Neighbor: Distortion-aware Unsupervised Domain Adaptation for Panoramic Semantic Segmentation Xu Zheng et.al. 2308.05493 null
2023-08-10 Deep Semantic Graph Matching for Large-scale Outdoor Point Clouds Registration Shaocong Liu et.al. 2308.05314 null
2023-08-09 SegMatch: A semi-supervised learning method for surgical instrument segmentation Meng Wei et.al. 2308.05232 null
2023-08-10 Prototypical Kernel Learning and Open-set Foreground Perception for Generalized Few-shot Semantic Segmentation Kai Huang et.al. 2308.04952 null
2023-08-09 Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation Lei Zhu et.al. 2308.04949 null
2023-08-09 MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation Kaixin Cai et.al. 2308.04829 null
2023-08-09 Continual Road-Scene Semantic Segmentation via Feature-Aligned Symmetric Multi-Modal Network Francesco Barbato et.al. 2308.04702 null
2023-08-08 Semi-Supervised Semantic Segmentation of Cell Nuclei via Diffusion-based Large-Scale Pre-Training and Collaborative Learning Zhuchen Shao et.al. 2308.04578 null
2023-08-08 All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation Weixuan Sun et.al. 2308.04321 link
2023-08-08 AICSD: Adaptive Inter-Class Similarity Distillation for Semantic Segmentation Amir M. Mansourian et.al. 2308.04243 link
2023-08-08 PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation Zhu Liu et.al. 2308.03979 link
2023-08-07 FeatEnHancer: Enhancing Hierarchical Features for Object Detection and Beyond Under Low-Light Vision Khurram Azeem Hashmi et.al. 2308.03594 link
2023-08-11 DiT: Efficient Vision Transformers with Dynamic Token Routing Yuchen Ma et.al. 2308.03409 link
2023-08-06 Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities Rohit Mohan et.al. 2308.03193 null
2023-08-06 High-Resolution Vision Transformers for Pixel-Level Identification of Structural Components and Damage Kareem Eltouny et.al. 2308.03006 null
2023-08-06 MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation Lian Xu et.al. 2308.03005 link
2023-08-06 Cal-SFDA: Source-Free Domain-adaptive Semantic Segmentation with Differentiable Expected Calibration Error Zixin Wang et.al. 2308.03003 link
2023-08-05 Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation Yiyang Chen et.al. 2308.02883 null
2023-08-05 NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation Jianfeng Wang et.al. 2308.02866 link
2023-08-05 Few-shot Class-Incremental Semantic Segmentation via Pseudo-Labeling and Knowledge Distillation Chengjia Jiang et.al. 2308.02790 link
2023-08-04 Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP Qihang Yu et.al. 2308.02487 link
2023-08-04 Frustratingly Easy Model Generalization by Dummy Risk Minimization Juncheng Wang et.al. 2308.02287 null
2023-08-04 On the Calibration of Uncertainty Estimation in LiDAR-based Semantic Segmentation Mariella Dreissig et.al. 2308.02248 null
2023-08-04 Deep Semantic Model Fusion for Ancient Agricultural Terrace Detection Yi Wang et.al. 2308.02225 link
2023-08-04 ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo Qiang Zhou et.al. 2308.02191 null
2023-08-04 Synthetic outlier generation for anomaly detection in autonomous driving Martin Bikandi et.al. 2308.02184 null
2023-08-04 Semantics-guided Transformer-based Sensor Fusion for Improved Waypoint Prediction Hwan-Soo Choi et.al. 2308.02126 link
2023-08-04 Rethinking Class Activation Maps for Segmentation: Revealing Semantic Information in Shallow Layers by Reducing Noise Hang-Cheng Dong et.al. 2308.02118 null
2023-08-03 Dynamic Token-Pass Transformers for Semantic Segmentation Yuang Liu et.al. 2308.01944 null
2023-08-03 LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment Zhiwei Zhang et.al. 2308.01686 link
2023-08-03 Assessing Systematic Weaknesses of DNNs using Counterfactuals Sujan Sai Gannamaneni et.al. 2308.01614 null
2023-08-03 Target-point Attention Transformer: A novel trajectory predict network for end-to-end autonomous driving Jingyu Du et.al. 2308.01496 null
2023-08-02 DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation Jingfan Chen et.al. 2308.01127 null
2023-08-02 Dynamic Token Pruning in Plain Vision Transformers for Semantic Segmentation Quan Tang et.al. 2308.01045 null
2023-08-02 Training-Free Instance Segmentation from Semantic Image Segmentation Masks Yuchen Shen et.al. 2308.00949 link
2023-08-01 MonoNext: A 3D Monocular Object Detection with ConvNext Marcelo Eduardo Pederiva et.al. 2308.00596 null
2023-08-01 A Satellite Imagery Dataset for Long-Term Sustainable Development in United States Cities Yanxin Xi et.al. 2308.00465 link
2023-08-01 Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding Runyu Ding et.al. 2308.00353 null
2023-08-01 Improving Pixel-based MIM by Reducing Wasted Modeling Capability Yuan Liu et.al. 2308.00261 link
2023-07-31 Multispectral Image Segmentation in Agriculture: A Comprehensive Study on Fusion Approaches Nuno Cunha et.al. 2308.00159 link
2023-07-29 A 3D deep learning classifier and its explainability when assessing coronary artery disease Wing Keung Cheung et.al. 2308.00009 null
2023-08-02 Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models Weikang Yu et.al. 2307.16865 link
2023-07-31 Transferable Attack for Semantic Segmentation Mengqi He et.al. 2307.16572 link
2023-07-29 CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation Ruihao Xia et.al. 2307.15942 link
2023-07-28 OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation of Road Scenes Fei Teng et.al. 2307.15588 link
2023-07-27 To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation Marc Botet Colomer et.al. 2307.15063 link
2023-07-31 pCTFusion: Point Convolution-Transformer Fusion with Semantic Aware Loss for Outdoor LiDAR Point Cloud Segmentation Abhishek Kuriyal et.al. 2307.14777 link
2023-07-27 GenCo: An Auxiliary Generator from Contrastive Learning for Enhanced Few-Shot Learning in Remote Sensing Jing Wu et.al. 2307.14612 null
2023-07-27 MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation Liang Xu et.al. 2307.14588 link
2023-07-26 Self-supervised Few-shot Learning for Semantic Segmentation: An Annotation-free Approach Sanaz Karimijafarbigloo et.al. 2307.14446 link
2023-07-26 Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy Luca Clissa et.al. 2307.14243 null
2023-07-26 Resolution-Aware Design of Atrous Rates for Semantic Segmentation Networks Bum Jun Kim et.al. 2307.14179 null
2023-07-27 Pre-Training with Diffusion models for Dental Radiography segmentation Jérémy Rousseau et.al. 2307.14066 null
2023-07-31 Causal reasoning in typical computer vision tasks Kexuan Zhang et.al. 2307.13992 null
2023-07-26 Topology-aware Robust Optimization for Out-of-distribution Generalization Fengchun Qiao et.al. 2307.13943 link
2023-07-26 Improving Semi-Supervised Semantic Segmentation with Dual-Level Siamese Structure Network Zhibo Tain et.al. 2307.13938 link
2023-07-25 Optical Flow boosts Unsupervised Localization and Segmentation Xinyu Zhang et.al. 2307.13640 link
2023-07-25 Fashion Matrix: Editing Photos by Just Talking Zheng Chong et.al. 2307.13240 link
2023-07-25 Image Segmentation Keras : Implementation of Segnet, FCN, UNet, PSPNet and other models in Keras Divam Gupta et.al. 2307.13215 link
2023-07-24 Compact & Capable: Harnessing Graph Neural Networks and Edge Convolution for Medical Image Classification Aryan Singh et.al. 2307.12790 link
2023-07-24 CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components Davide Di Nucci et.al. 2307.12718 null
2023-07-24 MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised Learning of Motion and Content Features Adrien Bardes et.al. 2307.12698 null
2023-07-24 Damage Vision Mining Opportunity for Imbalanced Anomaly Detection Takato Yasuno et.al. 2307.12676 null
2023-07-24 PRIOR: Prototype Representation Joint Learning from Medical Images and Reports Pujin Cheng et.al. 2307.12577 link
2023-07-24 A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation Jinjing Zhu et.al. 2307.12574 null
2023-07-23 EnTri: Ensemble Learning with Tri-level Representations for Explainable Scene Recognition Amirhossein Aminimehr et.al. 2307.12442 null
2023-07-23 ComPtr: Towards Diverse Bi-source Dense Prediction Tasks via A Simple yet General Complementary Transformer Youwei Pang et.al. 2307.12349 link
2023-07-22 Morphology-inspired Unsupervised Gland Segmentation via Selective Semantic Grouping Qixiang Zhang et.al. 2307.11989 link
2023-07-25 CORE: Cooperative Reconstruction for Multi-Agent Perception Binglu Wang et.al. 2307.11514 link
2023-07-21 SA-BEV: Generating Semantic-Aware Bird’s-Eye-View Feature for Multi-view 3D Object Detection Jinqing Zhang et.al. 2307.11477 link
2023-07-20 Spinal nerve segmentation method and dataset construction in endoscopic surgical scenarios Shaowu Peng et.al. 2307.10955 link
2023-07-20 Label Calibration for Semantic Segmentation Under Domain Shift Ondrej Bohdal et.al. 2307.10842 null
2023-07-20 Gradient-Semantic Compensation for Incremental Semantic Segmentation Wei Cong et.al. 2307.10822 null
2023-07-22 TwinLiteNet: An Efficient and Lightweight Model for Driveable Area and Lane Segmentation in Self-Driving Cars Quang Huy Che et.al. 2307.10705 link
2023-07-19 CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation Lizhao Liu et.al. 2307.10316 link
2023-07-18 Towards Automated Semantic Segmentation in Mammography Images Cesar A. Sierra-Franco et.al. 2307.10296 null
2023-07-17 On the Real-Time Semantic Segmentation of Aphid Clusters in the Wild Raiyan Rahman et.al. 2307.10267 null
2023-07-19 Boundary-Refined Prototype Generation: A General End-to-End Paradigm for Semi-Supervised Semantic Segmentation Junhao Dong et.al. 2307.10097 link
2023-07-19 U-CE: Uncertainty-aware Cross-Entropy for Semantic Segmentation Steven Landgraf et.al. 2307.09947 null
2023-07-19 Space Engage: Collaborative Space Supervision for Contrastive-based Semi-Supervised Semantic Segmentation Changqi Wang et.al. 2307.09755 null
2023-07-19 ClickSeg: 3D Instance Segmentation with Click-Level Weak Annotations Leyao Liu et.al. 2307.09732 null
2023-07-14 LEST: Large-scale LiDAR Semantic Segmentation with Transformer Chuanyu Luo et.al. 2307.09367 null
2023-07-19 Disentangle then Parse:Night-time Semantic Segmentation with Illumination Disentanglement Zhixiang Wei et.al. 2307.09362 link
2023-07-18 MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds Jiahui Liu et.al. 2307.09316 link
2023-07-18 CG-fusion CAM: Online segmentation of laser-induced damage on large-aperture optics Yueyue Han et.al. 2307.09161 null
2023-07-18 Mining of Single-Class by Active Learning for Semantic Segmentation Hugues Lambert et.al. 2307.09109 null
2023-07-18 EgoVM: Achieving Precise Ego-Localization using Lightweight Vectorized Maps Yuzhe He et.al. 2307.08991 null
2023-07-19 Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation Rundong Luo et.al. 2307.08779 null
2023-07-17 A Nested U-Structure for Instrument Segmentation in Robotic Surgery Yanjie Xia et.al. 2307.08630 null
2023-07-17 Scale-Aware Modulation Meet Transformer Weifeng Lin et.al. 2307.08579 link
2023-07-17 Variational Probabilistic Fusion Network for RGB-T Semantic Segmentation Baihong Lin et.al. 2307.08536 null
2023-07-17 On Point Affiliation in Feature Upsampling Wenze Liu et.al. 2307.08198 link
2023-07-16 HRHD-HK: A benchmark dataset of high-rise and high-density urban scenes for 3D semantic segmentation of photogrammetric point clouds Maosu Li et.al. 2307.07976 link
2023-07-16 Dual-level Interaction for Domain Adaptive Semantic Segmentation Dongyu Yao et.al. 2307.07972 link
2023-07-15 Improving Translation Invariance in Convolutional Neural Networks with Peripheral Prediction Padding Kensuke Mukai et.al. 2307.07725 null
2023-07-15 PSGformer: Enhancing 3D Point Cloud Instance Segmentation via Precise Semantic Guidance Lei Pan et.al. 2307.07708 null
2023-07-14 A scoping review on multimodal deep learning in biomedical images and texts Zhaoyi Sun et.al. 2307.07362 null
2023-07-14 Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks Chaoyu Liu et.al. 2307.07344 null
2023-07-14 HEAL-SWIN: A Vision Transformer On The Sphere Oscar Carlsson et.al. 2307.07313 link
2023-07-14 Adaptive Region Selection for Active Learning in Whole Slide Image Semantic Segmentation Jingna Qiu et.al. 2307.07168 link
2023-07-13 YOLIC: An Efficient Method for Object Localization and Classification on Edge Devices Kai Su et.al. 2307.06689 link
2023-07-13 WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmark for Autonomous Driving on Water Surfaces Shanliang Yao et.al. 2307.06505 link
2023-07-12 Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Mostafa Dehghani et.al. 2307.06304 null
2023-07-12 OG: Equip vision occupancy with instance segmentation and visual grounding Zichao Dong et.al. 2307.05873 null
2023-07-11 Automatic Generation of Semantic Parts for Face Image Synthesis Tomaso Fontanini et.al. 2307.05317 link
2023-07-11 Estimating label quality and errors in semantic segmentation data via any model Vedang Lad et.al. 2307.05080 link
2023-07-10 Test-Time Adaptation for Nighttime Color-Thermal Semantic Segmentation Yexin Liu et.al. 2307.04470 null
2023-07-10 Stroke Extraction of Chinese Character Based on Deep Structure Deformable Image Registration Meng Li et.al. 2307.04341 link
2023-07-09 Mx2M: Masked Cross-Modality Modeling in Domain Adaptation for 3D Semantic Segmentation Boxiang Zhang et.al. 2307.04231 null
2023-07-11 Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird’s Eye View Jiayu Yang et.al. 2307.04106 null
2023-07-09 Enhancing Building Semantic Segmentation Accuracy with Super Resolution and Deep Learning: Investigating the Impact of Spatial Resolution on Various Datasets Zhiling Guo et.al. 2307.04101 null
2023-07-09 CMDFusion: Bidirectional Fusion Network with Cross-modality Knowledge Distillation for LIDAR Semantic Segmentation Jun Cen et.al. 2307.04091 link
2023-07-08 Building and Road Segmentation Using EffUNet and Transfer Learning Approach Sahil Gangurde et.al. 2307.03980 null
2023-07-07 Tranfer Learning of Semantic Segmentation Methods for Identifying Buried Archaeological Structures on LiDAR Data Paolo Soleni et.al. 2307.03512 null
2023-07-07 Large AI Model-Based Semantic Communications Feibo Jiang et.al. 2307.03492 null
2023-07-07 A Deep Active Contour Model for Delineating Glacier Calving Fronts Konrad Heidler et.al. 2307.03461 null
2023-07-07 General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation Nhi Kieu et.al. 2307.03388 link
2023-07-06 To pretrain or not to pretrain? A case study of domain-specific pretraining for semantic segmentation in histopathology Tushar Kataria et.al. 2307.03275 link
2023-07-10 Art Authentication with Vision Transformers Ludovica Schaerf et.al. 2307.03039 null
2023-07-05 Spherical Feature Pyramid Networks For Semantic Segmentation Thomas Walker et.al. 2307.02658 null
2023-07-05 AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images Ao Cheng et.al. 2307.02464 null
2023-07-05 RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation Renato Sortino et.al. 2307.02392 null
2023-07-05 Prompting Diffusion Representations for Cross-Domain Semantic Segmentation Rui Gong et.al. 2307.02138 null
2023-07-05 Line Graphics Digitization: A Step Towards Full Automation Omar Moured et.al. 2307.02065 link
2023-07-05 Multi-Modal Prototypes for Open-Set Semantic Segmentation Yuhuan Yang et.al. 2307.02003 null
2023-07-05 The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT Nicholas Heller et.al. 2307.01984 link
2023-07-04 Augment Features Beyond Color for Domain Generalized Segmentation Qiyu Sun et.al. 2307.01703 null
2023-07-04 Exploiting Richness of Learned Compressed Representation of Images for Semantic Segmentation Ravi Kakaiya et.al. 2307.01524 null
2023-07-04 Semantic Segmentation on 3D Point Clouds with High Density Variations Ryan Faulkner et.al. 2307.01489 null
2023-07-03 MeT: A Graph Transformer for Semantic Segmentation of 3D Meshes Giuseppe Vecchio et.al. 2307.01115 null
2023-07-03 TomatoDIFF: On-plant Tomato Segmentation with Denoising Diffusion Models Marija Ivanovska et.al. 2307.01064 link
2023-07-03 DifFSS: Diffusion Model for Few-Shot Semantic Segmentation Weimin Tan et.al. 2307.00773 link
2023-07-03 Hierarchical Open-vocabulary Universal Image Segmentation Xudong Wang et.al. 2307.00764 link
2023-07-02 Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization Yumeng Li et.al. 2307.00648 link
2023-07-01 Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation Qi Bi et.al. 2307.00371 link
2023-07-01 SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation Fabian Duffhauss et.al. 2307.00306 link
2023-07-01 Efficient Subclass Segmentation in Medical Images Linrui Dai et.al. 2307.00257 link
2023-07-01 Internal-External Boundary Attention Fusion for Glass Surface Segmentation Dongshen Han et.al. 2307.00212 null
2023-06-30 Obscured Wildfire Flame Detection By Temporal Analysis of Smoke Patterns Captured by Unmanned Aerial Systems Uma Meleti et.al. 2307.00104 null
2023-06-30 Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation Balamurali Murugesan et.al. 2307.00097 link
2023-06-30 Achieving RGB-D level Segmentation Performance from a Single ToF Camera Pranav Sharma et.al. 2306.17636 null
2023-06-28 Analysis of LiDAR Configurations on Off-road Semantic Segmentation Performance Jinhee Yu et.al. 2306.16551 null
2023-06-28 Land Cover Segmentation with Sparse Annotations from Sentinel-2 Imagery Marco Galatola et.al. 2306.16252 link
2023-07-03 GraSS: Contrastive Learning with Gradient Guided Sampling Strategy for Remote Sensing Image Semantic Segmentation Zhaoyang Zhang et.al. 2306.15868 link
2023-06-27 What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation Benedikt Blumenstiel et.al. 2306.15521 link
2023-06-27 Enhancing Navigation Benchmarking and Perception Data Generation for Row-based Crops in Simulation Mauro Martini et.al. 2306.15517 null
2023-06-27 SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion Jianbiao Mei et.al. 2306.15349 link
2023-06-27 Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract Bohao Peng et.al. 2306.15278 null
2023-06-27 Semantic Segmentation Using Super Resolution Technique as Pre-Processing Chih-Chia Chen et.al. 2306.15218 null
2023-06-28 MIMIC: Masked Image Modeling with Image Correspondences Kalyani Marathe et.al. 2306.15128 link
2023-06-26 Localized Text-to-Image Generation for Free via Cross Attention Control Yutong He et.al. 2306.14636 null
2023-06-26 AME-CAM: Attentive Multiple-Exit CAM for Weakly Supervised Segmentation on MRI Brain Tumor Yu-Jen Chen et.al. 2306.14505 link
2023-06-25 On Evaluating the Adversarial Robustness of Semantic Segmentation Models Levente Halmosi et.al. 2306.14217 null
2023-06-25 The Second-place Solution for CVPR VISION 23 Challenge Track 1 – Data Effificient Defect Detection Xian Tao et.al. 2306.14116 link
2023-06-25 When SAM Meets Sonar Images Lin Wang et.al. 2306.14109 link
2023-06-24 Semantic Segmentation of Porosity in 4D Spatio-Temporal X-ray μCT of Titanium Coated Ni wires using Deep Learning Pradyumna Elavarthi et.al. 2306.14039 null
2023-06-23 OpenMask3D: Open-Vocabulary 3D Instance Segmentation Ayça Takmaz et.al. 2306.13631 link
2023-06-23 3DSAM-adapter: Holistic Adaptation of SAM from 2D to 3D for Promptable Medical Image Segmentation Shizhan Gong et.al. 2306.13465 link
2023-06-22 Robust Semantic Segmentation: Strong Adversarial Attacks and Fast Training of Robust Models Francesco Croce et.al. 2306.12941 link
2023-06-21 Multi-Task Consistency for Active Learning Aral Hekimoglu et.al. 2306.12398 null
2023-06-20 No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths Charles Guille-Escuret et.al. 2306.11922 link
2023-06-20 Using super-resolution for enhancing visual perception and segmentation performance in veterinary cytology Jakub Caputa et.al. 2306.11848 null
2023-06-26 Hyperbolic Active Learning for Semantic Segmentation under Domain Shift Luca Franco et.al. 2306.11180 link
2023-06-19 Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation Shuting He et.al. 2306.11087 link
2023-06-19 A spatio-temporal network for video semantic segmentation in surgical videos Maria Grammatikopoulou et.al. 2306.11052 null
2023-06-18 Balanced Energy Regularization Loss for Out-of-distribution Detection Hyunjun Choi et.al. 2306.10485 link
2023-06-17 Residual Spatial Fusion Network for RGB-Thermal Semantic Segmentation Ping Li et.al. 2306.10364 null
2023-06-17 Benchmarking Deep Learning Architectures for Urban Vegetation Points Segmentation Aditya et.al. 2306.10274 null
2023-06-16 ALP: Action-Aware Embodied Learning for Perception Xinran Liang et.al. 2306.10190 null
2023-06-16 Enhancing Visual Domain Adaptation with Source Preparation Anirudha Ramesh et.al. 2306.10142 null
2023-06-16 PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation Yuqi Wang et.al. 2306.10013 link
2023-06-15 SSL4EO-L: Datasets and Foundation Models for Landsat Imagery Adam J. Stewart et.al. 2306.09424 link
2023-06-15 Infinite Photorealistic Worlds using Procedural Generation Alexander Raistrick et.al. 2306.09310 link
2023-06-15 Neural World Models for Computer Vision Anthony Hu et.al. 2306.09179 null
2023-06-15 Contrast, Stylize and Adapt: Unsupervised Contrastive Learning Framework for Domain Adaptive Semantic Segmentation Tianyu Li et.al. 2306.09098 link
2023-06-15 A Self-Supervised Miniature One-Shot Texture Segmentation (MOSTS) Model for Real-Time Robot Navigation and Embedded Applications Yu Chen et.al. 2306.08814 link
2023-06-13 BPKD: Boundary Privileged Knowledge Distillation For Semantic Segmentation Liyang Liu et.al. 2306.08075 link
2023-06-13 Efficient 3D Semantic Segmentation with Superpoint Transformer Damien Robert et.al. 2306.08045 link
2023-06-13 Low-Resource White-Box Semantic Segmentation of Supporting Towers on 3D Point Clouds via Signature Shape Identification Diogo Lavado et.al. 2306.07809 null
2023-06-12 Video-to-Music Recommendation using Temporal Alignment of Segments Laure Prétet et.al. 2306.07187 null
2023-06-12 Volume-DROID: A Real-Time Implementation of Volumetric Mapping with DROID-SLAM Peter Stratton et.al. 2306.06850 link
2023-06-12 AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation Kashu Yamazaki et.al. 2306.06842 link
2023-06-11 3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation Jinming Su et.al. 2306.06753 null
2023-06-09 SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers Bowen Zhang et.al. 2306.06289 link
2023-06-09 Data-Link: High Fidelity Manufacturing Datasets for Model2Real Transfer under Industrial Settings Sunny Katyara et.al. 2306.05766 null
2023-06-09 Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding Jie Gui et.al. 2306.05675 link
2023-06-08 A Novel Confidence Induced Class Activation Mapping for MRI Brain Tumor Segmentation Yu-Jen Chen et.al. 2306.05476 link
2023-06-08 Mesh-MLP: An all-MLP Architecture for Mesh Classification and Semantic Segmentation Qiujie Dong et.al. 2306.05246 null
2023-06-08 Unsupervised augmentation optimization for few-shot medical image segmentation Quan Quan et.al. 2306.05107 null
2023-06-08 Improving Visual Prompt Tuning for Self-supervised Vision Transformers Seungryong Yoo et.al. 2306.05067 link
2023-06-08 A Dynamic Feature Interaction Framework for Multi-task Visual Perception Yuling Xi et.al. 2306.05061 null
2023-06-08 Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction Ali Jamali et.al. 2306.04947 link
2023-06-07 UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks Yanan Sun et.al. 2306.04715 null
2023-06-06 DenseDINO: Boosting Dense Self-Supervised Learning with Token-Based Point-Level Consistency Yike Yuan et.al. 2306.04654 null
2023-06-07 PhenoBench – A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain Jan Weyler et.al. 2306.04557 link
2023-06-14 CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation Boyuan Sun et.al. 2306.04300 link
2023-06-07 Randomized 3D Scene Generation for Generalizable Self-supervised Pre-training Lanxiao Li et.al. 2306.04237 null
2023-06-06 Accurate Fine-Grained Segmentation of Human Anatomy in Radiographs via Volumetric Pseudo-Labeling Constantin Seibold et.al. 2306.03934 link
2023-06-06 Towards Label-free Scene Understanding by Vision Foundation Models Runnan Chen et.al. 2306.03899 link
2023-06-06 Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation Xinrong Hu et.al. 2306.03878 link
2023-06-06 Single-Shot Global Localization via Graph-Theoretic Correspondence Matching Shigemichi Matsuzaki et.al. 2306.03641 null
2023-06-06 Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach Min Yan et.al. 2306.03508 null
2023-06-08 DFormer: Diffusion-guided Transformer for Universal Image Segmentation Hefeng Wang et.al. 2306.03437 link
2023-06-06 SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation Xuewei Li et.al. 2306.03403 link
2023-06-05 Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing Biao Wu et.al. 2306.02894 null
2023-06-05 Learning from Multi-View Representation for Point-Cloud Pre-Training Siming Yan et.al. 2306.02558 null
2023-06-04 Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation Haochen Wang et.al. 2306.02314 null
2023-06-04 Cross-CBAM: A Lightweight network for Scene Segmentation Zhengbin Zhang et.al. 2306.02306 null
2023-06-06 3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW Shijie Chang et.al. 2306.02291 link
2023-06-03 Content-aware Token Sharing for Efficient Semantic Segmentation with Vision Transformers Chenyang Lu et.al. 2306.02095 link
2023-06-03 Balancing Logit Variation for Long-tailed Semantic Segmentation Yuchao Wang et.al. 2306.02061 link
2023-06-03 Efficient Multi-Grained Knowledge Reuse for Class Incremental Segmentation Zhihe Lu et.al. 2306.02027 link
2023-06-02 Denoising Diffusion Semantic Segmentation with Mask Prior Modeling Zeqiang Lai et.al. 2306.01721 link
2023-06-02 Towards In-context Scene Understanding Ivana Balažević et.al. 2306.01667 null
2023-06-02 Towards Source-free Domain Adaptive Semantic Segmentation via Importance-aware and Prototype-contrast Learning Yihong Cao et.al. 2306.01598 link
2023-06-05 Robust and Generalisable Segmentation of Subtle Epilepsy-causing Lesions: a Graph Convolutional Approach Hannah Spitzer et.al. 2306.01375 link
2023-06-01 Geo-Tiles for Semantic Segmentation of Earth Observation Imagery Sebastian Bullinger et.al. 2306.00823 link
2023-06-01 Exploring Open-Vocabulary Semantic Segmentation without Human Labels Jun Chen et.al. 2306.00450 null
2023-05-31 Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN Yangfan Hu et.al. 2305.19868 link
2023-06-01 Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards Guian Fang et.al. 2305.19599 link
2023-05-30 TrueDeep: A systematic approach of crack detection with less data Ram Krishna Pandey et.al. 2305.19088 null
2023-05-28 Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR W. Ronny Huang et.al. 2305.18419 null
2023-05-29 Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising Fu-Yun Wang et.al. 2305.18264 link
2023-05-29 Contrastive Learning Based Recursive Dynamic Multi-Scale Network for Image Deraining Zhiying Jiang et.al. 2305.18092 null
2023-05-29 CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models Zhongxi Chen et.al. 2305.17932 link
2023-05-27 Condition-Invariant Semantic Segmentation Christos Sakaridis et.al. 2305.17349 link
2023-05-26 SSSegmenation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch Zhenchao Jin et.al. 2305.17091 link
2023-05-26 Maskomaly:Zero-Shot Mask Anomaly Segmentation Jan Ackermann et.al. 2305.16972 null
2023-05-26 Semantic segmentation of sparse irregular point clouds for leaf/wood discrimination Yuchen Bai et.al. 2305.16963 link
2023-05-26 Localization under consistent assumptions over dynamics Matti Pekkanen et.al. 2305.16702 null
2023-05-25 GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds Zihui Zhang et.al. 2305.16404 link
2023-05-25 Making Vision Transformers Truly Shift-Equivariant Renan A. Rojas-Gomez et.al. 2305.16316 null
2023-05-25 Interactive Segment Anything NeRF with Feature Imitation Xiaokang Chen et.al. 2305.16233 null
2023-05-26 Energy-based Detection of Adverse Weather Effects in LiDAR Data Aldi Piroli et.al. 2305.16129 link
2023-05-25 DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification Sitian Shen et.al. 2305.15957 null
2023-05-25 Knowledge Diffusion for Distillation Tao Huang et.al. 2305.15712 link

image restoration

Publish Date Title Authors PDF Code
2025-06-29 Double-Diffusion: Diffusion Conditioned Diffusion Probabilistic Model For Air Quality Prediction Hanlin Dong et.al. 2506.23053 null
2025-06-27 EAMamba: Efficient All-Around Vision State Space Model for Image Restoration Yu-Cheng Lin et.al. 2506.22246 null
2025-06-26 Elucidating and Endowing the Diffusion Training Paradigm for General Image Restoration Xin Lu et.al. 2506.21722 null
2025-06-26 Wild refitting for black box prediction Martin J. Wainwright et.al. 2506.21460 null
2025-06-25 TDiR: Transformer based Diffusion for Image Restoration Tasks Abbas Anwar et.al. 2506.20302 null
2025-06-24 A Comparative Study of NAFNet Baselines for Image Restoration Vladislav Esaulov et.al. 2506.19845 null
2025-06-24 NAADA: A Noise-Aware Attention Denoising Autoencoder for Dental Panoramic Radiographs Khuram Naveed et.al. 2506.19387 null
2025-06-23 Enhancing Image Restoration Transformer via Adaptive Translation Equivariance JiaKui Hu et.al. 2506.18520 null
2025-06-23 BSMamba: Brightness and Semantic Modeling for Long-Range Interaction in Low-Light Image Enhancement Tongshun Zhang et.al. 2506.18346 null
2025-06-20 Reversing Flow for Image Restoration Haina Qin et.al. 2506.16961 null
2025-06-20 Visual-Instructed Degradation Diffusion for All-in-One Image Restoration Wenyang Luo et.al. 2506.16960 link
2025-06-23 RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought Junbo Qiao et.al. 2506.16796 link
2025-06-19 MoiréXNet: Adaptive Multi-Scale Demoiréing with Linear Attention Test-Time Training and Truncated Flow Matching Prior Liangyan Li et.al. 2506.15929 null
2025-06-16 ADAM-Dehaze: Adaptive Density-Aware Multi-Stage Dehazing for Improved Object Detection in Foggy Conditions Fatmah AlHindaassi et.al. 2506.15837 null
2025-06-17 Optimization-Based Image Restoration under Implementation Constraints in Optical Analog Circuits Taisei Kato et.al. 2506.14624 null
2025-06-17 Unsupervised Imaging Inverse Problems with Diffusion Distribution Matching Giacomo Meanti et.al. 2506.14605 link
2025-06-22 Exploring Diffusion with Test-Time Training on Efficient Image Restoration Rongchang Lu et.al. 2506.14541 null
2025-06-16 Exploiting the Exact Denoising Posterior Score in Training-Free Guidance of Diffusion Models Gregory Bellchambers et.al. 2506.13614 null
2025-06-15 Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution Hang Xu et.al. 2506.12738 null
2025-06-14 UniDet-D: A Unified Dynamic Spectral Attention Model for Object Detection under Adverse Weathers Yuantao Wang et.al. 2506.12324 null
2025-06-10 Adaptive Object Detection with ESRGAN-Enhanced Resolution & Faster R-CNN Divya Swetha K et.al. 2506.11122 null
2025-06-11 Text-Aware Image Restoration with Diffusion Models Jaewon Min et.al. 2506.09993 null
2025-06-09 M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration Yongzhen Wang et.al. 2506.07814 null
2025-06-08 Multi-Step Guided Diffusion for Image Restoration on Edge Devices: Toward Lightweight Perception in Embodied AI Aditya Chakravarty et.al. 2506.07286 null
2025-06-08 A PDE-Based Image Restoration Method: Mathematical Analysis and Implementation Dragos-Patru Covei et.al. 2506.07132 null
2025-06-06 NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces Pierluigi Zama Ramirez et.al. 2506.05815 null
2025-06-05 UniRes: Universal Image Restoration for Complex Degradations Mo Zhou et.al. 2506.05599 null
2025-06-05 SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training Jianyi Wang et.al. 2506.05301 null
2025-06-03 NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results Xiaohong Liu et.al. 2506.02875 null
2025-06-03 ControlMambaIR: Conditional Controls with State-Space Model for Image Restoration Cheng Yang et.al. 2506.02633 null
2025-06-04 NTIRE 2025 Challenge on RAW Image Restoration and Super-Resolution Marcos V. Conde et.al. 2506.02197 null
2025-06-02 RAW Image Reconstruction from RGB on Smartphones. NTIRE 2025 Challenge Report Marcos V. Conde et.al. 2506.01947 null
2025-06-02 NTIRE 2025 the 2nd Restore Any Image Model (RAIM) in the Wild Challenge Jie Liang et.al. 2506.01394 null
2025-05-31 Image Restoration Learning via Noisy Supervision in the Fourier Domain Haosen Liu et.al. 2506.00564 null
2025-05-30 IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models Hanting Wang et.al. 2505.24406 link
2025-05-30 Boosting All-in-One Image Restoration via Self-Improved Privilege Learning Gang Wu et.al. 2505.24207 link
2025-05-29 Proximal Algorithm Unrolling: Flexible and Efficient Reconstruction Networks for Single-Pixel Imaging Ping Wang et.al. 2505.23180 link
2025-05-29 URWKV: Unified RWKV Model with Multi-state Perspective for Low-light Image Restoration Rui Xu et.al. 2505.23068 link
2025-05-29 EquiReg: Equivariance Regularized Diffusion for Inverse Problems Bahareh Tolooshams et.al. 2505.22973 null
2025-05-28 From Controlled Scenarios to Real-World: Cross-Domain Degradation Pattern Matching for All-in-One Image Restoration Junyu Fan et.al. 2505.22284 null
2025-05-28 Reference-Guided Identity Preserving Face Restoration Mo Zhou et.al. 2505.21905 null
2025-05-27 BaryIR: Learning Multi-Source Unified Representation in Continuous Barycenter Space for Generalizable All-in-One Image Restoration Xiaole Tang et.al. 2505.21637 null
2025-05-23 UniDB++: Fast Sampling of Unified Diffusion Bridge Mokai Pan et.al. 2505.21528 null
2025-05-28 PreP-OCR: A Complete Pipeline for Document Image Restoration and Enhanced OCR Accuracy Shuhao Guan et.al. 2505.20429 null
2025-05-26 A Regularization-Guided Equivariant Approach for Image Restoration Yulu Bai et.al. 2505.19799 link
2025-05-25 Benchmarking Laparoscopic Surgical Image Restoration and Beyond Jialun Pei et.al. 2505.19161 link
2025-05-25 Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition Xiaoyang Liu et.al. 2505.19120 link
2025-05-24 Manifold-aware Representation Learning for Degradation-agnostic Image Restoration Bin Ren et.al. 2505.18679 null
2025-05-23 RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration Sudarshan Rajagopalan et.al. 2505.18047 null
2025-05-23 MODEM: A Morton-Order Degradation Estimation Mechanism for Adverse Weather Image Recovery Hainuo Wang et.al. 2505.17581 link
2025-05-23 Dual Ascent Diffusion for Inverse Problems Minseo Kim et.al. 2505.17353 null
2025-05-22 Forward-only Diffusion Probabilistic Models Ziwei Luo et.al. 2505.16733 link
2025-05-22 Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration Yuetong Liu et.al. 2505.16479 null
2025-05-22 NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment Shuhao Han et.al. 2505.16314 null
2025-05-22 Deep Learning-Driven Ultra-High-Definition Image Restoration: A Survey Liyan Wang et.al. 2505.16161 link
2025-05-22 Breaking Complexity Barriers: High-Resolution Image Restoration with Rank Enhanced Linear Attention Yuang Ai et.al. 2505.16157 null
2025-05-22 Continuous Representation Methods, Theories, and Applications: An Overview and Perspectives Yisi Luo et.al. 2505.15222 link
2025-05-20 UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache Pu Wang et.al. 2505.14010 null
2025-05-19 Adaptive Image Restoration for Video Surveillance: A Real-Time Approach Muhammad Awais Amin et.al. 2505.13130 null
2025-05-19 LatentINDIGO: An INN-Guided Latent Diffusion Algorithm for Image Restoration Di You et.al. 2505.12935 null
2025-05-19 Towards a Universal Image Degradation Model via Content-Degradation Disentanglement Wenbo Yang et.al. 2505.12860 null
2025-05-19 Degradation-Aware Feature Perturbation for All-in-One Image Restoration Xiangpeng Tian et.al. 2505.12630 link
2025-05-18 Trustworthy Image Super-Resolution via Generative Pseudoinverse Andreas Floros et.al. 2505.12375 link
2025-05-20 Diff-Unfolding: A Model-Based Score Learning Framework for Inverse Problems Yuanhao Wang et.al. 2505.11393 null
2025-05-15 torchmfbd: a flexible multi-object multi-frame blind deconvolution code A. Asensio Ramos et.al. 2505.10639 link
2025-05-13 Behind the Noise: Conformal Quantile Regression Reveals Emergent Representations Petrus H. Zwart et.al. 2505.08176 null
2025-05-12 Image Restoration via Integration of Optimal Control Techniques and the Hamilton-Jacobi-Bellman Equation Dragos-Patru Covei et.al. 2505.07699 null
2025-05-12 Generalizable Pancreas Segmentation via a Dual Self-Supervised Learning Framework Jun Li et.al. 2505.07165 null
2025-05-10 UnfoldIR: Rethinking Deep Unfolding Network in Illumination Degradation Image Restoration Chunming He et.al. 2505.06683 null
2025-05-17 A Preliminary Study for GPT-4o on Image Restoration Hao Yang et.al. 2505.05621 link
2025-05-07 Image Restoration via Multi-domain Learning Xingyu Jiang et.al. 2505.05504 link
2025-05-08 SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation Yonwoo Choi et.al. 2505.05475 link
2025-05-08 EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution Haizhen Xie et.al. 2505.05209 null
2025-05-03 Multi-Scale Target-Aware Representation Learning for Fundus Image Enhancement Haofan Wu et.al. 2505.01831 null
2025-05-02 Deblurring fission fragment mass distributions Pierre Nzabahimana et.al. 2505.01294 null
2025-05-01 GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution Aditya Arora et.al. 2505.00687 null
2025-05-08 DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration Hebaixu Wang et.al. 2504.21487 link
2025-04-27 Marine Snow Removal Using Internally Generated Pseudo Ground Truth Alexandra Malyugina et.al. 2504.19289 null
2025-04-27 Rendering Anywhere You See: Renderability Field-guided Gaussian Splatting Xiaofeng Jin et.al. 2504.19261 null
2025-04-24 Dual Prompting Image Restoration with Diffusion Transformers Dehong Kong et.al. 2504.17825 null
2025-04-24 DPMambaIR:All-in-One Image Restoration via Degradation-Aware Prompt State Space Model Zhanwen Liu et.al. 2504.17732 null
2025-04-24 Inverse-Designed Metasurfaces for Wavefront Restoration in Under-Display Camera Systems Jaegang Jo et.al. 2504.17368 null
2025-04-24 I-INR: Iterative Implicit Neural Representations Ali Haider et.al. 2504.17364 null
2025-04-23 RouteWinFormer: A Route-Window Transformer for Middle-range Attention in Image Restoration Qifan Li et.al. 2504.16637 null
2025-04-23 Cross Paradigm Representation and Alignment Transformer for Image Deraining Shun Zou et.al. 2504.16455 null
2025-04-21 Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration Junyuan Deng et.al. 2504.15159 null
2025-04-21 Distribution-aware Dataset Distillation for Efficient Image Restoration Zhuoran Zheng et.al. 2504.14826 null
2025-04-19 Any Image Restoration via Efficient Spatial-Frequency Degradation Adaptation Bin Ren et.al. 2504.14249 null
2025-04-21 Circular Image Deturbulence using Quasi-conformal Geometry Chu Chen et.al. 2504.13432 null
2025-04-17 Saliency-Aware Diffusion Reconstruction for Effective Invisible Watermark Removal Inzamamul Alam et.al. 2504.12809 link
2025-04-17 AdaQual-Diff: Diffusion-Based Image Restoration via Adaptive Quality Prompting Xin Su et.al. 2504.12605 null
2025-04-16 Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging Tristan S. W. Stevens et.al. 2504.12154 null
2025-04-16 HyperKING: Quantum-Classical Generative Adversarial Networks for Hyperspectral Image Restoration Chia-Hsiang Lin et.al. 2504.11782 null
2025-04-15 Efficient Medical Image Restoration via Reliability Guided Learning in Frequency Domain Pengcheng Zheng et.al. 2504.11286 null
2025-04-20 An Efficient and Mixed Heterogeneous Model for Image Restoration Yubin Gu et.al. 2504.10967 link
2025-04-14 Enhancing Image Restoration through Learning Context-Rich and Detail-Accurate Features Hu Gao et.al. 2504.10558 link
2025-04-14 PG-DPIR: An efficient plug-and-play method for high-count Poisson-Gaussian inverse problems Maud Biquard et.al. 2504.10375 null
2025-04-14 VibrantLeaves: A principled parametric image generator for training deep restoration models Raphael Achddou et.al. 2504.10201 link
2025-04-14 Progressive Transfer Learning for Multi-Pass Fundus Image Restoration Uyen Phan et.al. 2504.10025 null
2025-04-14 Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration Gang Wu et.al. 2504.09973 link
2025-04-13 Computationally iterative methods for salt-and-pepper denoising Jianwei Ke et.al. 2504.09408 null
2025-04-12 Beyond Degradation Conditions: All-in-One Image Restoration via HOG Transformers Jiawei Wu et.al. 2504.09377 link
2025-04-11 ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration Yongsheng Yu et.al. 2504.08591 null
2025-04-11 VL-UR: Vision-Language-guided Universal Restoration of Images Degraded by Adverse Weather Conditions Ziyan Liu et.al. 2504.08219 null
2025-04-09 Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model Yingjie Zhou et.al. 2504.07148 null
2025-04-09 Rethinking LayerNorm in Image Restoration Transformers MinKyu Lee et.al. 2504.06629 null
2025-04-08 AstroClearNet: Deep image prior for multi-frame astronomical image restoration Yashil Sukurdeep et.al. 2504.06463 null
2025-04-07 DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration Jiamei Xiong et.al. 2504.05135 null
2025-04-08 Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision Yuandong Pu et.al. 2504.04903 null
2025-04-07 Content-Aware Transformer for All-in-one Image Restoration Gang Wu et.al. 2504.04869 link
2025-04-05 JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration Yunlong Lin et.al. 2504.04158 null
2025-04-04 Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal Yuyang Hu et.al. 2504.03607 null
2025-04-04 Finding the Reflection Point: Unpadding Images to Remove Data Augmentation Artifacts in Large Open Source Image Datasets for Machine Learning Lucas Choi et.al. 2504.03168 null
2025-04-03 RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models ZhongLi Fang et.al. 2504.02640 null
2025-04-02 Bridge the Gap between SNN and ANN for Image Restoration Xin Su et.al. 2504.01755 null
2025-04-01 Deconver: A Deconvolutional Network for Medical Image Segmentation Pooya Ashtari et.al. 2504.00302 link
2025-03-31 InstructRestore: Region-Customized Image Restoration with Human Instructions Shuaizheng Liu et.al. 2503.24357 link
2025-03-29 indiSplit: Bringing Severity Cognizance to Image Decomposition in Fluorescence Microscopy Ashesh Ashesh et.al. 2503.22983 null
2025-03-28 RELD: Regularization by Latent Diffusion Models for Image Restoration Pasquale Cascarano et.al. 2503.22563 null
2025-04-02 Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration Yujie Chen et.al. 2503.21970 null
2025-03-27 Invert2Restore: Zero-Shot Degradation-Blind Image Restoration Hamadi Chihaoui et.al. 2503.21486 null
2025-03-27 Diffusion Image Prior Hamadi Chihaoui et.al. 2503.21410 null
2025-03-26 Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration Shihao Zhou et.al. 2503.20174 null
2025-03-23 Cat-AIR: Content and Task-Aware All-in-One Image Restoration Jiachen Jiang et.al. 2503.17915 null
2025-03-22 Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration Yawei Li et.al. 2503.17825 null
2025-03-21 Vision-Language Gradient Descent-driven All-in-One Deep Unfolding Networks Haijin Zeng et.al. 2503.16930 null
2025-03-20 Efficient Bayesian Computation Using Plug-and-Play Priors for Poisson Inverse Problems Teresa Klatzer et.al. 2503.16222 null
2025-03-20 DIPLI: Deep Image Prior Lucky Imaging for Blind Astronomical Image Restoration Suraj Singh et.al. 2503.15984 null
2025-03-21 UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations Debabrata Mandal et.al. 2503.15868 null
2025-03-19 Image Restoration Models with Optimal Transport and Total Variation Regularization Weijia Huang et.al. 2503.14947 null
2025-03-18 SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model Yucheng Mao et.al. 2503.14463 null
2025-03-18 Towards properties of adversarial image perturbations Egor Kuznetsov et.al. 2503.14111 null
2025-03-18 Intra and Inter Parser-Prompted Transformers for Effective Image Restoration Cong Wang et.al. 2503.14037 link
2025-03-17 From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective Chen Zhao et.al. 2503.13165 null
2025-03-17 Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion Yidi Liu et.al. 2503.12764 null
2025-03-16 Pathology Image Restoration via Mixture of Prompts Jiangdong Cai et.al. 2503.12399 link
2025-03-14 InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences Hongkai Zheng et.al. 2503.11043 null
2025-03-13 Hybrid Agents for Image Restoration Bingchen Li et.al. 2503.10120 null
2025-03-13 Dream-IF: Dynamic Relative EnhAnceMent for Image Fusion Xingxin Xu et.al. 2503.10109 null
2025-03-17 Multi-Agent Image Restoration Xu Jiang et.al. 2503.09403 null
2025-03-12 MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration Zhehui Wu et.al. 2503.09131 link
2025-03-12 Prompt to Restore, Restore to Prompt: Cyclic Prompting for Universal Adverse Weather Removal Rongxin Liao et.al. 2503.09013 link
2025-03-11 QUIET-SR: Quantum Image Enhancement Transformer for Single Image Super-Resolution Siddhant Dutta et.al. 2503.08759 null
2025-03-11 Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios Chenglu Pan et.al. 2503.07232 null
2025-03-03 Hyperspectral Image Restoration and Super-resolution with Physics-Aware Deep Learning for Biomedical Applications Yuchen Xiang et.al. 2503.02908 null
2025-03-04 ERetinex: Event Camera Meets Retinex Theory for Low-Light Image Enhancement Xuejian Guo et.al. 2503.02484 link
2025-03-18 Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration Pengchen Liang et.al. 2503.02321 null
2025-03-03 MRI super-resolution reconstruction using efficient diffusion probabilistic model with residual shifting Mojtaba Safari et.al. 2503.01576 link
2025-03-03 Wavelet-Enhanced Desnowing: A Novel Single Image Restoration Approach for Traffic Surveillance under Adverse Weather Conditions Zihan Shen et.al. 2503.01339 null
2025-03-03 Reconciling Stochastic and Deterministic Strategies for Zero-shot Image Restoration using Diffusion Model in Dual Chong Wang et.al. 2503.01288 link
2025-02-28 Diffusion Restoration Adapter for Real-World Image Restoration Hanbang Liang et.al. 2502.20679 null
2025-02-26 Self-supervised conformal prediction for uncertainty quantification in Poisson imaging problems Bernardin Tamo Amougou et.al. 2502.19194 null
2025-02-26 Multi-level Attention-guided Graph Neural Network for Image Restoration Jiatao Jiang et.al. 2502.19181 null
2025-02-27 RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images Yuhan Tang et.al. 2502.19153 null
2025-03-08 Dynamic Degradation Decomposition Network for All-in-One Image Restoration Huiqiang Wang et.al. 2502.19068 null
2025-02-24 Splitting Regularized Wasserstein Proximal Algorithms for Nonsmooth Sampling Problems Fuqun Han et.al. 2502.16773 link
2025-02-19 RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior Ching-Hua Lee et.al. 2502.13574 null
2025-02-19 Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal Jinpei Guo et.al. 2502.09873 link
2025-02-13 Source function from two-particle correlation function through entropy-regularized Richardson-Lucy deblurring C. K. Tam et.al. 2502.09478 null
2025-02-19 MRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers Ao Li et.al. 2502.07856 null
2025-02-10 UniDemoiré: Towards Universal Image Demoiréing with Data Generation and Synthesis Zemin Yang et.al. 2502.06324 null
2025-02-21 UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control Kaizhen Zhu et.al. 2502.05749 link
2025-02-07 Self-supervised Conformal Prediction for Uncertainty Quantification in Imaging Problems Jasper M. Everink et.al. 2502.05127 null
2025-02-05 All-in-One Image Compression and Restoration Huimin Zeng et.al. 2502.03649 link
2025-02-05 Efficient Image Restoration via Latent Consistency Flow Matching Elad Cohen et.al. 2502.03500 null
2025-02-04 Blind Visible Watermark Removal with Morphological Dilation Preston K. Robinette et.al. 2502.02676 null
2025-02-03 Human Body Restoration with One-Step Diffusion Model and A New Benchmark Jue Gong et.al. 2502.01411 null
2025-02-10 Compressed Image Generation with Denoising Diffusion Codebook Models Guy Ohayon et.al. 2502.01189 null
2025-02-01 Shape from Semantics: 3D Shape Generation from Multi-View Semantics Liangchen Li et.al. 2502.00360 null
2025-01-30 Integrating Spatial and Frequency Information for Under-Display Camera Image Restoration Kyusu Ahn et.al. 2501.18517 null
2025-01-31 MatIR: A Hybrid Mamba-Transformer Image Restoration Model Juan Wen et.al. 2501.18401 link
2025-01-27 Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration Long Peng et.al. 2501.16583 null
2025-01-27 CausalSR: Structural Causal Model-Driven Super-Resolution with Counterfactual Inference Zhengyang Lu et.al. 2501.15852 link
2025-01-26 Universal Image Restoration Pre-training via Degradation Classification JiaKui Hu et.al. 2501.15510 link
2025-01-24 CDI: Blind Image Restoration Fidelity Evaluation based on Consistency with Degraded Image Xiaojun Tang et.al. 2501.14264 null
2025-01-23 INDIGO+: A Unified INN-Guided Probabilistic Diffusion Algorithm for Blind and Non-Blind Image Restoration Di You et.al. 2501.14014 null
2025-01-23 Binary Diffusion Probabilistic Model Vitaliy Kinakh et.al. 2501.13915 null
2025-01-22 UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior I-Hsiang Chen et.al. 2501.13134 null
2025-01-22 Deep Learning-Based Image Recovery and Pose Estimation for Resident Space Objects Louis Aberdeen et.al. 2501.13009 null
2025-01-22 UniUIR: Considering Underwater Image Restoration as An All-in-One Learner Xu Zhang et.al. 2501.12981 null
2025-01-22 FDG-Diff: Frequency-Domain-Guided Diffusion Framework for Compressed Hazy Image Restoration Ruicheng Zhang et.al. 2501.12832 link
2025-01-21 Proxies for Distortion and Consistency with Applications for Real-World Image Restoration Sean Man et.al. 2501.12102 null
2025-01-20 SILO: Solving Inverse Problems with Latent Operators Ron Raphaeli et.al. 2501.11746 null
2025-01-17 DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration Huiyun Cao et.al. 2501.10325 null
2025-01-16 Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression Yongheng Zhang et.al. 2501.09321 null
2025-01-16 Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images Yongheng Zhang et.al. 2501.09268 null
2025-01-08 Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration Laibin Chang et.al. 2501.04740 null
2025-01-08 MB-TaylorFormer V2: Improved Multi-branch Linear Transformer Expanded by Taylor Formula for Image Restoration Zhi Jin et.al. 2501.04486 link
2025-01-07 Fixed Points of Deep Neural Networks: Emergence, Stability, and Applications L. Berlyand et.al. 2501.04182 null
2025-01-07 Convergent Primal-Dual Plug-and-Play Image Restoration: A General Algorithm and Applications Yodai Suzuki et.al. 2501.03780 link
2025-01-06 ImageMM: Joint multi-frame image restoration and super-resolution Yashil Sukurdeep et.al. 2501.03002 null
2025-01-06 Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis Xiaojiao Guo et.al. 2501.02701 link
2024-12-30 Varformer: Adapting VAR’s Generative Prior for Image Restoration Siyang Wang et.al. 2412.21063 link
2024-12-29 Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond) Tomer Garber et.al. 2412.20596 link
2024-12-28 UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity Jingbo Lin et.al. 2412.20157 link
2024-12-28 MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration Boyun Li et.al. 2412.20066 link
2024-12-28 An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models Yuang Wang et.al. 2412.19992 null
2024-12-27 Generative Adversarial Network on Motion-Blur Image Restoration Zhengdong Li et.al. 2412.19479 null
2024-12-24 Underwater Image Restoration via Polymorphic Large Kernel CNNs Xiaojiao Guo et.al. 2412.18459 link
2024-12-24 UNet–: Memory-Efficient and Feature-Enhanced Network Architecture based on U-Net with Reduced Skip-Connections Lingxiao Yin et.al. 2412.18276 null
2024-12-21 Optoelectronic generative adversarial networks Jumin Qiu et.al. 2412.16672 link
2025-01-11 NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images Yue Guo et.al. 2412.15890 null
2024-12-20 Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation Aiwen Jiang et.al. 2412.15845 link
2024-12-19 Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model Minglong Xue et.al. 2412.14630 link
2024-12-18 Personalized Generative Low-light Image Denoising and Enhancement Xijun Wang et.al. 2412.14327 null
2024-12-18 Distilled Pooling Transformer Encoder for Efficient Realistic Image Dehazing Le-Anh Tran et.al. 2412.14220 link
2024-12-18 DarkIR: Robust Low-Light Image Restoration Daniel Feijoo et.al. 2412.13443 link
2024-12-17 Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration Xinlong Cheng et.al. 2412.12550 null
2024-12-15 Towards Context-aware Convolutional Network for Image Restoration Fangwei Hao et.al. 2412.11008 null
2024-12-14 Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification Yucong Meng et.al. 2412.10776 null
2024-12-16 Matrix Completion via Residual Spectral Matching Ziyuan Chen et.al. 2412.10005 null
2024-12-12 OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs Yuanzhi Zhu et.al. 2412.09465 link
2024-12-13 Are Conditional Latent Diffusion Models Effective for Image Restoration? Yunchen Yuan et.al. 2412.09324 null
2024-12-12 ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring Zhongbao Yang et.al. 2412.09193 null
2024-12-17 Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration Yunshuai Zhou et.al. 2412.08939 link
2024-12-11 Convergence Analysis of a Proximal Stochastic Denoising Regularization Algorithm Marien Renaud et.al. 2412.08262 null
2024-12-10 Modeling Dual-Exposure Quad-Bayer Patterns for Joint Denoising and Deblurring Yuzhi Zhao et.al. 2412.07256 link
2024-12-10 EchoIR: Advancing Image Restoration with Echo Upsampling and Bi-Level Optimization Yuhan He et.al. 2412.07225 null
2024-12-10 A Progressive Image Restoration Network for High-order Degradation Imaging in Remote Sensing Yujie Feng et.al. 2412.07195 null
2024-12-09 InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention Howard Zhang et.al. 2412.06753 null
2024-12-07 Enhancing Sample Generation of Diffusion Models using Noise Level Correction Abulikemu Abuduweili et.al. 2412.05488 null
2024-12-06 Equivariant Denoisers for Image Restoration Marien Renaud et.al. 2412.05343 null
2024-12-06 ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration Chi-Wei Hsiao et.al. 2412.05043 null
2024-12-05 Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian Noise Brayan Monroy et.al. 2412.04648 link
2024-12-05 MetaFormer: High-fidelity Metalens Imaging via Aberration Correcting Transformers Byeonghyeon Lee et.al. 2412.04591 null
2024-12-05 Deep priors for satellite image restoration with accurate uncertainties Biquard Maud et.al. 2412.04130 null
2024-12-05 Blind Underwater Image Restoration using Co-Operational Regressor Networks Ozer Can Devecioglu et.al. 2412.03995 null
2024-12-05 LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model Yuan Xue et.al. 2412.03841 null
2024-12-11 Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration Yuzhen Du et.al. 2412.03814 null
2024-12-04 Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution Jiahua Xiao et.al. 2412.02960 null
2024-12-03 Relaxed and Inertial Nonlinear Forward-Backward with Momentum Fernando Roldán et.al. 2412.02045 link
2024-12-02 Phaseformer: Phase-based Attention Mechanism for Underwater Image Restoration and Beyond MD Raqib Khan et.al. 2412.01456 link
2024-12-02 FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration Hao Li et.al. 2412.01427 null
2024-12-06 Beyond Pixels: Text Enhances Generalization in Real-World Image Restoration Haoze Sun et.al. 2412.00878 null
2024-11-30 Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion Michail Dontas et.al. 2412.00557 null
2024-11-27 Hierarchical Information Flow for Generalized Efficient Image Restoration Yawei Li et.al. 2411.18588 null
2024-11-27 Complexity Experts are Task-Discriminative Learners for Any Image Restoration Eduard Zamfir et.al. 2411.18466 null
2024-11-27 Adaptive Blind All-in-One Image Restoration David Serrano-Lozano et.al. 2411.18412 link
2024-11-27 TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution Linwei Dong et.al. 2411.18263 link
2024-11-26 Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation Sudarshan Rajagopalan et.al. 2411.17814 null
2024-11-26 GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration Sudarshan Rajagopalan et.al. 2411.17687 null
2024-11-26 Puzzle Similarity: A Perceptually-guided No-Reference Metric for Artifact Detection in 3D Scene Reconstructions Nicolai Hermann et.al. 2411.17489 null
2024-11-26 MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers Ruoxi Zhu et.al. 2411.17226 link
2024-11-23 Gradient-Guided Parameter Mask for Multi-Scenario Image Restoration Under Adverse Weather Jilong Guo et.al. 2411.16739 link
2024-11-25 Mixed Degradation Image Restoration via Local Dynamic Optimization and Conditional Embedding Yubin Gu et.al. 2411.16217 null
2024-11-25 U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields Vinayak Gupta et.al. 2411.16172 null
2024-11-29 PromptHSI: Universal Hyperspectral Image Restoration Framework for Composite Degradation Chia-Ming Lee et.al. 2411.15922 link
2024-11-24 LTCF-Net: A Transformer-Enhanced Dual-Channel Fourier Framework for Low-Light Image Restoration Gaojing Zhang et.al. 2411.15740 null
2024-11-22 Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration Darshan Thaker et.al. 2411.15295 null
2024-11-22 MambaIRv2: Attentive State Space Restoration Hang Guo et.al. 2411.15269 link
2024-11-20 Analysis and Synthesis Denoisers for Forward-Backward Plug-and-Play Algorithms Matthieu Kowalski et.al. 2411.13276 null
2024-11-19 Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models Jun Xiao et.al. 2411.12450 null
2024-11-19 Versatile Cataract Fundus Image Restoration Model Utilizing Unpaired Cataract and High-quality Images Zheng Gong et.al. 2411.12278 null
2024-11-19 TSFormer: A Robust Framework for Efficient UHD Image Restoration Xin Su et.al. 2411.10951 null
2024-11-16 AllRestorer: All-in-One Transformer for Image Restoration under Composite Degradations Jiawei Mao et.al. 2411.10708 null
2024-11-15 Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence Guodong Sun et.al. 2411.10321 null
2024-11-12 Joint multi-dimensional dynamic attention and transformer for general image restoration Huan Zhang et.al. 2411.07893 link
2024-11-12 All-in-one Weather-degraded Image Restoration via Adaptive Degradation-aware Self-prompting Model Yuanbo Wen et.al. 2411.07445 null
2024-11-11 Multi-scale Frequency Enhancement Network for Blind Image Deblurring Yawen Xiang et.al. 2411.06893 null
2024-11-10 Dropout the High-rate Downsampling: A Novel Design Paradigm for UHD Image Restoration Chen Wu et.al. 2411.06456 null
2024-11-08 A Modular Conditional Diffusion Framework for Image Reconstruction Magauiya Zhussip et.al. 2411.05993 null
2024-11-03 Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image Restoration Xiaole Tang et.al. 2411.01656 link
2024-10-31 Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes Shaohua Liu et.al. 2411.00239 null
2024-10-31 Chasing Better Deep Image Priors between Over- and Under-parameterization Qiming Wu et.al. 2410.24187 link
2024-10-31 Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data Yucun Hou et.al. 2410.23628 null
2024-10-31 MS-Glance: Non-semantic context vectors and the applications in supervising image reconstruction Ziqi Gao et.al. 2410.23577 link
2024-10-30 EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models Shangquan Sun et.al. 2410.22959 link
2024-10-29 DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Yuang Ai et.al. 2410.18666 link
2024-10-23 DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection Qingpeng Li et.al. 2410.17822 link
2024-10-23 An Intelligent Agentic System for Complex Image Restoration Problems Kaiwen Zhu et.al. 2410.17809 link
2024-10-23 A variational approach to nonlocal image restoration flows Harsh Prasad et.al. 2410.17649 null
2024-10-23 Diffusion Priors for Variational Likelihood Estimation and Image Denoising Jun Cheng et.al. 2410.17521 link
2024-11-16 LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration Yuang Ai et.al. 2410.15385 link
2024-10-19 A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends Junjun Jiang et.al. 2410.15067 link
2024-10-16 Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond Pengwei Liang et.al. 2410.12274 null
2024-10-15 Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos Zhouxia Wang et.al. 2410.11828 null
2024-10-11 Chain-of-Restoration: Multi-Task Image Restoration Models are Zero-Shot Step-by-Step Universal Image Restorers Jin Cao et.al. 2410.08688 link
2024-10-10 TANet: Triplet Attention Network for All-In-One Adverse Weather Image Restoration Hsing-Hua Wang et.al. 2410.08177 link
2024-10-09 InstantIR: Blind Image Restoration with Instant Generative Reference Jen-Yuan Huang et.al. 2410.06551 null
2024-10-08 ReFIR: Grounding Large Restoration Models with Retrieval Augmentation Hang Guo et.al. 2410.05601 link
2024-10-07 Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration Zhiyu Zhu et.al. 2410.04811 link
2024-10-06 SITCOM: Step-wise Triple-Consistent Diffusion Sampling for Inverse Problems Ismail Alkhouri et.al. 2410.04479 link
2024-10-05 Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model Keda Tao et.al. 2410.04161 null
2024-10-04 Diffusion State-Guided Projected Gradient for Inverse Problems Rayhan Zirvi et.al. 2410.03463 link
2024-10-03 PnP-Flow: Plug-and-Play Image Restoration with Flow Matching Ségolène Martin et.al. 2410.02423 link
2024-10-02 Posterior sampling via Langevin dynamics based on generative priors Vishal Purohit et.al. 2410.02078 null
2024-10-01 Three-Operator Splitting Method with Two-Step Inertial Extrapolation Olaniyi S. Iyiola et.al. 2410.01099 null
2024-10-01 Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration Guy Ohayon et.al. 2410.00418 link
2024-10-01 GLMHA A Guided Low-rank Multi-Head Self-Attention for Efficient Image Restoration and Spectral Reconstruction Zaid Ilyas et.al. 2410.00380 null
2024-09-30 A Survey on Diffusion Models for Inverse Problems Giannis Daras et.al. 2410.00083 null
2024-09-30 UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation Cheng Zhang et.al. 2409.20197 link
2024-09-28 Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration Chu-Jie Qin et.al. 2409.19403 link
2024-09-26 Toward Efficient Deep Blind RAW Image Restoration Marcos V. Conde et.al. 2409.18204 link
2024-09-26 Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs Qinpeng Cui et.al. 2409.17778 link
2024-10-05 PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions Weifeng Lin et.al. 2409.15278 link
2024-09-18 Denoising diffusion models for high-resolution microscopy image restoration Pamela Osuna-Vargas et.al. 2409.12078 null
2024-09-16 Taming Diffusion Models for Image Restoration: A Review Ziwei Luo et.al. 2409.10353 null
2024-09-12 Quaternion Nuclear Norm minus Frobenius Norm Minimization for color image reconstruction Yu Guo et.al. 2409.07797 null
2024-09-11 PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening RuoCheng Wu et.al. 2409.06980 null
2024-09-24 Lightweight single-image super-resolution network based on dual paths Li Ke et.al. 2409.06590 null
2024-09-10 Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement Yang Wen et.al. 2409.06334 null
2024-09-10 AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration Hongyi Cai et.al. 2409.06206 null
2024-09-07 Power Line Aerial Image Restoration under dverse Weather: Datasets and Baselines Sai Yang et.al. 2409.04812 link
2024-09-06 Empirical Bayesian image restoration by Langevin sampling with a denoising diffusion implicit prior Charlesquin Kemajou Mbakam et.al. 2409.04384 null
2024-09-05 Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration Pei Wang et.al. 2409.03455 null
2024-09-05 Multiple weather images restoration using the task transformer and adaptive mixup strategy Yang Wen et.al. 2409.03249 null
2024-09-05 Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem Qiwen Zhu et.al. 2409.03179 link
2024-09-03 Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models Jiaqi Xu et.al. 2409.02101 link
2024-09-03 F2former: When Fractional Fourier Meets Deep Wiener Deconvolution and Selective Frequency Transformer for Image Deblurring Subhajit Paul et.al. 2409.02056 null
2024-09-03 GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting Zixuan Guo et.al. 2409.01581 null
2024-09-01 Accurate Forgetting for All-in-One Image Restoration Model Xin Su et.al. 2409.00685 null
2024-08-30 AWRaCLe: All-Weather Image Restoration using Visual In-Context Learning Sudarshan Rajagopalan et.al. 2409.00263 null
2024-08-30 Efficient Image Restoration through Low-Rank Adaptation and Stable Diffusion XL Haiyang Zhao et.al. 2408.17060 null
2024-08-29 GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content Lebin Zhou et.al. 2408.16866 null
2024-08-29 Enhanced Control for Diffusion Bridge in Image Restoration Conghan Yue et.al. 2408.16303 link
2024-08-28 Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration Xu Zhang et.al. 2408.15994 null
2024-08-27 A Preliminary Exploration Towards General Image Restoration Xiangtao Kong et.al. 2408.15143 null
2024-08-22 CODE: Confident Ordinary Differential Editing Bastien van Delft et.al. 2408.12418 link
2024-08-21 OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal Qiao Mo et.al. 2408.11480 link
2024-08-21 Taming Generative Diffusion for Universal Blind Image Restoration Siwei Tu et.al. 2408.11287 null
2024-08-19 Multi-Scale Representation Learning for Image Restoration with State-Space Model Yuhong He et.al. 2408.10145 null
2024-08-19 Harnessing Multi-resolution and Multi-scale Attention for Underwater Image Restoration Alik Pramanick et.al. 2408.09912 link
2024-08-17 Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration Xin Lin et.al. 2408.09241 link
2024-08-15 Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks Jiawei Wu et.al. 2408.08149 link
2024-08-28 HAIR: Hypernetworks-based All-in-One Image Restoration Jin Cao et.al. 2408.08091 link
2024-08-13 Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method Xin Su et.al. 2408.06709 null
2024-08-12 Wavelet based inpainting detection Barglazan Adrian-Alin et.al. 2408.06429 null
2024-08-10 Greedy randomized block Kaczmarz method for matrix equation AXB=C and its applications in color image restoration Wenli Wang et.al. 2408.05444 null
2024-08-08 Physical prior guided cooperative learning framework for joint turbulence degradation estimation and infrared video restoration Ziran Zhang et.al. 2408.04227 null
2024-08-08 MultiColor: Image Colorization by Learning from Multiple Color Spaces Xiangcheng Du et.al. 2408.04172 null
2024-08-28 Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models Tongtong Feng et.al. 2408.02408 null
2024-08-02 Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration Donwon Park et.al. 2408.01099 null
2024-08-01 A Prior Embedding-Driven Architecture for Long Distance Blind Iris Recognition Qi Xiong et.al. 2408.00210 null
2024-07-30 UniProcessor: A Text-induced Unified Low-level Image Processor Huiyu Duan et.al. 2407.20928 link
2024-07-27 Inverse Problems with Diffusion Models: A MAP Estimation Perspective Sai bharath chandra Gutha et.al. 2407.20784 link
2024-07-27 Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration Xiaoyan Yu et.al. 2407.19139 link
2024-07-19 GroupCDL: Interpretable Denoising and Compressed Sensing MRI via Learned Group-Sparsity and Circulant Attention Nikola Janjusevic et.al. 2407.18967 null
2024-07-26 Dilated Strip Attention Network for Image Restoration Fangwei Hao et.al. 2407.18613 null
2024-07-25 RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models Haoyu Chen et.al. 2407.18035 null
2024-07-23 CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction Liang Zhao et.al. 2407.16204 null
2024-07-23 Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems Sojin Lee et.al. 2407.16125 link
2024-07-20 Deep Learning CT Image Restoration using System Blur and Noise Models Yijie Yuan et.al. 2407.14983 null
2024-07-20 Dual High-Order Total Variation Model for Underwater Image Restoration Yuemei Li et.al. 2407.14868 link
2024-07-18 Any Image Restoration with Efficient Automatic Degradation Adaptation Bin Ren et.al. 2407.13372 link
2024-07-18 Training-Free Large Model Priors for Multiple-in-One Image Restoration Xuanhua He et.al. 2407.13181 null
2024-07-21 HPPP: Halpern-type Preconditioned Proximal Point Algorithms and Applications to Image Restoration Shuchang Zhang et.al. 2407.13120 link
2024-07-17 GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity Shuo Cao et.al. 2407.12273 null
2024-07-16 Haze-Aware Attention Network for Single-Image Dehazing Lihan Tong et.al. 2407.11505 null
2024-07-31 Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV Zhiwen Yang et.al. 2407.11087 link
2024-07-15 In-Loop Filtering via Trained Look-Up Tables Zhuoyuan Li et.al. 2407.10926 null
2024-07-15 MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration Yulin Ren et.al. 2407.10833 null
2024-07-25 Restoring Images in Adverse Weather Conditions via Histogram Transformer Shangquan Sun et.al. 2407.10172 link
2024-07-12 Region Attention Transformer for Medical Image Restoration Zhiwen Yang et.al. 2407.09268 link
2024-07-12 Exploring Richer and More Accurate Information via Frequency Selection for Image Restoration Hu Gao et.al. 2407.08950 link
2024-07-11 Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey Laniqng Guo et.al. 2407.08865 link
2024-07-11 Haar Nuclear Norms with Applications to Remote Sensing Imagery Restoration Shuang Xu et.al. 2407.08509 null
2024-07-10 Aging-Resistant Wideband Precoding in 5G and Beyond Using 3D Convolutional Neural Networks Alejandro Villena-Rodriguez et.al. 2407.07434 null
2024-07-15 Asymmetric Mask Scheme for Self-Supervised Real Image Denoising Xiangyu Liao et.al. 2407.06514 link
2024-07-07 Multi-scale Conditional Generative Modeling for Microscopic Image Restoration Luzhe Huang et.al. 2407.05259 null
2024-07-06 Robust Skin Color Driven Privacy Preserving Face Recognition via Function Secret Sharing Dong Han et.al. 2407.05045 null
2024-07-05 On a nonlinear nonlocal reaction-diffusion system applied to image restoration Yuhang Li et.al. 2407.04347 null
2024-07-04 Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration Yuhong Zhang et.al. 2407.03636 null
2024-07-04 MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration Yuhong Zhang et.al. 2407.03635 null
2024-07-02 Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model Cong Cao et.al. 2407.01960 null
2024-06-30 Learning Frequency-Aware Dynamic Transformers for All-In-One Image Restoration Zenglin Shi et.al. 2407.01636 null
2024-07-01 Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing Bingliang Zhang et.al. 2407.01521 link
2024-07-01 DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models Chang-Han Yeh et.al. 2407.01519 link
2024-07-01 Unrolling Plug-and-Play Gradient Graph Laplacian Regularizer for Image Restoration Jianghe Cai et.al. 2407.01469 null
2024-07-01 Blind Inversion using Latent Diffusion Priors Weimin Bai et.al. 2407.01027 null
2024-06-30 Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation Yuchuan Tian et.al. 2407.00676 link
2024-06-27 Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model Jiangtong Tan et.al. 2406.19030 link
2024-06-26 Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Kang Liao et.al. 2406.18516 link
2024-06-26 ConStyle v2: A Strong Prompter for All-in-One Image Restoration Dongqi Fan et.al. 2406.18242 link
2024-06-26 MFDNet: Multi-Frequency Deflare Network for Efficient Nighttime Flare Removal Yiguo Jiang et.al. 2406.18079 link
2024-06-24 DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution Aiwen Jiang et.al. 2406.16477 link
2024-06-22 Ultra-High-Definition Restoration: New Benchmarks and A Dual Interaction Prior-Driven Solution Liyan Wang et.al. 2406.13607 link
2024-06-19 Diffusion Model-based FOD Restoration from High Distortion in dMRI Shuo Huang et.al. 2406.13209 null
2024-06-18 Restorer: Solving Multiple Image Restoration Tasks with One Set of Parameters Jiawei Mao et.al. 2406.12587 link
2024-06-13 DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer Wei-Ting Chen et.al. 2406.09622 null
2024-06-13 Blind Super-Resolution via Meta-learning and Markov Chain Monte Carlo Simulation Jingyuan Xia et.al. 2406.08896 link
2024-06-12 LayeredDoc: Domain Adaptive Document Restoration with a Layer Separation Approach Maria Pilligua et.al. 2406.08610 link
2024-06-12 DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor Juncheng Wu et.al. 2406.08377 link
2024-06-14 One-Step Effective Diffusion Network for Real-World Image Super-Resolution Rongyuan Wu et.al. 2406.08177 link
2024-06-12 3D CBCT Challenge 2024: Improved Cone Beam CT Reconstruction using SwinIR-Based Sinogram and Image Enhancement Sasidhar Alavala et.al. 2406.08048 null
2024-06-12 DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera Senyan Xu et.al. 2406.07951 link
2024-06-11 Beware of Aliases – Signal Preservation is Crucial for Robust Image Restoration Shashank Agnihotri et.al. 2406.07435 null
2024-06-11 Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems Jiawei Zhang et.al. 2406.06959 link
2024-06-07 Optimal Eye Surgeon: Finding Image Priors through Sparse Generators at Initialization Avrajit Ghosh et.al. 2406.05288 link
2024-06-06 Diffusion-based image inpainting with internal learning Nicolas Cherel et.al. 2406.04206 link
2024-06-04 Deep Block Proximal Linearised Minimisation Algorithm for Non-convex Inverse Problems Chaoyan Huang et.al. 2406.02458 null
2024-06-02 Correlation Matching Transformation Transformers for UHD Image Restoration Cong Wang et.al. 2406.00629 link
2024-05-30 Sharing Key Semantics in Transformer Makes Efficient Image Restoration Bin Ren et.al. 2405.20008 link
2024-05-30 All-In-One Medical Image Restoration via Task-Adaptive Routing Zhiwen Yang et.al. 2405.19769 link
2024-05-29 Blind Image Restoration via Fast Diffusion Inversion Hamadi Chihaoui et.al. 2405.19572 link
2024-05-27 Fast Samplers for Inverse Problems in Iterative Refinement Models Kushagra Pandey et.al. 2405.17673 link
2024-06-04 Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models Regev Cohen et.al. 2405.16475 null
2024-05-24 Hierarchical Uncertainty Exploration via Feedforward Posterior Trees Elias Nehme et.al. 2405.15719 null
2024-06-01 Efficient Degradation-aware Any Image Restoration Eduard Zamfir et.al. 2405.15475 null
2024-05-24 Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving Jia He et.al. 2405.15241 null
2024-05-23 Efficient Visual State Space Model for Image Deblurring Lingshun Kong et.al. 2405.14343 link
2024-05-22 Perceptual Fairness in Image Restoration Guy Ohayon et.al. 2405.13805 null
2024-05-21 DARK: Denoising, Amplification, Restoration Kit Zhuoheng Li et.al. 2405.12891 link
2024-05-21 Spatial-aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image Zerui Zhang et.al. 2405.12872 link
2024-05-20 A New Cross-Space Total Variation Regularization Model for Color Image Restoration with Quaternion Blur Operator Zhigang Jia et.al. 2405.12114 null
2024-05-19 Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image Enhancement Igor Morawski et.al. 2405.11478 null
2024-05-19 Emphasizing Crucial Features for Efficient Image Restoration Hu Gao et.al. 2405.11468 link
2024-05-17 A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model Mingxiang Fu et.al. 2405.10890 null
2024-05-16 RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing Huiling Zhou et.al. 2405.10030 null
2024-05-16 NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge Jie Liang et.al. 2405.09923 null
2024-05-15 Inference in higher-order undirected graphical models and binary polynomial optimization Aida Khajavirad et.al. 2405.09727 null
2024-05-13 FRRffusion: Unveiling Authenticity with Diffusion-Based Face Retouching Reversal Fengchuang Xing et.al. 2405.07582 link
2024-05-09 RPBG: Towards Robust Neural Point-based Graphics in the Wild Qingtian Zhu et.al. 2405.05663 link
2024-05-07 DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks Jiaxin Zhang et.al. 2405.04408 link
2024-05-11 Residual-Conditioned Optimal Transport: Towards Structure-Preserving Unpaired and Paired Image Restoration Xiaole Tang et.al. 2405.02843 link
2024-05-04 Deep Image Restoration For Image Anti-Forensics Eren Tahir et.al. 2405.02751 link
2024-05-23 SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising Guanyiman Fu et.al. 2405.01726 link
2024-04-29 Reconstructing Satellites in 3D from Amateur Telescope Images Zhiming Chang et.al. 2404.18394 null
2024-04-26 PromptCIR: Blind Compressed Image Restoration with Prompt Learning Bingchen Li et.al. 2404.17433 link
2024-04-26 One-Shot Image Restoration Deborah Pereg et.al. 2404.17426 null
2024-05-07 NTIRE 2024 Quality Assessment of AI-Generated Content Challenge Xiaohong Liu et.al. 2404.16687 null
2024-04-26 A Survey on Visual Mamba Hanwei Zhang et.al. 2404.15956 null
2024-04-26 A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution Zhixiong Yang et.al. 2404.15620 link
2024-04-22 Face2Face: Label-driven Facial Retouching Restoration Guanhua Zhao et.al. 2404.14177 null
2024-04-22 CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task Kangzhen Yang et.al. 2404.14132 link
2024-04-24 Bracketing Image Restoration and Enhancement with High-Low Frequency Decomposition Genggeng Chen et.al. 2404.13537 link
2024-04-20 PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition Xi Fang et.al. 2404.13299 null
2024-04-17 CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration Rui Deng et.al. 2404.11778 null
2024-04-17 AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters Hao-Wei Chen et.al. 2404.11475 null
2024-04-16 Improving Bracket Image Restoration and Enhancement with Flow-guided Alignment and Enhanced Feature Aggregation Wenjie Lin et.al. 2404.10358 null
2024-04-16 Referring Flexible Image Restoration Runwei Guan et.al. 2404.10342 link
2024-04-17 OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model Runyi Li et.al. 2404.10312 null
2024-04-15 The Problem Of Image Super-Resolution, Denoising And Some Image Restoration Methods In Deep Learning Models Ngoc-Giau Pham et.al. 2404.09817 null
2024-04-15 Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement Wenyi Lian et.al. 2404.09735 link
2024-04-15 Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Ziwei Luo et.al. 2404.09732 link
2024-04-11 TBSN: Transformer-Based Blind-Spot Network for Self-Supervised Image Denoising Junyi Li et.al. 2404.07846 link
2024-04-11 Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations Yufeng Yue et.al. 2404.07770 null
2024-04-10 Unfolding ADMM for Enhanced Subspace Clustering of Hyperspectral Images Xianlu Li et.al. 2404.07112 link
2024-04-07 STAIC regularization for spatio-temporal image reconstruction Deepak G Skariah et.al. 2404.05070 null
2024-04-09 Empowering Image Recovery_ A Multi-Attention Approach Juan Wen et.al. 2404.04617 null
2024-04-04 DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior Yiming Zhang et.al. 2404.03642 null
2024-04-02 Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration Akshay Dudhane et.al. 2404.02154 link
2024-03-31 GAMA-IR: Global Additive Multidimensional Averaging for Fast Image Restoration Youssef Mansour et.al. 2404.00807 null
2024-03-31 IPT-V2: Efficient Image Processing Transformer using Hierarchical Attentions Zhijun Tu et.al. 2404.00633 null
2024-03-30 Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration Shihao Zhou et.al. 2404.00288 null
2024-03-30 Look-Around Before You Leap: High-Frequency Injected Transformer for Image Restoration Shihao Zhou et.al. 2404.00279 null
2024-03-29 Deeper, Sharper, Faster: Application of Efficient Transformer to Galaxy Image Restoration Hyosun Park et.al. 2404.00102 link
2024-03-27 Towards Image Ambient Lighting Normalization Florin-Alexandru Vasluianu et.al. 2403.18730 link
2024-03-26 Serpent: Scalable and Efficient Image Restoration via Multi-scale Structured State Space Models Mohammad Shahab Sepehri et.al. 2403.17902 null
2024-03-26 SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational Autoencoder Dihan Zheng et.al. 2403.17502 link
2024-03-26 Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance Donghoon Ahn et.al. 2403.17377 link
2024-04-02 Distilling Semantic Priors from SAM to Efficient Image Restoration Models Quan Zhang et.al. 2403.16368 null
2024-03-23 Graph Image Prior for Unsupervised Dynamic MRI Reconstruction Zhongsen Li et.al. 2403.15770 link
2024-03-22 Latent Neural Cellular Automata for Resource-Efficient Image Restoration Andrea Menta et.al. 2403.15525 null
2024-03-21 Osmosis: RGBD Diffusion Prior for Underwater Image Restoration Opher Bar Nathan et.al. 2403.14837 null
2024-03-21 AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation Yuning Cui et.al. 2403.14614 link
2024-03-26 Step-Calibrated Diffusion for Biomedical Optical Image Restoration Yiwei Lyu et.al. 2403.13680 link
2024-03-20 A multilevel framework for accelerating uSARA in radio-interferometric imaging Guillaume Lauga et.al. 2403.13385 null
2024-03-19 Multispectral Image Restoration by Generalized Opponent Transformation Total Variation Zhantao Ma et.al. 2403.12770 null
2024-03-18 CasSR: Activating Image Power for Real-World Image Super-Resolution Haolan Chen et.al. 2403.11451 null
2024-03-18 VmambaIR: Visual State Space Model for Image Restoration Yuan Shi et.al. 2403.11423 link
2024-03-18 Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors Yazid Janati et.al. 2403.11407 link
2024-03-17 Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model Dian Zheng et.al. 2403.11157 link
2024-03-16 A Spectrum-based Image Denoising Method with Edge Feature Enhancement Peter Luvton et.al. 2403.11036 null
2024-03-15 Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint Haoyue Tang et.al. 2403.10585 null
2024-03-15 How Powerful Potential of Attention on Image Restoration? Cong Wang et.al. 2403.10336 null
2024-03-15 BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution Feng Li et.al. 2403.10211 link
2024-03-20 D-YOLO a robust framework for object detection in adverse weather conditions Zihan Chu et.al. 2403.09233 null
2024-03-13 Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data Asad Aali et.al. 2403.08728 link
2024-03-12 Efficient Diffusion Model for Image Restoration by Residual Shifting Zongsheng Yue et.al. 2403.07319 link
2024-03-12 Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure De Cheng et.al. 2403.07292 link
2024-03-19 Boosting Image Restoration via Priors from Pre-trained Models Xiaogang Xu et.al. 2403.06793 null
2024-03-10 Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising Yuang Wang et.al. 2403.06069 link
2024-03-12 Decoupled Data Consistency with Diffusion Purification for Image Restoration Xiang Li et.al. 2403.06054 link
2024-03-09 Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration Jingyun Xue et.al. 2403.05906 null
2024-03-09 Generalizing to Out-of-Sample Degradations via Model Reprogramming Runhua Jiang et.al. 2403.05886 link
2024-03-08 Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera Chengxu Liu et.al. 2403.05660 link
2024-03-07 FriendNet: Detection-Friendly Dehazing Network Yihua Fan et.al. 2403.04443 link
2024-03-02 Extrapolated Plug-and-Play Three-Operator Splitting Methods for Nonconvex Optimization with Applications to Image Restoration Zhongming Wu et.al. 2403.01144 link
2024-02-26 Randomized Algorithms for Solving Singular Value Decomposition Problems with Matlab Toolbox Xiaowen Li et.al. 2402.17794 null
2024-02-25 Diffusion Posterior Proximal Sampling for Image Restoration Hongjie Wu et.al. 2402.16907 link
2024-03-04 Learning to See Through Dazzle Xiaopeng Peng et.al. 2402.15919 null
2024-02-24 HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models Li Pang et.al. 2402.15865 link
2024-03-07 IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer Dongqi Fan et.al. 2402.15784 link
2024-02-23 MambaIR: A Simple Baseline for Image Restoration with State-Space Model Hang Guo et.al. 2402.15648 link
2024-02-21 Adversarial Purification and Fine-tuning for Robust UDC Image Restoration Zhenbo Song et.al. 2402.13629 null
2024-02-14 DestripeCycleGAN: Stripe Simulation CycleGAN for Unsupervised Infrared Image Destriping Shiqi Yang et.al. 2402.09101 null
2024-02-10 Gyroscope-Assisted Motion Deblurring Network Simin Luan et.al. 2402.06854 link
2024-02-08 Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model Junghun Cha et.al. 2402.05350 null
2024-02-16 U-shaped Vision Mamba for Single Image Dehazing Zhuoran Zheng et.al. 2402.04139 link
2024-02-08 Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction Shijun Liang et.al. 2402.04097 null
2024-02-05 Rethinking RGB Color Representation for Image Restoration Models Jaerin Lee et.al. 2402.03399 null
2024-02-05 Knowledge-driven deep learning for fast MR imaging: undersampled MR image reconstruction from supervised to un-supervised learning Shanshan Wang et.al. 2402.02704 null
2024-02-04 Key-Graph Transformer for Image Restoration Bin Ren et.al. 2402.02634 null
2024-03-04 RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction Nikolaos Stathoulopoulos et.al. 2402.02192 null
2024-02-01 Plug-and-Play image restoration with Stochastic deNOising REgularization Marien Renaud et.al. 2402.01779 link
2024-02-29 LIR: A Lightweight Baseline for Image Restoration Dongqi Fan et.al. 2402.01368 link
2024-01-31 Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models Kyungsung Lee et.al. 2401.17629 null
2024-01-31 Task-Oriented Diffusion Model Compression Geonung Kim et.al. 2401.17547 null
2024-02-21 InstructIR: High-Quality Image Restoration Following Human Instructions Marcos V. Conde et.al. 2401.16468 link
2024-01-28 UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration Nachuan Ma et.al. 2401.15647 null
2024-01-26 CascadedGaze: Efficiency in Global Context Extraction for Image Restoration Amirhosein Ghasemabadi et.al. 2401.15235 link
2024-01-24 Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild Fanghua Yu et.al. 2401.13627 null
2024-01-24 Unified-Width Adaptive Dynamic Network for All-In-One Image Restoration Yimin Xu et.al. 2401.13221 link
2024-01-21 LLMRA: Multi-modal Large Language Model based Restoration Assistant Xiaoyu Jin et.al. 2401.11401 null
2024-01-19 MixNet: Towards Effective and Efficient UHD Low-Light Image Enhancement Chen Wu et.al. 2401.10666 link
2024-01-03 Image Restoration: A Comparative Analysis of Image De noising Using Different Spatial Filtering Techniques E. G. Onyedinma et.al. 2401.09460 null
2024-01-16 Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network Zida Chen et.al. 2401.08171 link
2024-01-12 LiDAR Depth Map Guided Image Compression Model Alessandro Gnutti et.al. 2401.06517 null
2024-01-10 Content-Aware Depth-Adaptive Image Restoration Tom Richard Vargis et.al. 2401.05049 null
2024-01-07 Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning Strategy Xiangtao Kong et.al. 2401.03379 link
2024-01-06 MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond Yupei Lin et.al. 2401.03221 null
2024-01-05 Analysis of a wavelet frame based two-scale model for enhanced edges Bin Dong et.al. 2401.02688 null
2024-01-04 Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain Xuanhua He et.al. 2401.02161 link
2024-01-01 Bracketing is All You Need: Unifying Image Restoration and Enhancement Tasks with Multi-Exposure Images Zhilu Zhang et.al. 2401.00766 link
2023-12-31 UGPNet: Universal Generative Prior for Image Restoration Hwayoon Lee et.al. 2401.00370 null
2023-12-28 Improving Image Restoration through Removing Degradations in Textual Representations Jingbo Lin et.al. 2312.17334 link
2023-12-28 Personalized Restoration via Dual-Pivot Tuning Pradyumna Chari et.al. 2312.17234 null
2023-12-28 Restoration by Generation with Constrained Priors Zheng Ding et.al. 2312.17161 null
2024-01-10 DarkShot: Lighting Dark Images with Low-Compute and High-Quality Jiazhang Zheng et.al. 2312.16805 null
2023-12-27 Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation Rongyu Zhang et.al. 2312.16610 null
2023-12-27 Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance Tomer Garber et.al. 2312.16519 link
2023-12-25 Rotation Equivariant Proximal Operator for Deep Unfolding Methods in Image Restoration Jiahong Fu et.al. 2312.15701 link
2023-12-25 MuLA-GAN: Multi-Level Attention GAN for Enhanced Underwater Visibility Ahsan Baidar Bakht et.al. 2312.15633 null
2023-12-24 Perception-Distortion Balanced Super-Resolution: A Multi-Objective Optimization Perspective Lingchen Sun et.al. 2312.15408 link
2023-12-19 Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion Fan Zhang et.al. 2312.12471 link
2023-12-18 TIP: Text-Driven Image Processing with Semantic and Restoration Instructions Chenyang Qi et.al. 2312.11595 null
2023-12-17 Bengali License Plate Recognition: Unveiling Clarity with CNN and GFP-GAN Noushin Afrin et.al. 2312.10701 link
2023-12-16 Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge Conghan Yue et.al. 2312.10299 link
2023-12-15 Image Deblurring using GAN Zhengdong Li et.al. 2312.09496 null
2023-12-12 AdaptIR: Parameter Efficient Multi-task Adaptation for Pre-trained Image Restoration Models Hang Guo et.al. 2312.08881 link
2023-12-14 Guided Image Restoration via Simultaneous Feature and Image Guided Fusion Xinyi Liu et.al. 2312.08853 null
2023-12-16 VQCNIR: Clearer Night Image Restoration with Vector-Quantized Codebook Wenbin Zou et.al. 2312.08606 link
2023-12-12 Uncertainty Visualization via Low-Dimensional Posterior Projections Omer Yair et.al. 2312.07804 link
2023-12-12 Hyper-Restormer: A General Hyperspectral Image Restoration Transformer for Remote Sensing Imaging Yo-Yu Lai et.al. 2312.07016 null
2023-12-12 WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction Jingchun Zhou et.al. 2312.06946 null
2023-12-11 Textual Prompt Guided Image Restoration Qiuhai Yan et.al. 2312.06162 link
2023-12-08 Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation Bruno Lecouat et.al. 2312.05190 null
2023-12-08 Prompt-In-Prompt Learning for Universal Image Restoration Zilong Li et.al. 2312.05038 link
2023-12-08 Decoupling Degradation and Content Processing for Adverse Weather Image Restoration Xi Wang et.al. 2312.05006 null
2023-12-06 Training Neural Networks on RAW and HDR Images for Restoration Tasks Lei Luo et.al. 2312.03640 link
2023-12-05 Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration Yuang Ai et.al. 2312.02918 null
2023-12-05 Deep-learning-driven end-to-end metalens imaging Joonhyuk Seo et.al. 2312.02669 link
2023-12-02 Exploiting Diffusion Priors for All-in-One Image Restoration Yuanbiao Gou et.al. 2312.02197 link
2023-12-05 Multi-task Image Restoration Guided By Robust DINO Features Xin Lin et.al. 2312.01677 null
2023-12-05 T3D: Towards 3D Medical Image Understanding through Vision-Language Pre-training Che Liu et.al. 2312.01529 null
2023-12-03 An Augmented Lagrangian Primal-Dual Semismooth Newton Method for Multi-Block Composite Optimization Zhanwang Deng et.al. 2312.01273 null
2023-12-01 Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution Xi Yang et.al. 2312.00853 link
2023-11-30 A Novel Variational Approach for Multiphoton Microscopy Image Restoration: from PSF Estimation to 3D Deconvolution Julien Ajdenbaum et.al. 2311.18386 null
2023-11-29 Variational Bayes image restoration with compressive autoencoders Maud Biquard et.al. 2311.17744 null
2023-11-29 Improving Stability during Upsampling – on the Importance of Spatial Context Shashank Agnihotri et.al. 2311.17524 null
2023-11-28 Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration Chen Zhao et.al. 2311.16845 link
2023-11-28 Decomposer: Semi-supervised Learning of Image Restoration and Image Decomposition Boris Meinardus et.al. 2311.16829 null
2023-11-28 Full-resolution MLPs Empower Medical Dense Prediction Mingyuan Meng et.al. 2311.16707 link
2023-11-27 Joint Deep Image Restoration and Unsupervised Quality Assessment Hakan Emre Gedik et.al. 2311.16372 null
2023-11-26 FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration Zihao Zou et.al. 2311.15445 null
2023-11-20 Clarity ChatGPT: An Interactive and Adaptive Processing System for Image Restoration and Enhancement Yanyan Wei et.al. 2311.11695 null
2023-11-20 Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model Chunming He et.al. 2311.11638 link
2023-11-20 Deep Equilibrium Diffusion Restoration with Parallel Sampling Jiezhang Cao et.al. 2311.11600 link
2023-11-14 The Perception-Robustness Tradeoff in Deterministic Image Restoration Guy Ohayon et.al. 2311.09253 null
2023-11-09 Dynamic Association Learning of Self-Attention and Convolution in Image Restoration Kui Jiang et.al. 2311.05147 null
2023-11-08 LuminanceL1Loss: A loss function which measures percieved brightness and colour differences Dominic De Jonge et.al. 2311.04614 null
2023-11-21 Energy-Calibrated VAE with Test Time Free Lunch Yihong Luo et.al. 2311.04071 link
2023-11-07 Constrained Regularization by Denoising with Automatic Parameter Selection Pasquale Cascarano et.al. 2311.03819 null
2023-11-22 Pelvic floor MRI segmentation based on semi-supervised deep learning Jianwei Zuo et.al. 2311.03105 null
2023-11-06 A New Extrapolation Economy Cascadic Multigrid Method for Image Restoration Problems Zhaoteng Chu et.al. 2311.03010 null
2023-11-08 Deep Image Semantic Communication Model for Artificial Intelligent Internet of Things Li Ping Qian et.al. 2311.02926 link
2023-11-03 Cascadic Tensor Multigrid Method and Economic Cascadic Tensor Multigrid Method for Image Restoration Problems Ziqi Yan et.al. 2311.01924 null
2023-11-02 Convergent plug-and-play with proximal denoiser and unconstrained regularization parameter Samuel Hurault et.al. 2311.01216 null
2023-10-31 Image Restoration with Point Spread Function Regularization and Active Learning Peng Jia et.al. 2311.00186 null
2023-10-27 Always Clear Days: Degradation Type and Severity Aware All-In-One Adverse Weather Removal Yu-Wei Chen et.al. 2310.18293 link
2023-10-24 From Posterior Sampling to Meaningful Diversity in Image Restoration Noa Cohen et.al. 2310.16047 null
2023-10-19 Neural Degradation Representation Learning for All-In-One Image Restoration Mingde Yao et.al. 2310.12848 link
2023-10-18 A Comparative Study of Image Restoration Networks for General Backbone Network Design Xiangyu Chen et.al. 2310.11881 link
2023-10-16 Unifying Image Processing as Visual Prompting Question Answering Yihao Liu et.al. 2310.10513 null
2023-11-19 AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion Yitong Jiang et.al. 2310.10123 null
2023-10-12 Frequency-Aware Re-Parameterization for Over-Fitting Based Image Compression Yun Ye et.al. 2310.08068 null
2023-10-10 Tweedie Moment Projected Diffusions For Inverse Problems Benjamin Boys et.al. 2310.06721 null
2023-10-06 Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution Qingguo Liu et.al. 2310.04180 link
2023-11-07 Deformation-Invariant Neural Network and Its Applications in Distorted Image Restoration and Analysis Han Zhang et.al. 2310.02641 null
2023-10-03 Leveraging Classic Deconvolution and Feature Extraction in Zero-Shot Image Restoration Tomáš Chobola et.al. 2310.02097 link
2023-10-02 A Restoration Network as an Implicit Prior Yuyang Hu et.al. 2310.01391 null
2023-10-02 Controlling Vision-Language Models for Universal Image Restoration Ziwei Luo et.al. 2310.01018 link
2023-10-02 JPEG Information Regularized Deep Image Prior for Denoising Tsukasa Takagi et.al. 2310.00894 null
2023-10-22 Guided Frequency Loss for Image Restoration Bilel Benjdira et.al. 2309.15563 null
2023-09-27 Uncertainty Quantification via Neural Posterior Principal Components Elias Nehme et.al. 2309.15533 null
2023-10-09 Survey on Deep Face Restoration: From Non-blind to Blind and Beyond Wenjie Li et.al. 2309.15490 link
2023-09-21 License Plate Super-Resolution Using Diffusion Models Sawsan AlHalawani et.al. 2309.12506 null
2023-09-21 Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal Xiao Feng Zhang et.al. 2309.11715 null
2023-09-19 Local Lipschitz continuity for energy integrals with slow growth and lower order terms Michela Eleuteri et.al. 2309.10727 null
2023-09-19 Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising Yujin Wang et.al. 2309.10714 null
2023-09-16 AOSR-Net: All-in-One Sandstorm Removal Network Yazhong Si et.al. 2309.08838 null
2023-09-14 A Multi-scale Generalized Shrinkage Threshold Network for Image Blind Deblurring in Remote Sensing Yujie Feng et.al. 2309.07524 null
2023-09-13 FAIR: Frequency-aware Image Restoration for Industrial Visual Anomaly Detection Tongkun Liu et.al. 2309.07068 link
2023-09-12 Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration Gang Wu et.al. 2309.06023 link
2023-09-11 HAT: Hybrid Attention Transformer for Image Restoration Xiangyu Chen et.al. 2309.05239 link
2023-10-10 Prompt-based Ingredient-Oriented All-in-One Image Restoration Hu Gao et.al. 2309.03063 link
2023-09-05 SAM-Deblur: Let Segment Anything Boost Image Deblurring Siwei Li et.al. 2309.02270 link
2023-09-05 Advanced Underwater Image Restoration in Complex Illumination Conditions Yifan Song et.al. 2309.02217 null
2023-09-04 Memory augment is All You Need for image restoration Xiao Feng Zhang et.al. 2309.01377 link
2023-09-04 Restoration Guarantee of Image Inpainting via Low Rank Patch Matrix Completion Jian-Feng Cai et.al. 2309.01328 null
2023-09-03 Holistic Dynamic Frequency Transformer for Image Fusion and Exposure Correction Xiaoke Shang et.al. 2309.01183 null
2023-08-29 DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior Xinqi Lin et.al. 2308.15070 link
2023-09-05 MetaWeather: Few-Shot Weather-Degraded Image Restoration via Degradation Pattern Matching Youngrae Kim et.al. 2308.14334 link
2023-08-27 Hierarchical Contrastive Learning for Pattern-Generalizable Image Corruption Detection Xin Feng et.al. 2308.14061 link
2023-08-25 Residual Denoising Diffusion Models Jiawei Liu et.al. 2308.13712 link
2023-08-24 MOFA: A Model Simplification Roadmap for Image Restoration on Mobile Devices Xiangyu Chen et.al. 2308.12494 link
2023-08-23 Synergistic Multiscale Detail Refinement via Intrinsic Supervision for Underwater Image Enhancement Dehuan Zhang et.al. 2308.11932 link
2023-08-20 Blind Face Restoration for Under-Display Camera via Dictionary Guided Transformer Jingfan Tan et.al. 2308.10196 null
2023-08-22 WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning Dongjian Huo et.al. 2308.10195 null
2023-08-18 Diffusion Models for Image Restoration and Enhancement – A Comprehensive Survey Xin Li et.al. 2308.09388 link
2023-08-29 Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration Liyan Wang et.al. 2308.08730 link
2023-08-08 Under-Display Camera Image Restoration with Scattering Effect Binbin Song et.al. 2308.04163 link
2023-08-06 Nest-DGIL: Nesterov-optimized Deep Geometric Incremental Learning for CS Image Reconstruction Xiaohong Fan et.al. 2308.03807 link
2023-08-06 PNN: From proximal algorithms to robust unfolded image denoising networks and Plug-and-Play methods Hoang Trieu Vy Le et.al. 2308.03139 null
2023-08-06 All-in-one Multi-degradation Image Restoration Network via Hierarchical Degradation Representation Cheng Zhang et.al. 2308.03021 null
2023-08-06 Recurrent Spike-based Image Restoration under General Illumination Lin Zhu et.al. 2308.03018 link
2023-08-01 Decomposition Ascribed Synergistic Learning for Unified Image Restoration Jinghao Zhang et.al. 2308.00759 null
2023-07-27 The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation Lingdong Kong et.al. 2307.15061 link
2023-07-26 SuperInpaint: Learning Detail-Enhanced Attentional Implicit Representation for Super-resolutional Image Inpainting Canyu Zhang et.al. 2307.14489 null
2023-08-22 Phenotype-preserving metric design for high-content image reconstruction by generative inpainting Vaibhav Sharma et.al. 2307.14436 link
2023-07-25 On the unreasonable vulnerability of transformers for image restoration – and an easy fix Shashank Agnihotri et.al. 2307.13856 null
2023-07-24 A Theoretically Guaranteed Quaternion Weighted Schatten p-norm Minimization Method for Color Image Restoration Qing-Hua Zhang et.al. 2307.12656 link
2023-07-20 Physics-Driven Turbulence Image Restoration with Stochastic Refinement Ajay Jaiswal et.al. 2307.10603 link
2023-07-19 NTIRE 2023 Quality Assessment of Video Enhancement Challenge Xiaohong Liu et.al. 2307.09729 null
2023-07-18 Unleashing the Imagination of Text: A Novel Framework for Text-to-image Person Retrieval via Exploring the Power of Words Delong Liu et.al. 2307.09059 link
2023-07-18 Soft-IntroVAE for Continuous Latent space Image Super-Resolution Zhi-Song Liu et.al. 2307.09008 null
2023-07-16 LUCYD: A Feature-Driven Richardson-Lucy Deconvolution Network Tomáš Chobola et.al. 2307.07998 link
2023-07-15 DRM-IR: Task-Adaptive Deep Unfolding Network for All-In-One Image Restoration Yuanshuo Cheng et.al. 2307.07688 null
2023-07-12 Latent Graph Attention for Enhanced Spatial Context Ayush Singh et.al. 2307.04149 null
2023-06-29 FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude Feng Liu et.al. 2306.17206 null
2023-06-27 Cutting-Edge Techniques for Depth Map Super-Resolution Ryan Peterson et.al. 2306.15244 null
2023-06-23 ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration Jiaqi Ma et.al. 2306.13653 link
2023-06-22 PromptIR: Prompting for All-in-One Blind Image Restoration Vaishnav Potlapalli et.al. 2306.13090 link
2023-06-22 Restoration of the JPEG Maximum Lossy Compressed Face Images with Hourglass Block based on Early Stopping Discriminator Jongwook Si et.al. 2306.12757 null
2023-06-21 Accelerating Multiframe Blind Deconvolution via Deep Learning A. Asensio Ramos et.al. 2306.12078 link
2023-06-21 TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting Liang Liao et.al. 2306.11528 link
2023-07-31 Enlighten Anything: When Segment Anything Model Meets Low-Light Image Enhancement Qihan Zhao et.al. 2306.10286 link
2023-06-15 Exploring the Application of Large-scale Pre-trained Models on Adverse Weather Removal Zhentao Tan et.al. 2306.09008 null
2023-06-14 Investigation of the Challenges of Underwater-Visual-Monocular-SLAM Michele Grimaldi et.al. 2306.08738 null
2023-06-13 Learning Image-Adaptive Codebooks for Class-Agnostic Image Restoration Kechun Liu et.al. 2306.06513 null
2023-06-09 Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding Jie Gui et.al. 2306.05675 link
2023-06-08 HQ-50K: A Large-scale, High-quality Dataset for Image Restoration Qinhong Yang et.al. 2306.05390 link
2023-06-06 BokehOrNot: Transforming Bokeh Effect with Image Transformer and Lens Metadata Embedding Zhihao Yang et.al. 2306.04032 link
2023-06-06 Convergent Bregman Plug-and-Play Image Restoration for Poisson Inverse Problems Samuel Hurault et.al. 2306.03466 null
2023-06-05 Zero shot framework for satellite image restoration Praveen Kandula et.al. 2306.02921 null
2023-06-04 ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes Minghao Fu et.al. 2306.02443 link
2023-06-04 Deep Optimal Transport: A Practical Algorithm for Photo-realistic Image Restoration Theo Adrai et.al. 2306.02342 link
2023-06-03 Unsupervised Low Light Image Enhancement Using SNR-Aware Swin Transformer Zhijian Luo et.al. 2306.02082 null
2023-06-02 Fast and Interpretable Nonlocal Neural Networks for Image Denoising via Group-Sparse Convolutional Dictionary Learning Nikola Janjušević et.al. 2306.01950 link
2023-06-02 Counting Crowds in Bad Weather Zhi-Kai Huang et.al. 2306.01209 null
2023-06-01 Wavelet Image Restoration Using Multifractal Priors Karl Young et.al. 2306.00309 null
2023-06-01 Low-Light Image Enhancement with Wavelet-based Diffusion Models Hai Jiang et.al. 2306.00306 link
2023-05-31 A Unified Conditional Framework for Diffusion-based Image Restoration Yi Zhang et.al. 2305.20049 null
2023-05-30 Wide & deep learning for spatial & intensity adaptive image restoration Yadong Wang et.al. 2305.18708 link
2023-05-29 GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions Tao Wang et.al. 2305.17863 link
2023-05-28 PND-Net: Physics based Non-local Dual-domain Network for Metal Artifact Reduction Jinqiu Xia et.al. 2305.17778 link
2023-05-27 Rethinking PRL: A Multiscale Progressively Residual Learning Network for Inverse Halftoning Feiyu Li et.al. 2305.17355 link
2023-05-24 Learning INR for Event-guided Rolling Shutter Frame Correction, Deblur, and Interpolation Yunfan Lu et.al. 2305.15078 link
2023-05-23 Generalized Expectation Maximization Framework for Blind Image Super Resolution Yuxiao Li et.al. 2305.13880 null
2023-05-23 WaveDM: Wavelet-Based Diffusion Models for Image Restoration Yi Huang et.al. 2305.13819 link
2023-05-23 A Dive into SAM Prior in Image Restoration Zeyu Xiao et.al. 2305.13620 null
2023-05-22 Restore Anything Pipeline: Segment Anything Meets Image Restoration Jiaxi Jiang et.al. 2305.13093 link
2023-05-19 SIDAR: Synthetic Image Dataset for Alignment & Restoration Monika Kwiatkowski et.al. 2305.12036 link
2023-05-15 Neural information coding for efficient spike-based image denoising Andrea Castagnetti et.al. 2305.11898 null
2023-05-22 RAMiT: Reciprocal Attention Mixing Transformer for Lightweight Image Restoration Haram Choi et.al. 2305.11474 link
2023-05-17 Principal Uncertainty Quantification with Spatial Correlation for Image Restoration Problems Omer Belhasin et.al. 2305.10124 link
2023-05-17 Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go Ye-Cong Wan et.al. 2305.09996 link
2023-05-15 Denoising Diffusion Models for Plug-and-Play Image Restoration Yuanzhi Zhu et.al. 2305.08995 link
2023-05-15 Toward Moiré-Free and Detail-Preserving Demosaicking Xuanchen Li et.al. 2305.08585 null
2023-05-13 A Two-Stage Real Image Deraining Method for GT-RAIN Challenge CVPR 2023 Workshop UG $^{\textbf{2}}$ + Track 3 Yun Guo et.al. 2305.07979 link

SAM

Publish Date Title Authors PDF Code
2025-06-30 Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data Shubhabrata Mukherjee et.al. 2506.24039 null
2025-06-30 Diffusion Model-based Data Augmentation Method for Fetal Head Ultrasound Segmentation Fangyijie Wang et.al. 2506.23664 null
2025-07-01 SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting Yiming Huang et.al. 2506.23309 null
2025-06-29 DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation Jihun Kim et.al. 2506.23104 null
2025-06-28 VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding Minchao Jiang et.al. 2506.22799 null
2025-06-26 Detection of Breast Cancer Lumpectomy Margin with SAM-incorporated Forward-Forward Contrastive Learning Tyler Ward et.al. 2506.21006 null
2025-06-25 AI-Driven MRI-based Brain Tumour Segmentation Benchmarking Connor Ludwig et.al. 2506.20786 null
2025-06-24 SAM2-SGP: Enhancing SAM2 for Medical Image Segmentation via Support-Set Guided Prompting Yang Xing et.al. 2506.19658 null
2025-06-24 Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models Kai Zhao et.al. 2506.19300 null
2025-06-24 PicoSAM2: Low-Latency Segmentation In-Sensor for Edge Vision Applications Pietro Bonazzi et.al. 2506.18807 null
2025-06-23 MedSeg-R: Medical Image Segmentation with Clinical Reasoning Hao Shao et.al. 2506.18669 null
2025-06-23 Segment Anything for Satellite Imagery: A Strong Baseline and a Regional Dataset for Automatic Field Delineation Carmelo Scribano et.al. 2506.16318 link
2025-06-16 MorphSAM: Learning the Morphological Prompts from Atlases for Spine Image Segmentation Dingwei Fan et.al. 2506.13094 null
2025-06-13 Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling Yunhan Ren et.al. 2506.11661 link
2025-06-12 Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches Andrea Moglia et.al. 2506.10825 null
2025-06-12 Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation Shuyang Li et.al. 2506.10503 null
2025-06-11 Q-SAM2: Accurate Quantization for Segment Anything Model 2 Nicola Farronato et.al. 2506.09782 null
2025-06-11 SRPL-SFDA: SAM-Guided Reliable Pseudo-Labels for Source-Free Domain Adaptation in Medical Image Segmentation Xinya Liu et.al. 2506.09403 link
2025-06-10 SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything Joost van Dalen et.al. 2506.08613 link
2025-06-10 Discovery of Odd Radio Circles and Other Peculiars in the First Year of the EMU Survey using Object Detection Nikhel Gupta et.al. 2506.08439 null
2025-06-09 Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods Beining Xu et.al. 2506.07779 null
2025-06-09 OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting Jens Piekenbrinck et.al. 2506.07697 null
2025-06-06 Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models Yannis Spyridis et.al. 2506.06569 null
2025-06-03 Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation Luka Vetoshkin et.al. 2506.05396 null
2025-06-05 SAM-aware Test-time Adaptation for Universal Medical Image Segmentation Jianghao Wu et.al. 2506.05221 null
2025-06-05 Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery Mélisande Teng et.al. 2506.04970 null
2025-06-03 Extremely large oblate deformation of the first excited state in $^{12}$ C: a new challenge to modern nuclear theory C. Ngwetsheni et.al. 2506.03236 null
2025-06-03 Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery Michelle Chen et.al. 2506.03114 link
2025-06-05 GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation Sohyun Lee et.al. 2506.02882 null
2025-06-03 Hierarchical Self-Prompting SAM: A Prompt-Free Medical Image Segmentation Framework Mengmeng Zhang et.al. 2506.02854 null
2025-06-03 SAMJ: Fast Image Annotation on ImageJ/Fiji via Segment Anything Model Carlos Garcia-Lopez-de-Haro et.al. 2506.02783 null
2025-06-02 SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes Yuji Wang et.al. 2506.01558 null
2025-06-02 Computing Diverse and Nice Triangulations Waldo Gálvez et.al. 2506.01323 null
2025-06-02 SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost Haiyang Mei et.al. 2506.01304 link
2025-06-01 AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting Yuyuan Liu et.al. 2506.01015 link
2025-05-30 KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices Uzair Khan et.al. 2505.24334 link
2025-05-28 SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning Jiaqi Huang et.al. 2505.22596 null
2025-05-28 Adapting Segment Anything Model for Power Transmission Corridor Hazard Segmentation Hang Chen et.al. 2505.22105 link
2025-06-03 InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective Yuanhong Zhang et.al. 2505.21920 null
2025-05-27 Geometric Feature Prompting of Image Segmentation Models Kenneth Ball et.al. 2505.21644 null
2025-05-29 Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation Nagito Saito et.al. 2505.19846 null
2025-05-25 Domain and Task-Focused Example Selection for Data-Efficient Contrastive Medical Image Segmentation Tyler Ward et.al. 2505.19208 link
2025-05-24 SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models Ye Sun et.al. 2505.18812 null
2025-05-23 Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking Cheng-Yen Yang et.al. 2505.18111 null
2025-05-22 Assessing the generalization performance of SAM for ureteroscopy scene understanding Martin Villagrana et.al. 2505.17210 null
2025-05-22 TextureSAM: Towards a Texture Aware Foundation Model for Segmentation Inbal Cohen et.al. 2505.16540 null
2025-05-21 VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation Niccolo Avogaro et.al. 2505.15592 null
2025-05-21 UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset Hua Li et.al. 2505.15581 link
2025-05-21 Zero-Shot Gaze-based Volumetric Medical Image Segmentation Tatyana Shmykova et.al. 2505.15256 null
2025-05-19 IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion Wentao Song et.al. 2505.13633 null
2025-05-20 Industrial Synthetic Segment Pre-training Shinichi Mae et.al. 2505.13099 null
2025-05-17 Beluga Whale Detection from Satellite Imagery with Point Labels Yijie Zheng et.al. 2505.12066 link
2025-05-17 AoP-SAM: Automation of Prompts for Efficient Segmentation Yi Chen et.al. 2505.11980 null
2025-05-16 SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision Utsav Rai et.al. 2505.11439 null
2025-05-16 Unifying Segment Anything in Microscopy with Multimodal Large Language Model Manyu Li et.al. 2505.10769 null
2025-05-14 Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance Guoying Liang et.al. 2505.09123 null
2025-05-13 Parameter-Efficient Fine-Tuning of Vision Foundation Model for Forest Floor Segmentation from UAV Imagery Mohammad Wasil et.al. 2505.08932 link
2025-05-13 ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking Haofeng Liu et.al. 2505.08581 link
2025-05-14 Leveraging Segment Anything Model for Source-Free Domain Adaptation via Dual Feature Guided Auto-Prompting Zheang Huai et.al. 2505.08527 link
2025-05-12 ABS-Mamba: SAM2-Driven Bidirectional Spiral Mamba Network for Medical Image Translation Feng Yuan et.al. 2505.07687 null
2025-05-12 MAIS: Memory-Attention for Interactive Segmentation Mauricio Orbes-Arteaga et.al. 2505.07511 null
2025-05-11 MarkMatch: Same-Hand Stuffing Detection Fei Zhao et.al. 2505.07032 null
2025-05-10 Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation Jingyao Wang et.al. 2505.06524 link
2025-05-09 The 76Cu conundrum remains unsolved B. Olaizola et.al. 2505.06400 null
2025-05-09 Adapting a Segmentation Foundation Model for Medical Image Classification Pengfei Gu et.al. 2505.06217 null
2025-05-09 UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model Timo Kaiser et.al. 2505.05049 link
2025-05-08 Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization Xi Yang et.al. 2505.04905 null
2025-05-08 Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model Navin Ranjan et.al. 2505.04861 null
2025-05-07 Cross-organ all-in-one parallel compressed sensing magnetic resonance imaging Baoshun Shi et.al. 2505.04658 link
2025-05-09 MAISY: Motion-Aware Image SYnthesis for Medical Image Motion Correction Andrew Zhang et.al. 2505.04105 null
2025-05-06 CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting Huawei Sun et.al. 2505.03679 null
2025-05-04 Segment Any RGB-Thermal Model with Language-aided Distillation Dong Xing et.al. 2505.01950 null
2025-05-03 Accelerating Volumetric Medical Image Annotation via Short-Long Memory SAM 2 Yuwen Chen et.al. 2505.01854 link
2025-04-30 MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection Qiushi Yang et.al. 2505.00739 null
2025-05-05 AI-Driven Segmentation and Analysis of Microbial Cells Shuang Zhang et.al. 2505.00578 null
2025-04-30 SAM4EM: Efficient memory-based two stage prompt-free segment anything model adapter for complex 3D neuroscience electron microscopy stacks Uzair Shah et.al. 2504.21544 link
2025-04-30 UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation Linshan Wu et.al. 2504.21336 link
2025-04-29 RadSAM: Segmenting 3D radiological images with a 2D promptable model Julien Khlaut et.al. 2504.20837 null
2025-04-29 SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation Jia Wang et.al. 2504.20501 null
2025-04-26 Reservoir-enhanced Segment Anything Model for Subsurface Diagnosis Xiren Zhou et.al. 2504.18802 link
2025-04-25 RSFR: A Coarse-to-Fine Reconstruction Framework for Diffusion Tensor Cardiac MRI with Semantic-Aware Refinement Jiahao Huang et.al. 2504.18520 null
2025-04-23 Prompt-Tuning SAM: From Generalist to Specialist with only 2048 Parameters and 16 Training Images Tristan Piater et.al. 2504.16739 null
2025-04-23 RGB-D Video Object Segmentation via Enhanced Multi-store Feature Memory Boyue Xu et.al. 2504.16471 null
2025-04-19 Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection Ghodsiyeh Rostami et.al. 2504.14138 null
2025-04-18 HSACNet: Hierarchical Scale-Aware Consistency Regularized Semi-Supervised Change Detection Qi’ao Xu et.al. 2504.13428 null
2025-04-24 Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance Oliver Mills et.al. 2504.13340 link
2025-04-17 SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping Yun-Cheng Li et.al. 2504.12619 null
2025-04-17 Contour Field based Elliptical Shape Prior for the Segment Anything Model Xinyu Zhao et.al. 2504.12556 null
2025-04-17 DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency Mengshi Qi et.al. 2504.12080 link
2025-04-14 Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials Jingyun Yang et.al. 2504.10281 null
2025-04-13 Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation Jia Wei et.al. 2504.09601 null
2025-04-12 AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images Saikat Dutta et.al. 2504.09203 null
2025-04-11 Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models Jiahuan Long et.al. 2504.08915 null
2025-04-11 Robust SAM: On the Adversarial Robustness of Vision Foundation Models Jiahuan Long et.al. 2504.08906 null
2025-04-11 FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents Xin Tan et.al. 2504.08581 null
2025-04-11 SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data Sourya Sengupta et.al. 2504.08177 null
2025-04-09 Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting Daiwei Zhang et.al. 2504.06978 null
2025-04-09 A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology Marco Acerbis et.al. 2504.06957 link
2025-04-09 MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking Chang Nie et.al. 2504.06863 null
2025-04-08 HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling Qing Xu et.al. 2504.06205 link
2025-04-08 KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection Xingyuan Li et.al. 2504.05878 null
2025-04-07 S^4M: Boosting Semi-Supervised Instance Segmentation with SAM Heeji Yoon et.al. 2504.05301 null
2025-04-07 CMaP-SAM: Contraction Mapping Prior for SAM-driven Few-shot Segmentation Shuai Chen et.al. 2504.05049 null
2025-04-05 PIORF: Physics-Informed Ollivier-Ricci Flow for Long-Range Interactions in Mesh Graph Neural Networks Youn-Yeol Yu et.al. 2504.04052 null
2025-04-05 UCS: A Universal Model for Curvilinear Structure Segmentation Dianshuo Li et.al. 2504.04034 null
2025-04-04 MedSAM2: Segment Anything in 3D Medical Images and Videos Jun Ma et.al. 2504.03600 link
2025-04-03 APSeg: Auto-Prompt Model with Acquired and Injected Knowledge for Nuclear Instance Segmentation and Classification Liying Xu et.al. 2504.02222 null
2025-04-02 BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models Encheng Su et.al. 2504.01452 null
2025-04-01 CamoSAM2: Motion-Appearance Induced Auto-Refining Prompts for Video Camouflaged Object Detection Xin Zhang et.al. 2504.00375 null
2025-04-01 Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation Ting Liu et.al. 2504.00356 link
2025-03-31 SmartScan: An AI-based Interactive Framework for Automated Region Extraction from Satellite Images Savinay Nagendra et.al. 2504.00200 null
2025-04-03 IMPACT: A Generic Semantic Loss for Multimodal Medical Image Registration Valentin Boussot et.al. 2503.24121 link
2025-03-31 MGD-SAM2: Multi-view Guided Detail-enhanced Segment Anything Model 2 for High-Resolution Class-agnostic Segmentation Haoran Shen et.al. 2503.23786 link
2025-03-28 SCHNet: SAM Marries CLIP for Human Parsing Kunliang Liu et.al. 2503.22237 null
2025-03-28 Synergistic Bleeding Region and Point Detection in Surgical Videos Jialun Pei et.al. 2503.22174 null
2025-03-27 Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying Hairong Yin et.al. 2503.21767 null
2025-03-27 AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation Jiahe Qian et.al. 2503.21695 null
2025-03-31 Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement Xinghao Wang et.al. 2503.20294 null
2025-03-26 Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery Mélisande Teng et.al. 2503.20199 null
2025-03-25 BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts Suzhe Xu et.al. 2503.19769 null
2025-03-24 Towards Human-Understandable Multi-Dimensional Concept Discovery Arne Grobrügge et.al. 2503.18629 link
2025-03-26 PG-SAM: Prior-Guided SAM with Medical for Multi-organ Segmentation Yiheng Zhong et.al. 2503.18227 link
2025-03-23 Cost-effective multi-fidelity strategy for the optimization of high-Reynolds number turbine flows guided by LES Camille Matar et.al. 2503.17977 null
2025-03-18 Organ-aware Multi-scale Medical Image Segmentation Using Text Prompt Engineering Wenjie Zhang et.al. 2503.13806 null
2025-03-17 Integrating AI for Human-Centric Breast Cancer Diagnostics: A Multi-Scale and Multi-View Swin Transformer Framework Farnoush Bayatmakou et.al. 2503.13309 null
2025-03-17 3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o Dingning Liu et.al. 2503.13185 null
2025-03-17 SAM2 for Image and Video Segmentation: A Comprehensive Survey Zhang Jiaxing et.al. 2503.12781 null
2025-03-16 Segment Any-Quality Images with Generative Latent Space Enhancement Guangqian Guo et.al. 2503.12507 null
2025-03-16 SAM2-ELNet: Label Enhancement and Automatic Annotation for Remote Sensing Segmentation Jianhao Yang et.al. 2503.12404 null
2025-03-15 E-SAM: Training-Free Segment Every Entity Model Weiming Zhang et.al. 2503.12094 null
2025-03-12 NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model Yuzhi Lai et.al. 2503.09335 link
2025-03-10 Visual and Text Prompt Segmentation: A Novel Multi-Model Framework for Remote Sensing Xing Zi et.al. 2503.07911 null
2025-03-10 Customized SAM 2 for Referring Remote Sensing Image Segmentation Fu Rong et.al. 2503.07266 null
2025-03-10 Multi-Modal 3D Mesh Reconstruction from Images and Text Melvin Reka et.al. 2503.07190 null
2025-03-10 OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation Ding Zhong et.al. 2503.07098 null
2025-03-20 MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation Chenfei Liao et.al. 2503.06700 null
2025-03-09 SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model Jing Zhang et.al. 2503.06515 null
2025-03-08 Segment Anything, Even Occluded Wei-En Tai et.al. 2503.06261 null
2025-03-08 Dynamically evolving segment anything model with continuous learning for medical image segmentation Zhaori Liu et.al. 2503.06236 null
2025-03-08 Improving SAM for Camouflaged Object Detection via Dual Stream Adapters Jiaming Liu et.al. 2503.06042 null
2025-03-08 Towards Universal Text-driven CT Image Segmentation Yuheng Li et.al. 2503.06030 null
2025-03-07 S4M: Segment Anything with 4 Extreme Points Adrien Meyer et.al. 2503.05534 null
2025-03-05 Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching Haiyue Zu et.al. 2503.04826 null
2025-03-06 Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation Aishik Konwer et.al. 2503.04639 null
2025-03-07 GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain tumour Segmentation on mp-MRI Cecilia Diana-Albelda et.al. 2503.04325 link
2025-03-06 WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining Haoran Wang et.al. 2503.04106 link
2025-03-05 Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model Steve Andreas Immanuel et.al. 2503.03785 link
2025-03-05 AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model Wenlun Zhang et.al. 2503.03088 null
2025-03-04 Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance Jiayi Zhao et.al. 2503.02581 link
2025-03-04 Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration Pengchen Liang et.al. 2503.02321 null
2025-03-03 Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond Guanyao Wu et.al. 2503.01210 null
2025-02-25 An Analysis of Segment Anything 2 Clayton Bromley et.al. 2503.00042 null
2025-02-28 SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models Yichi Zhang et.al. 2502.20749 link
2025-02-27 Energy-carbon comprehensive efficiency evaluation of hydrogen metallurgy system considering low-temperature waste heat recovery Qiang Ji et.al. 2502.20131 null
2025-02-25 VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention Adnan Iltaf et.al. 2502.18185 link
2025-02-23 Lightweight Vision Model-based Multi-user Semantic Communication Systems Feibo Jiang et.al. 2502.16424 null
2025-02-22 USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images Jiamu Wang et.al. 2502.16160 null
2025-02-21 UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction Chenyu Li et.al. 2502.15199 null
2025-02-16 Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review Ufaq Khan et.al. 2502.14886 null
2025-02-21 Vision Foundation Models in Medical Image Analysis: Advances and Challenges Pengchen Liang et.al. 2502.14584 null
2025-02-19 MaizeEar-SAM: Zero-Shot Maize Ear Phenotyping Hossein Zaremehrjerdi et.al. 2502.13399 link
2025-02-18 SpeHeatal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis Yi Shi et.al. 2502.13192 link
2025-02-17 Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness Hao Xu et.al. 2502.11440 link
2025-02-17 WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing Yunyi Zhou et.al. 2502.11338 null
2025-02-14 MITO: Enabling Non-Line-of-Sight Perception using Millimeter-waves through Real-World Datasets and Simulation Tools Laura Dodds et.al. 2502.10259 link
2025-02-12 Towards Fine-grained Interactive Segmentation in Images and Videos Yuan Yao et.al. 2502.09660 null
2025-02-10 SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement Yuqi Lin et.al. 2502.06756 link
2025-02-10 FunduSAM: A Specialized Deep Learning Model for Enhanced Optic Disc and Cup Segmentation in Fundus Images Jinchen Yu et.al. 2502.06220 null
2025-02-05 ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models Ying Zhang et.al. 2502.03266 link
2025-02-04 Rethinking Vision Transformer for Object Centric Foundation Models Manuel Traub et.al. 2502.02763 null
2025-02-04 RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2 Bin Xie et.al. 2502.02741 null
2025-02-04 IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning Quan Zhang et.al. 2502.02454 null
2025-02-02 SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation Mingyu Yang et.al. 2502.00960 null
2025-02-02 Vision and Language Reference Prompt into SAM for Few-shot Segmentation Kosuke Sakurai et.al. 2502.00719 link
2025-02-02 Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation Bin Xie et.al. 2502.00630 null
2025-02-01 Parameter Efficient Fine-Tuning of Segment Anything Model Carolin Teuber et.al. 2502.00418 link
2025-02-01 Segment Anything for Histopathology Titus Griebel et.al. 2502.00408 link
2025-01-28 Efficient Knowledge Distillation of SAM for Medical Image Segmentation Kunal Dasharath Patil et.al. 2501.16740 null
2025-01-27 CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation Xiaochuan Ma et.al. 2501.16246 null
2025-01-26 Marker Track: Accurate Fiducial Marker Tracking for Evaluation of Residual Motions During Breath-Hold Radiotherapy Aimee Guo et.al. 2501.15660 null
2025-01-27 Gland Segmentation Using SAM With Cancer Grade as a Prompt Yijie Zhu et.al. 2501.14718 null
2025-01-23 MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation Fu Rong et.al. 2501.13667 null
2025-01-23 Auto-Prompting SAM for Weakly Supervised Landslide Extraction Jian Wang et.al. 2501.13426 null
2025-01-21 fabSAM: A Farmland Boundary Delineation Method Based on the Segment Anything Model Yufeng Xie et.al. 2501.12487 null
2025-01-17 Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks Michael Schwingshackl et.al. 2501.10080 link
2025-01-15 Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures Pengru Deng et.al. 2501.09203 null
2025-01-15 Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation Xingxin He et.al. 2501.09138 null
2025-01-15 SuperSAM: Crafting a SAM Supernetwork via Structured Pruning and Unstructured Parameter Prioritization Waqwoya Abebe et.al. 2501.08504 link
2025-01-13 Guided SAM: Label-Efficient Part Segmentation S. B. van Rooij et.al. 2501.07434 null
2025-01-13 OCORD: Open-Campus Object Removal Dataset Shuo Zhang et.al. 2501.07397 null
2025-01-13 EdgeTAM: On-Device Track Anything Model Chong Zhou et.al. 2501.07256 link
2025-01-12 Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation Zhenyang Feng et.al. 2501.06749 null
2025-01-12 PGP-SAM: Prototype-Guided Prompt Learning for Efficient Few-Shot Medical Image Segmentation Zhonghao Yan et.al. 2501.06692 null
2025-01-10 Weakly Supervised Segmentation of Hyper-Reflective Foci with Compact Convolutional Transformers and SAM2 Olivier Morelle et.al. 2501.05933 null
2025-01-10 Zero-shot Shark Tracking and Biometrics from Aerial Imagery Chinmay K Lalgudi et.al. 2501.05717 null
2025-01-07 MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention Aadya Arora et.al. 2501.03839 null
2025-01-07 AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish Stefan Hein Bengtson et.al. 2501.03767 null
2025-01-06 Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy Risha Goel et.al. 2501.03153 link
2025-01-02 ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI Neda Tavakoli et.al. 2501.01372 link
2025-01-02 Evidential Calibrated Uncertainty-Guided Interactive Segmentation paradigm for Ultrasound Images Jiang Shang et.al. 2501.01072 null
2024-12-31 Advanced Lung Nodule Segmentation and Classification for Early Detection of Lung Cancer using SAM and Transfer Learning Asha V et.al. 2501.00586 null
2024-12-31 Is Segment Anything Model 2 All You Need for Surgery Video Segmentation? A Systematic Evaluation Cheng Yuan et.al. 2501.00525 null
2024-12-27 Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts Enze Xie et.al. 2412.19917 null
2024-12-26 When SAM2 Meets Video Shadow and Mirror Detection Leiping Jie et.al. 2412.19293 link
2024-12-28 Optimizing Prompt Strategies for SAM: Advancing lesion Segmentation Across Diverse Medical Imaging Modalities Yuli Wang et.al. 2412.17943 null
2024-12-16 Machine Learning-Based Automated Assessment of Intracorporeal Suturing in Laparoscopic Fundoplication Shekhar Madhav Khairnar et.al. 2412.16195 null
2024-12-18 Memorizing SAM: 3D Medical Segment Anything Model with Memorizing Transformer Xinyuan Shao et.al. 2412.13908 link
2024-12-18 Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation Kaiwen Huang et.al. 2412.13742 link
2024-12-17 Fruit Deformity Classification through Single-Input and Multi-Input Architectures based on CNN Models using Real and Synthetic Images Tommy D. Beltran et.al. 2412.12966 null
2024-12-17 Synthetic Data Generation for Anomaly Detection on Table Grapes Ionut Marian Motoi et.al. 2412.12949 link
2024-12-17 SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection Xing Liufu et.al. 2412.12892 link
2024-12-17 PolSAM: Polarimetric Scattering Mechanism Informed Segment Anything Model Yuqing Wang et.al. 2412.12737 link
2024-12-17 SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation Shuangping Huang et.al. 2412.12660 null
2024-12-17 SAModified: A Foundation Model-Based Zero-Shot Approach for Refining Noisy Land-Use Land-Cover Maps Sparsh Pekhale et.al. 2412.12552 null
2024-12-16 Adapting Segment Anything Model (SAM) to Experimental Datasets via Fine-Tuning on GAN-based Simulation: A Case Study in Additive Manufacturing Anika Tabassum et.al. 2412.11381 link
2024-12-15 Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment Haisheng Lu et.al. 2412.11186 link
2024-12-15 SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation Xudong Zhou et.al. 2412.11034 null
2024-12-13 TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views Liang Zhao et.al. 2412.10051 link
2024-12-11 SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation Tapas Kumar Dutta et.al. 2412.08482 link
2024-12-11 Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion Bingzhi Shen et.al. 2412.08315 null
2024-12-13 Crack-EdgeSAM Self-Prompting Crack Segmentation System for Edge Devices Yingchu Wang et.al. 2412.07205 null
2024-12-17 Continual Learning for Segment Anything Model Adaptation Jinglong Yang et.al. 2412.06418 link
2024-12-18 Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework Jiuyi Xu et.al. 2412.06268 null
2024-12-08 MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One Day Donghang Lyu et.al. 2412.05888 link
2024-12-07 RefSAM3D: Adapting SAM with Cross-modal Reference for 3D Medical Image Segmentation Xiang Gao et.al. 2412.05605 null
2024-12-06 SAMCL: Empowering SAM to Continually Learn from Dynamic Domains Zeqing Wang et.al. 2412.05012 null
2024-12-06 HOLa: HoloLens Object Labeling Michael Schwimmbeck et.al. 2412.04945 link
2024-12-05 Quantifying the Limits of Segment Anything Model: Analyzing Challenges in Segmenting Tree-Like and Low-Contrast Structures Yixin Zhang et.al. 2412.04243 link
2024-12-05 Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts Chenyang Zhu et.al. 2412.04220 null
2024-12-04 Automated galaxy sizes in Euclid images using the Segment Anything Model J. Vega-Ferrero et.al. 2412.03642 link
2024-12-04 Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything Yongkyu Lee et.al. 2412.03472 link
2024-12-04 MRNet: Multifaceted Resilient Networks for Medical Image-to-Image Translation Hyojeong Lee et.al. 2412.03039 null
2024-12-02 CellSeg1: Robust Cell Segmentation with One Training Image Peilin Zhou et.al. 2412.01410 link
2024-12-02 A Bottom-Up Approach to Optimizing the Solar Organic Rankine Cycle for Transactive Energy Trading Silvia Anna Cordieri et.al. 2412.01359 null
2024-12-02 Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different Scenes Xiaoqi Zhao et.al. 2412.01240 null
2024-12-02 Referring Video Object Segmentation via Language-aligned Track Selection Seongchan Kim et.al. 2412.01136 link
2024-11-27 In Search of Truth: In memory of Balraj Singh José Nicolás Orce et.al. 2412.00097 null
2024-11-28 SADG: Segment Any Dynamic Gaussian Without Object Trackers Yun-Jin Li et.al. 2411.19290 link
2024-12-02 Det-SAM2:Technical Report on the Self-Prompting Segmentation Framework Based on Segment Anything Model 2 Zhiting Wang et.al. 2411.18977 link
2024-11-28 Efficient Track Anything Yunyang Xiong et.al. 2411.18933 null
2024-11-28 COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detection Xiaoqin Zhang et.al. 2411.18858 link
2024-11-27 SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality Chenyang Lei et.al. 2411.18669 link
2024-11-26 “Nuclear thermometers” reveal the origin of the universal r-process nucleosynthesis José Nicolás Orce et.al. 2411.17852 null
2024-11-26 SAM-MPA: Applying SAM to Few-shot Medical Image Segmentation using Mask Propagation and Auto-prompting Jie Xu et.al. 2411.17363 null
2024-11-26 MeerKAT discovery of a MIGHTEE Odd Radio Circle Ray P. Norris et.al. 2411.17311 null
2024-11-29 Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning Hui-Yue Yang et.al. 2411.17217 null
2024-11-25 UltraSam: A Foundation Model for Ultrasound using Large Open-Access Segmentation Datasets Adrien Meyer et.al. 2411.16222 link
2024-11-25 Weakly supervised image segmentation for defect-based grading of fresh produce Manuel Knott et.al. 2411.16219 link
2024-11-25 Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain Hangyul Yoon et.al. 2411.16123 link
2024-11-22 There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks Miguel Espinosa et.al. 2411.15288 link
2024-11-22 Effective SAM Combination for Open-Vocabulary Semantic Segmentation Minhyeok Lee et.al. 2411.14723 null
2024-11-21 Data Formats in Analytical DBMSs: Performance Trade-offs and Future Directions Chunwei Liu et.al. 2411.14331 null
2024-11-21 Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting Nikolai Goncharov et.al. 2411.13840 link
2024-11-21 Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals Hussni Mohd Zakir et.al. 2411.13774 null
2024-11-24 ClickTrack: Towards Real-time Interactive Single Object Tracking Kuiran Wang et.al. 2411.13183 null
2024-11-13 SAM-I2I: Unleash the Power of Segment Anything Model for Medical Image Translation Jiayu Huo et.al. 2411.12755 null
2024-11-19 SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation Ron Keuth et.al. 2411.12602 link
2024-11-30 SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory Cheng-Yen Yang et.al. 2411.11922 link
2024-11-18 Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development Ranjan Sapkota et.al. 2411.11285 null
2024-11-15 Large quadrupole deformation in $^{20}$Ne challenges rotor model and modern theory: urging for $α$ clusters in nuclei C. V. Mehl et.al. 2411.10598 null
2024-11-15 SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning Zewen Chen et.al. 2411.10161 link
2024-11-15 CoSAM: Self-Correcting SAM for Domain Generalization in 2D Medical Image Segmentation Yihang Fu et.al. 2411.10136 null
2024-11-15 CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Dengke Zhang et.al. 2411.10086 link
2024-11-14 Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Yuheng Shi et.al. 2411.09219 link
2024-11-13 Zero-shot capability of SAM-family models for bone segmentation in CT scans Caroline Magg et.al. 2411.08629 null
2024-11-13 Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model Jun Xie et.al. 2411.08592 null
2024-11-13 Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model Yutao Shen et.al. 2411.08453 null
2024-11-12 Triaxial nuclear shapes from simple ratios of electric-quadrupole matrix elements Elena Atanassova Lawrie et.al. 2411.08130 null
2024-11-12 INTRABENCH: Interactive Radiological Benchmark Constantin Ulrich et.al. 2411.07885 null
2024-11-14 MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data Chika Maduabuchi et.al. 2411.07463 link
2024-11-11 MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps Xue Xia et.al. 2411.06971 link
2024-11-10 Superpixel Segmentation: A Long-Lasting Ill-Posed Problem Rémi Giraud et.al. 2411.06478 null
2024-11-08 Assessing Foundational Medical ‘Segment Anything’ (Med-SAM1, Med-SAM2) Deep Learning Models for Left Atrial Segmentation in 3D LGE MRI Mehri Mehrnia et.al. 2411.05963 null
2024-11-18 Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model Shuchang Lyu et.al. 2411.05878 link
2024-11-07 UEVAVD: A Dataset for Developing UAV’s Eye View Active Object Detection Xinhua Jiang et.al. 2411.04348 null
2024-11-06 SA3DIP: Segment Any 3D Instance with Potential 3D Priors Xi Yang et.al. 2411.03819 link
2024-11-05 Exploiting the Segment Anything Model (SAM) for Lung Segmentation in Chest X-ray Images Gabriel Bellon de Carvalho et.al. 2411.03064 null
2024-11-08 Region-Guided Attack on the Segment Anything Model (SAM) Xiaoliang Liu et.al. 2411.02974 null
2024-11-05 Foundation AI Model for Medical Image Segmentation Rina Bao et.al. 2411.02745 null
2024-11-04 UnSegMedGAT: Unsupervised Medical Image Segmentation using Graph Attention Networks Clustering A. Mudit Adityaja et.al. 2411.01966 link
2024-11-01 ZIM: Zero-Shot Image Matting for Anything Beomyoung Kim et.al. 2411.00626 link
2024-11-01 Generative AI-based Pipeline Architecture for Increasing Training Efficiency in Intelligent Weed Control Systems Sourav Modak et.al. 2411.00548 null
2024-10-29 Performance of the Segment Anything Model in Various RFI/Events Detection in Radio Astronomy Yanbin Yang et.al. 2410.22497 null
2024-10-30 Benchmarking Human and Automated Prompting in the Segment Anything Model Jorge Quesada et.al. 2410.22048 link
2024-10-29 SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection Jia Wei et.al. 2410.21813 link
2024-11-03 VideoSAM: A Large Vision Foundation Model for High-Speed Video Segmentation Chika Maduabuchi et.al. 2410.21304 link
2024-10-29 Transferable Adversarial Attacks on SAM and Its Downstream Models Song Xia et.al. 2410.20197 link
2024-10-11 A SAM based Tool for Semi-Automatic Food Annotation Lubnaa Abdur Rahman et.al. 2410.19756 null
2024-10-24 Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction Hongxin Peng et.al. 2410.18433 null
2024-10-23 Gaze-Assisted Medical Image Segmentation Leila Khaertdinova et.al. 2410.17920 link
2024-10-22 Subshell gaps and onsets of collectivity from proton and neutron pairing gap correlations José Nicolás Orce et.al. 2410.17436 null
2024-10-22 Multi Kernel Estimation based Object Segmentation Haim Goldfisher et.al. 2410.17064 link
2024-10-21 PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model Zhongchen Deng et.al. 2410.16545 null
2024-10-21 SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Shuangrui Ding et.al. 2410.16268 link
2024-10-17 SAMReg: SAM-enabled Image Registration with ROI-based Correspondence Shiqi Huang et.al. 2410.14083 link
2024-10-22 EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything Joonhyeon Song et.al. 2410.13621 link
2024-10-16 Adaptive Prompt Learning with SAM for Few-shot Scanning Probe Microscope Image Segmentation Yao Shen et.al. 2410.12562 null
2024-10-15 MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation Xianping Ma et.al. 2410.11160 link
2024-10-13 UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation Ye Sun et.al. 2410.09909 null
2024-10-13 AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model Yuchen Li et.al. 2410.09714 null
2024-10-12 Distribution-aware Noisy-label Crack Segmentation Xiaoyan Jiang et.al. 2410.09409 link
2024-10-11 VideoSAM: Open-World Video Segmentation Pinxue Guo et.al. 2410.08781 null
2024-10-11 Bridge the Points: Graph-based Few-shot Segment Anything Semantically Anqi Zhang et.al. 2410.06964 link
2024-10-08 Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing Images Shiyu Miao et.al. 2410.06194 link
2024-10-08 Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts Zhiwei Lin et.al. 2410.05963 null
2024-10-18 On Efficient Variants of Segment Anything Model: A Survey Xiaorui Sun et.al. 2410.04960 null
2024-10-07 Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting Matthew Strong et.al. 2410.04680 link
2024-10-05 DB-SAM: Delving into High Quality Universal Medical Image Segmentation Chao Qin et.al. 2410.04172 link
2024-10-03 Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images Qingyuan Liu et.al. 2410.02207 null
2024-10-02 SinkSAM: A Monocular Depth-Guided SAM Framework for Automatic Sinkhole Segmentation Osher Rafaeli et.al. 2410.01473 link
2024-10-02 Recovering Manifold Structure Using Ollivier-Ricci Curvature Tristan Luca Saidi et.al. 2410.01149 link
2024-09-30 Automating MedSAM by Learning Prompts with Weak Few-Shot Supervision Mélanie Gaillochet et.al. 2409.20293 link
2024-09-30 Medical Image Segmentation with SAM-generated Annotations Iira Häkkinen et.al. 2409.20253 null
2024-09-29 One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Zechen Bai et.al. 2409.19603 link
2024-09-29 RoboNurse-VLA: Robotic Scrub Nurse System based on Vision-Language-Action Model Shunlei Li et.al. 2409.19590 null
2024-10-10 MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation Taha Koleilat et.al. 2409.19483 link
2024-09-27 When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation Yuli Zhou et.al. 2409.18653 link
2024-09-26 AI-Powered Augmented Reality for Satellite Assembly, Integration and Test Alvaro Patricio et.al. 2409.18101 null
2024-09-26 DarkSAM: Fooling Segment Anything Model to Segment Nothing Ziqi Zhou et.al. 2409.17874 link
2024-09-26 Global-Local Medical SAM Adaptor Based on Full Adaption Meng Wang et.al. 2409.17486 null
2024-09-25 Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis Illia Tsiporenko et.al. 2409.16940 null
2024-09-25 Towards Underwater Camouflaged Object Tracking: An Experimental Evaluation of SAM and SAM 2 Chunhui Zhang et.al. 2409.16902 link
2024-09-24 Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking Xi Wang et.al. 2409.16287 null
2024-09-24 Open-World Object Detection with Instance Representation Learning Sunoh Lee et.al. 2409.16073 null
2024-09-23 Adapting Segment Anything Model for Unseen Object Instance Segmentation Rui Cao et.al. 2409.15481 null
2024-09-24 Towards Ground-truth-free Evaluation of Any Segmentation in Medical Images Ahjol Senbi et.al. 2409.14874 link
2024-09-23 SAMEdge: An Edge-cloud Video Analytics Architecture for the Segment Anything Model Rui Lu et.al. 2409.14784 null
2024-09-23 An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding Wei-Bin Kou et.al. 2409.14737 null
2024-09-23 Video-to-Audio Generation with Fine-grained Temporal Semantics Yuchen Hu et.al. 2409.14709 null
2024-09-21 Foundation Models for Amodal Video Instance Segmentation in Automated Driving Jasmin Breitenstein et.al. 2409.14095 link
2024-09-20 Deep learning for fast segmentation and critical dimension metrology & characterization enabling AR/VR design and fabrication Kundan Chaudhary et.al. 2409.13951 null
2024-09-20 PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images Nanqing Liu et.al. 2409.13401 link
2024-09-20 MCICSAM: Monte Carlo-guided Interpolation Consistency Segment Anything Model for Semi-Supervised Prostate Zone Segmentation Guantian Huang et.al. 2409.13371 null
2024-09-19 Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation Zhikai Wei et.al. 2409.12522 link
2024-09-23 GraspSAM: When Segment Anything Model Meets Grasp Detection Sangjun Noh et.al. 2409.12521 null
2024-09-19 Frequency-Guided Spatial Adaptation for Camouflaged Object Detection Shizhou Zhang et.al. 2409.12421 null
2024-09-14 Target Speaker ASR with Whisper Alexander Polok et.al. 2409.09543 link
2024-09-14 An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation Zheming Zuo et.al. 2409.09530 null
2024-09-14 Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM Empowerment Xin Hu et.al. 2409.09520 null
2024-09-14 Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 Model Mobina Mansoori et.al. 2409.09484 null
2024-09-14 SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2 Xinrun Chen et.al. 2409.09286 link
2024-09-13 Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images Hualiang Wang et.al. 2409.08492 null
2024-09-12 SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality Chenyang Lei et.al. 2409.08083 link
2024-09-11 Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets Ruochen Gao et.al. 2409.07172 link
2024-09-10 Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts Assefa Seyoum Wahd et.al. 2409.06821 link
2024-09-11 Segmenting sea ice floes in close-range optical imagery with active contour and foundation models Giulio Passerotti et.al. 2409.06641 null
2024-09-10 Towards Generalizable Scene Change Detection Jaewoo Kim et.al. 2409.06214 link
2024-09-09 AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations Jingtao Li et.al. 2409.05679 null
2024-09-09 TAVP: Task-Adaptive Visual Prompt for Cross-domain Few-shot Segmentation Jiaqi Yang et.al. 2409.05393 null
2024-09-07 SSFam: Scribble Supervised Salient Object Detection Family Zhengyi Liu et.al. 2409.04817 link
2024-09-07 Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection Mingjin Zhang et.al. 2409.04714 link
2024-09-06 FS-MedSAM2: Exploring the Potential of SAM2 for Few-Shot Medical Image Segmentation without Fine-tuning Yunhao Bai et.al. 2409.04298 link
2024-09-06 Reprojection Errors as Prompts for Efficient Scene Coordinate Regression Ting-Ru Liu et.al. 2409.04178 null
2024-09-04 Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation Tiantian Zhang et.al. 2409.02567 link
2024-09-03 When 3D Partial Points Meets SAM: Tooth Point Cloud Segmentation with Sparse Labels Yifan Liu et.al. 2409.01691 null
2024-09-02 MedSAM-U: Uncertainty-Guided Auto Multi-Prompt Adaptation for Reliable MedSAM Nan Zhou et.al. 2409.00924 null
2024-08-29 SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners Ziyu Guo et.al. 2408.16768 link
2024-08-27 SAM & SAM 2 in 3D Slicer: SegmentWithSAM Extension for Annotating Medical Images Zafer Yildiz et.al. 2408.15224 link
2024-09-02 Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance Kunpeng Wang et.al. 2408.15063 link
2024-08-27 Intraoperative Glioma Segmentation with YOLO + SAM for Improved Accuracy in Tumor Resection Samir Kassam et.al. 2408.14847 null
2024-08-26 FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation Daixun Li et.al. 2408.13980 null
2024-08-23 Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey Yichi Zhang et.al. 2408.12889 link
2024-08-23 S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis Kamal Basha S et.al. 2408.12833 link
2024-08-23 VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models Purushothaman Natarajan et.al. 2408.12808 link
2024-08-22 Segment Anything Model for Grain Characterization in Hard Drive Design Kai Nichols et.al. 2408.12732 null
2024-08-22 The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation Tuyen Tran et.al. 2408.12447 null
2024-08-22 Generalized SAM: Efficient Fine-Tuning of SAM for Variable Input Image Sizes Sota Kato et.al. 2408.12406 link
2024-08-22 SAM-SP: Self-Prompting Makes SAM Great Again Chunpeng Zhou et.al. 2408.12364 null
2024-08-21 EmbodiedSAM: Online Segment Any 3D Thing in Real Time Xiuwei Xu et.al. 2408.11811 null
2024-08-25 NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation Zhenye Lou et.al. 2408.11787 link
2024-08-22 SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything Chongkai Yu et.al. 2408.11535 null
2024-08-20 SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection Huafeng Chen et.al. 2408.10760 null
2024-08-24 Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track Feiyu Pan et.al. 2408.10125 null
2024-08-19 LCE: A Framework for Explainability of DNNs for Ultrasound Image Based on Concept Discovery Weiji Kong et.al. 2408.09899 null
2024-08-19 SAM-UNet:Enhancing Zero-Shot Segmentation of SAM for Universal Medical Images Sihan Yang et.al. 2408.09886 link
2024-08-19 Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving Jun Yan et.al. 2408.09839 link
2024-08-17 GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation Weiming Zhang et.al. 2408.09115 null
2024-08-17 Segment Anything with Multiple Modalities Aoran Xiao et.al. 2408.09085 link
2024-08-16 SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation Xinyu Xiong et.al. 2408.08870 link
2024-08-16 Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models Lin Zhao et.al. 2408.08813 null
2024-08-16 Extracting polygonal footprints in off-nadir images with Segment Anything Model Kai Li et.al. 2408.08645 link
2024-08-16 Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation Linghao Zheng et.al. 2408.08576 null
2024-08-15 Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning Haofeng Liu et.al. 2408.07931 link
2024-08-14 MeerKAT reveals a ghostly thermal radio ring towards the Galactic Centre C. Bordiu et.al. 2408.07727 null
2024-08-14 Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification Yongcheng Li et.al. 2408.07467 link
2024-08-15 Prompt-Based Segmentation at Multiple Resolutions and Lighting Conditions using Segment Anything Model 2 Osher Rafaeli et.al. 2408.06970 null
2024-08-13 Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model Yongcheng Li et.al. 2408.06716 link
2024-08-13 Specialized Change Detection using Segment Anything Tahir Ahmad et.al. 2408.06644 null
2024-08-12 S-SAM: SVD-based Fine-Tuning of Segment Anything Model for Medical Image Segmentation Jay N. Paranjape et.al. 2408.06447 link
2024-08-12 From SAM to SAM 2: Exploring Improvements in Meta’s Segment Anything Model Athulya Sundaresan Geetha et.al. 2408.06305 null
2024-08-12 Zero-shot 3D Segmentation of Abdominal Organs in CT Scans Using Segment Anything Model 2: Adapting Video Tracking Capabilities for 3D Medical Imaging Yosuke Yamagishi et.al. 2408.06170 null
2024-08-12 Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes Ke Zhou et.al. 2408.05936 null
2024-08-12 Polyp SAM 2: Advancing Zero shot Polyp Segmentation in Colorectal Cancer Detection Mobina Mansoori et.al. 2408.05892 link
2024-08-15 SAM-FNet: SAM-Guided Fusion Network for Laryngo-Pharyngeal Tumor Detection Jia Wei et.al. 2408.05426 link
2024-08-09 One Shot is Enough for Sequential Infrared Small Target Segmentation Bingbing Dan et.al. 2408.04823 link
2024-08-08 Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2 Andrew Seohwan Yu et.al. 2408.04762 null
2024-08-08 SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation Jieming Yu et.al. 2408.04593 null
2024-08-08 Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection Shixuan Gao et.al. 2408.04326 link
2024-08-12 Is SAM 2 Better than SAM in Medical Image Segmentation? Sourya Sengupta et.al. 2408.04212 null
2024-08-07 PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation Blessing Agyei Kyem et.al. 2408.04110 link
2024-08-16 Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation Yiqing Shen et.al. 2408.04098 null
2024-08-07 SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology Mingya Zhang et.al. 2408.03651 link
2024-08-06 Segment Anything in Medical Images and Videos: Benchmark and Deployment Jun Ma et.al. 2408.03322 link
2024-08-06 Biomedical SAM 2: Segment Anything in Biomedical Images and Videos Zhiling Yan et.al. 2408.03286 link
2024-08-06 Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment Shijie Lian et.al. 2408.02924 link
2024-08-05 Interactive 3D Medical Image Segmentation with SAM 2 Chuyun Shen et.al. 2408.02635 link
2024-08-04 PromptSAM+: Malware Detection based on Prompt Segment Anything Model Xingyuan Wei et.al. 2408.02066 null
2024-08-04 PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone Xin Yang et.al. 2408.02053 null
2024-08-03 TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks Yang Yu et.al. 2408.01835 link
2024-08-03 Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2 Ange Lou et.al. 2408.01648 link
2024-08-01 Medical SAM 2: Segment medical images as video via Segment Anything Model 2 Jiayuan Zhu et.al. 2408.00874 link
2024-08-06 Segment anything model 2: an application to 2D and 3D medical images Haoyu Dong et.al. 2408.00756 link
2024-08-01 SAM 2: Segment Anything in Images and Videos Nikhila Ravi et.al. 2408.00714 link
2024-08-01 Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM Xiaofeng Liu et.al. 2408.00706 null
2024-08-01 DMESA: Densely Matching Everything by Segmenting Anything Yesheng Zhang et.al. 2408.00279 link
2024-07-31 CC-SAM: SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation Shreyank N Gowda et.al. 2408.00181 null
2024-07-31 A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation Mothilal Asokan et.al. 2407.21739 null
2024-07-31 Evaluating SAM2’s Role in Camouflaged Object Detection: From SAM to SAM2 Lv Tang et.al. 2407.21596 null
2024-07-31 Robust Box Prompt based SAM for Medical Image Segmentation Yuhao Huang et.al. 2407.21284 null
2024-07-31 Weakly Supervised Intracranial Hemorrhage Segmentation with YOLO and an Uncertainty Rectified Segment Anything Model Pascal Spiegler et.al. 2407.20461 null
2024-07-28 ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding Zhen Chen et.al. 2407.19435 link
2024-07-25 SSTD: Stripe-Like Space Target Detection using Single-Point Supervision Zijian Zhu et.al. 2407.18097 null
2024-07-25 Segmentation by registration-enabled SAM prompt engineering using five reference images Yaxi Chen et.al. 2407.17933 link
2024-07-25 SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification Heng Fang et.al. 2407.17689 link
2024-07-23 SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation Pengfei Chen et.al. 2407.16682 null
2024-07-23 Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance Jiyeop Kim et.al. 2407.16173 null
2024-07-23 SAM2CLIP2SAM: Vision Language Model for Segmentation of 3D CT Scans for Covid-19 Detection Dimitrios Kollias et.al. 2407.15728 null
2024-07-21 MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM Navyansh Mahla et.al. 2407.15042 null
2024-07-19 ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation Qing Xu et.al. 2407.14153 link
2024-07-19 Seismic Fault SAM: Adapting SAM with Lightweight Modules and 2.5D Strategy for Fault Detection Ran Chen et.al. 2407.14121 null
2024-07-25 MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis Ziming Zhong et.al. 2407.13675 link
2024-07-18 Hybrid Deep Learning-Based for Enhanced Occlusion Segmentation in PICU Patient Monitoring Mario Francisco Munoz et.al. 2407.13341 null
2024-07-17 OMG-Net: A Deep Learning Framework Deploying Segment Anything to Detect Pan-Cancer Mitotic Figures from Haematoxylin and Eosin-Stained Slides Zhuoyan Shen et.al. 2407.12773 null
2024-07-17 FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification Yiqing Shen et.al. 2407.12658 link
2024-07-17 Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection Zhenni Yu et.al. 2407.12339 link
2024-07-19 Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes Zhi Cai et.al. 2407.11464 link
2024-07-17 Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts Jianhao Li et.al. 2407.11382 null
2024-07-16 Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations Yunya Gao et.al. 2407.11381 link
2024-07-14 WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models Xinjian Wu et.al. 2407.10131 link
2024-07-12 Region Attention Transformer for Medical Image Restoration Zhiwen Yang et.al. 2407.09268 link
2024-07-11 Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appear Seonwhee Jin et.al. 2407.08257 link
2024-07-11 Enrich the content of the image Using Context-Aware Copy Paste Qiushi Guo et.al. 2407.08151 null
2024-07-10 Interactive Segmentation Model for Placenta Segmentation from 3D Ultrasound images Hao Li et.al. 2407.08020 link
2024-07-10 IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection Mingjin Zhang et.al. 2407.07520 link
2024-07-18 ProtoSAM: One-Shot Medical Image Segmentation With Foundational Models Lev Ayzenberg et.al. 2407.07042 link
2024-07-09 CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM Aditya Murali et.al. 2407.06795 null
2024-07-08 Unsupervised Fault Detection using SAM with a Moving Window Approach Ahmed Maged et.al. 2407.06303 null
2024-07-08 MBA-Net: SAM-driven Bidirectional Aggregation Network for Ovarian Tumor Segmentation Yifan Gao et.al. 2407.05984 null
2024-07-07 Addressing single object tracking in satellite imagery through prompt-engineered solutions Athena Psalta et.al. 2407.05518 null
2024-07-07 Cross Prompting Consistency with Segment Anything Model for Semi-supervised Medical Image Segmentation Juzheng Miao et.al. 2407.05416 link
2024-07-06 SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation Guoan Wang et.al. 2407.04938 null
2024-07-06 Revolutionizing Alloy Microstructure Segmentation through SAM and Domain Knowledge without Extra Training Xudong Ma et.al. 2407.04922 null
2024-07-05 Graph Pooling via Ricci Flow Amy Feng et.al. 2407.04236 null
2024-07-09 CS3: Cascade SAM for Sperm Segmentation Yi Shi et.al. 2407.03772 link
2024-07-02 Lung-CADex: Fully automatic Zero-Shot Detection and Classification of Lung Nodules in Thoracic CT Images Furqan Shaukat et.al. 2407.02625 null
2024-07-02 Virtually Objective Quantification of in vitro Wound Healing Scratch Assays with the Segment Anything Model Katja Löwenstein et.al. 2407.02187 null
2024-07-02 HRSAM: Efficiently Segment Anything in High-Resolution Images You Huang et.al. 2407.02109 link
2024-07-03 SAVE: Segment Audio-Visual Easy way using Segment Anything Model Khanh-Binh Nguyen et.al. 2407.02004 null
2024-07-01 Investigating the Segment Anything Foundation Model for Mapping Smallholder Agriculture Field Boundaries Without Training Labels Pratyush Tripathy et.al. 2407.01846 null
2024-07-01 Efficient Cutting Tool Wear Segmentation Based on Segment Anything Model Zongshuo Li et.al. 2407.01211 null
2024-06-30 ASPS: Augmented Segment Anything Model for Polyp Segmentation Huiqian Li et.al. 2407.00718 link
2024-06-30 HATs: Hierarchical Adaptive Taxonomy Segmentation for Panoramic Pathology Image Analysis Ruining Deng et.al. 2407.00596 link
2024-06-29 SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City Guohao Wang et.al. 2407.00296 link
2024-06-28 Segment Anything without Supervision XuDong Wang et.al. 2406.20081 link
2024-07-03 EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model Yuxuan Zhang et.al. 2406.20076 link
2024-06-28 Parallax-tolerant Image Stitching via Segmentation-guided Multi-homography Warping Tianli Liao et.al. 2406.19922 link
2024-06-27 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model Haobo Yuan et.al. 2406.19369 link
2024-06-30 Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO Fuseini Mumuni et.al. 2406.19057 null
2024-06-27 Structural Attention: Rethinking Transformer for Unpaired Medical Image Synthesis Vu Minh Hieu Phan et.al. 2406.18967 link
2024-06-07 Composition Vision-Language Understanding via Segment and Depth Anything Model Mingxiao Huo et.al. 2406.18591 link
2024-06-25 Point-SAM: Promptable 3D Segmentation Model for Point Clouds Yuchen Zhou et.al. 2406.17741 link
2024-06-22 TP-DRSeg: Improving Diabetic Retinopathy Lesion Segmentation with Explicit Text-Prompts Assisted SAM Wenxue Li et.al. 2406.15764 link
2024-06-21 TraceNet: Segment one thing efficiently Mingyuan Wu et.al. 2406.14874 null
2024-06-21 SAM-EG: Segment Anything Model with Egde Guidance framework for efficient Polyp Segmentation Quoc-Huy Trinh et.al. 2406.14819 null
2024-06-18 An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation Qin Li et.al. 2406.12646 null
2024-06-16 Boosting Medical Image Classification with Segmentation Foundation Model Pengfei Gu et.al. 2406.11026 null
2024-06-16 ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model Song Zhang et.al. 2406.10855 link
2024-06-13 RobustSAM: Segment Anything Robustly on Degraded Images Wei-Ting Chen et.al. 2406.09627 link
2024-06-13 APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation Weizhao He et.al. 2406.08372 null
2024-06-11 Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation Jinyuan Li et.al. 2406.07268 link
2024-06-10 Extending Segment Anything Model into Auditory and Temporal Dimensions for Audio-Visual Segmentation Juhyeong Seon et.al. 2406.06163 link
2024-06-10 Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset Shijie Lian et.al. 2406.06039 link
2024-06-09 SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention Muhammad Nawfal Meeran et.al. 2406.05802 link
2024-06-08 Training-Free Robust Interactive Video Object Segmentation Xiaoli Wei et.al. 2406.05485 null
2024-06-07 USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation Xiaoqi Wang et.al. 2406.05271 null
2024-06-06 Matching Anything by Segmenting Anything Siyuan Li et.al. 2406.04221 link
2024-06-03 Immunocto: a massive immune cell database auto-generated for histopathology Mikaël Simard et.al. 2406.02618 null
2024-06-04 FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping Yuzhou Ji et.al. 2406.01916 null
2024-06-03 SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation Danni Yang et.al. 2406.01451 link
2024-06-03 Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation Tianyu Huang et.al. 2406.00956 null
2024-06-02 SimSAM: Zero-shot Medical Image Segmentation via Simulated Interaction Benjamin Towle et.al. 2406.00663 link
2024-06-05 SAM-LAD: Segment Anything Model Meets Zero-Shot Logic Anomaly Detection Yun Peng et.al. 2406.00625 null
2024-06-12 Artificial General Intelligence (AGI) for the oil and gas industry: a review Jimmy Xuekai Li et.al. 2406.00594 null
2024-06-01 AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning Duojun Huang et.al. 2406.00480 link
2024-05-29 FocSAM: Delving Deeply into Focused Objects in Segmenting Anything You Huang et.al. 2405.18706 link
2024-05-28 Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation Yangxiao Lu et.al. 2405.17859 link
2024-05-27 Part123: Part-aware 3D Reconstruction from a Single-view Image Anran Liu et.al. 2405.16888 null
2024-05-27 PP-SAM: Perturbed Prompts for Robust Adaptation of Segment Anything Model for Polyp Segmentation Md Mostafijur Rahman et.al. 2405.16740 link
2024-05-24 Open-Vocabulary SAM3D: Understand Any 3D Scene Hanchen Tai et.al. 2405.15580 null
2024-05-22 Accelerated Evaluation of Ollivier-Ricci Curvature Lower Bounds: Bridging Theory and Computation Wonwoo Kang et.al. 2405.13302 null
2024-05-20 Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate Model Mounes Zaval et.al. 2405.11837 null
2024-05-20 Universal Organizer of SAM for Unsupervised Semantic Segmentation Tingting Li et.al. 2405.11742 link
2024-05-17 One registration is worth two segmentations Shiqi Huang et.al. 2405.10879 link
2024-05-12 Zero Shot Context-Based Object Segmentation using SLIP (SAM+CLIP) Saaketh Koundinya Gundavarapu et.al. 2405.07284 link
2024-05-10 SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model Trevor J. Chan et.al. 2405.06786 null
2024-05-10 Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach Elham Ravanbakhsh et.al. 2405.06586 null
2024-05-10 Automated Cell Structure Extraction for 3D Electron Microscopy by Deep Learning Jin Kousaka et.al. 2405.06303 null
2024-05-07 ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation Zhibo Zhang et.al. 2405.04121 null
2024-05-06 PTQ4SAM: Post-Training Quantization for Segment Anything Chengtao Lv et.al. 2405.03144 link
2024-05-04 UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model Shuai Yuan et.al. 2405.02608 link
2024-05-02 Active Learning Enabled Low-cost Cell Image Segmentation Using Bounding Box Annotation Yu Zhu et.al. 2405.01701 null
2024-05-01 Beyond Human Vision: The Role of Large Vision Language Models in Microscope Image Analysis Prateek Verma et.al. 2405.00876 null
2024-05-01 MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model Rajat Sahay et.al. 2405.00293 null
2024-05-01 ASAM: Boosting Segment Anything Model with Adversarial Tuning Bo Li et.al. 2405.00256 link
2024-04-29 Innovative Integration of Visual Foundation Model with a Robotic Arm on a Mobile Platform Shimian Zhang et.al. 2404.18720 null
2024-04-25 Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segmentation Tanvi Deshpande et.al. 2404.17033 link
2024-04-25 Dr-SAM: An End-to-End Framework for Vascular Segmentation, Diameter Estimation, and Anomaly Detection on Angiography Images Vazgen Zohranyan et.al. 2404.17029 link
2024-04-25 OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation Lizhi Wang et.al. 2404.15891 link
2024-05-09 MAS-SAM: Segment Any Marine Animal with Aggregated Features Tianyu Yan et.al. 2404.15700 link
2024-04-23 Ultrasound SAM Adapter: Adapting SAM for Breast Lesion Segmentation in Ultrasound Images Zhengzheng Tu et.al. 2404.14837 link
2024-04-22 UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation Siru Zhong et.al. 2404.14241 null
2024-04-22 Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic Surgery Yuyang Sheng et.al. 2404.14040 link
2024-04-22 PM-VIS: High-Performance Box-Supervised Video Instance Segmentation Zhangjing Yang et.al. 2404.13863 null
2024-04-20 Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models Yuyan Shi et.al. 2404.13239 null
2024-04-19 ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation Yu-Hsuan Ho et.al. 2404.12606 null
2024-04-18 Moving Object Segmentation: All You Need Is SAM (and Flow) Junyu Xie et.al. 2404.12389 link
2024-04-18 SOHES: Self-supervised Open-world Hierarchical Entity Segmentation Shengcao Cao et.al. 2404.12386 null
2024-04-18 Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery Yona Falinie A. Gaus et.al. 2404.12285 null
2024-04-17 When are Foundation Models Effective? Understanding the Suitability for Pixel-Level Classification Using Multispectral Imagery Yiqun Xie et.al. 2404.11797 null
2024-04-15 How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model Hanxue Gu et.al. 2404.09957 link
2024-04-15 The Physalis system: Discovery of ORC-like radio shells around a massive pair of interacting early-type galaxies with offset X-ray emission Bärbel S. Koribalski et.al. 2404.09522 null
2024-04-15 VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection Bonan Ding et.al. 2404.09431 null
2024-04-12 LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning Junchi Wang et.al. 2404.08767 link
2024-04-12 Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation Abu Bakor Hayat Arnob et.al. 2404.08584 link
2024-04-12 Adapting the Segment Anything Model During Usage in Novel Situations Robin Schön et.al. 2404.08421 null
2024-04-12 Practical Region-level Attack against Segment Anything Models Yifan Shen et.al. 2404.08255 link
2024-04-11 Streamlined Photoacoustic Image Processing with Foundation Models: A Training-Free Solution Handi Deng et.al. 2404.07833 null
2024-04-09 SAM-I-Am: Semantic Boosting for Zero-shot Atomic-Scale Electron Micrograph Segmentation Waqwoya Abebe et.al. 2404.06638 link
2024-04-09 Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation Sidra Aleem et.al. 2404.06362 link
2024-04-08 Rendering-Enhanced Automatic Image-to-Point Cloud Registration for Roadside Scenes Yu Sheng et.al. 2404.05164 null
2024-04-07 Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM Pingping Zhang et.al. 2404.04996 link
2024-04-05 Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models Sangwon Jang et.al. 2404.04243 null
2024-04-02 Red-Teaming Segment Anything Model Krzysztof Jankowski et.al. 2404.02067 link
2024-04-01 Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs Jialou Wang et.al. 2404.01151 null
2024-03-31 Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts Qin Liu et.al. 2404.00741 link
2024-03-31 Deep Instruction Tuning for Segment Anything Model Xiaorui Huang et.al. 2404.00650 link
2024-03-29 MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation Taha Koleilat et.al. 2403.20253 link
2024-03-29 Mixed-precision Supernet Training from Vision Foundation Models using Low Rank Adapter Yuiko Sakuma et.al. 2403.20080 null
2024-03-30 Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction Xiaoyang Lyu et.al. 2403.19314 link
2024-03-27 Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding Zhiheng Cheng et.al. 2403.18271 link
2024-03-26 EgoLifter: Open-world 3D Segmentation for Egocentric Perception Qiao Gu et.al. 2403.18118 link
2024-03-26 Segment Any Medical Model Extended Yihao Liu et.al. 2403.18114 link
2024-03-25 GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation Weiming Zhang et.al. 2403.16370 null
2024-04-02 Distilling Semantic Priors from SAM to Efficient Image Restoration Models Quan Zhang et.al. 2403.16368 null
2024-03-31 Segment Anything Model for Road Network Graph Extraction Congrui Hetang et.al. 2403.16051 link
2024-03-22 Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations Pranav Kulkarni et.al. 2403.15218 link
2024-03-22 Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans Heng Guo et.al. 2403.15063 link
2024-03-21 Empowering Segmentation Ability to Multi-modal Large Language Models Yuqi Yang et.al. 2403.14141 null
2024-03-21 MaskSAM: Towards Auto-prompt SAM with Mask Classification for Medical Image Segmentation Bin Xie et.al. 2403.14103 null
2024-03-20 SAMCT: Segment Any CT Allowing Labor-Free Task-Indicator Prompts Xian Lin et.al. 2403.13258 link
2024-03-19 Segment Anything for comprehensive analysis of grapevine cluster architecture and berry properties Efrain Torres-Lomas et.al. 2403.12935 null
2024-03-27 LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model Yuxin Cao et.al. 2403.11656 null
2024-03-18 CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization Mrityunjoy Gain et.al. 2403.11494 null
2024-03-17 Concatenate, Fine-tuning, Re-training: A SAM-enabled Framework for Semi-supervised 3D Medical Image Segmentation Shumeng Li et.al. 2403.11229 link
2024-03-16 Task-Aware Low-Rank Adaptation of Segment Anything Model Xuehao Wang et.al. 2403.10971 null
2024-03-19 Uncertainty-Aware Adapter: Adapting Segment Anything Model (SAM) for Ambiguous Medical Image Segmentation Mingzhou Jiang et.al. 2403.10931 null
2024-03-16 Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval Shichao Kan et.al. 2403.10798 link
2024-03-16 Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation Mariia Khan et.al. 2403.10780 null
2024-03-15 Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models Tian Meng et.al. 2403.10287 null
2024-03-15 Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning Meixuan Li et.al. 2403.10252 null
2024-03-15 Grasp Anything: Combining Teacher-Augmented Policy Gradient Learning with Instance Segmentation to Grasp Arbitrary Objects Malte Mosbach et.al. 2403.10187 null
2024-03-15 TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model Changhong Hou et.al. 2403.10127 null
2024-03-15 Group-Mix SAM: Lightweight Solution for Industrial Assembly Line Applications Wu Liang et.al. 2403.10053 null
2024-03-15 Cardiac Magnetic Resonance 2D+T Short- and Long-axis Segmentation via Spatio-temporal SAM Adaptation Zhennong Chen et.al. 2403.10009 null
2024-03-14 FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images Yiqing Shen et.al. 2403.09827 link
2024-03-14 The galaxy group merger origin of the Cloverleaf odd radio circle system E. Bulbul et.al. 2403.09808 null
2024-03-14 PosSAM: Panoptic Open-vocabulary Segment Anything Vibashan VS et.al. 2403.09620 link
2024-03-14 DF4LCZ: A SAM-Empowered Data Fusion Framework for Scene-Level Local Climate Zone Classification Qianqian Wu et.al. 2403.09367 link
2024-03-17 WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images Hong Liu et.al. 2403.09257 link
2024-03-14 Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation Hyung-Il Kim et.al. 2403.09199 null
2024-03-18 SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration Yanfei Song et.al. 2403.09195 null
2024-03-12 FluoroSAM: A Language-aligned Foundation Model for X-ray Image Segmentation Benjamin D. Killeen et.al. 2403.08059 link
2024-03-12 Real-time Surgical Instrument Segmentation in Video Using Point Tracking and Segment Anything Zijian Wu et.al. 2403.08003 link
2024-03-12 SAMDA: Leveraging SAM on Few-Shot Domain Adaptation for Electronic Microscopy Segmentation Yiran Wang et.al. 2403.07951 null
2024-03-09 Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation Hairong Shi et.al. 2403.05912 link
2024-03-09 Large Generative Model Assisted 3D Semantic Communication Feibo Jiang et.al. 2403.05783 null
2024-03-14 OmniCount: Multi-label Object Counting with Semantic-Geometric Priors Anindya Mondal et.al. 2403.05435 null
2024-03-08 Part-aware Personalized Segment Anything Model for Patient-Specific Segmentation Chenhui Zhao et.al. 2403.05433 link
2024-03-08 FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation Yuxi Liu et.al. 2403.05408 link
2024-03-07 SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising Tao Zhou et.al. 2403.04194 link
2024-03-07 ProMISe: Promptable Medical Image Segmentation using SAM Jinfeng Wang et.al. 2403.04164 link
2024-03-06 Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery Wei Zhang et.al. 2403.03790 null
2024-03-03 A Simple-but-effective Baseline for Training-free Class-Agnostic Counting Yuhao Lin et.al. 2403.01418 null
2024-02-29 RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation Jie Zhang et.al. 2402.19004 null
2024-02-28 From Generalization to Precision: Exploring SAM for Tool Segmentation in Surgical Environments Kanyifeechukwu J. Oguine et.al. 2402.17972 null
2024-02-27 VRP-SAM: SAM with Visual Reference Prompt Yanpeng Sun et.al. 2402.17726 link
2024-02-27 Robust Unsupervised Crowd Counting and Localization with Adaptive Resolution SAM Jia Wan et.al. 2402.17514 null
2024-02-27 Segment anything model for head and neck tumor segmentation with CT, PET and MRI multi-modality images Jintao Ren et.al. 2402.17454 link
2024-02-27 SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution Chengcheng Wang et.al. 2402.17133 link
2024-02-26 UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images Zhen Chen et.al. 2402.16663 link
2024-03-11 BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM Li Zhang et.al. 2402.16338 link
2024-02-24 Increasing SAM Zero-Shot Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation Zekun Jiang et.al. 2402.15759 link
2024-02-22 WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition Lianghui Zhu et.al. 2402.14812 link
2024-02-22 Subobject-level Image Tokenization Delong Chen et.al. 2402.14327 link
2024-02-20 Object-level Geometric Structure Preserving for Natural Image Stitching Wenxiao Cai et.al. 2402.12677 link
2024-02-27 ISCUTE: Instance Segmentation of Cables Using Text Embedding Shir Kozlovsky et.al. 2402.11996 null
2024-02-18 A Multispectral Automated Transfer Technique (MATT) for machine-driven image labeling utilizing the Segment Anything Model (SAM) James E. Gallagher et.al. 2402.11413 null
2024-02-16 Dynamic Patch-aware Enrichment Transformer for Occluded Person Re-Identification Xin Zhang et.al. 2402.10435 null
2024-02-15 LaserSAM: Zero-Shot Change Detection Using Visual Segmentation of Spinning LiDAR Alexander Krawciw et.al. 2402.10321 null
2024-02-15 Lester: rotoscope animation through video object segmentation and tracking Ruben Tous et.al. 2402.09883 link
2024-02-15 Are Odd Radio Circles phoenixes of powerful radio galaxies? Stanislav Shabala et.al. 2402.09708 null
2024-02-10 Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance Raza Imam et.al. 2402.07059 link
2024-02-09 Iris-SAM: Iris Segmentation Using a Foundational Model Parisa Farmanifard et.al. 2402.06497 link
2024-02-25 ClickSAM: Fine-tuning Segment Anything Model using click prompts for ultrasound image segmentation Aimee Guo et.al. 2402.05902 null
2024-02-07 EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss Zhuoyang Zhang et.al. 2402.05008 link
2024-02-06 CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model Aoran Xiao et.al. 2402.03631 link
2024-02-03 Polyp-DAM: Polyp segmentation via depth anything model Zhuoran Zheng et.al. 2402.02298 null
2024-02-15 Segment Any Change Zhuo Zheng et.al. 2402.01188 link
2024-02-01 Comparative Evaluation of Traditional and Deep Learning-Based Segmentation Methods for Spoil Pile Delineation Using UAV Images Sureka Thiruchittampalam et.al. 2402.00295 null
2024-01-31 Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation Maoyuan Ye et.al. 2401.17904 link
2024-01-31 Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model Zihan Zhong et.al. 2401.17868 link
2024-01-31 SimAda: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes Yiran Song et.al. 2401.17803 link
2024-01-29 MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection Yuxue Yang et.al. 2401.16305 link
2024-01-27 GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis Jing Hao et.al. 2401.15282 link
2024-01-30 SAM-based instance segmentation models for the automation of structural damage detection Zehao Ye et.al. 2401.15266 null
2024-01-25 On generalisability of segment anything model for nuclear instance segmentation in histology images Kesi Xu et.al. 2401.14248 null
2024-01-25 Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks Tianhe Ren et.al. 2401.14159 link
2024-01-24 Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation Saiyang Na et.al. 2401.13220 null
2024-01-23 PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation Zhaozhi Xie et.al. 2401.13051 link
2024-01-23 SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI Hanxue Gu et.al. 2401.12974 link
2024-01-18 RAP-SAM: Towards Real-Time All-Purpose Segment Anything Shilin Xu et.al. 2401.10228 link
2024-01-20 Boosting Few-Shot Semantic Segmentation Via Segment Anything Model Chen-Bin Feng et.al. 2401.09826 null
2024-01-17 Change Detection Between Optical Remote Sensing Imagery and Map Data via Segment Anything Model (SAM) Hongruixuan Chen et.al. 2401.09019 null
2024-01-16 Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model’s Generalizability in Permafrost Mapping Wenwen Li et.al. 2401.08787 null
2024-01-16 AGN jet-inflated bubbles as possible origin of odd radio circles Yen-Hsing Lin et.al. 2401.08207 null
2024-02-01 UV-SAM: Adapting Segment Anything Model for Urban Village Identification Xin Zhang et.al. 2401.08083 link
2024-01-16 Achieve Fairness without Demographics for Dermatological Disease Diagnosis Ching-Hao Chiu et.al. 2401.08066 link
2024-01-15 Foundation Models for Biomedical Image Segmentation: A Survey Ho Hin Lee et.al. 2401.07654 null
2024-01-15 Compositional Oil Spill Detection Based on Object Detector and Adapted Segment Anything Model from SAR Images Wenhui Wu et.al. 2401.07502 null
2024-01-12 SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization Zhenlong Yuan et.al. 2401.06385 null
2024-01-12 SamLP: A Customized Segment Anything Model for License Plate Detection Haoxuan Ding et.al. 2401.06374 link
2024-01-11 MatSAM: Efficient Materials Microstructure Extraction via Visual Large Model Changtai Li et.al. 2401.05638 link
2024-01-09 Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-based Non-invasive Digital System Galib Muhammad Shahriar Himel et.al. 2401.04746 null
2024-01-09 Segment anything model (SAM) for brain extraction in fMRI studies Dwith Chenna et.al. 2401.04740 link
2024-01-09 Learning to Prompt Segment Anything Models Jiaxing Huang et.al. 2401.04651 null
2024-01-07 Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions Yichi Zhang et.al. 2401.03495 link
2024-01-05 Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively Haobo Yuan et.al. 2401.02955 link
2024-01-04 ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation Xinyang Pu et.al. 2401.02326 link
2024-01-08 BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model Yiran Song et.al. 2401.02317 link
2024-01-04 Leveraging SAM for Single-Source Domain Generalization in Medical Image Segmentation Hanhui Wang et.al. 2401.02076 link
2024-01-06 Discovery of a circularly symmetric extended diffuse radio emission around an elliptical galaxy with the VLA FIRST survey Shobha Kumari et.al. 2401.01278 null
2024-01-02 Unsupervised Continual Anomaly Detection with Contrastively-learned Prompt Jiaqi Liu et.al. 2401.01010 link
2023-12-30 Promoting Segment Anything Model towards Highly Accurate Dichotomous Image Segmentation Xianjie Liu et.al. 2401.00248 null
2023-12-28 Generalizable Visual Reinforcement Learning with Segment Anything Model Ziyu Wang et.al. 2312.17116 link
2023-12-27 Segment Change Model (SCM) for Unsupervised Change detection in VHR Remote Sensing Images: a Case Study of Buildings Xiaoliang Tan et.al. 2312.16410 link
2023-12-24 Segment Any Events via Weighted Adaptation of Pivotal Tokens Zhiwen Chen et.al. 2312.16222 link
2023-12-26 Medical Report Generation based on Segment-Enhanced Contrastive Representation Learning Ruoqing Zhao et.al. 2312.15869 null
2023-12-26 Video Frame Interpolation with Region-Distinguishable Priors from SAM Yan Han et.al. 2312.15868 null
2023-12-22 Part to Whole: Collaborative Prompting for Surgical Instrument Segmentation Wenxi Yue et.al. 2312.14481 link
2023-12-22 FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection Dongmei Zhang et.al. 2312.14465 null
2023-12-21 TinySAM: Pushing the Envelope for Efficient Segment Anything Model Han Shu et.al. 2312.13789 link
2023-12-20 Testing the Segment Anything Model on radiology data José Guilherme de Almeida et.al. 2312.12880 null
2023-12-20 Segment Anything Model Meets Image Harmonization Haoxing Chen et.al. 2312.12729 null
2023-12-19 Weakly Supervised Open-Vocabulary Object Detection Jianghang Lin et.al. 2312.12437 null
2023-12-19 Towards SAMBA: Segment Anything Model for Brain Tumor Segmentation in Sub-Sharan African Populations Mohannad Barakat et.al. 2312.11775 null
2023-12-17 SAI3D: Segment Any Instance in 3D Scenes Yingda Yin et.al. 2312.11557 null
2023-12-18 Appearance-based Refinement for Object-Centric Motion Segmentation Junyu Xie et.al. 2312.11463 null
2023-12-20 How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model Yixin Zhang et.al. 2312.10600 link
2023-12-16 Mapping Housing Stock Characteristics from Drone Images for Climate Resilience in the Caribbean Isabelle Tingzon et.al. 2312.10306 null
2023-12-25 Osprey: Pixel Understanding with Visual Instruction Tuning Yuqian Yuan et.al. 2312.10032 link
2023-12-15 SQA-SAM: Segmentation Quality Assessment for Medical Images Utilizing the Segment Anything Model Yizhe Zhang et.al. 2312.09899 null
2023-12-15 Collaborating Foundation models for Domain Generalized Semantic Segmentation Yasser Benigmim et.al. 2312.09788 link
2023-12-15 MobileSAMv2: Faster Segment Anything to Everything Chaoning Zhang et.al. 2312.09579 link
2023-12-21 Enhancing Data Lakes with GraphAr: Efficient Graph Data Management with a Specialized Storage Scheme Xue Li et.al. 2312.09577 link
2023-12-14 Influence of Prompting Strategies on Segment Anything Model (SAM) for Short-axis Cardiac MRI segmentation Josh Stein et.al. 2312.08932 null
2023-12-13 ASLseg: Adapting SAM in the Loop for Semi-supervised Liver Tumor Segmentation Shiyun Chen et.al. 2312.07969 null
2023-12-18 Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged Objects Jian Hu et.al. 2312.07374 link
2023-12-11 SqueezeSAM: User friendly mobile interactive segmentation Balakrishnan Varadarajan et.al. 2312.06736 null
2023-12-11 EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM Chong Zhou et.al. 2312.06660 link
2023-12-11 The Intrinsic Sizes of Odd Radio Circles David Rupke et.al. 2312.06387 null
2023-12-11 Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation Dong Zhao et.al. 2312.06331 link
2023-12-11 SemiSAM: Exploring SAM for Enhancing Semi-Supervised Medical Image Segmentation with Extremely Limited Annotations Yichi Zhang et.al. 2312.06316 link
2023-12-10 RepViT-SAM: Towards Real-Time Segmenting Anything Ao Wang et.al. 2312.05760 link
2023-12-12 0.1% Data Makes Segment Anything Slim Zigeng Chen et.al. 2312.05284 link
2023-12-15 Fine-tuning vision foundation model for crack segmentation in civil infrastructures Kang Ge et.al. 2312.04233 null
2023-12-07 SAMBA: A Trainable Segmentation Web-App with Smart Labelling Ronan Docherty et.al. 2312.04197 link
2023-12-07 An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything Israt Zarin Era et.al. 2312.04063 null
2023-12-06 Boosting Segment Anything Model Towards Open-Vocabulary Learning Xumeng Han et.al. 2312.03628 link
2023-12-10 Foundation Model Assisted Weakly Supervised Semantic Segmentation Xiaobo Yang et.al. 2312.03585 link
2023-12-05 AI-SAM: Automatic and Interactive Segment Anything Model Yimu Pan et.al. 2312.03119 link
2023-12-05 SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints Xianping Ma et.al. 2312.02464 link
2023-12-05 Towards Granularity-adjusted Pixel-level Semantic Annotation Rohit Kundu et.al. 2312.02420 null
2023-12-03 SANeRF-HQ: Segment Anything for NeRF in High Quality Yichen Liu et.al. 2312.01531 null
2023-12-01 Segment and Caption Anything Xiaoke Huang et.al. 2312.00869 link
2023-12-01 EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything Yunyang Xiong et.al. 2312.00863 link
2023-12-01 Segment Anything Model-guided Collaborative Learning Network for Scribble-supervised Polyp Segmentation Yiming Zhao et.al. 2312.00312 null
2023-11-29 SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation Mutian Xu et.al. 2311.17707 link
2023-11-28 Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model Zelin Peng et.al. 2311.17112 null
2023-11-28 I-MedSAM: Implicit Medical Image Segmentation with Segment Anything Xiaobao Wei et.al. 2311.17081 link
2023-12-01 Self-Supervised Learning of Whole and Component-Based Semantic Representations for Person Re-Identification Siyuan Huang et.al. 2311.17074 null
2023-11-27 Unleashing the Power of Prompt-driven Nucleus Instance Segmentation Zhongyi Shui et.al. 2311.15939 link
2023-12-05 Stable Segment Anything Model Qi Fan et.al. 2311.15776 link
2023-11-27 MARIS: Referring Image Segmentation via Mutual-Aware Attention Features Mengxi Zhang et.al. 2311.15727 null
2023-11-27 SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation Jiehong Lin et.al. 2311.15707 link
2023-11-27 Where to Begin? From Random to Foundation Model Instructed Initialization in Federated Learning for Medical Image Segmentation Ming Li et.al. 2311.15463 null
2023-11-26 Obj-NeRF: Extract Object NeRFs from Multi-view Images Zhiyi Li et.al. 2311.15291 null
2023-12-04 Can SAM recognize crops? Quantifying the zero-shot performance of a semantic segmentation foundation model on generating crop-type maps using satellite imagery for precision agriculture Rutuja Gurav et.al. 2311.15138 null
2023-11-22 Self-guided Few-shot Semantic Segmentation for Remote Sensing Imagery Based on Large Vision Models Xiyu Qi et.al. 2311.13200 null
2023-11-21 Novel OCT mosaicking pipeline with Feature- and Pixel-based registration Jiacheng Wang et.al. 2311.13052 link
2023-11-21 GMISeg: General Medical Image Segmentation without Re-Training Jing Xu et.al. 2311.12539 null
2023-11-20 Broadband non-thermal emission of odd radio circles induced by galactic outflow remnants and their evolution Yutaka Fujita et.al. 2311.12099 null
2023-11-19 Few-Shot Classification & Segmentation Using Large Language Models Agent Tian Meng et.al. 2311.12065 null
2023-11-20 SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks Jin Ye et.al. 2311.11969 link
2023-11-19 GeoSAM: Fine-tuning SAM with Sparse and Dense Visual Prompting for Automated Segmentation of Mobility Infrastructure Rafi Ibn Sultan et.al. 2311.11319 link
2023-11-18 A Foundation Model for Cell Segmentation Uriah Israel et.al. 2311.11004 null
2023-11-17 Zero-Shot Digital Rock Image Segmentation with a Fine-Tuned Segment Anything Model Zhaoyang Ma et.al. 2311.10865 null
2023-11-17 Segment Anything Model with Uncertainty Rectification for Auto-Prompting Medical Image Segmentation Yichi Zhang et.al. 2311.10529 null
2023-11-16 Slide-SAM: Medical SAM Meets Sliding Window Quan Quan et.al. 2311.10121 link
2023-11-15 AdapterShadow: Adapting Segment Anything Model for Shadow Detection Leiping Jie et.al. 2311.08891 link
2023-11-15 Discovery of Diffuse Radio Source in Abell 1060 Kohei Kurahara et.al. 2311.08693 null
2023-11-14 Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images Zhiyun Song et.al. 2311.08225 null
2023-11-14 SAMIHS: Adaptation of Segment Anything Model for Intracranial Hemorrhage Segmentation Yinuo Wang et.al. 2311.08190 link
2023-11-14 Zero-Shot Segmentation of Eye Features Using the Segment Anything Model (SAM) Virmarie Maquiling et.al. 2311.08077 link
2023-11-14 GlanceSeg: Real-time microaneurysm lesion segmentation with gaze-map-guided foundation model for early detection of diabetic retinopathy Hongyang Jiang et.al. 2311.08075 null
2023-11-10 EviPrompt: A Training-Free Evidential Prompt Generation Method for Segment Anything Model in Medical Images Yinsong Xu et.al. 2311.06400 null
2023-11-09 SAMVG: A Multi-stage Image Vectorization Model with the Segment-Anything Model Haokun Zhu et.al. 2311.05276 null
2023-11-08 Are foundation models efficient for medical image segmentation? Danielle Ferreira et.al. 2311.04847 null
2023-11-06 Masking Hyperspectral Imaging Data with Pretrained Models Elias Arbash et.al. 2311.03053 link
2023-11-06 Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation Shichao Dong et.al. 2311.01989 null
2023-11-02 Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning Gaoang Wang et.al. 2311.01004 link
2023-10-31 Joint Depth Prediction and Semantic Segmentation with Multi-View SAM Mykhailo Shvets et.al. 2311.00134 null
2023-10-31 Team I2R-VI-FF Technical Report on EPIC-KITCHENS VISOR Hand Object Segmentation Challenge 2023 Fen Fang et.al. 2310.20120 null
2023-11-13 Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models Hao Li et.al. 2310.19721 link
2023-10-30 A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture Qianqian Shen et.al. 2310.19257 link
2023-10-28 Audio-Visual Instance Segmentation Ruohao Guo et.al. 2310.18709 link
2023-10-26 Task-driven Prompt Evolution for Foundation Models Rachana Sathish et.al. 2310.17128 null
2023-10-25 Open-NeRF: Towards Open Vocabulary NeRF Decomposition Hao Zhang et.al. 2310.16383 null
2023-10-23 SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Haoxiang Wang et.al. 2310.15308 null
2023-10-23 Ionized Gas Extended Over 40 kpc in an Odd Radio Circle Host Galaxy Alison L. Coil et.al. 2310.15162 null
2023-10-29 SAM-Med3D Haoyu Wang et.al. 2310.15161 link
2023-10-19 Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models Zhaozheng Chen et.al. 2310.13026 link
2023-10-04 Comprehensive Multimodal Segmentation in Medical Imaging: Combining YOLOv8 with SAM and HQ-SAM Models Sumit Pandey et.al. 2310.12995 null
2023-10-19 Segment Anything Meets Universal Adversarial Perturbation Dongshen Han et.al. 2310.12431 null
2023-10-17 Towards Training-free Open-world Segmentation via Image Prompting Foundation Models Lv Tang et.al. 2310.10912 link
2023-10-16 Electric dipole polarizability of low-lying excited states in atomic nuclei José Nicolás Orce et.al. 2310.10775 null
2023-10-16 Evaluation and improvement of Segment Anything Model for interactive histopathology image segmentation SeungKyu Kim et.al. 2310.10493 null
2023-11-07 Recursive Segmentation Living Image: An eXplainable AI (XAI) Approach for Computing Structural Beauty of Images or the Livingness of Space Yao Qianxiang et.al. 2310.10149 null
2023-10-16 Black-box Targeted Adversarial Attack on Segment Anything (SAM) Sheng Zheng et.al. 2310.10010 null
2023-10-24 Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data Jiahao Xia et.al. 2310.09918 null
2023-10-17 Prototype-oriented Unsupervised Change Detection for Disaster Management Youngtack Oh et.al. 2310.09759 null
2023-10-13 Generative AI-driven Semantic Communication Framework for NextG Wireless Network Avi Deb Raha et.al. 2310.09021 null
2023-10-12 Virtual Augmented Reality for Atari Reinforcement Learning Christian A. Schiller et.al. 2310.08683 link
2023-10-12 Fine-Grained Annotation for Face Anti-Spoofing Xu Chen et.al. 2310.08142 null
2023-10-10 Machine Eye for Defects: Machine Learning-Based Solution to Identify and Characterize Topological Defects in Textured Images of Nematic Materials Haijie Ren et.al. 2310.06406 null
2023-10-09 Empirical Evaluation of the Segment Anything Model (SAM) for Brain Tumor Segmentation Mohammad Peivandi et.al. 2310.06162 null
2023-10-07 Tree-GPT: Modular Large Language Model Expert System for Forest Remote Sensing Image Understanding and Interactive Analysis Siqi Du et.al. 2310.04698 null
2023-10-06 TiC: Exploring Vision Transformer in Convolution Song Zhang et.al. 2310.04134 link
2023-10-03 Multi-Prompt Fine-Tuning of Foundation Models for Enhanced Medical Image Segmentation Xiangru Li et.al. 2310.02381 null
2023-10-03 Zero-Shot Refinement of Buildings’ Segmentation Models using SAM Ali Mayladan et.al. 2310.01845 link
2023-10-01 Propagating Semantic Labels in Video Data David Balaban et.al. 2310.00783 null
2023-09-30 Exploring SAM Ablations for Enhancing Medical Segmentation in Radiology and Pathology Amin Ranem et.al. 2310.00504 null
2023-09-29 Are Odd Radio Circles virial shocks around massive galaxies? Implications for cosmic-ray diffusion in the circumgalactic medium Shotaro Yamasaki et.al. 2309.17451 null
2023-10-02 UniQuadric: A SLAM Backend for Unknown Rigid Object 3D Tracking and Light-Weight Modeling Linghao Yang et.al. 2309.17036 null
2023-09-29 Segment Anything Model is a Good Teacher for Local Feature Learning Jingqian Wu et.al. 2309.16992 link
2023-10-02 nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance Yunxiang Li et.al. 2309.16967 link
2023-09-28 Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization Thilo von Neumann et.al. 2309.16482 null
2023-09-27 Learning from SAM: Harnessing a Segmentation Foundation Model for Sim2Real Domain Adaptation through Regularization Mayara E. Bonani et.al. 2309.15562 null
2023-09-24 A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and Weeds Competition Khoa Dang Nguyen et.al. 2309.13578 null
2023-09-24 MediViSTA-SAM: Zero-shot Medical Video Analysis with Spatio-temporal SAM Adaptation Sekeun Kim et.al. 2309.13539 link
2023-09-22 NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything Xiaobao Wei et.al. 2309.12790 link
2023-09-21 Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal Xiao Feng Zhang et.al. 2309.11715 null
2023-09-18 An Accurate and Efficient Neural Network for OCTA Vessel Segmentation and a New Dataset Haojian Ning et.al. 2309.09483 link
2023-09-16 MA-SAM: Modality-agnostic SAM Adaptation for 3D Medical Image Segmentation Cheng Chen et.al. 2309.08842 link
2023-09-15 Global trends of the electric dipole polarizability from shell-model calculations José Nicolás Orce et.al. 2309.08810 null
2023-09-15 Segment Anything Model for Brain Tumor Segmentation Peng Zhang et.al. 2309.08434 null
2023-09-13 SAMUS: Adapting Segment Anything Model for Clinically-Friendly and Generalizable Ultrasound Image Segmentation Xian Lin et.al. 2309.06824 link
2023-09-07 SAM3D: Segment Anything Model in Volumetric Medical Images Nhat-Tan Bui et.al. 2309.03493 link
2023-09-05 Artificial General Intelligence for Radiation Oncology Chenbin Liu et.al. 2309.02590 null
2023-09-05 SAM-Deblur: Let Segment Anything Boost Image Deblurring Siwei Li et.al. 2309.02270 link
2023-09-04 Prompt me a Dataset: An investigation of text-image prompting for historical image dataset creation using foundation models Hassan El-Hajj et.al. 2309.01674 link
2023-09-04 Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images Lei Ding et.al. 2309.01429 link
2023-09-01 Self-Sampling Meta SAM: Enhancing Few-shot Medical Image Segmentation with Meta-Learning Yiming Zhang et.al. 2308.16466 link
2023-08-30 SAM-Med2D Junlong Cheng et.al. 2308.16184 link
2023-08-28 Auto-Prompting SAM for Mobile Friendly 3D Medical Image Segmentation Chengyin Li et.al. 2308.14936 link
2023-08-31 SAM-PARSER: Fine-tuning SAM Efficiently by Parameter Space Reconstruction Zelin Peng et.al. 2308.14604 null
2023-08-27 Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars Weijia Feng et.al. 2308.14133 null
2023-08-27 Enhancing Bloodstain Analysis Through AI-Based Segmentation: Leveraging Segment Anything Model for Crime Scene Investigation Zihan Dong et.al. 2308.13979 link
2023-08-26 Zero-Shot Edge Detection with SCESAME: Spectral Clustering-based Ensemble for Segment Anything Model Estimation Hiroaki Yamagiwa et.al. 2308.13779 link
2023-08-26 SamDSK: Combining Segment Anything Model with Domain-Specific Knowledge for Semi-Supervised Learning in Medical Image Segmentation Yizhe Zhang et.al. 2308.13759 link
2023-08-23 SPPNet: A Single-Point Prompt Network for Nuclei Image Segmentation Qing Xu et.al. 2308.12231 link
2023-08-22 SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF) Ange Lou et.al. 2308.11774 null
2023-08-20 False Negative/Positive Control for SAM on Noisy Medical Images Xing Yao et.al. 2308.10382 link
2023-08-31 SAMedOCT: Adapting Segment Anything Model (SAM) for Retinal OCT Botond Fazekas et.al. 2308.09331 null
2023-08-17 SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation Wenxi Yue et.al. 2308.08746 link
2023-08-15 Self-Prompting Large Vision Models for Few-Shot Medical Image Segmentation Qi Wu et.al. 2308.07624 link
2023-08-14 SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation An Wang et.al. 2308.07156 null
2023-08-14 A One Stop 3D Target Reconstruction and multilevel Segmentation Method Jiexiong Xu et.al. 2308.06974 link
2023-08-14 CEmb-SAM: Segment Anything Model with Condition Embedding for Joint Learning from Heterogeneous Datasets Dongik Shin et.al. 2308.06957 null
2023-08-28 CLE Diffusion: Controllable Light Enhancement Diffusion Model Yuyang Yin et.al. 2308.06725 null
2023-08-12 Polyp-SAM++: Can A Text Guided SAM Perform Better for Polyp Segmentation? Risab Biswas et.al. 2308.06623 link
2023-08-12 TongueSAM: An Universal Tongue Segmentation Model Based on SAM with Zero-Shot Shan Cao et.al. 2308.06444 link
2023-08-11 FoodSAM: Any Food Segmentation Xing Lan et.al. 2308.05938 link
2023-08-10 Leverage Weakly Annotation to Pixel-wise Annotation via Zero-shot Segment Anything Model for Molecular-empowered Learning Xueyuan Li et.al. 2308.05785 null
2023-08-10 Adaptive Low Rank Adaptation of Segment Anything to Salient Object Detection Ruikai Cui et.al. 2308.05426 link
2023-08-08 AquaSAM: Underwater Image Foreground Segmentation Muduo Xu et.al. 2308.04218 link
2023-08-05 Surrogate Empowered Sim2Real Transfer of Deep Reinforcement Learning for ORC Superheat Control Runze Lin et.al. 2308.02765 null
2023-08-02 Push the Boundary of SAM: A Pseudo-label Correction Framework for Medical Segmentation Ziyi Huang et.al. 2308.00883 null
2023-08-16 SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model Shili Zhou et.al. 2307.16586 null
2023-07-26 Tracking Anything in High Quality Jiawen Zhu et.al. 2307.13974 link
2023-07-21 MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems Thilo von Neumann et.al. 2307.11394 link
2023-07-12 SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology Jingwei Zhang et.al. 2307.09570 null
2023-07-15 Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments Ruiping Liu et.al. 2307.07757 link
2023-07-11 $\mathrm{SAM^{Med}}$ : A medical image annotation framework based on large vision model Chenglong Wang et.al. 2307.05617 null
2023-07-07 Large AI Model-Based Semantic Communications Feibo Jiang et.al. 2307.03492 null
2023-07-10 ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking Yuanyou Xu et.al. 2307.02508 null
2023-07-05 AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images Ao Cheng et.al. 2307.02464 null
2023-07-03 Segment Anything Meets Point Tracking Frano Rajič et.al. 2307.01197 link
2023-07-03 SAMAug: Point Prompt Augmentation for Segment Anything Model Haixing Dai et.al. 2307.01187 link
2023-07-03 SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation Liangliang Yao et.al. 2307.01024 link
2023-07-03 RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation Yonglin Li et.al. 2307.00997 link
2023-07-01 All-in-SAM: from Weak Annotation to Pixel-wise Nuclei Segmentation with Prompt-based Finetuning Can Cui et.al. 2307.00290 null
2023-06-30 Training-free Object Counting with Prompts Zenglin Shi et.al. 2307.00038 link
2023-06-30 Topological Data Analysis Guided Segment Anything Model Prompt Optimization for Zero-Shot Segmentation in Biological Imaging Ruben Glatt et.al. 2306.17400 null
2023-06-29 Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization Yingxin Lai et.al. 2306.17075 link
2023-06-29 The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot Lucas Prado Osco et.al. 2306.16623 link
2023-06-28 RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model Keyan Chen et.al. 2306.16269 link
2023-06-28 Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection Zhewei Chen et.al. 2306.16186 null
2023-06-24 Utilizing Segment Anything Model For Assessing Localization of GRAD-CAM in Medical Imaging Evan Kellener et.al. 2306.15692 null
2023-06-27 CellViT: Vision Transformers for Precise Cell Segmentation and Classification Fabian Hörst et.al. 2306.15350 link
2023-06-30 MedLSAM: Localize and Segment Anything Model for 3D Medical Images Wenhui Lei et.al. 2306.14752 link
2023-07-01 Faster Segment Anything: Towards Lightweight SAM for Mobile Applications Chaoning Zhang et.al. 2306.14289 link
2023-06-25 When SAM Meets Sonar Images Lin Wang et.al. 2306.14109 link
2023-06-23 Curvature-enhanced Graph Convolutional Network for Biomolecular Interaction Prediction Cong Shen et.al. 2306.13699 link
2023-06-23 3DSAM-adapter: Holistic Adaptation of SAM from 2D to 3D for Promptable Medical Image Segmentation Shizhan Gong et.al. 2306.13465 link
2023-06-23 Robustness of Segment Anything Model (SAM) for Autonomous Driving in Adverse Weather Conditions Xinru Shan et.al. 2306.13290 null
2023-06-22 Ladder Fine-tuning approach for SAM integrating complementary network Shurong Chai et.al. 2306.12737 link
2023-06-21 Comparative Analysis of Segment Anything Model and U-Net for Breast Tumor Detection in Ultrasound and Mammography Images Mohsen Ahmadi et.al. 2306.12510 null
2023-06-21 Fast Segment Anything Xu Zhao et.al. 2306.12156 link
2023-06-20 Segment Anything Model (SAM) for Radiation Oncology Lian Zhang et.al. 2306.11730 null
2023-06-22 Enlighten Anything: When Segment Anything Model Meets Low-Light Image Enhancement Qihan Zhao et.al. 2306.10286 link
2023-06-15 Temporally-Extended Prompts Optimization for SAM in Interactive Medical Image Segmentation Chuyun Shen et.al. 2306.08958 null
2023-06-14 TomoSAM: a 3D Slicer extension using SAM for tomography segmentation Federico Semeraro et.al. 2306.08609 link
2023-06-13 Robustness of SAM: Segment Anything Under Corruptions and Beyond Yu Qiao et.al. 2306.07713 null