CV Arxiv Daily

GitPages on https://theneao.github.io/CV-SAR-Seg-arxiv-daily

Updated on 2026.03.30

Usage instructions: here

self-supervised

Publish Date	Title	Authors	PDF	Code
2021-09-09	Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders	Fangyu Liu et.al.	2104.08027	link
2022-07-11	Learned Camera Gain and Exposure Control for Improved Visual Feature Detection and Matching	Justin Tomasi et.al.	2102.04341	null

edge detection

Publish Date	Title	Authors	PDF	Code
2025-07-19	GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset	Zhiwei Zhang et.al.	2507.14697	null
2025-07-17	Multiresolution local smoothness detection in non-uniformly sampled multivariate signals	Sara Avesani et.al.	2507.13480	null
2025-07-15	Towards a Utility-Scale Quantum Edge Detection for Real-World Medical Image Data	Emmanuel Billias et.al.	2507.10939	null
2025-07-14	Improved upper limits on the 21-cm signal power spectrum at $z=17.0$ and $z=20.3$ from an optimal field observed with NenuFAR	S. Munshi et.al.	2507.10533	null
2025-07-09	Quantum algorithm for edge detection in digital grayscale images	Mohit Rohida et.al.	2507.06642	null
2025-07-09	Edge-Boundary-Texture Loss: A Tri-Class Generalization of Weighted Binary Cross-Entropy for Enhanced Edge Detection	Hao Shu et.al.	2507.06569	null
2025-06-29	Multimodal image registration for effective thermographic fever screening	C. Y. N. Dwith et.al.	2507.02955	null
2025-07-01	LIGHTS. A robust technique to identify galaxy edges	Giulia Golini et.al.	2507.01085	null
2025-06-25	U-R-VEDA: Integrating UNET, Residual Links, Edge and Dual Attention, and Vision Transformer for Accurate Semantic Segmentation of CMRs	Racheal Mukisa et.al.	2506.20689	null
2025-06-23	Programmable electro-optic frequency comb empowers integrated parallel convolution processing	Jinze He et.al.	2506.18310	null
2025-06-22	Mobile Image Analysis Application for Mantoux Skin Test	Liong Gele et.al.	2506.17954	null
2025-06-04	Mechanistic Interpretability of Diffusion Models: Circuit-Level Analysis and Causal Validation	Dip Roy et.al.	2506.17237	null
2025-06-20	Self-supervised Feature Extraction for Enhanced Ball Detection on Soccer Robots	Can Lin et.al.	2506.16821	null
2025-06-14	Binarization-Aware Adjuster: Bridging Continuous Optimization and Binary Inference in Edge Detection	Hao Shu et.al.	2506.12460	null
2025-06-13	Exploring the Effectiveness of Deep Features from Domain-Specific Foundation Models in Retinal Image Synthesis	Zuzanna Skorniewska et.al.	2506.11753	null
2025-06-11	A new approach for image segmentation based on diffeomorphic registration and gradient fields	Junchao Zhou et.al.	2506.09357	null
2025-06-10	Machine Learning for the Cluster Reconstruction in the CALIFA Calorimeter at R3B	Tobias Jenegger et.al.	2506.09088	null
2025-06-06	Elementary Cellular Automata as Non-Cryptographic Hash Functions	Daniel McKinley et.al.	2506.06551	null
2025-06-18	Statistical microlocal analysis in two-dimensional X-ray CT	Anuj Abhishek et.al.	2506.05113	null
2025-06-03	Heliostat Optical Error Inspection with Polarimetric Imaging Drone	Mo Tian et.al.	2506.02333	null
2025-06-01	Hybridizing Expressive Rendering: Stroke-Based Rendering with Classic and Neural Methods	Kapil Dev et.al.	2506.00870	null
2025-05-28	Depth to magnetic source estimation using TDX contour	Hammed Oyekan et.al.	2505.22780	null
2025-05-24	Tropical Geometry Based Edge Detection Using Min-Plus and Max-Plus Algebra	Shivam Kumar Jha S et.al.	2505.18625	null
2025-05-07	Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer	Sainath Dey et.al.	2505.04740	null
2025-05-06	Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation	Yi Lin et.al.	2505.04652	link
2025-05-03	Seeing Heat with Color – RGB-Only Wildfire Temperature Inference from SAM-Guided Multimodal Distillation using Radiometric Ground Truth	Michael Marinaccio et.al.	2505.01638	null
2025-05-02	Edge Detection based on Channel Attention and Inter-region Independence Test	Ru-yu Yan et.al.	2505.01040	null
2025-05-02	Edge-preserving Image Denoising via Multi-scale Adaptive Statistical Independence Testing	Ruyu Yan et.al.	2505.01032	null
2025-04-22	DeepCS-TRD, a Deep Learning-based Cross-Section Tree Ring Detector	Henry Marichal et.al.	2504.16242	null
2025-04-22	Multi-Scale Tensorial Summation and Dimensional Reduction Guided Neural Network for Edge Detection	Lei Xu et.al.	2504.15770	null
2025-04-21	Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model	Ahmed Sobhi Saleh et.al.	2504.14782	null
2025-04-18	DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images	Racheal Mukisa et.al.	2504.13415	null
2025-04-07	Advanced Knife-Edge free Self-Aligned Colour Schlieren Imaging with Extended Measuring Range	Shubham Saxena et.al.	2504.05433	null
2025-04-06	Evaluation framework for Image Segmentation Algorithms	Tatiana Merkulova et.al.	2504.04435	null
2025-03-26	Hybrid Multi-Stage Learning Framework for Edge Detection: A Survey	Mark Phil Pacot et.al.	2503.21827	null
2025-03-21	Model reduction of convection-dominated viscous conservation laws using implicit feature tracking and landmark image registration	Victor Zucatti et.al.	2503.17463	null
2025-03-21	Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection	Gensheng Pei et.al.	2503.17080	null
2025-03-19	Benchmarking Brain Connectivity Graph Inference: A Novel Validation Approach	Alice Chevaux et.al.	2503.15012	null
2025-03-04	Robust Detection of Extremely Thin Lines Using 0.2mm Piano Wire	Jisoo Hong et.al.	2503.13473	null
2025-03-14	Refining Image Edge Detection via Linear Canonical Riesz Transforms	Shuhui Yang et.al.	2503.11148	null
2025-03-12	Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection	Qipeng Mei et.al.	2503.09187	null
2025-03-02	STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds	Zikuan Li et.al.	2503.00801	link
2025-02-24	Theory-guided Pseudo-spectral Full Waveform Inversion via Deep Neural Networks	Christopher Zerafa et.al.	2502.17624	null
2025-02-23	Subpixel Edge Localization Based on Converted Intensity Summation under Stable Edge Region	Yingyuan Yang et.al.	2502.16502	null
2025-02-17	Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection	Tessa Pulli et.al.	2502.12027	null
2025-02-14	Edge detection with polynomial frames on the sphere	Frederic Schoppert et.al.	2502.09979	null
2025-02-08	Multifunctional meta-optic azimuthal shear interferometer	Linzhi Yu et.al.	2502.05569	null
2025-02-06	Agricultural Field Boundary Detection through Integration of “Simple Non-Iterative Clustering (SNIC) Super Pixels” and “Canny Edge Detection Method”	Artughrul Gayibov et.al.	2502.04529	link
2025-01-31	Training-free Quantum-Inspired Image Edge Extraction Method	Arti Jain et.al.	2501.18929	null
2025-01-27	Autonomous Horizon-based Asteroid Navigation With Observability-constrained Maneuvers	Aditya Arjun Anibha et.al.	2501.15806	null
2025-01-25	Snapshot Compressed Imaging Based Single-Measurement Computer Vision for Videos	Fengpu Pan et.al.	2501.15122	null
2025-01-29	Stroke classification using Virtual Hybrid Edge Detection from in silico electrical impedance tomography data	Juan Pablo Agnelli et.al.	2501.14704	null
2025-01-23	Enhanced Extractor-Selector Framework and Symmetrization Weighted Binary Cross-Entropy for Edge Detections	Hao Shu et.al.	2501.13365	null
2025-01-20	Wafer-scale waveguide sidewall roughness scattering loss characterization by image processing	Mohit Khurana et.al.	2501.11590	null
2025-01-08	EDMB: Edge Detector with Mamba	Yachuan Li et.al.	2501.04846	link
2025-01-06	Gaussian Masked Autoencoders	Jathushan Rajasegaran et.al.	2501.03229	null
2025-01-05	Pixel-Wise Feature Selection for Perceptual Edge Detection without post-processing	Hao Shu et.al.	2501.02534	null
2025-01-03	Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification	Jarin Ritu et.al.	2501.01921	link
2024-12-24	Efficient Detection Framework Adaptation for Edge Computing: A Plug-and-play Neural Network Toolbox Enabling Edge Deployment	Jiaqi Wu et.al.	2412.18230	link
2024-12-22	Phase-change metasurfaces for reconfigurable image processing	Tingting Liu et.al.	2412.16856	null
2024-12-17	Synthetic Data Generation for Anomaly Detection on Table Grapes	Ionut Marian Motoi et.al.	2412.12949	link
2024-12-17	SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection	Xing Liufu et.al.	2412.12892	link
2025-02-03	Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining	Zhiqi Ge et.al.	2412.10342	null
2024-12-13	Deep Gaussian Process Priors for Bayesian Image Reconstruction	Jonas Latz et.al.	2412.10248	link
2024-12-06	Spinal ligaments detection on vertebrae meshes using registration and 3D edge detection	Ivanna Kramer et.al.	2412.05081	null
2024-11-29	Simultaneous two-dimensional velocity and distance measurements based on laser triangulation	Hao Zhang et.al.	2411.19669	null
2024-11-27	Fall Leaf Adversarial Attack on Traffic Sign Classification	Anthony Etim et.al.	2411.18776	null
2024-11-22	Deep Learning-Based Automatic Delineation of Liver Domes in kV Triggered Images for Online Breath-hold Reproducibility Verification of Liver Stereotactic Body Radiation Therapy	Sugandima Weragoda et.al.	2411.15322	null
2024-12-24	Defective Edge Detection Using Cascaded Ensemble Canny Operator	Anjali Nambiyar Rajkumar Kannan et.al.	2411.14868	null
2024-11-21	Transforming Engineering Diagrams: A Novel Approach for P&ID Digitization using Transformers	Jan Marius Stürmer et.al.	2411.13929	null
2024-11-20	Edge-Detected 4DSTEM – effective low-dose diffraction data acquisition method for nanopowder samples in a SEM instrument	Nikita Denisov et.al.	2411.13265	null
2024-11-12	Well-posedness of a Variable-Exponent Telegraph Equation Applied to Image Despeckling	Sudeb Majee et.al.	2411.08175	null
2024-11-12	WavShadow: Wavelet Based Shadow Segmentation and Removal	Shreyans Jain et.al.	2411.05747	null
2024-11-06	Mapping reionization bubbles in the JWST era I: empirical edge detection with Lyman alpha emission from galaxies	Ting-Yi Lu et.al.	2411.04176	null
2024-11-04	Deep Learning for Leopard Individual Identification: An Adaptive Angular Margin Approach	David Colomer Matachana et.al.	2411.01962	link
2024-10-29	Assessment of Abrupt Shifts in CMIP6 Models using Edge Detection	Sjoerd Terpstra et.al.	2410.19498	null
2024-10-19	Cutting-Edge Detection of Fatigue in Drivers: A Comparative Study of Object Detection Models	Amelia Jones et.al.	2410.15030	null
2024-10-17	Co-Segmentation without any Pixel-level Supervision with Application to Large-Scale Sketch Classification	Nikolaos-Antonios Ypsilantis et.al.	2410.13582	null
2024-10-16	Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization	Nanda Febri Istighfarin et.al.	2410.12240	null
2024-10-13	Energy-Efficient and Fast Memristor-based Serial Multipliers Applicable in Image Processing	Seyed Erfan Fatemieh et.al.	2410.09953	null
2024-10-04	Generative Edge Detection with Stable Diffusion	Caixia Zhou et.al.	2410.03080	null
2024-11-07	Learning from Pattern Completion: Self-supervised Controllable Generation	Zhiqiang Chen et.al.	2409.18694	link
2024-09-26	Photon Inhibition for Energy-Efficient Single-Photon Imaging	Lucas J. Koerner et.al.	2409.18337	null
2024-09-26	EfficientCrackNet: A Lightweight Model for Crack Segmentation	Abid Hasan Zim et.al.	2409.18099	null
2024-09-24	Nonlinear Analog Processing with Anisotropic Nonlinear Films	Michele Cotrufo et.al.	2409.16448	null
2024-11-24	A new baseline for edge detection: Make Encoder-Decoder great again	Yachuan Li et.al.	2409.14976	link
2024-09-17	OmniGen: Unified Image Generation	Shitao Xiao et.al.	2409.11340	link
2024-09-17	Nonlocal phase-change metaoptics for reconfigurable nonvolatile image processing	Guoce Yang et.al.	2409.10976	null
2024-08-26	Automated Quantification of White Blood Cells in Light Microscopic Images of Injured Skeletal Muscle	Yang Jiao et.al.	2409.06722	null
2024-09-11	A Machine Learning Based Approach for Statistical Analysis of Detonation Cells from Soot Foils	Vansh Sharma et.al.	2409.06466	null
2024-09-10	Contour Analysis Tool: an interactive tool for background and morphology analysis	Mark A. Hutchison et.al.	2409.06421	null
2024-09-06	Cycle Pixel Difference Network for Crisp Edge Detection	Changsong Liu et.al.	2409.04272	null
2024-09-04	Image Registration with Averaging Network and Edge-Based Loss for Low-SNR Cardiac MRI	Xuan Lei et.al.	2409.02348	null
2024-09-03	EDCSSM: Edge Detection with Convolutional State Space Model	Qinghui Hong et.al.	2409.01609	null
2024-08-29	Android Malware Detection Based on RGB Images and Multi-feature Fusion	Zhiqiang Wang et.al.	2408.16555	null
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445	link
2024-08-28	Image Triangulation Using the Sobel Operator for Vertex Selection	Olivia Laske et.al.	2408.16112	null
2024-08-27	Optimizing Lung Cancer Detection in CT Imaging: A Wavelet Multi-Layer Perceptron (WMLP) Approach Enhanced by Dragonfly Algorithm (DA)	Bitasadat Jamshidi et.al.	2408.15355	null
2024-09-03	A Multiscale Gradient Fusion Method for Edge Detection in Color Images Utilizing the CBM3D Filter	Zhuoyue Wang et.al.	2408.14013	null
2024-08-20	EdgeNAT: Transformer for Efficient Edge Detection	Jinghuai Jie et.al.	2408.10527	link
2024-08-19	Edge detection imaging by quasi-bound states in the continuum	Tingting Liu et.al.	2408.10106	null
2024-08-08	UHNet: An Ultra-Lightweight and High-Speed Edge Detection Network	Fuzhang Li et.al.	2408.04258	null
2024-08-07	GUI Element Detection Using SOTA YOLO Deep Learning Models	Seyed Shayan Daneshvar et.al.	2408.03507	null
2024-07-19	How Homogenizing the Channel-wise Magnitude Can Enhance EEG Classification Model?	Huyen Ngo et.al.	2407.20247	null
2024-07-29	More precise edge detections	Hao Shu et.al.	2407.19992	link
2024-06-28	DCSM 2.0: Deep Conditional Shape Models for Data Efficient Segmentation	Athira J Jacob et.al.	2407.00186	null
2024-06-19	Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review	Abhishek Swami et.al.	2406.13266	null
2024-06-14	Research on Edge Detection of LiDAR Images Based on Artificial Intelligence Technology	Haowei Yang et.al.	2406.09773	null
2024-06-14	An alternate approach for estimating grain-growth kinetics	Manoj Prabakar et.al.	2406.09653	link
2024-06-12	A New Class Biorthogonal Spline Wavelet for Image Edge Detection	Dujuan Zhou et.al.	2406.08285	null
2024-06-28	Learning to utilize image second-order derivative information for crisp edge detection	Changsong Liu et.al.	2406.05779	null
2024-06-04	RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting	Qi Wang et.al.	2406.02461	null
2024-06-02	An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites	Ylva Grønningsæter et.al.	2406.00704	link
2024-06-01	A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing	Nurul Rafi et.al.	2406.00239	null
2024-05-28	Enhanced infrared vision by nonlinear up-conversion in nonlocal metasurfaces	Laura Valencia Molina et.al.	2405.17726	null
2024-04-02	Improving and Evaluating Machine Learning Methods for Forensic Shoeprint Matching	Divij Jain et.al.	2405.14878	null
2024-05-21	Automating Attendance Management in Human Resources: A Design Science Approach Using Computer Vision and Facial Recognition	Bao-Thien Nguyen-Tat et.al.	2405.12633	null
2024-05-19	The Effectiveness of Edge Detection Evaluation Metrics for Automated Coastline Detection	Conor O’Sullivan et.al.	2405.11498	link
2024-05-19	Automated Coastline Extraction Using Edge Detection Algorithms	Conor O’Sullivan et.al.	2405.11494	link
2024-05-18	Quantum Edge Detection	Santiago Llorens et.al.	2405.11373	null
2024-05-14	NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-Resolution	Yihong Chen et.al.	2405.08423	link
2024-05-13	AnomalyLLM: Few-shot Anomaly Edge Detection for Dynamic Graphs using Large Language Models	Shuo Liu et.al.	2405.07626	link
2024-05-07	Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map	Yuxuan Xia et.al.	2405.04290	null
2024-05-06	Statistical Edge Detection And UDF Learning For Shape Representation	Virgile Foy et.al.	2405.03381	null
2024-04-14	Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery	Chengxi Han et.al.	2404.09179	link
2024-04-10	Edge Detection Quantumized: A Novel Quantum Algorithm For Image Processing	Syed Emad Uddin Shubha et.al.	2404.06889	null
2024-06-01	Leveraging edge detection and neural networks for better UAV localization	Theo Di Piazza et.al.	2404.06207	link
2024-04-07	Msmsfnet: a multi-stream and multi-scale fusion net for edge detection	Chenguang Liu et.al.	2404.04856	null
2024-03-30	The Devil is in the Edges: Monocular Depth Estimation with Edge-aware Consistency Fusion	Pengzhi Li et.al.	2404.00373	null
2024-03-30	Radio Frequency Interference Detection Using Efficient Multi-Scale Convolutional Attention UNet	Fei Gu et.al.	2404.00277	null
2024-03-28	Learning Multiple Representations with Inconsistency-Guided Detail Regularization for Mask-Guided Matting	Weihao Jiang et.al.	2403.19213	null
2024-03-27	Colour and Brush Stroke Pattern Recognition in Abstract Art using Modified Deep Convolutional Generative Adversarial Networks	Srinitish Srinivasan et.al.	2403.18397	link
2024-03-23	An edge detection-based deep learning approach for tear meniscus height measurement	Kesheng Wang et.al.	2403.15853	null
2024-03-18	Logistic regression to boost exoplanet detection performances	Hadrien Cambazard et.al.	2403.11571	null
2024-03-17	Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning	Jesher Joshua M et.al.	2403.11291	null
2024-03-16	Texture Edge detection by Patch consensus (TEP)	Guangyu Cui et.al.	2403.11038	null
2024-03-14	Temporal Signal Processing with Nonlocal Optical Metasurfaces	Michele Cotrufo et.al.	2403.09087	null
2024-03-13	RAF-GI: Towards Robust, Accurate and Fast-Convergent Gradient Inversion Attack in Federated Learning	Can Liu et.al.	2403.08383	link
2024-03-13	MGIC: A Multi-Label Gradient Inversion Attack based on Canny Edge Detection on Federated Learning	Can Liu et.al.	2403.08284	null
2024-03-07	RankED: Addressing Imbalance and Uncertainty in Edge Detection Using Ranking-based Losses	Bedrettin Cetinkaya et.al.	2403.01795	link
2024-03-03	CDSE-UNet: Enhancing COVID-19 CT Image Segmentation with Canny Edge Detection and Dual-Path SENet Feature Fusion	Jiao Ding et.al.	2403.01513	null
2024-02-28	On the Accuracy of Edge Detectors in Number Plate Extraction	Bashir Olaniyi Sadiq et.al.	2402.18251	null
2024-03-20	Lightweight, error-tolerant edge detection using memristor-enabled stochastic logics	Lekai Song et.al.	2402.16908	null
2024-02-22	SHM-Traffic: DRL and Transfer learning based UAV Control for Structural Health Monitoring of Bridges with Traffic	Divija Swetha Gadiraju et.al.	2402.14757	null
2024-02-18	Near-infrared metalens empowered dual-mode high resolution and large FOV microscope	Chuang Sun et.al.	2402.11554	null
2024-02-07	Color Recognition in Challenging Lighting Environments: CNN Approach	Nizamuddin Maitlo et.al.	2402.04762	null
2024-02-01	Lightweight Pixel Difference Networks for Efficient Visual Representation Learning	Zhuo Su et.al.	2402.00422	link
2024-01-27	Applications of Tao General Difference in Discrete Domain	Linmi Tao et.al.	2401.15287	null
2024-01-18	False Discovery Rate Control for Gaussian Graphical Models via Neighborhood Screening	Taulant Koka et.al.	2401.09979	null
2024-01-14	Photonic real time video image signal processor at 17Tb/s based on a Kerr microcomb	Mengxi Tan et.al.	2401.07197	null
2024-01-12	Space-Time Nonlocal Metasurfaces for Event-Based Image Processing	Sedigheh Esfahani et.al.	2401.06586	null
2024-01-07	Real-Time Asphalt Pavement Layer Thickness Prediction Using Ground-Penetrating Radar Based on a Modified Extended Common Mid-Point (XCMP) Approach	Siqi Wang et.al.	2401.03375	null
2024-01-05	Systematic review of image segmentation using complex networks	Amin Rezaei et.al.	2401.02758	null
2024-01-04	SuperEdge: Towards a Generalization Model for Self-Supervised Edge Detection	Leng Kai et.al.	2401.02313	link
2024-01-09	DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection	Yunfan Ye et.al.	2401.02032	link
2023-12-21	Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation	Rasha Alshawi et.al.	2312.14053	link
2023-12-14	Automated Grain Boundary Detection for Bright-Field Transmission Electron Microscopy Images via U-Net	Matthew J. Patrick et.al.	2312.09392	null
2023-12-10	Polar Linear Canonical Wavelet Transform: Theory and Its Application	Hui Zhao et.al.	2312.06702	null
2023-12-09	A fast numerical algorithm for finding all real solutions to a system of N nonlinear equations in a finite domain	Fernando Chueca-Diez et.al.	2312.03927	null
2023-12-04	Cable Slack Detection for Arresting Gear Application using Machine Vision	Ari Goodman et.al.	2312.02320	null
2023-12-03	Meta ControlNet: Enhancing Task Adaptation via Meta Learning	Junjie Yang et.al.	2312.01255	link
2023-10-28	Vision-Based Incoming Traffic Estimator Using Deep Neural Network on General Purpose Embedded Hardware	K. G. Zoysa et.al.	2311.16125	null
2023-11-27	DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization	Zhaoyang Xia et.al.	2311.16060	link
2023-11-22	Reconfigurable Image Processing Metasurfaces with Phase-Change Materials	Michele Cotrufo et.al.	2311.13109	null
2023-11-21	Unveiling the cosmic dawn and epoch of reionization using cosmic 21-cm signal	Ankita Bera et.al.	2311.13019	null
2023-11-16	Depth Insight – Contribution of Different Features to Indoor Single-image Depth Estimation	Yihong Wu et.al.	2311.10042	null
2023-11-14	RoboSense At Edge: Detecting Slip, Crumple and Shape of the Object in Robotic Hand for Teleoprations	Sudev Kumar Padhi et.al.	2311.07888	null
2023-10-28	Tracking and fast imaging of a translational object via Fourier modulation	Shijian Li et.al.	2310.18732	null
2024-01-09	FaultSeg Swin-UNETR: Transformer-Based Self-Supervised Pretraining Model for Fault Recognition	Zeren Zhang et.al.	2310.17974	null
2023-11-08	Constraining exotic dark matter models with the dark ages 21-cm signal	Rajesh Mondal et.al.	2310.15530	null
2023-10-22	Research on Key Technologies of Infrastructure Digitalization based on Multimodal Spatial Data	Zhanyuan Tian et.al.	2310.14296	null
2023-10-01	Quantum image edge detection based on eight-direction Sobel operator for NEQR	Wenjie Liu et.al.	2310.03037	null
2023-09-26	3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction	Miriam Jäger et.al.	2309.14800	null
2023-09-13	Temporal compressive edge imaging enabled by a lensless diffuser camera	Ze Zheng et.al.	2309.07198	null
2023-11-05	MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation	Nhat-Tan Bui et.al.	2309.03329	link
2023-09-05	DeNISE: Deep Networks for Improved Segmentation Edges	Sander Riisøen Jyhne et.al.	2309.02091	null
2023-08-29	A Pseudo-Boolean Polynomials Approach for Image Edge Detection	Tendai Mapungwana Chikake et.al.	2308.15557	link
2023-08-29	Pseudo-Boolean Polynomials Approach To Edge Detection And Image Segmentation	Tendai Mapungwana Chikake et.al.	2308.15453	null
2023-08-27	Practical Edge Detection via Robust Collaborative Learning	Yuanbin Fu et.al.	2308.14084	link
2023-11-18	Zero-Shot Edge Detection with SCESAME: Spectral Clustering-based Ensemble for Segment Anything Model Estimation	Hiroaki Yamagiwa et.al.	2308.13779	link
2023-08-19	R-C-P Method: An Autonomous Volume Calculation Method Using Image Processing and Machine Vision	MA Muktadir et.al.	2308.10058	null
2023-08-19	TSAR-MVS: Textureless-aware Segmentation and Correlative Refinement Guided Multi-View Stereo	Zhenlong Yuan et.al.	2308.09990	null
2023-08-12	The Color Clifford Hardy Signal: Application to Color Edge Detection and Optical Flow	Xiaoxiao Hu et.al.	2308.06485	null
2023-08-12	Tiny and Efficient Model for the Edge Detection Generalization	Xavier Soria et.al.	2308.06468	link
2023-08-05	Electromagnetic Spatiotemporal Differentiators	Yi Zhou et.al.	2308.03797	null
2023-08-06	ECT: Fine-grained Edge Detection with Learned Cause Tokens	Shaocong Xu et.al.	2308.03092	link
2023-08-08	Generation of Realistic Synthetic Raw Radar Data for Automated Driving Applications using Generative Adversarial Networks	Eduardo C. Fidelis et.al.	2308.02632	link
2023-08-23	MSECNet: Accurate and Robust Normal Estimation for 3D Point Clouds by Multi-Scale Edge Conditioning	Haoyi Xiu et.al.	2308.02237	link
2023-07-31	Multispectral Image Segmentation in Agriculture: A Comprehensive Study on Fusion Approaches	Nuno Cunha et.al.	2308.00159	link
2023-07-31	Hybrid quantum transfer learning for crack image classification on NISQ hardware	Alexander Geng et.al.	2307.16723	null
2023-10-16	PNT-Edge: Towards Robust Edge Detection with Noisy Labels by Learning Pixel-level Noise Transitions	Wenjie Xuan et.al.	2307.14070	link
2023-07-20	Integrated Photonic Fractional Convolution Accelerator	Kevin Zelaya et.al.	2307.10976	null
2023-07-11	Compact Twice Fusion Network for Edge Detection	Yachuan Li et.al.	2307.04952	link
2023-07-08	Edge-Aware Mirror Network for Camouflaged Object Detection	Dongyue Sun et.al.	2307.03932	link
2023-07-08	On a cylindrical scanning modality in three-dimensional Compton scatter tomography	James W. Webber et.al.	2307.03896	null
2023-07-07	Polarization Imaging and Edge Detection with Image-Processing Metasurfaces	Michele Cotrufo et.al.	2307.03548	null
2023-07-07	A Deep Active Contour Model for Delineating Glacier Calving Fronts	Konrad Heidler et.al.	2307.03461	null
2023-06-29	Pupil-driven quantitative differential phase contrast imaging	Shuhe Zhang et.al.	2306.17088	null
2023-06-27	Delving into Crispness: Guided Label Refinement for Crisp Edge Detection	Yunfan Ye et.al.	2306.15172	link
2023-06-26	Integrated lithium niobate microwave photonic processing engine	Hanke Feng et.al.	2306.14415	null
2023-06-22	XAI-TRIS: Non-linear benchmarks to quantify ML explanation performance	Benedict Clark et.al.	2306.12816	link
2023-07-03	A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering	Chaoning Zhang et.al.	2306.06211	null
2023-06-03	Hierarchical Multiresolution Feature- and Prior-based Graphs for Classification	Faezeh Fallah et.al.	2306.02143	null
2023-05-31	SPAC-Net: Synthetic Pose-aware Animal ControlNet for Enhanced Pose Estimation	Le Jiang et.al.	2305.17845	link
2023-05-16	A Geometric Calibration of the Tip of the Red Giant Branch in the Milky Way using Gaia DR3	M. Dixon et.al.	2305.09215	null
2023-05-12	Vision and Control for Grasping Clear Plastic Bags	Joohwan Seo et.al.	2305.07631	link
2023-07-28	Edge-Enhanced Microscopy of Comlplex Object using Scalar and Vectorial Vortex Filtering	Jigme Zangpo et.al.	2305.07225	null
2023-05-10	Novel Quantum Information Processing Methods and Investigation	Zhang Ze Yu et.al.	2305.05953	null
2023-05-10	Low-Light Image Enhancement via Structure Modeling and Guidance	Xiaogang Xu et.al.	2305.05839	link
2023-04-30	Multi-directional Sobel operator kernel on GPUs	Qiong Chang et.al.	2305.00515	null
2023-04-30	Continuous motion of an electrically actuated water droplet over a PDMS-coated surface	Supriya Upadhyay et.al.	2305.00420	null
2023-04-13	CATS: The Hubble Constant from Standardized TRGB and Type Ia Supernova Measurements	D. Scolnic et.al.	2304.06693	null
2023-04-10	Reconstruction-driven Dynamic Refinement based Unsupervised Domain Adaptation for Joint Optic Disc and Cup Segmentation	Ziyang Chen et.al.	2304.04581	null
2023-03-28	Vision based UAV Navigation through Narrow Passages	Jayakant Kumar et.al.	2303.15803	null
2023-03-21	The Treasure Beneath Multiple Annotations: An Uncertainty-aware Edge Detector	Caixia Zhou et.al.	2303.11828	link
2023-03-15	PENet: A Joint Panoptic Edge Detection Network	Yang Zhou et.al.	2303.08848	link
2023-05-08	SILOP: An Automated Framework for Semantic Segmentation Using Image Labels Based on Object Perimeters	Erik Ostrowski et.al.	2303.07892	link
2023-03-16	NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction from Multi-view Images	Yunfan Ye et.al.	2303.07653	link
2023-03-10	Automatic Detection and Rectification of Paper Receipts on Smartphones	Edward Whittaker et.al.	2303.05763	null
2023-03-09	When Optical Microscopy Meets All-Optical Analog Computing: A Brief Review	Yichang Shou et.al.	2303.04988	null
2023-03-06	Optimal Periodic Control of Unmanned Aerial Vehicles Based on Fourier Integral Pseudospectral and Edge-Detection Methods	Kareem T. Elgindy et.al.	2303.02969	null
2023-03-02	Scalable optical neural networks based on temporal computing	Shuang Zheng et.al.	2303.01287	null
2023-03-26	Attention-based Point Cloud Edge Sampling	Chengzhi Wu et.al.	2302.14673	link

transfer learning

Publish Date	Title	Authors	PDF	Code
2025-07-21	Sufficiency-principled Transfer Learning via Model Averaging	Xiyuan Zhang et.al.	2507.15416	null
2025-07-21	Universal crystal material property prediction via multi-view geometric fusion in graph transformers	Liang Zhang et.al.	2507.15303	null
2025-07-20	A Case Against Implicit Standards: Homophone Normalization in Machine Translation for Languages that use the Ge’ez Script	Hellina Hailu Nigatu et.al.	2507.15142	null
2025-07-20	Omni-Think: Scaling Cross-Domain Generalization in LLMs via Multi-Task RL with Hybrid Rewards	Derek Li et.al.	2507.14783	null
2025-07-19	Rethinking Suicidal Ideation Detection: A Trustworthy Annotation Framework and Cross-Lingual Model Evaluation	Amina Dzafic et.al.	2507.14693	null
2025-07-19	Depthwise-Dilated Convolutional Adapters for Medical Object Tracking and Segmentation Using the Segment Anything Model 2	Guoping Xu et.al.	2507.14613	null
2025-07-19	Towards a Proactive Autoscaling Framework for Data Stream Processing at the Edge using GRU and Transfer Learning	Eugene Armah et.al.	2507.14597	null
2025-07-19	Parameter-transfer in spatial autoregressive models via model averaging	Fen Jiang et.al.	2507.14453	null
2025-07-19	IRGPT: Understanding Real-world Infrared Image with Bi-cross-modal Curriculum on Large-scale Benchmark	Zhe Cao et.al.	2507.14449	null
2025-07-18	Language Models as Ontology Encoders	Hui Yang et.al.	2507.14334	null
2025-07-17	Disentangling coincident cell events using deep transfer learning and compressive sensing	Moritz Leuthner et.al.	2507.13176	null
2025-07-17	Improving Diagnostic Accuracy of Pigmented Skin Lesions With CNNs: an Application on the DermaMNIST Dataset	Nerma Kadric et.al.	2507.12961	null
2025-07-17	IDS-Net: A novel framework for few-shot photovoltaic power prediction with interpretable dynamic selection and feature information fusion	Hang Fan et.al.	2507.12745	null
2025-07-16	Improving physics-informed neural network extrapolation via transfer learning and adaptive activation functions	Athanasios Papastathopoulos-Katsaros et.al.	2507.12659	null
2025-07-16	Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows	Judy Long et.al.	2507.12590	null
2025-07-14	Quantum Transfer Learning to Boost Dementia Detection	Sounak Bhowmik et.al.	2507.12485	null
2025-07-16	Hybrid Ensemble Approaches: Optimal Deep Feature Fusion and Hyperparameter-Tuned Classifier Ensembling for Enhanced Brain Tumor Classification	Zahid Ullah et.al.	2507.12177	null
2025-07-15	Two intersecting radio shells: relics of galaxy merger shocks ?	Bärbel S. Koribalski et.al.	2507.11781	null
2025-07-15	Physics-Informed Transfer Learning for Data-Driven Sound Source Reconstruction in Near-Field Acoustic Holography	Xinmeng Luan et.al.	2507.11070	null
2025-07-14	HEIMDALL: a grapH-based sEIsMic Detector And Locator for microseismicity	Matteo Bagagli et.al.	2507.10850	null
2025-07-20	Supporting SENCOTEN Language Documentation Efforts with Automatic Speech Recognition	Mengzhe Geng et.al.	2507.10827	null
2025-07-14	National level satellite-based crop field inventories in smallholder landscapes	Philippe Rufin et.al.	2507.10499	null
2025-07-14	A Transfer Learning-Based Method for Water Body Segmentation in Remote Sensing Imagery: A Case Study of the Zhada Tulin Area	Haonan Chen et.al.	2507.10084	null
2025-07-14	Leveraging Swin Transformer for enhanced diagnosis of Alzheimer’s disease using multi-shell diffusion MRI	Quentin Dessain et.al.	2507.09996	null
2025-07-13	Low-Rank Adaptation of Deep Prior Neural Networks For Room Impulse Response Reconstruction	Mirco Pezzoli et.al.	2507.09806	null
2025-07-13	Pre-trained Under Noise: A Framework for Robust Bone Fracture Detection in Medical Imaging	Robby Hoover et.al.	2507.09731	null
2025-07-13	Hybrid Quantum-Classical Generative Adversarial Networks with Transfer Learning	Asma Al-Othni et.al.	2507.09706	null
2025-07-13	Enhancing ALS Progression Tracking with Semi-Supervised ALSFRS-R Scores Estimated from Ambient Home Health Monitoring	Noah Marchal et.al.	2507.09460	null
2025-07-12	Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distribution Shift	Behraj Khan et.al.	2507.09222	null
2025-07-12	CycleGAN-Driven Transfer Learning for Electronics Response Emulation in High-Purity Germanium Detectors	Kevin Bhimani et.al.	2507.09106	null
2025-07-11	The Bayesian Approach to Continual Learning: An Overview	Tameem Adel et.al.	2507.08922	null
2025-07-15	Dually Hierarchical Drift Adaptation for Online Configuration Performance Learning	Zezhen Xiang et.al.	2507.08730	null
2025-07-11	MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion	Jihao Gu et.al.	2507.08344	null
2025-07-11	Transfer Learning and Mixup for Fine-Grained Few-Shot Fungi Classification	Jason Kahei Tam et.al.	2507.08248	null
2025-07-10	An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision	Jareen Anjom et.al.	2507.08165	null
2025-07-10	An Object-Based Deep Learning Approach for Building Height Estimation from Single SAR Images	Babak Memar et.al.	2507.08096	null
2025-07-10	BEAVER: Building Environments with Assessable Variation for Evaluating Multi-Objective Reinforcement Learning	Ruohong Liu et.al.	2507.07769	null
2025-07-09	Deep Brain Net: An Optimized Deep Learning Model for Brain tumor Detection in MRI Images Using EfficientNetB0 and ResNet50 with Transfer Learning	Daniel Onah et.al.	2507.07011	null
2025-07-08	DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data Augmentation	Maximilian Heil et.al.	2507.06189	null
2025-07-09	A Survey on Prompt Tuning	Zongqian Li et.al.	2507.06085	null
2025-07-08	Contrastive and Transfer Learning for Effective Audio Fingerprinting through a Real-World Evaluation Protocol	Christos Nikou et.al.	2507.06070	null
2025-07-08	PSAT: Pediatric Segmentation Approaches via Adult Augmentations and Transfer Learning	Tristan Kirscher et.al.	2507.05764	null
2025-07-07	Predicting mutational effects on protein binding from folding energy	Arthur Deng et.al.	2507.05502	null
2025-07-07	Conditional Graph Neural Network for Predicting Soft Tissue Deformation and Forces	Madina Kojanazarova et.al.	2507.05315	null
2025-07-07	HGNet: High-Order Spatial Awareness Hypergraph and Multi-Scale Context Attention Network for Colorectal Polyp Detection	Xiaofang Liu et.al.	2507.04880	null
2025-07-07	Towards Human-in-the-Loop Onset Detection: A Transfer Learning Approach for Maracatu	António Sá Pinto et.al.	2507.04858	null
2025-07-07	Model Compression using Progressive Channel Pruning	Jinyang Guo et.al.	2507.04792	null
2025-07-06	Transfer Learning in Infinite Width Feature Learning Networks	Clarissa Lauditi et.al.	2507.04448	null
2025-07-06	Mixed-Sample SGD: an End-to-end Analysis of Supervised Transfer Learning	Yuyang Deng et.al.	2507.04194	null
2025-07-05	When Data-Free Knowledge Distillation Meets Non-Transferable Teacher: Escaping Out-of-Distribution Trap is All You Need	Ziming Hong et.al.	2507.04119	null
2025-07-05	Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery	Xiao Liu et.al.	2507.04051	null
2025-07-04	ChestGPT: Integrating Large Language Models and Vision Transformers for Disease Detection and Localization in Chest X-Rays	Shehroz S. Khan et.al.	2507.03739	null
2025-07-04	SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications	Yana Hasson et.al.	2507.03578	null
2025-07-03	Understanding Knowledge Transferability for Transfer Learning: A Survey	Haohua Wang et.al.	2507.03175	null
2025-07-02	Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains	Abhishek Verma et.al.	2507.03026	null
2025-07-03	Wildlife Target Re-Identification Using Self-supervised Learning in Non-Urban Settings	Mufhumudzi Muthivhi et.al.	2507.02403	null
2025-07-03	Transfer Learning for Matrix Completion	Dali Liu et.al.	2507.02248	null
2025-07-03	Domain-Adversarial Transfer Learning for Fault Root Cause Identification in Cloud Computing Systems	Bruce Fang et.al.	2507.02233	null
2025-07-02	Transfer Learning for VLC-based indoor Localization: Addressing Environmental Variability	Masood Jan et.al.	2507.01575	null
2025-07-02	How Weight Resampling and Optimizers Shape the Dynamics of Continual Learning and Forgetting in Neural Networks	Lapo Frati et.al.	2507.01559	null
2025-07-02	Automated Classification of Volcanic Earthquakes Using Transformer Encoders: Insights into Data Quality and Model Interpretability	Y. Suzuki et.al.	2507.01260	null
2025-07-01	Phase Transition in Nonparametric Minimax Rates for Covariate Shifts on Approximate Manifolds	Yuyao Wang et.al.	2507.00889	null
2025-06-30	Natural language processing for African languages	David Ifeoluwa Adelani et.al.	2507.00297	null
2025-06-28	An efficient plant disease detection using transfer learning approach	Bosubabu Sambana et.al.	2507.00070	null
2025-06-30	CoMMiT: Co-informed inference of microbiome-metabolome interactions via transfer learning	Leiyue Li et.al.	2506.24013	null
2025-06-30	Pruning by Block Benefit: Exploring the Properties of Vision Transformer Blocks during Domain Adaptation	Patrick Glandorf et.al.	2506.23675	null
2025-06-30	AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval	Suyash Maniyar et.al.	2506.23605	null
2025-06-29	FedRef: Communication-Efficient Bayesian Fine Tuning with Reference Model	Taehwan Yoon et.al.	2506.23210	null
2025-06-29	Self-Supervised Contrastive Learning for Multi-Label Images	Jiale Chen et.al.	2506.23156	null
2025-06-28	Towards Time Series Generation Conditioned on Unstructured Natural Language	Jaeyun Woo et.al.	2506.22927	null
2025-06-28	ReasonBridge: Efficient Reasoning Transfer from Closed to Open-Source Language Models	Ziqi Zhong et.al.	2506.22865	null
2025-06-27	Are Fast Methods Stable in Adversarially Robust Transfer Learning?	Joshua C. Zhao et.al.	2506.22602	null
2025-06-25	How Can Multimodal Remote Sensing Datasets Transform Classification via SpatialNet-ViT?	Gautam Siddharth Kashyap et.al.	2506.22501	null
2025-06-27	Multi-View Contrastive Learning for Robust Domain Adaptation in Medical Time Series Analysis	YongKyung Oh et.al.	2506.22393	null
2025-06-27	Transfer Learning for Assessing Heavy Metal Pollution in Seaports Sediments	Tin Lai et.al.	2506.22096	null
2025-06-27	Visual Content Detection in Educational Videos with Transfer Learning and Dataset Enrichment	Dipayan Biswas et.al.	2506.21903	null
2025-06-26	Offensive Language Detection on Social Media Using XLNet	Reem Alothman et.al.	2506.21795	null
2025-06-26	Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation	Sweta Banerjee et.al.	2506.21444	null
2025-06-25	Brain2Model Transfer: Training sensory and decision models with human neural activity as a teacher	Tomas Gallo Aquino et.al.	2506.20834	null
2025-06-25	Physics-Informed Machine Learning Regulated by Finite Element Analysis for Simulation Acceleration of Laser Powder Bed Fusion	R. Sharma et.al.	2506.20537	null
2025-06-25	Comparative Analysis of Deep Learning Models for Crop Disease Detection: A Transfer Learning Approach	Saundarya Subramaniam et.al.	2506.20323	null
2025-06-25	FundaQ-8: A Clinically-Inspired Scoring Framework for Automated Fundus Image Quality Assessment	Lee Qi Zun et.al.	2506.20303	null
2025-06-24	General Methods Make Great Domain-specific Foundation Models: A Case-study on Fetal Ultrasound	Jakob Ambsdorf et.al.	2506.19552	null
2025-06-24	From High-SNR Radar Signal to ECG: A Transfer Learning Model with Cardio-Focusing Algorithm for Scenarios with Limited Data	Yuanyuan Zhang et.al.	2506.19358	null
2025-06-23	Focus Your Attention: Towards Data-Intuitive Lightweight Vision Transformers	Suyash Gaurav et.al.	2506.18791	null
2025-06-23	Leveraging Transfer Learning to Overcome Data Limitations in Czochralski Crystal Growth	Milena Petkovic et.al.	2506.18774	null
2025-06-23	Benchmarking histopathology foundation models in a multi-center dataset for skin cancer subtyping	Pablo Meseguer et.al.	2506.18668	null
2025-06-23	When Fine-Tuning Fails: Lessons from MS MARCO Passage Ranking	Manu Pande et.al.	2506.18535	null
2025-06-23	Generalizing Vision-Language Models to Novel Domains: A Comprehensive Survey	Xinyao Li et.al.	2506.18504	null
2025-06-23	Leveraging neural network interatomic potentials for a foundation model of chemistry	So Yeon Kim et.al.	2506.18497	null
2025-06-26	These Are Not All the Features You Are Looking For: A Fundamental Bottleneck in Supervised Pretraining	Xingyu Alice Yang et.al.	2506.18221	null
2025-06-22	Deep Supervised LSTM for 3D morphology estimation from Multi-View RGB Images of Wheat Spikes	Olivia Zumsteg et.al.	2506.18060	null
2025-06-22	Classification of Tents in Street Bazaars Using CNN	Azamat Ibragimov et.al.	2506.17946	null
2025-06-21	Rethinking the Role of Operating Conditions for Learning-based Multi-condition Fault Diagnosis	Pengyu Han et.al.	2506.17740	null
2025-06-21	Numerical simulation of transient heat conduction with moving heat source using Physics Informed Neural Networks	Anirudh Kalyan et.al.	2506.17726	null
2025-06-21	Unveiling Factors for Enhanced POS Tagging: A Study of Low-Resource Medieval Romance Languages	Matthias Schöffel et.al.	2506.17715	null
2025-06-20	Trustworthy Few-Shot Transfer of Medical VLMs through Split Conformal Prediction	Julio Silva-Rodríguez et.al.	2506.17503	null
2025-06-19	Energy-Based Transfer for Reinforcement Learning	Zeyun Deng et.al.	2506.16590	null
2025-06-17	Large Language Models – the Future of Fundamental Physics?	Caroline Heneka et.al.	2506.14757	null
2025-06-17	DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning	Kunal Swami et.al.	2506.14709	null
2025-06-17	Bayesian Knowledge Transfer for a Kalman Fixed-Lag Interval Smoother	Ondřej Skalský et.al.	2506.14572	null
2025-06-17	Adjustment for Confounding using Pre-Trained Representations	Rickmer Schulte et.al.	2506.14329	link
2025-06-17	Less is More: Undertraining Experts Improves Model Upcycling	Stefan Horoi et.al.	2506.14126	null
2025-06-17	Leveraging Transfer Learning and User-Specific Updates for Rapid Training of BCI Decoders	Ziheng Chen et.al.	2506.14120	null
2025-06-16	Understand the Implication: Learning to Think for Pragmatic Understanding	Settaluri Lakshmi Sravanthi et.al.	2506.13559	null
2025-06-16	Advancing Image-Based Grapevine Variety Classification with a New Benchmark and Evaluation of Masked Autoencoders	Gabriel A. Carneiro et.al.	2506.13335	null
2025-06-16	Evolution of ReID: From Early Methods to LLM Integration	Amran Bhuiyan et.al.	2506.13039	null
2025-06-16	Geometric Embedding Alignment via Curvature Matching in Transfer Learning	Sung Moon Ko et.al.	2506.13015	null
2025-06-14	Konooz: Multi-domain Multi-dialect Corpus for Named Entity Recognition	Nagham Hamad et.al.	2506.12615	null
2025-06-14	A Transfer Learning Framework for Multilayer Networks via Model Averaging	Yongqin Qiu et.al.	2506.12455	null
2025-06-14	Hierarchical Deep Feature Fusion and Ensemble Learning for Enhanced Brain Tumor MRI Classification	Zahid Ullah et.al.	2506.12363	null
2025-06-13	Interpretable Classification of Levantine Ceramic Thin Sections via Neural Networks	Sara Capriotti et.al.	2506.12250	null
2025-06-13	Coefficient Shape Transfer Learning for Functional Linear Regression	Shuhao Jiao et.al.	2506.11367	null
2025-06-12	Many-Body Neural Network Wavefunction for a Non-Hermitian Ising Chain	Lavoisier Wah et.al.	2506.11222	null
2025-06-12	PromptTSS: A Prompting-Based Approach for Interactive Multi-Granularity Time Series Segmentation	Ching Chang et.al.	2506.11170	null
2025-06-12	Instance-Based Transfer Learning with Similarity-Aware Subject Selection for Cross-Subject SSVEP-Based BCIs	Ziwen Wang et.al.	2506.10933	null
2025-06-12	Efficient nanophotonic devices optimization using deep neural network trained with physics-based transfer learning (PBTL) methodology	Gibaek Kim et.al.	2506.10418	null
2025-06-12	Uncertainty-Aware Deep Learning for Automated Skin Cancer Classification: A Comprehensive Evaluation	Hamzeh Asgharnezhad et.al.	2506.10302	null
2025-06-11	Going beyond density functional theory accuracy: Leveraging experimental data to refine pre-trained machine learning interatomic potentials	Shriya Gumber et.al.	2506.10211	null
2025-06-11	Attention on flow control: transformer-based reinforcement learning for lift regulation in highly disturbed flows	Zhecheng Liu et.al.	2506.10153	null
2025-06-11	Auto-Compressing Networks	Vaggelis Dorovatas et.al.	2506.09714	null
2025-06-11	An Effective End-to-End Solution for Multimodal Action Recognition	Songping Wang et.al.	2506.09345	null
2025-06-10	An Explainable Deep Learning Framework for Brain Stroke and Tumor Progression via MRI Interpretation	Rajan Das Gupta et.al.	2506.09161	null
2025-06-07	Exploring Image Transforms derived from Eye Gaze Variables for Progressive Autism Diagnosis	Abigail Copiaco et.al.	2506.09065	null
2025-06-11	Do Multiple Instance Learning Models Transfer?	Daniel Shao et.al.	2506.09022	link
2025-06-10	Data-Efficient Challenges in Visual Inductive Priors: A Retrospective	Robert-Jan Bruintjes et.al.	2506.08612	null
2025-06-10	Robust Evolutionary Multi-Objective Network Architecture Search for Reinforcement Learning (EMNAS-RL)	Nihal Acharya Adde et.al.	2506.08533	null
2025-06-10	Discovery of Odd Radio Circles and Other Peculiars in the First Year of the EMU Survey using Object Detection	Nikhel Gupta et.al.	2506.08439	null
2025-06-09	CrosswalkNet: An Optimized Deep Learning Framework for Pedestrian Crosswalk Detection in Aerial Images with High-Performance Computing	Zubin Bhuyan et.al.	2506.07885	null
2025-06-09	The Catechol Benchmark: Time-series Solvent Selection Data for Few-shot Machine Learning	Toby Boyne et.al.	2506.07619	link
2025-06-09	Flowing Datasets with Wasserstein over Wasserstein Gradient Flows	Clément Bonet et.al.	2506.07534	link
2025-06-09	Variational Supervised Contrastive Learning	Ziwen Wang et.al.	2506.07413	null
2025-06-08	Transfer Learning and Explainable AI for Brain Tumor Classification: A Study Using MRI Data from Bangladesh	Shuvashis Sarker et.al.	2506.07228	null
2025-06-08	State Entropy Regularization for Robust Reinforcement Learning	Uri Koren et.al.	2506.07085	null
2025-06-07	Exploring Visual Prompting: Robustness Inheritance and Beyond	Qi Li et.al.	2506.06823	null
2025-06-06	Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models	Yannis Spyridis et.al.	2506.06569	null
2025-06-03	CR-BLEA: Contrastive Ranking for Adaptive Resource Allocation in Bilevel Evolutionary Algorithms	Dejun Xu et.al.	2506.06362	null
2025-06-06	Full Conformal Adaptation of Medical Vision-Language Models	Julio Silva-Rodríguez et.al.	2506.06076	null
2025-06-05	DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning	Tanmay Parekh et.al.	2506.05128	null
2025-06-05	GEX: Democratizing Dexterity with Fully-Actuated Dexterous Hand and Exoskeleton Glove	Yunlong Dong et.al.	2506.04982	link
2025-06-05	Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets	Marianna Nezhurina et.al.	2506.04598	link
2025-06-05	OpenAg: Democratizing Agricultural Intelligence	Srikanth Thudumu et.al.	2506.04571	null
2025-06-04	Neurosymbolic Artificial Intelligence for Robust Network Intrusion Detection: From Scratch to Transfer Learning	Huynh T. T. Tran et.al.	2506.04454	null
2025-06-08	Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems	Yu Ma et.al.	2506.03586	null
2025-06-03	Culture Matters in Toxic Language Detection in Persian	Zahra Bokaei et.al.	2506.03458	null
2025-06-06	StARS DCM: A Sleep Stage-Decoding Forehead EEG Patch for Real-time Modulation of Sleep Physiology	William G. Coon et.al.	2506.03442	null
2025-06-03	Semiconductor SEM Image Defect Classification Using Supervised and Semi-Supervised Learning with Vision Transformers	Chien-Fu et.al.	2506.03345	null
2025-06-03	Extremely large oblate deformation of the first excited state in $^{12}$ C: a new challenge to modern nuclear theory	C. Ngwetsheni et.al.	2506.03236	null
2025-05-31	Human Fall Detection using Transfer Learning-based 3D CNN	Ekram Alam et.al.	2506.03193	null
2025-06-04	MMM4Rec: A Transfer-Efficient Framework for Multi-modal Sequential Recommendation	Hao Fan et.al.	2506.02916	null
2025-06-03	MVTD: A Benchmark Dataset for Maritime Visual Object Tracking	Ahsan Baidar Bakht et.al.	2506.02866	null
2025-06-03	Self-attention U-Net decoder for toric codes	Wei-Wei Zhang et.al.	2506.02734	link
2025-06-03	MLaGA: Multimodal Large Language and Graph Assistant	Dongzhe Fan et.al.	2506.02568	null
2025-06-02	Benchmarking Large Language Models for Polymer Property Predictions	Sonakshi Gupta et.al.	2506.02129	null
2025-06-02	Principled data augmentation for learning to solve quadratic programming problems	Chendi Qian et.al.	2506.01728	null
2025-06-02	Computing Diverse and Nice Triangulations	Waldo Gálvez et.al.	2506.01323	null
2025-06-01	Advancing from Automated to Autonomous Beamline by Leveraging Computer Vision	Baolu Li et.al.	2506.00836	null
2025-05-31	Getting More from Less: Transfer Learning Improves Sleep Stage Decoding Accuracy in Peripheral Wearable Devices	William G Coon et.al.	2506.00730	null
2025-05-31	Temporal Chunking Enhances Recognition of Implicit Sequential Patterns	Jayanta Dey et.al.	2506.00588	null
2025-05-31	COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning	Chamika Sudusinghe et.al.	2506.00424	null
2025-05-31	Neuro2Semantic: A Transfer Learning Framework for Semantic Reconstruction of Continuous Language from Human Intracranial EEG	Siavash Shams et.al.	2506.00381	link
2025-05-30	Conformal Prediction for Zero-Shot Models	Julio Silva-Rodríguez et.al.	2505.24693	link
2025-05-30	Density Ratio Permutation Tests with connections to distributional shifts and conditional two-sample testing	Alberto Bordino et.al.	2505.24529	null
2025-05-30	Attractor learning for spatiotemporally chaotic dynamical systems using echo state networks with transfer learning	Mohammad Shah Alam et.al.	2505.24099	null
2025-05-29	BIRD: Behavior Induction via Representation-structure Distillation	Galen Pogoncheff et.al.	2505.23933	null
2025-05-29	To Trust Or Not To Trust Your Vision-Language Model’s Prediction	Hao Dong et.al.	2505.23745	link
2025-05-29	Epistemic Errors of Imperfect Multitask Learners When Distributions Shift	Sabina J. Sloman et.al.	2505.23496	null
2025-05-29	Graph Positional Autoencoders as Self-supervised Learners	Yang Liu et.al.	2505.23345	null
2025-05-29	FreRA: A Frequency-Refined Augmentation for Contrastive Learning on Time Series Classification	Tian Tian et.al.	2505.23181	link
2025-05-28	When Does Neuroevolution Outcompete Reinforcement Learning in Transfer Learning Tasks?	Eleni Nisioti et.al.	2505.22696	link
2025-05-28	Chest Disease Detection In X-Ray Images Using Deep Learning Classification Method	Alanna Hazlett et.al.	2505.22609	null
2025-05-28	GLAMP: An Approximate Message Passing Framework for Transfer Learning with Applications to Lasso-based Estimators	Longlin Wang et.al.	2505.22594	null
2025-05-27	A Joint Reconstruction-Triplet Loss Autoencoder Approach Towards Unseen Attack Detection in IoV Networks	Julia Boone et.al.	2505.21703	null
2025-05-27	LLMPR: A Novel LLM-Driven Transfer Learning based Petition Ranking Model	Avijit Gayen et.al.	2505.21689	null
2025-05-27	Optimizing Deep Learning for Skin Cancer Classification: A Computationally Efficient CNN with Minimal Accuracy Trade-Off	Abdullah Al Mamun et.al.	2505.21597	null
2025-05-26	Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework	Julien Soulé et.al.	2505.21559	null
2025-05-27	Data-Driven Cellular Mobility Management via Bayesian Optimization and Reinforcement Learning	Mohamed Benzaghta et.al.	2505.21249	null
2025-05-27	Transfer learning for multifidelity simulation-based inference in cosmology	Alex A. Saoulis et.al.	2505.21215	null
2025-05-27	Intelligent Incident Hypertension Prediction in Obstructive Sleep Apnea	Omid Halimi Milani et.al.	2505.20615	null
2025-05-26	Solving Euler equations with Multiple Discontinuities via Separation-Transfer Physics-Informed Neural Networks	Chuanxing Wang et.al.	2505.20361	null
2025-05-26	ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers	Fotios Lygerakis et.al.	2505.20032	null
2025-05-26	Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation Models	Mobina Mansoori et.al.	2505.19779	link
2025-05-25	Omni-Perception: Omnidirectional Collision Avoidance for Legged Locomotion in Dynamic Environments	Zifan Wang et.al.	2505.19214	null
2025-05-25	A Smart Healthcare System for Monkeypox Skin Lesion Detection and Tracking	Huda Alghoraibi et.al.	2505.19023	null
2025-05-29	Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning	Chi Zhang et.al.	2505.18447	null
2025-05-23	X-MethaneWet: A Cross-scale Global Wetland Methane Emission Benchmark Dataset for Advancing Science Discovery with AI	Yiming Sun et.al.	2505.18355	null
2025-05-21	Reinforcement Twinning for Hybrid Control of Flapping-Wing Drones	Romain Poletti et.al.	2505.18201	null
2025-05-23	TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations	Alan Arazi et.al.	2505.18125	null
2025-05-23	Wasserstein Transfer Learning	Kaicheng Zhang et.al.	2505.17404	null
2025-05-22	Transfer Faster, Price Smarter: Minimax Dynamic Pricing under Cross-Market Preference Shift	Yi Zhang et.al.	2505.17203	null
2025-05-22	Mitigating Overfitting in Medical Imaging: Self-Supervised Pretraining vs. ImageNet Transfer Learning for Dermatological Diagnosis	Iván Matas et.al.	2505.16773	null
2025-05-24	End-to-End Framework for Predicting the Remaining Useful Life of Lithium-Ion Batteries	Khoa Tran et.al.	2505.16664	null
2025-05-22	WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning	Zhaomin Wu et.al.	2505.16635	null
2025-05-22	Reward-Aware Proto-Representations in Reinforcement Learning	Hon Tik Tse et.al.	2505.16217	null
2025-05-22	Scalable Graph Generative Modeling via Substructure Sequences	Zehong Wang et.al.	2505.16130	link
2025-05-21	An Exploratory Approach Towards Investigating and Explaining Vision Transformer and Transfer Learning for Brain Disease Detection	Shuvashis Sarker et.al.	2505.16039	null
2025-05-21	An Approach Towards Identifying Bangladeshi Leaf Diseases through Transfer Learning and XAI	Faika Fairuj Preotee et.al.	2505.16033	null
2025-05-21	Comprehensive Lung Disease Detection Using Deep Learning Models and Hybrid Chest X-ray Data with Explainable AI	Shuvashis Sarker et.al.	2505.16028	null
2025-05-21	Transfer of Structural Knowledge from Synthetic Languages	Mikhail Budnikov et.al.	2505.15769	link
2025-05-21	Inter-Subject Variance Transfer Learning for EMG Pattern Classification Based on Bayesian Inference	Seitaro Yoneda et.al.	2505.15381	null
2025-05-21	Scaling Diffusion Transformers Efficiently via $μ$ P	Chenyu Zheng et.al.	2505.15270	link
2025-05-21	GAMA++: Disentangled Geometric Alignment with Adaptive Contrastive Perturbation for Reliable Domain Transfer	Kim Yun et.al.	2505.15241	null
2025-05-21	Geometrically Regularized Transfer Learning with On-Manifold and Off-Manifold Perturbation	Hana Satou et.al.	2505.15191	null
2025-05-21	AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation	Meenal Parakh et.al.	2505.14986	null
2025-05-20	MultiMAE Meets Earth Observation: Pre-training Multi-modal Multi-task Masked Autoencoders for Earth Observation Tasks	Jose Sosa et.al.	2505.14951	link
2025-05-20	LOD1 3D City Model from LiDAR: The Impact of Segmentation Accuracy on Quality of Urban 3D Modeling and Morphology Extraction	Fatemeh Chajaei et.al.	2505.14747	link
2025-05-20	Vulnerability of Transfer-Learned Neural Networks to Data Reconstruction Attacks in Small-Data Regime	Tomasz Maciążek et.al.	2505.14323	link
2025-05-20	Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data	Faeze Ghorbanpour et.al.	2505.14272	null
2025-05-20	Contrastive Consolidation of Top-Down Modulations Achieves Sparsely Supervised Continual Learning	Viet Anh Khoa Tran et.al.	2505.14125	null
2025-05-20	Domain Adaptation of VLM for Soccer Video Understanding	Tiancheng Jiang et.al.	2505.13860	null
2025-05-19	Adaptive Image Restoration for Video Surveillance: A Real-Time Approach	Muhammad Awais Amin et.al.	2505.13130	null
2025-05-19	Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR	Xugang Lu et.al.	2505.13079	null
2025-05-19	Mamba-Adaptor: State Space Model Adaptor for Visual Recognition	Fei Xie et.al.	2505.12685	null
2025-05-19	On the Mechanisms of Adversarial Data Augmentation for Robust and Adaptive Transfer Learning	Hana Satou et.al.	2505.12681	null
2025-05-18	InnateCoder: Learning Programmatic Options with Foundation Models	Rubens O. Moraes et.al.	2505.12508	link
2025-05-18	Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation	Hang Yu et.al.	2505.12428	null
2025-05-17	Relation-Aware Graph Foundation Model	Jianxiang Yu et.al.	2505.12027	null
2025-05-17	Residual Feature Integration is Sufficient to Prevent Negative Transfer	Yichen Xu et.al.	2505.11771	link
2025-05-16	Evaluation and optimization of deep learning models for enhanced detection of brain cancer using transmission optical microscopy of thin brain tissue samples	Mohnish Sao et.al.	2505.11735	null
2025-05-16	Humble your Overconfident Networks: Unlearning Overfitting via Sequential Monte Carlo Tempered Deep Ensembles	Andrew Millard et.al.	2505.11671	null
2025-05-16	Programmable metasurfaces for future photonic artificial intelligence	Loubnan Abou-Hamdan et.al.	2505.11659	null
2025-05-16	Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model	Phan Tran Minh Dat et.al.	2505.11421	null
2025-05-16	Assessing the Performance of Analog Training for Transfer Learning	Omobayode Fagbohungbe et.al.	2505.11067	null
2025-05-19	Bias and Generalizability of Foundation Models across Datasets in Breast Mammography	Elodie Germani et.al.	2505.10579	null
2025-05-15	An AI-driven framework for the prediction of personalised health response to air pollution	Nazanin Zounemat Kermani et.al.	2505.10556	null
2025-05-15	Logos as a Well-Tempered Pre-train for Sign Language Recognition	Ilya Ovodov et.al.	2505.10481	null
2025-05-15	MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models	Yuncheng Guo et.al.	2505.10088	link
2025-05-15	Automated grading and staging of ovarian cancer using deep learning on the transmission optical microscopy bright-field images of thin biopsy tissue samples	Ashmit K Mishra et.al.	2505.09993	null
2025-05-14	Community-based Multi-Agent Reinforcement Learning with Transfer and Active Exploration	Zhaoyang Shi et.al.	2505.09756	null
2025-05-14	Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	Bingxin Ke et.al.	2505.09358	link
2025-05-13	GNN-based Precoder Design and Fine-tuning for Cell-free Massive MIMO with Real-world CSI	Tianzheng Miao et.al.	2505.08788	null
2025-05-13	Revealing economic facts: LLMs know more than they say	Marcus Buckmann et.al.	2505.08662	null
2025-05-13	A computer vision-based model for occupancy detection using low-resolution thermal images	Xue Cui et.al.	2505.08336	null
2025-05-13	Knowledge-Informed Deep Learning for Irrigation Type Mapping from Remote Sensing	Oishee Bintey Hoque et.al.	2505.08302	null
2025-05-12	Sleep Position Classification using Transfer Learning for Bed-based Pressure Sensors	Olivier Papillon et.al.	2505.08111	null
2025-05-12	Multi-modal wound classification using wound image and location by Xception and Gaussian Mixture Recurrent Neural Network (GMRNN)	Ramin Mousa et.al.	2505.08086	null
2025-05-10	Development of a WAZOBIA-Named Entity Recognition System	S. E Emedem et.al.	2505.07884	null
2025-05-12	Gameplay Highlights Generation	Vignesh Edithal et.al.	2505.07721	null
2025-05-12	Transfer Learning Across Fixed-Income Product Classes	Nicolas Camenzind et.al.	2505.07676	null
2025-05-12	Automated Visual Attention Detection using Mobile Eye Tracking in Behavioral Classroom Studies	Efe Bozkir et.al.	2505.07552	null
2025-05-12	Linux Kernel Configurations at Scale: A Dataset for Performance and Evolution Analysis	Heraldo Borges et.al.	2505.07487	link
2025-05-11	Enhancing Inference for Small Cohorts via Transfer Learning and Weighted Integration of Multiple Datasets	Subharup Guha et.al.	2505.07153	null
2025-05-15	A systematic review of challenges and proposed solutions in modeling multimodal data	Maryam Farhadizadeh et.al.	2505.06945	null
2025-05-11	A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting	Lhuqita Fazry et.al.	2505.06862	link
2025-05-10	Deep Neural Networks for Cross-Energy Particle Identification at RHIC and LHC	Omar M. Khalaf et.al.	2505.06732	null
2025-05-10	Mixer-Informer-Based Two-Stage Transfer Learning for Long-Sequence Load Forecasting in Newly Constructed Electric Vehicle Charging Stations	Zhenhua Zhou et.al.	2505.06657	null
2025-05-09	The 76Cu conundrum remains unsolved	B. Olaizola et.al.	2505.06400	null
2025-05-09	NSF-MAP: Neurosymbolic Multimodal Fusion for Robust and Interpretable Anomaly Prediction in Assembly Pipelines	Chathurangi Shyalika et.al.	2505.06333	link
2025-05-09	The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review	Jingguo Qu et.al.	2505.06118	null
2025-05-09	Discovery of the Polar Ring Galaxies with deep learning	D. V. Dobrycheva et.al.	2505.05890	null
2025-05-09	Automated Knot Detection and Pairing for Wood Analysis in the Timber Industry	Guohao Lin et.al.	2505.05845	null
2025-05-09	HyperspectralMAE: The Hyperspectral Imagery Classification Model using Fourier-Encoded Dual-Branch Masked Autoencoder	Wooyoung Jeong et.al.	2505.05710	null
2025-05-08	Fast and Fourier Features for Transfer Learning of Interatomic Potentials	Pietro Novelli et.al.	2505.05652	null
2025-05-08	Improved Brain Tumor Detection in MRI: Fuzzy Sigmoid Convolution in Deep Learning	Muhammad Irfan et.al.	2505.05208	null
2025-05-08	Structural Alignment in Link Prediction	Jeffrey Seathrún Sardina et.al.	2505.04939	link
2025-05-08	VaCDA: Variational Contrastive Alignment-based Scalable Human Activity Recognition	Soham Khisa et.al.	2505.04907	null
2025-05-05	Advanced Clustering Framework for Semiconductor Image Analytics Integrating Deep TDA with Self-Supervised and Transfer Learning Techniques	Janhavi Giri et.al.	2505.03848	null
2025-05-06	Sustainable Smart Farm Networks: Enhancing Resilience and Efficiency with Decision Theory-Guided Deep Reinforcement Learning	Dian Chen et.al.	2505.03721	null
2025-05-07	Multi-modal cascade feature transfer for polymer property prediction	Kiichi Obuchi et.al.	2505.03704	null
2025-05-06	Comparative Analysis of Lightweight Deep Learning Models for Memory-Constrained Devices	Tasnim Shahriar et.al.	2505.03303	null
2025-05-06	HMAE: Self-Supervised Few-Shot Learning for Quantum Spin Systems	Ibne Farabi Shihab et.al.	2505.03140	null
2025-05-05	Early Prediction of Sepsis: Feature-Aligned Transfer Learning	Oyindolapo O. Komolafe et.al.	2505.02889	null
2025-05-05	Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning	David Ramos et.al.	2505.02634	null
2025-05-04	Local Herb Identification Using Transfer Learning: A CNN-Powered Mobile Application for Nepalese Flora	Prajwal Thapa et.al.	2505.02147	null
2025-05-03	Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge	Florian Schmid et.al.	2505.01747	link
2025-05-02	Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments	Noussaiba Djeffal et.al.	2505.01632	null
2025-05-02	A Physics-preserved Transfer Learning Method for Differential Equations	Hao-Ran Yang et.al.	2505.01281	null
2025-05-01	A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic	Muhammad Imran Zaman et.al.	2505.00534	null
2025-05-01	AI-Assisted Decision-Making for Clinical Assessment of Auto-Segmented Contour Quality	Biling Wang et.al.	2505.00308	null
2025-05-01	Explorative Curriculum Learning for Strongly Correlated Electron Systems	Kimihiro Yamazaki et.al.	2505.00233	null
2025-04-30	Convergence rate for Nearest Neighbour matching: geometry of the domain and higher-order regularity	Simon Viel et.al.	2504.21633	null
2025-04-30	Multi-level datasets training method in Physics-Informed Neural Networks	Yao-Hsuan Tsai et.al.	2504.21328	null
2025-04-30	Multi-modal Transfer Learning for Dynamic Facial Emotion Recognition in the Wild	Ezra Engel et.al.	2504.21248	null
2025-04-29	A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection	Andreas Karathanasis et.al.	2504.21066	null
2025-04-29	SVD Based Least Squares for X-Ray Pneumonia Classification Using Deep Features	Mete Erdogan et.al.	2504.20970	null
2025-04-29	Transfer Learning Under High-Dimensional Network Convolutional Regression Model	Liyuan Wang et.al.	2504.19979	null
2025-04-28	Comments on the minimal training set for CNN: a case study of the frustrated $J_1$-$J_2$ Ising model on the square lattice	Shang-Wei Li et.al.	2504.19795	null
2025-04-26	Improving Pretrained YAMNet for Enhanced Speech Command Detection via Transfer Learning	Sidahmed Lachenani et.al.	2504.19030	null
2025-04-26	Predicting Stress in Two-phase Random Materials and Super-Resolution Method for Stress Images by Embedding Physical Information	Tengfei Xing et.al.	2504.18854	null
2025-04-26	FiberKAN: Kolmogorov-Arnold Networks for Nonlinear Fiber Optics	Xiaotian Jiang et.al.	2504.18833	null
2025-04-23	Speaker Diarization for Low-Resource Languages Through Wav2vec Fine-Tuning	Abdulhady Abas Abdullah et.al.	2504.18582	null
2025-04-25	Unifying Direct and Indirect Learning for Safe Control of Linear Systems	Amir Modares et.al.	2504.18331	null
2025-04-25	Post-Transfer Learning Statistical Inference in High-Dimensional Regression	Nguyen Vu Khai Tam et.al.	2504.18212	null
2025-04-25	A Model Zoo on Phase Transitions in Neural Networks	Konstantin Schürholt et.al.	2504.18072	null
2025-04-24	FlexPINN: Modeling Fluid Dynamics and Mass Transfer in 3D Micromixer Geometries Using a Flexible Physics-Informed Neural Network	Meraj Hassanzadeh et.al.	2504.17896	null
2025-04-22	Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models	Ze Yang et.al.	2504.17807	null
2025-04-24	An Explainable Nature-Inspired Framework for Monkeypox Diagnosis: Xception Features Combined with NGBoost and African Vultures Optimization Algorithm	Ahmadreza Shateri et.al.	2504.17540	null
2025-04-25	On the workflow, opportunities and challenges of developing foundation model in geophysics	Hanlin Sheng et.al.	2504.17384	null
2025-04-24	The Riemannian Means Field Classifier for EEG-Based BCI Data	Anton Andreev et.al.	2504.17352	null
2025-04-24	Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo	Ocheme Anthony Ekle et.al.	2504.17252	null
2025-04-23	A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs	Jalal Arabneydi et.al.	2504.17006	null
2025-04-23	An Adaptive ML Framework for Power Converter Monitoring via Federated Transfer Learning	Panagiotis Kakosimos et.al.	2504.16866	null
2025-04-22	SparseJEPA: Sparse Representation Learning of Joint Embedding Predictive Architectures	Max Hartman et.al.	2504.16140	null
2025-04-21	Active Learning Methods for Efficient Data Utilization and Model Performance Enhancement	Chiung-Yi Tseng et.al.	2504.16136	null
2025-04-22	Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications	Leonardo Olivi et.al.	2504.15991	null
2025-04-23	MedNNS: Supernet-based Medical Task-Adaptive Neural Network Search	Lotfi Abdelkrim Mecharbat et.al.	2504.15865	null
2025-04-22	Transfer Learning for High-dimensional Reduced Rank Time Series Models	Mingliang Ma Abolfazl Safikhani et.al.	2504.15691	null
2025-04-21	Fourier analysis of the physics of transfer learning for data-driven subgrid-scale models of ocean turbulence	Moein Darman et.al.	2504.15487	null
2025-04-21	Transferable Learning of Reaction Pathways from Geometric Priors	Juno Nam et.al.	2504.15370	link
2025-04-22	Histogram-based Parameter-efficient Tuning for Passive Sonar Classification	Amirmohammad Mohammadi et.al.	2504.15214	link
2025-04-21	Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments?	Xinglei Dou et.al.	2504.15021	null
2025-04-21	PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV	Qianyu Zhu et.al.	2504.14952	link
2025-04-18	CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning	Yang Yue et.al.	2504.13820	link
2025-04-18	Enhancing Pothole Detection and Characterization: Integrated Segmentation and Depth Estimation in Road Anomaly Systems	Uthman Baroudi et.al.	2504.13648	null
2025-04-18	MetaDSE: A Few-shot Meta-learning Framework for Cross-workload CPU Design Space Exploration	Runzhen Xue et.al.	2504.13568	null
2025-04-18	A Deep Learning-Based Supervised Transfer Learning Framework for DOA Estimation with Array Imperfections	Bo Zhou et.al.	2504.13394	link
2025-04-17	Non-Uniform Class-Wise Coreset Selection: Characterizing Category Difficulty for Data-Efficient Transfer Learning	Hanyu Zhang et.al.	2504.13234	null
2025-04-17	Scaling Laws for Data-Efficient Visual Transfer Learning	Wenxuan Yang et.al.	2504.13219	null
2025-04-17	Transfer Learning via Auxiliary Labels with Application to Cold-Hardiness Prediction	Kristen Goebel et.al.	2504.13142	null
2025-04-17	All-in-One Transferring Image Compression from Human Perception to Multi-Machine Perception	Jiancheng Zhao et.al.	2504.12997	null
2025-04-17	Enhancing Cocoa Pod Disease Classification via Transfer Learning and Ensemble Methods: Toward Robust Predictive Modeling	Devina Anduyan et.al.	2504.12992	null
2025-04-17	Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification	Reek Majumder et.al.	2504.12644	link
2025-04-17	Privacy-Preserving CNN Training with Transfer Learning: Two Hidden Layers	John Chiang et.al.	2504.12623	null
2025-04-15	TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics Data	Shuo Shuo Liu et.al.	2504.12353	link
2025-04-16	Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets	Yechao Zhang et.al.	2504.11990	null
2025-04-15	Towards a Universal Vibration Analysis Dataset: A Framework for Transfer Learning in Predictive Maintenance and Structural Health Monitoring	Mert Sehri et.al.	2504.11581	null
2025-04-15	Rank-based transfer learning for high-dimensional survival data with application to sepsis data	Nan Qiao et.al.	2504.11270	null
2025-04-15	Meta-learning For Few-Shot Time Series Crop Type Classification: A Benchmark On The EuroCropsML Dataset	Joana Reuss et.al.	2504.11022	null
2025-04-17	Transfer Learning for Temporal Link Prediction	Ayan Chatterjee et.al.	2504.10925	link
2025-04-14	Transfer Learning Assisted XgBoost For Adaptable Cyberattack Detection In Battery Packs	Sanchita Ghosh et.al.	2504.10658	null
2025-04-14	Inferring genotype-phenotype maps using attention models	Krishna Rijal et.al.	2504.10388	link
2025-04-14	UP-Person: Unified Parameter-Efficient Transfer Learning for Text-based Person Retrieval	Yating Liu et.al.	2504.10084	link
2025-04-14	Learning to Harmonize Cross-vendor X-ray Images by Non-linear Image Dynamics Correction	Yucheng Lu et.al.	2504.10080	null
2025-04-14	Progressive Transfer Learning for Multi-Pass Fundus Image Restoration	Uyen Phan et.al.	2504.10025	null
2025-04-14	Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics	Nikolai Röhrich et.al.	2504.10021	null
2025-04-13	Comorbidity-Informed Transfer Learning for Neuro-developmental Disorder Diagnosis	Xin Wen et.al.	2504.09463	null
2025-04-12	Beyond Glucose-Only Assessment: Advancing Nocturnal Hypoglycemia Prediction in Children with Type 1 Diabetes	Marco Voegeli et.al.	2504.09299	null
2025-04-12	Query-based Knowledge Transfer for Heterogeneous Learning Environments	Norah Alballa et.al.	2504.09205	null
2025-04-12	Towards On-Device Learning and Reconfigurable Hardware Implementation for Encoded Single-Photon Signal Processing	Zhenya Zang et.al.	2504.09028	null
2025-04-11	Distilling and exploiting quantitative insights from Large Language Models for enhanced Bayesian optimization of chemical reactions	Roshan Patel et.al.	2504.08874	null
2025-04-11	Boosting multi-demographic federated learning for chest x-ray analysis using general-purpose self-supervised representations	Mahshad Lotfinia et.al.	2504.08584	null
2025-04-11	Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets	Luis Chuquimarca et.al.	2504.08568	null
2025-04-10	Deep Reinforcement Learning for Day-to-day Dynamic Tolling in Tradable Credit Schemes	Xiaoyi Wu et.al.	2504.08074	null
2025-04-14	Pushing the Accuracy Limit of Foundation Neural Network Models with Quantum Monte Carlo Forces and Path Integrals	Anouar Benali et.al.	2504.07948	null
2025-04-10	Focal Cortical Dysplasia Type II Detection Using Cross Modality Transfer Learning and Grad-CAM in 3D-CNNs for MRI Analysis	Lorenzo Lasagni et.al.	2504.07775	null
2025-04-10	Benchmarking Image Embeddings for E-Commerce: Evaluating Off-the Shelf Foundation Models, Fine-Tuning Strategies and Practical Trade-offs	Urszula Czerwinska et.al.	2504.07567	null
2025-04-10	Conditional Data Synthesis Augmentation	Xinyu Tian et.al.	2504.07426	null
2025-04-09	Identifying regions of interest in whole slide images of renal cell carcinoma	Mohammed Lamine Benomar et.al.	2504.07313	null
2025-04-09	Data Fusion of Deep Learned Molecular Embeddings for Property Prediction	Robert J Appleton et.al.	2504.07297	null
2025-04-09	EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture	Wenfeng Feng et.al.	2504.06738	null
2025-04-09	TabKAN: Advancing Tabular Data Analysis using Kolmograv-Arnold Network	Ali Eslamian et.al.	2504.06559	null
2025-04-08	High-Resource Translation:Turning Abundance into Accessibility	Abhiram Reddy Yanampally et.al.	2504.05914	null
2025-04-07	Cross-functional transferability in universal machine learning interatomic potentials	Xu Huang et.al.	2504.05565	null
2025-04-07	Cellular Network Design for UAV Corridors via Data-driven High-dimensional Bayesian Optimization	Mohamed Benzaghta et.al.	2504.05176	null
2025-04-07	Sparse Optimization for Transfer Learning: A L0-Regularized Framework for Multi-Source Domain Adaptation	Chenqi Gong et.al.	2504.04812	null
2025-04-05	ADA-Net: Attention-Guided Domain Adaptation Network with Contrastive Learning for Standing Dead Tree Segmentation Using Aerial Imagery	Mete Ahishali et.al.	2504.04271	link
2025-04-05	Quantum parallel information exchange (QPIE) hybrid network with transfer learning	Ziqing Guo et.al.	2504.04235	null
2025-04-05	PIORF: Physics-Informed Ollivier-Ricci Flow for Long-Range Interactions in Mesh Graph Neural Networks	Youn-Yeol Yu et.al.	2504.04052	null
2025-04-04	Optimizing Specific and Shared Parameters for Efficient Parameter Tuning	Van-Anh Nguyen et.al.	2504.03450	null
2025-04-04	Early detection of diabetes through transfer learning-based eye (vision) screening and improvement of machine learning model performance and advanced parameter setting algorithms	Mohammad Reza Yousefi et.al.	2504.03439	null
2025-04-04	Block Toeplitz Sparse Precision Matrix Estimation for Large-Scale Interval-Valued Time Series Forecasting	Wan Tian et.al.	2504.03322	null
2025-04-04	A model-free feature extraction procedure for interval-valued time series prediction	Wan Tian et.al.	2504.03310	null
2025-04-04	Mitigating the Impact of Electrode Shift on Classification Performance in Electromyography-Based Motion Prediction Using Sliding-Window Normalization	Taichi Tanaka et.al.	2504.03196	null
2025-04-03	Data-Driven Design of 3GPP Handover Parameters with Bayesian Optimization and Transfer Learning	Mohamed Benzaghta et.al.	2504.02633	null
2025-04-02	Instruction-Guided Autoregressive Neural Network Parameter Generation	Soro Bedionita et.al.	2504.02012	null
2025-04-02	Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning	Yiting Lu et.al.	2504.01655	link
2025-04-01	Privacy-Preserving Transfer Learning for Community Detection using Locally Distributed Multiple Networks	Xiao Guo et.al.	2504.00890	null
2025-04-01	Data-driven Optimization and Transfer Learning for Cellular Network Antenna Configurations	Mohamed Benzaghta et.al.	2504.00825	null
2025-04-01	Transfer Learning in Financial Time Series with Gramian Angular Field	Hou-Wan Long et.al.	2504.00378	null
2025-04-01	Spatiotemporal Attention Learning Framework for Event-Driven Object Recognition	Tiantian Xie et.al.	2504.00370	null
2025-04-01	CopyQNN: Quantum Neural Network Extraction Attack under Varying Quantum Noise	Zhenxiao Fu et.al.	2504.00366	null
2025-03-31	Detecting Glioma, Meningioma, and Pituitary Tumors, and Normal Brain Tissues based on Yolov11 and Yolov8 Deep Learning Models	Ahmed M. Taha et.al.	2504.00189	null
2025-03-31	From Colors to Classes: Emergence of Concepts in Vision Transformers	Teresa Dorszewski et.al.	2503.24071	link
2025-03-29	A QUBO Framework for Team Formation	Karan Vombatkere et.al.	2503.23209	null
2025-03-29	Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning	Xinlei Shao et.al.	2503.23012	link
2025-04-01	Nonhuman Primate Brain Tissue Segmentation Using a Transfer Learning Approach	Zhen Lin et.al.	2503.22829	null
2025-03-28	Accelerated VQE: Parameter Recycling for Similar Recurring Problem Instances	Tobias Rohe et.al.	2503.22590	null
2025-03-28	Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation	Sarubi Thillainathan et.al.	2503.22582	null
2025-03-28	Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets	Martin Kišš et.al.	2503.22513	null
2025-03-28	On-site estimation of battery electrochemical parameters via transfer learning based physics-informed neural network approach	Josu Yeregui et.al.	2503.22396	null
2025-03-28	A Survey on Remote Sensing Foundation Models: From Vision to Multimodality	Ziyue Huang et.al.	2503.22081	link
2025-04-04	Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models	Umer Butt et.al.	2503.21530	null
2025-03-27	Exploring the flavor structure of leptons via diffusion models	Satsuki Nishimura et.al.	2503.21432	null
2025-03-27	AugWard: Augmentation-Aware Representation Learning for Accurate Graph Classification	Minjun Kim et.al.	2503.21105	link
2025-03-27	Integrate Meta-analysis into Specific Study (InMASS) for Estimating Conditional Average Treatment Effect	Keisuke Hanada et.al.	2503.21091	link
2025-03-26	World Model Agents with Change-Based Intrinsic Motivation	Jeremias Ferrao et.al.	2503.21047	link
2025-03-26	A Deep Learning Pipeline for Large Earthquake Analysis using High-Rate Global Navigation Satellite System Data	Claudia Quinteros-Cartaya et.al.	2503.20584	null
2025-03-26	Low-resource Information Extraction with the European Clinical Case Corpus	Soumitra Ghosh et.al.	2503.20568	null
2025-03-26	Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications	Mahya Nikouei et.al.	2503.20516	null
2025-03-26	Multi-dataset and Transfer Learning Using Gene Expression Knowledge Graphs	Rita T. Sousa et.al.	2503.20400	link
2025-03-25	The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs	Jonathan Sauder et.al.	2503.20000	link
2025-03-25	Untangling the Influence of Typology, Data and Model Architecture on Ranking Transfer Languages for Cross-Lingual POS Tagging	Enora Rice et.al.	2503.19979	null
2025-03-25	Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification	Daniel G. P. Petrini et.al.	2503.19945	null
2025-03-25	Exploring Cultural Nuances in Emotion Perception Across 15 African Languages	Ibrahim Said Ahmad et.al.	2503.19642	null
2025-03-24	Continual Reinforcement Learning for HVAC Systems Control: Integrating Hypernetworks and Transfer Learning	Gautham Udayakumar Bekal et.al.	2503.19212	null
2025-03-24	Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach	Jakob Abeßer et.al.	2503.19161	null
2025-03-24	Out-of-distribution evaluations of channel agnostic masked autoencoders in fluorescence microscopy	Christian John Hurry et.al.	2503.19149	null
2025-03-24	Anomaly Detection Using Computer Vision: A Comparative Analysis of Class Distinction and Performance Metrics	Md. Barkat Ullah Tusher et.al.	2503.19100	null
2025-03-24	Convolutional neural network approach to ion Coulomb crystal image analysis	James Allsopp et.al.	2503.18846	null
2025-03-24	Natural Language Processing for Electronic Health Records in Scandinavian Languages: Norwegian, Swedish, and Danish	Ashenafi Zebene Woldaregay et.al.	2503.18539	null
2025-03-24	k-NN as a Simple and Effective Estimator of Transferability	Moein Sorkhei et.al.	2503.18528	null
2025-03-24	Similarity-Informed Transfer Learning for Multivariate Functional Censored Quantile Regression	Hua Liu et.al.	2503.18437	null
2025-03-24	PNN: A Novel Progressive Neural Network for Fault Classification in Rotating Machinery under Small Dataset Constraint	Praveen Chopra et.al.	2503.18263	null
2025-03-25	PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment	Jong Myoung Kim et.al.	2503.18250	null
2025-03-23	Adaptive Multi-Fidelity Reinforcement Learning for Variance Reduction in Engineering Design Optimization	Akash Agrawal et.al.	2503.18229	null
2025-03-23	Adaptive Physics-informed Neural Networks: A Survey	Edgar Torres et.al.	2503.18181	null
2025-03-23	Training A Neural Network For Partially Occluded Road Sign Identification In The Context Of Autonomous Vehicles	Gulnaz Gimaletdinova et.al.	2503.18177	null
2025-03-23	Cost-effective multi-fidelity strategy for the optimization of high-Reynolds number turbine flows guided by LES	Camille Matar et.al.	2503.17977	null
2025-03-23	Physics-Guided Multi-Fidelity DeepONet for Data-Efficient Flow Field Prediction	Sunwoong Yang et.al.	2503.17941	null
2025-03-23	Cross-Domain Underwater Image Enhancement Guided by No-Reference Image Quality Assessment: A Transfer Learning Approach	Zhi Zhang et.al.	2503.17937	null
2025-03-22	Causal Inference based Transfer Learning with LLMs: An Efficient Framework for Industrial RUL Prediction	Yan Chen et.al.	2503.17686	null
2025-03-21	Shear-based Grasp Control for Multi-fingered Underactuated Tactile Robotic Hands	Christopher J. Ford et.al.	2503.17501	null
2025-03-21	Stream Automatic Detection with Convolutional Neural Network (SAD-CNN)	Alex Vera-Casanova. et.al.	2503.17202	null
2025-03-21	Jailbreaking the Non-Transferable Barrier via Test-Time Data Disguising	Yongli Xiang et.al.	2503.17198	null
2025-03-21	Transfer Learning for EDFA Gain Modeling: A Semi-Supervised Approach Using Internal Amplifier Features	Agastya Raj et.al.	2503.17094	null
2025-03-21	PRIOT: Pruning-Based Integer-Only Transfer Learning for Embedded Systems	Honoka Anada et.al.	2503.16860	null
2025-03-21	Multi-property directed generative design of inorganic materials through Wyckoff-augmented transfer learning	Shuya Yamazaki et.al.	2503.16784	null
2025-03-20	UniCrossAdapter: Multimodal Adaptation of CLIP for Radiology Report Generation	Yaxiong Chen et.al.	2503.15940	link
2025-03-21	Sample-Efficient Bayesian Transfer Learning for Online Machine Parameter Optimization	Philipp Wagner et.al.	2503.15928	null
2025-03-20	Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation	Tiange Xiang et.al.	2503.15877	null
2025-03-19	Sequential learning based PINNs to overcome temporal domain complexities in unsteady flow past flapping wings	Rahul Sundar et.al.	2503.15679	null
2025-03-20	Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis	Imanol G. Estepa et.al.	2503.15060	null
2025-03-19	Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene	Shengqiong Wu et.al.	2503.15019	null
2025-03-19	A Novel Channel Boosted Residual CNN-Transformer with Regional-Boundary Learning for Breast Cancer Detection	Aamir Mehmood et.al.	2503.15008	null
2025-03-18	Cross-Environment Transfer Learning for Location-Aided Beam Prediction in 5G and Beyond Millimeter-Wave Networks	Enrico Tosi et.al.	2503.14287	null
2025-03-18	Multi-task Learning for Identification of Porcelain in Song and Yuan Dynasties	Ziyao Ling et.al.	2503.14231	null
2025-03-17	MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset	Zhaodong Wu et.al.	2503.13560	link
2025-03-17	Edit Transfer: Learning Image Editing via Vision In-Context Relations	Lan Chen et.al.	2503.13327	null
2025-03-17	Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach	Muhan Hou et.al.	2503.12993	null
2025-03-17	An Optimization Framework for Differentially Private Sparse Fine-Tuning	Mehdi Makni et.al.	2503.12822	null
2025-03-16	TuneNSearch: a hybrid transfer learning and local search approach for solving vehicle routing problems	Arthur Corrêa et.al.	2503.12662	null
2025-03-16	Realized Volatility Forecasting for New Issues and Spin-Offs using Multi-Source Transfer Learning	Andreas Teller et.al.	2503.12648	null
2025-03-16	COVID 19 Diagnosis Analysis using Transfer Learning	Anjali Dharmik et.al.	2503.12642	null
2025-03-16	Learning Privacy from Visual Entities	Alessio Xompero et.al.	2503.12464	null
2025-03-16	A Transformer-based survival model for prediction of all-cause mortality in heart failure patients: a multi-cohort study	Shishir Rao et.al.	2503.12317	null
2025-03-15	Automatic Characterization of Fluxonium Superconducting Qubits Parameters with Deep Transfer Learning	Huan-Hsuan Kung et.al.	2503.12099	null
2025-03-15	Effective and Efficient Cross-City Traffic Knowledge Transfer A Privacy-Preserving Perspective	Zhihao Zeng et.al.	2503.11963	null
2025-03-14	Transfer Learning for Automated Feedback Generation on Small Datasets	Oscar Morris et.al.	2503.11836	null
2025-03-14	Deepfake Detection of Face Images based on a Convolutional Neural Network	Lukas Kroiß et.al.	2503.11389	null
2025-03-14	TransiT: Transient Transformer for Non-line-of-sight Videography	Ruiqian Li et.al.	2503.11328	null
2025-03-13	Automated Tomato Maturity Estimation Using an Optimized Residual Model with Pruning and Quantization Techniques	Muhammad Waseem et.al.	2503.10940	null
2025-03-13	SOLA-GCL: Subgraph-Oriented Learnable Augmentation Method for Graph Contrastive Learning	Tianhao Peng et.al.	2503.10100	null
2025-03-11	Are ECGs enough? Deep learning classification of cardiac anomalies using only electrocardiograms	Joao D. S. Marques et.al.	2503.08960	link
2025-03-11	Beam Selection in ISAC using Contextual Bandit with Multi-modal Transformer and Transfer Learning	Mohammad Farzanullah et.al.	2503.08937	null
2025-03-11	Towards species’ classification of the \textit{Anastrepha pseudoparallela} group	Gabriel R. Palma et.al.	2503.08598	null
2025-03-11	MMRL: Multi-Modal Representation Learning for Vision-Language Models	Yuncheng Guo et.al.	2503.08497	link
2025-03-17	Structure-Activation Synergy: A Dual Efficiency Framework for Parameter-Memory Optimized Transfer Learning	Tian Jin et.al.	2503.08154	null
2025-03-11	Pre-trained Models Succeed in Medical Imaging with Representation Similarity Degradation	Wenqiang Zu et.al.	2503.07958	null
2025-03-11	A Study to Evaluate the Impact of LoRA Fine-tuning on the Performance of Non-functional Requirements Classification	Xia Li et.al.	2503.07927	null
2025-03-10	Elderly Activity Recognition in the Wild: Results from the EAR Challenge	Anh-Kiet Duong et.al.	2503.07821	null
2025-03-10	Real-Time Load Estimation for Load-lifting Exoskeletons Using Insole Pressure Sensors and Machine Learning	Kaida Wu et.al.	2503.07527	null
2025-03-10	Linguistic Knowledge Transfer Learning for Speech Enhancement	Kuo-Hsuan Hung et.al.	2503.07078	null
2025-03-10	Are We Truly Forgetting? A Critical Re-examination of Machine Unlearning Evaluation Protocols	Yongwoo Kim et.al.	2503.06991	null
2025-03-09	Transfer Learning for LQR Control	Taosha Guo et.al.	2503.06755	null
2025-03-09	MetaXCR: Reinforcement-Based Meta-Transfer Learning for Cross-Lingual Commonsense Reasoning	Jie He et.al.	2503.06531	null
2025-03-09	R+R: Security Vulnerability Dataset Quality Is Critical	Anurag Swarnim Yadav et.al.	2503.06387	link
2025-03-08	Adversarial Robustness of Discriminative Self-Supervised Learning in Vision	Ömer Veysel Çağatan et.al.	2503.06361	null
2025-03-08	NeuroADDA: Active Discriminative Domain Adaptation in Connectomic	Shashata Sawmya et.al.	2503.06196	null
2025-03-07	CACTUS: An Open Dataset and Framework for Automated Cardiac Assessment and Classification of Ultrasound Images Using Deep Transfer Learning	Hanae Elmekki et.al.	2503.05604	null
2025-03-10	opXRD: Open Experimental Powder X-ray Diffraction Database	Daniel Hollarek et.al.	2503.05577	null
2025-03-13	Statistical Deficiency for Task Inclusion Estimation	Loïc Fosse et.al.	2503.05491	null
2025-03-07	Quantum-PEFT: Ultra parameter-efficient fine-tuning	Toshiaki Koike-Akino et.al.	2503.05431	null
2025-03-07	Spatial Distillation based Distribution Alignment (SDDA) for Cross-Headset EEG Classification	Dingkun Liu et.al.	2503.05349	link
2025-03-06	TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation	Lin Sun et.al.	2503.04872	null
2025-03-06	DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval	Yating Liu et.al.	2503.04144	null
2025-03-05	On the Acquisition of Shared Grammatical Representations in Bilingual Language Models	Catherine Arnett et.al.	2503.03962	null
2025-03-05	Hierarchical quantum embedding by machine learning for large molecular assemblies	Moritz Bensberg et.al.	2503.03928	null
2025-03-05	Sarcasm Detection as a Catalyst: Improving Stance Detection with Cross-Target Capabilities	Gibson Nkhata Shi Yin Hong et.al.	2503.03787	null
2025-03-04	A Phylogenetic Approach to Genomic Language Modeling	Carlos Albors et.al.	2503.03773	link
2025-03-10	MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving	Ruida Wang et.al.	2503.03205	link
2025-03-05	Intermediate-Task Transfer Learning: Leveraging Sarcasm Detection for Stance Detection	Gibson Nkhata et.al.	2503.03172	null
2025-03-04	Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment	Matthew DosSantos DiSorbo et.al.	2503.02976	null
2025-03-03	Hyperspectral Image Restoration and Super-resolution with Physics-Aware Deep Learning for Biomedical Applications	Yuchen Xiang et.al.	2503.02908	null
2025-03-03	Diagnosis of Patients with Viral, Bacterial, and Non-Pneumonia Based on Chest X-Ray Images Using Convolutional Neural Networks	Carlos Arizmendi et.al.	2503.02906	null
2025-03-04	Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques	Mustafa Majeed Abd Zaid et.al.	2503.02510	null
2025-03-04	X2CT-CLIP: Enable Multi-Abnormality Detection in Computed Tomography from Chest Radiography via Tri-Modal Contrastive Learning	Jianzhong You et.al.	2503.02162	null
2025-03-03	A General Neural Network Potential for Energetic Materials with C, H, N, and O elements	Mingjie Wen et.al.	2503.01932	link
2025-03-03	Do GFlowNets Transfer? Case Study on the Game of 24/42	Adesh Gupta et.al.	2503.01819	null
2025-03-03	An Efficient Approach to Detecting Lung Nodules Using Swin Transformer	Saeed Shakuri et.al.	2503.01592	null
2025-03-03	A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models	Seyed Mohamad Ali Tousi et.al.	2503.01169	null
2025-03-01	Rapid morphology characterization of two-dimensional TMDs and lateral heterostructures based on deep learning	Junqi He et.al.	2503.00470	link
2025-03-01	Towards Understanding the Benefit of Multitask Representation Learning in Decision Process	Rui Lu et.al.	2503.00345	null
2025-02-28	Optimal Transfer Learning for Missing Not-at-Random Matrix Completion	Akhil Jalan et.al.	2503.00174	null
2025-02-28	Fine-tuning machine-learned particle-flow reconstruction for new detector geometries in future colliders	Farouk Mokhtar et.al.	2503.00131	null
2025-02-28	RuCCoD: Towards Automated ICD Coding in Russian	Aleksandr Nesterov et.al.	2502.21263	link
2025-02-28	Incorporating Long-Range Interactions via the Multipole Expansion into Ground and Excited-State Molecular Simulations	Rhyan Barrett et.al.	2502.21045	null
2025-02-27	On the Role of Individual Differences in Current Approaches to Computational Image Aesthetics	Li-Wei Chen et.al.	2502.20518	null
2025-02-27	Deep Convolutional Neural Networks for Palm Fruit Maturity Classification	Mingqiang Han et.al.	2502.20223	link
2025-02-27	An Amplitude-Encoding-Based Classical-Quantum Transfer Learning framework: Outperforming Classical Methods in Image Recognition	Shouwei Hu et.al.	2502.20184	null
2025-02-27	Transfer Learning in Latent Contextual Bandits with Covariate Shift Through Causal Transportability	Mingwei Deng et.al.	2502.20153	link
2025-02-27	Energy-carbon comprehensive efficiency evaluation of hydrogen metallurgy system considering low-temperature waste heat recovery	Qiang Ji et.al.	2502.20131	null
2025-02-27	Efficient Machine Learning Approach for Yield Prediction in Chemical Reactions	Supratim Ghosh et.al.	2502.19976	null
2025-02-27	A Principled Approach to Bayesian Transfer Learning	Adam Bretherton et.al.	2502.19796	null
2025-02-26	Deep Learning-Based Transfer Learning for Classification of Cassava Disease	Ademir G. Costa Junior et.al.	2502.19351	null
2025-02-26	Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective	Jiawei Huang et.al.	2502.19255	link
2025-03-01	GraphBridge: Towards Arbitrary Transfer Learning in GNNs	Li Ju et.al.	2502.19252	link
2025-02-26	A Sample-Level Evaluation and Generative Framework for Model Inversion Attacks	Haoyang Li et.al.	2502.19070	link
2025-02-26	KAN-powered large-target detection for automotive radar	Vinay Kulkarni et.al.	2502.19000	null
2025-02-25	Transfer Learning Assisted Fast Design Migration Over Technology Nodes: A Study on Transformer Matching Network	Chenhao Chu et.al.	2502.18636	link
2025-02-25	Transfer Learning for Transient Classification: From Simulations to Real Data and ZTF to LSST	Rithwik Gupta et.al.	2502.18558	null
2025-02-23	Rewards-based image analysis in microscopy	Kamyar Barakati et.al.	2502.18522	null
2025-02-25	Conformal Prediction Under Generalized Covariate Shift with Posterior Drift	Baozhen Wang et.al.	2502.17744	null
2025-02-23	Multimodal Bearing Fault Classification Under Variable Conditions: A 1D CNN with Transfer Learning	Tasfiq E. Alam et.al.	2502.17524	null
2025-02-24	Leveraging recurrence in neural network wavefunctions for large-scale simulations of Heisenberg antiferromagnets: the square lattice	M. Schuyler Moss et.al.	2502.17144	link
2025-02-24	Provable Benefits of Unsupervised Pre-training and Transfer Learning via Single-Index Models	Taj Jones-McCormick et.al.	2502.16849	null
2025-02-23	Automated Keypoint Estimation for Self-Piercing Rivet Joints Using micro-CT Imaging and Transfer Learning	Wei Qin Chuah et.al.	2502.16752	null
2025-02-27	Diagnosing COVID-19 Severity from Chest X-Ray Images Using ViT and CNN Architectures	Luis Lara et.al.	2502.16622	link
2025-02-23	SDA-DDA Semi-supervised Domain Adaptation with Dynamic Distribution Alignment Network For Emotion Recognition Using EEG Signals	Jiahao Tang et.al.	2502.16485	link
2025-02-22	Iterative Auto-Annotation for Scientific Named Entity Recognition Using BERT-Based Models	Kartik Gupta et.al.	2502.16312	null
2025-02-21	Graph Attention Convolutional U-NET: A Semantic Segmentation Model for Identifying Flooded Areas	Muhammad Umair Danish et.al.	2502.15907	null
2025-02-21	Improving variable selection properties by using external data	Paul Rognon-Vael et.al.	2502.15584	null
2025-02-21	Fine-tuning foundation models of materials interatomic potentials with frozen transfer learning	Mariia Radova et.al.	2502.15582	null
2025-02-20	P2W: From Power Traces to Weights Matrix – An Unconventional Transfer Learning Approach	Roozbeh Siyadatzadeh et.al.	2502.14968	null
2025-02-20	Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes	Lukas Rauch et.al.	2502.14721	null
2025-02-20	Distribution Matching for Self-Supervised Transfer Learning	Yuling Jiao et.al.	2502.14424	link
2025-02-20	A Macro- and Micro-Hierarchical Transfer Learning Framework for Cross-Domain Fake News Detection	Xuankai Yang et.al.	2502.14403	null
2025-02-20	Asymmetric Co-Training for Source-Free Few-Shot Domain Adaptation	Gengxu Li et.al.	2502.14214	link
2025-02-19	Appeal prediction for AI up-scaled Images	Steve Göring et.al.	2502.14013	link
2025-02-19	Toward Robust Non-Transferable Learning: A Survey and Benchmark	Ziming Hong et.al.	2502.13593	link
2025-02-19	Enhancing Machine Learning Potentials through Transfer Learning across Chemical Elements	Sebastien Röcken et.al.	2502.13522	null
2025-02-18	Performance Evaluation of Sentiment Analysis on Text and Emoji Data Using End-to-End, Transfer Learning, Distributed and Explainable AI Models	Sirisha Velampalli et.al.	2502.13278	null
2025-02-18	Pre-training Auto-regressive Robotic Models with 4D Representations	Dantong Niu et.al.	2502.13142	null
2025-02-18	Detection and Geographic Localization of Natural Objects in the Wild: A Case Study on Palms	Kangning Cui et.al.	2502.13023	null
2025-02-18	Universal Embedding Function for Traffic Classification via QUIC Domain Recognition Pretraining: A Transfer Learning Success	Jan Luxemburk et.al.	2502.12930	link
2025-02-18	Unsupervised optimal deep transfer learning for classification under general conditional shift	Junjun Lang et.al.	2502.12729	null
2025-02-18	NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation	Zhiyuan Liu et.al.	2502.12638	link
2025-02-17	PreAdaptFWI: Pretrained-Based Adaptive Residual Learning for Full-Waveform Inversion Without Dataset Dependency	Xintong Dong et.al.	2502.11913	null
2025-02-17	M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis	Chengyan Wu et.al.	2502.11824	link
2025-02-17	Transfer Learning of CATE with Kernel Ridge Regression	Seok-Jin Kim et.al.	2502.11331	link
2025-02-16	Detecting Cadastral Boundary from Satellite Images Using U-Net model	Neda Rahimpour Anaraki et.al.	2502.11044	null
2025-02-15	Controlling Neural Collapse Enhances Out-of-Distribution Detection and Transfer Learning	Md Yousuf Harun et.al.	2502.10691	null
2025-02-14	SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models	Aditya Mishra et.al.	2502.10307	null
2025-02-19	ExoMiner++ on TESS with Transfer Learning from Kepler: Transit Classification and Vetting Catalog for 2-min Data	Hamed Valizadegan et.al.	2502.09790	null
2025-02-13	NeuralCFD: Deep Learning on High-Fidelity Automotive Aerodynamics Simulations	Maurits Bleeker et.al.	2502.09692	null
2025-02-13	A Survey of Reinforcement Learning for Optimization in Automation	Ahmad Farooq et.al.	2502.09417	null
2025-02-13	Revisiting Euclidean Alignment for Transfer Learning in EEG-Based Brain-Computer Interfaces	Dongrui Wu et.al.	2502.09203	null
2025-02-13	A Hybrid Model for Few-Shot Text Classification Using Transfer and Meta-Learning	Jia Gao et.al.	2502.09086	null
2025-02-12	$\mathsf{CSMAE~}$ :~Cataract Surgical Masked Autoencoder (MAE) based Pre-training	Nisarg A. Shah et.al.	2502.08822	null
2025-02-12	Advancing machine fault diagnosis: A detailed examination of convolutional neural networks	Govind Vashishtha et.al.	2502.08689	null
2025-02-14	Multifidelity Simulation-based Inference for Computationally Expensive Simulators	Anastasia N. Krouglova et.al.	2502.08416	null
2025-02-12	Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation	Fenghe Tang et.al.	2502.08347	link
2025-02-12	Knowledge-Guided Wasserstein Distributionally Robust Optimization	Zitao Wang et.al.	2502.08146	null
2025-02-11	Instance-dependent Early Stopping	Suqin Yuan et.al.	2502.07547	link
2025-02-12	Music for All: Exploring Multicultural Representations in Music Generation Models	Atharva Mehta et.al.	2502.07328	link
2025-02-11	Long-term simulation of physical and mechanical behaviors using curriculum-transfer-learning based physics-informed neural networks	Yuan Guo et.al.	2502.07325	null
2025-02-11	Robust Indoor Localization in Dynamic Environments: A Multi-source Unsupervised Domain Adaptation Framework	Jiyu Jiao et.al.	2502.07246	null
2025-02-11	Tab2Visual: Overcoming Limited Data in Tabular Data Classification Using Deep Learning with Visual Representations	Ahmed Mamdouh et.al.	2502.07181	null
2025-02-10	Cross-platform Learning-based Fault Tolerant Surfacing Controller for Underwater Robots	Yuya Hamamatsu et.al.	2502.07133	null
2025-02-10	Generative Distribution Prediction: A Unified Approach to Multimodal Learning	Xinyu Tian et.al.	2502.07090	null
2025-02-10	Model Diffusion for Certifiable Few-shot Transfer Learning	Fady Rezk et.al.	2502.06970	null
2025-02-08	Topological derivative approach for deep neural network architecture adaptation	C G Krishnanunni et.al.	2502.06885	null
2025-02-10	Institutional Preferences in the Laboratory	Qiankun Zhong et.al.	2502.06748	null
2025-02-10	Hyperparameters in Score-Based Membership Inference Attacks	Gauri Pradhan et.al.	2502.06374	link
2025-02-10	A Data-Efficient Pan-Tumor Foundation Model for Oncology CT Interpretation	Wenhui Lei et.al.	2502.06171	null
2025-02-10	Low Tensor-Rank Adaptation of Kolmogorov–Arnold Networks	Yihang Gao et.al.	2502.06153	null
2025-02-09	Estimation with missing not at random binary outcomes via exponential tilts	Subha Maity et.al.	2502.06046	link
2025-02-09	Protecting Intellectual Property of EEG-based Neural Networks with Watermarking	Ahmed Abdelaziz et.al.	2502.05931	link
2025-02-09	Target Speaker Lipreading by Audio-Visual Self-Distillation Pretraining and Speaker Adaptation	Jing-Xuan Zhang et.al.	2502.05758	null
2025-02-08	Coalition Formation for Heterogeneous Federated Learning Enabled Channel Estimation in RIS-assisted Cell-free MIMO	Nan Qi et.al.	2502.05538	null
2025-02-07	Evaluating Standard and Dialectal Frisian ASR: Multilingual Fine-tuning and Language Identification for Improved Low-resource Performance	Reihaneh Amooie et.al.	2502.04883	null
2025-02-07	Self-Supervised Learning for Pre-training Capsule Networks: Overcoming Medical Imaging Dataset Challenges	Heba El-Shimy et.al.	2502.04748	null
2025-02-07	Performance Evaluation of Image Enhancement Techniques on Transfer Learning for Touchless Fingerprint Recognition	S Sreehari et.al.	2502.04680	null
2025-02-06	Provable Sample-Efficient Transfer Learning Conditional Diffusion Models via Representation Learning	Ziheng Cheng et.al.	2502.04491	null
2025-02-06	Multi-fidelity emulator for large-scale 21 cm lightcone images: a few-shot transfer learning approach with generative adversarial network	Kangning Diao et.al.	2502.04246	null
2025-02-06	A Theoretical Framework for Data Efficient Multi-Source Transfer Learning Based on Cramér-Rao Bound	Qingyue Zhang et.al.	2502.04242	null
2025-02-06	Transfer Learning for Covert Speech Classification Using EEG Hilbert Envelope and Temporal Fine Structure	Saravanakumar Duraisamy et.al.	2502.04132	null
2025-02-06	Exploring Group Convolutional Networks for Sign Problem Mitigation via Contour Deformation	Christoph Gäntgen et.al.	2502.04104	null
2025-02-06	Generalize Drug Response Prediction by Latent Independent Projection for Asymmetric Constrained Domain Generalization	Ran Song et.al.	2502.04034	null
2025-02-06	ICGNN: Graph Neural Network Enabled Scalable Beamforming for MISO Interference Channels	Changpeng He et.al.	2502.03936	null
2025-02-06	SWIPTNet: A Unified Deep Learning Framework for SWIPT based on GNN and Transfer Learning	Hong Han et.al.	2502.03928	null
2025-02-06	Self-Supervised Learning for Solar Radio Spectrum Classification	Siqi Li et.al.	2502.03778	null
2025-02-05	Prediction of the Most Fire-Sensitive Point in Building Structures with Differentiable Agents for Thermal Simulators	Yuan Xinjie et.al.	2502.03424	null
2025-02-05	DES to HSC: Detecting low surface brightness galaxies in the Abell 194 cluster using transfer learning	H. Thuruthipilly et.al.	2502.03142	null
2025-02-05	TopoCL: Topological Contrastive Learning for Time Series	Namwoo Kim et.al.	2502.02924	null
2025-02-04	Cross-Lingual Transfer for Low-Resource Natural Language Processing	Iker García-Ferrero et.al.	2502.02722	null
2025-02-05	Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study	Calvin Yixiang Cheng et.al.	2502.02451	link
2025-02-04	Self-Supervised Convolutional Audio Models are Flexible Acoustic Feature Learners: A Domain Specificity and Transfer-Learning Study	Mattson Ogg et.al.	2502.02366	link
2025-02-04	Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation	Shutong Duan et.al.	2502.02340	null
2025-02-03	Geometric Framework for 3D Cell Segmentation Correction	Peter Chen et.al.	2502.01890	null
2025-02-03	Learning Hyperparameters via a Data-Emphasized Variational Objective	Ethan Harvey et.al.	2502.01861	link
2025-02-03	Grokking Explained: A Statistical Phenomenon	Breno W. Carvalho et.al.	2502.01774	null
2025-02-03	Towards Robust and Generalizable Lensless Imaging with Modular Learned Reconstruction	Eric Bezzam et.al.	2502.01102	null
2025-02-02	Fruit Fly Classification (Diptera: Tephritidae) in Images, Applying Transfer Learning	Erick Andrew Bustamante Flores et.al.	2502.00939	null
2025-02-02	UniGraph2: Learning a Unified Embedding Space to Bind Multimodal Graphs	Yufei He et.al.	2502.00806	link
2025-02-02	Transfer Learning in Physics-Informed Neural Networks: Full Fine-Tuning, Lightweight Fine-Tuning, and Low-Rank Adaptation	Yizheng Wang et.al.	2502.00782	null
2025-02-02	Role of Mixup in Topological Persistence Based Knowledge Distillation for Wearable Sensor Data	Eun Som Jeon et.al.	2502.00779	null
2025-02-01	SSRepL-ADHD: Adaptive Complex Representation Learning Framework for ADHD Detection from Visual Attention Tasks	Abdul Rehman et.al.	2502.00376	null
2025-02-01	Machine Learning Models for Reinforced Concrete Pipes Condition Prediction: The State-of-the-Art Using Artificial Neural Networks and Multiple Linear Regression in a Wisconsin Case Study	Mohsen Mohammadagha et.al.	2502.00363	null
2025-02-01	MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model	Jihyeok Kim et.al.	2502.00315	null
2025-01-31	Improving Quality Control Of MRI Images Using Synthetic Motion Data	Charles Bricout et.al.	2502.00160	null
2025-01-31	Exploring Transfer Learning for Deep Learning Polyp Detection in Colonoscopy Images Using YOLOv8	Fabian Vazquez et.al.	2502.00133	null
2025-01-31	SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging	Javier Montalvo et.al.	2501.19035	link
2025-01-31	Lightspeed Geometric Dataset Distance via Sliced Optimal Transport	Khai Nguyen et.al.	2501.18901	link
2025-01-31	Transfer Learning for Nonparametric Contextual Dynamic Pricing	Fan Wang et.al.	2501.18836	link
2025-01-31	Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques	Samitha Vidhanaarachchi et.al.	2501.18835	null
2025-01-30	Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images	Wei-Lun Chen et.al.	2501.18453	null
2025-01-30	Function Encoders: A Principled Approach to Transfer Learning in Hilbert Spaces	Tyler Ingebrand et.al.	2501.18373	null
2025-01-30	Transfer Learning of Surrogate Models: Integrating Domain Warping and Affine Transformations	Shuaiqun Pan et.al.	2501.18344	null
2025-01-30	Advancing Personalized Federated Learning: Integrative Approaches with AI for Enhanced Privacy and Customization	Kevin Cooper et.al.	2501.18174	null
2025-01-29	Digital Twin-Enabled Real-Time Control in Robotic Additive Manufacturing via Soft Actor-Critic Reinforcement Learning	Matsive Ali et.al.	2501.18016	null
2025-01-29	LEKA:LLM-Enhanced Knowledge Augmentation	Xinhao Zhang et.al.	2501.17802	null
2025-01-29	Action Recognition Using Temporal Shift Module and Ensemble Learning	Anh-Kiet Duong et.al.	2501.17550	link
2025-01-29	EMD-Fuzzy: An Empirical Mode Decomposition Based Fuzzy Model for Cross-Stimulus Transfer Learning of SSVEP	Beining Cao et.al.	2501.17475	null
2025-01-29	Fundamental Computational Limits in Pursuing Invariant Causal Prediction and Invariance-Guided Regularization	Yihong Gu et.al.	2501.17354	null
2025-01-28	Stiff Transfer Learning for Physics-Informed Neural Networks	Emilien Seiler et.al.	2501.17281	null
2025-01-28	CoRe-Net: Co-Operational Regressor Network with Progressive Transfer Learning for Blind Radar Signal Restoration	Muhammad Uzair Zahid et.al.	2501.17125	null
2025-01-31	Multimodal Magic Elevating Depression Detection with a Fusion of Text and Audio Intelligence	Lindy Gan et.al.	2501.16813	null
2025-01-28	Molecular-driven Foundation Model for Oncologic Pathology	Anurag Vaidya et.al.	2501.16652	link
2025-01-27	Automatic Machine Learning Framework to Study Morphological Parameters of AGN Host Galaxies within $z < 1.4$ in the Hyper Supreme-Cam Wide Survey	Chuan Tian et.al.	2501.15739	link
2025-01-26	Building Efficient Lightweight CNN Models	Nathan Isong et.al.	2501.15547	null
2025-01-26	Universal Image Restoration Pre-training via Degradation Classification	JiaKui Hu et.al.	2501.15510	link
2025-01-26	Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning	Alberto Castagna et.al.	2501.15495	link
2025-01-26	Cross-Modal Transfer from Memes to Videos: Addressing Data Scarcity in Hateful Video Detection	Han Wang et.al.	2501.15438	link
2025-01-26	A Transfer Learning Framework for Anomaly Detection in Multivariate IoT Traffic Data	Mahshid Rezakhani et.al.	2501.15365	null
2025-01-25	Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data	Nora Fink et.al.	2501.15263	null
2025-01-25	In-Context Operator Learning for Linear Propagator Models	Tingwei Meng et.al.	2501.15106	null
2025-01-24	A Recurrent Spiking Network with Hierarchical Intrinsic Excitability Modulation for Schema Learning	Yingchao Yu et.al.	2501.14539	null
2025-01-24	Quantum Neural Networks: A Comparative Analysis and Noise Robustness Evaluation	Tasnim Ahmed et.al.	2501.14412	null
2025-01-24	Deep Learning-Powered Classification of Thoracic Diseases in Chest X-Rays	Yiming Lei et.al.	2501.14279	null
2025-01-24	Detection and Classification of Acute Lymphoblastic Leukemia Utilizing Deep Transfer Learning	Md. Abu Ahnaf Mollick et.al.	2501.14228	null
2025-01-23	On the Transfer of Knowledge in Quantum Algorithms	Esther Villar-Rodriguez et.al.	2501.14120	null
2025-01-23	Transfer Learning of Surrogate Models via Domain Affine Transformation Across Synthetic and Real-World Benchmarks	Shuaiqun Pan et.al.	2501.14012	null
2025-01-23	2-Tier SimCSE: Elevating BERT for Robust Sentence Embeddings	Yumeng Wang et.al.	2501.13758	null
2025-01-23	Skin Disease Detection and Classification of Actinic Keratosis and Psoriasis Utilizing Deep Transfer Learning	Fahud Ahmmed et.al.	2501.13713	null
2025-01-23	GenTL: A General Transfer Learning Model for Building Thermal Dynamics	Fabian Raisch et.al.	2501.13703	link
2025-01-23	WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control	Claire Bizon Monroc et.al.	2501.13592	link
2025-01-23	NUDT4MSTAR: A New Dataset and Benchmark Towards SAR Target Recognition in the Wild	Yongxiang Liu et.al.	2501.13354	link
2025-01-22	Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral	Reza Saadati Fard et.al.	2501.13247	null
2025-01-22	LLM4WM: Adapting LLM for Wireless Multi-Tasking	Xuanyu Liu et.al.	2501.12983	null
2025-01-21	Bidirectional Brain Image Translation using Transfer Learning from Generic Pre-trained Models	Fatima Haimour et.al.	2501.12488	null
2025-01-21	Transfer learning electronic structure: millielectron volt accuracy for sub-million-atom moiré semiconductor	Ting Bao et.al.	2501.12452	null
2025-01-21	Tackling Small Sample Survival Analysis via Transfer Learning: A Study of Colorectal Cancer Prognosis	Yonghao Zhao et.al.	2501.12421	link
2025-01-21	Efficient PINNs: Multi-Head Unimodular Regularization of the Solutions Space	Pedro Tarancón-Álvarez et.al.	2501.12116	null
2025-01-21	Multi-Modal Variable-Rate CSI Reconstruction for FDD Massive MIMO Systems	Yunseo Nam et.al.	2501.11926	null
2025-01-20	Rethinking Membership Inference Attacks Against Transfer Learning	Cong Wu et.al.	2501.11577	null
2025-01-20	On the Adversarial Vulnerabilities of Transfer Learning in Remote Sensing	Tao Bai et.al.	2501.11462	null
2025-01-20	How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?	Wenxuan Li et.al.	2501.11253	link
2025-01-20	Energy Consumption Reduction for UAV Trajectory Training : A Transfer Learning Approach	Chenrui Sun et.al.	2501.11243	null
2025-01-19	Enhancing Brain Tumor Segmentation Using Channel Attention and Transfer learning	Majid Behzadpour et.al.	2501.11196	link
2025-01-19	Transfer Learning Strategies for Pathological Foundation Models: A Systematic Evaluation in Brain Tumor Classification	Ken Enda et.al.	2501.11014	null
2025-01-19	BeST – A Novel Source Selection Metric for Transfer Learning	Ashutosh Soni et.al.	2501.10933	null
2025-01-19	Adaptive Target Localization under Uncertainty using Multi-Agent Deep Reinforcement Learning with Knowledge Transfer	Ahmed Alagha et.al.	2501.10924	null
2025-01-18	Model-Robust and Adaptive-Optimal Transfer Learning for Tackling Concept Shifts in Nonparametric Regression	Haotian Lin et.al.	2501.10870	null
2025-01-18	A Resource-Efficient Training Framework for Remote Sensing Text–Image Retrieval	Weihang Zhang et.al.	2501.10638	null
2025-01-17	Surrogate-based multiscale analysis of experiments on thermoplastic composites under off-axis loading	M. A. Maia et.al.	2501.10193	link
2025-01-17	Automatic Speech Recognition for Sanskrit with Transfer Learning	Bidit Sadhukhan et.al.	2501.10024	null
2025-01-16	Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities	Runzhou Mao et.al.	2501.09579	null
2025-01-16	Transfer learning of many-body electronic correlation entropy from local measurements	Faluke Aikebaier et.al.	2501.09505	null
2025-01-15	An analysis of data variation and bias in image-based dermatological datasets for machine learning classification	Francisco Mauro et.al.	2501.08962	null
2025-01-15	Empowering Agricultural Insights: RiceLeafBD – A Novel Dataset and Optimal Model Selection for Rice Leaf Disease Diagnosis through Transfer Learning Technique	Sadia Afrin Rimi et.al.	2501.08912	null
2025-01-15	A Bayesian Hierarchical Model for Generating Synthetic Unbalanced Power Distribution Grids	Henrique O. Caetano et.al.	2501.08808	null
2025-01-15	Detecting Wildfire Flame and Smoke through Edge Computing using Transfer Learning Enhanced Deep Learning Models	Giovanny Vazquez et.al.	2501.08639	null
2025-01-15	Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation	Jiaqi Huang et.al.	2501.08580	link
2025-01-14	Mechanics Informatics: A paradigm for efficiently learning constitutive models	Royal C. Ihuaenyi et.al.	2501.08314	null
2025-01-14	Continual Deep Active Learning for Medical Imaging: Replay-Base Architecture for Context Adaptation	Rui Daniel et.al.	2501.08245	link
2025-01-14	Optimal Policy Adaptation under Covariate Shift	Xueqing Liu et.al.	2501.08067	null
2025-01-16	Mining Intraday Risk Factor Collections via Hierarchical Reinforcement Learning based on Transferred Options	Wenyan Xu et.al.	2501.07274	link
2025-01-13	Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis	Andrzej D. Dobrzycki et.al.	2501.07221	null
2025-01-13	**AlgoRxplorers	Precision in Mutation – Enhancing Drug Design with Advanced Protein Stability Prediction Tools**	Karishma Thakrar et.al.	2501.07014
2025-01-12	Towards Fair and Privacy-Aware Transfer Learning for Educational Predictive Modeling: A Case Study on Retention Prediction in Community Colleges	Chengyuan Yao et.al.	2501.06913	link
2025-01-12	Transfer Learning of Tabular Data by Finetuning Large Language Models	Shourav B. Rabbani et.al.	2501.06863	null
2025-01-12	Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures	Samia Mehnaz et.al.	2501.06740	null
2025-01-12	Hold On! Is My Feedback Useful? Evaluating the Usefulness of Code Review Comments	Sharif Ahmed et.al.	2501.06738	null
2025-01-11	Transforming Social Science Research with Transfer Learning: Social Science Survey Data Integration with AI	Ali Amini et.al.	2501.06577	null
2025-01-11	Mathematics of Digital Twins and Transfer Learning for PDE Models	Yifei Zong et.al.	2501.06400	null
2025-01-10	IoT Firmware Version Identification Using Transfer Learning with Twin Neural Networks	Ashley Andrews et.al.	2501.06033	null
2025-01-09	Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal	Wanli Ma et.al.	2501.05265	null
2025-01-09	Load Forecasting for Households and Energy Communities: Are Deep Learning Models Worth the Effort?	Lukas Moosbrugger et.al.	2501.05000	link
2025-01-09	A CT Image Classification Network Framework for Lung Tumors Based on Pre-trained MobileNetV2 Model and Transfer learning, And Its Application and Market Analysis in the Medical field	Ziyang Gao et.al.	2501.04996	null
2025-01-09	AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data	Haoran Zhu et.al.	2501.04969	link
2025-01-08	Deep Transfer $Q$ -Learning for Offline Non-Stationary Reinforcement Learning	Jinhang Chai et.al.	2501.04870	null
2025-01-08	Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model	Sanjana Sankar et.al.	2501.04799	null
2025-01-08	Rapid Automated Mapping of Clouds on Titan With Instance Segmentation	Zachary Yahn et.al.	2501.04459	link
2025-01-08	A novel Facial Recognition technique with Focusing on Masked Faces	Dana A Abdullah et.al.	2501.04444	null
2025-01-08	TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning	Seungmin Baek et.al.	2501.04293	null
2025-01-08	Comparison of Neural Models for X-ray Image Classification in COVID-19 Detection	Jimi Togni et.al.	2501.04196	null
2025-01-07	DeepVIVONet: Using deep neural operators to optimize sensor locations with application to vortex-induced vibrations	Ruyin Wan et.al.	2501.04105	null
2025-01-07	Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study	Xaver Maria Krückl et.al.	2501.03863	link
2025-01-07	SelectiveFinetuning: Enhancing Transfer Learning in Sleep Staging through Selective Domain Alignment	Siyuan Zhao et.al.	2501.03764	null
2025-01-07	A Multimodal Lightweight Approach to Fault Diagnosis of Induction Motors in High-Dimensional Dataset	Usman Ali et.al.	2501.03746	null
2025-01-07	Transfer Learning for Deep-Unfolded Combinatorial Optimization Solver with Quantum Annealer	Ryo Hagiwara et.al.	2501.03518	null
2025-01-06	FTA-FTL: A Fine-Tuned Aggregation Federated Transfer Learning Scheme for Lithology Microscopic Image Classification	Keyvan RahimiZadeh et.al.	2501.03349	link
2025-01-06	CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets	Tanay Agrawal et.al.	2501.03332	null
2025-01-06	Scalable Forward-Forward Algorithm	Andrii Krutsylo et.al.	2501.03176	null
2025-01-06	Offline-to-online hyperparameter transfer for stochastic bandits	Dravyansh Sharma et.al.	2501.02926	null
2025-01-06	Hybrid deep convolution model for lung cancer detection with transfer learning	Sugandha Saxena et.al.	2501.02785	null
2025-01-08	Transfer learning via Regularized Linear Discriminant Analysis	Hongzhe Zhang et.al.	2501.02411	null
2025-01-04	tCURLoRA: Tensor CUR Decomposition Based Low-Rank Parameter Adaptation for Medical Image Segmentation	Guanghua He et.al.	2501.02227	null
2025-01-03	Transfer Learning for Individualized Treatment Rules: Application to Sepsis Patients Data from eICU-CRD and MIMIC-III Databases	Andong Wang et.al.	2501.02128	null
2025-01-03	Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model	Haixu Liu et.al.	2501.01611	null
2025-01-02	Transfer Neyman-Pearson Algorithm for Outlier Detection	Mohammadreza M. Kalan et.al.	2501.01525	null
2025-01-02	Transfer Learning Analysis of Variational Quantum Circuits	Huan-Hsin Tseng et.al.	2501.01507	null
2025-01-02	Robust COVID-19 Detection from Cough Sounds using Deep Neural Decision Tree and Forest: A Comprehensive Cross-Datasets Evaluation	Rofiqul Islam et.al.	2501.01117	null
2025-01-02	SpecPT (Spectroscopy Pre-trained Transformer) Model for Extragalactic Spectroscopy: I. Architecture and Automated Redshift Measurement	Rohan Pattnaik et.al.	2501.01070	null
2025-01-02	Prediction of Geoeffective CMEs Using SOHO Images and Deep Learning	Khalid A. Alobaid et.al.	2501.01011	null
2025-01-02	Is It Still Fair? Investigating Gender Fairness in Cross-Corpus Speech Emotion Recognition	Shreya G. Upadhyay et.al.	2501.00995	null
2025-01-01	Active and transfer learning with partially Bayesian neural networks for materials and chemicals	Sarah I. Allec et.al.	2501.00952	link
2025-01-01	Intent-based Radio Scheduler for RAN Slicing: Learning to deal with different network scenarios	Cleverson Nahum et.al.	2501.00950	link
2025-01-01	Navigating Nuance: In Quest for Political Truth	Soumyadeep Sar et.al.	2501.00782	link
2024-12-31	Advanced Lung Nodule Segmentation and Classification for Early Detection of Lung Cancer using SAM and Transfer Learning	Asha V et.al.	2501.00586	null
2024-12-31	Addressing Challenges in Data Quality and Model Generalization for Malaria Detection	Kiswendsida Kisito Kabore et.al.	2501.00464	null
2024-12-30	Class-based Subset Selection for Transfer Learning under Extreme Label Shift	Akul Goyal et.al.	2501.00162	null
2024-12-29	On Adversarial Robustness of Language Models in Transfer Learning	Bohdan Turbal et.al.	2501.00066	null
2024-12-28	VisTabNet: Adapting Vision Transformers for Tabular Data	Witold Wydmański et.al.	2501.00057	null
2024-12-28	LLM-Virus: Evolutionary Jailbreak Attack on Large Language Models	Miao Yu et.al.	2501.00055	link
2024-12-30	Investigating layer-selective transfer learning of QAOA parameters for Max-Cut problem	Francesco Aldo Venturelli et.al.	2412.21071	null
2024-12-30	Improving Location-based Thermal Emission Side-Channel Analysis Using Iterative Transfer Learning	Tun-Chieh Lou et.al.	2412.21030	null
2024-12-30	Attention Is All You Need For Mixture-of-Depths Routing	Advait Gadhikar et.al.	2412.20875	null
2024-12-30	Sample Correlation for Fingerprinting Deep Face Recognition	Jiyang Guan et.al.	2412.20768	link
2024-12-30	Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning	Tomasz Rutowski et.al.	2412.20741	null
2024-12-29	LEARNER: A Transfer Learning Method for Low-Rank Matrix Estimation	Sean McGrath et.al.	2412.20605	link
2024-12-28	Enhancing Transfer Learning for Medical Image Classification with SMOTE: A Comparative Study	Md. Zehan Alam et.al.	2412.20235	null
2024-12-28	SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection	Phi Vu Tran et.al.	2412.20047	link
2024-12-28	Uncertainty Quantified Deep Learning and Regression Analysis Framework for Image Segmentation of Skin Cancer Lesions	Elhoucine Elfatimi et.al.	2412.20007	link
2024-12-27	Data-driven tool wear prediction in milling, based on a process-integrated single-sensor approach	Eric Hirsch et.al.	2412.19950	null
2024-12-27	Mouth Articulation-Based Anchoring for Improved Cross-Corpus Speech Emotion Recognition	Shreya G. Upadhyay et.al.	2412.19909	null
2024-12-27	EEG-Reptile: An Automatized Reptile-Based Meta-Learning Library for BCIs	Daniil A. Berdyshev et.al.	2412.19725	link
2024-12-27	Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models	Shuo Wang et.al.	2412.19449	null
2024-12-26	Large Language Models for Market Research: A Data-augmentation Approach	Mengxin Wang et.al.	2412.19363	null
2024-12-26	Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components	Tengxue Zhang et.al.	2412.19085	null
2024-12-26	Robust Speech and Natural Language Processing Models for Depression Screening	Y. Lu et.al.	2412.19072	null
2024-12-24	On the Applicability of Zero-Shot Cross-Lingual Transfer Learning for Sentiment Classification in Distant Language Pairs	Andre Rusli et.al.	2412.18188	link
2024-12-24	Text-Aware Adapter for Few-Shot Keyword Spotting	Youngmoon Jung et.al.	2412.18142	null
2024-12-24	Heterogeneous transfer learning for high dimensional regression with feature mismatch	Jae Ho Chang et.al.	2412.18081	null
2024-12-24	SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC	Yue Deng et.al.	2412.17707	link
2024-12-23	Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework	Aswini Kumar Patra et.al.	2412.17587	null
2024-12-23	CALLIC: Content Adaptive Learning for Lossless Image Compression	Daxin Li et.al.	2412.17464	null
2024-12-23	Feature Based Methods Domain Adaptation for Object Detection: A Review Paper	Helia Mohamadi et.al.	2412.17325	null
2024-12-23	On the Feasibility of Vision-Language Models for Time-Series Classification	Vinay Prithyani et.al.	2412.17304	link
2024-12-23	Trainingless Adaptation of Pretrained Models for Environmental Sound Classification	Noriyuki Tonami et.al.	2412.17212	null
2024-12-24	Semantic Hierarchical Prompt Tuning for Parameter-Efficient Fine-Tuning	Haowei Zhu et.al.	2412.16956	link
2024-12-22	Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and a Large Corpus	Amir Harati et.al.	2412.16900	null
2024-12-21	The Master Key Filters Hypothesis: Deep Filters Are General in DS-CNNs	Zahra Babaiee et.al.	2412.16751	null
2024-12-21	Optoelectronic generative adversarial networks	Jumin Qiu et.al.	2412.16672	link
2024-12-21	IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks	Yaming Zhang et.al.	2412.16654	link
2024-12-21	Learning for Cross-Layer Resource Allocation in MEC-Aided Cell-Free Networks	Chong Zheng et.al.	2412.16565	null
2024-12-20	SeagrassFinder: Deep Learning for Eelgrass Detection and Coverage Estimation in the Wild	Jannik Elsäßer et.al.	2412.16147	null
2024-12-20	Monkey Transfer Learning Can Improve Human Pose Estimation	Bradley Scott et.al.	2412.15966	null
2024-12-20	Polaris: Multi-Fidelity Design Space Exploration of Deep Learning Accelerators	Chirag Sakhuja et.al.	2412.15548	null
2024-12-20	The First Multilingual Model For The Detection of Suicide Texts	Rodolfo Zevallos et.al.	2412.15498	null
2024-12-19	A Multi-Fidelity Graph U-Net Model for Accelerated Physics Simulations	Rini Jasmine Gladstone et.al.	2412.15372	null
2024-12-19	Transfer Learning Meets Functional Linear Regression: No Negative Transfer under Posterior Drift	Xiaoyu Hu et.al.	2412.14563	null
2024-12-19	Color Enhancement for V-PCC Compressed Point Cloud via 2D Attribute Map Optimization	Jingwei Bao et.al.	2412.14449	null
2024-12-18	Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations	Ludovico Nista et.al.	2412.14150	null
2024-12-18	Trustworthy Transfer Learning: A Survey	Jun Wu et.al.	2412.14116	null
2024-12-18	Language verY Rare for All	Ibrahim Merad et.al.	2412.13924	null
2024-12-18	Understanding and Analyzing Model Robustness and Knowledge-Transfer in Multilingual Neural Machine Translation using TX-Ray	Vageesh Saxena et.al.	2412.13881	null
2024-12-18	FlexPose: Pose Distribution Adaptation with Limited Guidance	Zixiao Wang et.al.	2412.13463	null
2024-12-17	Deep Speech Synthesis from Multimodal Articulatory Representations	Peter Wu et.al.	2412.13387	null
2024-12-16	A Digital twin for Diesel Engines: Operator-infused PINNs with Transfer Learning for Engine Health Monitoring	Kamaljyoti Nath et.al.	2412.11967	null
2024-12-16	Prediction of social dilemmas in networked populations via graph neural networks	Huaiyu Tan et.al.	2412.11775	null
2024-12-16	Classification of Spiral Galaxies by Spiral Arm Number using Convolutional Neural Network	Ming Wei Lee et.al.	2412.11696	null
2024-12-18	CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning	Eloy Geenjaar et.al.	2412.11695	null
2024-12-16	Fast-staged CNN Model for Accurate pulmonary diseases and Lung cancer detection	Abdelbaki Souid et.al.	2412.11681	null
2024-12-16	Multilabel Classification for Lung Disease Detection: Integrating Deep Learning and Natural Language Processing	Maria Efimovich et.al.	2412.11452	null
2024-12-16	Accurate, Robust and Privacy-Preserving Brain-Computer Interface Decoding	Xiaoqing Chen et.al.	2412.11390	null
2024-12-14	Global Estimation of Subsurface Eddy Kinetic Energy of Mesoscale Eddies Using a Multiple-input Residual Neural Network	Chenyue Xie et.al.	2412.10656	null
2024-12-13	Active Poisoning: Efficient Backdoor Attacks on Transfer Learning-Based Brain-Computer Interfaces	X. Jiang et.al.	2412.09933	null
2024-12-13	Data-Driven Transfer Learning Framework for Estimating Turning Movement Counts	Xiaobo Ma et.al.	2412.09861	null
2024-12-12	BayesAdapter: enhanced uncertainty estimation in CLIP few-shot adaptation	Pablo Morales-Álvarez et.al.	2412.09718	null
2024-12-12	A Novel Ensemble-Based Deep Learning Model with Explainable AI for Accurate Kidney Disease Diagnosis	Md. Arifuzzaman et.al.	2412.09472	null
2024-12-12	Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy	Alistair Plum et.al.	2412.09415	null
2024-12-12	Prediction Aided by Surrogate Training	Eric Xia et.al.	2412.09364	null
2024-12-12	Stop Relearning: Model Reuse via Feature Distribution Analysis for Incremental Entity Resolution	Victor Christen et.al.	2412.09355	link
2024-12-12	Computer-Aided Osteoporosis Diagnosis Using Transfer Learning with Enhanced Features from Stacked Deep Learning Modules	Ayesha Siddiqua et.al.	2412.09330	null
2024-12-12	Transfer Learning of RSSI to Improve Indoor Localisation Performance	Thanaphon Suwannaphong et.al.	2412.09292	link
2024-12-12	Evaluating Pixel Language Models on Non-Standardized Languages	Alberto Muñoz-Ortiz et.al.	2412.09084	null
2024-12-16	Improvement in Sign Language Translation Using Text CTC Alignment	Sihan Tan et.al.	2412.09014	link
2024-12-12	A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter	Zirun Guo et.al.	2412.08979	link
2024-12-11	Improving Satellite Imagery Masking using Multi-task and Transfer Learning	Rangel Daroya et.al.	2412.08545	null
2024-12-11	ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts	Sinan Du et.al.	2412.08341	null
2024-12-11	Unified HT-CNNs Architecture: Transfer Learning for Segmenting Diverse Brain Tumors in MRI from Gliomas to Pediatric Tumors	Ramy A. Zeineldin et.al.	2412.08240	null
2024-12-10	PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition	Kartik Narayan et.al.	2412.07771	null
2024-12-10	Real-time Sign Language Recognition Using MobileNetV2 and Transfer Learning	Smruti Jagtap et.al.	2412.07486	null
2024-12-10	T-TIME: Test-Time Information Maximization Ensemble for Plug-and-Play BCIs	Siyang Li et.al.	2412.07228	link
2024-12-10	Monte Carlo Tree Search based Space Transfer for Black-box Optimization	Shukuan Wang et.al.	2412.07186	link
2024-12-10	An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications	Kayne Uriel K. Rodrigo et.al.	2412.07182	null
2024-12-10	Annotation Techniques for Judo Combat Phase Classification from Tournament Footage	Anthony Miyaguchi et.al.	2412.07155	null
2024-12-10	Enhancing radioisotope identification in gamma spectra with transfer learning	Peter Lalor et.al.	2412.07069	null
2024-12-09	Using optimal control to guide neural-network interpolation of continuously-parameterized gates	Bikrant Bhattacharyya et.al.	2412.06623	link
2024-12-09	Representational Transfer Learning for Matrix Completion	Yong He et.al.	2412.06233	null
2024-12-09	SGIA: Enhancing Fine-Grained Visual Classification with Sequence Generative Image Augmentation	Qiyu Liao et.al.	2412.06138	null
2024-12-08	Self-Supervised Learning with Probabilistic Density Labeling for Rainfall Probability Estimation	Junha Lee et.al.	2412.05825	link
2024-12-07	Finite Element Neural Network Interpolation. Part I: Interpretable and Adaptive Discretization for Solving PDEs	Kateřina Škardová et.al.	2412.05719	link
2024-12-07	Finite Element Neural Network Interpolation. Part II: Hybridisation with the Proper Generalised Decomposition for non-linear surrogate modelling	Alexandre Daby-Seesaram et.al.	2412.05714	link
2024-12-05	Assessing and Learning Alignment of Unimodal Vision and Language Models	Le Zhang et.al.	2412.04616	null
2024-12-05	Moto: Latent Motion Token as the Bridging Language for Robot Manipulation	Yi Chen et.al.	2412.04445	link
2024-12-05	Adult Glioma Segmentation in Sub-Saharan Africa using Transfer Learning on Stratified Finetuning Data	Abhijeet Parida et.al.	2412.04111	null
2024-12-04	Automated galaxy sizes in Euclid images using the Segment Anything Model	J. Vega-Ferrero et.al.	2412.03642	link
2024-12-04	Streaming Detection of Queried Event Start	Cristobal Eyzaguirre et.al.	2412.03567	link
2024-12-04	Hybrid deep learning-based strategy for the hepatocellular carcinoma cancer grade classification of H&E stained liver histopathology images	Ajinkya Deshpande et.al.	2412.03084	null
2024-12-04	Bayesian Transfer Learning for Enhanced Estimation and Inference	Daoyuan Lai et.al.	2412.02986	null
2024-12-02	Pooling Solvent Mixtures for Solvation Free Energy Predictions	Roel J. Leenhouts et.al.	2412.01982	null
2024-12-02	The Evolution and Future Perspectives of Artificial Intelligence Generated Content	Chengzhang Zhu et.al.	2412.01948	null
2024-12-01	Pairwise Discernment of AffectNet Expressions with ArcFace	Dylan Waldner et.al.	2412.01860	null
2024-12-02	Transfer Learning for Control Systems via Neural Simulation Relations	Alireza Nadali et.al.	2412.01783	null
2024-12-02	FathomVerse: A community science dataset for ocean animal discovery	Genevieve Patterson et.al.	2412.01701	null
2024-12-02	Command-line Risk Classification using Transformer-based Neural Architectures	Paolo Notaro et.al.	2412.01655	null
2024-12-02	Task Adaptation of Reinforcement Learning-based NAS Agents through Transfer Learning	Amber Cassimon et.al.	2412.01420	null
2024-12-02	A Bottom-Up Approach to Optimizing the Solar Organic Rankine Cycle for Transactive Energy Trading	Silvia Anna Cordieri et.al.	2412.01359	null
2024-12-02	SiTSE: Sinhala Text Simplification Dataset and Evaluation	Surangika Ranathunga et.al.	2412.01293	link
2024-11-30	Pruned Convolutional Attention Network Based Wideband Spectrum Sensing with Sub-Nyquist Sampling	Peihao Dong et.al.	2412.00562	link
2024-11-29	Transfer Learning for High-dimensional Quantile Regression with Distribution Shift	Ruiqi Bai et.al.	2411.19933	null
2024-11-29	Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation	Syed Mohammed Mostaque Billah et.al.	2411.19726	null
2024-11-28	Parameter-Efficient Transfer Learning for Music Foundation Models	Yiwei Ding et.al.	2411.19371	link
2024-11-28	Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG	Xinxu Wei et.al.	2411.19230	null
2024-11-28	TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition	Yilong Wang et.al.	2411.19041	link
2024-11-28	Data Augmentation with Diffusion Models for Colon Polyp Localization on the Low Data Regime: How much real data is enough?	Adrian Tormos et.al.	2411.18926	null
2024-11-27	Exponential Moving Average of Weights in Deep Learning: Dynamics and Benefits	Daniel Morales-Brotons et.al.	2411.18704	null
2024-11-27	What do physics-informed DeepONets learn? Understanding and improving training for scientific computing applications	Emily Williams et.al.	2411.18459	null
2024-11-27	Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification	José Fernando Núñez et.al.	2411.18456	null
2024-11-27	Deep learning-based spatio-temporal fusion for high-fidelity ultra-high-speed x-ray radiography	Songyuan Tang et.al.	2411.18441	link
2024-11-27	Transfer Learning for Deep Learning-based Prediction of Lattice Thermal Conductivity	L. Klochko et.al.	2411.18259	link
2024-11-27	Leveraging Transfer Learning for Astronomical Image Analysis	Stefano Cavuoti et.al.	2411.18206	null
2024-11-27	Spectral-Spatial Transformer with Active Transfer Learning for Hyperspectral Image Classification	Muhammad Ahmad et.al.	2411.18115	link
2024-11-27	Using different sources of ground truths and transfer learning to improve the generalization of photometric redshift estimation	Jonathan Soriano et.al.	2411.18054	null
2024-11-27	Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?	Lewen Yang et.al.	2411.18021	null
2024-11-26	Breast Tumor Classification Using EfficientNet Deep Learning Model	Majid Behzadpour et.al.	2411.17870	link
2024-11-26	“Nuclear thermometers” reveal the origin of the universal r-process nucleosynthesis	José Nicolás Orce et.al.	2411.17852	null
2024-11-26	On the Generalization of Handwritten Text Recognition Models	Carlos Garrido-Munoz et.al.	2411.17332	null
2024-11-26	MeerKAT discovery of a MIGHTEE Odd Radio Circle	Ray P. Norris et.al.	2411.17311	null
2024-11-26	Learning Hierarchical Polynomials of Multiple Nonlinear Features with Three-Layer Networks	Hengyu Fu et.al.	2411.17201	null
2024-11-26	Crack Detection in Infrastructure Using Transfer Learning, Spatial Attention, and Genetic Algorithm Optimization	Feng Ding et.al.	2411.17140	null
2024-11-25	Glo-In-One-v2: Holistic Identification of Glomerular Cells, Tissues, and Lesions in Human and Mouse Histopathology	Lining Yu et.al.	2411.16961	link
2024-11-25	SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction	Shester Gueuwou et.al.	2411.16765	null
2024-11-25	Towards Foundation Models for Critical Care Time Series	Manuel Burger et.al.	2411.16346	null
2024-11-25	Deep Learning for Motion Classification in Ankle Exoskeletons Using Surface EMG and IMU Signals	Silas Ruhrberg Estévez et.al.	2411.16273	null
2024-11-24	Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan	Saba Zahid et.al.	2411.15923	null
2024-11-23	Trans-Glasso: A Transfer Learning Approach to Precision Matrix Estimation	Boxin Zhao et.al.	2411.15624	null
2024-11-23	MulModSeg: Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training	Chengyin Li et.al.	2411.15576	link
2024-11-22	Personalization of Wearable Sensor-Based Joint Kinematic Estimation Using Computer Vision for Hip Exoskeleton Applications	Changseob Song et.al.	2411.15366	null
2024-11-21	Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation	Seokil Ham et.al.	2411.15224	null
2024-11-22	Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network	Irfan Nafiz Shahan et.al.	2411.15082	link
2024-11-22	Implementation of Real-Time Lane Detection on Autonomous Mobile Robot	Midriem Mirdanies et.al.	2411.14873	null
2024-11-22	Self-Supervised Learning for Ordered Three-Dimensional Structures	Matthew Spellings et.al.	2411.14680	null
2024-11-21	Variable Extraction for Model Recovery in Scientific Literature	Chunwei Liu et.al.	2411.14569	null
2024-11-21	SegBook: A Simple Baseline and Cookbook for Volumetric Medical Image Segmentation	Jin Ye et.al.	2411.14525	null
2024-11-21	POS-tagging to highlight the skeletal structure of sentences	Grigorii Churakov et.al.	2411.14393	link
2024-11-21	Data Formats in Analytical DBMSs: Performance Trade-offs and Future Directions	Chunwei Liu et.al.	2411.14331	null
2024-11-21	BERT-Based Approach for Automating Course Articulation Matrix Construction with Explainable AI	Natenaile Asmamaw Shiferaw et.al.	2411.14254	link
2024-11-21	Uncertainty-Aware Regression for Socio-Economic Estimation via Multi-View Remote Sensing	Fan Yang et.al.	2411.14119	link
2024-11-20	Machine Learning Domain Adaptation in Spin Models with Continuous Phase Transitions	Vladislav Chertenkov et.al.	2411.13027	null
2024-11-15	FedCL-Ensemble Learning: A Framework of Federated Continual Learning with Ensemble Transfer Learning Enhanced for Alzheimer’s MRI Classifications while Preserving Privacy	Rishit Kapoor et.al.	2411.12756	null
2024-11-19	Multivariate and Online Transfer Learning with Uncertainty Quantification	Jimmy Hickey et.al.	2411.12555	null
2024-11-19	Probe-Me-Not: Protecting Pre-trained Encoders from Malicious Probing	Ruyi Ding et.al.	2411.12508	null
2024-11-19	Classification of Geographical Land Structure Using Convolution Neural Network and Transfer Learning	Mustafa M. Abd Zaid et.al.	2411.12415	null
2024-11-19	Adversarial Multi-Agent Reinforcement Learning for Proactive False Data Injection Detection	Kejun Chen et.al.	2411.12130	null
2024-11-18	In-Situ Melt Pool Characterization via Thermal Imaging for Defect Detection in Directed Energy Deposition Using Vision Transformers	Israt Zarin Era et.al.	2411.12028	null
2024-11-18	Compression of Higher Order Ambisonics with Multichannel RVQGAN	Toni Hirvonen et.al.	2411.12008	null
2024-11-18	TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition	Ke Zhang et.al.	2411.11370	null
2024-11-18	Efficient Transfer Learning for Video-language Foundation Models	Haoxing Chen et.al.	2411.11223	link
2024-11-16	Adaptive Learning of Design Strategies over Non-Hierarchical Multi-Fidelity Models via Policy Alignment	Akash Agrawal et.al.	2411.10841	null
2024-11-15	Large quadrupole deformation in $^{20}$Ne challenges rotor model and modern theory: urging for $α$ clusters in nuclei	C. V. Mehl et.al.	2411.10598	null
2024-11-15	Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review	Hossein Hassani et.al.	2411.10268	null
2024-11-15	Causal Time-Series Synchronization for Multi-Dimensional Forecasting	Michael Mayr et.al.	2411.10152	null
2024-11-15	Unlocking Transfer Learning for Open-World Few-Shot Recognition	Byeonggeun Kim et.al.	2411.09986	null
2024-11-15	mmSpyVR: Exploiting mmWave Radar for Penetrating Obstacles to Uncover Privacy Vulnerability of Virtual Reality	Luoyu Mei et.al.	2411.09914	link
2024-11-14	Edge Caching Optimization with PPO and Transfer Learning for Dynamic Environments	Farnaz Niknia et.al.	2411.09812	null
2024-11-14	Assessing the Performance of the DINOv2 Self-supervised Learning Vision Transformer Model for the Segmentation of the Left Atrium from MRI Images	Bipasha Kundu et.al.	2411.09598	null
2024-11-14	A Practical Guide to Fine-tuning Language Models with Limited Data	Márton Szép et.al.	2411.09539	null
2024-11-14	A Centralized-Distributed Transfer Model for Cross-Domain Recommendation Based on Multi-Source Heterogeneous Transfer Learning	Ke Xu et.al.	2411.09286	null
2024-11-14	Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery	Ashim Dahal et.al.	2411.09101	link
2024-11-13	Zero-shot Cross-lingual Transfer Learning with Multiple Source and Target Languages for Information Extraction: Language Selection and Adversarial Training	Nghia Trung Ngo et.al.	2411.08785	null
2024-11-13	MVKTrans: Multi-View Knowledge Transfer for Robust Multiomics Classification	Shan Cong et.al.	2411.08703	null
2024-11-13	Transfer Learning Guided Noise Reduction for Automatic Modulation Classification	Zelin Ji et.al.	2411.08376	null
2024-11-13	DEEGITS: Deep Learning based Framework for Measuring Heterogenous Traffic State in Challenging Traffic Scenarios	Muttahirul Islam et.al.	2411.08335	null
2024-11-12	Comprehensive and Comparative Analysis between Transfer Learning and Custom Built VGG and CNN-SVM Models for Wildfire Detection	Aditya V. Jonnalagadda et.al.	2411.08171	null
2024-11-12	Triaxial nuclear shapes from simple ratios of electric-quadrupole matrix elements	Elena Atanassova Lawrie et.al.	2411.08130	null
2024-11-11	High-Fidelity Cellular Network Control-Plane Traffic Generation without Domain Knowledge	Z. Jonny Kong et.al.	2411.07345	null
2024-11-11	DeepONet as a Multi-Operator Extrapolation Model: Distributed Pretraining with Physics-Informed Fine-Tuning	Zecheng Zhang et.al.	2411.07239	null
2024-11-10	Foundation Model for Composite Materials and Microstructural Analysis	Ting-Ju Wei et.al.	2411.06565	link
2024-11-10	MBL-CPDP: A Multi-objective Bilevel Method for Cross-Project Defect Prediction via Automated Machine Learning	Jiaxin Chen et.al.	2411.06491	null
2024-11-10	Do you want to play a game? Learning to play Tic-Tac-Toe in Hypermedia Environments	Katharine Beaumont et.al.	2411.06398	null
2024-11-10	A Hybrid Approach for COVID-19 Detection: Combining Wasserstein GAN with Transfer Learning	Sumera Rounaq et.al.	2411.06397	null
2024-11-09	Deep Nonparametric Conditional Independence Tests for Images	Marco Simnacher et.al.	2411.06140	link
2024-11-12	Cross-Domain Transfer Learning using Attention Latent Features for Multi-Agent Trajectory Prediction	Jia Quan Loh et.al.	2411.06087	null
2024-11-09	Predicting band structures for 2D Photonic Crystals via Deep Learning	Yueqi Wang et.al.	2411.06063	null
2024-11-08	Towards Equitable ASD Diagnostics: A Comparative Study of Machine and Deep Learning Models Using Behavioral and Facial Data	Mohammed Aledhari et.al.	2411.05880	null
2024-11-08	Predicting Stroke through Retinal Graphs and Multimodal Self-supervised Learning	Yuqing Huang et.al.	2411.05597	link
2024-11-07	AGE2HIE: Transfer Learning from Brain Age to Predicting Neurocognitive Outcome for Infant Brain Injury	Rina Bao et.al.	2411.05188	null
2024-11-07	High Entropy Alloy property predictions using Transformer-based language model	Spyros Kamnis et.al.	2411.04861	null
2024-11-07	SpectraFM: Tuning into Stellar Foundation Models	Nolan Koblischke et.al.	2411.04750	link
2024-11-07	wav2sleep: A Unified Multi-Modal Approach to Sleep Stage Classification from Physiological Signals	Jonathan F. Carter et.al.	2411.04644	link
2024-11-07	Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation	Qingyao Tian et.al.	2411.04404	null
2024-11-06	Fine-tuning – a Transfer Learning approach	Joseph Arul Raj et.al.	2411.03941	null
2024-11-06	Cross Feature Fusion of Fundus Image and Generated Lesion Map for Referable Diabetic Retinopathy Classification	Dahyun Mok et.al.	2411.03618	null
2024-11-05	Energy Price Modelling: A Comparative Evaluation of four Generations of Forecasting Methods	Alexandru-Victor Andrei et.al.	2411.03372	null
2024-11-05	Proxy-informed Bayesian transfer learning with unknown sources	Sabina J. Sloman et.al.	2411.03263	null
2024-11-05	Exploiting the Segment Anything Model (SAM) for Lung Segmentation in Chest X-ray Images	Gabriel Bellon de Carvalho et.al.	2411.03064	null
2024-11-05	A Mamba Foundation Model for Time Series Forecasting	Haoyu Ma et.al.	2411.02941	null
2024-11-04	Supervised Transfer Learning Framework for Fault Diagnosis in Wind Turbines	Kenan Weber et.al.	2411.02127	null
2024-11-04	AM Flow: Adapters for Temporal Processing in Action Recognition	Tanay Agrawal et.al.	2411.02065	null
2024-11-04	V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams	Muhammad Waqas Ashraf et.al.	2411.01963	null
2024-11-03	Interaction-Aware Trajectory Prediction for Safe Motion Planning in Autonomous Driving: A Transformer-Transfer Learning Approach	Jinhao Liang et.al.	2411.01475	null
2024-11-02	Transfer Learning for Finetuning Large Language Models	Tobias Strangmann et.al.	2411.01195	null
2024-11-02	Transfer Learning Between U.S. Presidential Elections: How Should We Learn From A 2020 Ad Campaign To Inform 2024 Ad Campaigns?	Xinran Miao et.al.	2411.01100	null
2024-11-01	Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior	Mingxuan Zhang et.al.	2411.00969	null
2024-10-31	Denoising study of Fluoroscopic Images in real time tumor tracking System based on Statistical model of noise	Yongxuan Yan et.al.	2411.00199	null
2024-10-31	Attention is All You Need to Optimize Wind Farm Operations and Maintenance	Iman Kazemian et.al.	2410.24052	null
2024-10-31	Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment	Weichao Zhou et.al.	2410.23680	link
2024-10-31	BioNCERE: Non-Contrastive Enhancement For Relation Extraction In Biomedical Texts	Farshad Noravesh et.al.	2410.23583	null
2024-10-30	Mind the Gap: A Generalized Approach for Cross-Modal Embedding Alignment	Arihan Yadav et.al.	2410.23437	null
2024-10-30	Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks	Axel Klawonn et.al.	2410.23359	null
2024-10-30	Sequential Order-Robust Mamba for Time Series Forecasting	Seunghan Lee et.al.	2410.23356	null
2024-10-30	Transfer Learning in Vocal Education: Technical Evaluation of Limited Samples Describing Mezzo-soprano	Zhenyi Hou et.al.	2410.23325	null
2024-10-30	Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe	Songyu Xu et.al.	2410.23154	null
2024-10-30	Don’t Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification	Debjyoti Saharoy et.al.	2410.23066	null
2024-10-30	MutaPLM: Protein Language Modeling for Mutation Explanation and Engineering	Yizhen Luo et.al.	2410.22949	link
2024-10-30	Self-Driving Car Racing: Application of Deep Reinforcement Learning	Florentiana Yuwono et.al.	2410.22766	null
2024-10-29	Towards Neural-Network-based optical temperature sensing of Semiconductor Membrane External Cavity Laser	Jakob Mannstadt et.al.	2410.22528	null
2024-10-29	The PV-ALE Dataset: Enhancing Apple Leaf Disease Classification Through Transfer Learning with Convolutional Neural Networks	Joseph Damilola Akinyemi et.al.	2410.22490	null
2024-10-30	Feature distribution Adaptation Network for Speech Emotion Recognition	Shaokai Li et.al.	2410.22023	link
2024-10-29	Advancing Efficient Brain Tumor Multi-Class Classification – New Insights from the Vision Mamba Model in Transfer Learning	Yinyi Lai et.al.	2410.21872	null
2024-10-29	Cross-Domain Transfer Learning Method for Thermal Adaptive Behavior Recognition with WiFi	Zhaohe Lv et.al.	2410.21827	null
2024-10-30	Adaptive Transfer Clustering: A Unified Framework	Yuqi Gu et.al.	2410.21263	link
2024-10-28	Breccia and basalt classification of thin sections of Apollo rocks with deep learning	Freja Thoresen et.al.	2410.21024	null
2024-10-28	KANsformer for Scalable Beamforming	Xinke Xie et.al.	2410.20690	null
2024-10-27	Causal Modeling in Multi-Context Systems: Distinguishing Multiple Context-Specific Causal Graphs which Account for Observational Support	Martin Rabel et.al.	2410.20405	null
2024-10-27	Uncovering Capabilities of Model Pruning in Graph Contrastive Learning	Wu Junran et.al.	2410.20356	null
2024-10-26	Detection-Guided Deep Learning-Based Model with Spatial Regularization for Lung Nodule Segmentation	Jiasen Zhang et.al.	2410.20154	null
2024-10-26	Sensor2Text: Enabling Natural Language Interactions for Daily Activity Tracking Using Wearable Sensors	Wenqiang Chen et.al.	2410.20034	null
2024-10-25	Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models	Zheng Zhao et.al.	2410.20008	null
2024-10-25	The Galaxy Zoo Catalogs for the Galaxy And Mass Assembly (GAMA) Survey	Benne W. Holwerda et.al.	2410.19985	null
2024-10-25	A Review of Deep Learning Approaches for Non-Invasive Cognitive Impairment Detection	Muath Alsuhaibani et.al.	2410.19898	null
2024-10-25	Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective	Ethan Harvey et.al.	2410.19675	link
2024-10-25	Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis	Yanguang Zhao et.al.	2410.18698	null
2024-10-23	Deep learning for model correction of dynamical systems with data scarcity	Caroline Tatsuoka et.al.	2410.17913	null
2024-10-23	New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture	Ach. Khozaimi et.al.	2410.17735	null
2024-10-22	Subshell gaps and onsets of collectivity from proton and neutron pairing gap correlations	José Nicolás Orce et.al.	2410.17436	null
2024-10-23	Understanding Transfer Learning via Mean-field Analysis	Gholamali Aminian et.al.	2410.17128	null
2024-10-22	Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification	Ganga Prasad Basyal et.al.	2410.16711	null
2024-10-22	Enhancing Two-Player Performance Through Single-Player Knowledge Transfer: An Empirical Study on Atari 2600 Games	Kimiya Saadat et.al.	2410.16653	link
2024-10-21	Towards Optimal Adapter Placement for Efficient Transfer Learning	Aleksandra I. Nowak et.al.	2410.15858	null
2024-10-21	SSMT: Few-Shot Traffic Forecasting with Single Source Meta-Transfer	Kishor Kumar Bhaumik et.al.	2410.15589	null
2024-10-20	Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing	Daniya Najiha Abdul Kareem et.al.	2410.15360	link
2024-10-20	FoMo: A Foundation Model for Mobile Traffic Forecasting with Diffusion Model	Haoye Chai et.al.	2410.15322	null
2024-10-19	Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning	David Schulte et.al.	2410.15148	link
2024-10-19	Generalizable Prediction Model of Molten Salt Mixture Density with Chemistry-Informed Transfer Learning	Julian Barra et.al.	2410.15120	null
2024-10-19	Water quality polluted by total suspended solids classified within an Artificial Neural Network approach	I. Luviano Soto et.al.	2410.14929	null
2024-10-18	A novel approach towards the classification of Bone Fracture from Musculoskeletal Radiography images using Attention Based Transfer Learning	Sayeda Sanzida Ferdous Ruhi et.al.	2410.14833	null
2024-10-18	Effects of Soft-Domain Transfer and Named Entity Information on Deception Detection	Steven Triplett et.al.	2410.14814	null
2024-10-18	How Does Data Diversity Shape the Weight Landscape of Neural Networks?	Yang Ba et.al.	2410.14602	null
2024-10-18	Transfer Reinforcement Learning in Heterogeneous Action Spaces using Subgoal Mapping	Kavinayan P. Sivakumar et.al.	2410.14484	null
2024-10-18	Predicting the trajectory of intracranial pressure in patients with traumatic brain injury: evaluation of a foundation model for time series	Florian D. van Leeuwen et.al.	2410.14333	null
2024-10-18	Transfer Learning on Transformers for Building Energy Consumption Forecasting – A Comparative Study	Robert Spencer et.al.	2410.14107	null
2024-10-18	ST-MoE-BERT: A Spatial-Temporal Mixture-of-Experts Framework for Long-Term Cross-City Mobility Prediction	Haoyu He et.al.	2410.14099	link
2024-10-16	FedGTST: Boosting Global Transferability of Federated Models via Statistics Tuning	Evelyn Ma et.al.	2410.13045	null
2024-10-15	Exploring transfer learning for Deep NLP systems on rarely annotated languages	Dipendra Yadav et.al.	2410.12879	null
2024-10-17	Local transfer learning Gaussian process modeling, with applications to surrogate modeling of expensive computer simulators	Xinming Wang et.al.	2410.12690	null
2024-10-16	Tracking Universal Features Through Fine-Tuning and Model Merging	Niels Horn et.al.	2410.12391	null
2024-10-16	iFuzzyTL: Interpretable Fuzzy Transfer Learning for SSVEP BCI System	Xiaowei Jiang et.al.	2410.12267	null
2024-10-16	Transfer Learning on Multi-Dimensional Data: A Novel Approach to Neural Network-Based Surrogate Modeling	Adrienne M. Propp et.al.	2410.12241	null
2024-10-16	TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration	Yiwei Guo et.al.	2410.12183	link
2024-10-15	Learning to rumble: Automated elephant call classification, detection and endpointing using deep architectures	Christiaan M. Geldenhuys et.al.	2410.12082	null
2024-10-15	A Survey on Deep Tabular Learning	Shriyank Somvanshi et.al.	2410.12034	null
2024-10-15	Transfer Learning Adapts to Changing PSD in Gravitational Wave Data	Beka Modrekiladze et.al.	2410.11911	null
2024-10-15	YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection	Olalekan Akindele et.al.	2410.11727	null
2024-10-15	Transfer Learning with Foundational Models for Time Series Forecasting using Low-Rank Adaptations	M. Germán-Morales et.al.	2410.11539	null
2024-10-15	Improving Bias in Facial Attribute Classification: A Combined Impact of KL Divergence induced Loss Function and Dual Attention	Shweta Patel et.al.	2410.11176	null
2024-10-14	TL-PCA: Transfer Learning of Principal Component Analysis	Sharon Hendy et.al.	2410.10805	null
2024-10-14	Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework	Zhengwei Yang et.al.	2410.10663	null
2024-10-14	SpeGCL: Self-supervised Graph Spectrum Contrastive Learning without Positive Samples	Yuntao Shou et.al.	2410.10365	null
2024-10-12	Bayesian Transfer Learning for Artificially Intelligent Geospatial Systems: A Predictive Stacking Approach	Luca Presicce et.al.	2410.09504	link
2024-10-12	Deep Transfer Learning: Model Framework and Error Analysis	Yuling Jiao et.al.	2410.09383	null
2024-10-12	Hey AI Can You Grade My Essay?: Automatic Essay Grading	Maisha Maliha et.al.	2410.09319	null
2024-10-11	Meta-Transfer Learning Empowered Temporal Graph Networks for Cross-City Real Estate Appraisal	Weijia Zhang et.al.	2410.08947	null
2024-10-10	Features are fate: a theory of transfer learning in high-dimensional regression	Javan Tahir et.al.	2410.08194	null
2024-10-10	Non-transferable Pruning	Ruyi Ding et.al.	2410.08015	null
2024-10-10	CL3: A Collaborative Learning Framework for the Medical Data Ensuring Data Privacy in the Hyperconnected Environment	Mohamamd Zavid Parvez et.al.	2410.07900	link
2024-10-10	Unsupervised Data Validation Methods for Efficient Model Training	Yurii Paniv et.al.	2410.07880	null
2024-10-10	Robustness and Security Enhancement of Radio Frequency Fingerprint Identification in Time-Varying Channels	Lu Yang et.al.	2410.07591	null
2024-10-10	Physics-informed neural networks for multi-field visualization with single-color laser induced fluorescence	Nagahiro Ohashi et.al.	2410.07568	null
2024-10-09	Collusion Detection with Graph Neural Networks	Lucas Gomes et.al.	2410.07091	null
2024-10-09	Z-upscaling: Optical Flow Guided Frame Interpolation for Isotropic Reconstruction of 3D EM Volumes	Fisseha A. Ferede et.al.	2410.07043	link
2024-10-09	Selecting the Best Sequential Transfer Path for Medical Image Segmentation with Limited Labeled Data	Jingyun Yang et.al.	2410.06892	link
2024-10-09	Transfer Learning for a Class of Cascade Dynamical Systems	Shima Rabiei et.al.	2410.06828	null
2024-10-09	Seg2Act: Global Context-aware Action Generation for Document Logical Structuring	Zichao Li et.al.	2410.06802	link
2024-10-09	Utilizing Transfer Learning and pre-trained Models for Effective Forest Fire Detection: A Case Study of Uttarakhand	Hari Prabhat Gupta et.al.	2410.06743	null
2024-10-09	On The Relationship between Visual Anomaly-free and Anomalous Representations	Riya Sadrani et.al.	2410.06576	null
2024-10-09	Model-assisted and Knowledge-guided Transfer Regression for the Underrepresented Population	Doudou Zhou et.al.	2410.06484	null
2024-10-08	Advancements in Road Lane Mapping: Comparative Fine-Tuning Analysis of Deep Learning-based Semantic Segmentation Methods Using Aerial Imagery	Xuanchen et.al.	2410.05717	null
2024-10-08	Robust Transfer Learning for Active Level Set Estimation with Locally Adaptive Gaussian Process Prior	Giang Ngo et.al.	2410.05660	null
2024-10-08	Deep Transfer Learning-based Detection for Flash Memory Channels	Zhen Mei et.al.	2410.05618	null
2024-10-07	Pre-Ictal Seizure Prediction Using Personalized Deep Learning	Shriya Jaddu et.al.	2410.05491	null
2024-10-07	Deep learning-based Visual Measurement Extraction within an Adaptive Digital Twin Framework from Limited Data Using Transfer Learning	Mehrdad Shafiei Dizaji et.al.	2410.05403	null
2024-10-07	Hyper-Representations: Learning from Populations of Neural Networks	Konstantin Schürholt et.al.	2410.05107	link
2024-10-07	Learning Interpretable Hierarchical Dynamical Systems Models from Time Series Data	Manuel Brenner et.al.	2410.04814	null
2024-10-06	Learning De-Biased Representations for Remote-Sensing Imagery	Zichen Tian et.al.	2410.04546	link
2024-10-06	Transfer Learning with General Estimating Equations	Han Yan et.al.	2410.04398	null
2024-10-05	Deep Transfer Learning Based Peer Review Aggregation and Meta-review Generation for Scientific Articles	Md. Tarek Hasan et.al.	2410.04202	null
2024-10-04	Interpolation-Free Deep Learning for Meteorological Downscaling on Unaligned Grids Across Multiple Domains with Application to Wind Power	Jean-Sébastien Giroux et.al.	2410.03945	null
2024-10-03	Reconstructing Human Mobility Pattern: A Semi-Supervised Approach for Cross-Dataset Transfer Learning	Xishun Liao et.al.	2410.03788	null
2024-10-04	SAG: Style-Aligned Article Generation via Model Collaboration	Chenning Xu et.al.	2410.03137	null
2024-10-04	Remaining Useful Life Prediction: A Study on Multidimensional Industrial Signal Processing and Efficient Transfer Learning Based on Large Language Models	Yan Chen et.al.	2410.03134	null
2024-10-03	Ethio-Fake: Cutting-Edge Approaches to Combat Fake News in Under-Resourced Languages Using Explainable AI	Mesay Gemeda Yigezu et.al.	2410.02609	null
2024-10-03	Source Data Selection for Brain-Computer Interfaces based on Simple Features	Frida Heskebeck et.al.	2410.02360	null
2024-10-03	QDGset: A Large Scale Grasping Dataset Generated with Quality-Diversity	Johann Huber et.al.	2410.02319	null
2024-10-03	The Comparison of Individual Cat Recognition Using Neural Networks	Mingxuan Li et.al.	2410.02305	null
2024-10-03	A Novel Method for Accurate & Real-time Food Classification: The Synergistic Integration of EfficientNetB7, CBAM, Transfer Learning, and Data Augmentation	Shayan Rokhva et.al.	2410.02304	null
2024-10-03	Universality in Transfer Learning for Linear Models	Reza Ghane et.al.	2410.02164	null
2024-10-02	In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks	Dingzirui Wang et.al.	2410.01548	link
2024-10-02	RS-FME-SwinT: A Novel Feature Map Enhancement Framework Integrating Customized SwinT with Residual and Spatial CNN for Monkeypox Diagnosis	Saddam Hussain Khan et.al.	2410.01216	null
2024-10-02	Recovering Manifold Structure Using Ollivier-Ricci Curvature	Tristan Luca Saidi et.al.	2410.01149	link
2024-09-30	On the topology and geometry of population-based SHM	Keith Worden et.al.	2410.00923	null
2024-10-01	Advanced Arabic Alphabet Sign Language Recognition Using Transfer Learning and Transformer Models	Mazen Balat et.al.	2410.00681	null
2024-10-01	EMGTTL: Transformers-Based Transfer Learning for Classification of ADL using Raw Surface EMG Signals	Ashraf Ali Kareemulla et.al.	2410.00586	null
2024-10-01	Scalable Multi-Task Transfer Learning for Molecular Property Prediction	Chanhui Lee et.al.	2410.00432	null
2024-09-30	FireLite: Leveraging Transfer Learning for Efficient Fire Detection in Resource-Constrained Environments	Mahamudul Hasan et.al.	2409.20384	null
2024-09-30	UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation	Cheng Zhang et.al.	2409.20197	link
2024-09-30	SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition	Shu Yang et.al.	2409.20083	null
2024-09-30	Model Selection with a Shapelet-based Distance Measure for Multi-source Transfer Learning in Time Series Classification	Jiseok Lee et.al.	2409.20005	link
2024-09-29	MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generation	Lijian Xu et.al.	2409.19684	link
2024-09-29	Brain Tumor Classification on MRI in Light of Molecular Markers	Jun Liu et.al.	2409.19583	null
2024-09-29	A Universal Deep Learning Framework for Materials X-ray Absorption Spectra	Shubha R. Kharel et.al.	2409.19552	link
2024-09-28	Accelerating Malware Classification: A Vision Transformer Solution	Shrey Bavishi et.al.	2409.19461	link
2024-09-28	On the universality of neural encodings in CNNs	Florentin Guth et.al.	2409.19460	null
2024-09-27	Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning	Yu Fu et.al.	2409.19075	null
2024-09-27	Audio-Based Linguistic Feature Extraction for Enhancing Multi-lingual and Low-Resource Text-to-Speech	Youngjae Kim et.al.	2409.18622	null
2024-09-27	How Effective is Pre-training of Large Masked Autoencoders for Downstream Earth Observation Tasks?	Jose Sosa et.al.	2409.18536	null
2024-10-01	Automated Segmentation and Analysis of Microscopy Images of Laser Powder Bed Fusion Melt Tracks	Aagam Shah et.al.	2409.18326	null
2024-09-26	Jump Diffusion-Informed Neural Networks with Transfer Learning for Accurate American Option Pricing under Data Scarcity	Qiguo Sun et.al.	2409.18168	null
2024-09-26	Transfer Learning in $\ell_1$ Regularized Regression: Hyperparameter Selection Strategy based on Sharp Asymptotic Analysis	Koki Okajima et.al.	2409.17704	null
2024-09-26	T3: A Novel Zero-shot Transfer Learning Framework Iteratively Training on an Assistant Task for a Target Task	Xindi Tong et.al.	2409.17640	null
2024-09-26	MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models	Gongfan Fang et.al.	2409.17481	link
2024-09-24	Transfer learning for financial data predictions: a systematic review	V. Lanzetta et.al.	2409.17183	null
2024-09-25	Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models	Zhichen Han et.al.	2409.16920	link
2024-09-25	GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning	Zhe-Rui Yang et.al.	2409.16670	link
2024-09-25	Graph Pruning Based Spatial and Temporal Graph Convolutional Network with Transfer Learning for Traffic Prediction	Zihao Jing et.al.	2409.16532	link
2024-09-24	Lessons Learned from a Unifying Empirical Study of Parameter-Efficient Transfer Learning (PETL) in Visual Recognition	Zheda Mai et.al.	2409.16434	link
2024-09-24	Stable Survival Extrapolation via Transfer Learning	Anastasios Apsemidis et.al.	2409.16044	null
2024-09-24	Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification	Leire Benito-Del-Valle et.al.	2409.16002	link
2024-09-24	Machine Translation Advancements of Low-Resource Indian Languages by Transfer Learning	Bin Wei et.al.	2409.15879	null
2024-09-21	Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics	Burooj Ghani et.al.	2409.15383	null
2024-09-22	From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks	Clémentine C. J. Dominé et.al.	2409.14623	null
2024-09-21	Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer	Zheng Liu et.al.	2409.13999	null
2024-09-20	Transfer Learning with Clinical Concept Embeddings from Large Language Models	Yuhe Gao et.al.	2409.13893	null
2024-09-20	Transfer Learning for Passive Sonar Classification using Pre-trained Audio and ImageNet Models	Amirmohammad Mohammadi et.al.	2409.13878	null
2024-09-20	Transfer Learning and Double U-Net Empowered Wave Propagation Model in Complex Indoor Environment	Ziheng Fu et.al.	2409.13833	null
2024-09-20	MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension	Ting Liu et.al.	2409.13609	link
2024-09-20	Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Tensorflow Pretrained Models	Keyu Chen et.al.	2409.13566	null
2024-09-20	Overcoming Data Limitations in Internet Traffic Forecasting: LSTM Models with Transfer Learning and Wavelet Augmentation	Sajal Saha et.al.	2409.13181	null
2024-09-20	Bilateral Sharpness-Aware Minimization for Flatter Minima	Jiaxin Deng et.al.	2409.13173	null
2024-09-19	Recognition of Harmful Phytoplankton from Microscopic Images using Deep Learning	Aymane Khaldi et.al.	2409.12900	null
2024-09-19	Rapid aerodynamic prediction of swept wings via physics-embedded transfer learning	Yunjia Yang et.al.	2409.12711	null
2024-09-19	Exploring bat song syllable representations in self-supervised audio encoders	Marianne de Heer Kloots et.al.	2409.12634	null
2024-09-19	Using Large Language Models to Generate Clinical Trial Tables and Figures	Yumeng Yang et.al.	2409.12046	null
2024-09-18	All-in-one foundational models learning across quantum chemical levels	Yuxinxin Chen et.al.	2409.12015	link
2024-09-18	Location based Probabilistic Load Forecasting of EV Charging Sites: Deep Transfer Learning with Multi-Quantile Temporal Convolutional Network	Mohammad Wazed Ali et.al.	2409.11862	null
2024-09-18	Bridging Domain Gap for Flight-Ready Spaceborne Vision	Tae Ha Park et.al.	2409.11661	null
2024-09-17	Leveraging Reviewer Experience in Code Review Comment Generation	Hong Yi Lin et.al.	2409.10959	null
2024-09-16	Can Transfer Learning be Used to Identify Tropical State-Dependent Bias Relevant to Midlatitude Subseasonal Predictability?	Kirsten J. Mayer et.al.	2409.10755	null
2024-09-16	RF-GML: Reference-Free Generative Machine Listener	Arijit Biswas et.al.	2409.10210	null
2024-09-16	A Comparative Study of Open Source Computer Vision Models for Application on Small Data: The Case of CFRP Tape Laying	Thomas Fraunholz et.al.	2409.10104	null
2024-09-14	Target Speaker ASR with Whisper	Alexander Polok et.al.	2409.09543	link
2024-09-14	On the Generalizability of Foundation Models for Crop Type Mapping	Yi-Chia Chang et.al.	2409.09451	link
2024-09-14	The T05 System for The VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic Speech	Kaito Baba et.al.	2409.09305	link
2024-09-22	Train-On-Request: An On-Device Continual Learning Workflow for Adaptive Real-World Brain Machine Interfaces	Lan Mei et.al.	2409.09161	link
2024-09-11	Distributed Convolutional Neural Network Training on Mobile and Edge Clusters	Pranav Rama et.al.	2409.09083	null
2024-09-13	Comparative Analysis of Pretrained Audio Representations in Music Recommender Systems	Yan-Martin Tamm et.al.	2409.08987	link
2024-09-13	Data Efficient Child-Adult Speaker Diarization with Simulated Conversations	Anfeng Xu et.al.	2409.08881	link
2024-09-13	Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages	Yao-Fei Cheng et.al.	2409.08872	null
2024-09-12	Identification of head impact locations, speeds, and force based on head kinematics	Xianghao Zhan et.al.	2409.08177	link
2024-09-12	SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality	Chenyang Lei et.al.	2409.08083	link
2024-09-12	SPARK: Self-supervised Personalized Real-time Monocular Face Capture	Kelian Baert et.al.	2409.07984	null
2024-09-12	Data-efficient multi-fidelity training for high-fidelity machine learning interatomic potentials	Jaesun Kim et.al.	2409.07947	null
2024-09-12	Reimagining Linear Probing: Kolmogorov-Arnold Networks in Transfer Learning	Sheng Shen et.al.	2409.07763	null
2024-09-12	Transfer Learning Applied to Computer Vision Problems: Survey on Current Progress, Limitations, and Opportunities	Aaryan Panda et.al.	2409.07736	null
2024-09-17	Music auto-tagging in the long tail: A few-shot approach	T. Aleksandra Ma et.al.	2409.07730	null
2024-09-11	Deep Neural Network-Based Sign Language Recognition: A Comprehensive Approach Using Transfer Learning with Explainability	A. E. M Ridwan et.al.	2409.07426	null
2024-09-11	Deep Learning Techniques for Hand Vein Biometrics: A Comprehensive Review	Mustapha Hemis et.al.	2409.07128	null
2024-09-13	A Bayesian framework for active object recognition, pose estimation and shape transfer learning through touch	Haodong Zheng et.al.	2409.06912	null
2024-09-10	Adaptive Meta-Domain Transfer Learning (AMDTL): A Novel Approach for Knowledge Transfer in AI	Michele Laurelli et.al.	2409.06800	link
2024-09-10	A study on Deep Convolutional Neural Networks, Transfer Learning and Ensemble Model for Breast Cancer Detection	Md Taimur Ahad et.al.	2409.06699	null
2024-09-10	A comprehensive study on Blood Cancer detection and classification using Convolutional Neural Network	Md Taimur Ahad et.al.	2409.06689	null
2024-09-10	Advancements in Gesture Recognition Techniques and Machine Learning for Enhanced Human-Robot Interaction: A Comprehensive Review	Sajjad Hussain et.al.	2409.06503	null
2024-09-10	Inference is All You Need: Self Example Retriever for Cross-domain Dialogue State Tracking with ChatGPT	Jihyun Lee et.al.	2409.06243	null
2024-09-09	Robust Real-time Segmentation of Bio-Morphological Features in Human Cherenkov Imaging during Radiotherapy via Deep Learning	Shiru Wang et.al.	2409.05666	null
2024-09-09	Preparing Schrödinger cat states in a microwave cavity using a neural network	Hector Hutin et.al.	2409.05557	null
2024-09-13	Federated Transfer Learning Based Cooperative Wideband Spectrum Sensing with Model Pruning	Jibin Jia et.al.	2409.05462	null
2024-09-09	Sample-Efficient Bayesian Optimization with Transfer Learning for Heterogeneous Search Spaces	Aryan Deshwal et.al.	2409.05325	link
2024-09-07	Collaborative Learning with Shared Linear Representations: Statistical Rates and Optimal Algorithms	Xiaochun Niu et.al.	2409.04919	null
2024-09-07	Urban traffic analysis and forecasting through shared Koopman eigenmodes	Chuhan Yang et.al.	2409.04728	null
2024-09-06	A Unified Framework for Cross-Domain Recommendation	Jiangxia Cao et.al.	2409.04540	null
2024-09-06	Incorporating external data for analyzing randomized clinical trials: A transfer learning approach	Yujia Gu et.al.	2409.04126	null
2024-09-09	AnyMatch – Efficient Zero-Shot Entity Matching with a Small Language Model	Zeyu Zhang et.al.	2409.04073	link
2024-09-05	Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning	Isaac Ray et.al.	2409.03938	null
2024-09-05	The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives	Èric Śanchez et.al.	2409.03911	link
2024-09-05	Threat Classification on Deployed Optical Networks Using MIMO Digital Fiber Sensing, Wavelets, and Machine Learning	Khouloud Abdelli et.al.	2409.03667	null
2024-09-05	Shuffle Vision Transformer: Lightweight, Fast and Efficient Recognition of Driver Facial Expression	Ibtissam Saadi et.al.	2409.03438	null
2024-09-05	Non-stationary and Sparsely-correlated Multi-output Gaussian Process with Spike-and-Slab Prior	Wang Xinming et.al.	2409.03149	null
2024-09-04	Knowledge Transfer for Collaborative Misbehavior Detection in Untrusted Vehicular Environments	Roshan Sedar et.al.	2409.02844	null
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-04	Regularized Multi-output Gaussian Convolution Process with Domain Adaptation	Wang Xinming et.al.	2409.02778	null
2024-09-04	A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing	Davi Rodrigues et.al.	2409.02528	null
2024-09-05	Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR	Xugang Lu et.al.	2409.02239	null
2024-09-04	When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective	Hsi-Ai Tsao et.al.	2409.01821	link
2024-09-03	METcross: A framework for short-term forecasting of cross-city metro passenger flow	Wenbo Lu et.al.	2409.01515	null
2024-09-02	A multilingual training strategy for low resource Text to Speech	Asma Amalas et.al.	2409.01217	null
2024-09-02	Beyond Efficiency: Molecular Data Pruning for Enhanced Generalization	Dingshuo Chen et.al.	2409.01081	null
2024-09-01	Equitable Skin Disease Prediction Using Transfer Learning and Domain Adaptation	Sajib Acharjee Dip et.al.	2409.00873	null
2024-09-01	Multiscale Color Guided Attention Ensemble Classifier for Age-Related Macular Degeneration using Concurrent Fundus and Optical Coherence Tomography Images	Pragya Gupta et.al.	2409.00718	null
2024-08-31	Comparative Analysis of Modality Fusion Approaches for Audio-Visual Person Identification and Verification	Aref Farhadipour et.al.	2409.00562	null
2024-08-31	Foundations of Multivariate Distributional Reinforcement Learning	Harley Wiltzer et.al.	2409.00328	null
2024-08-30	Self-Supervised Learning for Building Robust Pediatric Chest X-ray Classification Models	Sheng Cheng et.al.	2409.00231	null
2024-08-30	Transfer Learning Based Hybrid Quantum Neural Network Model for Surface Anomaly Detection	Sounak Bhowmik et.al.	2409.00228	null
2024-09-02	Disease Classification and Impact of Pretrained Deep Convolution Neural Networks on Diverse Medical Imaging Datasets across Imaging Modalities	Jutika Borah et.al.	2408.17011	null
2024-08-30	Contrastive Learning with Synthetic Positives	Dewen Zeng et.al.	2408.16965	link
2024-08-30	An Empirical Study of Scaling Laws for Transfer	Matthew Barnett et.al.	2408.16947	null
2024-08-29	Comparative Analysis of Transfer Learning Models for Breast Cancer Classification	Sania Eskandari et.al.	2408.16859	link
2024-08-29	CNN Based Detection of Cardiovascular Diseases from ECG Images	Irem Sayin et.al.	2408.16800	null
2024-08-29	Data Quality Monitoring through Transfer Learning on Anomaly Detection for the Hadron Calorimeters	Mulugeta Weldezgina Asres et.al.	2408.16612	null
2024-08-29	On Transfer Learning for a Fully Convolutional Deep Neural SIMO Receiver	Uyoata E. Uyoata et.al.	2408.16401	null
2024-08-29	Efficient Transfer Learning Framework for Cross-Domain Click-Through Rate Prediction	Qi Liu et.al.	2408.16238	null
2024-08-29	A More Unified Theory of Transfer Learning	Steve Hanneke et.al.	2408.16189	null
2024-08-28	Q-MRS: A Deep Learning Framework for Quantitative Magnetic Resonance Spectra Analysis	Christopher J. Wu et.al.	2408.15999	null
2024-08-28	Auxiliary Input in Training: Incorporating Catheter Features into Deep Learning Models for ECG-Free Dynamic Coronary Roadmapping	Yikang Liu et.al.	2408.15947	null
2024-08-28	Emulating Brain-like Rapid Learning in Neuromorphic Edge Computing	Kenneth Stewart et.al.	2408.15800	link
2024-08-28	Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection	Sondos Mohamed et.al.	2408.15637	null
2024-08-27	Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models	Hongfu Liu et.al.	2408.14866	link
2024-08-27	GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning	Shubhendu Jena et.al.	2408.14724	null
2024-08-26	Comparative Analysis: Violence Recognition from Videos using Transfer Learning	Dursun Dashdamirov et.al.	2408.14659	link
2024-08-23	Knowledge Graph Modeling-Driven Large Language Model Operating System (LLM OS) for Task Automation in Process Engineering Problem-Solving	Sakhinana Sagar Srinivas et.al.	2408.14494	null
2024-08-26	Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition	Axel Klawonn et.al.	2408.14442	null
2024-08-26	Application of Neural Ordinary Differential Equations for ITER Burning Plasma Dynamics	Zefang Liu et.al.	2408.14404	link
2024-08-26	Histology Virtual Staining with Mask-Guided Adversarial Transfer Learning for Tertiary Lymphoid Structure Detection	Qiuli Wang et.al.	2408.13978	null
2024-08-24	Advancing Gamma-Ray Burst Identification through Transfer Learning with Convolutional Neural Networks	Peng Zhang et.al.	2408.13598	null
2024-08-24	Optimal Layer Selection for Latent Data Augmentation	Tomoumi Takase et.al.	2408.13426	null
2024-08-23	Enhancing Few-Shot Transfer Learning with Optimized Multi-Task Prompt Tuning through Modular Prompt Composition	Ahmad Pouramini et.al.	2408.13227	null
2024-08-23	Deep Learning for Lung Disease Classification Using Transfer Learning and a Customized CNN Architecture with Attention	Xiaoyi Liu et.al.	2408.13180	null
2024-08-22	Time series forecasting of multiphase microstructure evolution using deep learning	Saurabh Tiwari et.al.	2408.13111	null
2024-08-23	A cost-effective strategy of enhancing machine learning potentials by transfer learning from a multicomponent dataset on ænet-PyTorch	An Niza El Aisnadaa et.al.	2408.12939	null
2024-08-23	Efficient Training Approaches for Performance Anomaly Detection Models in Edge Computing Environments	Duneesha Fernando et.al.	2408.12855	null
2024-08-23	Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence	Purushothaman Natarajan et.al.	2408.12837	link
2024-08-22	Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification	Sudi Murindanyi et.al.	2408.12426	null
2024-08-22	Modularized data-driven approximation of the Koopman operator and generator	Yang Guo et.al.	2408.12277	null
2024-08-22	Accounts of using the Tustin-Net architecture on a rotary inverted pendulum	Stijn van Esch et.al.	2408.12266	link
2024-08-23	Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models	Shenglin Zhang et.al.	2408.12247	link
2024-08-21	Defining Boundaries: The Impact of Domain Specification on Cross-Language and Cross-Domain Transfer in Machine Translation	Lia Shahnazaryan et.al.	2408.11926	null
2024-08-19	Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition	Xuan Kan et.al.	2408.11873	null
2024-08-21	Embedding Ordinality to Binary Loss Function for Improving Solar Flare Forecasting	Chetraj Pandey et.al.	2408.11768	link
2024-08-21	Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning Methods	David Jacob Kedziora et.al.	2408.11322	link
2024-08-21	RedWhale: An Adapted Korean LLM Through Efficient Continual Pretraining	Anh-Dung Vo et.al.	2408.11294	null
2024-08-20	Multichannel Attention Networks with Ensembled Transfer Learning to Recognize Bangla Handwritten Charecter	Farhanul Haque et.al.	2408.10955	null
2024-08-20	The Evolution of Reinforcement Learning in Quantitative Finance	Nikolaos Pippas et.al.	2408.10932	null
2024-08-20	ViLReF: A Chinese Vision-Language Retinal Foundation Model	Shengzhu Yang et.al.	2408.10894	link
2024-08-20	TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning	Bin Wang et.al.	2408.10688	link
2024-08-20	Multi-Attribute Preferences: A Transfer Learning Approach	Sjoerd Hermes et.al.	2408.10558	null
2024-08-20	Transfer Operator Learning with Fusion Frame	Haoyang Jiang et.al.	2408.10458	null
2024-08-23	Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language	Manjil Karki et.al.	2408.10128	null
2024-08-19	Weakly Supervised Pretraining and Multi-Annotator Supervised Finetuning for Facial Wrinkle Detection	Ik Jun Moon et.al.	2408.09952	null
2024-08-19	Electron-nucleus cross sections from transfer learning	Krzysztof M. Graczyk et.al.	2408.09936	null
2024-08-19	Meta-Learning on Augmented Gene Expression Profiles for Enhanced Lung Cancer Detection	Arya Hadizadeh Moghaddam et.al.	2408.09635	link
2024-08-18	CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination	Kaicheng Yang et.al.	2408.09441	null
2024-08-16	GLANCE: Graph-based Learnable Digital Twin for Communication Networks	Boning Li et.al.	2408.09040	null
2024-08-16	AdaRank: Disagreement Based Module Rank Prediction for Low-rank Adaptation	Yihe Dong et.al.	2408.09015	link
2024-08-16	A Multi-Task and Multi-Label Classification Model for Implicit Discourse Relation Recognition	Nelson Filipe Costa et.al.	2408.08971	null
2024-08-16	CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk	Mohamad Fares El Hajj Chehade et.al.	2408.08812	null
2024-08-16	Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation	Linghao Zheng et.al.	2408.08576	null
2024-08-16	Unsupervised Transfer Learning via Adversarial Contrastive Training	Chenguang Duan et.al.	2408.08533	link
2024-08-16	Inverse design with conditional cascaded diffusion models	Milad Habibi et.al.	2408.08526	null
2024-08-16	Enhancement of price trend trading strategies via image-induced importance weights	Zhoufan Zhu et.al.	2408.08483	link
2024-08-15	Training Spatial-Frequency Visual Prompts and Probabilistic Clusters for Accurate Black-Box Transfer Learning	Wonwoo Cho et.al.	2408.07944	null
2024-08-14	MeerKAT reveals a ghostly thermal radio ring towards the Galactic Centre	C. Bordiu et.al.	2408.07727	null
2024-08-14	PolyCL: Contrastive Learning for Polymer Representation Learning via Explicit and Implicit Augmentations	Jiajun Zhou et.al.	2408.07556	link
2024-08-20	Surrogate-Assisted Search with Competitive Knowledge Transfer for Expensive Optimization	Xiaoming Xue et.al.	2408.07176	link
2024-08-13	Object Tracking Incorporating Transfer Learning into Unscented and Cubature Kalman Filters	Omar Alotaibi et.al.	2408.07157	null
2024-08-12	A Unified Manifold Similarity Measure Enhancing Few-Shot, Transfer, and Reinforcement Learning in Manifold-Distributed Datasets	Sayed W Qayyumi et.al.	2408.07095	null
2024-08-07	Anatomical Foundation Models for Brain MRIs	Carlo Alberto Barbano et.al.	2408.07079	link
2024-08-13	Approaches for enhancing extrapolability in process-based and data-driven models in hydrology	Haiyang Shi et.al.	2408.07071	null
2024-08-20	Spectrum Prediction With Deep 3D Pyramid Vision Transformer Learning	Guangliang Pan et.al.	2408.06870	link
2024-08-12	InfLocNet: Enhanced Lung Infection Localization and Disease Detection from Chest X-Ray Images Using Lightweight Deep Learning	Md. Asiful Islam Miah et.al.	2408.06459	null
2024-08-12	Wireless Channel Aware Data Augmentation Methods for Deep Leaning-Based Indoor Localization	Omer Gokalp Serbetci et.al.	2408.06452	null
2024-08-12	Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems	Steve Yuwono et.al.	2408.05992	null
2024-08-09	ECG-FM: An Open Electrocardiogram Foundation Model	Kaden McKeen et.al.	2408.05178	link
2024-08-08	Segmentation of Mental Foramen in Orthopantomographs: A Deep Learning Approach	Haider Raza et.al.	2408.04763	null
2024-08-08	Hybrid Quantum-Classical Neural Networks for Downlink Beamforming Optimization	Juping Zhang et.al.	2408.04747	null
2024-08-08	Modelling parametric uncertainty in PDEs models via Physics-Informed Neural Networks	Milad Panahi et.al.	2408.04690	null
2024-08-08	Model-Based Transfer Learning for Contextual Reinforcement Learning	Jung-Hoon Cho et.al.	2408.04498	link
2024-08-08	Deep Transfer Learning for Kidney Cancer Diagnosis	Yassine Habchi et.al.	2408.04318	null
2024-08-07	Scaling Law of Sim2Real Transfer Learning in Expanding Computational Materials Databases for Real-World Predictions	Shunya Minami et.al.	2408.04042	null
2024-08-06	An Interactive Augmented Reality Interface for Personalized Proxemics Modeling	Massimiliano Nigro et.al.	2408.03453	null
2024-08-05	Quantum Transfer Learning for MNIST Classification Using a Hybrid Quantum-Classical Approach	Soumyadip Sarkar et.al.	2408.03351	null
2024-08-06	LLaVA-OneVision: Easy Visual Task Transfer	Bo Li et.al.	2408.03326	link
2024-08-06	Segment Anything in Medical Images and Videos: Benchmark and Deployment	Jun Ma et.al.	2408.03322	link
2024-08-06	Fast Whole-Brain MR Multi-Parametric Mapping with Scan-Specific Self-Supervised Networks	Amir Heydari et.al.	2408.02988	null
2024-08-05	FPT+: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification	Yijin Huang et.al.	2408.02426	link
2024-08-05	FE-Adapter: Adapting Image-based Emotion Classifiers to Videos	Shreyank N Gowda et.al.	2408.02421	null
2024-08-05	Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding	Renato Vukovic et.al.	2408.02361	null
2024-08-05	Machine Learning Applications in Medical Prognostics: A Comprehensive Review	Michael Fascia et.al.	2408.02344	null
2024-08-05	Synergistic Learning with Multi-Task DeepONet for Efficient PDE Problem Solving	Varun Kumar et.al.	2408.02198	link
2024-08-04	Graph-Enabled Fast MCMC Sampling with an Unknown High-Dimensional Prior Distribution	Chenyang Zhong et.al.	2408.02122	link
2024-08-04	DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation	Qinshuo Liu et.al.	2408.02045	link
2024-08-04	Unsupervised Representation Learning by Balanced Self Attention Matching	Daniel Shalam et.al.	2408.02014	link
2024-08-04	AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis	Townim F. Chowdhury et.al.	2408.02001	link
2024-08-06	Sharpness-Aware Cross-Domain Recommendation to Cold-Start Users	Guohang Zeng et.al.	2408.01931	null
2024-08-02	PiCoGen2: Piano cover generation with transfer learning approach and weakly aligned data	Chih-Pin Tan et.al.	2408.01551	null
2024-08-02	Analyzing LLMs’ Capabilities to Establish Implicit User Sentiment of Software Desirability	Sherri Weitl-Harms et.al.	2408.01527	null
2024-08-02	IAI Group at CheckThat! 2024: Transformer Models and Data Augmentation for Checkworthy Claim Detection	Peter Røysland Aarnes et.al.	2408.01118	link
2024-08-08	Cross-domain Named Entity Recognition via Graph Matching	Junhao Zheng et.al.	2408.00981	null
2024-08-01	A deep learning-enabled smart garment for versatile sleep behaviour monitoring	Chenyu Tang et.al.	2408.00753	null
2024-08-01	Accelerating Full Waveform Inversion By Transfer Learning	Divya Shyam Singh et.al.	2408.00695	null
2024-08-03	Scaling Backwards: Minimal Synthetic Pre-training?	Ryo Nakamura et.al.	2408.00677	link
2024-08-01	Efficient Patient Fine-Tuned Seizure Detection with a Tensor Kernel Machine	Seline J. S. de Rooij et.al.	2408.00437	null
2024-08-01	Provably Efficient Adiabatic Learning for Quantum-Classical Dynamics	Changnan Peng et.al.	2408.00276	null
2024-07-31	Leveraging Self-Supervised Learning for Fetal Cardiac Planes Classification using Ultrasound Scan Videos	Joseph Geo Benjamin et.al.	2407.21738	null
2024-07-31	Shape-restricted transfer learning analysis for generalized linear regression model	Pengfei Li et.al.	2407.21682	null
2024-07-31	An Explainable Vision Transformer with Transfer Learning Combined with Support Vector Machine Based Efficient Drought Stress Identification	Aswini Kumar Patra et.al.	2407.21666	null
2024-07-31	Accurate Tunneling Splittings for Ever-Larger Molecules from Transfer-Learned, CCSD(T) Quality Energy Functions	Silvan Käser et.al.	2407.21366	null
2024-07-30	Domain Shift Analysis in Chest Radiographs Classification in a Veterans Healthcare Administration Population	Mayanka Chandrashekar et.al.	2407.21149	null
2024-07-30	Transfer Learning for Multi-material Classification of Transition Metal Dichalcogenides with Atomic Force Microscopy	Isaiah A. Moses et.al.	2407.20975	null
2024-07-30	Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning	Norman Di Palo et.al.	2407.20798	null
2024-07-30	Image-based Detection of Segment Misalignment in Multi-mirror Satellites using Transfer Learning	C. Tanner Fredieu et.al.	2407.20582	null
2024-07-30	DuA: Dual Attentive Transformer in Long-Term Continuous EEG Emotion Analysis	Yue Pan et.al.	2407.20519	null
2024-07-26	Robust and Efficient Transfer Learning via Supernet Transfer in Warm-started Neural Architecture Search	Prabhant Singh et.al.	2407.20279	null
2024-07-29	Enhancing Anti-spoofing Countermeasures Robustness through Joint Optimization and Transfer Learning	Yikang Wang et.al.	2407.20111	null
2024-07-29	Transfer Learning Targeting Mixed Population: A Distributional Robust Perspective	Keyao Zhan et.al.	2407.20073	null
2024-07-29	ProRuka: A highly efficient HMI algorithm for controlling a novel prosthetic hand with 6-DOF using sonomyography	Vaheh Nazari et.al.	2407.19859	null
2024-07-29	Online Multi-Source Domain Adaptation through Gaussian Mixtures and Dataset Dictionary Learning	Eduardo Fernandes Montesuma et.al.	2407.19853	null
2024-07-29	Unmasking unlearnable models: a classification challenge for biomedical images without visible cues	Shivam Kumar et.al.	2407.19773	null
2024-07-28	Deep Generative Models-Assisted Automated Labeling for Electron Microscopy Images Segmentation	Wenhao Yuan et.al.	2407.19544	link
2024-07-25	Adapting Mouse Pathological Model to Human Glomerular Lesion Segmentation	Lining Yu et.al.	2407.18390	null
2024-07-25	Detection of manatee vocalisations using the Audio Spectrogram Transformer	Stefano Schiappacasse et.al.	2407.18083	link
2024-07-25	Difficulty Estimation and Simplification of French Text Using LLMs	Henri Jamet et.al.	2407.18061	null
2024-07-26	Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision	Tim J. M. Jaspers et.al.	2407.17904	link
2024-07-25	Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey	Shahab Saquib Sohail et.al.	2407.17877	null
2024-07-25	Innovative Speech-Based Deep Learning Approaches for Parkinson’s Disease Classification: A Systematic Review	Lisanne van Gelderen et.al.	2407.17844	null
2024-07-25	How Lightweight Can A Vision Transformer Be	Jen Hong Tan et.al.	2407.17783	null
2024-07-24	Traditional Methods Outperform Generative LLMs at Forecasting Credit Ratings	Felix Drinkall et.al.	2407.17624	link
2024-07-24	Wavelet-based Autoencoder and EfficientNet for Schizophrenia Detection from EEG Signals	Umesh Kumar Naik M et.al.	2407.17540	null
2024-07-24	Federated Automatic Latent Variable Selection in Multi-output Gaussian Processes	Jingyi Gao et.al.	2407.16935	null
2024-07-24	Cross-Domain Policy Transfer by Representation Alignment via Multi-Domain Behavioral Cloning	Hayato Watahiki et.al.	2407.16912	link
2024-07-23	AbdomenAtlas: A Large-Scale, Detailed-Annotated, & Multi-Center Dataset for Efficient Transfer Learning and Open Algorithmic Benchmarking	Wenxuan Li et.al.	2407.16697	link
2024-07-23	Towards scalable efficient on-device ASR with transfer learning	Laxmi Pandey et.al.	2407.16664	null
2024-07-23	EffiSegNet: Gastrointestinal Polyp Segmentation through a Pre-Trained EfficientNet-based Network with a Simplified Decoder	Ioannis A. Vezakis et.al.	2407.16298	link
2024-07-23	Exploring the Effectiveness and Consistency of Task Selection in Intermediate-Task Transfer Learning	Pin-Jie Lin et.al.	2407.16245	link
2024-07-23	ODGR: Online Dynamic Goal Recognition	Matan Shamir et.al.	2407.16220	null
2024-07-20	Enhancing Wildfire Forecasting Through Multisource Spatio-Temporal Data, Deep Learning, Ensemble Models and Transfer Learning	Ayoub Jadouli et.al.	2407.15878	null
2024-07-22	Reconstructing Training Data From Real World Models Trained with Transfer Learning	Yakir Oz et.al.	2407.15845	null
2024-07-22	TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly	Mengqi Guo et.al.	2407.15648	link
2024-07-22	Affordance Labeling and Exploration: A Manifold-Based Approach	İsmail Özçil et.al.	2407.15479	null
2024-07-21	Practical multi-fidelity machine learning: fusion of deterministic and Bayesian models	Jiaxiang Yi et.al.	2407.15110	link
2024-07-20	Enhancing Skin Disease Classification Leveraging Transformer-based Deep Learning Architectures and Explainable AI	Jayanth Mohan et.al.	2407.14757	null
2024-07-19	A Comparative Study of Transfer Learning for Emotion Recognition using CNN and Modified VGG16 Models	Samay Nathani et.al.	2407.14576	null
2024-07-22	Vision-Based Power Line Cables and Pylons Detection for Low Flying Aircrafts	Jakub Gwizdała et.al.	2407.14352	null
2024-07-19	Quantifying the value of positive transfer: An experimental case study	Aidan J. Hughes et.al.	2407.14342	null
2024-07-19	Straightforward Layer-wise Pruning for More Efficient Visual Adaptation	Ruizi Han et.al.	2407.14330	null
2024-07-23	Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition	Yurong Zhang et.al.	2407.14302	null
2024-07-19	Enhancing Data-Limited Graph Neural Networks by Actively Distilling Knowledge from Large Language Models	Quan Li et.al.	2407.13989	null
2024-07-18	PowerTrain: Fast, Generalizable Time and Power Prediction Models to Optimize DNN Training on Accelerated Edges	Prashanthi S. K. et.al.	2407.13944	null
2024-07-18	Semi-Supervised Contrastive Learning of Musical Representations	Julien Guinot et.al.	2407.13840	link
2024-07-18	AROhI: An Interactive Tool for Estimating ROI of Data Analytics	Noopur Zambar et.al.	2407.13839	null
2024-07-18	Are We Ready for Out-of-Distribution Detection in Digital Pathology?	Ji-Hun Oh et.al.	2407.13708	null
2024-07-17	On Initializing Transformers with Pre-trained Embeddings	Ha Young Kim et.al.	2407.12514	null
2024-07-16	Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces	Shumei Liu et.al.	2407.11701	null
2024-07-16	Green Resource Allocation in Cloud-Native O-RAN Enabled Small Cell Networks	Rana M. Sohaib et.al.	2407.11563	null
2024-07-16	Genomic Language Models: Opportunities and Challenges	Gonzalo Benegas et.al.	2407.11435	null
2024-07-16	MRIo3DS-Net: A Mutually Reinforcing Images to 3D Surface RNN-like framework for model-adaptation indoor 3D reconstruction	Chang Li et.al.	2407.11431	null
2024-07-16	Exploring connections of spectral analysis and transfer learning in medical imaging	Yucheng Lu et.al.	2407.11379	null
2024-07-19	LoRA-PT: Low-Rank Adapting UNETR for Hippocampus Segmentation Using Principal Tensor Singular Values and Vectors	Guanghua He et.al.	2407.11292	link
2024-07-15	Exploration in Knowledge Transfer Utilizing Reinforcement Learning	Adam Jedlička et.al.	2407.10835	null
2024-07-15	Detecting Omissions in Geographic Maps through Computer Vision	Phuc D. A. Nguyen et.al.	2407.10709	link
2024-07-15	Deep-Learning-Based Markerless Pose Estimation Systems in Gait Analysis: DeepLabCut Custom Training and the Refinement Function	Giulia Panconi et.al.	2407.10590	null
2024-07-13	Automated detection of gibbon calls from passive acoustic monitoring data using convolutional neural networks in the “torch for R” ecosystem	Dena J. Clink et.al.	2407.09976	null
2024-07-11	Improve Load Forecasting in Energy Communities through Transfer Learning using Open-Access Synthetic Profiles	Lukas Moosbrugger et.al.	2407.08434	null
2024-07-11	A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning	Adrien Banse et.al.	2407.08324	null
2024-07-11	AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization	Shixiong Xu et.al.	2407.08156	link
2024-07-10	Prediction of Frequency-Dependent Optical Spectrum for Solid Materials: A Multi-Output & Multi-Fidelity Machine Learning Approach	Akram Ibrahim et.al.	2407.07736	null
2024-07-10	SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning	Haiwen Diao et.al.	2407.07523	link
2024-07-10	Fine-Grained Classification for Poisonous Fungi Identification with Transfer Learning	Christopher Chiu et.al.	2407.07492	link
2024-07-10	Towards a text-based quantitative and explainable histopathology image analysis	Anh Tien Nguyen et.al.	2407.07360	link
2024-07-09	Estimating centrality in heavy-ion collisions using Transfer Learning technique	Dipankar Basak et.al.	2407.07210	null
2024-07-09	Statistical mechanics of transfer learning in fully-connected networks in the proportional limit	Alessandro Ingrosso et.al.	2407.07168	null
2024-07-14	Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach	Taolin Zhang et.al.	2407.06964	null
2024-07-09	Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation	Filipe Lauar et.al.	2407.06950	link
2024-07-09	Rethinking Image-to-Video Adaptation: An Object-centric Perspective	Rui Qian et.al.	2407.06871	null
2024-07-09	Robust and Explainable Framework to Address Data Scarcity in Diagnostic Imaging	Zehui Zhao et.al.	2407.06566	null
2024-07-09	Using Graph Neural Networks and Frequency Domain Data for Automated Operational Modal Analysis of Populations of Structures	Xudong Jian et.al.	2407.06492	link
2024-07-09	CrowdTransfer: Enabling Crowd Knowledge Transfer in AIoT Community	Yan Liu et.al.	2407.06485	null
2024-07-08	Multi-Label Plant Species Classification with Self-Supervised Vision Transformers	Murilo Gustineli et.al.	2407.06298	link
2024-07-08	Transfer Learning with Pseudo Multi-Label Birdcall Classification for DS@GT BirdCLEF 2024	Anthony Miyaguchi et.al.	2407.06291	link
2024-07-08	Transfer Learning with Self-Supervised Vision Transformers for Snake Identification	Anthony Miyaguchi et.al.	2407.06178	link
2024-07-08	Multi-Fidelity Bayesian Neural Network for Uncertainty Quantification in Transonic Aerodynamic Loads	Andrea Vaiuso et.al.	2407.05684	null
2024-07-08	An Experimental Comparison of Transfer Learning against Self-supervised Learning	Zehui Zhao et.al.	2407.05592	null
2024-07-09	CBM: Curriculum by Masking	Andrei Jarca et.al.	2407.05193	link
2024-07-06	Recent Advancements and Challenges of Turkic Central Asian Language Processing	Yana Veitsman et.al.	2407.05006	null
2024-07-05	Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates	Shirley Kokane et.al.	2407.04871	null
2024-07-05	TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR	Shashi Kumar et.al.	2407.04444	null
2024-07-05	Understanding the Role of Invariance in Transfer Learning	Till Speicher et.al.	2407.04325	link
2024-07-05	Graph Pooling via Ricci Flow	Amy Feng et.al.	2407.04236	null
2024-07-08	A Computer Vision Approach to Estimate the Localized Sea State	Aleksandar Vorkapic et.al.	2407.03755	null
2024-07-04	On-Device Training Empowered Transfer Learning For Human Activity Recognition	Pixi Kang et.al.	2407.03644	null
2024-07-03	Iris and Palmprint Multimodal Biometric Recognition using Novel Preactivated Inverted ResNet and Hybrid Metaheuristic Optimized DenseNet	Indu Singh et.al.	2407.03498	null
2024-07-03	DACB-Net: Dual Attention Guided Compact Bilinear Convolution Neural Network for Skin Disease Classification	Belal Ahmad et.al.	2407.03439	null
2024-07-03	Artificial Inductive Bias for Synthetic Tabular Data Generation in Data-Scarce Scenarios	Patricia A. Apellániz et.al.	2407.03080	link
2024-07-02	MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering	Ahmad AlMughrabi et.al.	2407.02668	null
2024-07-02	ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation	Chaoqun Hou et.al.	2407.02542	null
2024-07-02	AXIAL: Attention-based eXplainability for Interpretable Alzheimer’s Localized Diagnosis using 2D CNNs on 3D MRI brain scans	Gabriele Lozupone et.al.	2407.02418	link
2024-07-03	MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing	Shangda Wu et.al.	2407.02277	link
2024-07-02	MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance Optimizations	Akash Dutta et.al.	2407.02238	null
2024-07-02	Towards Training Music Taggers on Synthetic Data	Nadine Kroher et.al.	2407.02156	link
2024-07-01	Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models	Lam Pham et.al.	2407.01777	null
2024-06-30	A Deep Generative Framework for Joint Households and Individuals Population Synthesis	Xiao Qian et.al.	2407.01643	null
2024-07-01	Bridging the Gap: Transfer Learning from English PLMs to Malaysian English	Mohan Raj Chanthran et.al.	2407.01374	null
2024-07-01	M $^2$ IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension	Xuyang Liu et.al.	2407.01131	null
2024-07-01	Cross-Lingual Transfer Learning for Speech Translation	Rao Ma et.al.	2407.01130	null
2024-07-01	Deep Image-to-Recipe Translation	Jiangqin Ma et.al.	2407.00911	link
2024-06-30	Image Classification for Snow Detection to Improve Pedestrian Safety	Ricardo de Deijn et.al.	2407.00818	null
2024-06-30	Establishing Deep InfoMax as an effective self-supervised learning methodology in materials informatics	Michael Moran et.al.	2407.00671	link
2024-06-30	LegalTurk Optimized BERT for Multi-Label Text Classification and NER	Farnaz Zeidi et.al.	2407.00648	null
2024-06-29	Resource Allocation and Secure Wireless Communication in the Large Model-based Mobile Edge Computing System	Zefan Wang et.al.	2407.00347	null
2024-06-28	Minimax And Adaptive Transfer Learning for Nonparametric Classification under Distributed Differential Privacy Constraints	Arnab Auddy et.al.	2406.20088	null
2024-06-28	Malaria Cell Detection Using Deep Neural Networks	Saurabh Sawant et.al.	2406.20005	null
2024-06-28	Fine-tuning of Geospatial Foundation Models for Aboveground Biomass Estimation	Michal Muszynski et.al.	2406.19888	null
2024-06-27	T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings	Björn Deiseroth et.al.	2406.19223	link
2024-06-27	Towards Learning Abductive Reasoning using VSA Distributed Representations	Giacomo Camposampiero et.al.	2406.19121	link
2024-07-01	RouteLLM: Learning to Route LLMs with Preference Data	Isaac Ong et.al.	2406.18665	link
2024-07-01	VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning Challenges	Robert-Jan Bruintjes et.al.	2406.18176	null
2024-06-25	LABOR-LLM: Language-Based Occupational Representations with Large Language Models	Tianyu Du et.al.	2406.17972	null
2024-06-25	Transfer Learning for High Dimensional Robust Regression	Xiaohui Yuan et.al.	2406.17567	null
2024-06-25	Leveraging Parameter-Efficient Transfer Learning for Multi-Lingual Text-to-Speech Adaptation	Yingting Li et.al.	2406.17257	null
2024-06-24	Convolutional neural network for Lyman break galaxies classification and redshift regression in DESI (Dark Energy Spectroscopic Instrument)	Julien Taran et.al.	2406.16730	null
2024-06-24	Robust NLoS Localization in 5G mmWave Networks: Data-based Methods and Performance	Roman Klus et.al.	2406.16519	null
2024-06-23	Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization	Kshitij Bhatta et.al.	2406.16191	null
2024-06-23	Evaluation and Comparison of Emotionally Evocative Image Augmentation Methods	Jan Ignatowicz et.al.	2406.16187	null
2024-06-23	Federated Transfer Learning Aided Interference Classification in GNSS Signals	Min Jiang et.al.	2406.16102	null
2024-06-22	Bone Fracture Classification using Transfer Learning	Shyam Gupta et.al.	2406.15958	link
2024-06-21	Flat Posterior Does Matter For Bayesian Transfer Learning	Sungjun Lim et.al.	2406.15664	link
2024-06-21	GOAL: A Generalist Combinatorial Optimization Agent Learner	Darko Drakulic et.al.	2406.15079	link
2024-06-20	Depth $F_1$ : Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability	Parker Seegmiller et.al.	2406.14695	link
2024-06-19	Modeling & Evaluating the Performance of Convolutional Neural Networks for Classifying Steel Surface Defects	Nadeem Jabbar Chaudhry et.al.	2406.14583	null
2024-06-20	Robust Few-shot Transfer Learning for Knowledge Base Question Answering with Unanswerable Questions	Riya Sawhney et.al.	2406.14313	null
2024-06-20	Multi-modal Transfer Learning between Biological Foundation Models	Juan Jose Garau-Luis et.al.	2406.14150	null
2024-06-21	Information Guided Regularization for Fine-tuning Language Models	Mandar Sharma et.al.	2406.14005	link
2024-06-20	Generalization error of min-norm interpolators in transfer learning	Yanke Song et.al.	2406.13944	null
2024-06-20	Semi-supervised Regression Analysis with Model Misspecification and High-dimensional Data	Ye Tian et.al.	2406.13906	null
2024-06-19	Neuro-symbolic Training for Reasoning over Spatial Language	Tanawan Premsri et.al.	2406.13828	link
2024-06-19	CNN Based Flank Predictor for Quadruped Animal Species	Vanessa Suessle et.al.	2406.13588	null
2024-06-19	Robust Melanoma Thickness Prediction via Deep Transfer Learning enhanced by XAI Techniques	Miguel Nogales et.al.	2406.13441	null
2024-06-19	Representation Transfer Learning for Semiparametric Regression	Baihua He et.al.	2406.13197	null
2024-06-19	Optimal pre-train/fine-tune strategies for accurate material property predictions	Reshma Devi et.al.	2406.13142	link
2024-06-18	Skin Cancer Images Classification using Transfer Learning Techniques	Md Sirajul Islam et.al.	2406.12954	null
2024-06-18	Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video	Xiangming Zhu et.al.	2406.12769	null
2024-06-18	BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity	Zahra Gharaee et.al.	2406.12723	link
2024-06-18	Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly	Siddhant Shete et.al.	2406.12698	null
2024-06-18	Spatial Sequence Attention Network for Schizophrenia Classification from Structural Brain MR Images	Nagur Shareef Shaik et.al.	2406.12683	null
2024-06-18	Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation	Sophie Loizillon et.al.	2406.12448	link
2024-06-18	The Wisdom of a Crowd of Brains: A Universal Brain Encoder	Roman Beliy et.al.	2406.12179	null
2024-06-17	UniGLM: Training One Unified Language Model for Text-Attributed Graphs	Yi Fang et.al.	2406.12052	link
2024-06-17	Large Scale Transfer Learning for Tabular Data via Language Modeling	Josh Gardner et.al.	2406.12031	link
2024-06-15	A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges	Yuqi Nie et.al.	2406.11903	null
2024-06-17	Faces of Experimental Pain: Transferability of Deep Learned Heat Pain Features to Electrical Pain	Pooja Prajod et.al.	2406.11808	null
2024-06-16	A Unified View of Abstract Visual Reasoning Problems	Mikołaj Małkiński et.al.	2406.11068	null
2024-06-16	Generalization and Knowledge Transfer in Abstract Visual Reasoning Models	Mikołaj Małkiński et.al.	2406.11061	null
2024-06-16	Physics-Informed Deep Learning and Partial Transfer Learning for Bearing Fault Diagnosis in the Presence of Highly Missing Data	Mohammadreza Kavianpour et.al.	2406.11023	null
2024-06-16	ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts	Samar Khanna et.al.	2406.10973	null
2024-06-16	On the Effectiveness of Supervision in Asymmetric Non-Contrastive Learning	Jeongheon Oh et.al.	2406.10815	link
2024-06-16	ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation	Yurun Song et.al.	2406.10785	link
2024-06-18	Augmenting Biomedical Named Entity Recognition with General-domain Resources	Yu Yin et.al.	2406.10671	link
2024-06-15	ADSNet: Cross-Domain LTV Prediction with an Adaptive Siamese Network in Advertising	Ruize Wang et.al.	2406.10517	null
2024-06-14	Comparison of fine-tuning strategies for transfer learning in medical image classification	Ana Davila et.al.	2406.10050	null
2024-06-14	Deep Learning Models to Automate the Scoring of Hand Radiographs for Rheumatoid Arthritis	Zhiyan Bo et.al.	2406.09980	null
2024-06-17	UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages	Trinh Pham et.al.	2406.09717	link
2024-06-14	RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications	Shyam Venkatasubramanian et.al.	2406.09638	null
2024-06-14	Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings	Keno Moenck et.al.	2406.09637	link
2024-06-13	Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment	Fengbin Guan et.al.	2406.09546	link
2024-06-12	Quantum Hardware-Enabled Molecular Dynamics via Transfer Learning	Abid Khan et.al.	2406.08554	null
2024-06-12	Strategies for Pretraining Neural Operators	Anthony Zhou et.al.	2406.08473	link
2024-06-12	PRIBOOT: A New Data-Driven Expert for Improved Driving Simulations	Daniel Coelho et.al.	2406.08421	link
2024-06-12	Measuring model variability using robust non-parametric testing	Sinjini Banerjee et.al.	2406.08307	null
2024-06-12	Beyond the Mean: Differentially Private Prototypes for Private Transfer Learning	Dariush Wahdany et.al.	2406.08039	null
2024-06-11	Unleashing the Power of Transfer Learning Model for Sophisticated Insect Detection: Revolutionizing Insect Classification	Md. Mahmudul Hasan et.al.	2406.07716	null
2024-06-11	Transferring Knowledge from Large Foundation Models to Small Downstream Models	Shikai Qiu et.al.	2406.07337	null
2024-06-10	SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection	Sakshi Mahendru et.al.	2406.06663	null
2024-06-10	Network-Based Transfer Learning Helps Improve Short-Term Crime Prediction Accuracy	Jiahui Wu et.al.	2406.06645	null
2024-06-10	Contrastive learning of T cell receptor representations	Yuta Nagano et.al.	2406.06397	link
2024-06-09	Few-Shot Load Forecasting Under Data Scarcity in Smart Grids: A Meta-Learning Approach	Georgios Tsoumplekas et.al.	2406.05887	null
2024-06-09	Utilizing Grounded SAM for self-supervised frugal camouflaged human detection	Matthias Pijarowski et.al.	2406.05776	null
2024-06-11	MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-Training	Bo Chen et.al.	2406.05347	link
2024-06-08	Hidden Question Representations Tell Non-Factuality Within and Across Large Language Models	Yanling Wang et.al.	2406.05328	null
2024-06-08	DeviceBERT: Applied Transfer Learning With Targeted Annotations and Vocabulary Enrichment to Identify Medical Device and Component Terminology in FDA Recall Summaries	Miriam Farrington et.al.	2406.05307	null
2024-06-07	Accelerating evolutionary exploration through language model-based transfer learning	Maximilian Reissmann et.al.	2406.05166	null
2024-06-07	Labeled Data Selection for Category Discovery	Bingchen Zhao et.al.	2406.04898	null
2024-06-07	FunBO: Discovering Acquisition Functions for Bayesian Optimization with FunSearch	Virginia Aglietti et.al.	2406.04824	null
2024-06-07	Low-Resource Cross-Lingual Summarization through Few-Shot Learning with Large Language Models	Gyutae Park et.al.	2406.04630	null
2024-06-06	InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation	David Doukhan et.al.	2406.04429	link
2024-06-06	UrbanSARFloods: Sentinel-1 SLC-Based Benchmark Dataset for Urban and Open-Area Flood Mapping	Jie Zhao et.al.	2406.04111	null
2024-06-06	Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation	Loc X. Nguyen et.al.	2406.03773	null
2024-06-06	LLMEmbed: Rethinking Lightweight LLM’s Genuine Function in Text Classification	Chun Liu et.al.	2406.03725	link
2024-06-06	Transfer Learning for Latent Variable Network Models	Akhil Jalan et.al.	2406.03437	null
2024-06-08	Randomized Geometric Algebra Methods for Convex Neural Networks	Yifei Wang et.al.	2406.02806	link
2024-06-04	CADE: Cosine Annealing Differential Evolution for Spiking Neural Network	Runhua Jiang et.al.	2406.02349	link
2024-06-04	Towards Neural Architecture Search for Transfer Learning in 6G Networks	Adam Orucu et.al.	2406.02333	null
2024-06-04	M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation	Daisuke Niizumi et.al.	2406.02032	link
2024-06-04	Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs	Nik Bear Brown et.al.	2406.01943	null
2024-06-03	Multi-Agent Transfer Learning via Temporal Contrastive Learning	Weihao Zeng et.al.	2406.01377	null
2024-06-04	Towards Practical Single-shot Motion Synthesis	Konstantinos Roditakis et.al.	2406.01136	null
2024-06-03	Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models	Georgia Markham et.al.	2406.01073	null
2024-06-03	Satellites swarm cooperation for pursuit-attachment tasks with transformer-based reinforcement learning	yonghao Li et.al.	2406.01061	null
2024-06-02	Phonetic Error Analysis of Raw Waveform Acoustic Models with Parametric and Non-Parametric CNNs	Erfan Loweimi et.al.	2406.00898	null
2024-06-02	Using 3-D LiDAR Data for Safe Physical Human-Robot Interaction	Sarthak Arora et.al.	2406.00869	null
2024-06-06	Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting	Jincheng Zhong et.al.	2406.00773	null
2024-06-05	Profiled Transfer Learning for High Dimensional Linear Model	Ziqian Lin et.al.	2406.00701	null
2024-05-29	On the Condition Monitoring of Bolted Joints through Acoustic Emission and Deep Transfer Learning: Generalization, Ordinal Loss and Super-Convergence	Emmanuel Ramasso et.al.	2405.20887	null
2024-05-30	Learning 3D Robotics Perception using Inductive Priors	Muhammad Zubair Irshad et.al.	2405.20364	null
2024-05-30	Who Writes the Review, Human or AI?	Panagiotis C. Theocharopoulos et.al.	2405.20285	null
2024-05-30	Image-to-Joint Inverse Kinematic of a Supportive Continuum Arm Using Deep Learning	Shayan Sepahvand et.al.	2405.20248	null
2024-05-30	Federated and Transfer Learning for Cancer Detection Based on Image Analysis	Amine Bechar et.al.	2405.20126	null
2024-05-30	Chemical Space-Informed Machine Learning Models for Rapid Predictions of X-ray Photoelectron Spectra of Organic Molecules	Susmita Tripathy et.al.	2405.20033	link
2024-05-30	Breaking Indistinguishability with Transfer Learning: A First Look at SPECK32/64 Lightweight Block Ciphers	Jimmy Dani et.al.	2405.19683	null
2024-05-30	Few-shot fault diagnosis based on multi-scale graph convolution filtering for industry	Mengjie Gan et.al.	2405.19642	null
2024-05-30	Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases	Zian Su et.al.	2405.19581	link
2024-05-29	MDS-ViTNet: Improving saliency prediction for Eye-Tracking with Vision Transformer	Polezhaev Ignat et.al.	2405.19501	link
2024-05-29	RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter	Meng Cao et.al.	2405.19465	null
2024-05-29	Domain adaptation in small-scale and heterogeneous biological datasets	Seyedmehdi Orouji et.al.	2405.19221	null
2024-05-28	Recent Advances of Foundation Language Models-based Continual Learning: A Survey	Yutao Yang et.al.	2405.18653	null
2024-05-28	Transfer Learning for Emulating Ocean Climate Variability across $CO_2$ forcing	Surya Dheeshjith et.al.	2405.18585	null
2024-05-28	Deep Learning-based Epicenter Localization using Single-Station Strong Motion Records	Melek Türkmen et.al.	2405.18451	null
2024-05-28	Adaptive Multiscale Retinal Diagnosis: A Hybrid Trio-Model Approach for Comprehensive Fundus Multi-Disease Detection Leveraging Transfer Learning and Siamese Networks	Yavuz Selim Inan et.al.	2405.18449	null
2024-05-28	A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic	Ioanna Gogou et.al.	2405.18387	link
2024-05-28	An adaptive transfer learning perspective on classification in non-stationary environments	Henry W J Reeve et.al.	2405.18091	null
2024-05-28	A Survey of Latent Factor Models in Recommender Systems	Hind I. Alshbanat et.al.	2405.18068	null
2024-05-28	MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction	Xiang Dai et.al.	2405.18015	null
2024-05-28	Self-supervised Pre-training for Transferable Multi-modal Perception	Xiaohao Xu et.al.	2405.17942	link
2024-05-28	Cost-Sensitive Multi-Fidelity Bayesian Optimization with Transfer of Learning Curve Extrapolation	Dong Bok Lee et.al.	2405.17918	null
2024-05-28	Gradually Vanishing Gap in Prototypical Network for Unsupervised Domain Adaptation	Shanshan Wang et.al.	2405.17774	null
2024-05-27	Flow control of three-dimensional cylinders transitioning to turbulence via multi-agent reinforcement learning	P. Suárez et.al.	2405.17210	null
2024-05-27	Harnessing the Power of Vicinity-Informed Analysis for Classification under Covariate Shift	Mitsuhiro Fujikawa et.al.	2405.16906	null
2024-05-28	Transfer Learning for Diffusion Models	Yidong Ouyang et.al.	2405.16876	null
2024-05-27	Enhancing Accuracy in Generative Models via Knowledge Transfer	Xinyu Tian et.al.	2405.16837	null
2024-05-27	Dual-State Personalized Knowledge Tracing with Emotional Incorporation	Shanshan Wang et.al.	2405.16799	null
2024-05-26	Transfer Learning Under High-Dimensional Graph Convolutional Regression Model for Node Classification	Jiachen Chen et.al.	2405.16672	null
2024-05-26	Mixture of Experts Using Tensor Products	Zhan Su et.al.	2405.16671	link
2024-05-26	Acceleration of Grokking in Learning Arithmetic Operations via Kolmogorov-Arnold Representation	Yeachan Park et.al.	2405.16658	null
2024-05-26	From Macro to Micro: Boosting micro-expression recognition via pre-training on macro-expression videos	Hanting Li et.al.	2405.16451	null
2024-05-26	Daily Physical Activity Monitoring – Adaptive Learning from Multi-source Motion Sensor Data	Haoting Zhang et.al.	2405.16395	null
2024-05-25	LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters	Xinyu Zhou et.al.	2405.16287	link
2024-05-25	Generation of synthetic data using breast cancer dataset and classification with resnet18	Dilsat Berin Aytar et.al.	2405.16286	null
2024-05-25	Transfer learning in predicting quantum many-body dynamics: from physical observables to entanglement entropy	Philipp Schmidt et.al.	2405.16254	null
2024-05-25	A statistical framework for weak-to-strong generalization	Seamus Somerstep et.al.	2405.16236	null
2024-05-24	Disease-informed Adaptation of Vision-Language Models	Jiajin Zhang et.al.	2405.15728	link
2024-05-28	The Impact of Geometric Complexity on Neural Collapse in Transfer Learning	Michael Munn et.al.	2405.15706	null
2024-05-24	Transfer Learning with Informative Priors: Simple Baselines Better than Previously Reported	Ethan Harvey et.al.	2405.15583	link
2024-05-24	Unsteady aerodynamic prediction using limited samples based on transfer learning	Wen Ji et.al.	2405.15470	null
2024-05-24	Environment Sensing-aided Beam Prediction with Transfer Learning for Smart Factory	Yuan Feng et.al.	2405.15339	null
2024-05-24	Detection and Positive Reconstruction of Cognitive Distortion sentences: Mandarin Dataset and Evaluation	Shuya Lin et.al.	2405.15334	link
2024-05-23	Deep learning lattice gauge theories	Anuj Apte et.al.	2405.14830	null
2024-05-23	Implicit In-context Learning	Zhuowei Li et.al.	2405.14660	link
2024-05-23	SolNet: Open-source deep learning models for photovoltaic power forecasting across the globe	Joris Depoortere et.al.	2405.14472	null
2024-05-23	Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer Models	Alejo Lopez-Avila et.al.	2405.14437	link
2024-05-22	Just rotate it! Uncertainty estimation in closed-source models via multiple queries	Konstantinos Pitas et.al.	2405.13864	null
2024-05-22	Multi-Dataset Multi-Task Learning for COVID-19 Prognosis	Filippo Ruffini et.al.	2405.13771	null
2024-05-22	Transfer of Safety Controllers Through Learning Deep Inverse Dynamics Model	Alireza Nadali et.al.	2405.13735	null
2024-05-22	Identifying type II quasars at intermediate redshift with few-shot learning photometric classification	P. A. C. Cunha et.al.	2405.13650	link
2024-05-22	Dynamically enhanced static handwriting representation for Parkinson’s disease detection	Moises Diaz et.al.	2405.13438	null
2024-05-22	Boosted Neural Decoders: Achieving Extreme Reliability of LDPC Codes for 6G Networks	Hee-Youl Kwak et.al.	2405.13413	link
2024-05-22	Accelerated Evaluation of Ollivier-Ricci Curvature Lower Bounds: Bridging Theory and Computation	Wonwoo Kang et.al.	2405.13302	null
2024-05-22	Traffic control using intelligent timing of traffic lights with reinforcement learning technique and real-time processing of surveillance camera images	Mahdi Jamebozorg et.al.	2405.13256	null
2024-05-21	Transfer Learning Approach for Railway Technical Map (RTM) Component Identification	Obadage Rochana Rumalshan et.al.	2405.13229	null
2024-05-21	Accelerating Resonance Searches via Signature-Oriented Pre-training	Congqiao Li et.al.	2405.12972	null
2024-05-21	Prompt-Enhanced Spatio-Temporal Graph Transfer Learning	Junfeng Hu et.al.	2405.12452	link
2024-05-15	Fully Distributed Fog Load Balancing with Multi-Agent Reinforcement Learning	Maad Ebrahim et.al.	2405.12236	null
2024-05-20	Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models	Tong Zeng et.al.	2405.12206	link
2024-05-20	Towards Graph Contrastive Learning: A Survey and Beyond	Wei Ju et.al.	2405.11868	null
2024-05-20	Transfer Learning for CSI-based Positioning with Multi-environment Meta-learning	Anastasios Foliadis et.al.	2405.11816	null
2024-05-20	Foundation Model for Chemical Process Modeling: Meta-Learning with Physics-Informed Adaptation	Zihao Wang et.al.	2405.11752	link
2024-05-19	Computer Vision in the Food Industry: Accurate, Real-time, and Automatic Food Recognition with Pretrained MobileNetV2	Shayan Rokhva et.al.	2405.11621	null
2024-05-19	Learning More Generalized Experts by Merging Experts in Mixture-of-Experts	Sejik Park et.al.	2405.11530	null
2024-05-17	Probabilistic transfer learning methodology to expedite high fidelity simulation of reactive flows	Bruno S. Soriano et.al.	2405.10944	null
2024-05-17	Multicenter Privacy-Preserving Model Training for Deep Learning Brain Metastases Autosegmentation	Yixing Huang et.al.	2405.10870	link
2024-05-17	DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts	Anastasia Voznyuk et.al.	2405.10629	link
2024-05-17	Dynamic data sampler for cross-language transfer learning in large language models	Yudong Li et.al.	2405.10626	link
2024-05-16	Continuous Transfer Learning for UAV Communication-aware Trajectory Design	Chenrui Sun et.al.	2405.10087	null
2024-05-16	Monaural speech enhancement on drone via Adapter based transfer learning	Xingyu Chen et.al.	2405.10022	null
2024-05-16	A Unified Deep Transfer Learning Model for Accurate IoT Localization in Diverse Environments	Abdullahi Isa Ahmed et.al.	2405.09960	null
2024-05-16	Confidence Estimation in Unsupervised Deep Change Vector Analysis	Sudipan Saha et.al.	2405.09896	null
2024-05-15	SA-FedLora: Adaptive Parameter Allocation for Efficient Federated Learning with LoRA Tuning	Yuning Yang et.al.	2405.09394	null
2024-05-15	Transfer Learning in Pre-Trained Large Language Models for Malware Detection Based on System Calls	Pedro Miguel Sánchez Sánchez et.al.	2405.09318	null
2024-05-15	Deep Learning in Earthquake Engineering: A Comprehensive Review	Yazhou Xie et.al.	2405.09021	null
2024-05-15	Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy	Feng Wang et.al.	2405.09014	link
2024-05-16	Neural Collapse Meets Differential Privacy: Curious Behaviors of NoisyGD with Near-perfect Representation Learning	Chendi Wang et.al.	2405.08920	null
2024-05-14	FLEXIBLE: Forecasting Cellular Traffic by Leveraging Explicit Inductive Graph-Based Learning	Duc Thinh Ngo et.al.	2405.08843	null
2024-05-14	Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs	P. Mas-Buitrago et.al.	2405.08703	link
2024-05-13	Modeling of Time-varying Wireless Communication Channel with Fading and Shadowing	Lee Youngmin et.al.	2405.08199	link
2024-05-13	Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer	Chi-en Amy Tai et.al.	2405.07869	null
2024-05-13	Automatic Recognition of Food Ingestion Environment from the AIM-2 Wearable Sensor	Yuning Huang et.al.	2405.07827	null
2024-05-11	Fractals as Pre-training Datasets for Anomaly Detection and Localization	C. I. Ugwu et.al.	2405.06980	null
2024-05-13	MRSegmentator: Robust Multi-Modality Segmentation of 40 Classes in MRI and CT Sequences	Hartmut Häntze et.al.	2405.06463	link
2024-05-10	DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding	Ting Liu et.al.	2405.06217	link
2024-05-09	Scalable Learning of Segment-Level Traffic Congestion Functions	Shushman Choudhury et.al.	2405.06080	null
2024-05-09	Robust and Explainable Fine-Grained Visual Classification with Transfer Learning: A Dual-Carriageway Framework	Zheming Zuo et.al.	2405.05853	null
2024-05-17	Identification of problematic epochs in astronomical time series through transfer learning	Stefano Cavuoti et.al.	2405.05591	link
2024-05-09	Model Inversion Robustness: Can Transfer Learning Help?	Sy-Tuyen Ho et.al.	2405.05588	null
2024-05-08	Large Language Model Enhanced Machine Learning Estimators for Classification	Yuhang Wu et.al.	2405.05445	link
2024-05-08	Deep Learning Method to Predict Wound Healing Progress Based on Collagen Fibers in Wound Tissue	Juan He et.al.	2405.05297	null
2024-05-08	Deep learning-based variational autoencoder for classification of quantum and classical states of light	Mahesh Bhupati et.al.	2405.05243	null
2024-05-08	Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming	Tommaso Pasini et.al.	2405.05176	null
2024-05-08	Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches	Qing Yu et.al.	2405.04771	null
2024-05-09	Large Language Models for Cyber Security: A Systematic Literature Review	HanXiang Xu et.al.	2405.04760	link
2024-05-07	SingIt! Singer Voice Transformation	Amit Eliav et.al.	2405.04627	null
2024-05-07	Neural network based approach for solving problems in plane wave duct acoustics	D. Veerababu et.al.	2405.04603	null
2024-05-07	Cross-Platform Autonomous Control of Minimal Kitaev Chains	David van Driel et.al.	2405.04596	null
2024-05-07	Enriched BERT Embeddings for Scholarly Publication Classification	Benjamin Wolff et.al.	2405.04136	link
2024-05-07	A Stealthy Wrongdoer: Feature-Oriented Reconstruction Attack against Split Learning	Xiaoyang Xu et.al.	2405.04115	link
2024-05-07	Predicting Lung Disease Severity via Image-Based AQI Analysis using Deep Learning Techniques	Anvita Mahajan et.al.	2405.03981	null
2024-05-05	Spatial Transfer Learning with Simple MLP	Hongjian Yang et.al.	2405.03720	null
2024-05-06	Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data	Leonhard Hennicke et.al.	2405.03243	null
2024-05-04	Stable Diffusion Dataset Generation for Downstream Classification Tasks	Eugenio Lomurno et.al.	2405.02698	null
2024-05-04	Few-Shot Fruit Segmentation via Transfer Learning	Jordan A. James et.al.	2405.02556	link
2024-05-04	CNN-LSTM and Transfer Learning Models for Malware Classification based on Opcodes and API Calls	Ahmed Bensaoud et.al.	2405.02548	null
2024-05-03	Spatio-Temporal SwinMAE: A Swin Transformer based Multiscale Representation Learner for Temporal Satellite Imagery	Yohei Nakayama et.al.	2405.02512	null
2024-05-03	Deep Learning and Transfer Learning Architectures for English Premier League Player Performance Forecasting	Daniel Frees et.al.	2405.02412	link
2024-05-03	GMP-ATL: Gender-augmented Multi-scale Pseudo-label Enhanced Adaptive Transfer Learning for Speech Emotion Recognition via HuBERT	Yu Pan et.al.	2405.02151	null
2024-05-03	Creation of Novel Soft Robot Designs using Generative AI	Wee Kiat Chan et.al.	2405.01824	null
2024-05-02	Diabetic Retinopathy Detection Using Quantum Transfer Learning	Ankush Jain et.al.	2405.01734	null
2024-05-02	Individual Fairness Through Reweighting and Tuning	Abdoul Jalil Djiberou Mahamadou et.al.	2405.01711	null
2024-05-01	KITE: A Kernel-based Improved Transferability Estimation Method	Yunhui Guo et.al.	2405.01603	null
2024-05-02	CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation	Chenying Liu et.al.	2405.01217	null
2024-05-01	Transformer-Based Self-Supervised Learning for Histopathological Classification of Ischemic Stroke Clot Origin	K. Yeh et.al.	2405.00908	null
2024-05-01	Koopman-based Deep Learning for Nonlinear System Estimation	Zexin Sun et.al.	2405.00627	null
2024-05-01	Self-supervised Pre-training of Text Recognizers	Martin Kišš et.al.	2405.00420	link
2024-05-01	Employing Federated Learning for Training Autonomous HVAC Systems	Fredrik Hagström et.al.	2405.00389	null
2024-04-30	Expanding the Horizon: Enabling Hybrid Quantum Transfer Learning for Long-Tailed Chest X-Ray Classification	Skylar Chan et.al.	2405.00156	link
2024-04-30	ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents	Hoang-Thang Ta et.al.	2404.19714	null
2024-04-30	Let’s Focus: Focused Backdoor Attack against Federated Transfer Learning	Marco Arazzi et.al.	2404.19420	null
2024-04-29	What Drives Performance in Multilingual Language Models?	Sina Bagheri Nezhad et.al.	2404.19159	link
2024-04-27	Remote Sensing Image Enhancement through Spatiotemporal Filtering	Hessah Albanwan et.al.	2404.18950	null
2024-04-29	Adaptive Reinforcement Learning for Robot Control	Yu Tang Liu et.al.	2404.18713	link
2024-04-29	Generation of Uncorrelated Residual Variables for Chemical Process Fault Diagnosis via Transfer Learning-based Input-Output Decoupled Network	Zhuofu Pan et.al.	2404.18528	null
2024-05-02	Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment	Tengjun Huang et.al.	2404.18253	link
2024-04-28	EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter	Comfort Eseohen Ilevbare et.al.	2404.18180	link
2024-04-27	Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering	Chenhao Cui et.al.	2404.17949	null
2024-04-26	Causally Abstracted Multi-armed Bandits	Fabio Massimo Zennaro et.al.	2404.17493	link
2024-04-26	FTL: Transfer Learning Nonlinear Plasma Dynamic Transitions in Low Dimensional Embeddings via Deep Neural Networks	Zhe Bai et.al.	2404.17466	link
2024-04-26	Comparison of self-supervised in-domain and supervised out-domain transfer learning for bird species recognition	Houtan Ghaffari et.al.	2404.17252	null
2024-04-26	Self-supervised visual learning in the low-data regime: a comparative evaluation	Sotirios Konstantakos et.al.	2404.17202	null
2024-04-26	2M-NER: Contrastive Learning for Multilingual and Multimodal NER with Language and Modal Fusion	Dongsheng Wang et.al.	2404.17122	null
2024-04-26	Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection	Daisuke Niizumi et.al.	2404.17107	link
2024-04-29	On TinyML and Cybersecurity: Electric Vehicle Charging Infrastructure Use Case	Fatemeh Dehrouyeh et.al.	2404.16894	link
2024-04-25	Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer Learning for Skin Disease Classification in Long-Tail Distribution	Zeynep Özdemir et.al.	2404.16814	null
2024-04-25	Probabilistic Multi-Layer Perceptrons for Wind Farm Condition Monitoring	Filippo Fiocchi et.al.	2404.16496	null
2024-04-25	Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics	Ben Williams et.al.	2404.16436	link
2024-04-25	Asking and Answering Questions to Extract Event-Argument Structures	Md Nayem Uddin et.al.	2404.16413	link
2024-04-24	Employing Two-Dimensional Word Embedding for Difficult Tabular Data Stream Classification	Paweł Zyblewski et.al.	2404.15836	link
2024-04-24	Where to Mask: Structure-Guided Masking for Graph Masked Autoencoders	Chuang Liu et.al.	2404.15806	link
2024-04-24	No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement	Mateusz Klimaszewski et.al.	2404.15737	link
2024-04-24	MDDD: Manifold-based Domain Adaptation with Dynamic Distribution for Non-Deep Transfer Learning in Cross-subject and Cross-session EEG-based Emotion Recognition	Ting Luo et.al.	2404.15615	null
2024-04-19	KATO: Knowledge Alignment and Transfer for Transistor Sizing of Different Design and Technology	Wei W. Xing et.al.	2404.14433	null
2024-04-22	Machine Learning Techniques for MRI Data Processing at Expanding Scale	Taro Langner et.al.	2404.14326	null
2024-04-22	Automated Long Answer Grading with RiceChem Dataset	Shashank Sonkar et.al.	2404.14316	link
2024-04-26	ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis	Zichen Tang et.al.	2404.13711	link
2024-04-20	MultiConfederated Learning: Inclusive Non-IID Data handling with Decentralized Federated Learning	Michael Duchesne et.al.	2404.13421	null
2024-04-20	Transfer Learning for Molecular Property Predictions from Small Data Sets	Thorren Kirschbaum et.al.	2404.13393	link
2024-04-20	Federated Transfer Learning with Task Personalization for Condition Monitoring in Ultrasonic Metal Welding	Ahmadreza Eslaminia et.al.	2404.13278	null
2024-04-19	Explainable AI for Fair Sepsis Mortality Predictive Model	Chia-Hsuan Chang et.al.	2404.13139	null
2024-04-19	Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for Vision-Language Models	Juncheng Yang et.al.	2404.12588	null
2024-04-18	Understanding Optimal Feature Transfer via a Fine-Grained Bias-Variance Analysis	Yufan Li et.al.	2404.12481	null
2024-04-18	sEMG-based Fine-grained Gesture Recognition via Improved LightGBM Model	Xiupeng Qiao et.al.	2404.11861	null
2024-04-17	GenFighter: A Generative and Evolutive Textual Attack Removal	Md Athikul Islam et.al.	2404.11538	null
2024-04-17	Explainable Lung Disease Classification from Chest X-Ray Images Utilizing Deep Learning and XAI	Tanzina Taher Ifty et.al.	2404.11428	null
2024-04-19	Feature Corrective Transfer Learning: End-to-End Solutions to Object Detection in Non-Ideal Visual Conditions	Chuheng Wei et.al.	2404.11214	null
2024-04-18	Supervised Contrastive Vision Transformer for Breast Histopathological Image Classification	Mohammad Shiri et.al.	2404.11052	null
2024-04-17	Control Theoretic Approach to Fine-Tuning and Transfer Learning	Erkan Bayram et.al.	2404.11013	null
2024-04-16	Tao: Re-Thinking DL-based Microarchitecture Simulation	Santosh Pandey et.al.	2404.10921	null
2024-04-21	Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport	Eduardo Fernandes Montesuma et.al.	2404.10261	link
2024-04-16	Privacy-Preserving Training-as-a-Service for On-Device Intelligence: Concept, Architectural Scheme, and Open Problems	Zhiyuan Wu et.al.	2404.10255	null
2024-04-15	High-Resolution Detection of Earth Structural Heterogeneities from Seismic Amplitudes using Convolutional Neural Networks with Attention layers	Luiz Schirmer et.al.	2404.10170	null
2024-04-15	Self-Supervised Learning Featuring Small-Scale Image Dataset for Treatable Retinal Diseases Classification	Luffina C. Huang et.al.	2404.10166	null
2024-04-15	Multiple-Input Fourier Neural Operator (MIFNO) for source-dependent 3D elastodynamics	Fanny Lehmann et.al.	2404.10115	link
2024-04-15	Conditional Prototype Rectification Prompt Learning	Haoxing Chen et.al.	2404.09872	link
2024-04-15	The Physalis system: Discovery of ORC-like radio shells around a massive pair of interacting early-type galaxies with offset X-ray emission	Bärbel S. Koribalski et.al.	2404.09522	null
2024-04-14	Low-Resource Named Entity Recognition with Cross-Lingual, Character-Level Neural Conditional Random Fields	Ryan Cotterell et.al.	2404.09383	null
2024-04-14	Breast Cancer Image Classification Method Based on Deep Transfer Learning	Weimin Wang et.al.	2404.09226	null
2024-04-14	Intelligent Chemical Purification Technique Based on Machine Learning	Wenchao Wu et.al.	2404.09114	null
2024-04-13	HEAT: Head-level Parameter Efficient Adaptation of Vision Transformers with Taylor-expansion Importance Scores	Yibo Zhong et.al.	2404.08894	null
2024-04-16	E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data	Aref Azizpour et.al.	2404.08814	link
2024-04-12	Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data	Huan Zhang et.al.	2404.08613	link
2024-04-12	Advanced wood species identification based on multiple anatomical sections and using deep feature transfer and fusion	Kallil M. Zielinski et.al.	2404.08585	null
2024-04-12	Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example	MingXuan Xiao et.al.	2404.08279	null
2024-04-12	Transfer Learning Study of Motion Transformer-based Trajectory Predictions	Lars Ullrich et.al.	2404.08271	null
2024-04-12	Investigating Neural Machine Translation for Low-Resource Languages: Using Bavarian as a Case Study	Wan-Hua Her et.al.	2404.08259	link
2024-04-11	Predictive Handover Strategy in 6G and Beyond: A Deep and Transfer Learning Approach	Ioannis Panitsas et.al.	2404.08113	null
2024-04-11	MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference	Mobashir Sadat et.al.	2404.08066	link
2024-04-11	OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities	Lasse H. Hansen et.al.	2404.07711	link
2024-04-11	Depth Estimation using Weighted-loss and Transfer Learning	Muhammad Adeel Hafeez et.al.	2404.07686	null
2024-04-11	PINNACLE: PINN Adaptive ColLocation and Experimental points selection	Gregory Kang Ruey Lau et.al.	2404.07662	link
2024-04-11	GLID: Pre-training a Generalist Encoder-Decoder Vision Model	Jihao Liu et.al.	2404.07603	null
2024-04-10	Transfer Learning via Latent Dependency Factor for Estimating PM 2.5	Shrey Gupta et.al.	2404.07308	link
2024-04-10	XNLIeu: a dataset for cross-lingual NLI in Basque	Maite Heredia et.al.	2404.06996	link
2024-04-10	The ‘Sandwich’ meta-framework for architecture agnostic deep privacy-preserving transfer learning for non-invasive brainwave decoding	Xiaoxi Wei et.al.	2404.06868	null
2024-04-10	Adapting LLaMA Decoder to Vision Transformer	Jiahao Wang et.al.	2404.06773	link
2024-04-09	Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis	Mikel Zubillaga et.al.	2404.06392	null
2024-04-09	The impact of data set similarity and diversity on transfer learning success in time series forecasting	Claudia Ehrig et.al.	2404.06198	null
2024-04-10	Using Few-Shot Learning to Classify Primary Lung Cancer and Other Malignancy with Lung Metastasis in Cytological Imaging via Endobronchial Ultrasound Procedures	Ching-Kai Lin et.al.	2404.06080	null
2024-04-08	BatSort: Enhanced Battery Classification with Transfer Learning for Battery Sorting and Recycling	Yunyi Zhao et.al.	2404.05802	link
2024-04-08	MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning	Matteo Farina et.al.	2404.05621	link
2024-04-07	DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology	Valentin Koch et.al.	2404.05022	link
2024-04-06	Latent-based Diffusion Model for Long-tailed Recognition	Pengxiao Han et.al.	2404.04517	link
2024-04-05	Open vocabulary keyword spotting through transfer learning from speech synthesis	Kesavaraj V et.al.	2404.03914	null
2024-04-05	VoltaVision: A Transfer Learning model for electronic component classification	Anas Mohammad Ishfaqul Muktadir Osmani et.al.	2404.03898	link
2024-04-09	Enhancing Breast Cancer Diagnosis in Mammography: Evaluation and Integration of Convolutional Neural Networks and Explainable AI	Maryam Ahmed et.al.	2404.03892	null
2024-04-04	Free Energy Calculations using Smooth Basin Classification	Sander Vandenhaute et.al.	2404.03777	null
2024-04-04	How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes	Harmon Bhasin et.al.	2404.03558	link
2024-04-03	Transfer learning applications for anomaly detection in wind turbines	Cyriana M. A. Roelofs et.al.	2404.03011	null
2024-04-03	Fast Diffusion Model For Seismic Data Noise Attenuation	Junheng Peng et.al.	2404.02767	null
2024-04-03	Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers	Sehyun Choi et.al.	2404.02684	null
2024-04-03	What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases	Anthony Meng Huat Tiong et.al.	2404.02415	link
2024-04-02	Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning	Jonathan C. Balloch et.al.	2404.02235	null
2024-04-03	ResNet with Integrated Convolutional Block Attention Module for Ship Classification Using Transfer Learning on Optical Satellite Imagery	Ryan Donghan Kwon et.al.	2404.02135	null
2024-04-02	ImageNot: A contrast with ImageNet preserves model rankings	Olawale Salaudeen et.al.	2404.02112	link
2024-04-02	Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation	Carlos Plou et.al.	2404.01867	null
2024-04-02	Transfer Learning from Whisper for Microscopic Intelligibility Prediction	Paul Best et.al.	2404.01737	null
2024-04-01	NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields	Muhammad Zubair Irshad et.al.	2404.01300	link
2024-04-01	LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization	Akshita Gupta et.al.	2404.01282	null
2024-04-01	Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models	Amir Faghihi et.al.	2404.01160	null
2024-04-01	TransFusion: Covariate-Shift Robust Transfer Learning for High-Dimensional Regression	Zelin He et.al.	2404.01153	null
2024-04-01	Machine Learning Robustness: A Primer	Houssem Ben Braiek et.al.	2404.00897	null
2024-04-01	Bailong: Bilingual Transfer Learning based on QLoRA and Zip-tie Embedding	Lung-Chuan Chen et.al.	2404.00862	null
2024-04-01	Transfer Learning with Point Transformers	Kartik Gupta et.al.	2404.00846	null
2024-03-31	$R^2$ -Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding	Ye Liu et.al.	2404.00801	link
2024-03-31	Minimum-Norm Interpolation Under Covariate Shift	Neil Mallinar et.al.	2404.00522	null
2024-03-31	Transfer Learning with Reconstruction Loss	Wei Cui et.al.	2404.00505	link
2024-03-30	Noise-Aware Training of Layout-Aware Language Models	Ritesh Sarkhel et.al.	2404.00488	null
2024-03-30	From attention to profit: quantitative trading strategy based on transformer	Zhaofeng Zhang et.al.	2404.00424	link
2024-03-28	Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization	Yuhang Li et.al.	2403.19866	null
2024-03-28	A Tulu Resource for Machine Translation	Manu Narayanan et.al.	2403.19142	link
2024-04-01	Quantum to Classical Neural Network Transfer Learning Applied to Drug Toxicity Prediction	Anthony M. Smaldone et.al.	2403.18997	link
2024-03-27	Direct mineral content prediction from drill core images via transfer learning	Romana Boiger et.al.	2403.18495	null
2024-03-27	Deep Learning Segmentation and Classification of Red Blood Cells Using a Large Multi-Scanner Dataset	Mohamed Elmanna et.al.	2403.18468	null
2024-03-26	Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision Transformer	Badri N. Patro et.al.	2403.18063	link
2024-03-26	The Need for Speed: Pruning Transformers with One Recipe	Samir Khaki et.al.	2403.17921	link
2024-03-26	Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos	Akshay Paruchuri et.al.	2403.17915	null
2024-03-26	To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of 3D Transfer Learning	Souhail Hadgi et.al.	2403.17869	null
2024-03-26	A Bayesian shrinkage estimator for transfer learning	Mohamed A. Abba et.al.	2403.17321	null
2024-03-25	A Hybrid Approach To Aspect Based Sentiment Analysis Using Transfer Learning	Gaurav Negi et.al.	2403.17254	null
2024-03-25	Engagement Measurement Based on Facial Landmarks and Spatial-Temporal Graph Convolutional Networks	Ali Abedi et.al.	2403.17175	null
2024-03-29	Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships	Rangel Daroya et.al.	2403.17173	link
2024-03-25	Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?	Shaoxiong Ji et.al.	2403.16777	null
2024-03-25	Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT	Rohit Raju et.al.	2403.16655	null
2024-03-25	Enhancing Industrial Transfer Learning with Style Filter: Cost Reduction and Defect-Focus	Chen Li et.al.	2403.16607	null
2024-03-25	Exploit High-Dimensional RIS Information to Localization: What Is the Impact of Faulty Element?	Tuo Wu et.al.	2403.16529	null
2024-03-25	Employing High-Dimensional RIS Information for RIS-aided Localization Systems	Tuo Wu et.al.	2403.16521	null
2024-03-25	Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes	Tianwei Zhang et.al.	2403.16499	null
2024-03-25	Data-Driven Extrusion Force Control Tuning for 3D Printing	Xavier Guidetti et.al.	2403.16470	null
2024-03-23	A Deep Learning Architectures for Kidney Disease Classification	Muhammad Shoaib Farooq et.al.	2403.15895	null
2024-03-23	VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding	Phong Nguyen-Thuan Do et.al.	2403.15882	null
2024-03-22	SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series	Badri N. Patro et.al.	2403.15360	link
2024-03-22	Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models	Qiong Wu et.al.	2403.15226	link
2024-03-22	Vehicle Detection Performance in Nordic Region	Hamam Mokayed et.al.	2403.15017	null
2024-03-21	A Transfer Learning Causal Approach to Evaluate Racial/Ethnic and Geographic Variation in Outcomes Following Congenital Heart Surgery	Larry Han et.al.	2403.14573	null
2024-03-21	Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets	Ahmet Alp Kindiroglu et.al.	2403.14534	link
2024-03-21	Exploring Task Unification in Graph Representation Learning via Generative Approach	Yulan Hu et.al.	2403.14340	null
2024-03-21	Stitching for Neuroevolution: Recombining Deep Neural Networks without Breaking Them	Arthur Guijt et.al.	2403.14224	null
2024-03-21	HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption	Seewoo Lee et.al.	2403.14111	link
2024-03-20	Bayesian Physics-informed Neural Networks for System Identification of Inverter-dominated Power Systems	Simon Stock et.al.	2403.13602	null
2024-03-20	AdaTrans: Feature-wise and Sample-wise Adaptive Transfer Learning for High-dimensional Regression	Zelin He et.al.	2403.13565	null
2024-03-20	Have You Poisoned My Data? Defending Neural Networks against Data Poisoning	Fabio De Gaspari et.al.	2403.13523	null
2024-03-20	FissionFusion: Fast Geometric Generation and Hierarchical Souping for Medical Image Analysis	Santosh Sanjeev et.al.	2403.13341	link
2024-03-21	Arcee’s MergeKit: A Toolkit for Merging Large Language Models	Charles Goddard et.al.	2403.13257	link
2024-03-19	Wildfire danger prediction optimization with transfer learning	Spiros Maggioros et.al.	2403.12871	link
2024-03-19	TransformMix: Learning Transformation and Mixing Strategies from Data	Tsz-Him Cheung et.al.	2403.12429	null
2024-03-19	Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning	Cheng Peng et.al.	2403.12374	null
2024-03-18	Transfer Learning for T-Cell Response Prediction	Josua Stadelmaier et.al.	2403.12117	link
2024-03-18	Sub-photon accuracy noise reduction of single shot coherent diffraction pattern with atomic model trained autoencoder	Takuto Ishikawa et.al.	2403.11992	null
2024-03-18	Transfer Learning Beyond Bounded Density Ratios	Alkis Kalavasis et.al.	2403.11963	null
2024-03-18	SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modules	Xiangyu Chen et.al.	2403.11887	null
2024-03-18	S-JEPA: towards seamless cross-dataset transfer through dynamic spatial attention	Pierre Guetschel et.al.	2403.11772	null
2024-03-18	Revisiting Tensor Basis Neural Networks for Reynolds stress modeling: application to plane channel and square duct flows	Jiayi Cai et.al.	2403.11746	null
2024-03-18	MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks	Ibrahim Almakky et.al.	2403.11646	null
2024-03-18	Augment Before Copy-Paste: Data and Memory Efficiency-Oriented Instance Segmentation Framework for Sport-scenes	Chih-Chung Hsu et.al.	2403.11572	null
2024-03-17	Federated Transfer Learning with Differential Privacy	Mengchu Li et.al.	2403.11343	null
2024-03-16	Automatic location detection based on deep learning	Anjali Karangiya et.al.	2403.10912	link
2024-03-15	On the low-shot transferability of [V]-Mamba	Diganta Misra et.al.	2403.10696	null
2024-03-15	Latent Object Characteristics Recognition with Visual to Haptic-Audio Cross-modal Transfer Learning	Namiko Saito et.al.	2403.10689	null
2024-03-14	Achieving Pareto Optimality using Efficient Parameter Reduction for DNNs in Resource-Constrained Edge Environment	Atah Nuh Mih et.al.	2403.10569	null
2024-03-15	FeatUp: A Model-Agnostic Framework for Features at Any Resolution	Stephanie Fu et.al.	2403.10516	link
2024-03-15	TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model	Changhong Hou et.al.	2403.10127	null
2024-03-14	The galaxy group merger origin of the Cloverleaf odd radio circle system	E. Bulbul et.al.	2403.09808	null
2024-03-14	GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding	Chengyao Wang et.al.	2403.09639	link
2024-03-14	The Neural-SRP method for positional sound source localization	Eric Grinstein et.al.	2403.09455	link
2024-03-13	A Physics-driven GraphSAGE Method for Physical Process Simulations Described by Partial Differential Equations	Hang Hu et.al.	2403.08569	null
2024-03-13	HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers	Francesco Dibitonto et.al.	2403.08536	link
2024-03-13	Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts	Shengzhuang Chen et.al.	2403.08477	link
2024-03-12	Authorship Style Transfer with Policy Optimization	Shuai Liu et.al.	2403.08043	link
2024-03-12	Conditional computation in neural networks: principles and research trends	Simone Scardapane et.al.	2403.07965	null
2024-03-12	Physics-Transfer Learning for Material Strength Screening	Yingjie Zhao et.al.	2403.07526	null
2024-03-12	DALSA: Domain Adaptation for Supervised Learning From Sparsely Annotated MR Images	Michael Götz et.al.	2403.07434	null
2024-03-12	Knowledge Transfer across Multiple Principal Component Analysis Studies	Zeyu Li et.al.	2403.07431	null
2024-03-12	Enhancing Transfer Learning with Flexible Nonparametric Posterior Sampling	Hyungi Lee et.al.	2403.07282	null
2024-03-11	Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents	Nishchal Prasad et.al.	2403.06872	link
2024-03-11	LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations	Mohammad Alkhalefi et.al.	2403.06813	null
2024-03-11	Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation	Bianca-Cerasela-Zelia Blaga et.al.	2403.06621	null
2024-03-11	Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers	Alexander H. Berger et.al.	2403.06601	link
2024-03-11	When Crypto Economics Meet Graph Analytics and Learning	Bingqiao Luo et.al.	2403.06454	null
2024-03-11	Can LLMs’ Tuning Methods Work in Medical Multimodal Domain?	Jiawei Chen et.al.	2403.06407	link
2024-03-11	A Segmentation Foundation Model for Diverse-type Tumors	Jianhao Xie et.al.	2403.06396	null
2024-03-11	Pre-Trained Model Recommendation for Downstream Fine-tuning	Jiameng Bai et.al.	2403.06382	null
2024-03-11	See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI	Yulong Liu et.al.	2403.06361	link
2024-03-10	Active Learning for Rapid Targeted Synthesis of Compositionally Complex Alloys	Nathan Johnson et.al.	2403.06329	null
2024-03-10	Large Language Models on Fine-grained Emotion Detection Dataset with Data Augmentation and Transfer Learning	Kaipeng Wang et.al.	2403.06108	null
2024-03-10	Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models	Esmaeil Seraj et.al.	2403.06088	null
2024-03-09	Multimodal deep learning approach to predicting neurological recovery from coma after cardiac arrest	Felix H. Krones et.al.	2403.06027	null
2024-03-08	OmniJet- $α$ : The first cross-task foundation model for particle physics	Joschka Birk et.al.	2403.05618	link
2024-03-08	Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT	Aisha Khatun et.al.	2403.05519	null
2024-03-08	JointMotion: Joint Self-supervision for Joint Motion Prediction	Royden Wagner et.al.	2403.05489	link
2024-03-08	HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction	Zhengrui Guo et.al.	2403.05396	link
2024-03-08	Hybridized Convolutional Neural Networks and Long Short-Term Memory for Improved Alzheimer’s Disease Diagnosis from MRI Scans	Maleka Khatun et.al.	2403.05353	null
2024-03-07	Cell reprogramming design by transfer learning of functional transcriptional networks	Thomas P. Wytock et.al.	2403.04837	link
2024-03-07	AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors	Kaishen Yuan et.al.	2403.04697	link
2024-03-07	Source Matters: Source Dataset Impact on Model Robustness in Medical Imaging	Dovile Juodelyte et.al.	2403.04484	link
2024-03-07	DA-Net: A Disentangled and Adaptive Network for Multi-Source Cross-Lingual Transfer Learning	Ling Ge et.al.	2403.04158	null
2024-03-06	Self and Mixed Supervision to Improve Training Labels for Multi-Class Medical Image Segmentation	Jianfei Liu et.al.	2403.03882	null
2024-03-06	Neural Architecture Search using Particle Swarm and Ant Colony Optimization	Séamus Lankford et.al.	2403.03781	null
2024-03-06	On Transfer in Classification: How Well do Subsets of Classes Generalize?	Raphael Baena et.al.	2403.03569	null
2024-03-06	A comparative study of cosmological constraints from weak lensing using Convolutional Neural Networks	Divij Sharma et.al.	2403.03490	null
2024-03-06	Multi-modal Deep Learning	Chen Yuhua et.al.	2403.03385	null
2024-03-05	PalmProbNet: A Probabilistic Approach to Understanding Palm Distributions in Ecuadorian Tropical Forest via Transfer Learning	Kangning Cui et.al.	2403.03161	null
2024-03-05	Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning	Zhitao He et.al.	2403.02893	null
2024-03-05	Generative Software Engineering	Yuan Huang et.al.	2403.02583	null
2024-03-04	Encodings for Prediction-based Neural Architecture Search	Yash Akhauri et.al.	2403.02484	link
2024-03-04	On Latency Predictors for Neural Architecture Search	Yash Akhauri et.al.	2403.02446	link
2024-03-04	How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider Transformer Models	Xin Lu et.al.	2403.02436	null
2024-03-04	On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation	Joaquín Sánchez García et.al.	2403.02432	null
2024-03-04	Distilled ChatGPT Topic & Sentiment Modeling with Applications in Finance	Olivier Gandouet et.al.	2403.02185	null
2024-03-04	Self-Supervised Facial Representation Learning with Facial Region Awareness	Zheng Gao et.al.	2403.02138	null
2024-03-04	Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models	Sargam Yadav et.al.	2403.02121	null
2024-03-04	A New Perspective on Smiling and Laughter Detection: Intensity Levels Matter	Hugo Bohy et.al.	2403.02112	null
2024-03-03	Is in-domain data beneficial in transfer learning for landmarks detection in x-ray images?	Roberto Di Via et.al.	2403.01470	null
2024-03-03	Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis	Xin Zhou et.al.	2403.01439	link
2024-03-03	A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications	Wei Guo et.al.	2403.01387	null
2024-03-02	Fast Low-parameter Video Activity Localization in Collaborative Learning Environments	Venkatesh Jatla et.al.	2403.01281	null
2024-03-02	Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey	Hamza Kheddar et.al.	2403.01255	null
2024-03-02	Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding	Ha-Thanh Nguyen et.al.	2403.01185	null
2024-03-02	Transfer Learning-Enhanced Instantaneous Multi-Person Indoor Localization by CSI	Zhiyuan He et.al.	2403.01153	null
2024-03-01	Transfer Learning for Security: Challenges and Future Directions	Adrian Shuai Li et.al.	2403.00935	null
2024-03-01	A Regularization-based Transfer Learning Method for Information Extraction via Instructed Graph Decoder	Kedi Chen et.al.	2403.00891	link
2024-03-01	Bias Mitigation in Fine-tuning Pre-trained Models for Enhanced Fairness and Efficiency	Yixuan Zhang et.al.	2403.00625	null
2024-03-01	Generalized User Representations for Transfer Learning	Ghazal Fazelnia et.al.	2403.00584	null
2024-03-01	Cross-Lingual Learning vs. Low-Resource Fine-Tuning: A Case Study with Fact-Checking in Turkish	Recep Firat Cekinel et.al.	2403.00411	link
2024-03-01	Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification	Mufan Sang et.al.	2403.00293	null
2024-02-29	Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement	Xinyi Fang et.al.	2402.19001	null
2024-02-28	Deep Neural Network Models Trained With A Fixed Random Classifier Transfer Better Across Domains	Hafiz Tiomoko Ali et.al.	2402.18614	null
2024-02-28	TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding	Zhihao Zhang et.al.	2402.18490	null
2024-02-28	Universal neural network potentials as descriptors: Towards scalable chemical property prediction using quantum and classical computers	Tomoya Shiota et.al.	2402.18433	null
2024-02-28	Emotion Classification in Low and Moderate Resource Languages	Shabnam Tafreshi et.al.	2402.18424	null
2024-02-29	Investigation of Adapter for Automatic Speech Recognition in Noisy Environment	Hao Shi et.al.	2402.18275	null
2024-02-28	Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations	Gregor Donabauer et.al.	2402.18179	link
2024-02-28	Diffusion-based Neural Network Weights Generation	Bedionita Soro et.al.	2402.18153	link
2024-03-03	Automated Testing of Spatially-Dependent Environmental Hypotheses through Active Transfer Learning	Nicholas Harrison et.al.	2402.18064	null
2024-03-04	OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine	Xiaosong Wang et.al.	2402.18028	null
2024-02-27	Quantum Circuit Discovery for Fault-Tolerant Logical State Preparation with Reinforcement Learning	Remmy Zen et.al.	2402.17761	link
2024-02-27	MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation	Hanan Gani et.al.	2402.17725	link
2024-02-27	Transfer Learning Bayesian Optimization to Design Competitor DNA Molecules for Use in Diagnostic Assays	Ruby Sedgwick et.al.	2402.17704	link
2024-02-27	Intensive Care as One Big Sequence Modeling Problem	Vadim Liventsev et.al.	2402.17501	link
2024-02-26	CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision	Hao Wang et.al.	2402.16928	link
2024-02-26	Enhancing Continuous Domain Adaptation with Multi-Path Transfer Curriculum	Hanbing Liu et.al.	2402.16681	null
2024-02-28	Few-Shot Learning for Annotation-Efficient Nucleus Instance Segmentation	Yu Ming et.al.	2402.16280	null
2024-02-25	StochCA: A Novel Approach for Exploiting Pretrained Models with Cross-Attention	Seungwon Seo et.al.	2402.16092	link
2024-02-25	Emotion Classification in Short English Texts using Deep Learning Techniques	Siddhanth Bhat et.al.	2402.16034	null
2024-02-25	Adversarial-Robust Transfer Learning for Medical Imaging via Domain Assimilation	Xiaohui Chen et.al.	2402.16005	null
2024-02-25	Exploring the Power of Pure Attention Mechanisms in Blind Room Parameter Estimation	Chunxi Wang et.al.	2402.16003	null
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961	link
2024-02-23	Artificial Bee Colony optimization of Deep Convolutional Neural Networks in the context of Biomedical Imaging	Adri Gomez Martin et.al.	2402.15246	null
2024-02-23	Which Model to Transfer? A Survey on Transferability Estimation	Yuhe Ding et.al.	2402.15231	null
2024-02-23	Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning	Joseph D. Clark et.al.	2402.15181	link
2024-02-23	PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning	Zhisheng Lin et.al.	2402.15082	null
2024-02-22	Smoothness Adaptive Hypothesis Transfer Learning	Haotian Lin et.al.	2402.14966	null
2024-02-22	An image-based transfer learning approach for using in situ processing data to predict laser powder bed fusion additively manufactured Ti-6Al-4V mechanical properties	Qixiang Luo et.al.	2402.14945	null
2024-02-22	SHM-Traffic: DRL and Transfer learning based UAV Control for Structural Health Monitoring of Bridges with Traffic	Divija Swetha Gadiraju et.al.	2402.14757	null
2024-02-22	CLCE: An Approach to Refining Cross-Entropy and Contrastive Learning for Optimized Learning Fusion	Zijun Long et.al.	2402.14551	null
2024-02-21	Simple and Effective Transfer Learning for Neuro-Symbolic Integration	Alessandro Daniele et.al.	2402.14047	null
2024-02-21	UniGraph: Learning a Cross-Domain Graph Foundation Model From Natural Language	Yufei He et.al.	2402.13630	link
2024-02-21	ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling	Lingxi Zhang et.al.	2402.13542	null
2024-02-20	LinkSAGE: Optimizing Job Matching Using Graph Neural Networks	Ping Liu et.al.	2402.13430	null
2024-02-20	Cross-Domain Transfer Learning with CoRTe: Consistent and Reliable Transfer from Black-Box to Lightweight Segmentation Model	Claudia Cuttano et.al.	2402.13122	null
2024-02-20	CST: Calibration Side-Tuning for Parameter and Memory Efficient Transfer Learning	Feng Chen et.al.	2402.12736	null
2024-02-20	Scalable and reliable deep transfer learning for intelligent fault detection via multi-scale neural processes embedded with knowledge	Zhongzhi Li et.al.	2402.12729	null
2024-02-20	Iterated learning and multiscale modeling of history-dependent architectured metamaterials	Yupeng Zhang et.al.	2402.12674	null
2024-02-20	Indiscriminate Data Poisoning Attacks on Pre-trained Feature Extractors	Yiwei Lu et.al.	2402.12626	null
2024-02-19	Predicting trucking accidents with truck drivers ‘safety climate perception across companies: A transfer learning approach	Kailai Sun et.al.	2402.12417	null
2024-02-19	A synthetic data approach for domain generalization of NLI models	Mohammad Javad Hosseini et.al.	2402.12368	null
2024-02-19	Molecule Generation and Optimization for Efficient Fragrance Creation	Bruno C. L. Rodrigues et.al.	2402.12134	link
2024-02-19	Stealing the Invisible: Unveiling Pre-Trained CNN Models through Adversarial Examples and Timing Side-Channels	Shubhi Shukla et.al.	2402.11953	null
2024-02-20	A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning	Yuan Yuan et.al.	2402.11922	link
2024-02-18	Autocorrect for Estonian texts: final report from project EKTB25	Agnes Luhtaru et.al.	2402.11671	null
2024-02-17	ZeroG: Investigating Cross-dataset Zero-shot Transferability in Graphs	Yuhan Li et.al.	2402.11235	link
2024-02-17	A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction	Huaiyuan Ying et.al.	2402.11177	null
2024-02-16	Robust agents learn causal world models	Jonathan Richens et.al.	2402.10877	null
2024-02-16	Differential Private Federated Transfer Learning for Mental Health Monitoring in Everyday Settings: A Case Study on Stress Detection	Ziyu Wang et.al.	2402.10862	null
2024-02-16	Masked Attention is All You Need for Graphs	David Buterez et.al.	2402.10793	null
2024-02-16	Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information	Aishwarya Jayagopal et.al.	2402.10551	link
2024-02-15	Data Augmentation and Transfer Learning Approaches Applied to Facial Expressions Recognition	Enrico Randellini et.al.	2402.09982	null
2024-02-15	Are Odd Radio Circles phoenixes of powerful radio galaxies?	Stanislav Shabala et.al.	2402.09708	null
2024-02-15	Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm	Amir Mohammad Naderi et.al.	2402.09658	null
2024-02-14	Prediction of Activated Sludge Settling Characteristics from Microscopy Images with Deep Convolutional Neural Networks and Transfer Learning	Sina Borzooei et.al.	2402.09367	link
2024-02-14	Few-Shot Object Detection with Sparse Context Transformers	Jie Mei et.al.	2402.09315	null
2024-02-15	Multi-Hierarchical Surrogate Learning for Structural Dynamical Crash Simulations Using Graph Convolutional Neural Networks	Jonas Kneifl et.al.	2402.09234	null
2024-02-14	Tackling Negative Transfer on Graphs	Zehong Wang et.al.	2402.08907	link
2024-02-14	Multiscale graph neural networks with adaptive mesh refinement for accelerating mesh-based simulations	Roberto Perera et.al.	2402.08863	null
2024-02-13	Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning	Haeju Lee et.al.	2402.08594	link
2024-02-13	Convolutional Neural Networks Towards Facial Skin Lesions Detection	Reza Sarshar et.al.	2402.08592	null
2024-02-13	FedLPS: Heterogeneous Federated Learning for Multiple Tasks with Local Parameter Sharing	Yongzhe Jia et.al.	2402.08578	link
2024-02-13	Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation	Ayesha Siddika Nipu et.al.	2402.08184	null
2024-02-12	A Competition Winning Deep Reinforcement Learning Agent in microRTS	Scott Goodfriend et.al.	2402.08112	link
2024-02-12	MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLO	Shubhabrata Mukherjee et.al.	2402.07894	link
2024-02-13	Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and DINOv2 in Medical Imaging Classification	Yuning Huang et.al.	2402.07595	link
2024-02-11	Multi-Modal Emotion Recognition by Text, Speech and Video Using Pretrained Transformers	Minoo Shayaninasab et.al.	2402.07327	null
2024-02-10	An Optimization Framework for Processing and Transfer Learning for the Brain Tumor Segmentation	Tianyi Ren et.al.	2402.07008	null
2024-02-10	Should I try multiple optimizers when fine-tuning pre-trained Transformers for NLP tasks? Should I tune their hyperparameters?	Nefeli Gkouti et.al.	2402.06948	null
2024-02-09	Transfer learning with generative models for object detection on limited datasets	Matteo Paiano et.al.	2402.06784	null
2024-02-09	Transferring facade labels between point clouds with semantic octrees while considering change detection	Sophia Schwarz et.al.	2402.06531	link
2024-02-09	BarlowTwins-CXR : Enhancing Chest X-Ray abnormality localization in heterogeneous data with cross-domain self-supervised learning	Haoyue Sheng et.al.	2402.06499	null
2024-02-12	Text-to-Code Generation with Modality-relative Pre-training	Fenia Christopoulou et.al.	2402.05783	null
2024-02-08	Transfer learning of optimal QAOA parameters in combinatorial optimization	J. A. Montanez-Barrera et.al.	2402.05549	null
2024-02-05	Enhancing Textbook Question Answering Task with Large Language Models and Retrieval Augmented Generation	Hessa Abdulrahman Alawwad et.al.	2402.05128	link
2024-02-07	Group Distributionally Robust Dataset Distillation with Risk Minimization	Saeed Vahidian et.al.	2402.04676	link
2024-02-07	Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers	Md Shamim Hussain et.al.	2402.04538	link
2024-02-06	Scaling Laws for Downstream Task Performance of Large Language Models	Berivan Isik et.al.	2402.04177	null
2024-02-06	Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models	Jianyuan Guo et.al.	2402.03749	link
2024-02-06	Symbol Correctness in Deep Neural Networks Containing Symbolic Layers	Aaron Bembenek et.al.	2402.03663	null
2024-02-04	Survival and grade of the glioma prediction using transfer learning	Santiago Valbuena Rubio et.al.	2402.03384	null
2024-02-05	Constrained Decoding for Cross-lingual Label Projection	Duong Minh Le et.al.	2402.03131	link
2024-02-04	Pruner: An Efficient Cross-Platform Tensor Compiler with Dual Awareness	Liang Qiao et.al.	2402.02361	link
2024-02-03	InceptionCapsule: Inception-Resnet and CapsuleNet with self-attention for medical image Classification	Elham Sadeghnezhad et.al.	2402.02274	null
2024-02-08	Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey	Yi Xin et.al.	2402.02242	link
2024-02-03	Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties	Ekaterina Artemova et.al.	2402.02078	link
2024-02-03	Transfer Learning in ECG Diagnosis: Is It Effective?	Cuong V. Nguyen et.al.	2402.02021	link
2024-02-03	Enhancing the efficiency of protein language models with minimal wet-lab data through few-shot learning	Ziyi Zhou et.al.	2402.02004	null
2024-02-03	Online Transfer Learning for RSV Case Detection	Yiming Sun et.al.	2402.01987	null
2024-02-02	Exploring transfer learning for pathological speech feature prediction: Impact of layer selection	Daniela A. Wiepert et.al.	2402.01796	link
2024-02-02	cmaes : A Simple yet Practical Python Library for CMA-ES	Masahiro Nomura et.al.	2402.01373	link
2024-02-05	Cascaded Scaling Classifier: class incremental learning with probability scaling	Jary Pomponi et.al.	2402.01262	link
2024-02-02	Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization	Arezoo Rajabi et.al.	2402.01114	null
2024-02-01	Graph Domain Adaptation: Challenges, Progress and Prospects	Boshen Shi et.al.	2402.00904	link
2024-02-01	Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters	Umberto Cappellazzo et.al.	2402.00828	link
2024-02-01	Control-Theoretic Techniques for Online Adaptation of Deep Neural Networks in Dynamical Systems	Jacob G. Elkins et.al.	2402.00761	null
2024-02-01	HAYATE: Photometric redshift estimation by hybridising machine learning with template fitting	Shingo Tanigawa et.al.	2402.00323	null
2024-01-31	MelNet: A Real-Time Deep Learning Algorithm for Object Detection	Yashar Azadvatan et.al.	2401.17972	null
2024-01-30	Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks	Savas Yildirim et.al.	2401.17396	null
2024-01-30	Transfer Learning for Text Diffusion Models	Kehang Han et.al.	2401.17181	null
2024-01-30	Finetuning Large Language Models for Vulnerability Detection	Alexey Shestov et.al.	2401.17010	link
2024-01-30	Quantum Transfer Learning with Adversarial Robustness for Classification of High-Resolution Image Datasets	Amena Khatun et.al.	2401.17009	null
2024-01-30	A Framework of Data Assimilation for Wind Flow Fields by Physics-informed Neural Networks	Chang Yan et.al.	2401.17001	link
2024-01-30	Multiple Yield Curve Modeling and Forecasting using Deep Learning	Ronald Richman et.al.	2401.16985	null
2024-01-29	Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending	Mario Sanz-Guerrero et.al.	2401.16458	null
2024-01-29	Capturing Pertinent Symbolic Features for Enhanced Content-Based Misinformation Detection	Flavio Merenda et.al.	2401.16285	link
2024-01-29	Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data	Sascha Jecklin et.al.	2401.16027	null
2024-01-29	GPS: Graph Contrastive Learning via Multi-scale Augmented Views from Adversarial Pooling	Wei Ju et.al.	2401.16011	null
2024-01-29	MV2MAE: Multi-View Video Masked Autoencoders	Ketul Shah et.al.	2401.15900	null
2024-01-27	Exploring the Transferability of a Foundation Model for Fundus Images: Application to Hypertensive Retinopathy	Julio Silva-Rodriguez et.al.	2401.15526	null
2024-01-27	A New Method for Vehicle Logo Recognition Based on Swin Transformer	Yang Li et.al.	2401.15458	null
2024-01-27	GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis	Jing Hao et.al.	2401.15282	link
2024-01-26	Transfer Learning for the Prediction of Entity Modifiers in Clinical Text: Application to Opioid Use Disorder Case Detection	Abdullateef I. Almudaifer et.al.	2401.15222	null
2024-01-26	Additional Look into GAN-based Augmentation for Deep Learning COVID-19 Image Classification	Oleksandr Fedoruk et.al.	2401.14705	null
2024-01-26	Asymptotic Midpoint Mixup for Margin Balancing and Moderate Broadening	Hoyong Kim et.al.	2401.14696	null
2024-01-23	Multi-Agent Based Transfer Learning for Data-Driven Air Traffic Applications	Chuhao Deng et.al.	2401.14421	null
2024-01-25	Assessing the Portability of Parameter Matrices Trained by Parameter-Efficient Finetuning Methods	Mohammed Sabry et.al.	2401.14228	null
2024-01-25	Deep Learning Innovations in Diagnosing Diabetic Retinopathy: The Potential of Transfer Learning and the DiaCNN Model	Mohamed R. Shoaib et.al.	2401.13990	null
2024-01-25	StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models	Yalong Bai et.al.	2401.13942	null
2024-01-25	A comparative study of zero-shot inference with large language models and supervised modeling in breast cancer pathology classification	Madhumita Sushil et.al.	2401.13887	null
2024-01-24	Don’t Push the Button! Exploring Data Leakage Risks in Machine Learning and Transfer Learning	Andrea Apicella et.al.	2401.13796	null
2024-01-24	SEDNet: Shallow Encoder-Decoder Network for Brain Tumor Segmentation	Chollette C. Olisah et.al.	2401.13403	link
2024-01-23	TCE at Qur’an QA 2023 Shared Task: Low Resource Enhanced Transformer-based Ensemble Approach for Qur’anic QA	Mohammed Alaa Elkomy et.al.	2401.13060	link
2024-01-23	Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?	Cheng Han et.al.	2401.12902	link
2024-01-23	Deep reinforcement transfer learning for active flow control of a 3D square cylinder under state dimension mismatch	Lei Yan et.al.	2401.12543	null
2024-01-22	Contrastive Learning and Cycle Consistency-based Transductive Transfer Learning for Target Annotation	Shoaib Meraj Sami et.al.	2401.12340	null
2024-01-22	Transfer Learning for Functional Mean Estimation: Phase Transition and Adaptive Algorithms	T. Tony Cai et.al.	2401.12331	null
2024-01-22	Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data	Leonardo Castro-Gonzalez et.al.	2401.12295	link
2024-01-22	Transfer Learning for Nonparametric Regression: Non-asymptotic Minimax Analysis and Adaptive Procedure	T. Tony Cai et.al.	2401.12272	null
2024-01-21	Transfer learning-assisted inverse modeling in nanophotonics based on mixture density networks	Liang Cheng et.al.	2401.12254	null
2024-01-22	Less Could Be Better: Parameter-efficient Fine-tuning Advances Medical Vision Foundation Models	Chenyu Lian et.al.	2401.12215	link
2024-01-22	Cross-lingual Transfer Learning for Javanese Dependency Parsing	Fadli Aulawi Al Ghiffari et.al.	2401.12072	null
2024-01-22	Feature Denoising Diffusion Model for Blind Image Quality Assessment	Xudong Li et.al.	2401.11949	null
2024-01-21	Transfer Learning under Covariate Shift: Local $k$ -Nearest Neighbours Regression with Heavy-Tailed Design	Petr Zamolodtchikov et.al.	2401.11554	null
2024-01-20	A Hybrid Approach of Transfer Learning and Physics-Informed Modeling: Improving Dissolved Oxygen Concentration Prediction in an Industrial Wastewater Treatment Plant	Ece S. Koksal et.al.	2401.11217	null
2024-01-19	A Systematic Evaluation of Euclidean Alignment with Deep Learning for EEG Decoding	Bruna Junqueira et.al.	2401.10746	null
2024-01-19	Name Tagging Under Domain Shift via Metric Learning for Life Sciences	Hongyi Liu et.al.	2401.10472	link
2024-01-18	Transfer Learning in Human Activity Recognition: A Survey	Sourish Gunesh Dhekane et.al.	2401.10185	null
2024-01-18	Few-shot learning for COVID-19 Chest X-Ray Classification with Imbalanced Data: An Inter vs. Intra Domain Study	Alejandro Galán-Cuenca et.al.	2401.10129	link
2024-01-18	Material-Response-Informed DeepONet and its Application to Polycrystal Stress-strain Prediction in Crystal Plasticity	Junyan He et.al.	2401.09977	null
2024-01-12	Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications	Hania Khan et.al.	2401.09354	null
2024-01-17	Material Informatics through Neural Networks on Ab-Initio Electron Charge Densities: the Role of Transfer Learning	Dario Massa et.al.	2401.09301	null
2024-01-17	Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges	Aiqi Jiang et.al.	2401.09244	link
2024-01-17	Toward Diverse Polymer Property Prediction Using Transfer Learning	Elaheh Kazemi-Khasragh et.al.	2401.09139	null
2024-01-16	Using i-vectors for subject-independent cross-session EEG transfer learning	Jonathan Lasko et.al.	2401.08851	null
2024-01-16	Surface-Enhanced Raman Spectroscopy and Transfer Learning Toward Accurate Reconstruction of the Surgical Zone	Ashutosh Raman et.al.	2401.08821	null
2024-01-16	Selecting Subsets of Source Data for Transfer Learning with Applications in Metal Additive Manufacturing	Yifan Tang et.al.	2401.08715	null
2024-01-16	N-Adaptive Ritz Method: A Neural Network Enriched Partition of Unity for Boundary Value Problems	Jonghyuk Baek et.al.	2401.08544	null
2024-01-16	AGN jet-inflated bubbles as possible origin of odd radio circles	Yen-Hsing Lin et.al.	2401.08207	null
2024-01-16	Transferring Core Knowledge via Learngenes	Fu Feng et.al.	2401.08139	null
2024-01-15	6-DoF Grasp Pose Evaluation and Optimization via Transfer Learning from NeRFs	Gergely Sóti et.al.	2401.07935	null
2024-01-15	Quantum Transfer Learning for Acceptability Judgements	Giuseppe Buonaiuto et.al.	2401.07777	null
2024-01-14	Harnessing Machine Learning for Discerning AI-Generated Synthetic Images	Yuyang Wang et.al.	2401.07358	null
2024-01-13	Concrete Surface Crack Detection with Convolutional-based Deep Learning Models	Sara Shomal Zadeh et.al.	2401.07124	null
2024-01-13	Bayesian Signal Matching for Transfer Learning in ERP-Based Brain Computer Interface	Tianwen Ma et.al.	2401.07111	null
2024-01-12	PyTy: Repairing Static Type Errors in Python	Yiu Wai Chow et.al.	2401.06619	link
2024-01-12	PersianMind: A Cross-Lingual Persian-English Large Language Model	Pedram Rostami et.al.	2401.06466	null
2024-01-11	Zero Resource Cross-Lingual Part Of Speech Tagging	Sahil Chopra et.al.	2401.05727	null
2024-01-16	POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation	Shilong Pan et.al.	2401.05596	null
2024-01-10	Enhancing Blood Flow Assessment in Diffuse Correlation Spectroscopy: A Transfer Learning Approach with Noise Robustness Analysis	Xi Chen et.al.	2401.05580	null
2024-01-10	VI-PANN: Harnessing Transfer Learning and Uncertainty-Aware Variational Inference for Improved Generalization in Audio Pattern Recognition	John Fischer et.al.	2401.05531	link
2024-01-10	Consensus Focus for Object Detection and minority classes	Erik Isai Valle Salgado et.al.	2401.05530	link
2024-01-10	Taming “data-hungry” reinforcement learning? Stability in continuous state-action spaces	Yaqi Duan et.al.	2401.05233	null
2024-01-10	Neural Population Learning beyond Symmetric Zero-sum Games	Siqi Liu et.al.	2401.05133	null
2024-01-09	Arabic Text Diacritization In The Age Of Transfer Learning: Token Classification Is All You Need	Abderrahman Skiredj et.al.	2401.04848	null
2024-01-10	Low-Resource Vision Challenges for Foundation Models	Yunhua Zhang et.al.	2401.04716	null
2024-01-09	Transfer-Learning-Based Autotuning Using Gaussian Copula	Thomas Randall et.al.	2401.04669	link
2024-01-11	Tiny Time Mixers (TTMs): Fast Pretrained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series	Vijay Ekambaram et.al.	2401.03955	link
2024-01-08	Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification	Adarsh Bhandary Panambur et.al.	2401.03912	null
2024-01-08	Anatomy of Neural Language Models	Majd Saleh et.al.	2401.03797	link
2024-01-07	Improving Transferability of Network Intrusion Detection in a Federated Learning Setup	Shreya Ghosh et.al.	2401.03560	link
2024-01-06	Efficient Bitrate Ladder Construction using Transfer Learning and Spatio-Temporal Features	Ali Falahati et.al.	2401.03195	link
2024-01-06	Transferable Learned Image Compression-Resistant Adversarial Perturbations	Yang Sui et.al.	2401.03115	null
2024-01-05	Physics-Informed Neural Networks for High-Frequency and Multi-Scale Problems using Transfer Learning	Abdul Hannan Mustajab et.al.	2401.02810	null
2024-01-05	Detection and Classification of Diabetic Retinopathy using Deep Learning Algorithms for Segmentation to Facilitate Referral Recommendation for Test and Treatment Prediction	Manoj S H et.al.	2401.02759	link
2024-01-05	Nurse-in-the-Loop Artificial Intelligence for Precision Management of Type 2 Diabetes in a Clinical Trial Utilizing Transfer-Learned Predictive Digital Twin	Syed Hasib Akhter Faruqui et.al.	2401.02661	null
2024-01-05	GTA: Guided Transfer of Spatial Attention from Object-Centric Representations	SeokHyun Seo et.al.	2401.02656	null
2024-01-04	Multi-Source Domain Adaptation with Transformer-based Feature Generation for Subject-Independent EEG-based Emotion Recognition	Shadi Sartipi et.al.	2401.02344	null
2024-01-03	A Comparative Study with Traditional and Transfer Learning-enhanced Machine Learning Algorithms for Geotechnical Characterisation of Coal Spoil	Sureka Thiruchittampalam et.al.	2401.01969	null
2024-01-03	Graph Neural Networks for Surfactant Multi-Property Prediction	Christoforos Brozos et.al.	2401.01874	link
2023-12-21	Discovery of a circular symmetry extended diffuse radio emission around an elliptical galaxy with the VLA FIRST survey	Shobha Kumari et.al.	2401.01278	null
2024-01-02	GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction	Yuping Hu et.al.	2401.01178	null
2024-01-01	Self-supervised learning for skin cancer diagnosis with limited training data	Hamish Haggerty et.al.	2401.00692	link
2023-12-30	AClassiHonk: A System Framework to Annotate and Classify Vehicular Honk from Road Traffic	Biswajit Maitya et.al.	2401.00154	null
2023-12-29	FedLED: Label-Free Equipment Fault Diagnosis with Vertical Federated Transfer Learning	Jie Shen et.al.	2312.17451	null
2023-12-28	OmniDialog: An Omnipotent Pre-training Model for Task-Oriented Dialogue System	Mingtao Yang et.al.	2312.16864	null
2023-12-29	GRSDet: Learning to Generate Local Reverse Samples for Few-shot Object Detection	Hefei Mei et.al.	2312.16571	null
2023-12-27	Soft Contrastive Learning for Time Series	Seunghan Lee et.al.	2312.16424	link
2023-12-26	EnchantDance: Unveiling the Potential of Music-Driven Dance Movement	Bo Han et.al.	2312.15946	link
2023-12-25	TimesURL: Self-supervised Contrastive Learning for Universal Time Series Representation Learning	Jiexi Liu et.al.	2312.15709	link
2023-12-25	APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond	Yuxiang Yang et.al.	2312.15612	link
2023-12-24	Leveraging Public Representations for Private Transfer Learning	Pratiksha Thaker et.al.	2312.15551	link
2023-12-24	Agent based modelling for continuously varying supply chains	Wan Wang et.al.	2312.15502	null
2023-12-22	Efficient Discrete Physics-informed Neural Networks for Addressing Evolutionary Partial Differential Equations	Siqi Chen et.al.	2312.14608	null
2023-12-21	Crystal Growth Characterization of WSe $_2$ Thin Film Using Machine Learning	Isaiah A. Moses et.al.	2312.14311	null
2023-12-25	Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning	Jiangmeng Li et.al.	2312.14222	link
2023-12-21	BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0	Miseul Kim et.al.	2312.13600	null
2023-12-21	Fine-tuning Graph Neural Networks by Preserving Graph Generative Patterns	Yifei Sun et.al.	2312.13583	link
2023-12-20	Bayesian Transfer Learning	Piotr M. Suder et.al.	2312.13484	null
2023-12-20	1D-CNN Optimization for Non-contact Respiration Pattern Classification	Md Zobaer Islam et.al.	2312.13035	null
2023-12-20	Heterogeneous Transfer Learning for Building High-Dimensional Generalized Linear Models with Disparate Datasets	Ruzhang Zhao et.al.	2312.12786	link
2023-12-20	A Closer Look at the Few-Shot Adaptation of Large Vision-Language Models	Julio Silva-Rodriguez et.al.	2312.12730	link
2023-12-19	H-ensemble: An Information Theoretic Approach to Reliable Few-Shot Multi-Source-Free Transfer	Yanru Wu et.al.	2312.12489	null
2023-12-19	Value Explicit Pretraining for Goal-Based Transfer Learning	Kiran Lekkala et.al.	2312.12339	null
2023-12-19	Empowering Dual-Level Graph Self-Supervised Pretraining with Motif Discovery	Pengwei Yan et.al.	2312.11927	link
2023-12-19	Point Cloud Segmentation Using Transfer Learning with RandLA-Net: A Case Study on Urban Areas	Alperen Enes Bayar et.al.	2312.11880	null
2023-12-18	AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System	Chengyuan Zhu et.al.	2312.11583	null
2023-12-18	Ensuring Cross-Device Portability of Electromagnetic Side-Channel Analysis	Lojenaa Navanesana et.al.	2312.11301	null
2023-12-18	LaViP:Language-Grounded Visual Prompts	Nilakshan Kunananthaseelan et.al.	2312.10945	null
2023-12-18	Domain adaption and physical constrains transfer learning for shale gas production	Zhaozhong Yang et.al.	2312.10920	null
2023-12-17	Cross-Domain Robustness of Transformer-based Keyphrase Generation	Anna Glazkova et.al.	2312.10700	null
2023-12-17	p-Laplacian Adaptation for Generative Pre-trained Vision-Language Models	Haoyuan Wu et.al.	2312.10613	link
2023-12-16	Optimizing Dense Feed-Forward Neural Networks	Luis Balderas et.al.	2312.10560	null
2023-12-15	One Self-Configurable Model to Solve Many Abstract Visual Reasoning Problems	Mikołaj Małkiński et.al.	2312.09997	link
2023-12-18	Multi-Modality is All You Need for Transferable Recommender Systems	Youhua Li et.al.	2312.09602	link
2023-12-21	Enhancing Data Lakes with GraphAr: Efficient Graph Data Management with a Specialized Storage Scheme	Xue Li et.al.	2312.09577	link
2023-12-14	Weight subcloning: direct initialization of transformers using larger pretrained ones	Mohammad Samragh et.al.	2312.09299	null
2023-12-14	Bayesian Optimization for Robust State Preparation in Quantum Many-Body Systems	Tizian Blatz et.al.	2312.09253	null
2023-12-14	Applying Pre-Trained Deep-Learning Model on Wrist Angel Data – An Analysis Plan	Harald Vilhelm Skat-Rørdam et.al.	2312.09052	null
2023-12-14	Context-PEFT: Efficient Multi-Modal, Multi-Task Fine-Tuning	Avelina Asada Hadji-Kyriacou et.al.	2312.08900	null
2023-12-12	AdaptIR: Parameter Efficient Multi-task Adaptation for Pre-trained Image Restoration Models	Hang Guo et.al.	2312.08881	link
2023-12-15	VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding	Yi Xin et.al.	2312.08733	null
2023-12-14	MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning	Yi Xin et.al.	2312.08636	null
2023-12-13	Distributional Robustness and Transfer Learning Through Empirical Bayes	Michael Law et.al.	2312.08485	null
2023-12-13	Explainable AI in Grassland Monitoring: Enhancing Model Performance and Domain Adaptability	Shanghua Liu et.al.	2312.08408	null
2023-12-12	Taking it further: leveraging pseudo labels for field delineation across label-scarce smallholder regions	Philippe Rufin et.al.	2312.08384	null
2023-12-13	Robust Few-Shot Named Entity Recognition with Boundary Discrimination and Correlation Purification	Xiaojun Xue et.al.	2312.07961	link
2023-12-13	DTL: Disentangled Transfer Learning for Visual Recognition	Minghao Fu et.al.	2312.07856	link
2023-12-12	Automated Behavioral Analysis Using Instance Segmentation	Chen Yang et.al.	2312.07723	link
2023-12-12	Reacting like Humans: Incorporating Intrinsic Human Behaviors into NAO through Sound-Based Reactions for Enhanced Sociability	Ali Ghadami et.al.	2312.07671	null
2023-12-10	COVID-19 Detection Using Slices Processing Techniques and a Modified Xception Classifier from Computed Tomography Images	Kenan Morani et.al.	2312.07580	link
2023-12-12	Medical Image Classification Using Transfer Learning and Chaos Game Optimization on the Internet of Medical Things	Alhassan Mabrouk et.al.	2312.07437	null
2023-12-12	NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image	Yoonwoo Jeong et.al.	2312.07315	link
2023-12-12	Neural Machine Translation of Clinical Text: An Empirical Investigation into Multilingual Pre-Trained Language Models and Transfer-Learning	Lifeng Han et.al.	2312.07250	link
2023-12-12	Dynamic Corrective Self-Distillation for Better Fine-Tuning of Pretrained Models	Ibtihel Amara et.al.	2312.07028	null
2023-12-12	READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling	Thong Nguyen et.al.	2312.06950	link
2023-12-12	Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks	Hongyue Fan et.al.	2312.06904	null
2023-12-14	Understanding and Leveraging the Learning Phases of Neural Networks	Johannes Schneider et.al.	2312.06887	null
2023-12-11	The improved backward compatible physics-informed neural networks for reducing error accumulation and applications in data-driven higher-order rogue waves	Shuning Lin et.al.	2312.06715	null
2023-12-11	Stoch BiRo: Design and Control of a low cost bipedal robot	GVS Mothish et.al.	2312.06512	null
2023-12-11	Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach	Yan Zhao et.al.	2312.06466	null
2023-12-11	The Intrinsic Sizes of Odd Radio Circles	David Rupke et.al.	2312.06387	null
2023-12-11	MMDesign: Multi-Modality Transfer Learning for Generative Protein Design	Jiangbin Zheng et.al.	2312.06297	null
2023-12-10	Natural Interaction Modalities for Human-CPS Interaction in Construction Progress Monitoring	Srijeet Halder et.al.	2312.05988	null
2023-12-10	Jumpstarting Surgical Computer Vision	Deepak Alapatt et.al.	2312.05968	null
2023-12-10	Initialization Matters for Adversarial Transfer Learning	Andong Hua et.al.	2312.05716	link
2023-12-09	Teamwork Dimensions Classification Using BERT	Junyoung Lee et.al.	2312.05483	null
2023-12-09	Model Evaluation for Domain Identification of Unknown Classes in Open-World Recognition: A Proposal	Gusti Ahmad Fanshuri Alfarisy et.al.	2312.05454	null
2023-12-07	Enhancing Polynomial Chaos Expansion Based Surrogate Modeling using a Novel Probabilistic Transfer Learning Strategy	Wyatt Bridgman et.al.	2312.04648	null
2023-12-07	TLCE: Transfer-Learning Based Classifier Ensembles for Few-Shot Class-Incremental Learning	Shuangmei Wang et.al.	2312.04225	null
2023-12-07	Small Area Estimation of Case Growths for Timely COVID-19 Outbreak Detection	Zhaowei She et.al.	2312.04110	link
2023-12-07	A Review and Taxonomy of Methods for Quantifying Dataset Similarity	Marieke Stolte et.al.	2312.04078	null
2023-12-06	A Scalable and Generalizable Pathloss Map Prediction	Ju-Hyung Lee et.al.	2312.03950	link
2023-12-07	Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers	Umberto Cappellazzo et.al.	2312.03694	link
2023-12-06	Transfer learning for galaxy feature detection: Finding Giant Star-forming Clumps in low redshift galaxies using Faster R-CNN	Jürgen Popp et.al.	2312.03503	link
2023-12-07	SVQ: Sparse Vector Quantization for Spatiotemporal Forecasting	Chao Chen et.al.	2312.03406	link
2023-12-06	Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition and Phoneme to Grapheme Translation	Wonjun Lee et.al.	2312.03312	null
2023-12-06	Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning	Haowen Wang et.al.	2312.03248	null
2023-12-05	Enhanced Breast Cancer Tumor Classification using MobileNetV2: A Detailed Exploration on Image Intensity, Error Mitigation, and Streamlit-driven Real-time Deployment	Aaditya Surya et.al.	2312.03020	null
2023-12-05	Applications of Domain Adversarial Neural Network in phase transition of 3D Potts model	Xiangna Chen et.al.	2312.02479	null
2023-12-02	Disentangling the Effects of Data Augmentation and Format Transform in Self-Supervised Learning of Image Representations	Neha Kalibhat et.al.	2312.02205	null
2023-12-04	VLTSeg: Simple Transfer of CLIP-Based Vision-Language Representations for Domain Generalized Semantic Segmentation	Christoph Hümmer et.al.	2312.02021	null
2023-12-03	Robust Computer Vision in an Ever-Changing World: A Survey of Techniques for Tackling Distribution Shifts	Eashan Adhikarla et.al.	2312.01540	null
2023-12-03	Facial Emotion Recognition Under Mask Coverage Using a Data Augmentation Technique	Aref Farhadipour et.al.	2312.01335	link
2023-12-02	A Comparative Analysis Towards Melanoma Classification Using Transfer Learning by Analyzing Dermoscopic Images	Md. Fahim Uddin et.al.	2312.01212	null
2023-12-02	Efficient Expansion and Gradient Based Task Inference for Replay Free Incremental Learning	Soumya Roy et.al.	2312.01188	null
2023-12-02	SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer	Renan A. Rojas-Gomez et.al.	2312.01187	null
2023-12-02	Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning	Raviraj Joshi et.al.	2312.01107	null
2023-12-02	Code-Mixed Text to Speech Synthesis under Low-Resource Constraints	Raviraj Joshi et.al.	2312.01103	null
2023-12-02	On the Effects of Randomness on Stability of Learning with Limited Labelled Data: A Systematic Literature Review	Branislav Pecher et.al.	2312.01082	null
2023-12-02	Acoustic Signal Analysis with Deep Neural Network for Detecting Fault Diagnosis in Industrial Machines	Mustafa Yurdakul et.al.	2312.01062	null
2023-12-02	Scaling Whole-Chip QAOA for Higher-Order Ising Spin Glass Models on Heavy-Hex Graphs	Elijah Pelofske et.al.	2312.00997	link
2023-12-04	Simple Transferability Estimation for Regression Tasks	Cuong N. Nguyen et.al.	2312.00656	link
2023-12-01	Pathway to a fully data-driven geotechnics: lessons from materials informatics	Stephen Wu et.al.	2312.00581	null
2023-12-01	Explainable AI in Diagnosing and Anticipating Leukemia Using Transfer Learning Method	Wahidul Hasan Abir et.al.	2312.00487	null
2023-12-01	Transfer learning for predicting source terms of principal component transport in chemically reactive flow	Ki Sung Jung et.al.	2312.00356	null
2023-12-01	Student Activity Recognition in Classroom Environments using Transfer Learning	Anagha Deshpande et.al.	2312.00348	null
2023-11-30	Stochastic Vision Transformers with Wasserstein Distance-Aware Attention	Franciskus Xaverius Erick et.al.	2311.18645	null
2023-11-30	Calibration-free online test-time adaptation for electroencephalography motor imagery decoding	Martin Wimpff et.al.	2311.18520	link
2023-11-30	Transfer Learning across Different Chemical Domains: Virtual Screening of Organic Materials with Deep Learning Models Pretrained on Small Molecule and Chemical Reaction Data	Chengwei Zhang et.al.	2311.18377	null
2023-12-01	Learning Robust Precipitation Forecaster by Temporal Frame Interpolation	Lu Han et.al.	2311.18341	link
2023-11-29	Transfer Learning in Robotics: An Upcoming Breakthrough? A Review of Promises and Challenges	Noémie Jaquier et.al.	2311.18044	null
2023-11-29	Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings	Andrea W Wen-Yi et.al.	2311.18034	link
2023-11-29	Latent Alignment with Deep Set EEG Decoders	Stylianos Bakas et.al.	2311.17968	null
2023-11-29	Skilful Precipitation Nowcasting Using NowcastNet	Ajitabh Kumar et.al.	2311.17961	null
2023-11-30	Grounding Foundation Models through Federated Transfer Learning: A General Framework	Yan Kang et.al.	2311.17431	null
2023-11-27	Data Imbalance, Uncertainty Quantification, and Generalization via Transfer Learning in Data-driven Parameterizations: Lessons from the Emulation of Gravity Wave Momentum Transport in WACCM	Y. Qiang Sun et.al.	2311.17078	link
2023-11-28	Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis	Aman Yadav et.al.	2311.16965	null
2023-11-29	ROSO: Improving Robotic Policy Inference via Synthetic Observations	Yusuke Miyashita et.al.	2311.16680	link
2023-11-28	Empowering COVID-19 Detection: Optimizing Performance Through Fine-Tuned EfficientNet Deep Learning Architecture	Md. Alamin Talukder et.al.	2311.16593	null
2023-11-28	FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning	Pengchao Han et.al.	2311.16584	null
2023-11-29	Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos	Takehiko Ohkawa et.al.	2311.16444	null
2023-11-27	Transformer-QEC: Quantum Error Correction Code Decoding with Transferable Transformers	Hanrui Wang et.al.	2311.16082	null
2023-11-27	Towards Transfer Learning for Large-Scale Image Classification Using Annealing-based Quantum Boltzmann Machines	Daniëlle Schuman et.al.	2311.15966	null
2023-11-27	Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning	Huanjin Yao et.al.	2311.15769	link
2023-11-27	Machine Learning-Based Jamun Leaf Disease Detection: A Comprehensive Review	Auvick Chandra Bhowmik et.al.	2311.15741	null
2023-11-27	Adinkra Symbol Recognition using Classical Machine Learning and Deep Learning	Michael Adjeisah et.al.	2311.15728	null
2023-11-27	Improving Adaptability and Generalizability of Efficient Transfer Learning for Vision-Language Models	Yongjin Yang et.al.	2311.15569	link
2023-11-26	Untargeted Code Authorship Evasion with Seq2Seq Transformation	Soohyeon Choi et.al.	2311.15366	null
2023-11-26	How much data do I need? A case study on medical data	Ayse Betul Cengiz et.al.	2311.15331	null
2023-11-25	nlpBDpatriots at BLP-2023 Task 2: A Transfer Learning Approach to Bangla Sentiment Analysis	Dhiman Goswami et.al.	2311.15032	null
2023-11-25	One-Shot Transfer Learning for Nonlinear ODEs	Wanzhou Lei et.al.	2311.14931	null
2023-11-24	A Reusable AI-Enabled Defect Detection System for Railway Using Ensembled CNN	Rahatara Ferdousi et.al.	2311.14824	null
2023-11-24	Data-driven Prior Learning for Bayesian Optimisation	Sigrid Passano Hellan et.al.	2311.14653	link
2023-11-24	Machine Translation for Ge’ez Language	Aman Kassahun Wassie et.al.	2311.14530	null
2023-11-23	Video Anomaly Detection using GAN	Anikeit Sethi et.al.	2311.14095	null
2023-11-23	On the Hyperparameter Landscapes of Machine Learning Algorithms	Mingyu Huang et.al.	2311.14014	null
2023-11-23	Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation	Mohammad Junayed Hasan et.al.	2311.13810	null
2023-11-22	End-to-end Transfer Learning for Speaker-independent Cross-language Speech Emotion Recognition	Duowei Tang et.al.	2311.13678	null
2023-11-23	Transfer Learning-based Real-time Handgun Detection	Youssef Elmir et.al.	2311.13559	null
2023-11-22	Recurrent neural networks and transfer learning for elasto-plasticity in woven composites	Ehsan Ghane et.al.	2311.13434	link
2023-11-21	InteRACT: Transformer Models for Human Intent Prediction Conditioned on Robot Actions	Kushal Kedia et.al.	2311.12943	null
2023-11-21	Digital Twin Framework for Optimal and Autonomous Decision-Making in Cyber-Physical Systems: Enhancing Reliability and Adaptability in the Oil and Gas Industry	Carine Menezes Rebello et.al.	2311.12755	null
2023-11-21	Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations	Sayak Mukherjee et.al.	2311.12264	null
2023-11-20	Broadband non-thermal emission of odd radio circles induced by galactic outflow remnants and their evolution	Yutaka Fujita et.al.	2311.12099	null
2023-11-17	Using Guided Transfer Learning to Predispose AI Agent to Learn Efficiently from Small RNA-sequencing Datasets	Kevin Li et.al.	2311.12045	null
2023-11-17	TransCDR: a deep learning model for enhancing the generalizability of cancer drug response prediction through transfer learning and multimodal data fusion for drug representation	Xiaoqiong Xia et.al.	2311.12040	link
2023-11-20	High-performance cVEP-BCI under minimal calibration	Yining Miao et.al.	2311.11596	null
2023-11-20	Event Camera Data Dense Pre-training	Yan Yang et.al.	2311.11533	null
2023-11-19	Towards interpretable-by-design deep learning algorithms	Plamen Angelov et.al.	2311.11396	null
2023-11-19	RflyMAD: A Dataset for Multicopter Fault Detection and Health Assessment	Xiangli Le et.al.	2311.11340	null
2023-11-18	Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning	Clifton Poth et.al.	2311.11077	link
2023-11-18	Bit Cipher – A Simple yet Powerful Word Representation System that Integrates Efficiently with Language Models	Haoran Zhao et.al.	2311.11012	null
2023-11-18	Gendec: A Machine Learning-based Framework for Gender Detection from Japanese Names	Duong Tien Pham et.al.	2311.11001	null
2023-11-18	Towards Robust and Accurate Visual Prompting	Qi Li et.al.	2311.10992	null
2023-11-17	SpACNN-LDVAE: Spatial Attention Convolutional Latent Dirichlet Variational Autoencoder for Hyperspectral Pixel Unmixing	Soham Chitnis et.al.	2311.10701	null
2023-11-17	Physics-Enhanced Multi-fidelity Learning for Optical Surface Imprint	Yongchao Chen et.al.	2311.10278	null
2023-11-16	Harnessing Transformers: A Leap Forward in Lung Cancer Image Detection	Amine Bechar et.al.	2311.09942	null
2023-11-16	Network Wide Evacuation Traffic Prediction in a Rapidly Intensifying Hurricane from Traffic Detectors and Facebook Movement Data: A Deep Learning Approach	Md Mobasshir Rashid et.al.	2311.09498	null
2023-11-15	Combining Transfer Learning with In-context Learning using Blackbox LLMs for Zero-shot Knowledge Base Question Answering	Mayur Patidar et.al.	2311.08894	link
2023-11-15	Language Semantic Graph Guided Data-Efficient Learning	Wenxuan Ma et.al.	2311.08782	link
2023-11-15	Discovery of Diffuse Radio Source in Abell 1060	Kohei Kurahara et.al.	2311.08693	null
2023-11-14	Peer is Your Pillar: A Data-unbalanced Conditional GANs for Few-shot Image Generation	Ziqiang Li et.al.	2311.08217	null
2023-11-14	Residual Importance Weighted Transfer Learning For High-dimensional Linear Regression	Junlong Zhao et.al.	2311.07972	link
2023-11-14	Cross-subject dual-domain fusion network with task-related and task-discriminant component analysis enhancing one-shot SSVEP classification	Yang Deng et.al.	2311.07932	link
2023-11-13	FedOpenHAR: Federated Multi-Task Transfer Learning for Sensor-Based Human Activity Recognition	Egemen İşgüder et.al.	2311.07765	null
2023-11-13	Histopathologic Cancer Detection	Varan Singh Rohila et.al.	2311.07711	link
2023-11-16	Lattice relaxation, electronic structure and continuum model for twisted bilayer MoTe $_2$	Ning Mao et.al.	2311.07533	null
2023-11-13	Fine-Tuning the Retrieval Mechanism for Tabular Deep Learning	Felix den Breejen et.al.	2311.07343	null
2023-11-13	C-Procgen: Empowering Procgen with Controllable Contexts	Zhenxiong Tan et.al.	2311.07312	null
2023-11-13	TIAGo RL: Simulated Reinforcement Learning Environments with Tactile Data for Mobile Robots	Luca Lach et.al.	2311.07260	null
2023-11-13	Developing a Named Entity Recognition Dataset for Tagalog	Lester James V. Miranda et.al.	2311.07161	link
2023-11-13	PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation	Vikas Dwivedi et.al.	2311.07002	null
2023-11-12	Sharing, Teaching and Aligning: Knowledgeable Transfer Learning for Cross-Lingual Machine Reading Comprehension	Tingfeng Cao et.al.	2311.06758	null
2023-11-12	Transfer Learning to Detect COVID-19 Coughs with Incremental Addition of Patient Coughs to Healthy People’s Cough Detection Models	Sudip Vhaduri et.al.	2311.06707	null
2023-11-10	Transfer Learning for Structured Pruning under Limited Task Data	Lucio Dery et.al.	2311.06382	null
2023-11-10	Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks	Bin Xiao et.al.	2311.06242	link
2023-11-10	Deep learning segmentation of fibrous cap in intravascular optical coherence tomography images	Juhwan Lee et.al.	2311.06202	null
2023-11-15	Cluster Expansion by Transfer Learning from Empirical Potentials	A. Dana et.al.	2311.06179	link
2023-11-10	Deep Fast Vision: A Python Library for Accelerated Deep Transfer Learning Vision Prototyping	Fabi Prezja et.al.	2311.06169	link
2023-11-10	Comparing Male Nyala and Male Kudu Classification using Transfer Learning with ResNet-50 and VGG-16	T. T Lemani et.al.	2311.05981	null
2023-11-10	Adaptive Variance Thresholding: A Novel Approach to Improve Existing Deep Transfer Vision Models and Advance Automatic Knee-Joint Osteoarthritis Classification	Fabi Prezja et.al.	2311.05799	null
2023-11-09	Deep Learning Architecture for Network-Efficiency at the Edge	Akrit Mudvari et.al.	2311.05739	null
2023-11-09	Enhancing Instance-Level Image Classification with Set-Level Labels	Renyu Zhang et.al.	2311.05659	null
2023-11-09	Disentangling Quantum and Classical Contributions in Hybrid Quantum Machine Learning Architectures	Michael Kölle et.al.	2311.05559	null
2023-11-09	Generalization in medical AI: a perspective on developing scalable models	Joachim A. Behar et.al.	2311.05418	null
2023-11-09	Weakly-supervised Deep Cognate Detection Framework for Low-Resourced Languages Using Morphological Knowledge of Closely-Related Languages	Koustava Goswami et.al.	2311.05155	link
2023-11-08	Active Transfer Learning for Efficient Video-Specific Human Pose Estimation	Hiromu Taketsugu et.al.	2311.05041	link
2023-11-08	Transfer learning from a sparsely annotated dataset of 3D medical images	Gabriel Efrain Humpire-Mamani et.al.	2311.05032	link
2023-11-09	On Characterizing the Evolution of Embedding Space of Neural Networks using Algebraic Topology	Suryaka Suresh et.al.	2311.04592	link
2023-11-07	Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning	Rishabh Jain et.al.	2311.04313	link
2023-11-07	Elastic Information Bottleneck	Yuyan Ni et.al.	2311.03955	null
2023-11-07	Sparse Contrastive Learning of Sentence Embeddings	Ruize An et.al.	2311.03881	null
2023-11-07	Mini but Mighty: Finetuning ViTs with Mini Adapters	Imad Eddine Marouf et.al.	2311.03873	link
2023-11-03	Determination of droplet size from wide-angle light scattering image data using convolutional neural networks	Tom Kirstein et.al.	2311.03387	null
2023-11-06	Risk of Transfer Learning and its Applications in Finance	Haoyang Cao et.al.	2311.03283	null
2023-11-06	Machine Learning-Based Tea Leaf Disease Detection: A Comprehensive Review	Faruk Ahmed et.al.	2311.03240	null
2023-11-06	Quantifying the value of information transfer in population-based SHM	Aidan J. Hughes et.al.	2311.03083	null
2023-11-06	TabRepo: A Large Scale Repository of Tabular Model Evaluations and its AutoML Applications	David Salinas et.al.	2311.02971	link
2023-11-06	Understanding Deep Representation Learning via Layerwise Feature Compression and Discrimination	Peng Wang et.al.	2311.02960	link
2023-11-06	AttentioNet: Monitoring Student Attention Type in Learning with EEG-Based Measurement System	Dhruv Verma et.al.	2311.02924	null
2023-11-05	AI Techniques for Uncovering Resolved Planetary Nebula Candidates from Wide-field VPHAS+ Survey Data	Ruiqi Sun et.al.	2311.02607	null
2023-11-03	Robust Fine-Tuning of Vision-Language Models for Domain Generalization	Kevin Vogt-Lowell et.al.	2311.02236	link
2023-11-03	Active Learning-Based Species Range Estimation	Christian Lange et.al.	2311.02061	link
2023-11-03	A Data-Driven Approach to Coarse-Graining Simple Liquids in Confinement	Ishan Nadkarni et.al.	2311.02042	null
2023-11-03	Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection	Gretel Liz De la Peña Sarracén et.al.	2311.02025	null
2023-11-03	CheX-Nomaly: Segmenting Lung Abnormalities from Chest Radiographs using Machine Learning	Sanskriti Singh et.al.	2311.01777	null
2023-11-03	Capturing Local and Global Features in Medical Images by Using Ensemble CNN-Transformer	Javad Mirzapour Kaleybar et.al.	2311.01731	null
2023-11-02	Adversary ML Resilience in Autonomous Driving Through Human Centered Perception Mechanisms	Aakriti Shah et.al.	2311.01478	null
2023-11-02	Scattering Vision Transformer: Spectral Mixing Matters	Badri N. Patro et.al.	2311.01310	null
2023-11-02	M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D Object Detection	Hang Zhang et.al.	2311.00986	link
2023-11-02	IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems	Muhammad Dehan Al Kautsar et.al.	2311.00958	link
2023-11-01	The Quantum Cartpole: A benchmark environment for non-linear reinforcement learning	Kai Meinerz et.al.	2311.00756	null
2023-10-31	Investigating Relative Performance of Transfer and Meta Learning	Benji Alwis et.al.	2311.00727	null
2023-11-01	Transfer learning for improved generalizability in causal physics-informed neural networks for beam simulations	Taniya Kapoor et.al.	2311.00578	null
2023-11-01	TLMCM Network for Medical Image Hierarchical Multi-Label Classification	Meng Wu et.al.	2311.00282	null
2023-10-31	Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis	Abhinav Nippani et.al.	2311.00164	link
2023-10-31	Dynamically Updating Event Representations for Temporal Relation Classification with Multi-category Learning	Fei Cheng et.al.	2310.20236	null
2023-10-31	Self-supervised Pre-training for Precipitation Post-processor	Sojung An et.al.	2310.20187	null
2023-10-30	Topological Learning for Motion Data via Mixed Coordinates	Hengrui Luo et.al.	2310.19960	link
2023-10-31	Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models	Hao Li et.al.	2310.19721	link
2023-10-30	CreoleVal: Multilingual Multitask Benchmarks for Creoles	Heather Lent et.al.	2310.19567	link
2023-10-30	On consequences of finetuning on data with highly discriminative features	Wojciech Masarczyk et.al.	2310.19537	null
2023-10-30	AdapINT: A Flexible and Adaptive In-Band Network Telemetry System Based on Deep Reinforcement Learning	Penghui Zhang et.al.	2310.19331	null
2023-10-30	Adapter Pruning using Tropical Characterization	Rishabh Bhardwaj et.al.	2310.19232	null
2023-10-29	BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping	Srikumar Sastry et.al.	2310.19168	link
2023-10-29	Transfer Learning in Transformer-Based Demand Forecasting For Home Energy Management System	Gargya Gokhale et.al.	2310.19159	null
2023-10-29	Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning	Suraj Singireddy et.al.	2310.19137	null
2023-10-29	A transfer learning approach with convolutional neural network for Face Mask Detection	Abolfazl Younesi et.al.	2310.18928	null
2023-10-29	QWID: Quantized Weed Identification Deep neural network	Parikshit Singh Rathore et.al.	2310.18921	link
2023-10-27	Parameter-Efficient Methods for Metastases Detection from Clinical Notes	Maede Ashofteh Barabadi et.al.	2310.18472	null
2023-10-27	Large-scale Foundation Models and Generative AI for BigData Neuroscience	Ran Wang et.al.	2310.18377	null
2023-10-26	Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs	Owen Henkel et.al.	2310.18373	null
2023-10-27	Transductive conformal inference with adaptive scores	Ulysse Gazin et.al.	2310.18108	link
2023-10-27	CPIA Dataset: A Comprehensive Pathological Image Analysis Dataset for Self-supervised Learning Pre-training	Nan Ying et.al.	2310.17902	link
2023-10-26	Feature Extraction and Classification from Planetary Science Datasets enabled by Machine Learning	Conor Nixon et.al.	2310.17681	null
2023-10-26	PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications	Yang Tan et.al.	2310.17415	link
2023-10-27	De-novo Chemical Reaction Generation by Means of Temporarily Convolutional Neural Networks	Andrei Buin et.al.	2310.17341	null
2023-10-26	Deep Learning on SAR Imagery: Transfer Learning Versus Randomly Initialized Weights	Morteza Karimzadeh et.al.	2310.17126	link
2023-10-25	An Efficient Deep Learning-based approach for Recognizing Agricultural Pests in the Wild	Mohtasim Hadi Rafi et.al.	2310.16991	null
2023-10-25	Transferring a molecular foundation model for polymer property predictions	Pei Zhang et.al.	2310.16958	null
2023-10-25	Learning Transfers over Several Programming Languages	Razan Baltaji et.al.	2310.16937	null
2023-10-24	Deep Learning Models for Classification of COVID-19 Cases by Medical Images	Amir Ali et.al.	2310.16851	null
2023-10-26	Deep machine learning for meteor monitoring: advances with transfer learning and gradient-weighted class activation mapping	Eloy Peña-Asensio et.al.	2310.16826	null
2023-10-25	CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images	Aaron Gokaslan et.al.	2310.16825	link
2023-10-25	From Pointwise to Powerhouse: Initialising Neural Networks with Generative Models	Christian Harder et.al.	2310.16695	null
2023-10-24	Combining Behaviors with the Successor Features Keyboard	Wilka Carvalho et.al.	2310.15940	null
2023-10-24	Ensemble of Task-Specific Language Models for Brain Encoding	Sanjai Kumaran et.al.	2310.15720	link
2023-10-24	Transfer learning for day-ahead load forecasting: a case study on European national electricity demand time series	Alexandros-Menelaos Tzortzis et.al.	2310.15555	link
2023-10-23	Burgers’ pinns with implicit euler transfer learning	Vitória Biesek et.al.	2310.15343	null
2023-10-23	Ionized Gas Extended Over 40 kpc in an Odd Radio Circle Host Galaxy	Alison L. Coil et.al.	2310.15162	null
2023-10-23	Quantum Federated Learning With Quantum Networks	Tyler Wang et.al.	2310.15084	null
2023-10-20	A Novel Transfer Learning Method Utilizing Acoustic and Vibration Signals for Rotating Machinery Fault Diagnosis	Zhongliang Chen et.al.	2310.14796	null
2023-10-22	Mobile Traffic Prediction at the Edge through Distributed and Transfer Learning	Alfredo Petrella et.al.	2310.14456	null
2023-10-22	Cross-Domain HAR: Few Shot Transfer Learning for Human Activity Recognition	Megha Thukral et.al.	2310.14390	null
2023-10-21	On the Transferability of Visually Grounded PCFGs	Yanpeng Zhao et.al.	2310.14107	link
2023-10-21	Convolutional Bidirectional Variational Autoencoder for Image Domain Translation of Dotted Arabic Expiration	Ahmed Zidane et.al.	2310.14069	null
2023-10-21	Minimax Optimal Transfer Learning for Kernel-based Nonparametric Regression	Chao Wang et.al.	2310.13966	null
2023-10-20	Foundation Model’s Embedded Representations May Detect Distribution Shift	Adam Tsou et.al.	2310.13836	null
2023-10-20	Using Human-like Mechanism to Weaken Effect of Pre-training Weight Bias in Face-Recognition Convolutional Neural Network	Haojiang Ying et.al.	2310.13674	null
2023-10-20	Diagnosis-oriented Medical Image Compression with Efficient Transfer Learning	Guangqi Xie et.al.	2310.13250	null
2023-10-20	The Less the Merrier? Investigating Language Representation in Multilingual Models	Hellina Hailu Nigatu et.al.	2310.13228	null
2023-10-19	Streamlining Brain Tumor Classification with Custom Transfer Learning in MRI Images	Javed Hossain et.al.	2310.13108	null
2023-10-19	Unsupervised Representation Learning to Aid Semi-Supervised Meta Learning	Atik Faysal et.al.	2310.13085	link
2023-10-19	Representation Learning via Consistent Assignment of Views over Random Partitions	Thalles Silva et.al.	2310.12692	link
2023-10-18	Adaptive Fine-tuning based Transfer Learning for the Identification of MGMT Promoter Methylation Status	Erich Schmitz et.al.	2310.12373	link
2023-10-18	New Environment Adaptation with Few Shots for OFDM Receiver and mmWave Beamforming	Ouya Wang et.al.	2310.12343	null
2023-10-17	Precise influence evaluation in complex networks	Bingyu Zhu et.al.	2310.12181	link
2023-10-19	Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning	Hao Zhao et.al.	2310.11670	link
2023-10-17	Predicting polymerization reactions via transfer learning using chemical language models	Brenda S. Ferrari et.al.	2310.11423	link
2023-10-17	Relearning Forgotten Knowledge: on Forgetting, Overfit and Training-Free Ensembles of DNNs	Uri Stern et.al.	2310.11094	null
2023-10-16	Electric dipole polarizability of low-lying excited states in atomic nuclei	José Nicolás Orce et.al.	2310.10775	null
2023-10-16	UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking	Chuang Li et.al.	2310.10492	link
2023-10-16	Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning	Chong Li et.al.	2310.10318	link
2023-10-16	Structural transfer learning of non-Gaussian DAG	Mingyang Ren et.al.	2310.10239	null
2023-10-15	Class-Specific Data Augmentation: Bridging the Imbalance in Multiclass Breast Cancer Classification	Kanan Mahammadli et.al.	2310.09981	null
2023-10-18	BanglaNLP at BLP-2023 Task 2: Benchmarking different Transformer Models for Sentiment Analysis of Bangla Social Media Posts	Saumajit Saha et.al.	2310.09238	link
2023-10-13	A Hybrid Transfer Learning Assisted Decision Support System for Accurate Prediction of Alzheimer Disease	Mahin Khan Mahadi et.al.	2310.08888	null
2023-10-13	A Framework for Few-Shot Policy Transfer through Observation Mapping and Behavior Cloning	Yash Shukla et.al.	2310.08836	link
2023-10-16	Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning	Yihua Zhang et.al.	2310.08782	link
2023-10-12	Defect Analysis of 3D Printed Cylinder Object Using Transfer Learning Approaches	Md Manjurul Ahsan et.al.	2310.08645	null
2023-10-15	A Survey of Heterogeneous Transfer Learning	Runxue Bao et.al.	2310.08459	link
2023-10-12	Reset It and Forget It: Relearning Last-Layer Weights Improves Continual and Transfer Learning	Lapo Frati et.al.	2310.07996	null
2023-10-12	Self-supervised visual learning for analyzing firearms trafficking activities on the Web	Sotirios Konstantakos et.al.	2310.07975	null
2023-10-12	CleftGAN: Adapting A Style-Based Generative Adversarial Network To Create Images Depicting Cleft Lip Deformity	Abdullah Hayajneh et.al.	2310.07969	link
2023-10-11	DeePref: Deep Reinforcement Learning For Video Prefetching In Content Delivery Networks	Nawras Alkassab et.al.	2310.07881	null
2023-10-11	Quantitative Analysis of MoS $_2$ Thin Film Micrographs with Machine Learning	Isaiah A. Moses et.al.	2310.07816	null
2023-10-11	A Transfer-Learning-Based Prognosis Prediction Paradigm that Bridges Data Distribution Shift across EMR Datasets	Zhongji Zhang et.al.	2310.07799	null
2023-10-11	Automatic Control of Reactive Brain Computer Interfaces	Pex Tufvesson et.al.	2310.07408	null
2023-10-12	GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning	Yun Zhu et.al.	2310.07365	null
2023-10-11	Give and Take: Federated Transfer Learning for Industrial IoT Network Intrusion Detection	Lochana Telugu Rajesh et.al.	2310.07354	null
2023-10-10	Distributed Transfer Learning with 4th Gen Intel Xeon Processors	Lakshmi Arunachalam et.al.	2310.06916	null
2023-10-10	EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention	Yulong Shi et.al.	2310.06629	link
2023-10-10	Self-Supervised Set Representation Learning for Unsupervised Meta-Learning	Dong Bok Lee et.al.	2310.06511	link
2023-10-10	Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features	Li Zhou et.al.	2310.06458	link
2023-10-10	Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks	Sung Moon Ko et.al.	2310.06369	null
2023-10-10	HoloFed: Environment-Adaptive Positioning via Multi-band Reconfigurable Holographic Surfaces and Federated Learning	Jingzhi Hu et.al.	2310.06336	null
2023-10-10	Transfer learning-based physics-informed convolutional neural network for simulating flow in porous media with time-varying controls	Jungang Chen et.al.	2310.06319	link
2023-10-10	Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation Extraction	Cheng Peng et.al.	2310.06239	null
2023-10-10	Efficient Adaptation of Large Vision Transformer via Adapter Re-Composing	Wei Dong et.al.	2310.06234	link
2023-10-09	Empirical Evaluation of the Segment Anything Model (SAM) for Brain Tumor Segmentation	Mohammad Peivandi et.al.	2310.06162	null
2023-10-09	Understanding Transfer Learning and Gradient-Based Meta-Learning Techniques	Mike Huisman et.al.	2310.06148	link
2023-10-09	Advancing Diagnostic Precision: Leveraging Machine Learning Techniques for Accurate Detection of Covid-19, Pneumonia, and Tuberculosis in Chest X-Ray Images	Aditya Kulkarni et.al.	2310.06080	null
2023-10-09	Transfer learning for piecewise-constant mean estimation: Optimality, $\ell_1$- and $\ell_0$ -penalisation	Fan Wang et.al.	2310.05646	link
2023-10-09	A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers	Matteo Bastico et.al.	2310.05572	link
2023-10-10	Hierarchical Side-Tuning for Vision Transformers	Weifeng Lin et.al.	2310.05393	link
2023-10-09	Investigating Continuous Learning in Spiking Neural Networks	C. Tanner Fredieu et.al.	2310.05343	null
2023-10-10	Enhancing Cross-Dataset Performance of Distracted Driving Detection With Score-Softmax Classifier	Cong Duan et.al.	2310.05202	link
2023-10-08	Lifelong Learning for Fog Load Balancing: A Transfer Learning Approach	Maad Ebrahim et.al.	2310.05187	null
2023-10-10	Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain	Gerald Woo et.al.	2310.05063	link
2023-10-08	Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset	Ze Liu et.al.	2310.04982	null
2023-10-07	Transferable Deep Clustering Model	Zheng Zhang et.al.	2310.04946	null
2023-10-07	CAD Models to Real-World Images: A Practical Approach to Unsupervised Domain Adaptation in Industrial Object Classification	Dennis Ritter et.al.	2310.04757	link
2023-10-07	EdgeFD: An Edge-Friendly Drift-Aware Fault Diagnosis System for Industrial IoT	Chen Jiao et.al.	2310.04704	null
2023-10-07	Tight Rates in Supervised Outlier Transfer Learning	Mohammadreza M. Kalan et.al.	2310.04686	null
2023-10-07	Neural2Speech: A Transfer Learning Framework for Neural-Driven Speech Reconstruction	Jiawei Li et.al.	2310.04644	link
2023-10-07	X-Transfer: A Transfer Learning-Based Framework for Robust GAN-Generated Fake Image Detection	Lei Zhang et.al.	2310.04639	null
2023-10-06	Robust Transfer Learning with Unreliable Source Data	Jianqing Fan et.al.	2310.04606	null
2023-10-06	Acoustic and linguistic representations for speech continuous emotion recognition in call center conversations	Manon Macary et.al.	2310.04481	null
2023-10-06	Enhancing the Authenticity of Rendered Portraits with Identity-Consistent Transfer Learning	Luyuan Wang et.al.	2310.04194	null
2023-10-05	ECAvg: An Edge-Cloud Collaborative Learning Approach using Averaged Weights	Atah Nuh Mih et.al.	2310.03823	null
2023-10-05	LumiNet: The Bright Side of Perceptual Knowledge Distillation	Md. Ismail Hossain et.al.	2310.03669	link
2023-10-05	Network Alignment with Transferable Graph Autoencoders	Jiashu He et.al.	2310.03272	link
2023-10-05	Detecting Electricity Service Equity Issues with Transfer Counterfactual Learning on Large-Scale Outage Datasets	Song Wei et.al.	2310.03258	null
2023-10-04	Crossed-IoT device portability of Electromagnetic Side Channel Analysis: Challenges and Dataset	Tharindu Lakshan Yasarathna et.al.	2310.03119	null
2023-10-04	Hybrid Quantum Machine Learning Assisted Classification of COVID-19 from Computed Tomography Scans	Leo Sünkel et.al.	2310.02748	null
2023-10-04	Comparative Analysis of Imbalanced Malware Byteplot Image Classification using Transfer Learning	Jayasudha M et.al.	2310.02742	null
2023-10-05	Hybrid Inception Architecture with Residual Connection: Fine-tuned Inception-ResNet Deep Learning Model for Lung Inflammation Diagnosis from Chest Radiographs	Mehdi Neshat et.al.	2310.02591	null
2023-10-03	Reducing Intraspecies and Interspecies Covariate Shift in Traumatic Brain Injury EEG of Humans and Mice Using Transfer Euclidean Alignment	Manoj Vishwanath et.al.	2310.02398	null
2023-10-03	Graph Neural Network-based EEG Classification: A Survey	Dominik Klepl et.al.	2310.02152	null
2023-10-03	PAD-Phys: Exploiting Physiology for Presentation Attack Detection in Face Biometrics	Luis F. Gomez et.al.	2310.02140	null
2023-10-03	An evaluation of pre-trained models for feature extraction in image classification	Erick da Silva Puls et.al.	2310.02037	null
2023-10-02	Toward Scalable Visual Servoing Using Deep Reinforcement Learning and Optimal Control	Salar Asayesh et.al.	2310.01360	null
2023-10-02	ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale	Markus Frohmann et.al.	2310.01217	link
2023-10-03	A Theoretical Analysis of the Test Error of Finite-Rank Kernel Ridge Regression	Tin Sum Cheng et.al.	2310.00987	null
2023-10-06	Data-Efficient Power Flow Learning for Network Contingencies	Parikshit Pareek et.al.	2310.00763	null
2023-09-30	An easy zero-shot learning combination: Texture Sensitive Semantic Segmentation IceHrNet and Advanced Style Transfer Learning Strategy	Zhiyong Yang et.al.	2310.00310	link
2023-09-29	Fusing simulation and monitoring data for real-time settlement prediction during tunnel construction: A multi-fidelity deep operator network (DeepONet)	Chen Xu et.al.	2310.00057	null
2023-09-29	AI ensemble for signal detection of higher order gravitational wave modes of quasi-circular, spinning, non-precessing binary black hole mergers	Minyang Tian et.al.	2310.00052	link
2023-10-03	Pretrain, Prompt, and Transfer: Evolving Digital Twins for Time-to-Event Analysis in Cyber-physical Systems	Qinghua Xu et.al.	2310.00032	link
2023-09-29	Are Odd Radio Circles virial shocks around massive galaxies? Implications for cosmic-ray diffusion in the circumgalactic medium	Shotaro Yamasaki et.al.	2309.17451	null
2023-09-29	Glioma subtype classification from histopathological images using in-domain and out-of-domain transfer learning: An experimental study	Vladimir Despotovic et.al.	2309.17223	null
2023-09-29	A Survey of Incremental Transfer Learning: Combining Peer-to-Peer Federated Learning and Domain Incremental Learning for Multicenter Collaboration	Yixing Huang et.al.	2309.17192	link
2023-09-29	Mixup Your Own Pairs	Yilei Wu et.al.	2309.16633	link
2023-09-28	Transfer Learning for Bayesian Optimization on Heterogeneous Search Spaces	Zhou Fan et.al.	2309.16597	null
2023-09-28	Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization	Thilo von Neumann et.al.	2309.16482	null
2023-09-28	Nondestructive chicken egg fertility detection using CNN-transfer learning algorithms	Shoffan Saifullah et.al.	2309.16257	null
2023-09-27	Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing	Brian Yan et.al.	2309.15826	null
2023-09-27	Question answering using deep learning in low resource Indian language Marathi	Dhiraj Amin et.al.	2309.15779	null
2023-09-27	Classification of skyrmionic textures and extraction of Hamiltonian parameters via machine learning	Dushuo Feng et.al.	2309.15679	null
2023-09-27	OceanBench: The Sea Surface Height Edition	J. Emmanuel Johnson et.al.	2309.15599	link
2023-09-29	Confidence-based Visual Dispersal for Few-shot Unsupervised Domain Adaptation	Yizhe Xiong et.al.	2309.15575	link
2023-09-27	Robust Internal Representations for Domain Generalization	Mohammad Rostami et.al.	2309.15522	null
2023-09-27	VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning	Yanan Wang et.al.	2309.15494	null
2023-09-27	Cross-Dataset Experimental Study of Radar-Camera Fusion in Bird’s-Eye View	Lukas Stäcker et.al.	2309.15465	null
2023-09-27	Detecting quantum phase transitions in a frustrated spin chain via transfer learning of a quantum classifier algorithm	André J. Ferreira-Martins et.al.	2309.15339	link
2023-09-26	Boosting High Resolution Image Classification with Scaling-up Transformers	Yi Wang et.al.	2309.15277	link
2023-09-26	Zero-Shot Constrained Motion Planning Transformers Using Learned Sampling Dictionaries	Jacob J. Johnson et.al.	2309.15272	null
2023-09-26	An Ensemble Model for Distorted Images in Real Scenarios	Boyuan Ji et.al.	2309.14998	null
2023-09-26	Transferring climate change knowledge	Francesco Immorlano et.al.	2309.14780	link
2023-09-26	BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning	Ching-Yu Chiang et.al.	2309.14774	link
2023-09-26	XGV-BERT: Leveraging Contextualized Language Model and Graph Neural Network for Efficient Software Vulnerability Detection	Vu Le Anh Quan et.al.	2309.14677	null
2023-09-26	ALEX: Towards Effective Graph Transfer Learning with Noisy Labels	Jingyang Yuan et.al.	2309.14673	null
2023-09-25	Unveiling the Potential of Deep Learning Models for Solar Flare Prediction in Near-Limb Regions	Chetraj Pandey et.al.	2309.14483	null
2023-09-25	Incorporating Ensemble and Transfer Learning For An End-To-End Auto-Colorized Image Detection Model	Ahmed Samir Ragab et.al.	2309.14478	null
2023-09-25	Chop & Learn: Recognizing and Generating Object-State Compositions	Nirat Saini et.al.	2309.14339	null
2023-09-24	Policy Stitching: Learning Transferable Robot Policies	Pingcheng Jian et.al.	2309.13753	null
2023-09-24	Crack-Net: Prediction of Crack Propagation in Composites	Hao Xu et.al.	2309.13626	null
2023-09-24	GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph	Xin Li et.al.	2309.13625	link
2023-09-23	Attention Is All You Need For Blind Room Volume Estimation	Chunxi Wang et.al.	2309.13504	null
2023-09-23	Randomize to Generalize: Domain Randomization for Runway FOD Detection	Javaria Farooq et.al.	2309.13264	null
2023-09-22	Understanding Calibration of Deep Neural Networks for Medical Image Classification	Abhishek Singh Sambyal et.al.	2309.13132	null
2023-09-22	Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts	Emad A. Alghamdi et.al.	2309.12863	null
2023-09-22	Unsupervised Representations Improve Supervised Learning in Speech Emotion Recognition	Amirali Soltani Tehrani et.al.	2309.12714	null
2023-09-22	Multiply Robust Federated Estimation of Targeted Average Treatment Effects	Larry Han et.al.	2309.12600	null
2023-09-21	Brain Tumor Detection Using Deep Learning Approaches	Razia Sultana Misu et.al.	2309.12193	null
2023-09-21	Identification of pneumonia on chest x-ray images through machine learning	Eduardo Augusto Roeder et.al.	2309.11995	null
2023-09-21	Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition	Shuai Wang et.al.	2309.11730	link
2023-09-20	Hand Gesture Recognition with Two Stage Approach Using Transfer Learning and Deep Ensemble Learning	Serkan Savaş et.al.	2309.11610	null
2023-09-20	SkeleTR: Towrads Skeleton-based Action Recognition in the Wild	Haodong Duan et.al.	2309.11445	null
2023-09-20	Using Artificial Intelligence for the Automation of Knitting Patterns	Uduak Uboh et.al.	2309.11202	null
2023-09-19	Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning	Mohammad-Javad Darvishi-Bayazi et.al.	2309.10910	null
2023-09-19	Semi-supervised Domain Adaptation in Graph Transfer Learning	Ziyue Qiao et.al.	2309.10773	null
2023-09-19	Exploring the Influence of Information Entropy Change in Learning Systems	Xiaowei Yu et.al.	2309.10625	link
2023-09-20	PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring	Thanveer Shaik et.al.	2309.10576	null
2023-09-19	A Hierarchical Neural Framework for Classification and its Explanation in Large Unstructured Legal Documents	Nishchal Prasad et.al.	2309.10563	null
2023-09-19	Toward efficient resource utilization at edge nodes in federated learning	Sadi Alawadi et.al.	2309.10367	null
2023-09-19	Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition	Ziyang Ma et.al.	2309.10294	null
2023-09-17	A Swin-Transformer-based Model for Efficient Compression of Turbulent Flow Data	Meng Zhang et.al.	2309.09192	null
2023-09-16	Universal Metric Learning with Parameter-Efficient Transfer Learning	Sungyeon Kim et.al.	2309.08944	null
2023-09-16	An Unified Search and Recommendation Foundation Model for Cold-Start Scenario	Yuqi Gong et.al.	2309.08939	null
2023-09-15	Global trends of the electric dipole polarizability from shell-model calculations	José Nicolás Orce et.al.	2309.08810	null
2023-09-15	Improved Breast Cancer Diagnosis through Transfer Learning on Hematoxylin and Eosin Stained Histology Images	Fahad Ahmed et.al.	2309.08745	null
2023-09-15	MIML: Multiplex Image Machine Learning for High Precision Cell Classification via Mechanical Traits within Microfluidic Systems	Khayrul Islam et.al.	2309.08421	null
2023-09-14	Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning	Zhiwu Qing et.al.	2309.07911	link
2023-09-14	Enhancing Performance, Calibration Time and Efficiency in Brain-Machine Interfaces through Transfer Learning and Wearable EEG Technology	Xiaying Wang et.al.	2309.07798	null
2023-09-20	NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation	Jiaqi Zhang et.al.	2309.07705	link
2023-09-14	Goal Space Abstraction in Hierarchical Reinforcement Learning via Set-Based Reachability Analysis	Mehdi Zadem et.al.	2309.07675	null
2023-09-14	Efficiently Robustify Pre-trained Models	Nishant Jain et.al.	2309.07499	null
2023-09-14	Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images	Zhiyun Song et.al.	2309.07394	link
2023-09-13	Learning from Auxiliary Sources in Argumentative Revision Classification	Tazin Afrin et.al.	2309.07334	null
2023-09-18	Safe and Accelerated Deep Reinforcement Learning-based O-RAN Slicing: A Hybrid Transfer Learning Approach	Ahmad M. Nagib et.al.	2309.07265	link
2023-09-12	Goal Space Abstraction in Hierarchical Reinforcement Learning via Reachability Analysis	Mehdi Zadem et.al.	2309.07168	null
2023-09-13	TransNet: A Transfer Learning-Based Network for Human Action Recognition	K. Alomar et.al.	2309.06951	null
2023-09-12	Distributionally Robust Transfer Learning	Xin Xiong et.al.	2309.06534	null
2023-09-12	Exploring the Benefits of Differentially Private Pre-training and Parameter-Efficient Fine-tuning for Table Transformers	Xilong Wang et.al.	2309.06526	link
2023-09-08	Adversarial attacks on hybrid classical-quantum Deep Learning models for Histopathological Cancer Detection	Biswaraj Baral et.al.	2309.06377	null
2023-09-12	Transfer learning from Hermitian to non-Hermitian quantum many-body physics	Sharareh Sayyad et.al.	2309.06303	null
2023-09-12	Transferability analysis of data-driven additive manufacturing knowledge: a case study between powder bed fusion and directed energy deposition	Mutahar Safdar et.al.	2309.06286	null
2023-09-12	A 3M-Hybrid Model for the Restoration of Unique Giant Murals: A Case Study on the Murals of Yongle Palace	Jing Yang et.al.	2309.06194	null
2023-09-12	Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning	Chunqing Ruan et.al.	2309.06123	null
2023-09-12	Systemization of Knowledge (SoK)- Cross Impact of Transfer Learning in Cybersecurity: Offensive, Defensive and Threat Intelligence Perspectives	Sofiya Makar et.al.	2309.05889	null
2023-09-11	SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Action Recognition	Cong Wu et.al.	2309.05834	null
2023-09-11	MultIOD: Rehearsal-free Multihead Incremental Object Detector	Eden Belouadah et.al.	2309.05334	null
2023-09-11	Analysing Cross-Lingual Transfer in Low-Resourced African Named Entity Recognition	Michael Beukman et.al.	2309.05311	link
2023-09-11	Generalized Graphon Process: Convergence of Graph Frequencies in Stretched Cut Distance	Xingchao Jian et.al.	2309.05260	null
2023-09-11	DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning	Zhengxiang Shi et.al.	2309.05173	link
2023-09-10	Seismic Data Strong Noise Attenuation Based on Diffusion Model and Principal Component Analysis	Junheng Peng et.al.	2309.04944	link
2023-09-09	Towards Real-time Training of Physics-informed Neural Networks: Applications in Ultrafast Ultrasound Blood Flow Imaging	Haotian Guan et.al.	2309.04755	null
2023-09-09	Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis	Nikhil J. Dhinagar et.al.	2309.04651	null
2023-09-08	Regret-Optimal Federated Transfer Learning for Kernel Regression with Applications in American Option Pricing	Xuwei Yang et.al.	2309.04557	link
2023-09-08	Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays	Aroof Aimen et.al.	2309.04462	null
2023-09-07	S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statistical Tokens	Rizhao Cai et.al.	2309.04038	null
2023-09-06	Active shooter detection and robust tracking utilizing supplemental synthetic data	Joshua R. Waite et.al.	2309.03381	null
2023-09-06	EvoCLINICAL: Evolving Cyber-Cyber Digital Twin with Active Transfer Learning for Automated Cancer Registry System	Chengjie Lu et.al.	2309.03246	link
2023-09-06	Adaptive Growth: Real-time CNN Layer Expansion	Yunjie Zhu et.al.	2309.03049	link
2023-09-06	Leveraging ASR Pretrained Conformers for Speaker Verification through Transfer Learning and Knowledge Distillation	Danwei Cai et.al.	2309.03019	null
2023-09-06	Roulette: A Semantic Privacy-Preserving Device-Edge Collaborative Inference Framework for Deep Learning Classification Tasks	Jingyi Li et.al.	2309.02820	null
2023-09-05	A Survey of the Impact of Self-Supervised Pretraining for Diagnostic Tasks with Radiological Images	Blake VanBerlo et.al.	2309.02555	null
2023-09-04	Active flow control for three-dimensional cylinders through deep reinforcement learning	Pol Suárez et.al.	2309.02462	null
2023-09-05	Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach	Vimal K B et.al.	2309.02429	null
2023-09-05	Graph Self-Contrast Representation Learning	Minjie Chen et.al.	2309.02304	null
2023-09-05	DeepVol: A Deep Transfer Learning Approach for Universal Asset Volatility Modeling	Chen Liu et.al.	2309.02072	link
2023-09-05	Probabilistic Self-supervised Learning via Scoring Rules Minimization	Amirhossein Vahidi et.al.	2309.02048	null
2023-09-06	Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models	Qiong Wu et.al.	2309.01479	link
2023-09-04	Deep Learning Approach for Large-Scale, Real-Time Quantification of Green Fluorescent Protein-Labeled Biological Samples in Microreactors	Yuanyuan Wei et.al.	2309.01384	null
2023-09-02	Big-model Driven Few-shot Continual Learning	Ziqi Gu et.al.	2309.00862	null
2023-09-01	Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models	Dezhao Luo et.al.	2309.00661	null
2023-08-31	QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning	Haohan Guo et.al.	2309.00126	null
2023-08-31	CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset	Nayeon Lee et.al.	2308.16705	link
2023-08-31	Towards Optimal Patch Size in Vision Transformers for Tumor Segmentation	Ramtin Mojtahedi et.al.	2308.16598	link
2023-08-29	Multi-Transfer Learning Techniques for Detecting Auditory Brainstem Response	Fatih Ozyurt et.al.	2308.16203	null
2023-08-30	Hybrid Quantum Neural Network Structures for Image Multi-classification	Mingrui Shi et.al.	2308.16005	null
2023-08-30	Towards Earlier Detection of Oral Diseases On Smartphones Using Oral and Dental RGB Images	Ayush Garg et.al.	2308.15705	link
2023-08-29	Target PCA: Transfer Learning Large Dimensional Panel Data	Junting Duan et.al.	2308.15627	null
2023-08-29	On the Steganographic Capacity of Selected Learning Models	Rishit Agrawal et.al.	2308.15502	null
2023-08-29	A General-Purpose Self-Supervised Model for Computational Pathology	Richard J. Chen et.al.	2308.15474	null
2023-08-29	Exploring Model Transferability through the Lens of Potential Energy	Xiaotong Li et.al.	2308.15074	link
2023-08-28	Robust Activity Recognition for Adaptive Worker-Robot Interaction using Transfer Learning	Farid Shahnavaz et.al.	2308.14843	null
2023-08-31	LAC: Latent Action Composition for Skeleton-based Action Segmentation	Di Yang et.al.	2308.14500	null
2023-08-28	UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory	Haiwen Diao et.al.	2308.14316	link
2023-08-28	Parameter-Efficient Transfer Learning for Audio-Visual-Language Tasks	Hongye Liu et.al.	2308.14274	null
2023-08-27	Exploring the Transfer Learning Capabilities of CLIP in Domain Generalization for Diabetic Retinopathy	Sanoojan Baliah et.al.	2308.14212	link
2023-08-27	Revolutionizing Disease Diagnosis: A Microservices-Based Architecture for Privacy-Preserving and Efficient IoT Data Analytics Using Federated Learning	Safa Ben Atitallah et.al.	2308.14017	null
2023-08-26	Transfer Learning for Microstructure Segmentation with CS-UNet: A Hybrid Algorithm with Transformer and CNN Encoders	Khaled Alrfou et.al.	2308.13917	link
2023-08-25	An Ensemble Approach to Personalized Real Time Predictive Writing for Experts	Sourav Prosad et.al.	2308.13576	null
2023-08-25	Ultrafast-and-Ultralight ConvNet-Based Intelligent Monitoring System for Diagnosing Early-Stage Mpox Anytime and Anywhere	Yubiao Yue et.al.	2308.13492	null
2023-08-25	Mesh-Wise Prediction of Demographic Composition from Satellite Images Using Multi-Head Convolutional Neural Network	Yuta Sato et.al.	2308.13441	null
2023-08-25	Enhanced Mortality Prediction In Patients With Subarachnoid Haemorrhage Using A Deep Learning Model Based On The Initial CT Scan	Sergio Garcia-Garcia et.al.	2308.13373	null
2023-08-25	CEIMVEN: An Approach of Cutting Edge Implementation of Modified Versions of EfficientNet (V1-V2) Architecture for Breast Cancer Detection and Classification from Ultrasound Images	Sheekar Banerjee et.al.	2308.13356	link
2023-08-24	Electronic Structure Prediction of Multi-million Atom Systems Through Uncertainty Quantification Enabled Transfer Learning	Shashank Pathrudkar et.al.	2308.13096	null
2023-08-24	Motion-Guided Masking for Spatiotemporal Representation Learning	David Fan et.al.	2308.12962	null
2023-08-25	Pre-trained Model-based Automated Software Vulnerability Repair: How Far are We?	Quanjun Zhang et.al.	2308.12533	link
2023-08-24	Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval	Yuan Yuan et.al.	2308.12509	link
2023-08-23	Layer-wise Feedback Propagation	Leander Weber et.al.	2308.12053	link
2023-08-23	Efficient Transfer Learning in Diffusion Models via Adversarial Noise	Xiyu Wang et.al.	2308.11948	null
2023-08-25	Exploring the Optimization Objective of One-Class Classification for Anomaly Detection	Han Gao et.al.	2308.11898	null
2023-08-23	${\rm E}(3)$ -Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning	Dingyang Chen et.al.	2308.11842	link
2023-08-22	Addressing Dynamic and Sparse Qualitative Data: A Hilbert Space Embedding of Categorical Variables	Anirban Mukherjee et.al.	2308.11781	null
2023-08-22	Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding	Jiantao Wu et.al.	2308.11448	null
2023-08-22	Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models	Baoshuo Kan et.al.	2308.11186	null
2023-08-22	MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation	Jinpeng Wang et.al.	2308.11175	link
2023-08-21	Ultrafast and Ultralight Network-Based Intelligent System for Real-time Diagnosis of Ear diseases in Any Devices	Yubiao Yue et.al.	2308.10610	null
2023-08-20	VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation	Yanyuan Qiao et.al.	2308.10172	link
2023-08-20	ExpeL: LLM Agents Are Experiential Learners	Andrew Zhao et.al.	2308.10144	link
2023-08-19	Disposable Transfer Learning for Selective Source Task Unlearning	Seunghee Koh et.al.	2308.09971	null
2023-08-19	Bamboo: Boosting Training Efficiency for Real-Time Video Streaming via Online Grouped Federated Transfer Learning	Qianyuan Zheng et.al.	2308.09948	null
2023-08-19	Dual Branch Deep Learning Network for Detection and Stage Grading of Diabetic Retinopathy	Hossein Shakibania et.al.	2308.09945	null
2023-08-19	Evaluating Transfer Learning for Simplifying GitHub READMEs	Haoyu Gao et.al.	2308.09940	null
2023-08-19	Towards a High-Performance Object Detector: Insights from Drone Detection Using ViT and CNN-based Deep Learning Models	Junyang Zhang et.al.	2308.09899	null
2023-08-18	Deformable-Detection Transformer for Microbubble Localization in Ultrasound Localization Microscopy	Sepideh K. Gharamaleki et.al.	2308.09845	null
2023-08-18	Time Series Predictions in Unmonitored Sites: A Survey of Machine Learning Techniques in Water Resources	Jared D. Willard et.al.	2308.09766	null
2023-08-18	SimDA: Simple Diffusion Adapter for Efficient Video Generation	Zhen Xing et.al.	2308.09710	null
2023-08-18	On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers	Thomas De Min et.al.	2308.09610	link
2023-08-18	Bridged-GNN: Knowledge Bridge Learning for Effective Knowledge Transfer	Wendong Bi et.al.	2308.09499	null
2023-08-18	Improving Buoy Detection with Deep Transfer Learning for Mussel Farm Automation	Carl McMillan et.al.	2308.09238	null
2023-08-18	A review of technical factors to consider when designing neural networks for semantic segmentation of Earth Observation imagery	Sam Khallaghi et.al.	2308.09221	null
2023-08-17	Multi-fidelity Fourier Neural Operator for Fast Modeling of Large-Scale Geological Carbon Storage	Hewei Tang1 et.al.	2308.09113	link
2023-08-16	PEvoLM: Protein Sequence Evolutionary Information Language Model	Issar Arab et.al.	2308.08578	link
2023-08-16	Sarcasm Detection in a Disaster Context	Tiberiu Sosea et.al.	2308.08156	null
2023-08-16	S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution	Minghao She et.al.	2308.08142	link
2023-08-15	Synthesizing Political Zero-Shot Relation Classification via Codebook Knowledge, NLI, and ChatGPT	Yibo Hu et.al.	2308.07876	link
2023-08-15	Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models	Kanchan Poudel et.al.	2308.07706	link
2023-08-14	The Performance of Transferability Metrics does not Translate to Medical Tasks	Levy Chaves et.al.	2308.07444	link
2023-08-16	Interaction-Aware Personalized Vehicle Trajectory Prediction Using Temporal Graph Neural Networks	Amr Abdelraouf et.al.	2308.07439	null
2023-08-15	SEMI-CenterNet: A Machine Learning Facilitated Approach for Semiconductor Defect Inspection	Vic De Ridder et.al.	2308.07180	null
2023-08-13	Optimizing Brain Tumor Classification: A Comprehensive Study on Transfer Learning and Imbalance Handling in Deep Learning Models	Raza Imam et.al.	2308.06821	link
2023-08-12	SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models	Sara Babakniya et.al.	2308.06522	null
2023-08-12	A Sequential Meta-Transfer (SMT) Learning to Combat Complexities of Physics-Informed Neural Networks: Application to Composites Autoclave Processing	Milad Ramezankhani et.al.	2308.06447	link
2023-08-11	Classification of Blood Cells Using Deep Learning Models	Rabia Asghar et.al.	2308.06300	null
2023-08-11	Hybrid-Supervised Deep Learning for Domain Transfer 3D Protoacoustic Image Reconstruction	Yankun Lang et.al.	2308.06194	null
2023-08-11	Fast and Accurate Transferability Measurement by Evaluating Intra-class Feature Variance	Huiwen Xu et.al.	2308.05986	null
2023-08-11	Tweet Sentiment Extraction using Viterbi Algorithm with Transfer Learning	Zied Baklouti et.al.	2308.05973	link
2023-08-09	Deep Learning Model Transfer in Forest Mapping using Multi-source Satellite SAR and Optical Images	Shaojia Ge et.al.	2308.05005	null
2023-08-08	Sparse Array Design for Direction Finding using Deep Learning	Kumar Vijay Mishra et.al.	2308.04615	null
2023-08-11	Deep Learning for Diverse Data Types Steganalysis: A Review	Hamza Kheddar et.al.	2308.04522	null
2023-08-08	Vascular Ageing and Smoking Habit Prediction via a Low-Cost Single-Lead ECG Module	S. Anas Ali et.al.	2308.04355	null
2023-08-07	PMU measurements based short-term voltage stability assessment of power systems via deep transfer learning	Yang Li et.al.	2308.03953	null
2023-08-07	Segmentation Framework for Heat Loss Identification in Thermal Images: Empowering Scottish Retrofitting and Thermographic Survey Companies	Md Junayed Hasan et.al.	2308.03631	null
2023-08-07	Provably Efficient Learning in Partially Observable Contextual Bandit	Xueping Gong et.al.	2308.03572	null
2023-08-07	A Transfer Learning Framework for Proactive Ramp Metering Performance Assessment	Xiaobo Ma et.al.	2308.03542	null
2023-08-07	On-ramp and Off-ramp Traffic Flows Estimation Based on A Data-driven Transfer Learning Framework	Xiaobo Ma et.al.	2308.03538	null
2023-08-07	RoadScan: A Novel and Robust Transfer Learning Framework for Autonomous Pothole Detection in Roads	Guruprasad Parasnis et.al.	2308.03467	null
2023-08-05	Surrogate Empowered Sim2Real Transfer of Deep Reinforcement Learning for ORC Superheat Control	Runze Lin et.al.	2308.02765	null
2023-08-04	Self-Normalizing Neural Network, Enabling One Shot Transfer Learning for Modeling EDFA Wavelength Dependent Gain	Agastya Raj et.al.	2308.02233	null
2023-08-07	Deep Maxout Network-based Feature Fusion and Political Tangent Search Optimizer enabled Transfer Learning for Thalassemia Detection	Hemn Barzan Abdalla et.al.	2308.02029	null
2023-08-03	Curricular Transfer Learning for Sentence Encoded Tasks	Jader Martins Camboim de Sá et.al.	2308.01849	null
2023-08-03	Deep Learning-based Prediction of Stress and Strain Maps in Arterial Walls for Improved Cardiovascular Risk Assessment	Yasin Shokrollahi1 et.al.	2308.01771	null
2023-08-03	IndoHerb: Indonesia Medicinal Plants Recognition using Transfer Learning and Deep Learning	Muhammad Salman Ikrar Musyaffa et.al.	2308.01604	link
2023-08-02	Grasp Stability Assessment Through Attention-Guided Cross-Modality Fusion and Transfer Learning	Zhuangzhuang Zhang et.al.	2308.00980	null
2023-08-01	Understanding Activation Patterns in Artificial Neural Networks by Exploring Stochastic Processes	Stephan Johann Lehmler et.al.	2308.00858	null
2023-07-31	Cardiac MRI Orientation Recognition and Standardization using Deep Neural Networks	Ruoxuan Zhen et.al.	2308.00615	link
2023-08-01	Scalable quantum measurement error mitigation via conditional independence and transfer learning	ChangWon Lee et.al.	2308.00320	null
2023-08-01	Pixel to policy: DQN Encoders for within & cross-game reinforcement learning	Ashrya Agrawal et.al.	2308.00318	null
2023-08-01	EEG-based Cognitive Load Classification using Feature Masked Autoencoding and Emotion Transfer Learning	Dustin Pulver et.al.	2308.00246	null
2023-07-31	Structural Transfer Learning in NL-to-Bash Semantic Parsers	Kyle Duffy et.al.	2307.16795	null
2023-07-31	Hybrid quantum transfer learning for crack image classification on NISQ hardware	Alexander Geng et.al.	2307.16723	null
2023-07-31	UDAMA: Unsupervised Domain Adaptation through Multi-discriminator Adversarial Training with Noisy Labels Improves Cardio-fitness Prediction	Yu Wu et.al.	2307.16651	link
2023-07-31	LP-MusicCaps: LLM-Based Pseudo Music Captioning	SeungHeon Doh et.al.	2307.16372	link
2023-07-30	Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation	Md Nurul Muttakin et.al.	2307.16275	null
2023-07-30	Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction	Pengfei Hu et.al.	2307.16253	null
2023-07-30	Gastrointestinal Mucosal Problems Classification with Deep Learning	Mohammadhasan Goharian et.al.	2307.16198	null
2023-07-29	Cross-dimensional transfer learning in medical image segmentation with deep learning	Hicham Messaoudi et.al.	2307.15872	link
2023-07-28	A deep transfer learning network for structural condition identification with limited real-world training data	Nengxin Bao et.al.	2307.15249	null
2023-07-27	Star Cluster Classification using Deep Transfer Learning with PHANGS-HST	Stephen Hannon et.al.	2307.15133	null
2023-07-26	Towards Generalist Biomedical AI	Tao Tu et.al.	2307.14334	null
2023-07-26	Reinforcement Learning by Guided Safe Exploration	Qisong Yang et.al.	2307.14316	null
2023-07-26	Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy	Luca Clissa et.al.	2307.14243	null
2023-07-25	ChildGAN: Large Scale Synthetic Child Facial Data Using Domain Adaptation in StyleGAN	Muhammad Ali Farooq et.al.	2307.13746	null
2023-07-25	Transfer Learning for Portfolio Optimization	Haoyang Cao et.al.	2307.13546	null
2023-07-25	Spectral-DP: Differentially Private Deep Learning through Spectral Perturbation and Filtering	Ce Feng et.al.	2307.13231	null
2023-07-24	End-to-End Deep Transfer Learning for Calibration-free Motor Imagery Brain Computer Interfaces	Maryam Alimardani et.al.	2307.12827	null
2023-07-24	Sparse annotation strategies for segmentation of short axis cardiac MRI	Josh Stein et.al.	2307.12619	null
2023-07-23	NCART: Neural Classification and Regression Tree for Tabular Data	Jiaqi Luo et.al.	2307.12198	null
2023-07-22	An X3D Neural Network Analysis for Runner’s Performance Assessment in a Wild Sporting Environment	David Freire-Obregón et.al.	2307.12183	null
2023-07-22	Identifying Misinformation on YouTube through Transcript Contextual Analysis with Transformer Models	Christos Christodoulou et.al.	2307.12155	link
2023-07-22	Flight Contrail Segmentation via Augmented Transfer Learning with Novel SR Loss Function in Hough Space	Junzi Sun et.al.	2307.12032	link
2023-07-22	Pick the Best Pre-trained Model: Towards Transferability Estimation for Medical Image Segmentation	Yuncheng Yang et.al.	2307.11958	link
2023-07-21	MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems	Thilo von Neumann et.al.	2307.11394	link
2023-07-20	Transfer Learning and Bias Correction with Pre-trained Audio Embeddings	Changhong Wang et.al.	2307.10834	link
2023-07-20	Predicting human motion intention for pHRI assistive control	Paolo Franceschi et.al.	2307.10743	null
2023-07-20	Transfer Learning for Inverse Design of Tunable Graphene-Based Metasurfaces	Mehdi Kiani et.al.	2307.10641	null
2023-07-20	Pluvio: Assembly Clone Search for Out-of-domain Architectures and Libraries through Transfer Learning and Conditional Variational Information Bottleneck	Zhiwei Fu et.al.	2307.10631	null
2023-07-19	Eye Disease Classification Using Deep Learning Techniques	Tareq Babaqi et.al.	2307.10501	null
2023-07-19	Novel Batch Active Learning Approach and Its Application to Synthetic Aperture Radar Datasets	James Chapman et.al.	2307.10495	link
2023-07-19	Determination of the critical points for systems of directed percolation class using machine learning	M. Ali Saif et.al.	2307.10456	null
2023-07-19	Gradient Sparsification For Masked Fine-Tuning of Transformers	James O’ Neill et.al.	2307.10098	null
2023-07-19	Revisiting invariances and introducing priors in Gromov-Wasserstein distances	Pinar Demetci et.al.	2307.10093	link
2023-07-19	From West to East: Who can understand the music of the others better?	Charilaos Papaioannou et.al.	2307.09795	link
2023-07-17	Study of Vision Transformers for Covid-19 Detection from Chest X-rays	Sandeep Angara et.al.	2307.09402	null
2023-07-18	Augmenting CLIP with Improved Visio-Linguistic Reasoning	Samyadeep Basu et.al.	2307.09233	null
2023-07-18	Detecting Throat Cancer from Speech Signals Using Machine Learning: A Reproducible Literature Review	Mary Paterson et.al.	2307.09230	null
2023-07-18	A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future	Chaoyang Zhu et.al.	2307.09220	link
2023-07-18	Evaluate Fine-tuning Strategies for Fetal Head Ultrasound Image Segmentation with U-Net	Fangyijie Wang et.al.	2307.09067	link
2023-07-18	Face-PAST: Facial Pose Awareness and Style Transfer Networks	Sunder Ali Khowaja et.al.	2307.09020	null
2023-07-18	Alioth: A Machine Learning Based Interference-Aware Performance Monitor for Multi-Tenancy Applications in Public Cloud	Tianyao Shi et.al.	2307.08949	link
2023-07-17	Diffusion Models Beat GANs on Image Classification	Soumik Mukhopadhyay et.al.	2307.08702	null
2023-07-18	Revisiting the Robustness of the Minimum Error Entropy Criterion: A Transfer Learning Case Study	Luis Pedro Silvestrin et.al.	2307.08572	link
2023-07-17	Domain Adaptation using Silver Standard Masks for Lateral Ventricle Segmentation in FLAIR MRI	Owen Crystal et.al.	2307.08456	null
2023-07-17	Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models	Zhiyuan Peng et.al.	2307.08303	link
2023-07-16	SHAMSUL: Simultaneous Heatmap-Analysis to investigate Medical Significance Utilizing Local interpretability methods	Mahbub Ul Alam et.al.	2307.08003	link
2023-07-18	S2R-ViT for Multi-Agent Cooperative Perception: Bridging the Gap from Simulation to Reality	Jinlong Li et.al.	2307.07935	null
2023-07-15	SoccerKDNet: A Knowledge Distillation Framework for Action Recognition in Soccer Videos	Sarosij Bose et.al.	2307.07768	null
2023-07-14	MGit: A Model Versioning and Management System	Wei Hao et.al.	2307.07507	null
2023-07-14	Improving Zero-Shot Generalization for CLIP with Synthesized Prompts	Zhengbo Wang et.al.	2307.07397	link
2023-07-14	Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition	Theresa Pekarek Rosin et.al.	2307.07280	null
2023-07-14	Improving BERT with Hybrid Pooling Network and Drop Mask	Qian Chen et.al.	2307.07258	null
2023-07-13	A Scenario-Based Functional Testing Approach to Improving DNN Performance	Hong Zhu et.al.	2307.07083	null
2023-07-13	AnyStar: Domain randomized universal star-convex 3D instance segmentation	Neel Dey et.al.	2307.07044	link
2023-07-13	A decision framework for selecting information-transfer strategies in population-based SHM	Aidan J. Hughes et.al.	2307.06978	null
2023-07-13	Agreement Tracking for Multi-Issue Negotiation Dialogues	Amogh Mannekote et.al.	2307.06524	null
2023-07-12	Feature Embeddings from Large-Scale Acoustic Bird Classifiers Enable Few-Shot Transfer Learning	Burooj Ghani et.al.	2307.06292	link
2023-07-12	Prototypical Contrastive Transfer Learning for Multimodal Language Understanding	Seitaro Otsuki et.al.	2307.05942	null
2023-07-06	LogitMat : Zeroshot Learning Algorithm for Recommender Systems without Transfer Learning or Pretrained Models	Hao Wang et.al.	2307.05680	null
2023-07-11	A Comprehensive Survey of Deep Transfer Learning for Anomaly Detection in Industrial Time Series: Methods, Applications, and Directions	Peng Yan et.al.	2307.05638	null
2023-07-11	Channel Selection for Wi-Fi 7 Multi-Link Operation via Optimistic-Weighted VDN and Parallel Transfer Reinforcement Learning	Pedro Enrique Iturria-Rivera et.al.	2307.05419	null
2023-07-11	Multi-fidelity Emulator for Cosmological Large Scale 21 cm Lightcone Images: a Few-shot Transfer Learning Approach with GAN	Kangning Diao et.al.	2307.04976	link
2023-07-10	SimpleMTOD: A Simple Language Model for Multimodal Task-Oriented Dialogue with Symbolic Scene Representation	Bhathiya Hemanthage et.al.	2307.04907	null
2023-07-10	Advances and Challenges in Meta-Learning: A Technical Review	Anna Vettoruzzo et.al.	2307.04722	null
2023-07-11	Generalization Error of First-Order Methods for Statistical Learning with Generic Oracles	Kevin Scaman et.al.	2307.04679	null
2023-07-10	Enhancing Biomedical Text Summarization and Question-Answering: On the Utility of Domain-Specific Pre-Training	Dima Galat et.al.	2307.04412	null
2023-07-08	Building and Road Segmentation Using EffUNet and Transfer Learning Approach	Sahil Gangurde et.al.	2307.03980	null
2023-07-07	Tranfer Learning of Semantic Segmentation Methods for Identifying Buried Archaeological Structures on LiDAR Data	Paolo Soleni et.al.	2307.03512	null
2023-07-06	Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment	Aref Farhadipour et.al.	2307.03296	link
2023-07-06	To pretrain or not to pretrain? A case study of domain-specific pretraining for semantic segmentation in histopathology	Tushar Kataria et.al.	2307.03275	link
2023-07-06	Vision Language Transformers: A Survey	Clayton Fields et.al.	2307.03254	null
2023-07-06	A Hybrid End-to-End Spatio-Temporal Attention Neural Network with Graph-Smooth Signals for EEG Emotion Recognition	Shadi Sartipi et.al.	2307.03068	null
2023-07-13	Self-supervised learning via inter-modal reconstruction and feature projection networks for label-efficient 3D-to-2D segmentation	José Morano et.al.	2307.03008	link
2023-07-06	Molecular Simulation for Atmospheric Reaction Exploration and Discovery: Non-Equilibrium Dynamics, Roaming and Glycolaldehyde Formation Following Photo-Induced Decomposition of syn-Acetaldehyde Oxide	Meenu Upadhyay et.al.	2307.02994	null
2023-07-06	Transfer Learning for the Efficient Detection of COVID-19 from Smartphone Audio Data	Mattia Giovanni Campana et.al.	2307.02975	link
2023-07-08	PUFFIN: A Path-Unifying Feed-Forward Interfaced Network for Vapor Pressure Prediction	Vinicius Viena Santana et.al.	2307.02903	null
2023-07-04	Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure	Yikang Wang et.al.	2307.01546	null
2023-07-04	On Conditional and Compositional Language Model Differentiable Prompting	Jonathan Pilault et.al.	2307.01446	null
2023-07-03	Exploring Spoken Named Entity Recognition: A Cross-Lingual Perspective	Moncef Benaicha et.al.	2307.01310	link
2023-07-03	SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation	Liangliang Yao et.al.	2307.01024	link
2023-07-03	Autism Spectrum Disorder Classification in Children based on Structural MRI Features Extracted using Contrastive Variational Autoencoder	Ruimin Ma et.al.	2307.00976	null
2023-07-03	Analysis of Task Transferability in Large Pre-trained Classifiers	Akshay Mehra et.al.	2307.00823	link
2023-07-02	Variational Autoencoding Molecular Graphs with Denoising Diffusion Probabilistic Model	Daiki Koge et.al.	2307.00623	null
2023-07-01	Unified Transfer Learning Models for High-Dimensional Linear Regression	Shuo Shuo Liu et.al.	2307.00238	null
2023-06-30	BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark for Short-Term Load Forecasting	Patrick Emami et.al.	2307.00142	link
2023-06-30	Scalable method for Bayesian experimental design without integrating over posterior distribution	Vinh Hoang et.al.	2306.17615	link
2023-06-30	Towards the extraction of robust sign embeddings for low resource sign language recognition	Mathieu De Coster et.al.	2306.17558	null
2023-06-30	Why does my medical AI look at pictures of birds? Exploring the efficacy of transfer learning across domain boundaries	Frederic Jonske et.al.	2306.17555	link
2023-06-30	Audio Embeddings as Teachers for Music Classification	Yiwei Ding et.al.	2306.17424	link
2023-06-29	Prediction of COVID-19 Patients’ Emergency Room Revisit using Multi-Source Transfer Learning	Yuelyu Ji et.al.	2306.17257	null
2023-06-29	Noise-Aware Quantum Software Testing	Asmar Muqeet et.al.	2306.16992	link
2023-06-29	Obeying the Order: Introducing Ordered Transfer Hyperparameter Optimisation	Sigrid Passano Hellan et.al.	2306.16916	link
2023-06-29	Sampling weights of deep neural networks	Erik Lien Bolager et.al.	2306.16830	link
2023-06-29	Transfer Learning with Semi-Supervised Dataset Annotation for Birdcall Classification	Anthony Miyaguchi et.al.	2306.16760	link
2023-06-29	Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train	Zhao Wang et.al.	2306.16741	link
2023-06-29	Multi-Scenario Ranking with Adaptive Feature Learning	Yu Tian et.al.	2306.16732	null
2023-06-26	A Collaborative Transfer Learning Framework for Cross-domain Recommendation	Wei Zhang et.al.	2306.16425	null
2023-06-28	Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks	Leyla Benhamida et.al.	2306.16357	null
2023-06-28	Relevant Entity Selection: Knowledge Graph Bootstrapping via Zero-Shot Analogical Pruning	Lucas Jarnac et.al.	2306.16296	link
2023-06-28	Recent Advances in Optimal Transport for Machine Learning	Eduardo Fernandes Montesuma et.al.	2306.16156	null
2023-06-28	A serial dual-channel library occupancy detection system based on Faster RCNN	Guoqiang Yang et.al.	2306.16080	null
2023-06-30	DUET: 2D Structured and Approximately Equivariant Representations	Xavier Suau et.al.	2306.16058	link
2023-06-28	Transfer Learning with Random Coefficient Ridge Regression	Hongzhe Zhang et.al.	2306.15915	null
2023-06-27	Differentially Private Video Activity Recognition	Zelun Luo et.al.	2306.15742	null
2023-06-27	Semi-supervised Multimodal Representation Learning through a Global Workspace	Benjamin Devillers et.al.	2306.15711	link
2023-06-27	Approximated Prompt Tuning for Vision-Language Pre-trained Models	Qiong Wu et.al.	2306.15706	null
2023-06-27	CamemBERT-bio: a Tasty French Language Model Better for your Health	Rian Touchent et.al.	2306.15550	null
2023-06-27	Transferability Metrics for Object Detection	Louis Fouquet et.al.	2306.15306	link
2023-06-26	Deep Transfer Learning for Intelligent Vehicle Perception: a Survey	Xinyu Liu et.al.	2306.15110	null
2023-06-26	Transfer Learning across Several Centuries: Machine and Historian Integrated Method to Decipher Royal Secretary’s Diary	Sojung Lucia Kim et.al.	2306.14592	null
2023-06-25	GPT-assisted learning of structure-property relationships by graph neural networks: Application to rare-earth doped phosphors	Xiang Zhang et.al.	2306.14238	link
2023-06-25	A Web-based Mpox Skin Lesion Detection System Using State-of-the-art Deep Learning Models Considering Racial Diversity	Shams Nafisa Ali et.al.	2306.14169	link
2023-06-25	Semi-supervised Object Detection: A Survey on Recent Research and Progress	Yanyang Wang et.al.	2306.14106	null
2023-06-24	Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks	Maxime Chevalier-Boisvert et.al.	2306.13831	link
2023-06-23	Curvature-enhanced Graph Convolutional Network for Biomolecular Interaction Prediction	Cong Shen et.al.	2306.13699	link
2023-06-23	Variance-Covariance Regularization Improves Representation Learning	Jiachen Zhu et.al.	2306.13292	null
2023-06-20	EEG Decoding for Datasets with Heterogenous Electrode Configurations using Transfer Learning Graph Neural Networks	Jinpei Han et.al.	2306.13109	null
2023-06-22	Natural Language Processing in Electronic Health Records in Relation to Healthcare Decision-making: A Systematic Review	Elias Hossain et.al.	2306.12834	null
2023-06-22	TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter	Binjie Zhang et.al.	2306.12642	null
2023-06-21	Introspective Action Advising for Interpretable Transfer Learning	Joseph Campbell et.al.	2306.12314	null
2023-06-21	Wildfire Detection Via Transfer Learning: A Survey	Ziliang Hong et.al.	2306.12276	null
2023-06-21	Benchmark data to study the influence of pre-training on explanation performance in MR image classification	Marta Oliveira et.al.	2306.12150	link
2023-06-21	Strategies in Transfer Learning for Low-Resource Speech Synthesis: Phone Mapping, Features Input, and Source Language Selection	Phat Do et.al.	2306.12040	null
2023-06-20	DynaQuant: Compressing Deep Learning Training Checkpoints via Dynamic Quantization	Amey Agrawal et.al.	2306.11800	null
2023-06-20	Meta-Analysis of Transfer Learning for Segmentation of Brain Lesions	Sovesh Mohapatra et.al.	2306.11714	null
2023-06-20	Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning	Tianlun Hu et.al.	2306.11552	null
2023-06-20	MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models	Yongzhu Miao et.al.	2306.11400	link
2023-06-20	MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian	Willy Fitra Hendria et.al.	2306.11341	link
2023-06-20	Progressive Neural Representation for Sequential Video Compilation	Haeyong Kang et.al.	2306.11305	link
2023-06-19	BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous Datasets	Po-Ting Lai et.al.	2306.11189	link
2023-06-19	Knowledge Transfer-Driven Few-Shot Class-Incremental Learning	Ye Wang et.al.	2306.10942	link
2023-06-19	Detailed retinal vessel segmentation without human annotations using simulated optical coherence tomography angiographs	Linus Kreitner et.al.	2306.10941	link
2023-06-19	Transformer Training Strategies for Forecasting Multiple Load Time Series	Matthias Hertel et.al.	2306.10891	link
2023-06-23	Text-Driven Foley Sound Generation With Latent Diffusion Model	Yi Yuan et.al.	2306.10359	link
2023-06-17	Persian Semantic Role Labeling Using Transfer Learning and BERT-Based Models	Saeideh Niksirat Aghdam et.al.	2306.10339	null
2023-06-16	Neural Priming for Sample-Efficient Adaptation	Matthew Wallingford et.al.	2306.10191	link
2023-06-16	LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning	Jifan Zhang et.al.	2306.09910	link
2023-06-16	Can robots mold soft plastic materials by shaping depth images?	Ege Gursoy et.al.	2306.09848	null
2023-06-16	Parameter-efficient is not sufficient: Exploring Parameter, Memory, and Time Efficient Adapter Tuning for Dense Predictions	Dongshuo Yin et.al.	2306.09729	null
2023-06-16	Cross-corpus Readability Compatibility Assessment for English Texts	Zhenzhen Li et.al.	2306.09704	link
2023-06-16	Early-times Yang-Mills dynamics and the characterization of strongly interacting matter with statistical learning	Matthew R. Heffernan et.al.	2306.09619	null
2023-06-15	Understanding and Mitigating Extrapolation Failures in Physics-Informed Neural Networks	Lukas Fesser et.al.	2306.09478	link
2023-06-15	A Comparison of Self-Supervised Pretraining Approaches for Predicting Disease Risk from Chest Radiograph Images	Yanru Chen et.al.	2306.08955	null
2023-06-14	Iterative self-transfer learning: A general methodology for response time-history prediction based on small dataset	Yongjia Xu et.al.	2306.08700	null
2023-06-14	SMC-UDA: Structure-Modal Constraint for Unsupervised Cross-Domain Renal Segmentation	Zhusi Zhong et.al.	2306.08213	null
2023-06-14	Solving Large-scale Spatial Problems with Convolutional Neural Networks	Damian Owerko et.al.	2306.08191	null
2023-06-13	PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer	Xu Han et.al.	2306.08126	null
2023-06-13	One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning	Arnav Chavan et.al.	2306.07967	link
2023-06-13	CAMEO: A Causal Transfer Learning Approach for Performance Optimization of Configurable Computer Systems	Md Shahriar Iqbal et.al.	2306.07888	null
2023-06-13	Robustness and Generalization Performance of Deep Learning Models on Cyber-Physical Systems: A Comparative Study	Alexander Windmann et.al.	2306.07737	null
2023-06-14	Few-shot Multi-domain Knowledge Rearming for Context-aware Defence against Advanced Persistent Threats	Gaolei Li et.al.	2306.07685	null
2023-06-12	EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing	Iker de la Iglesia et.al.	2306.07373	null
2023-06-12	A Brief Review of Hypernetworks in Deep Learning	Vinod Kumar Chauhan et.al.	2306.06955	link
2023-06-12	Differentiable Multi-Fidelity Fusion: Efficient Learning of Physics Simulations with Neural Architecture Search and Transfer Learning	Yuwen Deng et.al.	2306.06904	null
2023-06-12	Generating Synthetic Datasets by Interpolating along Generalized Geodesics	Jiaojiao Fan et.al.	2306.06866	null
2023-06-11	VBSF-TLD: Validation-Based Approach for Soft Computing-Inspired Transfer Learning in Drone Detection	Jaskaran Singh et.al.	2306.06797	null
2023-06-11	An information-Theoretic Approach to Semi-supervised Transfer Learning	Daniel Jakubovitz et.al.	2306.06731	null
2023-06-10	Enhancing Low Resource NER Using Assisting Language And Transfer Learning	Maithili Sabane et.al.	2306.06477	null
2023-06-10	Augmentations of Forman’s Ricci Curvature and their Applications in Community Detection	Lukas Fesser et.al.	2306.06474	null
2023-06-09	Understanding the Benefits of Image Augmentations	Matthew Iceland et.al.	2306.06254	null
2023-06-09	PoET: A generative model of protein families as sequences-of-sequences	Timothy F. Truong Jr et.al.	2306.06156	link
2023-06-13	End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates	Anshul Nasery et.al.	2306.05785	null
2023-06-09	Data-Link: High Fidelity Manufacturing Datasets for Model2Real Transfer under Industrial Settings	Sunny Katyara et.al.	2306.05766	null
2023-06-09	Emotion Detection from EEG using Transfer Learning	Sidharth Sidharth et.al.	2306.05680	null
2023-06-09	Customizing General-Purpose Foundation Models for Medical Report Generation	Bang Yang et.al.	2306.05642	null
2023-06-08	PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models	Tiantian Feng et.al.	2306.05350	link
2023-06-08	T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text Classification	Inigo Jauregi Unanue et.al.	2306.04996	link
2023-06-09	Generalization Performance of Transfer Learning: Overparameterized and Underparameterized Regimes	Peizhong Ju et.al.	2306.04901	null
2023-06-08	ExtPerFC: An Efficient 2D and 3D Perception Hardware-Software Framework for Mobile Cobot	Tuan Dang et.al.	2306.04853	link
2023-06-07	OBSTransformer: A Deep-Learning Seismic Phase Picker for OBS Data Using Automated Labelling and Transfer Learning	Alireza Niksejel et.al.	2306.04753	link
2023-06-07	AutoML Systems For Medical Imaging	Tasmia Tahmida Jidney et.al.	2306.04750	null
2023-06-07	Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation	Taha Aksu et.al.	2306.04724	link
2023-06-07	Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages	Claytone Sikasote et.al.	2306.04428	link
2023-06-07	Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak	Jan Lehečka et.al.	2306.04399	null
2023-06-07	Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization	Kohei Matsuura et.al.	2306.04233	null
2023-06-07	Transfer Learning for General M-estimators with Decomposable Regularizers in High-dimensions	Zeyu Li et.al.	2306.04182	null
2023-06-07	Physics-informed reinforcement learning for sample-efficient optimization of freeform nanophotonic devices	Chaejin Park et.al.	2306.04108	link
2023-06-07	XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations	Yusen Zhang et.al.	2306.04085	link
2023-06-06	Guiding The Last Layer in Federated Learning with Pre-Trained Models	Gwen Legate et.al.	2306.03937	link
2023-06-01	On the Robustness of Arabic Speech Dialect Identification	Peter Sullivan et.al.	2306.03789	null
2023-06-06	Deep Learning-Enabled Sleep Staging From Vital Signs and Activity Measured Using a Near-Infrared Video Camera	Jonathan Carter et.al.	2306.03711	null
2023-06-06	The Creative Frontier of Generative AI: Managing the Novelty-Usefulness Tradeoff	Anirban Mukherjee et.al.	2306.03601	null
2023-06-06	“A Little is Enough”: Few-Shot Quality Estimation based Corpus Filtering improves Machine Translation	Akshay Batheja et.al.	2306.03507	null
2023-06-06	Subgraph Networks Based Contrastive Learning	Jinhuan Wang et.al.	2306.03506	null
2023-06-05	Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model	Hoyeon Lee et.al.	2306.02579	null
2023-06-06	Training Like a Medical Resident: Universal Medical Image Segmentation via Context Prior Learning	Yunhe Gao et.al.	2306.02416	link
2023-06-02	Distilling Efficient Language-Specific Models for Cross-Lingual Transfer	Alan Ansell et.al.	2306.01709	link
2023-06-02	Resolving Interference When Merging Models	Prateek Yadav et.al.	2306.01708	link
2023-06-02	Transfer learning for atomistic simulations using GNNs and kernel mean embeddings	John Falk et.al.	2306.01589	link
2023-06-02	Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23	Ioannis Tsiamas et.al.	2306.01327	null
2023-06-02	A new method using deep transfer learning on ECG to predict the response to cardiac resynchronization therapy	Zhuo He et.al.	2306.01210	null
2023-06-01	TMI! Finetuned Models Leak Private Information from their Pretraining Data	John Abascal et.al.	2306.01181	link
2023-06-01	Improved Cross-Lingual Transfer Learning For Automatic Speech Translation	Sameer Khurana et.al.	2306.00789	null
2023-06-01	Improving Polish to English Neural Machine Translation with Transfer Learning: Effects of Data Volume and Language Similarity	Juuso Eronen et.al.	2306.00660	null
2023-06-01	The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech	Phat Do et.al.	2306.00535	null
2023-06-01	Divide, Conquer, and Combine: Mixture of Semantic-Independent Experts for Zero-Shot Dialogue State Tracking	Qingyue Wang et.al.	2306.00434	null
2023-06-01	Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting	Shubin Huang et.al.	2306.00409	link
2023-06-01	Autism Disease Detection Using Transfer Learning Techniques: Performance Comparison Between Central Processing Unit vs Graphics Processing Unit Functions for Neural Networks	Mst Shapna Akter et.al.	2306.00283	null
2023-06-01	Transfer Learning for Underrepresented Music Generation	Anahita Doosti et.al.	2306.00281	null
2023-06-01	Maximal Domain Independent Representations Improve Transfer Learning	Adrian Shuai Li et.al.	2306.00262	null
2023-06-01	Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior	Shashank Subramanian et.al.	2306.00258	null
2023-05-31	Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation	Chunliu Wang et.al.	2306.00124	link
2023-05-31	Additional Positive Enables Better Representation Learning for Medical Images	Dewen Zeng et.al.	2306.00112	null
2023-05-31	MetaXLR – Mixed Language Meta Representation Transformation for Low-resource Cross-lingual Learning based on Multi-Armed Bandit	Liat Bezalel et.al.	2306.00100	link
2023-05-31	A Survey of Label-Efficient Deep Learning for 3D Point Clouds	Aoran Xiao et.al.	2305.19812	link
2023-05-31	Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning	Shuyue Stella Li et.al.	2305.19759	null
2023-05-31	Hypothesis Transfer Learning with Surrogate Classification Losses	Anass Aghbalou et.al.	2305.19694	null
2023-05-31	VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges	Robert-Jan Bruintjes et.al.	2305.19688	null
2023-06-01	Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast	Guofan Fan et.al.	2305.19623	link
2023-05-31	SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with BERT	Aditya Yadavalli et.al.	2305.19589	null
2023-05-31	Deep into The Domain Shift: Transfer Learning through Dependence Regularization	Shumin Ma et.al.	2305.19499	link
2023-05-30	Transfer Learning With Efficient Estimators to Optimally Leverage Historical Data in Analysis of Randomized Trials	Lauren D. Liao et.al.	2305.19180	link

diffusion model

Publish Date	Title	Authors	PDF	Code
2025-07-21	Diffusion Beats Autoregressive in Data-Constrained Settings	Mihir Prabhudesai et.al.	2507.15857	null
2025-07-21	Diffusion models for multivariate subsurface generation and efficient probabilistic inversion	Roberto Miele et.al.	2507.15809	null
2025-07-21	DiffuMeta: Algebraic Language Models for Inverse Design of Metamaterials via Diffusion Transformers	Li Zheng et.al.	2507.15753	null
2025-07-21	TokensGen: Harnessing Condensed Tokens for Long Video Generation	Wenqi Ouyang et.al.	2507.15728	null
2025-07-21	DiffPF: Differentiable Particle Filtering with Generative Sampling via Conditional Diffusion Models	Ziyu Wan et.al.	2507.15716	null
2025-07-21	SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models	Giordano d’Aloisio et.al.	2507.15663	null
2025-07-21	SegDT: A Diffusion Transformer-Based Segmentation Model for Medical Imaging	Salah Eddine Bekhouche et.al.	2507.15595	null
2025-07-21	Ultrafast Spatial Hole Burning Dynamics in Monolayer WS2: Insights from Time-resolved Photoluminescence Spectroscopy	Yichun Pan et.al.	2507.15538	null
2025-07-21	Blended Point Cloud Diffusion for Localized Text-guided Shape Editing	Etai Sella et.al.	2507.15399	null
2025-07-21	Latent Space Synergy: Text-Guided Data Augmentation for Direct Diffusion Biomedical Segmentation	Muhammad Aqeel et.al.	2507.15361	null
2025-07-21	RAD: Retrieval High-quality Demonstrations to Enhance Decision-making	Lu Guo et.al.	2507.15356	null
2025-07-21	RoadFusion: Latent Diffusion Model for Pavement Defect Detection	Muhammad Aqeel et.al.	2507.15346	null
2025-07-22	Exponential Runge-Kutta Galerkin finite element method for a reaction-diffusion system with nonsmooth initial data	Runjie Zhang et.al.	2507.15345	null
2025-07-21	ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis	Muhammad Aqeel et.al.	2507.15335	null
2025-07-21	Conditional Video Generation for High-Efficiency Video Compression	Fangqiu Yi et.al.	2507.15269	null
2025-07-21	CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers	Jiaqi Han et.al.	2507.15260	null
2025-07-21	Improving Joint Embedding Predictive Architecture with Diffusion Noise	Yuping Qiu et.al.	2507.15216	null
2025-07-21	MeshMamba: State Space Models for Articulated 3D Mesh Generation and Reconstruction	Yusuke Yoshiyasu et.al.	2507.15212	null
2025-07-20	PET Image Reconstruction Using Deep Diffusion Image Prior	Fumio Hashimoto et.al.	2507.15078	null
2025-07-20	StableAnimator++: Overcoming Pose Misalignment and Face Distortion for Human Image Animation	Shuyuan Tu et.al.	2507.15064	null
2025-07-17	Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models	Yudong Jin et.al.	2507.13344	null
2025-07-17	FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization	Chuancheng Shi et.al.	2507.13311	null
2025-07-17	DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation	Ekta Balkrishna Gavas et.al.	2507.13292	null
2025-07-17	fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting	Alicia Durrer et.al.	2507.13146	null
2025-07-17	DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model	Han Zhang et.al.	2507.13087	null
2025-07-17	Label-Consistent Dataset Distillation with Detector-Guided Refinement	Yawen Zou et.al.	2507.13074	null
2025-07-17	Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities	Liuyi Wang et.al.	2507.13019	null
2025-07-17	From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation	Jinseo An et.al.	2507.12985	null
2025-07-17	Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning	Giwon Lee et.al.	2507.12977	null
2025-07-17	RGB Pre-Training Enhanced Unobservable Feature Latent Diffusion Model for Spectral Reconstruction	Keli Deng et.al.	2507.12967	null
2025-07-17	DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization	Dongyeun Lee et.al.	2507.12933	null
2025-07-17	Energy-Efficient RSMA-enabled Low-altitude MEC Optimization Via Generative AI-enhanced Deep Reinforcement Learning	Xudong Wang et.al.	2507.12910	null
2025-07-17	Generalist Bimanual Manipulation via Foundation Video Diffusion Models	Yao Feng et.al.	2507.12898	null
2025-07-16	Reconstruct, Inpaint, Finetune: Dynamic Novel-view Synthesis from Monocular Videos	Kaihua Chen et.al.	2507.12646	null
2025-07-16	Zero Forcing on Iterated Graph Models	Christopher Brice et.al.	2507.12579	null
2025-07-16	Sample Variance Denoising in Cylindrical 21-cm Power Spectra	Daniela Breitman et.al.	2507.12545	null
2025-07-16	Unsupervised Monocular 3D Keypoint Discovery from Multi-View Diffusion Priors	Subin Jeon et.al.	2507.12336	null
2025-07-17	Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models	Samuel Lavoie et.al.	2507.12318	null
2025-07-16	FADE: Adversarial Concept Erasure in Flow Models	Zixuan Fu et.al.	2507.12283	null
2025-07-16	Generate to Ground: Multimodal Text Conditioning Boosts Phrase Grounding in Medical Vision-Language Models	Felix Nützel et.al.	2507.12236	null
2025-07-16	RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models	Yiqi Tian et.al.	2507.12201	null
2025-07-16	RadioDiff-3D: A 3D $\times$ 3D Radio Map Dataset and Generative Diffusion Based Benchmark for 6G Environment-Aware Communication	Xiucheng Wang et.al.	2507.12166	null
2025-07-16	SmokeSVD: Smoke Reconstruction from A Single View via Progressive Novel View Synthesis and Refinement with Diffusion Models	Chen Li et.al.	2507.12156	null
2025-07-16	RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA Optimization	Vladimir Bogachev et.al.	2507.12142	null
2025-07-16	LidarPainter: One-Step Away From Any Lidar View To Novel Guidance	Yuzhou Ji et.al.	2507.12114	null
2025-07-16	Robust Planning for Autonomous Vehicles with Diffusion-Based Failure Samplers	Juanran Wang et.al.	2507.11991	null
2025-07-16	ID-EA: Identity-driven Text Enhancement and Adaptation with Textual Inversion for Personalized Text-to-Image Generation	Hyun-Jun Jin et.al.	2507.11990	null
2025-07-16	EC-Diff: Fast and High-Quality Edge-Cloud Collaborative Inference for Diffusion Models	Jiajian Xie et.al.	2507.11980	null
2025-07-16	A Review of Generative AI in Aquaculture: Foundations, Applications, and Future Directions for Smart and Sustainable Farming	Waseem Akram et.al.	2507.11974	null
2025-07-16	Schrödinger Bridge Consistency Trajectory Models for Speech Enhancement	Shuichiro Nishigori et.al.	2507.11925	null
2025-07-16	Analytic estimation of parameters of stochastic volatility diffusion models with exponential-affine characteristic function for currency option pricing	Mikołaj Łabędzki et.al.	2507.11868	null
2025-07-16	Similarity-Guided Diffusion for Contrastive Sequential Recommendation	Jinkyeong Choi et.al.	2507.11866	null
2025-07-15	Deep Generative Methods and Tire Architecture Design	Fouad Oubari et.al.	2507.11639	null
2025-07-15	CATVis: Context-Aware Thought Visualization	Tariq Mehmood et.al.	2507.11522	null
2025-07-15	HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing	Pan Du et.al.	2507.11474	null
2025-07-15	Implementing Adaptations for Vision AutoRegressive Model	Kaif Shaikh et.al.	2507.11441	null
2025-07-14	MP1: Mean Flow Tames Policy Learning in 1-step for Robotic Manipulation	Juyi Sheng et.al.	2507.10543	null
2025-07-14	Solving the compute crisis with physics-based ASICs	Maxwell Aifer et.al.	2507.10463	null
2025-07-14	Parallel Sampling of Diffusion Models on $SO(3)$	Yan-Ting Chen et.al.	2507.10347	null
2025-07-15	Text Embedding Knows How to Quantize Text-Guided Diffusion Models	Hongjae Lee et.al.	2507.10340	null
2025-07-14	Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching	Yuhan Liu et.al.	2507.10318	null
2025-07-14	Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection	Jinglun Li et.al.	2507.10225	null
2025-07-14	From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation	Jeongho Kim et.al.	2507.10217	null
2025-07-14	FIX-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long Text	Bingchao Wang et.al.	2507.10095	null
2025-07-14	Frequency Regulation for Exposure Bias Mitigation in Diffusion Models	Meng Yu et.al.	2507.10072	null
2025-07-14	Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies	Seokeon Choi et.al.	2507.10029	null
2025-07-14	Latent Diffusion Models with Masked AutoEncoders	Junho Lee et.al.	2507.09984	null
2025-07-14	Solving dynamic portfolio selection problems via score-based diffusion models	Ahmad Aghapour et.al.	2507.09916	null
2025-07-14	Crucial-Diff: A Unified Diffusion Model for Crucial Image and Annotation Synthesis in Data-scarce Scenarios	Siyue Yao et.al.	2507.09915	null
2025-07-14	IGD: Instructional Graphic Design with Multimodal Layer Generation	Yadong Qu et.al.	2507.09910	null
2025-07-14	Generative Audio Language Modeling with Continuous-valued Tokens and Masked Next-Token Prediction	Shu-wen Yang et.al.	2507.09834	null
2025-07-13	Advancing Text-to-3D Generation with Linearized Lookahead Variational Score Distillation	Yu Lei et.al.	2507.09748	null
2025-07-13	Generate Aligned Anomaly: Region-Guided Few-Shot Anomaly Image-Mask Pair Synthesis for Industrial Inspection	Yilin Lu et.al.	2507.09619	null
2025-07-13	I2I-PR: Deep Iterative Refinement for Phase Retrieval using Image-to-Image Diffusion Models	Mehmet Onurcan Kaya et.al.	2507.09609	null
2025-07-13	WordCraft: Interactive Artistic Typography with Attention Awareness and Noise Blending	Zhe Wang et.al.	2507.09573	null
2025-07-13	Consistency Trajectory Planning: High-Quality and Efficient Trajectory Optimization for Offline Model-Based Reinforcement Learning	Guanquan Wang et.al.	2507.09534	null
2025-07-10	Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling	Haoyu Wu et.al.	2507.07982	null
2025-07-10	Low Resource Reconstruction Attacks Through Benign Prompts	Sol Yarkoni et.al.	2507.07947	null
2025-07-11	Single-Step Latent Diffusion for Underwater Image Restoration	Jiayi Wu et.al.	2507.07878	null
2025-07-10	Re-Bottleneck: Latent Re-Structuring for Neural Audio Autoencoders	Dimitrios Bralios et.al.	2507.07867	null
2025-07-10	Benchmarking Content-Based Puzzle Solvers on Corrupted Jigsaw Puzzles	Richard Dirauf et.al.	2507.07828	null
2025-07-10	Phase-Space Synchronization Driven by Moon-Magnetosphere Coupling in Gas Giants	Adnane Osmane et.al.	2507.07739	null
2025-07-10	Capture Stage Environments: A Guide to Better Matting	Hannah Dröge et.al.	2507.07623	null
2025-07-10	Stable-Hair v2: Real-World Hair Transfer via Multiple-View Diffusion Model	Kuiyuan Sun et.al.	2507.07591	null
2025-07-10	Divergence Minimization Preference Optimization for Diffusion Model Alignment	Binxu Li et.al.	2507.07510	null
2025-07-10	Degradation-Agnostic Statistical Facial Feature Transformation for Blind Face Restoration in Adverse Weather Conditions	Chang-Hwan Son et.al.	2507.07464	null
2025-07-10	EscherNet++: Simultaneous Amodal Completion and Scalable View Synthesis through Masked Fine-Tuning and Enhanced Feed-Forward 3D Reconstruction	Xinan Zhang et.al.	2507.07410	null
2025-07-09	SonicMotion: Dynamic Spatial Audio Soundscapes with Latent Diffusion Models	Christian Templin et.al.	2507.07318	null
2025-07-09	MODA: A Unified 3D Diffusion Framework for Multi-Task Target-Aware Molecular Generation	Dong Xu et.al.	2507.07201	null
2025-07-09	Bridging the Last Mile of Prediction: Enhancing Time Series Forecasting with Conditional Guided Flow Matching	Huibo Xu et.al.	2507.07192	null
2025-07-09	Interpretable EEG-to-Image Generation with Semantic Prompts	Arshak Rezvani et.al.	2507.07157	null
2025-07-09	Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor	Vatsal Agarwal et.al.	2507.07106	null
2025-07-11	Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models	Tiezheng Zhang et.al.	2507.07104	null
2025-07-09	Exact Evaluation of the Accuracy of Diffusion Models for Inverse Problems with Gaussian Data Distributions	Emile Pierret et.al.	2507.07008	null
2025-07-09	DiffSpectra: Molecular Structure Elucidation from Spectra using Diffusion Models	Liang Wang et.al.	2507.06853	null
2025-07-09	Democratizing High-Fidelity Co-Speech Gesture Video Generation	Xu Yang et.al.	2507.06812	null
2025-07-09	Enhancing Diffusion Model Stability for Image Restoration via Gradient Management	Hongjie Wu et.al.	2507.06656	null
2025-07-09	Diff $^2$ I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior	Juncheng Mu et.al.	2507.06651	null
2025-07-09	Denoising Multi-Beta VAE: Representation Learning for Disentanglement and Generation	Anshuk Uppal et.al.	2507.06613	null
2025-07-09	MOST: Motion Diffusion Model for Rare Text via Temporal Clip Banzhaf Interaction	Yin Wang et.al.	2507.06590	null
2025-07-09	Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution	Yonghyun Park et.al.	2507.06547	null
2025-07-10	Concept Unlearning by Modeling Key Steps of Diffusion Process	Chaoshuo Zhang et.al.	2507.06526	null
2025-07-09	FedDifRC: Unlocking the Potential of Text-to-Image Diffusion Models in Heterogeneous Federated Learning	Huan Wang et.al.	2507.06482	null
2025-07-08	FedPhD: Federated Pruning with Hierarchical Learning of Diffusion Models	Qianyu Long et.al.	2507.06449	null
2025-07-08	Mitigating Multi-Sequence 3D Prostate MRI Data Scarcity through Domain Adaptation using Locally-Trained Latent Diffusion Models for Prostate Cancer Detection	Emerson P. Grabke et.al.	2507.06384	null
2025-07-08	Too Human to Model:The Uncanny Valley of LLMs in Social Simulation – When Generative Language Agents Misalign with Modelling Principles	Yongchao Zeng et.al.	2507.06310	null
2025-07-08	Modern Methods in Associative Memory	Dmitry Krotov et.al.	2507.06211	null
2025-07-08	CultureCLIP: Empowering CLIP with Cultural Awareness through Synthetic Images and Contextualized Captions	Yuchen Huang et.al.	2507.06210	null
2025-07-10	A Survey on Latent Reasoning	Rui-Jie Zhu et.al.	2507.06203	null
2025-07-08	Prompt-Free Conditional Diffusion for Multi-object Image Augmentation	Haoyu Wang et.al.	2507.06146	null
2025-07-08	Bridging Sequential Deep Operator Network and Video Diffusion: Residual Refinement of Spatio-Temporal PDE Solutions	Jaewan Park et.al.	2507.06133	null
2025-07-07	EmbodieDreamer: Advancing Real2Sim2Real Transfer for Policy Training via Embodied World Modeling	Boyuan Wang et.al.	2507.05198	null
2025-07-07	SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model	Chun Xie et.al.	2507.05148	null
2025-07-07	VERITAS: Verification and Explanation of Realness in Images for Transparency in AI Systems	Aadi Srivastava et.al.	2507.05146	null
2025-07-07	MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation	Yucheng Wang et.al.	2507.05092	null
2025-07-07	AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics	Jan Carreras Boada et.al.	2507.05063	null
2025-07-08	A COMPASS to Model Comparison and Simulation-Based Inference in Galactic Chemical Evolution	Berkay Gunes et.al.	2507.05060	null
2025-07-07	A Generative Diffusion Model for Amorphous Materials	Kai Yang et.al.	2507.05024	null
2025-07-07	TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation	Zonglin Lyu et.al.	2507.04984	null
2025-07-07	A diffusion model for light scattering in ejecta	J. A. Don Jayamanne et.al.	2507.04972	null
2025-07-07	LAPS-Diff: A Diffusion-Based Framework for Singing Voice Synthesis With Language Aware Prosody-Style Guided Learning	Sandipan Dhar et.al.	2507.04966	null
2025-07-07	DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer	Yecheng Wu et.al.	2507.04947	null
2025-07-07	Taming the Tri-Space Tension: ARC-Guided Hallucination Modeling and Control for Text-to-Image Generation	Jianjiang Yang et.al.	2507.04946	null
2025-07-07	RainShift: A Benchmark for Precipitation Downscaling Across Geographies	Paula Harder et.al.	2507.04930	null
2025-07-07	Object-centric Denoising Diffusion Models for Physical Reasoning	Moritz Lange et.al.	2507.04920	null
2025-07-07	Music Boomerang: Reusing Diffusion Models for Data Augmentation and Audio Manipulation	Alexander Fichtinger et.al.	2507.04864	null
2025-07-07	Discrete Diffusion Trajectory Alignment via Stepwise Decomposition	Jiaqi Han et.al.	2507.04832	null
2025-07-07	GraphBrep: Learning B-Rep in Graph Structure for Efficient CAD Generation	Weilin Lai et.al.	2507.04765	null
2025-07-07	Losing Control: Data Poisoning Attack on Guided Diffusion via ControlNet	Raz Lapid et.al.	2507.04726	null
2025-07-07	Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal	Wanchang Yu et.al.	2507.04692	null
2025-07-07	TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation	Changsong Lei et.al.	2507.04685	null
2025-07-03	Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching	Xin Zhou et.al.	2507.02860	null
2025-07-03	AnyI2V: Animating Any Conditional Image with Motion Control	Ziye Li et.al.	2507.02857	null
2025-07-03	USAD: An Unsupervised Data Augmentation Spatio-Temporal Attention Diffusion Network	Ying Yu et.al.	2507.02827	null
2025-07-03	LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion	Fangfu Liu et.al.	2507.02813	null
2025-07-03	RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation	Liheng Zhang et.al.	2507.02792	null
2025-07-03	FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models	Yuxuan Wang et.al.	2507.02714	null
2025-07-03	UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation	Qin Guo et.al.	2507.02713	null
2025-07-03	APT: Adaptive Personalized Training for Diffusion Models with Limited Data	JungWoo Chae et.al.	2507.02687	null
2025-07-03	Learning few-step posterior samplers by unfolding and distillation of diffusion models	Charlesquin Kemajou Mbakam et.al.	2507.02686	null
2025-07-03	Guided Generation for Developable Antibodies	Siqi Zhao et.al.	2507.02670	null
2025-07-03	Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation	François Rozet et.al.	2507.02608	null
2025-07-03	AC-Refiner: Efficient Arithmetic Circuit Optimization Using Conditional Diffusion Models	Chenhao Xue et.al.	2507.02598	null
2025-07-03	Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning	Buzhen Huang et.al.	2507.02565	null
2025-07-03	AvatarMakeup: Realistic Makeup Transfer for 3D Animatable Head Avatars	Yiming Zhong et.al.	2507.02419	null
2025-07-03	PosDiffAE: Position-aware Diffusion Auto-encoder For High-Resolution Brain Tissue Classification Incorporating Artifact Restoration	Ayantika Das et.al.	2507.02405	null
2025-07-03	Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement	Mostafa Sadeghi et.al.	2507.02391	null
2025-07-03	Offline Reinforcement Learning with Penalized Action Noise Injection	JunHyeok Oh et.al.	2507.02356	null
2025-07-03	Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback	Nina Konovalova et.al.	2507.02321	null
2025-07-03	Transformer-based EEG Decoding: A Survey	Haodong Zhang et.al.	2507.02320	null
2025-07-03	DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation	Yunhan Yang et.al.	2507.02299	null
2025-07-02	FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model	Yukang Cao et.al.	2507.01953	null
2025-07-02	Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning	Qingdong He et.al.	2507.01908	null
2025-07-02	Frontiers of Generative AI for Network Optimization: Theories, Limits, and Visions	Bo Yang et.al.	2507.01773	null
2025-07-02	Vision-Aided ISAC in Low-Altitude Economy Networks via De-Diffused Visual Priors	Yulan Gao et.al.	2507.01574	null
2025-07-02	Loss Functions in Diffusion Models: A Comparative Study	Dibyanshu Kumar et.al.	2507.01516	null
2025-07-02	ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation	Jimyeong Kim et.al.	2507.01496	null
2025-07-02	Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think	Ge Wu et.al.	2507.01467	null
2025-07-02	DiffMark: Diffusion-based Robust Watermark Against Deepfakes	Chen Sun et.al.	2507.01428	null
2025-07-02	DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal	Wenjie Liu et.al.	2507.01422	null
2025-07-03	Distributional Soft Actor-Critic with Diffusion Policy	Tong Liu et.al.	2507.01381	null
2025-07-02	Efficient Kilometer-Scale Precipitation Downscaling with Conditional Wavelet Diffusion	Chugang Yi et.al.	2507.01354	null
2025-07-02	Multi-User Generative Semantic Communication with Intent-Aware Semantic-Splitting Multiple Access	Jiayi Lu et.al.	2507.01333	null
2025-07-02	SD-Acc: Accelerating Stable Diffusion through Phase-aware Sampling and Hardware Co-Optimizations	Zhican Wang et.al.	2507.01309	null
2025-07-02	DiffusionLight-Turbo: Accelerated Light Probes for Free via Single-Pass Chrome Ball Inpainting	Worameth Chinchuthakun et.al.	2507.01305	null
2025-07-02	Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing	Chengxu Liu et.al.	2507.01275	null
2025-07-01	Diffusion Explorer: Interactive Exploration of Diffusion Models	Alec Helbling et.al.	2507.01178	null
2025-07-01	Bayesian Regression Analysis with the Drift-Diffusion Model	Zekai Jin et.al.	2507.01177	null
2025-07-01	DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution	Zhe Kong et.al.	2507.01012	null
2025-07-02	UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis	Yuanrui Wang et.al.	2507.00992	null
2025-07-01	Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations	Shivansh Patel et.al.	2507.00990	null
2025-06-30	Epona: Autoregressive Diffusion World Model for Autonomous Driving	Kaiwen Zhang et.al.	2506.24113	null
2025-06-30	Navigating with Annealing Guidance Scale in Diffusion Space	Shai Yehezkel et.al.	2506.24108	null
2025-06-30	Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention	Wonwoong Cho et.al.	2506.24085	null
2025-06-30	Faster Diffusion Models via Higher-Order Approximation	Gen Li et.al.	2506.24042	null
2025-06-30	Supervised Diffusion-Model-Based PET Image Reconstruction	George Webber et.al.	2506.24034	null
2025-06-30	VMoBA: Mixture-of-Block Attention for Video Diffusion Models	Jianzong Wu et.al.	2506.23858	null
2025-06-30	Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors	Ce Wang et.al.	2506.23801	null
2025-06-30	Radioactive Watermarks in Diffusion and Autoregressive Image Generative Models	Michel Meintz et.al.	2506.23731	null
2025-06-30	Proteus-ID: ID-Consistent and Motion-Coherent Video Customization	Guiyu Zhang et.al.	2506.23729	null
2025-06-30	MDPG: Multi-domain Diffusion Prior Guidance for MRI Reconstruction	Lingtong Zhang et.al.	2506.23701	null
2025-06-30	A Unified Framework for Stealthy Adversarial Generation via Latent Optimization and Transferability Enhancement	Gaozheng Pei et.al.	2506.23676	null
2025-06-30	Diffusion Model-based Data Augmentation Method for Fetal Head Ultrasound Segmentation	Fangyijie Wang et.al.	2506.23664	null
2025-06-30	Blending Concepts with Text-to-Image Diffusion Models	Lorenzo Olearo et.al.	2506.23630	null
2025-06-30	TurboVSR: Fantastic Video Upscalers and Where to Find Them	Zhongdao Wang et.al.	2506.23618	null
2025-06-30	SG-LDM: Semantic-Guided LiDAR Generation via Latent-Aligned Diffusion	Zhengkang Xiang et.al.	2506.23606	null
2025-06-30	Metadata, Wavelet, and Time Aware Diffusion Models for Satellite Image Super Resolution	Luigi Sigillo et.al.	2506.23566	null
2025-06-30	Uncertainty-aware Diffusion and Reinforcement Learning for Joint Plane Localization and Anomaly Diagnosis in 3D Ultrasound	Yuhao Huang et.al.	2506.23538	null
2025-06-30	WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image	Jiwoo Park et.al.	2506.23518	null
2025-06-30	ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models	Zixun Fang et.al.	2506.23513	null
2025-06-30	MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting	Jun Huang et.al.	2506.23482	null
2025-06-26	SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture	Kehan Sui et.al.	2506.21478	null
2025-06-26	Rethinking Oversaturation in Classifier-Free Guidance via Low Frequency	Kaiyu Song et.al.	2506.21452	null
2025-06-26	Controllable 3D Placement of Objects with Scene-Aware Diffusion Models	Mohamed Omran et.al.	2506.21446	null
2025-06-26	HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation	Diego Biagini et.al.	2506.21287	null
2025-06-27	FairyGen: Storied Cartoon Video from a Single Child-Drawn Character	Jiayi Zheng et.al.	2506.21272	null
2025-06-27	Alternating Spintronics: Capacitive Behavior of Spin Valves and Resonator Applications	Yunwen Liu et.al.	2506.21176	null
2025-06-26	Compressed and Smooth Latent Space for Text Diffusion Modeling	Viacheslav Meshchaninov et.al.	2506.21170	null
2025-06-26	Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image	Pufan Li et.al.	2506.21152	null
2025-06-26	Learning to See in the Extremely Dark	Hai Jiang et.al.	2506.21132	null
2025-06-26	Unlasting: Unpaired Single-Cell Multi-Perturbation Estimation by Dual Conditional Diffusion Implicit Bridges	Changxi Chi et.al.	2506.21107	null
2025-06-26	Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling	Hansam Cho et.al.	2506.21045	null
2025-06-26	Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability	Boyong He et.al.	2506.21042	null
2025-06-27	DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation	Wenzhou Lyu et.al.	2506.21034	null
2025-06-26	From Cradle to Cane: A Two-Pass Framework for High-Fidelity Lifespan Face Aging	Tao Liu et.al.	2506.20977	null
2025-06-26	ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation	Shruti Bansal et.al.	2506.20969	null
2025-06-26	Antibody Design and Optimization with Multi-scale Equivariant Graph Diffusion Models for Accurate Complex Antigen Binding	Jiameng Chen et.al.	2506.20957	null
2025-06-25	Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models	Cansu Korkmaz et.al.	2506.20832	null
2025-06-25	Stochastic and Non-local Closure Modeling for Nonlinear Dynamical Systems via Latent Score-based Generative Models	Xinghao Dong et.al.	2506.20771	null
2025-06-25	StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation	Haodong Li et.al.	2506.20756	null
2025-06-25	On Convolutions, Intrinsic Dimension, and Diffusion Models	Kin Kwan Leung et.al.	2506.20705	null
2025-06-25	EditP23: 3D Editing via Propagation of Image Prompts to Multi-View	Roi Bar-On et.al.	2506.20652	null
2025-06-25	Telegrapher’s Generative Model via Kac Flows	Richard Duong et.al.	2506.20641	null
2025-06-26	DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation	Shansan Gong et.al.	2506.20639	null
2025-06-25	MC for Agriculture: A Framework for Nature-inspired Sustainable Pest Control	Fardad Vakilipoor et.al.	2506.20637	null
2025-06-25	Shape2Animal: Creative Animal Generation from Natural Silhouettes	Quoc-Duy Tran et.al.	2506.20616	null
2025-06-25	Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks	Manyi Li et.al.	2506.20548	null
2025-06-25	HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling	Tobias Vontobel et.al.	2506.20452	null
2025-06-25	TDiR: Transformer based Diffusion for Image Restoration Tasks	Abbas Anwar et.al.	2506.20302	null
2025-06-25	Ctrl-Z Sampling: Diffusion Sampling with Controlled Random Zigzag Explorations	Shunqi Mao et.al.	2506.20294	null
2025-06-25	Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement	Kun Yuan et.al.	2506.20254	null
2025-06-25	Towards Efficient Exemplar Based Image Editing with Multimodal VLMs	Avadhoot Jadhav et.al.	2506.20155	null
2025-06-24	Robust Robotic Exploration and Mapping Using Generative Occupancy Map Synthesis	Lorin Achey et.al.	2506.20049	null
2025-06-24	Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting	Salva Rühling Cachay et.al.	2506.20024	null
2025-06-24	Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture	Shuchen Xue et.al.	2506.19935	null
2025-06-24	Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation	Xingyang Li et.al.	2506.19852	null
2025-06-24	AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models	Zehuan Huang et.al.	2506.19851	null
2025-06-24	GenHSI: Controllable Generation of Human-Scene Interaction Videos	Zekun Li et.al.	2506.19840	null
2025-06-24	Improving Progressive Generation with Decomposable Flow Matching	Moayed Haji-Ali et.al.	2506.19839	null
2025-06-24	SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution	Liangbin Xie et.al.	2506.19838	null
2025-06-24	Machine Learning with Privacy for Protected Attributes	Saeed Mahloujifar et.al.	2506.19836	null
2025-06-23	Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models	Kiymet Akdemir et.al.	2506.18900	null
2025-06-23	MinD: Unified Visual Imagination and Control via Hierarchical World Models	Xiaowei Chi et.al.	2506.18897	null
2025-06-23	Let Your Video Listen to Your Music!	Xinyu Zhang et.al.	2506.18881	null
2025-06-23	ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs	Michal Nazarczuk et.al.	2506.18792	null
2025-06-23	TCDiff++: An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography	Yuqin Dai et.al.	2506.18671	null
2025-06-23	GANs vs. Diffusion Models for virtual staining with the HER2match dataset	Pascal Klöckner et.al.	2506.18484	null
2025-06-23	DIP: Unsupervised Dense In-Context Post-training of Visual Representations	Sophia Sirko-Galouchenko et.al.	2506.18463	null
2025-06-23	CPAM: Context-Preserving Adaptive Manipulation for Zero-Shot Real Image Editing	Dinh-Khoi Vo et.al.	2506.18438	null
2025-06-23	How Robust is Model Editing after Fine-Tuning? An Empirical Study on Text-to-Image Diffusion Models	Feng He et.al.	2506.18428	null
2025-06-23	Generative Diffusion Receivers: Achieving Pilot-Efficient MIMO-OFDM Communications	Yuzhi Yang et.al.	2506.18419	null
2025-06-23	Large-Scale Training Data Attribution for Music Generative Models via Unlearning	Woosung Choi et.al.	2506.18312	null
2025-06-23	Instability in Diffusion ODEs: An Explanation for Inaccurate Image Reconstruction	Han Zhang et.al.	2506.18290	null
2025-06-23	Adaptive Mask-guided K-space Diffusion for Accelerated MRI Reconstruction	Qinrong Cai et.al.	2506.18270	null
2025-06-23	Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models	Chao Li et.al.	2506.18251	null
2025-06-23	Exact Conditional Score-Guided Generative Modeling for Amortized Inference in Uncertainty Quantification	Zezhong Zhang et.al.	2506.18227	null
2025-06-23	American options valuation in time-dependent jump-diffusion models via integral equations and characteristic functions	Andrey Itkin et.al.	2506.18210	null
2025-06-22	CDG-MAE: Learning Correspondences from Diffusion Generated Views	Varun Belagali et.al.	2506.18164	null
2025-06-22	Targeted False Positive Synthesis via Detector-guided Adversarial Diffusion Attacker for Robust Polyp Detection	Quan Zhou et.al.	2506.18134	null
2025-06-22	Enabling PSO-Secure Synthetic Data Sharing Using Diversity-Aware Diffusion Models	Mischa Dombrowski et.al.	2506.17975	null
2025-06-24	GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models	Julien Guinot et.al.	2506.17886	null
2025-06-18	Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards	Qingming Liu et.al.	2506.15684	null
2025-06-18	Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model	Anirud Aggarwal et.al.	2506.15682	link
2025-06-18	UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting	Kai He et.al.	2506.15673	null
2025-06-18	HOIDiNi: Human-Object Interaction through Diffusion Noise Optimization	Roey Ron et.al.	2506.15625	null
2025-06-18	One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution	Yujing Sun et.al.	2506.15591	link
2025-06-18	Control and Realism: Best of Both Worlds in Layout-to-Image without Training	Bonan Li et.al.	2506.15563	null
2025-06-18	Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models	Teysir Baoueb et.al.	2506.15530	null
2025-06-18	GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects	Shujia Li et.al.	2506.15483	null
2025-06-18	Provable Maximum Entropy Manifold Exploration via Diffusion Models	Riccardo De Santi et.al.	2506.15385	null
2025-06-18	When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class	Yujin Kim et.al.	2506.15381	null
2025-06-18	Acoustic Waveform Inversion with Image-to-Image Schrödinger Bridges	A. S. Stankevich et.al.	2506.15346	link
2025-06-19	Naive parton picture for color transparency of kaon in the electronuclear reaction $A(e,e’K^+)$	Kook-Jin Kong et.al.	2506.15331	null
2025-06-18	One-shot Face Sketch Synthesis in the Wild via Generative Diffusion Prior and Instruction Tuning	Han Wu et.al.	2506.15312	link
2025-06-18	Human Motion Capture from Loose and Sparse Inertial Sensors with Garment-aware Diffusion Models	Andela Ilic et.al.	2506.15290	null
2025-06-18	DM-FNet: Unified multimodal medical image fusion via diffusion process-trained encoder-decoder	Dan He et.al.	2506.15218	link
2025-06-18	Echo-DND: A dual noise diffusion model for robust and precise left ventricle segmentation in echocardiography	Abdur Rahman et.al.	2506.15166	null
2025-06-18	Fundamentals of the metal contact to p-type GaN: new multilayer design	Konrad Sakowski et.al.	2506.15163	null
2025-06-18	Generative thermodynamic computing	Stephen Whitelam et.al.	2506.15121	null
2025-06-17	Frequency-Calibrated Membership Inference Attacks on Medical Image Diffusion Models	Xinkai Zhao et.al.	2506.14919	null
2025-06-17	CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion	Jiahua Ma et.al.	2506.14769	null
2025-06-16	Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value	Yixian Xu et.al.	2506.13763	null
2025-06-17	VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models	Edward Li et.al.	2506.13754	null
2025-06-16	MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model	Bi Yuda et.al.	2506.13667	null
2025-06-16	Exploiting the Exact Denoising Posterior Score in Training-Free Guidance of Diffusion Models	Gregory Bellchambers et.al.	2506.13614	null
2025-06-16	Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching	Weimin Bai et.al.	2506.13594	null
2025-06-16	Flexible-length Text Infilling for Discrete Diffusion Models	Andrew Zhang et.al.	2506.13579	null
2025-06-16	X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability	Yu Yang et.al.	2506.13558	null
2025-06-16	Seismic Acoustic Impedance Inversion Framework Based on Conditional Latent Generative Diffusion Model	Jie Chen et.al.	2506.13529	null
2025-06-16	Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis	Martina Pastorino et.al.	2506.13484	null
2025-06-16	PRO: Projection Domain Synthesis for CT Imaging	Kang Chen et.al.	2506.13443	null
2025-06-16	Zero-Shot Solving of Imaging Inverse Problems via Noise-Refined Likelihood Guided Diffusion Models	Zhen Wang et.al.	2506.13391	null
2025-06-16	LapDDPM: A Conditional Graph Diffusion Model for scRNA-seq Generation with Spectral Adversarial Perturbations	Lorenzo Bini et.al.	2506.13344	null
2025-06-16	Quantitative Comparison of Fine-Tuning Techniques for Pretrained Latent Diffusion Models in the Generation of Unseen SAR Image Concepts	Solène Debuysère et.al.	2506.13307	null
2025-06-16	AttentionDrag: Exploiting Latent Correlation Knowledge in Pre-trained Diffusion Models for Image Editing	Biao Yang et.al.	2506.13301	null
2025-06-16	Overcoming Overfitting in Reinforcement Learning via Gaussian Process Diffusion Policy	Amornyos Horprasert et.al.	2506.13111	link
2025-06-16	DualFast: Dual-Speedup Framework for Fast Sampling of Diffusion Models	Hu Yu et.al.	2506.13058	null
2025-06-16	A Comprehensive Survey on Continual Learning in Generative Models	Haiyang Guo et.al.	2506.13045	link
2025-06-15	Generative modeling of seismic data using diffusion models and its application to multi-purpose posterior sampling for noisy inverse problems	Chuangji Meng et.al.	2506.12897	null
2025-06-15	EraserDiT: Fast Video Inpainting with Diffusion Transformer Model	Jie Liu et.al.	2506.12853	null
2025-06-15	DiffS-NOCS: 3D Point Cloud Reconstruction through Coloring Sketches to NOCS Maps Using Diffusion Models	Di Kong et.al.	2506.12835	null
2025-06-12	SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis	Weiliang Chen et.al.	2506.10981	null
2025-06-12	Fine-Grained Perturbation Guidance via Attention Head Selection	Donghoon Ahn et.al.	2506.10978	null
2025-06-12	What Exactly Does Guidance Do in Masked Discrete Diffusion Models	He Ye et.al.	2506.10971	null
2025-06-13	MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning	Yuxuan Luo et.al.	2506.10963	null
2025-06-12	SpectralAR: Spectral Autoregressive Visual Generation	Yuanhui Huang et.al.	2506.10962	null
2025-06-12	ReGuidance: A Simple Diffusion Wrapper for Boosting Sample Quality on Hard Inverse Problems	Aayush Karan et.al.	2506.10955	null
2025-06-12	The Diffusion Duality	Subham Sekhar Sahoo et.al.	2506.10892	link
2025-06-12	ME: Trigger Element Combination Backdoor Attack on Copyright Infringement	Feiyu Yang et.al.	2506.10776	null
2025-06-13	PDESpectralRefiner: Achieving More Accurate Long Rollouts with Spectral Adjustment	Li Luo et.al.	2506.10711	null
2025-06-12	Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework	Xia Du et.al.	2506.10685	null
2025-06-12	GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning	Xiaoyi Bao et.al.	2506.10639	null
2025-06-12	Anatomy-Grounded Weakly Supervised Prompt Tuning for Chest X-ray Latent Diffusion Models	Konstantinos Vilouras et.al.	2506.10633	null
2025-06-12	Hessian Geometry of Latent Space in Generative Models	Alexander Lobashev et.al.	2506.10632	link
2025-06-12	TexTailor: Customized Text-aligned Texturing via Effective Resampling	Suin Lee et.al.	2506.10612	link
2025-06-12	High-resolution efficient image generation from WiFi CSI using a pretrained latent diffusion model	Eshan Ramesh et.al.	2506.10605	null
2025-06-12	Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres	Muskan Dosi et.al.	2506.10576	null
2025-06-12	Equivariant Neural Diffusion for Molecule Generation	François Cornet et.al.	2506.10532	link
2025-06-12	Edit360: 2D Image Edits to 3D Assets from Any Angle	Junchao Huang et.al.	2506.10507	null
2025-06-12	A Crack in the Bark: Leveraging Public Knowledge to Remove Tree-Ring Watermarks	Junhua Lin et.al.	2506.10502	null
2025-06-12	Measuring Semantic Information Production in Generative Diffusion Models	Florian Handke et.al.	2506.10433	null
2025-06-09	StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets	Anh-Quan Cao et.al.	2506.08013	link
2025-06-09	Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion	Xun Huang et.al.	2506.08009	null
2025-06-09	Dynamic View Synthesis as an Inverse Problem	Hidir Yesiltepe et.al.	2506.08004	null
2025-06-09	MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation	Junhao Chen et.al.	2506.07999	null
2025-06-09	Generative Modeling of Weights: Generalization or Memorization?	Boya Zeng et.al.	2506.07998	link
2025-06-09	Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers	Zhengyao Lv et.al.	2506.07986	link
2025-06-09	Gradients: When Markets Meet Fine-tuning – A Distributed Approach to Model Optimisation	Christopher Subia-Waud et.al.	2506.07940	null
2025-06-09	Efficient Seismic Data Interpolation via Sparse Attention Transformer and Diffusion Model	Xiaoli Wei et.al.	2506.07923	null
2025-06-09	Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces	Kevin Rojas et.al.	2506.07903	link
2025-06-09	FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling	Sifan Wang et.al.	2506.07902	link
2025-06-09	Video Unlearning via Low-Rank Refusal Vector	Simone Facchiano et.al.	2506.07891	null
2025-06-09	Diffusion Counterfactual Generation with Semantic Abduction	Rajat Rasal et.al.	2506.07883	link
2025-06-09	Jarzynski Reweighting and Sampling Dynamics for Training Energy-Based Models: Theoretical Analysis of Different Transition Kernels	Davide Carbone et.al.	2506.07843	null
2025-06-09	Diffusion models under low-noise regime	Elizabeth Pavlova et.al.	2506.07841	link
2025-06-09	R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation	William Ljungbergh et.al.	2506.07826	null
2025-06-09	Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation	Xintong Duan et.al.	2506.07822	null
2025-06-09	Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution	Junseo Bang et.al.	2506.07813	null
2025-06-09	Diffusion Models-Aided Uplink Channel Estimation for RIS-Assisted Systems	Yang Wang et.al.	2506.07770	null
2025-06-09	Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation	Hyunsoo Kim et.al.	2506.07750	null
2025-06-09	Consistent Video Editing as Flow-Driven Image-to-Video Generation	Ge Wang et.al.	2506.07713	null
2025-06-05	Contrastive Flow Matching	George Stoica et.al.	2506.05350	link
2025-06-06	Exploring Diffusion Transformer Designs via Grafting	Keshigeyan Chandrasegaran et.al.	2506.05340	link
2025-06-05	Progressive Tempering Sampler with Diffusion	Severi Rissanen et.al.	2506.05231	link
2025-06-05	OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View	Yanbo Wang et.al.	2506.05204	link
2025-06-05	Quantifying Cross-Modality Memorization in Vision-Language Models	Yuxin Wen et.al.	2506.05198	null
2025-06-05	Associative Memory and Generative Diffusion in the Zero-noise Limit	Joshua Hess et.al.	2506.05178	null
2025-06-05	Neural Jumps for Option Pricing	Duosi Zheng et.al.	2506.05137	null
2025-06-06	SeedEdit 3.0: Fast and High-Quality Generative Image Editing	Peng Wang et.al.	2506.05083	null
2025-06-05	FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing	Guangzhao Li et.al.	2506.05046	null
2025-06-05	Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking	Yu-Feng Chen et.al.	2506.04879	link
2025-06-06	Sparse Autoencoders, Again?	Yin Lu et.al.	2506.04859	null
2025-06-05	Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion	Hongyu Wang et.al.	2506.04716	null
2025-06-05	Text-Aware Real-World Image Super-Resolution via Diffusion Model with Joint Segmentation Decoders	Qiming Hu et.al.	2506.04641	null
2025-06-05	Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth	Jinyoung Jun et.al.	2506.04612	null
2025-06-05	SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents	Alexander Huang-Menders et.al.	2506.04606	null
2025-06-04	HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation	Hermann Kumbong et.al.	2506.04421	null
2025-06-04	Is Perturbation-Based Image Protection Disruptive to Image Editing?	Qiuyu Tang et.al.	2506.04394	null
2025-06-04	HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting	Maksym Ivashechkin et.al.	2506.04351	null
2025-06-04	Sounding that Object: Interactive Object-Aware Image to Audio Generation	Tingle Li et.al.	2506.04214	null
2025-06-04	Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector	Boyong He et.al.	2506.04211	link
2025-06-04	Image Editing As Programs with Diffusion Models	Yujia Hu et.al.	2506.04158	null
2025-06-04	Global convergence rates in the relaxation limits for the compressible Euler and Euler-Maxwell systems in Sobolev spaces	Timothée Crin-Barat et.al.	2506.04103	null
2025-06-04	A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning	Zhiyu Zhang et.al.	2506.04083	null
2025-06-04	Beyond water limitation in vegetation-autotoxicity patterning: a cross-diffusion model	Francesco Giannino et.al.	2506.03981	null
2025-06-05	Solving Inverse Problems via Diffusion-Based Priors: An Approximation-Free Ensemble Sampling Approach	Haoxuan Chen et.al.	2506.03979	null
2025-06-04	DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models	Jia Fu et.al.	2506.03933	null
2025-06-04	Personalized MR-Informed Diffusion Models for 3D PET Image Reconstruction	George Webber et.al.	2506.03804	null
2025-06-04	EmoArt: A Multidimensional Dataset for Emotion-Aware Artistic Generation	Cheng Zhang et.al.	2506.03652	null
2025-06-04	DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models	Ziyi Wu et.al.	2506.03517	null
2025-06-04	CHIME: Conditional Hallucination and Integrated Multi-scale Enhancement for Time Series Diffusion Model	Yuxuan Chen et.al.	2506.03502	null
2025-06-04	Facial Appearance Capture at Home with Patch-Level Reflectance Prior	Yuxuan Han et.al.	2506.03478	link
2025-06-03	A Data-Driven Diffusion-based Approach for Audio Deepfake Explanations	Petr Grinberg et.al.	2506.03425	null
2025-06-03	Robustness in Both Domains: CLIP Needs a Robust Text Encoder	Elias Abad Rocamora et.al.	2506.03355	null
2025-06-03	AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation	Lu Qiu et.al.	2506.03126	null
2025-06-03	DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation	Zhengyao Lv et.al.	2506.03123	null
2025-06-03	Rectified Flows for Fast Multiscale Fluid Flow Modeling	Victor Armegioiu et.al.	2506.03111	null
2025-06-03	TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Chetwin Low et.al.	2506.03099	null
2025-06-03	EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models	Mingzhe Li et.al.	2506.03067	null
2025-05-30	AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion	Yangyi Huang et.al.	2505.24877	null
2025-05-30	MiniMax-Remover: Taming Bad Noise Helps Video Object Removal	Bojia Zi et.al.	2505.24873	null
2025-05-30	Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking	Heli Ben-Hamu et.al.	2505.24857	null
2025-05-30	RealDrive: Retrieval-Augmented Driving with Diffusion Models	Wenhao Ding et.al.	2505.24808	null
2025-05-30	Generalization Dynamics of Linear Diffusion Models	Claudia Merger et.al.	2505.24769	null
2025-05-30	A Composite Predictive-Generative Approach to Monaural Universal Speech Enhancement	Jie Zhang et.al.	2505.24576	null
2025-05-30	UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation	Yang-Tian Sun et.al.	2505.24521	null
2025-05-30	EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering	Runnan Lu et.al.	2505.24417	link
2025-05-30	IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models	Hanting Wang et.al.	2505.24406	link
2025-06-03	Interpreting Large Text-to-Image Diffusion Models with Dictionary Learning	Stepan Shabalin et.al.	2505.24360	link
2025-05-30	InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing	Jinlu Zhang et.al.	2505.24315	null
2025-05-30	Category-aware EEG image generation based on wavelet transform and contrast semantic loss	Enshang Zhang et.al.	2505.24301	link
2025-05-30	Large Language Models are Locally Linear Mappings	James R. Golden et.al.	2505.24293	link
2025-05-30	MUSE: Model-Agnostic Tabular Watermarking via Multi-Sample Selection	Liancheng Fang et.al.	2505.24267	null
2025-05-30	Generative AI for Urban Design: A Stepwise Approach Integrating Human Expertise with Multimodal Diffusion Models	Mingyi He et.al.	2505.24260	null
2025-05-30	Interactive Video Generation via Domain Adaptation	Ishaan Rawal et.al.	2505.24253	null
2025-05-30	LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework	Xin Kang et.al.	2505.24245	null
2025-05-30	Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin	Fangyikang Wang et.al.	2505.24222	link
2025-05-30	STORK: Improving the Fidelity of Mid-NFE Sampling for Diffusion and Flow Matching Models	Zheng Tan et.al.	2505.24210	link
2025-05-30	Aligning Protein Conformation Ensemble Generation with Physical Feedback	Jiarui Lu et.al.	2505.24203	null
2025-05-29	LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers	Yusuf Dalva et.al.	2505.23758	null
2025-05-29	DarkDiff: Advancing Low-Light Raw Enhancement by Retasking Diffusion Models for Camera ISP	Amber Yijia Zheng et.al.	2505.23743	null
2025-05-29	LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization	Ronghuan Wu et.al.	2505.23740	null
2025-05-29	How Animals Dance (When You’re Not Looking)	Xiaojuan Wang et.al.	2505.23738	null
2025-05-29	DiffER: Categorical Diffusion for Chemical Retrosynthesis	Sean Current et.al.	2505.23721	link
2025-05-29	ImmunoDiff: A Diffusion Model for Immunotherapy Response Prediction in Lung Cancer	Moinak Bhattacharya et.al.	2505.23675	null
2025-05-30	OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation	Size Wu et.al.	2505.23661	link
2025-05-29	VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models	Xiangdong Zhang et.al.	2505.23656	link
2025-05-29	Optimization-Free Diffusion Model – A Perturbation Theory Approach	Yuehaw Khoo et.al.	2505.23652	null
2025-05-29	ZeroSep: Separate Anything in Audio with Zero Training	Chao Huang et.al.	2505.23625	null
2025-05-29	Inference-time Scaling of Diffusion Models through Classical Search	Xiangcheng Zhang et.al.	2505.23614	null
2025-05-29	Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model	Qingyu Shi et.al.	2505.23606	link
2025-05-29	Normalizing Flows are Capable Models for RL	Raj Ghugare et.al.	2505.23527	link
2025-05-29	LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter	Runyi Li et.al.	2505.23462	null
2025-05-29	Diffusion Guidance Is a Controllable Policy Improvement Operator	Kevin Frans et.al.	2505.23458	link
2025-05-29	CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis	Runmin Jiang et.al.	2505.23444	null
2025-05-29	Enhanced DACER Algorithm with High Diffusion Efficiency	Yinuo Wang et.al.	2505.23426	null
2025-05-29	Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering	Sixian Wang et.al.	2505.23343	link
2025-05-29	TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models	Finn Carter et.al.	2505.23312	null
2025-05-29	MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction	Yunkee Chae et.al.	2505.23305	null
2025-05-28	SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation	Dekai Zhu et.al.	2505.22643	null
2025-05-28	Principled Out-of-Distribution Generalization via Simplicity	Jiawei Ge et.al.	2505.22622	null
2025-05-28	Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	Chengyue Wu et.al.	2505.22618	null
2025-05-28	ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models	Dmitrii Sorokin et.al.	2505.22569	null
2025-05-28	Test-Time Alignment of Discrete Diffusion Models with Sequential Monte Carlo	Chinmay Pani et.al.	2505.22524	null
2025-05-28	PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models	Junwen Chen et.al.	2505.22523	null
2025-05-28	Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics	Siyeop Yoon et.al.	2505.22489	null
2025-05-28	Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation	Jiadong Pan et.al.	2505.22407	null
2025-05-28	Physics-Informed Distillation of Diffusion Models for PDE-Constrained Generation	Yi Zhang et.al.	2505.22391	null
2025-05-28	A Closer Look on Memorization in Tabular Diffusion Model: A Data-Centric Perspective	Zhengyu Fang et.al.	2505.22322	null
2025-05-28	StateSpaceDiffuser: Bringing Long Context to Diffusion World Models	Nedko Savov et.al.	2505.22246	null
2025-05-28	Physics-inspired Generative AI models via real hardware-based noisy quantum diffusion	Marco Parigi et.al.	2505.22193	null
2025-05-28	Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes	Bocheng Li et.al.	2505.22165	null
2025-05-28	What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?	Jinhong Ni et.al.	2505.22129	null
2025-05-28	SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model	Yifan Chang et.al.	2505.22126	null
2025-05-28	Autoregression-free video prediction using diffusion model for mitigating error propagation	Woonho Ko et.al.	2505.22111	link
2025-05-28	AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion	Junqi Zhao et.al.	2505.22106	null
2025-05-28	High Volume Rate 3D Ultrasound Reconstruction with Diffusion Models	Tristan S. W. Stevens et.al.	2505.22090	null
2025-05-28	Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences	Jing-An Sun et.al.	2505.22008	null
2025-05-28	D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples	Zijing Hu et.al.	2505.22002	null
2025-05-26	MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning	Yuanxin Zhuang et.al.	2505.20131	null
2025-05-26	Understanding Generalization in Diffusion Models via Probability Flow Distance	Huijie Zhang et.al.	2505.20123	null
2025-05-26	Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning	Ziyi Zhang et.al.	2505.20107	link
2025-05-26	PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation	Hongsong Wang et.al.	2505.20056	null
2025-05-26	Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion	Zheqi Lv et.al.	2505.20053	link
2025-05-26	ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications	Tong Wu et.al.	2505.19983	null
2025-05-26	UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space	Yong Liu et.al.	2505.19958	null
2025-05-26	Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling	Junhong Lee et.al.	2505.19868	null
2025-05-26	On a retarded stochastic system with discrete diffusion modeling life tables	Tomás Caraballo et.al.	2505.19835	null
2025-05-26	TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning	Yuhui Chen et.al.	2505.19769	null
2025-05-26	On some coupled local and nonlocal diffusion models	Juan Pablo Borthagaray et.al.	2505.19765	null
2025-05-27	SAIL: Self-supervised Albedo Estimation from Real Images with a Latent Diffusion Model	Hala Djeghim et.al.	2505.19751	null
2025-05-26	Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning	Quentin Rouxel et.al.	2505.19717	null
2025-05-26	Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition	Wen Yin et.al.	2505.19694	null
2025-05-26	Graph Guided Diffusion: Unified Guidance for Conditional Graph Generation	Victor M. Tenorio et.al.	2505.19685	null
2025-05-26	Calibrating Pre-trained Language Classifiers on LLM-generated Noisy Labels via Iterative Refinement	Liqin Ye et.al.	2505.19675	link
2025-05-26	ReDDiT: Rehashing Noise for Discrete Visual Generation	Tianren Ma et.al.	2505.19656	null
2025-05-26	Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment	Jeongsoo Choi et.al.	2505.19595	link
2025-05-26	On scalable and efficient training of diffusion samplers	Minkyu Kim et.al.	2505.19552	null
2025-05-26	Unlocking the Power of Diffusion Models in Sequential Recommendation: A Simple and Effective Approach	Jialei Chen et.al.	2505.19544	link
2025-05-22	When Are Concepts Erased From Diffusion Models?	Kevin Lu et.al.	2505.17013	link
2025-05-22	Guided Diffusion Sampling on Function Spaces with Applications to PDEs	Jiachen Yao et.al.	2505.17004	link
2025-05-22	Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction	Dong Li et.al.	2505.16980	null
2025-05-22	Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On	Siqi Wan et.al.	2505.16977	link
2025-05-22	Creatively Upscaling Images with Global-Regional Priors	Yurui Qian et.al.	2505.16976	null
2025-05-22	Bigger Isn’t Always Memorizing: Early Stopping Overparameterized Diffusion Models	Alessandro Favero et.al.	2505.16959	null
2025-05-22	LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning	Zebin You et.al.	2505.16933	null
2025-05-22	T2I-ConBench: Text-to-Image Benchmark for Continual Post-training	Zhehao Huang et.al.	2505.16875	null
2025-05-22	Training-Free Efficient Video Generation via Dynamic Token Carving	Yuechen Zhang et.al.	2505.16864	link
2025-05-22	Conditional Panoramic Image Generation via Masked Autoregressive Modeling	Chaoyang Wang et.al.	2505.16862	null
2025-05-23	LaViDa: A Large Diffusion Language Model for Multimodal Understanding	Shufan Li et.al.	2505.16839	link
2025-05-22	From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization	Haonian Ji et.al.	2505.16832	link
2025-05-22	SEED: Speaker Embedding Enhancement Diffusion Model	KiHyun Nam et.al.	2505.16798	link
2025-05-22	Learning Flexible Forward Trajectories for Masked Molecular Diffusion	Hyunjin Seo et.al.	2505.16790	null
2025-05-22	Forward-only Diffusion Probabilistic Models	Ziwei Luo et.al.	2505.16733	link
2025-05-22	Masked Conditioning for Deep Generative Models	Phillip Mueller et.al.	2505.16725	null
2025-05-22	Towards Coordinate- and Dimension-Agnostic Machine Learning for Partial Differential Equations	Trung V. Phan et.al.	2505.16549	null
2025-05-22	Joint Relational Database Generation via Graph-Conditional Diffusion Models	Mohamed Amine Ketata et.al.	2505.16527	null
2025-05-22	Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection	Jiaxin Liu et.al.	2505.16512	null
2025-05-22	Consistent World Models via Foresight Diffusion	Yu Zhang et.al.	2505.16474	null
2025-05-19	Faster Video Diffusion with Trainable Sparse Attention	Peiyuan Zhang et.al.	2505.13389	null
2025-05-19	Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation	Yasi Zhang et.al.	2505.13377	null
2025-05-20	Minimum-Excess-Work Guidance	Christopher Kolloff et.al.	2505.13375	null
2025-05-20	One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling	Nimrod Berman et.al.	2505.13358	link
2025-05-19	FlowPure: Continuous Normalizing Flows for Adversarial Purification	Elias Collaert et.al.	2505.13280	link
2025-05-19	Seeing the Unseen: How EMoE Unveils Bias in Text-to-Image Diffusion Models	Lucas Berry et.al.	2505.13273	null
2025-05-19	Diffusion Models with Double Guidance: Generate with aggregated datasets	Yanfeng Yang et.al.	2505.13213	null
2025-05-19	Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model	Jonas Brenig et.al.	2505.13152	link
2025-05-19	Neurosymbolic Diffusion Models	Emile van Krieken et.al.	2505.13138	link
2025-05-19	Constraint-Aware Diffusion Guidance for Robotics: Real-Time Obstacle Avoidance for Autonomous Racing	Hao Ma et.al.	2505.13131	null
2025-05-19	Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction	Yuanbo Wang et.al.	2505.13091	null
2025-05-19	Anti-Inpainting: A Proactive Defense against Malicious Diffusion-based Inpainters under Unknown Conditions	Yimao Guo et.al.	2505.13023	null
2025-05-19	LatentINDIGO: An INN-Guided Latent Diffusion Algorithm for Image Restoration	Di You et.al.	2505.12935	null
2025-05-19	PhyDA: Physics-Guided Diffusion Models for Data Assimilation in Atmospheric Systems	Hao Wang et.al.	2505.12882	null
2025-05-19	Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses	Yingkai Kang et.al.	2505.12710	null
2025-05-19	CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models	Shristi Das Biswas et.al.	2505.12677	null
2025-05-19	Few-Step Diffusion via Score identity Distillation	Mingyuan Zhou et.al.	2505.12674	link
2025-05-19	Multi-View Wireless Sensing via Conditional Generative Learning: Framework and Model Design	Ziqing Xing et.al.	2505.12664	null
2025-05-19	MVPainter: Accurate and Detailed 3D Texture Generation via Multi-View Diffusion with Geometric Control	Mingqi Shao et.al.	2505.12635	null
2025-05-18	FreqSelect: Frequency-Aware fMRI-to-Image Reconstruction	Junliang Ye et.al.	2505.12552	null
2025-05-15	3D-Fixup: Advancing Photo Editing with 3D Priors	Yen-Chi Cheng et.al.	2505.10566	null
2025-05-15	Style Customization of Text-to-Vector Generation with Image Diffusion Priors	Peiying Zhang et.al.	2505.10558	null
2025-05-15	Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data	Yiwen Liu et.al.	2505.10551	link
2025-05-15	Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design	Amira Alakhdar et.al.	2505.10545	null
2025-05-15	Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps	Ningyuan Yang et.al.	2505.10482	null
2025-05-15	Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models	Zemin Huang et.al.	2505.10446	null
2025-05-15	Score-based diffusion nowcasting of GOES imagery	Randy J. Chase et.al.	2505.10432	null
2025-05-16	Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems	Jeffrey Alido et.al.	2505.10311	link
2025-05-15	FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation	Jun Guo et.al.	2505.10075	null
2025-05-15	ORL-LDM: Offline Reinforcement Learning Guided Latent Diffusion Model Super-Resolution Reconstruction	Shijie Lyu et.al.	2505.10027	null
2025-05-15	From Air to Wear: Personalized 3D Digital Fashion with AR/VR Immersive 3D Sketching	Ying Zang et.al.	2505.09998	null
2025-05-15	Ordered-subsets Multi-diffusion Model for Sparse-view CT Reconstruction	Pengfei Yu et.al.	2505.09985	null
2025-05-15	Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity	Zichen Liu et.al.	2505.09922	null
2025-05-15	Diffusion-SAFE: Shared Autonomy Framework with Diffusion for Safe Human-to-Robot Driving Handover	Yunxin Fan et.al.	2505.09889	null
2025-05-15	Unsupervised Radar Point Cloud Enhancement via Arbitrary LiDAR Guided Diffusion Prior	Yanlong Yang et.al.	2505.09887	null
2025-05-14	Mission Balance: Generating Under-represented Class Samples using Video Diffusion Models	Danush Kumar Venkatesh et.al.	2505.09858	link
2025-05-14	On the Well-Posedness of Green’s Function Reconstruction via the Kirchhoff-Helmholtz Equation for One-Speed Neutron Diffusion	Roberto Ponciroli et.al.	2505.09766	null
2025-05-14	EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models	Hu Yue et.al.	2505.09694	link
2025-05-14	LightLab: Controlling Light Sources in Images with Diffusion Models	Nadav Magar et.al.	2505.09608	null
2025-05-14	Don’t Forget your Inverse DDIM for Image Editing	Guillermo Gomez-Trenado et.al.	2505.09571	null
2025-05-14	BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset	Jiuhai Chen et.al.	2505.09568	link
2025-05-14	Diffusion Recommender Models and the Illusion of Progress: A Concerning Study of Reproducibility and a Conceptual Mismatch	Michael Benigni et.al.	2505.09364	null
2025-05-14	Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	Bingxin Ke et.al.	2505.09358	link
2025-05-14	TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving	Xuefeng Jiang et.al.	2505.09315	null
2025-05-14	Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations	Panqi Chen et.al.	2505.09284	null
2025-05-14	A Note on Semantic Diffusion	Alexander P. Ryjov et.al.	2505.09283	null
2025-05-14	Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation	Guan Gui et.al.	2505.09263	link
2025-05-15	Generating time-consistent dynamics with discriminator-guided image diffusion models	Philipp Hess et.al.	2505.09089	null
2025-05-13	Predictive Digital Twins with Quantified Uncertainty for Patient-Specific Decision Making in Oncology	Graham Pash et.al.	2505.08927	link
2025-05-15	IntrinsicEdit: Precise generative image manipulation in intrinsic space	Linjie Lyu et.al.	2505.08889	null
2025-05-13	Generative AI for Autonomous Driving: Frontiers and Opportunities	Yuping Wang et.al.	2505.08854	link
2025-05-13	Controllable Image Colorization with Instance-aware Texts and Masks	Yanru An et.al.	2505.08705	null
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607	null
2025-05-15	Diffusion-assisted Model Predictive Control Optimization for Power System Real-Time Operation	Linna Xu et.al.	2505.08535	null
2025-05-13	Building-Block Aware Generative Modeling for 3D Crystals of Metal Organic Frameworks	Chenru Duan et.al.	2505.08531	link
2025-05-14	Improving Data Fidelity via Diffusion Model-based Correction and Super-Resolution	Wuzhe Xu et.al.	2505.08526	null
2025-05-13	ConDiSim: Conditional Diffusion Models for Simulation Based Inference	Mayank Nautiyal et.al.	2505.08403	null
2025-05-13	Adaptive Diffusion Policy Optimization for Robotic Manipulation	Huiyun Jiang et.al.	2505.08376	null
2025-05-12	DanceGRPO: Unleashing GRPO on Visual Generation	Zeyue Xue et.al.	2505.07818	null
2025-05-12	Pixel Motion as Universal Representation for Robot Control	Kanchana Ranasinghe et.al.	2505.07817	null
2025-05-12	LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention	Jiangling Zhang et.al.	2505.07734	null
2025-05-12	ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models	Ozgur Kara et.al.	2505.07652	null
2025-05-12	Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models	Riccardo Passoni et.al.	2505.07615	null
2025-05-12	Noise Optimized Conditional Diffusion for Domain Adaptation	Lingkun Luo et.al.	2505.07548	null
2025-05-12	Addressing degeneracies in latent interpolation for diffusion models	Erik Landolsi et.al.	2505.07481	null
2025-05-12	You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts	Hongkun Dou et.al.	2505.07477	link
2025-05-12	DiffCrysGen: A Score-Based Diffusion Model for Design of Diverse Inorganic Crystalline Materials	Sourav Mal et.al.	2505.07442	null
2025-05-12	Diffusion-driven SpatioTemporal Graph KANsformer for Medical Examination Recommendation	Jianan Li et.al.	2505.07431	null
2025-05-12	GAN-based synthetic FDG PET images from T1 brain MRI can serve to improve performance of deep unsupervised anomaly detection models	Daria Zotova et.al.	2505.07364	null
2025-05-11	Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution	Zihang Liu et.al.	2505.07071	link
2025-05-11	DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models	Junhao Xia et.al.	2505.07057	null
2025-05-11	CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation	Peng Li et.al.	2505.07003	null
2025-05-11	Replay-Based Continual Learning with Dual-Layered Distillation and a Streamlined U-Net for Efficient Text-to-Image Generation	Md. Naimur Asif Borno et.al.	2505.06995	null
2025-05-11	Unsupervised Learning for Class Distribution Mismatch	Pan Du et.al.	2505.06948	link
2025-05-11	Near-Field Channel Estimation for XL-MIMO: A Deep Generative Model Guided by Side Information	Zhenzhou Jin et.al.	2505.06900	null
2025-05-11	Image Classification Using a Diffusion Model as a Pre-Training Model	Kosuke Ukita et.al.	2505.06890	null
2025-05-11	Topology Guidance: Controlling the Outputs of Generative Models via Vector Field Topology	Xiaohan Wang et.al.	2505.06804	null
2025-05-11	HistDiST: Histopathological Diffusion-based Stain Transfer	Erik Großkopf et.al.	2505.06793	null
2025-05-08	SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation	Yonwoo Choi et.al.	2505.05475	link
2025-05-08	3D Scene Generation: A Survey	Beichen Wen et.al.	2505.05474	link
2025-05-08	DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion	Qitao Zhao et.al.	2505.05473	null
2025-05-08	Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation	Chao Liao et.al.	2505.05472	null
2025-05-08	Denoising Diffusion Probabilistic Models for Coastal Inundation Forecasting	Kazi Ashik Islam et.al.	2505.05381	null
2025-05-08	Diffusion Model Quantization: A Review	Qian Zeng et.al.	2505.05215	link
2025-05-08	EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution	Haizhen Xie et.al.	2505.05209	null
2025-05-08	Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning	Chuangtao Chen et.al.	2505.05151	link
2025-05-08	Research on Anomaly Detection Methods Based on Diffusion Models	Yi Chen et.al.	2505.05137	null
2025-05-08	MDAA-Diff: CT-Guided Multi-Dose Adaptive Attention Diffusion Model for PET Denoising	Xiaolong Niu et.al.	2505.05112	null
2025-05-08	MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models	Hongyang Zhu et.al.	2505.05101	null
2025-05-08	ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model	Sagnik Bhattacharya et.al.	2505.05082	null
2025-05-08	PIDiff: Image Customization for Personalized Identities with Diffusion Models	Jinyu Gu et.al.	2505.05081	null
2025-05-08	Divide-and-Conquer: Cold-Start Bundle Recommendation via Mixture of Diffusion Experts	Ming Li et.al.	2505.05035	null
2025-05-08	SOAP: Style-Omniscient Animatable Portraits	Tingting Liao et.al.	2505.05022	link
2025-05-08	Inter-Diffusion Generation Model of Speakers and Listeners for Effective Communication	Jinhe Huang et.al.	2505.04996	null
2025-05-08	ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment	Wanjiang Weng et.al.	2505.04974	null
2025-05-08	Graffe: Graph Representation Learning via Diffusion Probabilistic Models	Dingshuo Chen et.al.	2505.04956	null
2025-05-08	Accurate and Fast Channel Estimation for Fluid Antenna Systems with Diffusion Models	Erqiang Tang et.al.	2505.04930	null
2025-05-08	GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing	Tong Wang et.al.	2505.04915	null
2025-05-07	Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond	Jessie Richter-Powell et.al.	2505.04621	null
2025-05-07	Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model	Pengfei Guo et.al.	2505.04522	null
2025-05-07	Efficient Flow Matching using Latent Variables	Anirban Samaddar et.al.	2505.04486	null
2025-05-07	Localized Diffusion Models for High Dimensional Distributions Generation	Georg A. Gottwald et.al.	2505.04417	null
2025-05-07	CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion	Yanyu Li et.al.	2505.04347	null
2025-05-07	MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition	Qiannan Fan et.al.	2505.04306	null
2025-05-07	TS-Diff: Two-Stage Diffusion Model for Low-Light RAW Image Enhancement	Yi Li et.al.	2505.04281	link
2025-05-07	HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation	Yajie Fu et.al.	2505.04276	link
2025-05-07	Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting	Feng Yang et.al.	2505.04262	null
2025-05-07	DiffPattern-Flex: Efficient Layout Pattern Generation via Discrete Diffusion	Zixiao Wang et.al.	2505.04173	null
2025-05-07	Person-In-Situ: Scene-Consistent Human Image Insertion with Occlusion-Aware Pose Control	Shun Masuda et.al.	2505.04052	null
2025-05-07	BuildingBlock: A Hybrid Approach for Structured Building Generation	Junming Huang et.al.	2505.04051	null
2025-05-07	TerraFusion: Joint Generation of Terrain Geometry and Texture Using Latent Diffusion Models	Kazuki Higo et.al.	2505.04050	null
2025-05-06	Diffusion Models are Secretly Exchangeable: Parallelizing DDPMs via Autospeculation	Hengyuan Hu et.al.	2505.03983	null
2025-05-06	nuGAN: Generative Adversarial Emulator for Cosmic Web with Neutrinos	Neerav Kaushal et.al.	2505.03936	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679	null
2025-05-06	Distribution-Conditional Generation: From Class Distribution to Creative Generation	Fu Feng et.al.	2505.03667	null
2025-05-06	Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map	Alessandro Simoni et.al.	2505.03623	link
2025-05-07	PAHA: Parts-Aware Audio-Driven Human Animation with Diffusion Model	Y. B. Wang et.al.	2505.03603	null
2025-05-06	A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges	Feibo Jiang et.al.	2505.03556	link
2025-05-05	Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models	Kuofeng Gao et.al.	2505.02824	link
2025-05-05	Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models	Yankai Jiang et.al.	2505.02753	link
2025-05-06	MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation	Mingcheng Li et.al.	2505.02648	null
2025-05-06	Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces	Yang Lyu et.al.	2505.02508	null
2025-05-05	Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction	Biao Gong et.al.	2505.02471	link
2025-05-05	Predicting the Dynamics of Complex System via Multiscale Diffusion Autoencoder	Ruikun Li et.al.	2505.02450	null
2025-05-05	T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models	Yunfeng Ge et.al.	2505.02417	link
2025-05-04	Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset	Jakub Wąsala et.al.	2505.02255	null
2025-05-04	Quantizing Diffusion Models from a Sampling-Aware Perspective	Qian Zeng et.al.	2505.02242	null
2025-05-06	Regression is all you need for medical image translation	Sebastian Rassmann et.al.	2505.02048	link
2025-05-03	Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling	Javier E. Santos et.al.	2505.01917	null
2025-05-03	Rethinking Score Distilling Sampling for 3D Editing and Generation	Xingyu Miao et.al.	2505.01888	null
2025-05-03	DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion	Haoteng Li et.al.	2505.01857	null
2025-05-03	Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning	Jifeng Hu et.al.	2505.01822	null
2025-05-02	The DCR Delusion: Measuring the Privacy Risk of Synthetic Data	Zexi Yao et.al.	2505.01524	null
2025-05-02	WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation	Daoan Zhang et.al.	2505.01490	null
2025-05-02	VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models	Mohammadreza Teymoorianfard et.al.	2505.01406	link
2025-05-02	Provable Efficiency of Guidance in Diffusion Models for General Data Distribution	Gen Li et.al.	2505.01382	null
2025-05-02	FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors	Chenxi Li et.al.	2505.01322	null
2025-05-02	Model See Model Do: Speech-Driven Facial Animation with Style Control	Yifang Pan et.al.	2505.01319	null
2025-05-01	Controllable Weather Synthesis and Removal with Video Diffusion Models	Chih-Hao Lin et.al.	2505.00704	null
2025-05-01	GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution	Aditya Arora et.al.	2505.00687	null
2025-05-01	ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models	Jiarong Wei et.al.	2505.00586	null
2025-05-01	Safety-Critical Traffic Simulation with Guided Latent Diffusion Model	Mingxing Peng et.al.	2505.00515	null
2025-05-01	Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly	Ruiyuan Zhang et.al.	2505.00426	null
2025-05-01	Denoising weak lensing mass maps with diffusion model: systematic comparison with generative adversarial network	Shohei D. Aoyama et.al.	2505.00345	null
2025-05-01	Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution	Luigi Sigillo et.al.	2505.00334	null
2025-04-30	Generative Multimodal Multiscale Data Fusion for Digital Twins in Aerosol Jet Electronics Printing	Fatemeh Elhambakhsh et.al.	2505.00176	null
2025-04-30	Materials discovery acceleration by using condition generative methodology	Caiyuan Ye et.al.	2505.00076	link
2025-04-30	ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction	Qihao Liu et.al.	2504.21855	null
2025-04-30	HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation	Haiyang Zhou et.al.	2504.21650	link
2025-04-30	Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection	Liqin Wang et.al.	2504.21646	null
2025-04-30	ODE and PDE models for COVID-19, with reinfection and vaccination process for Cameroon and Germany	Hamadjam Abboubakar et.al.	2504.21613	null
2025-04-30	Latent Feature-Guided Conditional Diffusion for High-Fidelity Generative Image Semantic Communication	Zehao Chen et.al.	2504.21577	null
2025-04-30	MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance	Mengting Wei et.al.	2504.21497	link
2025-04-30	DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration	Hebaixu Wang et.al.	2504.21487	link
2025-04-30	Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision	Weicai Yan et.al.	2504.21423	null
2025-04-30	IDDM: Bridging Synthetic-to-Real Domain Gap from Physics-Guided Diffusion for Real-world Image Dehazing	Shijun Zhou et.al.	2504.21385	null
2025-04-30	Sparse-to-Sparse Training of Diffusion Models	Inês Cardoso Oliveira et.al.	2504.21380	null
2025-04-30	Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing	Hong Zhang et.al.	2504.21356	link
2025-04-30	Text-Conditioned Diffusion Model for High-Fidelity Korean Font Generation	Abdul Sami et.al.	2504.21325	null
2025-04-30	Capturing Conditional Dependence via Auto-regressive Diffusion Models	Xunpeng Huang et.al.	2504.21314	null
2025-04-30	The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning	Siyi Chen et.al.	2504.21307	null
2025-04-30	Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions	ZiYi Dong et.al.	2504.21292	null
2025-04-30	CoCoDiff: Diversifying Skeleton Action Features via Coarse-Fine Text-Co-Guided Latent Diffusion	Zhifu Zhao et.al.	2504.21266	null
2025-04-29	T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection	Manikanta Varaganti et.al.	2504.21231	null
2025-04-29	ProT-GFDM: A Generative Fractional Diffusion Model for Protein Generation	Xiao Liang et.al.	2504.21092	null
2025-04-29	Erased but Not Forgotten: How Backdoors Compromise Concept Erasure	Jonas Henry Grebe et.al.	2504.21072	null
2025-04-29	AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection	Lorenzo Pellegrini et.al.	2504.20865	null
2025-04-28	DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images	Mamadou Keita et.al.	2504.19876	link
2025-04-28	CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback	Chenhan Jiang et.al.	2504.19860	null
2025-04-28	Multimodal Conditioned Diffusive Time Series Forecasting	Chen Su et.al.	2504.19669	null
2025-04-28	Robot Motion Planning using One-Step Diffusion with Noise-Optimized Approximate Motions	Tomoharu Aizu et.al.	2504.19652	null
2025-04-28	AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis	Haroui Ma et.al.	2504.19621	link
2025-04-28	Image Generation Method Based on Heat Diffusion Models	Pengfei Zhang et.al.	2504.19600	null
2025-04-28	GenPTW: In-Generation Image Watermarking for Provenance Tracing and Tamper Localization	Zhenliang Gan et.al.	2504.19567	null
2025-04-28	SynergyAmodal: Deocclude Anything with Text Control	Xinyang Li et.al.	2504.19506	null
2025-04-28	Simultaneous Pick and Place Detection by Combining SE(3) Diffusion Models with Differential Kinematics	Tianyi Ko et.al.	2504.19502	null
2025-04-28	GTSD: Generative Text Steganography Based on Diffusion Model	Zhengxian Wu et.al.	2504.19433	null
2025-04-28	Boosting 3D Liver Shape Datasets with Diffusion Models and Implicit Neural Representations	Khoa Tuan Nguyen et.al.	2504.19402	null
2025-04-27	Sketch2Anim: Towards Transferring Sketch Storyboards into 3D Animation	Lei Zhong et.al.	2504.19189	null
2025-04-27	Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions	Mohammad Mahdi Abootorabi et.al.	2504.19056	link
2025-04-26	Learning Stochastic Thermodynamics Directly from Correlation and Trajectory-Fluctuation Currents	Jinghao Lyu et.al.	2504.19007	null
2025-04-26	REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models	Gal Almog et.al.	2504.18989	link
2025-04-25	Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection	Brian K. S. Isaac-Medina et.al.	2504.18746	null
2025-04-25	Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation	Gérôme Andry et.al.	2504.18720	null
2025-04-25	SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations	Shuting Zhao et.al.	2504.18332	null
2025-04-25	STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting	Yunze Deng et.al.	2504.18318	null
2025-04-25	Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding	Kun Li et.al.	2504.18204	null
2025-04-24	LiDPM: Rethinking Point Diffusion for Lidar Scene Completion	Tetiana Martyniuk et.al.	2504.17791	null
2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789	null
2025-04-24	polyGen: A Learning Framework for Atomic-level Polymer Structure Generation	Ayush Jain et.al.	2504.17656	null
2025-04-24	Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization	Abderrachid Hamrani et.al.	2504.17628	null
2025-04-24	ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting	Junyan Zhang et.al.	2504.17524	null
2025-04-24	3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models	Min Wei et.al.	2504.17414	null
2025-04-24	DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition	Yiyan Xu et.al.	2504.17349	null
2025-04-24	CKMDiff: A Generative Diffusion Model for CKM Construction via Inverse Problems with Learned Priors	Shen Fu et.al.	2504.17323	null
2025-04-24	Towards Generalized and Training-Free Text-Guided Semantic Manipulation	Yu Hong et.al.	2504.17269	null
2025-04-24	DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks	Yinqi Li et.al.	2504.17253	link
2025-04-24	AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models	Mohammad Zarei et.al.	2504.17179	null
2025-04-23	Physics-guided and fabrication-aware inverse design of photonic devices using diffusion models	Dongjin Seo et.al.	2504.17077	link
2025-04-23	Diffusion Probabilistic Models for Compressive SAR Imaging	Odysseas Pappas et.al.	2504.17053	null
2025-04-23	Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials	Peichen Zhong et.al.	2504.16893	null
2025-04-23	Planning with Diffusion Models for Target-Oriented Dialogue Systems	Hanwen Du et.al.	2504.16858	null
2025-04-23	Physically Consistent Humanoid Loco-Manipulation using Latent Diffusion Models	Ilyass Taouil et.al.	2504.16843	null
2025-04-24	Simple Graph Contrastive Learning via Fractional-order Neural Diffusion Networks	Yanan Zhao et.al.	2504.16748	null
2025-04-23	MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning	Itamar Mishani et.al.	2504.16738	null
2025-04-24	Hyper-Transforming Latent Diffusion Models	Ignacio Peis et.al.	2504.16580	null
2025-04-23	A Comprehensive Survey of Synthetic Tabular Data Generation	Ruxue Shi et.al.	2504.16506	link
2025-04-23	The Dance of Atoms-De Novo Protein Design with Diffusion Model	Yujie Qin et.al.	2504.16479	null
2025-04-23	Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion	Ruixiang Zhang et.al.	2504.16431	null
2025-04-23	VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models	Xuming Hu et.al.	2504.16359	null
2025-04-22	SignX: The Foundation Model for Sign Recognition	Sen Fang et.al.	2504.16315	null
2025-04-22	Aerial Active STAR-RIS-assisted Satellite-Terrestrial Covert Communications	Chuang Zhang et.al.	2504.16146	null
2025-04-22	Survey of Video Diffusion Models: Foundations, Implementations, and Applications	Yimu Wang et.al.	2504.16081	link
2025-04-22	From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning	Le Zhuo et.al.	2504.16080	null
2025-04-22	Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation	Yuanpeng Qu et.al.	2504.16077	link
2025-04-22	Boosting Generative Image Modeling via Joint Image-Feature Synthesis	Theodoros Kouzelis et.al.	2504.16064	null
2025-04-22	Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework	Xinyuan Song et.al.	2504.16016	null
2025-04-22	Adversarial Observations in Weather Forecasting	Erik Imgrund et.al.	2504.15942	link
2025-04-22	Text-based Animatable 3D Avatars with Morphable Model Alignment	Yiqian Wu et.al.	2504.15835	link
2025-04-22	Satellite to GroundScape – Large-scale Consistent Ground View Generation from Satellite Views	Ningli Xu et.al.	2504.15786	null
2025-04-21	Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction	Vaishnavh Nagarajan et.al.	2504.15266	link
2025-04-21	Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation	Yunxuan Cai et.al.	2504.15259	null
2025-04-21	DRAGON: Distributional Rewards Optimize Diffusion Generative Models	Yatong Bai et.al.	2504.15217	null
2025-04-21	FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image	Fei Yin et.al.	2504.15179	null
2025-04-21	DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution	Miaomiao Cai et.al.	2504.15176	null
2025-04-21	Automatic Generation of Aerobatic Flight in Complex Environments via Diffusion Models	Yuhang Zhong et.al.	2504.15138	null
2025-04-22	VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation	Mingxia Zhan et.al.	2504.15095	null
2025-04-21	Generative Artificial Intelligence for Beamforming in Low-Altitude Economy	Geng Sun et.al.	2504.15079	null
2025-04-21	SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation	Yue Li et.al.	2504.15035	null
2025-04-21	Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models	Zijin Yang et.al.	2504.15026	null
2025-04-21	PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV	Qianyu Zhu et.al.	2504.14952	link
2025-04-21	TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models	Mazharul Islam Rakib et.al.	2504.14933	null
2025-04-21	What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale	Xiaoyong Yuan et.al.	2504.14815	null
2025-04-21	When Cloud Removal Meets Diffusion Model in Remote Sensing	Zhenyu Yu et.al.	2504.14785	null
2025-04-21	Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model	Ahmed Sobhi Saleh et.al.	2504.14782	null
2025-04-20	Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens	Kaihang Pan et.al.	2504.14666	null
2025-04-20	REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models	Chongye Guo et.al.	2504.14554	null
2025-04-20	FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models	Kuanting Wu et.al.	2504.14535	null
2025-04-20	SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization	Liang Peng et.al.	2504.14534	link
2025-04-20	DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning	Fulong Ye et.al.	2504.14509	link
2025-04-17	Personalized Text-to-Image Generation with Auto-Regressive Models	Kaiyue Sun et.al.	2504.13162	link
2025-04-17	UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models	Guanlong Jiao et.al.	2504.13109	null
2025-04-18	SkyReels-V2: Infinite-length Film Generative Model	Guibin Chen et.al.	2504.13074	link
2025-04-17	TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution	Yide Liu et.al.	2504.13026	link
2025-04-17	Image-Editing Specialists: An RLAIF Approach for Diffusion Models	Elior Benarous et.al.	2504.12833	link
2025-04-17	Privacy Protection Against Personalized Text-to-Image Synthesis via Cross-image Consistency Constraints	Guanyu Wang et.al.	2504.12747	null
2025-04-17	A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation	Rongtao Xu et.al.	2504.12636	null
2025-04-17	Packing Input Frame Context in Next-Frame Prediction Models for Video Generation	Lvmin Zhang et.al.	2504.12626	link
2025-04-17	Prompt-Driven and Training-Free Forgetting Approach and Dataset for Large Language Models	Zhenyu Yu et.al.	2504.12574	null
2025-04-16	Generalization through variance: how noise shapes inductive biases in diffusion models	John J. Vastola et.al.	2504.12532	link
2025-04-16	Diffusion Based Robust LiDAR Place Recognition	Benjamin Krummenacher et.al.	2504.12412	null
2025-04-16	Cobra: Efficient Line Art COlorization with BRoAder References	Junhao Zhuang et.al.	2504.12240	null
2025-04-16	Coding-Prior Guided Diffusion Network for Video Deblurring	Yike Liu et.al.	2504.12222	null
2025-04-16	Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis	Songping Wang et.al.	2504.12129	null
2025-04-16	A Diffusion-Based Framework for Terrain-Aware Remote Sensing Image Reconstruction	Zhenyu Yu et.al.	2504.12112	null
2025-04-16	Generalized Visual Relation Detection with Diffusion Models	Kaifeng Gao et.al.	2504.12100	null
2025-04-16	Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM	Zirui Pan et.al.	2504.12048	null
2025-04-17	Understanding Attention Mechanism in Video Diffusion Models	Bingyan Liu et.al.	2504.12027	null
2025-04-17	Dual-Energy Cone-Beam CT Using Two Orthogonal Projection Views: A Phantom Study	Junbo Peng et.al.	2504.12010	null
2025-04-16	Generative Recommendation with Continuous-Token Diffusion	Haohao Qu et.al.	2504.12007	null
2025-04-16	R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors	Haoyang Wang et.al.	2504.11946	null
2025-04-16	SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models	Zeyu Dai et.al.	2504.11923	null
2025-04-16	A Bidirectional DeepParticle Method for Efficiently Solving Low-dimensional Transport Map Problems	Tan Zhang et.al.	2504.11851	null
2025-04-16	ACE: Attentional Concept Erasure in Diffusion Models	Finn Carter et.al.	2504.11850	null
2025-04-16	TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation	Kangbo Ma et.al.	2504.11825	null
2025-04-16	PCDiff: Proactive Control for Ownership Protection in Diffusion Models with Watermark Compatibility	Keke Gai et.al.	2504.11774	null
2025-04-16	EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos	Jilan Xu et.al.	2504.11732	null
2025-04-16	Towards Safe Synthetic Image Generation On the Web: A Multimodal Robust NSFW Defense and Million Scale Dataset	Muhammad Shahid Muneer et.al.	2504.11707	link
2025-04-16	DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction	Sicong Pan et.al.	2504.11674	link
2025-04-15	Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception	Ziqi Pang et.al.	2504.11457	link
2025-04-16	Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion	An Zhao et.al.	2504.11447	link
2025-04-14	REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers	Xingjian Leng et.al.	2504.10483	null
2025-04-14	Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing	Taihang Hu et.al.	2504.10434	link
2025-04-14	MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model	Jian Liu et.al.	2504.10433	link
2025-04-14	Improving diffusion modeling in all-solid-state lithium batteries: a novel approach for grain boundary effects	Lena Scholz et.al.	2504.10348	null
2025-04-14	DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing	Jinyue Zhang et.al.	2504.10278	null
2025-04-14	Efficient Generative Model Training via Embedded Representation Warmup	Deyuan Liu et.al.	2504.10188	link
2025-04-14	NaviDiffusor: Cost-Guided Diffusion Model for Visual Navigation	Yiming Zeng et.al.	2504.10003	null
2025-04-15	OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation	Si-Tong Wei et.al.	2504.09975	link
2025-04-14	Semi-implicit-explicit Runge-Kutta method for nonlinear differential equations	Lingyun Ding et.al.	2504.09969	link
2025-04-14	Efficient Task-specific Conditional Diffusion Policies: Shortcut Model Acceleration and SO(3) Optimization	Haiyong Yu et.al.	2504.09927	null
2025-04-14	Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis	Zihao Liu et.al.	2504.09885	null
2025-04-14	EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise	Chao Liu et.al.	2504.09789	null
2025-04-13	Stochastic generative methods for stable and accurate closure modeling of chaotic dynamical systems	Emily Williams et.al.	2504.09750	null
2025-04-13	SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow	Kenan Tang et.al.	2504.09697	link
2025-04-13	Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training	Lexington Whalen et.al.	2504.09606	null
2025-04-13	Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark	Jinhao Li et.al.	2504.09555	null
2025-04-13	DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion	Puyu Han et.al.	2504.09513	null
2025-04-13	CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models	Pooja Guhan et.al.	2504.09472	null
2025-04-13	D $^2$ iT: Dynamic Diffusion Transformer for Accurate Image Generation	Weinan Jia et.al.	2504.09454	null
2025-04-13	Structure-Accurate Medical Image Translation based on Dynamic Frequency Balance and Knowledge Guidance	Jiahua Xu et.al.	2504.09441	null
2025-04-10	Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction	Zeren Jiang et.al.	2504.07961	link
2025-04-10	VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning	Zhong-Yu Li et.al.	2504.07960	null
2025-04-10	GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces	Hao Yu et.al.	2504.07945	null
2025-04-10	Optimal Control For Anti-Abeta Treatment in Alzheimer’s Disease using a Reaction-Diffusion Model	Wenrui Hao et.al.	2504.07913	null
2025-04-10	Revisiting Likelihood-Based Out-of-Distribution Detection by Modeling Representations	Yifan Ding et.al.	2504.07793	link
2025-04-10	Virtual-mask Informed Prior for Sparse-view Dual-Energy CT Reconstruction	Zini Chen et.al.	2504.07753	null
2025-04-10	PhaseGen: A Diffusion-Based Approach for Complex-Valued MRI Data Generation	Moritz Rempe et.al.	2504.07560	link
2025-04-10	STeP: A General and Scalable Framework for Solving Video Inverse Problems with Spatiotemporal Diffusion Priors	Bingliang Zhang et.al.	2504.07549	link
2025-04-10	A mass conserved reaction-diffusion system reveals switching between coexisting polar and oscillatory cell motility states	Jack M. Hughes et.al.	2504.07446	null
2025-04-10	Unifying and extending Diffusion Models through PDEs for solving Inverse Problems	Agnimitra Dasgupta et.al.	2504.07437	null
2025-04-10	Conditional Data Synthesis Augmentation	Xinyu Tian et.al.	2504.07426	null
2025-04-10	Routing to the Right Expertise: A Trustworthy Judge for Instruction-based Image Editing	Chenxi Sun et.al.	2504.07424	null
2025-04-10	ID-Booth: Identity-consistent Face Generation with Diffusion Models	Darian Tomašević et.al.	2504.07392	link
2025-04-10	Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction	Junyi Ma et.al.	2504.07375	link
2025-04-09	MoEDiff-SR: Mixture of Experts-Guided Diffusion Model for Region-Adaptive MRI Super-Resolution	Zhe Wang et.al.	2504.07308	link
2025-04-09	MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data	Paul Borne–Pons et.al.	2504.07210	link
2025-04-09	Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies	Jonas Loos et.al.	2504.07008	link
2025-04-09	PathSegDiff: Pathology Segmentation using Diffusion model representations	Sachin Kumar Danisetty et.al.	2504.06950	null
2025-04-09	MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs	Jiawei Mao et.al.	2504.06897	null
2025-04-09	EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation	Diljeet Jagpal et.al.	2504.06861	null
2025-04-09	CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading	Mishan Aliev et.al.	2504.06856	null
2025-04-09	DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Wangbo Zhao et.al.	2504.06803	link
2025-04-09	DIMA: DIffusing Motion Artifacts for unsupervised correction in brain MRI images	Paolo Angella et.al.	2504.06767	null
2025-04-10	Compass Control: Multi Object Orientation Control for Text-to-Image Generation	Rishubh Parihar et.al.	2504.06752	null
2025-04-09	Probability Density Geodesics in Image Diffusion Latent Space	Qingtao Yu et.al.	2504.06675	null
2025-04-09	RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism	Elia Peruzzo et.al.	2504.06672	null
2025-04-09	Diffusion Factor Models: Generating High-Dimensional Returns with Factor Structure	Minshuo Chen et.al.	2504.06566	link
2025-04-09	DiffusionCom: Structure-Aware Multimodal Diffusion Model for Multimodal Knowledge Graph Completion	Wei Huang et.al.	2504.06543	null
2025-04-08	D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition	Rupayan Mallick et.al.	2504.06432	null
2025-04-08	Unifying Autoregressive and Diffusion-Based Sequence Generation	Nima Fathi et.al.	2504.06416	null
2025-04-08	Transfer between Modalities with MetaQueries	Xichen Pan et.al.	2504.06256	null
2025-04-08	OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model	Xiaochen Wei et.al.	2504.06027	null
2025-04-08	CamContextI2V: Context-aware Controllable Video Generation	Luis Denninger et.al.	2504.06022	link
2025-04-08	An Empirical Study of GPT-4o Image Generation Capabilities	Sixiang Chen et.al.	2504.05979	link
2025-04-08	Diffusion Based Ambiguous Image Segmentation	Jakob Lønborg Christensen et.al.	2504.05977	null
2025-04-08	Physics-aware generative models for turbulent fluid flows through energy-consistent stochastic interpolants	Nikolaj T. Mücke et.al.	2504.05852	link
2025-04-07	CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models	Kavana Venkatesh et.al.	2504.05306	null
2025-04-07	Gaussian Mixture Flow Matching Models	Hansheng Chen et.al.	2504.05304	link
2025-04-07	Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures	Gen Li et.al.	2504.05300	null
2025-04-07	DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration	Jiamei Xiong et.al.	2504.05135	null
2025-04-07	Graph-based Diffusion Model for Collaborative Filtering	Xuan Zhang et.al.	2504.05029	null
2025-04-08	REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning	Jihyun Lee et.al.	2504.04956	null
2025-04-08	TabRep: a Simple and Effective Continuous Representation for Training Tabular Diffusion Models	Jacob Si et.al.	2504.04798	link
2025-04-07	Disentangling Instruction Influence in Diffusion Transformers for Parallel Multi-Instruction-Guided Image Editing	Hui Liu et.al.	2504.04784	null
2025-04-07	Continuous Locomotive Crowd Behavior Generation	Inhwan Bae et.al.	2504.04756	link
2025-04-07	Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches	Eloi Moliner et.al.	2504.04751	null
2025-04-06	Diffusion-Based Approximate MPC: Fast and Consistent Imitation of Multi-Modal Action Distributions	Pau Marquez Julbe et.al.	2504.04603	null
2025-04-08	Your Image Generator Is Your New Private Dataset	Nicolo Resmini et.al.	2504.04582	null
2025-04-06	Cramer-Rao Bounds for Laplacian Matrix Estimation	Morad Halihal et.al.	2504.04576	null
2025-04-06	BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis	Moinak Bhattacharya et.al.	2504.04532	null
2025-04-06	PRISM: Probabilistic Representation for Integrated Shape Modeling and Generation	Lei Cheng et.al.	2504.04454	null
2025-04-06	From Coarse to Fine: A Physics-Informed Self-Guided Flow Diffusion Model	Ruoyan Li et.al.	2504.04375	null
2025-04-06	DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation	Jinyang Li et.al.	2504.04351	null
2025-04-05	Multi-resolution Score-Based Variational Graphical Diffusion for Causal Disaster System Modeling and Inference	Xuechun Li et.al.	2504.04015	link
2025-04-05	DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion	Maksim Siniukov et.al.	2504.04010	null
2025-04-04	Enhancing Causal Effect Estimation with Diffusion-Generated Data	Li Chen et.al.	2504.03630	null
2025-04-03	Concept Lancet: Image Editing with Compositional Representation Transplant	Jinqi Luo et.al.	2504.02828	null
2025-04-03	F-ViTA: Foundation Model Guided Visible to Thermal Translation	Jay N. Paranjape et.al.	2504.02801	link
2025-04-03	Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model	Shengjun Zhang et.al.	2504.02764	null
2025-04-03	MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection	Ahmet Burak Yildirim et.al.	2504.02762	null
2025-04-04	RBT4DNN: Requirements-based Testing of Neural Networks	Nusrat Jahan Mozumder et.al.	2504.02737	link
2025-04-03	RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models	ZhongLi Fang et.al.	2504.02640	null
2025-04-03	Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression	Lucas Relic et.al.	2504.02579	null
2025-04-03	MAD: Makeup All-in-One with Cross-Domain Diffusion Model	Bo-Kai Ruan et.al.	2504.02545	null
2025-04-03	Translation of Fetal Brain Ultrasound Images into Pseudo-MRI Images using Artificial Intelligence	Naomi Silverstein et.al.	2504.02408	null
2025-04-03	Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation	Laibin Chang et.al.	2504.02391	null
2025-04-03	OmniCam: Unified Multimodal Video Generation via Camera Control	Xiaoda Yang et.al.	2504.02312	null
2025-04-03	WonderTurbo: Generating Interactive 3D World in 0.72 Seconds	Chaojun Ni et.al.	2504.02261	null
2025-04-02	FreSca: Unveiling the Scaling Space in Diffusion Models	Chao Huang et.al.	2504.02154	null
2025-04-02	Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis	Niluthpol Chowdhury Mithun et.al.	2504.01960	null
2025-04-03	VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step	Hanyang Wang et.al.	2504.01956	null
2025-04-02	A Unified Approach to Analysis and Design of Denoising Markov Models	Yinuo Ren et.al.	2504.01938	null
2025-04-03	ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement	Runhui Huang et.al.	2504.01934	null
2025-04-02	Multi-fidelity Parameter Estimation Using Conditional Diffusion Models	Caroline Tatsuoka et.al.	2504.01894	null
2025-04-02	A Diffusion-Based Framework for Occluded Object Movement	Zheng-Peng Duan et.al.	2504.01873	null
2025-04-02	Implicit Bias Injection Attacks against Text-to-Image Diffusion Models	Huayang Huang et.al.	2504.01819	link
2025-04-02	The protein escape process at the ribosomal exit tunnel has conserved mechanisms across the domains of life	Phuong Thuy Bui et.al.	2504.01731	null
2025-04-02	InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems	Noam Elata et.al.	2504.01689	link
2025-04-02	Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology	Lirui Qi et.al.	2504.01577	null
2025-04-02	Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training	Luca Ciampi et.al.	2504.01547	link
2025-04-02	Hyperbolic Diffusion Recommender Model	Meng Yuan et.al.	2504.01541	null
2025-04-02	Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model	Jincheng Zhong et.al.	2504.01521	link
2025-04-02	From Easy to Hard: Building a Shortcut for Differentially Private Image Synthesis	Kecen Li et.al.	2504.01395	link
2025-04-02	Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks	Jiawei Wang et.al.	2504.01308	link
2025-04-01	Prompting Forgetting: Unlearning in GANs via Textual Guidance	Piyush Nagasubramaniam et.al.	2504.01218	null
2025-04-01	Articulated Kinematics Distillation from Video Diffusion Models	Xuan Li et.al.	2504.01204	null
2025-04-01	Towards Sign Distance Function based Metamaterial Design: Neural Operator Transformer for Forward Prediction and Diffusion Models for Inverse Design	Qibang Liu et.al.	2504.01195	link
2025-04-01	Neural Approaches to SAT Solving: Design Choices and Interpretability	David Mojžíšek et.al.	2504.01173	null
2025-04-01	MixerMDM: Learnable Composition of Human Motion Diffusion Models	Pablo Ruiz-Ponce et.al.	2504.01019	null
2025-03-31	Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach	Francesco Pio Ramunno et.al.	2503.24271	link
2025-04-01	Visual Acoustic Fields	Yuelei Li et.al.	2503.24270	null
2025-03-31	Controlled Latent Diffusion Models for 3D Porous Media Reconstruction	Danilo Naiff et.al.	2503.24083	link
2025-03-31	DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model	Ming Yuan et.al.	2503.23993	null
2025-03-31	JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation	Fangda Chen et.al.	2503.23951	null
2025-03-31	DiffuSE: Cross-Layer Design Space Exploration of DNN Accelerator via Diffusion-Driven Optimization	Yi Ren et.al.	2503.23945	null
2025-03-31	Training-Free Text-Guided Image Editing with Visual Autoregressive Model	Yufei Wang et.al.	2503.23897	link
2025-03-31	DiffScale: Continuous Downscaling and Bias Correction of Subseasonal Wind Speed Forecasts using Diffusion Models	Maximilian Springenberg et.al.	2503.23893	null
2025-03-31	MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach	Xin Zhang et.al.	2503.23888	null
2025-03-31	ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image	Tianyi Gong et.al.	2503.23881	null
2025-03-31	Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism	Linghao Feng et.al.	2503.23767	null
2025-03-31	StrokeFusion: Vector Sketch Generation via Joint Stroke-UDF Encoding and Latent Sequence Diffusion	Jin Zhou et.al.	2503.23752	null
2025-03-31	Effective Cloud Removal for Remote Sensing Images by an Improved Mean-Reverting Denoising Model with Elucidated Design Space	Yi Liu et.al.	2503.23717	link
2025-03-31	Expanding-and-Shrinking Binary Neural Networks	Xulong Shi et.al.	2503.23709	link
2025-03-31	Bayesian Inference for a Time-Fractional HIV Model with Nonlinear Diffusion	Mohamed BenSalah et.al.	2503.23638	null
2025-03-30	Language-Guided Trajectory Traversal in Disentangled Stable Diffusion Latent Space for Factorized Medical Image Generation	Zahra TehraniNasab et.al.	2503.23623	null
2025-03-30	Make Autoregressive Great Again: Diffusion-Free Graph Generation with Next-Scale Prediction	Samuel Belkadi et.al.	2503.23612	null
2025-03-30	DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution	Zheng-Peng Duan et.al.	2503.23580	null
2025-03-30	Enhancing Creative Generation on Stable Diffusion-based Models	Jiyeon Han et.al.	2503.23538	link
2025-03-30	Diffusion Meets Few-shot Class Incremental Learning	Junsu Kim et.al.	2503.23402	null
2025-03-27	VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models	Chi-Pin Huang et.al.	2503.21781	null
2025-03-27	StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion	Ziyu Guo et.al.	2503.21775	null
2025-03-27	Optimal Stepsize for Diffusion Sampling	Jianning Pei et.al.	2503.21774	link
2025-03-27	Exploring the Evolution of Physics Cognition in Video Generation: A Survey	Minghui Lin et.al.	2503.21765	link
2025-03-27	Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data	Zhiyuan Ma et.al.	2503.21694	link
2025-03-27	Audio-driven Gesture Generation via Deviation Feature in the Latent Space	Jiahui Chen et.al.	2503.21616	null
2025-03-27	Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs	Yoann Boget et.al.	2503.21592	null
2025-03-27	AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion	Liuyue Xie et.al.	2503.21581	null
2025-03-27	SyncSDE: A Probabilistic Framework for Diffusion Synchronization	Hyunjun Lee et.al.	2503.21555	null
2025-03-28	LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing	Achint Soni et.al.	2503.21541	link
2025-03-27	Nonlinear Stability of Large-Period Traveling Waves Bifurcating from the Heteroclinic Loop in the FitzHugh-Nagumo Equation	Ji Li et.al.	2503.21509	null
2025-03-27	Invert2Restore: Zero-Shot Degradation-Blind Image Restoration	Hamadi Chihaoui et.al.	2503.21486	null
2025-03-27	Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving	Lucas Nunes et.al.	2503.21449	link
2025-03-27	Exploring the flavor structure of leptons via diffusion models	Satsuki Nishimura et.al.	2503.21432	null
2025-03-27	Diffusion Image Prior	Hamadi Chihaoui et.al.	2503.21410	null
2025-03-27	HORT: Monocular Hand-held Objects Reconstruction with Transformers	Zerui Chen et.al.	2503.21313	null
2025-03-27	GenFusion: Closing the Loop between Reconstruction and Generation via Videos	Sibo Wu et.al.	2503.21219	null
2025-03-27	ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model	Jinwei Qi et.al.	2503.21144	null
2025-03-27	Can Video Diffusion Model Reconstruct 4D Geometry?	Jinjie Mai et.al.	2503.21082	null
2025-03-27	Efficient Multi-Instance Generation with Janus-Pro-Dirven Prompt Parsing	Fan Qi et.al.	2503.21069	null
2025-03-26	Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency	Tianqi Liu et.al.	2503.20785	link
2025-03-26	FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks	Jinwei Li et.al.	2503.20784	link
2025-03-26	RecTable: Fast Modeling Tabular Data with Rectified Flow	Masane Fuchi et.al.	2503.20731	link
2025-03-26	Dynamic Motion Blending for Versatile Motion Editing	Nan Jiang et.al.	2503.20724	null
2025-03-26	ARMO: Autoregressive Rigging for Multi-Category Objects	Mingze Sun et.al.	2503.20663	null
2025-03-26	MMGen: Unified Multi-modal Image Generation and Understanding in One Go	Jiepeng Wang et.al.	2503.20644	null
2025-03-26	Stochastic Transport Maps in Diffusion Models and Sampling	Xicheng Zhang et.al.	2503.20573	null
2025-03-26	Exploring Robustness of Cortical Morphometry in the presence of white matter lesions, using Diffusion Models for Lesion Filling	Vinzenz Uhr et.al.	2503.20571	null
2025-03-26	TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration	Ziying Zhang et.al.	2503.20537	null
2025-03-26	Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation	Qi Si et.al.	2503.20484	null
2025-03-26	Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability	Yingdong Shi et.al.	2503.20483	null
2025-03-26	Latent Beam Diffusion Models for Decoding Image Sequences	Guilherme Fernandes et.al.	2503.20429	null
2025-03-26	ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On	Ji Woo Hong et.al.	2503.20418	null
2025-03-27	Consistency Trajectory Matching for One-Step Generative Super-Resolution	Weiyi You et.al.	2503.20349	null
2025-03-26	EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame Interpolation	Ziran Zhang et.al.	2503.20268	link
2025-03-26	Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models	Prin Phunyaphibarn et.al.	2503.20240	null
2025-03-26	Automated UI Interface Generation via Diffusion Models: Enhancing Personalization and Efficiency	Yifei Duan et.al.	2503.20229	null
2025-03-26	Video Motion Graphs	Haiyang Liu et.al.	2503.20218	null
2025-03-26	Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models	Alex Jinpeng Wang et.al.	2503.20198	null
2025-03-26	AIGC-assisted Federated Learning for Edge Intelligence: Architecture Design, Research Challenges and Future Directions	Xianke Qiang et.al.	2503.20166	link
2025-03-24	Target-Aware Video Diffusion Models	Taeksoo Kim et.al.	2503.18950	null
2025-03-24	Training-free Diffusion Acceleration with Bottleneck Sampling	Ye Tian et.al.	2503.18940	null
2025-03-24	SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction	Enrico Pallotta et.al.	2503.18933	link
2025-03-24	Dual-domain Multi-path Self-supervised Diffusion Model for Accelerated MRI Reconstruction	Yuxuan Zhang et.al.	2503.18836	null
2025-03-24	Thermalizer: Stable autoregressive neural emulation of spatiotemporal chaos	Chris Pedersen et.al.	2503.18731	null
2025-03-24	Human Motion Unlearning	Edoardo De Matteis et.al.	2503.18674	null
2025-03-24	Dig2DIG: Dig into Diffusion Information Gains for Image Fusion	Bing Cao et.al.	2503.18627	null
2025-03-24	Generative Dataset Distillation using Min-Max Diffusion Model	Junqiao Fan et.al.	2503.18626	null
2025-03-24	Unified Uncertainty-Aware Diffusion for Multi-Agent Trajectory Modeling	Guillem Capellera et.al.	2503.18589	null
2025-03-24	Adapting Video Diffusion Models for Time-Lapse Microscopy	Alexander Holmberg et.al.	2503.18583	link
2025-03-25	AMD-Hummingbird: Towards an Efficient Text-to-Video Model	Takashi Isobe et.al.	2503.18559	link
2025-03-24	EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation	Qiang Qu et.al.	2503.18552	null
2025-03-24	Discriminative protein sequence modelling with Latent Space Diffusion	Eoin Quinn et.al.	2503.18551	null
2025-03-24	DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels	Erjian Guo et.al.	2503.18536	null
2025-03-25	AIM2PC: Aerial Image to 3D Building Point Cloud Reconstruction	Soulaimene Turki et.al.	2503.18527	null
2025-03-24	Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model	Leheng Zhang et.al.	2503.18512	null
2025-03-24	Hiding Images in Diffusion Models by Editing Learned Score Functions	Haoyu Chen et.al.	2503.18459	null
2025-03-24	InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment	Yunhong Lu et.al.	2503.18454	link
2025-03-25	Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models	Jinho Jeong et.al.	2503.18446	link
2025-03-24	Panorama Generation From NFoV Image Done Right	Dian Zheng et.al.	2503.18420	link
2025-03-20	DreamTexture: Shape from Virtual Texture with Analysis by Augmentation	Ananta R. Bhattarai et.al.	2503.16412	null
2025-03-20	VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness	SeungJu Cha et.al.	2503.16406	link
2025-03-20	ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos	Haolin Yang et.al.	2503.16400	null
2025-03-20	Scale-wise Distillation of Diffusion Models	Nikita Starodubcev et.al.	2503.16397	null
2025-03-21	SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation	Chun-Han Yao et.al.	2503.16396	null
2025-03-20	Do Visual Imaginations Improve Vision-and-Language Navigation Agents?	Akhil Perincherry et.al.	2503.16394	null
2025-03-20	LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images	Leyang Wang et.al.	2503.16376	null
2025-03-20	Heat transfer and mixing in initiated Chemical Vapor Deposition analyzed by in-situ gas composition sensing	Simon Shindler et.al.	2503.16373	null
2025-03-20	Ultra-Resolution Adaptation with Ease	Ruonan Yu et.al.	2503.16322	link
2025-03-20	Unleashing Vecset Diffusion Model for Fast Shape Generation	Zeqiang Lai et.al.	2503.16302	link
2025-03-20	Diffusion-augmented Graph Contrastive Learning for Collaborative Filter	Fan Huang et.al.	2503.16290	null
2025-03-20	SceneMI: Motion In-betweening for Modeling Human-Scene Interactions	Inwoo Hwang et.al.	2503.16289	null
2025-03-21	Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens	Shuqi Lu et.al.	2503.16278	link
2025-03-20	Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts	Yu Cao et.al.	2503.16218	null
2025-03-20	Improving Discriminator Guidance in Diffusion Models	Alexandre Verine et.al.	2503.16117	null
2025-03-20	Universal class of exactly solvable diffusions from space-time transformations	Costantino Di Bello et.al.	2503.16090	null
2025-03-20	Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model	Yingmao Miao et.al.	2503.16065	null
2025-03-20	Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts	Yike Yuan et.al.	2503.16057	null
2025-03-20	Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models	Marc Benedí San Millán et.al.	2503.15996	null
2025-03-20	A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli	Pengyu Liu et.al.	2503.15978	null
2025-03-19	FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers	Ruichen Chen et.al.	2503.15465	link
2025-03-19	Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator	Yuanzhi Zhu et.al.	2503.15457	null
2025-03-19	MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space	Lixing Xiao et.al.	2503.15451	null
2025-03-19	Visual Persona: Foundation Model for Full-Body Human Customization	Jisu Nam et.al.	2503.15406	null
2025-03-19	CCDP: Composition of Conditional Diffusion Policies with Guided Sampling	Amirreza Razmjoo et.al.	2503.15386	null
2025-03-19	Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers	Corentin Vazia et.al.	2503.15383	null
2025-03-19	Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images	Euclid Collaboration et.al.	2503.15321	null
2025-03-19	Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization	Feifei Li et.al.	2503.15197	null
2025-03-19	Single-Step Bidirectional Unpaired Image Translation Using Implicit Bridge Consistency Distillation	Suhyeon Lee et.al.	2503.15056	null
2025-03-19	Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training	Yunwei Lan et.al.	2503.15017	link
2025-03-19	Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening	Zihan Cao et.al.	2503.14975	null
2025-03-19	Language-based Image Colorization: A Benchmark and Beyond	Yifan Li et.al.	2503.14974	link
2025-03-19	Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models	Tingxiu Chen et.al.	2503.14966	link
2025-03-19	POSTA: A Go-to Framework for Customized Artistic Poster Generation	Haoyu Chen et.al.	2503.14908	null
2025-03-19	FetalFlex: Anatomy-Guided Diffusion Model for Flexible Control on Fetal Ultrasound Image Synthesis	Yaofei Duan et.al.	2503.14906	null
2025-03-19	Efficient Personalization of Quantized Diffusion Model without Backpropagation	Hoigi Seo et.al.	2503.14868	null
2025-03-19	Temporal-Consistent Video Restoration with Pre-trained Diffusion Models	Hengkang Wang et.al.	2503.14863	null
2025-03-19	Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability	Zihao Liu et.al.	2503.14833	link
2025-03-18	ShapeShift: Towards Text-to-Shape Arrangement Synthesis with Content-Aware Geometric Constraints	Vihaan Misra et.al.	2503.14720	null
2025-03-18	A Simple Combination of Diffusion Models for Better Quality Trade-Offs in Image Denoising	Jonas Dornbusch et.al.	2503.14654	null
2025-03-17	One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation	Daniil Selikhanovych et.al.	2503.13358	null
2025-03-17	Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors	Katja Schwarz et.al.	2503.13272	null
2025-03-17	FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis	Luxi Chen et.al.	2503.13265	null
2025-03-17	MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis	Marvin Seyfarth et.al.	2503.13211	null
2025-03-17	Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images	Yaxi Chen et.al.	2503.13131	null
2025-03-17	DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry	Jing Li et.al.	2503.13110	link
2025-03-17	Beyond Classical Diffusion: Fractional Derivatives in Transport and Stochastic Systems	Cypres Verbeeck et.al.	2503.13096	null
2025-03-17	TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba	Jiaxu Liu et.al.	2503.13004	null
2025-03-17	Training Video Foundation Models with NVIDIA NeMo	Zeeshan Patel et.al.	2503.12964	null
2025-03-17	Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait	Chaolong Yang et.al.	2503.12963	link
2025-03-17	Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction	Zheyuan Liu et.al.	2503.12953	null
2025-03-17	FNSE-SBGAN: Far-field Speech Enhancement with Schrodinger Bridge and Generative Adversarial Networks	Tong Lei et.al.	2503.12936	link
2025-03-17	AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction	Xuying Zhang et.al.	2503.12929	null
2025-03-17	DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode	Junjia Huang et.al.	2503.12838	null
2025-03-17	VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis	Zhifeng Wang et.al.	2503.12758	null
2025-03-16	UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing	Tsu-Jui Fu et.al.	2503.12652	null
2025-03-16	Understanding Driver Cognition and Decision-Making Behaviors in High-Risk Scenarios: A Drift Diffusion Perspective	Heye Huang et.al.	2503.12637	null
2025-03-16	LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization	Alessio Spagnoletti et.al.	2503.12615	null
2025-03-16	BalancedDPO: Adaptive Multi-Metric Alignment	Dipesh Tamboli et.al.	2503.12575	null
2025-03-16	Diffusion on Graph: Augmentation of Graph Structure for Node Classification	Yancheng Wang et.al.	2503.12563	null
2025-03-13	GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing	Rongyao Fang et.al.	2503.10639	link
2025-03-13	Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective	Xiaoming Zhao et.al.	2503.10638	null
2025-03-14	Distilling Diversity and Control in Diffusion Models	Rohit Gandikota et.al.	2503.10637	null
2025-03-13	HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model	Jiaming Liu et.al.	2503.10631	null
2025-03-13	NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models	Mert Albaba et.al.	2503.10626	null
2025-03-13	DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation	Chen Chen et.al.	2503.10618	null
2025-03-13	MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction	Yingshuang Zou et.al.	2503.10604	null
2025-03-13	CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models	Hao He et.al.	2503.10592	null
2025-03-13	Long Context Tuning for Video Generation	Yuwei Guo et.al.	2503.10589	null
2025-03-13	Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion	Evgeniia Vu et.al.	2503.10488	null
2025-03-13	CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance	Yufan Deng et.al.	2503.10391	null
2025-03-13	Enhancing Facial Privacy Protection via Weakening Diffusion Purification	Ali Salar et.al.	2503.10350	link
2025-03-13	DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image	Qi Zhao et.al.	2503.10342	null
2025-03-13	CoDiPhy: A General Framework for Applying Denoising Diffusion Models to the Physical Layer of Wireless Communication Systems	Peyman Neshaastegaran et.al.	2503.10297	null
2025-03-13	Efficient Diffusion Posterior Sampling for Noisy Inverse Problems	Ji Li et.al.	2503.10237	null
2025-03-13	Probability-Flow ODE in Infinite-Dimensional Function Spaces	Kunwoo Na et.al.	2503.10219	null
2025-03-13	Data augmentation using diffusion models to enhance inverse Ising inference	Yechan Lim et.al.	2503.10154	null
2025-03-13	Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation	Yi Wu et.al.	2503.10125	null
2025-03-13	Improving Diffusion-based Inverse Algorithms under Few-Step Constraint via Learnable Linear Extrapolation	Jiawei Zhang et.al.	2503.10103	link
2025-03-13	Light-weighted foundation model for seismic data processing based on representative and non-redundant pre-training dataset	Xintong Dong et.al.	2503.10092	null
2025-03-12	PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop	Chenyu Li et.al.	2503.09595	link
2025-03-12	Minimax Optimality of the Probability Flow ODE for Diffusion Models	Changxiao Cai et.al.	2503.09583	null
2025-03-12	Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models	Marianne Arriola et.al.	2503.09573	link
2025-03-12	TPDiff: Temporal Pyramid Video Diffusion Model	Lingmin Ran et.al.	2503.09566	null
2025-03-12	FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model	Jiahao Xia et.al.	2503.09560	null
2025-03-12	CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images	Bin Hu et.al.	2503.09514	null
2025-03-12	DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction	Junjie Zhou et.al.	2503.09491	link
2025-03-12	Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models	Zhihua Tian et.al.	2503.09446	link
2025-03-12	SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation	Qijian Zhang et.al.	2503.09439	null
2025-03-12	Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space	Yifan Zhou et.al.	2503.09419	link
2025-03-12	Diff-CL: A Novel Cross Pseudo-Supervision Method for Semi-supervised Medical Image Segmentation	Xiuzhen Guo et.al.	2503.09408	null
2025-03-12	UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer	Haoxuan Wang et.al.	2503.09277	null
2025-03-12	Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets	Hannah Kniesel et.al.	2503.09221	null
2025-03-12	Reangle-A-Video: 4D Video Generation as Video-to-Video Translation	Hyeonho Jeong et.al.	2503.09151	null
2025-03-12	Spiritus: An AI-Assisted Tool for Creating 2D Characters and Animations	Qirui Sun et.al.	2503.09127	null
2025-03-12	AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks	Jin Li et.al.	2503.09124	null
2025-03-12	Sequential Multi-Object Grasping with One Dexterous Hand	Sicheng He et.al.	2503.09078	null
2025-03-12	Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows	Chengyue Gong et.al.	2503.09069	null
2025-03-11	SICNav-Diffusion: Safe and Interactive Crowd Navigation with Diffusion Trajectory Predictions	Sepehr Samavi et.al.	2503.08858	null
2025-03-11	GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing	Yuanhao Wang et.al.	2503.08678	null
2025-03-10	Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation	Tianyu Chen et.al.	2503.07578	null
2025-03-11	Inductive Moment Matching	Linqi Zhou et.al.	2503.07565	null
2025-03-10	DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks	Feiran You et.al.	2503.07433	link
2025-03-10	AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion	Mingzhen Sun et.al.	2503.07418	null
2025-03-10	TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision	Shaobin Zhuang et.al.	2503.07416	null
2025-03-10	SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models	Ouxiang Li et.al.	2503.07392	link
2025-03-10	PersonaBooth: Personalized Text-to-Motion Generation	Boeun Kim et.al.	2503.07390	null
2025-03-10	TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models	Ruidong Chen et.al.	2503.07389	link
2025-03-10	AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models	Bo Huang et.al.	2503.07307	link
2025-03-10	Efficient Distillation of Classifier-Free Guidance using Adapters	Cristian Perez Jensen et.al.	2503.07274	link
2025-03-11	AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis	Zhangyu Lai et.al.	2503.07253	null
2025-03-11	Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios	Chenglu Pan et.al.	2503.07232	null
2025-03-10	Synthetic Lung X-ray Generation through Cross-Attention and Affinity Transformation	Ruochen Pi et.al.	2503.07209	null
2025-03-10	Effective and Efficient Masked Image Generation Models	Zebin You et.al.	2503.07197	link
2025-03-10	Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms	Jiaming Song et.al.	2503.07154	null
2025-03-10	Controllable 3D Outdoor Scene Generation via Scene Graphs	Yuheng Liu et.al.	2503.07152	link
2025-03-10	VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation	Hanzhi Chen et.al.	2503.07135	null
2025-03-10	TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation	Victor Shea-Jay Huang et.al.	2503.07050	null
2025-03-10	Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion	Yongle Zhang et.al.	2503.07047	null
2025-03-10	EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer	Yuxuan Zhang et.al.	2503.07027	null
2025-03-06	Compositional World Knowledge leads to High Utility Synthetic data	Sachit Gaudi et.al.	2503.04687	null
2025-03-06	The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation	Aoxiong Yin et.al.	2503.04606	link
2025-03-06	How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects	Wonkwang Lee et.al.	2503.04257	null
2025-03-06	Synthetic Data is an Elegant GIFT for Continual Vision-Language Models	Bin Wu et.al.	2503.04229	null
2025-03-06	Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models	Rui Jiang et.al.	2503.04215	null
2025-03-06	CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation	Yuki Tanaka et.al.	2503.04164	null
2025-03-07	Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration	Qianliang Wu et.al.	2503.04127	null
2025-03-06	FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis	Ziqi Ni et.al.	2503.04067	null
2025-03-06	RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics Replanning	Xi Ye et.al.	2503.04051	null
2025-03-06	Underlying Semantic Diffusion for Effective and Efficient In-Context Learning	Zhong Ji et.al.	2503.04050	null
2025-03-06	Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details	Yifei Gao et.al.	2503.04037	null
2025-03-06	TextDoctor: Unified Document Image Inpainting via Patch Pyramid Diffusion Models	Wanglong Lu et.al.	2503.04021	null
2025-03-05	All-atom Diffusion Transformers: Unified generative modelling of molecules and materials	Chaitanya K. Joshi et.al.	2503.03965	link
2025-03-05	Generative Learning of Densities on Manifolds	Dimitris G. Giovanis et.al.	2503.03963	null
2025-03-05	GuardDoor: Safeguarding Against Malicious Diffusion Editing via Protective Backdoors	Yaopei Zeng et.al.	2503.03944	null
2025-03-05	A non-homogeneous, non-stationary and path-dependent Markov anomalous diffusion model	Nestor Barraza et.al.	2503.03896	null
2025-03-05	Metallicity Gradients in Modern Cosmological Simulations I: Tension Between Smooth Stellar Feedback Models and Observations	Alex M. Garcia et.al.	2503.03804	null
2025-03-05	Rethinking Video Tokenization: A Conditioned Diffusion-based Approach	Nianzu Yang et.al.	2503.03708	link
2025-03-05	DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance	Zhao Yang et.al.	2503.03689	link
2025-03-05	Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias	Rui Lu et.al.	2503.03595	null
2025-03-05	Generative Artificial Intelligence in Robotic Manipulation: A Survey	Kun Zhang et.al.	2503.03464	null
2025-03-05	Top-K Maximum Intensity Projection Priors for 3D Liver Vessel Segmentation	Xiaotong Zhang et.al.	2503.03367	null
2025-03-05	Video Super-Resolution: All You Need is a Video Diffusion Model	Zhihao Zhan et.al.	2503.03355	null
2025-03-05	Optimizing for the Shortest Path in Denoising Diffusion Model	Ping Chen et.al.	2503.03265	link
2025-03-05	GenColor: Generative Color-Concept Association in Visual Design	Yihan Hou et.al.	2503.03236	null
2025-03-05	Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture	Zhumei Wang et.al.	2503.03222	null
2025-03-05	An Analytical Theory of Power Law Spectral Bias in the Learning Dynamics of Diffusion Models	Binxu Wang et.al.	2503.03206	null
2025-03-05	WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models	Tao Feng et.al.	2503.03110	null
2025-03-05	From Architectural Sketch to Conceptual Representation: Using Structure-Aware Diffusion Model to Generate Renderings of School Buildings	Zhengyang Wang et.al.	2503.03090	null
2025-03-05	Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings	Xusheng Du et.al.	2503.03068	null
2025-03-04	Can Diffusion Models Provide Rigorous Uncertainty Quantification for Bayesian Inverse Problems?	Evan Scope Crafts et.al.	2503.03007	link
2025-03-04	Diverse Controllable Diffusion Policy with Signal Temporal Logic	Yue Meng et.al.	2503.02924	link
2025-03-04	Straight-Line Diffusion Model for Efficient 3D Molecular Generation	Yuyan Ni et.al.	2503.02918	link
2025-03-04	Generating Reliable Initial Velocity Models for Full-waveform Inversion with Well and Structural Constraints	Qingchen Zhang et.al.	2503.02815	null
2025-03-04	StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts	Zhaoxing Gan et.al.	2503.02595	null
2025-03-04	TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping	Xinying Hong et.al.	2503.02578	link
2025-03-04	SPG: Improving Motion Diffusion by Smooth Perturbation Guidance	Boseong Jeon et.al.	2503.02577	null
2025-02-28	Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos	Zhiyu Tan et.al.	2502.21314	null
2025-02-28	Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion	Kulin Shah et.al.	2502.21278	null
2025-02-28	A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images	Zineb Sordo et.al.	2502.21151	null
2025-02-28	Generative Uncertainty in Diffusion Models	Metod Jazbec et.al.	2502.20946	null
2025-02-28	DiffBrush:Just Painting the Art by Your Hands	Jiaming Chu et.al.	2502.20904	null
2025-02-28	CADDreamer: CAD object Generation from Single-view Images	Yuan Li et.al.	2502.20732	null
2025-02-28	Diffusion Restoration Adapter for Real-World Image Restoration	Hanbang Liang et.al.	2502.20679	null
2025-02-28	Wavelet-based density sketching with functional hierarchical tensor	Xun Tang et.al.	2502.20655	null
2025-02-28	Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion Models	Yu Pan et.al.	2502.20650	link
2025-02-28	T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting	Yifei Qian et.al.	2502.20625	null
2025-02-27	Unifying Model Predictive Path Integral Control, Reinforcement Learning, and Diffusion Models for Optimal Control and Planning	Yankai Li et.al.	2502.20476	null
2025-02-27	Tight Inversion: Image-Conditioned Inversion for Real Image Editing	Edo Kadosh et.al.	2502.20376	null
2025-02-27	Constrained Generative Modeling with Manually Bridged Diffusion Models	Saeid Naderiparizi et.al.	2502.20371	null
2025-02-27	FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction	Siyu Jiao et.al.	2502.20313	link
2025-02-27	Mobius: Text to Seamless Looping Video Generation via Latent Shift	Xiuli Bi et.al.	2502.20307	link
2025-02-27	Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions	Palawat Busaranuvong et.al.	2502.20277	null
2025-02-27	Attention Distillation: A Unified Approach to Visual Characteristics Transfer	Yang Zhou et.al.	2502.20235	link
2025-02-27	Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think	Liang Chen et.al.	2502.20172	link
2025-02-27	Scalability of the second-order reliability method for stochastic differential equations with multiplicative noise	Timo Schorlepp et.al.	2502.20114	null
2025-02-27	Generative augmentations for improved cardiac ultrasound segmentation using diffusion models	Gilles Van De Vyver et.al.	2502.20100	link
2025-02-27	Image Referenced Sketch Colorization Based on Animation Creation Workflow	Dingkun Yan et.al.	2502.19937	link
2025-02-27	DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models	Weihao wu et.al.	2502.19924	null
2025-02-27	High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model	Mingtao Guo et.al.	2502.19894	link
2025-02-27	C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation	Yuhao Li et.al.	2502.19868	link
2025-02-27	One-for-More: Continual Diffusion Model for Anomaly Detection	Xiaofan Li et.al.	2502.19848	link
2025-02-27	Analyzing CLIP’s Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study	Reza Abbasi et.al.	2502.19828	null
2025-02-27	Implicit Search via Discrete Diffusion: A Study on Chess	Jiacheng Ye et.al.	2502.19805	link
2025-02-27	UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition	Xiao Lin et.al.	2502.19803	link
2025-02-27	MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery	Lianping Yang et.al.	2502.19797	null
2025-02-27	Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network	Xingyu Qiu et.al.	2502.19754	link
2025-02-27	Recent Advances on Generalizable Diffusion-generated Image Detection	Qijie Xu et.al.	2502.19716	link
2025-02-26	HDM: Hybrid Diffusion Model for Unified Image Anomaly Detection	Zekang Weng et.al.	2502.19200	null
2025-02-26	RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images	Yuhan Tang et.al.	2502.19153	null
2025-02-26	Modulation of the galactic cosmic ray spectrum in an anisotropic diffusion approach	V. D. Borisov et.al.	2502.19062	null
2025-02-26	A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models	Vu Tuan Truong Long et.al.	2502.19047	null
2025-02-26	DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model	Lei Zhao et.al.	2502.18952	null
2025-02-26	Physics-Aware Inverse Design for Nanowire Single-Photon Avalanche Detectors via Deep Learning	Boyang Zhang et.al.	2502.18857	null
2025-02-26	Optimal Stochastic Trace Estimation in Generative Modeling	Xinyang Liu et.al.	2502.18808	null
2025-02-26	Ptychographic Image Reconstruction from Limited Data via Score-Based Diffusion Models with Physics-Guidance	Refik Mert Cam et.al.	2502.18767	null
2025-02-25	Adaptive conditional latent diffusion maps beam loss to 2D phase space projections	Alexander Scheinker et.al.	2502.18684	null
2025-02-25	Diffusion Models for conditional MRI generation	Miguel Herencia García del Castillo et.al.	2502.18620	null
2025-02-25	K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs	Ziheng Ouyang et.al.	2502.18461	null
2025-02-25	ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies	Pedro Sequeira et.al.	2502.18438	null
2025-02-25	LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation	Pengzhi Li et.al.	2502.18302	null
2025-02-25	Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training	Botao Ye et.al.	2502.18219	null
2025-02-25	Training Consistency Models with Variational Noise Coupling	Gianluigi Silvestri et.al.	2502.18197	link
2025-02-25	Multi-Perspective Data Augmentation for Few-shot Object Detection	Anh-Khoa Nguyen Vu et.al.	2502.18195	link
2025-02-25	Joint Reconstruction of Spatially-Coherent and Realistic Clothed Humans and Objects from a Single Image	Ayushi Dutta et.al.	2502.18150	null
2025-02-25	PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching	Han Nie et.al.	2502.18104	link
2025-02-25	Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models	Jia Yu et.al.	2502.17951	link
2025-02-25	3D Anatomical Structure-guided Deep Learning for Accurate Diffusion Microstructure Imaging	Xinrui Ma et.al.	2502.17933	null
2025-02-24	GCC: Generative Color Constancy via Diffusing a Color Checker	Chen-Wei Chang et.al.	2502.17435	null
2025-02-24	S4S: Solving for a Diffusion Model Solver	Eric Frankel et.al.	2502.17423	null
2025-02-24	X-Dancer: Expressive Music to Human Dance Video Generation	Zeyuan Chen et.al.	2502.17414	null
2025-02-24	AnyTop: Character Animation Diffusion with Any Topology	Inbar Gat et.al.	2502.17327	link
2025-02-24	VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing	Xiangpeng Yang et.al.	2502.17258	null
2025-02-24	Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation	Baptiste Chopin et.al.	2502.17198	null
2025-02-24	DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks	Canyu Zhao et.al.	2502.17157	link
2025-02-24	Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions	Zhong Li et.al.	2502.17119	link
2025-02-24	SFLD: Reducing the content bias for AI-generated Image Detection	Seoyeon Gye et.al.	2502.17105	null
2025-02-24	Generative Models in Decision Making: A Survey	Yinchuan Li et.al.	2502.17100	null
2025-02-24	Conditional Diffusion-Flow models for generating 3D cosmic density fields: applications to f(R) cosmologies	Julieth Katherine Riveros et.al.	2502.17087	link
2025-02-24	SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations	Wendi Liu et.al.	2502.17056	null
2025-02-24	TraFlow: Trajectory Distillation on Pre-Trained Rectified Flow	Zhangkai Wu et.al.	2502.16972	null
2025-02-24	Autoregressive Image Generation Guided by Chains of Thought	Miaomiao Cai et.al.	2502.16965	null
2025-02-24	MAD-AD: Masked Diffusion for Unsupervised Brain Anomaly Detection	Farzad Beizaee et.al.	2502.16943	link
2025-02-24	Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model	Kang Fu et.al.	2502.16915	link
2025-02-24	Mitigating Hallucinations in Diffusion Models through Adaptive Attention Modulation	Trevine Oorloff et.al.	2502.16872	null
2025-02-24	Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization	Taeyoung Yun et.al.	2502.16824	link
2025-02-24	Fast, Accurate Manifold Denoising by Tunneling Riemannian Optimization	Shiyu Wang et.al.	2502.16819	null
2025-02-24	DiffKAN-Inpainting: KAN-based Diffusion model for brain tumor inpainting	Tianli Tao et.al.	2502.16771	null
2025-02-20	Improving the Diffusability of Autoencoders	Ivan Skorokhodov et.al.	2502.14831	null
2025-02-20	A Survey on Text-Driven 360-Degree Panorama Generation	Hai Wang et.al.	2502.14799	null
2025-02-20	DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models	Hongji Yang et.al.	2502.14779	null
2025-02-20	Textured 3D Regenerative Morphing with 3D Diffusion Prior	Songlin Yang et.al.	2502.14316	null
2025-02-19	DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models	Daewon Chae et.al.	2502.14070	null
2025-02-19	d-Sketch: Improving Visual Fidelity of Sketch-to-Image Translation with Pretrained Latent Diffusion Models without Retraining	Prasun Roy et.al.	2502.14007	link
2025-02-19	Im2SurfTex: Surface Texture Generation via Neural Backprojection of Multi-View Images	Yiangos Georgiou et.al.	2502.14006	null
2025-02-19	SigStyle: Signature Style Transfer via Personalized Text-to-Image Models	Ye Wang et.al.	2502.13997	null
2025-02-19	FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation	Yunpeng Zhang et.al.	2502.13995	link
2025-02-19	Generative Detail Enhancement for Physically Based Materials	Saeed Hadadan et.al.	2502.13994	null
2025-02-19	SelfAge: Personalized Facial Age Transformation Using Self-reference Images	Taishi Ito et.al.	2502.13987	link
2025-02-19	IP-Composer: Semantic Composition of Visual Concepts	Sara Dorfman et.al.	2502.13951	null
2025-02-19	TESS 2: A Large-Scale Generalist Diffusion Language Model	Jaesung Tae et.al.	2502.13917	link
2025-02-19	Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions	Xinwei Shen et.al.	2502.13747	null
2025-02-19	RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior	Ching-Hua Lee et.al.	2502.13574	null
2025-02-19	Diffusion Model Agnostic Social Influence Maximization in Hyperbolic Space	Hongliang Qiao et.al.	2502.13571	null
2025-02-19	Interleaved Gibbs Diffusion for Constrained Generation	Gautham Govind Anil et.al.	2502.13450	null
2025-02-18	Secure and Efficient Watermarking for Latent Diffusion Models in Model Distribution Scenarios	Liangqi Lei et.al.	2502.13345	null
2025-02-18	Geometry-Aware Diffusion Models for Multiview Scene Inpainting	Ahmad Salimi et.al.	2502.13335	null
2025-02-18	MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching	Yen-Siang Wu et.al.	2502.13234	null
2025-02-18	Fundus2Globe: Generative AI-Driven 3D Digital Twins for Personalized Myopia Management	Danli Shi et.al.	2502.13182	null
2025-02-18	Is Noise Conditioning Necessary for Denoising Generative Models?	Qiao Sun et.al.	2502.13129	null
2025-02-18	Score Matching Riemannian Diffusion Means	Frederik Möbius Rygaard et.al.	2502.13106	null
2025-02-18	Personalized Image Generation with Deep Generative Models: A Decade Survey	Yuxiang Wei et.al.	2502.13081	link
2025-02-18	Does Training with Synthetic Data Truly Protect Privacy?	Yunpeng Zhao et.al.	2502.12976	link
2025-02-18	Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression	Jaemoon Lee et.al.	2502.12951	null
2025-02-18	RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models	Tanqiu Jiang et.al.	2502.12794	link
2025-02-18	Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo	James Thornton et.al.	2502.12786	null
2025-02-18	High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion	Xiang Zhang et.al.	2502.12752	null
2025-02-18	3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces	Fabian Bongratz et.al.	2502.12742	null
2025-02-18	NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation	Zhiyuan Liu et.al.	2502.12638	link
2025-02-17	Diffusion Models without Classifier-free Guidance	Zhicong Tang et.al.	2502.12154	link
2025-02-17	Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening	Ye Tian et.al.	2502.12146	link
2025-02-17	How compositional generalization and creativity improve as diffusion models are trained	Alessandro Favero et.al.	2502.12089	null
2025-02-17	HumanGif: Single-View Human Diffusion with Generative Prior	Shoukang Hu et.al.	2502.12080	link
2025-02-17	A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond	Shreya Shukla et.al.	2502.12048	null
2025-02-17	Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images	Negar Kamali et.al.	2502.11989	link
2025-02-17	Image Inversion: A Survey from GANs to Diffusion and Beyond	Yinan Chen et.al.	2502.11974	link
2025-02-17	Approximating a spatially-heterogeneously mass-emitting object by multiple point sources in a diffusion model	Qiyao Peng et.al.	2502.11908	null
2025-02-17	BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model	Weilin Lin et.al.	2502.11798	link
2025-02-17	MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow	Hanzhuo Huang et.al.	2502.11697	null
2025-02-17	GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text	Gyumin Shim et.al.	2502.11642	null
2025-02-17	Membership Inference Attacks for Face Images Against Fine-Tuned Latent Diffusion Models	Lauritz Christian Holme et.al.	2502.11619	null
2025-02-17	Maximum Entropy Reinforcement Learning with Diffusion Policy	Xiaoyi Dong et.al.	2502.11612	link
2025-02-17	Continuous Diffusion Model for Language Modeling	Jaehyeong Jo et.al.	2502.11564	link
2025-02-17	Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation	Zexi Jia et.al.	2502.11532	null
2025-02-17	SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion	Junxian Ma et.al.	2502.11515	null
2025-02-17	Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation	Taeyoung Yun et.al.	2502.11477	link
2025-02-17	Inverse Flow and Consistency Models	Yuchen Zhang et.al.	2502.11333	null
2025-02-17	Deep Learning of Proteins with Local and Global Regions of Disorder	Oufan Zhang et.al.	2502.11326	link
2025-02-16	Collaborative Deterministic-Diffusion Model for Probabilistic Urban Spatiotemporal Prediction	Zhi Sheng et.al.	2502.11013	null
2025-02-13	Theoretical Benefit and Limitation of Diffusion Language Model	Guhao Feng et.al.	2502.09622	null
2025-02-13	RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets	Isabella Liu et.al.	2502.09615	null
2025-02-13	Score-of-Mixture Training: Training One-Step Generative Models Made Simple	Tejas Jayashankar et.al.	2502.09609	null
2025-02-13	Rolling Ahead Diffusion for Traffic Scene Simulation	Yunpeng Liu et.al.	2502.09587	null
2025-02-13	Memorization and Generalization in Generative Diffusion under the Manifold Hypothesis	Beatrice Achilli et.al.	2502.09578	null
2025-02-13	DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra	Montgomery Bohde et.al.	2502.09571	link
2025-02-13	Diffusing DeBias: a Recipe for Turning a Bug into a Feature	Massimiliano Ciranni et.al.	2502.09564	null
2025-02-13	Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model	Fei Shen et.al.	2502.09533	null
2025-02-13	Diffusion Models for Molecules: A Survey of Methods and Tasks	Liang Wang et.al.	2502.09511	link
2025-02-13	Redistribute Ensemble Training for Mitigating Memorization in Diffusion Models	Xiaoliu Guan et.al.	2502.09434	link
2025-02-13	ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation	Rotem Shalev-Arkushin et.al.	2502.09411	null
2025-02-13	Non-asymptotic Analysis of Diffusion Annealed Langevin Monte Carlo for Generative Modelling	Paula Cordero-Encinar et.al.	2502.09306	null
2025-02-13	ConsistentDreamer: View-Consistent Meshes Through Balanced Multi-View Gaussian Optimization	Onat Şahin et.al.	2502.09278	null
2025-02-13	From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine	Lukas Buess et.al.	2502.09242	null
2025-02-13	E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization	Trung X. Pham et.al.	2502.09164	null
2025-02-13	Regularization can make diffusion models more efficient	Mahsa Taheri et.al.	2502.09151	null
2025-02-13	Exact Bayesian inference for Markov switching diffusions	Timothée Stumpf-Fétizon et.al.	2502.09126	null
2025-02-13	StyleBlend: Enhancing Style-Specific Content Creation in Text-to-Image Diffusion Models	Zichong Chen et.al.	2502.09064	link
2025-02-13	MTDP: Modulated Transformer Diffusion Policy Model	Qianhao Wang et.al.	2502.09029	null
2025-02-13	Dynamic watermarks in images generated by diffusion models	Yunzhuo Chen et.al.	2502.08927	null
2025-02-12	SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation	Ellie Arar et.al.	2502.08642	null
2025-02-12	CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation	Qinghe Wang et.al.	2502.08639	null
2025-02-12	Chasing Charge Carriers: Diffusion Dynamics in Mixed-n Quasi-Two-Dimensional Colloidal MAPbBr3 Perovskites	Ronja Maria Piehler et.al.	2502.08601	null
2025-02-12	Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio	Khaled Kahouli et.al.	2502.08598	link
2025-02-12	Light-A-Video: Training-free Video Relighting via Progressive Light Fusion	Yujie Zhou et.al.	2502.08590	link
2025-02-12	Ultrasound Image Generation using Latent Diffusion Models	Benoit Freiche et.al.	2502.08580	null
2025-02-12	Mapping the Landscape of Generative AI in Network Monitoring and Management	Giampaolo Bovenzi et.al.	2502.08576	null
2025-02-12	BCDDM: Branch-Corrected Denoising Diffusion Model for Black Hole Image Generation	Ao liu et.al.	2502.08528	null
2025-02-12	One-Shot Federated Learning with Classifier-Free Diffusion Models	Obaidullah Zaland et.al.	2502.08488	null
2025-02-12	A Survey on Pre-Trained Diffusion Model Distillations	Xuhui Fan et.al.	2502.08364	null
2025-02-12	A posteriori error control for a finite volume scheme for a cross-diffusion model of ion transport	Arne Berrens et.al.	2502.08306	null
2025-02-12	BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video	Yu Hong et.al.	2502.08297	null
2025-02-12	FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis	Wonjoon Jin et.al.	2502.08244	null
2025-02-12	DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias	Song Park et.al.	2502.08167	null
2025-02-12	PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation	Ziyan Wang et.al.	2502.08106	null
2025-02-12	End-to-End Predictive Planner for Autonomous Driving with Consistency Models	Anjian Li et.al.	2502.08033	null
2025-02-11	Training-Free Safe Denoisers for Safe Use of Diffusion Models	Mingyu Kim et.al.	2502.08011	null
2025-02-11	Greed is Good: Guided Generation from a Greedy Perspective	Zander W. Blasingame et.al.	2502.08006	null
2025-02-11	Towards Training One-Step Diffusion Models Without Distillation	Mingtian Zhang et.al.	2502.08005	null
2025-02-11	SurGrID: Controllable Surgical Simulation via Scene Graph to Image Diffusion	Yannik Frisch et.al.	2502.07945	null
2025-02-10	Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions	Jaeyeon Kim et.al.	2502.06768	null
2025-02-10	History-Guided Video Diffusion	Kiwhan Song et.al.	2502.06764	null
2025-02-10	Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene	Tai-Yu Pan et.al.	2502.06682	null
2025-02-10	Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification	Jiachen Li et.al.	2502.06619	link
2025-02-10	MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models	Kamil Garifullin et.al.	2502.06606	null
2025-02-10	A Large-scale AI-generated Image Inpainting Benchmark	Paschalis Giakoumoglou et.al.	2502.06593	null
2025-02-10	Diffusion Models for Computational Neuroimaging: A Survey	Haokai Zhao et.al.	2502.06552	link
2025-02-10	Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation	Soobin Um et.al.	2502.06516	link
2025-02-10	WyckoffDiff - A Generative Diffusion Model for Crystal Symmetry	Filip Ekström Kelvinius et.al.	2502.06485	link
2025-02-10	Habitizing Diffusion Planning for Efficient and Effective Decision Making	Haofei Lu et.al.	2502.06401	link
2025-02-10	TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints	Pengyu Long et.al.	2502.06392	null
2025-02-10	Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo	Filip Ekström Kelvinius et.al.	2502.06379	null
2025-02-10	Guidance-base Diffusion Models for Improving Photoacoustic Image Quality	Tatsuhiro Eguchi et.al.	2502.06354	null
2025-02-10	Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior	Lee Hyoseok et.al.	2502.06338	null
2025-02-10	Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance	Li Hu et.al.	2502.06145	null
2025-02-10	CDM: Contact Diffusion Model for Multi-Contact Point Localization	Seo Wook Han et.al.	2502.06109	null
2025-02-10	Debiasing Guidance for Discrete Diffusion with Sequential Monte Carlo	Cheuk Kit Lee et.al.	2502.06079	null
2025-02-09	Generating 3D Binding Molecules Using Shape-Conditioned Diffusion Models with Guidance	Ziqi Chen et.al.	2502.06027	null
2025-02-09	Dual Caption Preference Optimization for Diffusion Models	Amir Saeidi et.al.	2502.06023	link
2025-02-09	Diffusion Models for Inverse Problems in the Exponential Family	Alessandro Micheli et.al.	2502.05994	null
2025-02-06	HOG-Diff: Higher-Order Guided Diffusion for Graph Generation	Yiming Huang et.al.	2502.04308	link
2025-02-06	MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation	Jinbo Xing et.al.	2502.04299	null
2025-02-06	Diffusion-based mass map reconstruction from weak lensing data	Supranta S. Boruah et.al.	2502.04158	null
2025-02-06	Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis	Zhen Ye et.al.	2502.04128	link
2025-02-06	Generative Adversarial Networks Bridging Art and Machine Intelligence	Junhao Song et.al.	2502.04116	null
2025-02-06	TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers	Younghye Hwang et.al.	2502.04056	null
2025-02-06	PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models	Aleksandar Cvejic et.al.	2502.04050	null
2025-02-06	Hierarchical Entropic Diffusion for Ransomware Detection: A Probabilistic Approach to Behavioral Anomaly Isolation	Vasili Iskorohodov et.al.	2502.03882	null
2025-02-06	DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models	Lingshun Kong et.al.	2502.03810	null
2025-02-06	DICE: Distilling Classifier-Free Guidance into Text Embeddings	Zhenyu Zhou et.al.	2502.03726	null
2025-02-06	Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free	Gian Mario Favero et.al.	2502.03687	null
2025-02-06	Variational Control for Guidance in Diffusion Models	Kushagra Pandey et.al.	2502.03686	link
2025-02-05	Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach	Yunuo Chen et.al.	2502.03639	null
2025-02-05	SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models	Daniel Levy et.al.	2502.03638	link
2025-02-05	Simultaneous Multi-Robot Motion Planning with Projected Diffusion Models	Jinhao Liang et.al.	2502.03607	null
2025-02-05	Path Planning for Masked Diffusion Model Sampling	Fred Zhangzhi Peng et.al.	2502.03540	null
2025-02-05	Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics	Xuan Li et.al.	2502.03449	null
2025-02-05	Masked Autoencoders Are Effective Tokenizers for Diffusion Models	Hao Chen et.al.	2502.03444	null
2025-02-05	TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer	Zhihong Xu et.al.	2502.03426	null
2025-02-05	A Mixture-Based Framework for Guiding Diffusion Models	Yazid Janati et.al.	2502.03332	link
2025-02-05	An efficient end-to-end computational framework for the generation of ECG calibrated volumetric models of human atrial electrophysiology	Elena Zappon et.al.	2502.03322	null
2025-02-05	MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent	Xinyao Liao et.al.	2502.03207	null
2025-02-05	Poisson Flow Joint Model for Multiphase contrast-enhanced CT	Rongjun Ge et.al.	2502.03079	null
2025-02-05	Direct Distributional Optimization for Provable Alignment of Diffusion Models	Ryotaro Kawata et.al.	2502.02954	null
2025-02-05	Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization	Yang Li et.al.	2502.02941	null
2025-02-05	Elucidating the Preconditioning in Consistency Distillation	Kaiwen Zheng et.al.	2502.02922	null
2025-02-04	When are Diffusion Priors Helpful in Sparse Reconstruction? A Study with Sparse-view CT	Matt Y. Cheung et.al.	2502.02771	null
2025-02-04	Calibrated Multi-Preference Optimization for Aligning Diffusion Models	Kyungmin Lee et.al.	2502.02588	null
2025-02-04	Open Materials Generation with Stochastic Interpolants	Philipp Hoellmer et.al.	2502.02582	null
2025-02-04	Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation	Jian Liu et.al.	2502.02525	link
2025-02-04	Privacy Attacks on Image AutoRegressive Models	Antoni Kowalczuk et.al.	2502.02514	link
2025-02-04	Do Graph Diffusion Models Accurately Capture and Generate Substructure Distributions?	Xiyuan Wang et.al.	2502.02488	null
2025-02-04	Distributional Diffusion Models with Scoring Rules	Valentin De Bortoli et.al.	2502.02483	null
2025-02-04	Towards Consistent and Controllable Image Synthesis for Face Editing	Mengting Wei et.al.	2502.02465	null
2025-02-04	Sparse Data Generation Using Diffusion Models	Phil Ostheimer et.al.	2502.02448	null
2025-02-04	Towards Fast Graph Generation via Autoregressive Noisy Filtration Modeling	Markus Krimmel et.al.	2502.02415	link
2025-01-31	Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions	Sören Christensen et.al.	2501.19373	null
2025-01-31	Pathological MRI Segmentation by Synthetic Pathological Data Generation in Fetuses and Neonates	Misha P. T Kaandorp et.al.	2501.19338	null
2025-01-31	Medical Semantic Segmentation with Diffusion Pretrain	David Li et.al.	2501.19265	null
2025-01-31	Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search	Yuta Oshima et.al.	2501.19252	null
2025-01-31	PSyDUCK: Training-Free Steganography for Latent Diffusion	Georgia Channing et.al.	2501.19172	null
2025-01-31	RMDM: Radio Map Diffusion Model with Physics Informed	Haozhe Jia et.al.	2501.19160	link
2025-01-31	Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data	Xichen Xu et.al.	2501.19094	null
2025-01-31	MotionPCM: Real-Time Motion Synthesis with Phased Consistency Model	Lei Jiang et.al.	2501.19083	null
2025-01-31	Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations	Dahye Kim et.al.	2501.19066	link
2025-01-31	Collaborative Diffusion Model for Recommender System	Gyuseok Lee et.al.	2501.18997	null
2025-01-31	OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation	Yuchen Lin et.al.	2501.18982	null
2025-01-31	Fantastic Targets for Concept Erasure in Diffusion Models and Where To Find Them	Anh Bui et.al.	2501.18950	link
2025-01-31	Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior	Tongda Xu et.al.	2501.18913	link
2025-01-31	Trustworthy Evaluation of Generative AI Models	Zijun Gao et.al.	2501.18897	null
2025-01-31	Distorting Embedding Space for Safety: A Defense Mechanism for Adversarially Robust Diffusion Models	Jaesin Ahn et.al.	2501.18877	link
2025-01-31	REG: Rectified Gradient Guidance for Conditional Diffusion Models	Zhengqi Gao et.al.	2501.18865	null
2025-01-31	Equivariant Hypergraph Diffusion for Crystal Structure Prediction	Yang Liu et.al.	2501.18850	null
2025-01-31	Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential	Chenyu Gao et.al.	2501.18834	null
2025-01-30	Distillation-Driven Diffusion Model for Multi-Scale MRI Super-Resolution: Make 1.5T MRI Great Again	Zhe Wang et.al.	2501.18736	link
2025-01-30	Strong and Controllable 3D Motion Generation	Canxuan Gang et.al.	2501.18726	null
2025-01-30	DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models	Ruofan Liang et.al.	2501.18590	null
2025-01-30	Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss	Wenshuo Chen et.al.	2501.18232	link
2025-01-30	Inverse source problem of sub-diffusion of variable exponent	Zhiyuan Li et.al.	2501.18228	null
2025-01-29	SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders	Bartosz Cywiński et.al.	2501.18052	link
2025-01-28	ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models	Ruiqi Xu et.al.	2501.17895	null
2025-01-29	VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback	Sayeh Gholipour Picha et.al.	2501.17726	link
2025-01-29	Distinguished Quantized Guidance for Diffusion-based Sequence Recommendation	Wenyu Mao et.al.	2501.17670	null
2025-01-29	Solving Inverse Problems using Diffusion with Fast Iterative Renoising	Matt C. Bendel et.al.	2501.17468	null
2025-01-28	MDDM: A Molecular Dynamics Diffusion Model to Predict Particle Self-Assembly	Kevin Ferguson et.al.	2501.17319	null
2025-01-28	CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation	Nikolai Kalischek et.al.	2501.17162	null
2025-01-28	IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait	Han Yang et.al.	2501.17159	null
2025-01-28	Generative diffusion models from a PDE perspective	Fei Cao et.al.	2501.17054	null
2025-01-28	Adversarial Masked Autoencoder Purifier with Defense Transferability	Yuan-Chih Chen et.al.	2501.16904	null
2025-01-28	DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model	Josua Spisak et.al.	2501.16800	null
2025-01-28	FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation	Arvin Tashakori et.al.	2501.16778	null
2025-01-28	DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation	Chenguo Lin et.al.	2501.16764	null
2025-01-28	ITVTON:Virtual Try-On Diffusion Transformer Model Based on Integrated Image and Text	Haifeng Ni et.al.	2501.16757	null
2025-01-28	Consistency Diffusion Models for Single-Image 3D Reconstruction with Priors	Chenru Jiang et.al.	2501.16737	null
2025-01-28	Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models	Huijie Liu et.al.	2501.16714	null
2025-01-28	CascadeV: An Implementation of Wurstchen Architecture for Video Generation	Wenfeng Lin et.al.	2501.16612	link
2025-01-27	PackDiT: Joint Human Motion and Text Generation via Mutual Prompting	Zhongyu Jiang et.al.	2501.16551	null
2025-01-27	PhysAnimator: Physics-Guided Generative Cartoon Animation	Tianyi Xie et.al.	2501.16550	null
2025-01-27	Decrypting the temperature field in flow boiling with latent diffusion models	UngJin Na et.al.	2501.16510	null
2025-01-27	RelightVid: Temporal-Consistent Diffusion Model for Video Relighting	Ye Fang et.al.	2501.16330	null
2025-01-27	Congested Crossing Pedestrian Traffic Flow : Dispersion vs. Transport in Crowded Areas	Mariam Al Khatib et.al.	2501.16275	null
2025-01-27	UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images	Tatiana Taís Schein et.al.	2501.16211	link
2025-01-27	Multi-front dynamics in spatially inhomogeneous Allen-Cahn equations	Robbin Bastiaansen et.al.	2501.16195	null
2025-01-27	BAG: Body-Aligned 3D Wearable Asset Generation	Zhongjin Luo et.al.	2501.16177	null
2025-01-27	Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors	Zhiyuan Lu et.al.	2501.16147	null
2025-01-27	Using Generative Models to Produce Realistic Populations of UK Windstorms	Yee Chun Tsoi et.al.	2501.16110	null
2025-01-27	Improving Tropical Cyclone Forecasting With Video Diffusion Models	Zhibo Ren et.al.	2501.16003	link
2025-01-27	MatCLIP: Light- and Shape-Insensitive Assignment of PBR Material Models	Michael Birsak et.al.	2501.15981	null
2025-01-27	Generative AI for Lyapunov Optimization Theory in UAV-based Low-Altitude Economy Networking	Zhang Liu et.al.	2501.15928	null
2025-01-27	Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation	Adil Kaan Akan et.al.	2501.15878	null
2025-01-27	Can Location Embeddings Enhance Super-Resolution of Satellite Imagery?	Daniel Panangian et.al.	2501.15847	null
2025-01-27	Memorization and Regularization in Generative Diffusion Models	Ricardo Baptista et.al.	2501.15785	link
2025-01-26	BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation	Ali Khodabandeh Yalabadi et.al.	2501.15631	link
2025-01-26	Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models	Spencer Ramsey et.al.	2501.15571	null
2025-01-26	CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary	Jiahang Tu et.al.	2501.15562	null
2025-01-26	Distributionally Robust Graph Out-of-Distribution Recommendation via Diffusion Model	Chu Zhao et.al.	2501.15555	link
2025-01-26	LoRAGuard: An Effective Black-box Watermarking Approach for LoRAs	Peizhuo Lv et.al.	2501.15478	null
2025-01-26	SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity	Zichen Fan et.al.	2501.15448	null
2025-01-26	StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces	Kyeongmin Yeo et.al.	2501.15445	null
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	null
2025-01-23	Improving Video Generation with Human Feedback	Jie Liu et.al.	2501.13918	null
2025-01-23	Unveiling the Power of Noise Priors: Enhancing Diffusion Models for Mobile Traffic Prediction	Zhi Sheng et.al.	2501.13794	null
2025-01-23	An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem	Mingzhao Wang et.al.	2501.13767	link
2025-01-23	Training-Free Consistency Pipeline for Fashion Repose	Potito Aghilar et.al.	2501.13692	null
2025-01-23	One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt	Tao Liu et.al.	2501.13554	link
2025-01-23	Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse	Wenzhuo Ma et.al.	2501.13528	null
2025-01-23	LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation	JiaXin Chen et.al.	2501.13475	null
2025-01-23	Zero-Shot Trajectory Planning for Signal Temporal Logic Tasks	Ruijia Liu et.al.	2501.13457	null
2025-01-23	Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement	Meng-Ping Lin et.al.	2501.13375	null
2025-01-23	MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize	Haohang Xu et.al.	2501.13349	null
2025-01-23	One Fits All: General Mobility Trajectory Modeling via Masked Conditional Diffusion	Qingyue Long et.al.	2501.13347	null
2025-01-23	Retrievals Can Be Detrimental: A Contrastive Backdoor Attack Paradigm on Retrieval-Augmented Diffusion Models	Hao Fang et.al.	2501.13340	null
2025-01-23	Gradient-Free Adversarial Purification with Diffusion Models	Xuelong Dai et.al.	2501.13336	null
2025-01-22	State Combinatorial Generalization In Decision Making With Conditional Diffusion Models	Xintong Duan et.al.	2501.13241	null
2025-01-23	Accelerate High-Quality Diffusion Models with Inner Loop Feedback	Matthew Gwilliam et.al.	2501.13107	null
2025-01-22	Robust Representation Consistency Model via Contrastive Denoising	Jiachen Lei et.al.	2501.13094	link
2025-01-22	Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation	Akshay Krishnan et.al.	2501.13087	null
2025-01-22	Robust Body Composition Analysis by Generating 3D CT Volumes from Limited 2D Slices	Lianrui Zuo et.al.	2501.13071	null
2025-01-22	Beyond the Lungs: Extending the Field of View in Chest CT with Latent Diffusion Models	Lianrui Zuo et.al.	2501.13068	null
2025-01-22	Low-dimensional adaptation of diffusion models: Convergence in total variation	Jiadong Liang et.al.	2501.12982	null
2025-01-22	3D Object Manipulation in a Single Image using Generative Models	Ruisi Zhao et.al.	2501.12935	null
2025-01-22	CrossDiff: Diffusion Probabilistic Model With Cross-conditional Encoder-Decoder for Crack Segmentation	Xianglong Shi et.al.	2501.12860	null
2025-01-22	AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation	Aghiles Kebaili et.al.	2501.12840	null
2025-01-22	Certified Guidance for Planning with Deep Generative Models	Francesco Giacomarra et.al.	2501.12815	null
2025-01-22	T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation	Lijun Li et.al.	2501.12612	link
2025-01-22	Image Motion Blur Removal in the Temporal Dimension with Video Diffusion Models	Wang Pang et.al.	2501.12604	null
2025-01-21	Federated Discrete Denoising Diffusion Model for Molecular Generation with OpenFL	Kevin Ta et.al.	2501.12523	link
2025-01-21	Towards Affordance-Aware Articulation Synthesis for Rigged Objects	Yu-Chu Yu et.al.	2501.12393	null
2025-01-22	GPS as a Control Signal for Image Generation	Chao Feng et.al.	2501.12390	null
2025-01-21	Audio Texture Manipulation by Exemplar-Based Analogy	Kan Jen Cheng et.al.	2501.12385	null
2025-01-21	DiffDoctor: Diagnosing Image Diffusion Models Before Treating	Yiyang Wang et.al.	2501.12382	null
2025-01-21	VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models	Chaohao Xie et.al.	2501.12267	null
2025-01-21	Joint Reconstruction and Motion Estimation in Sparse-View 4DCT Using Diffusion Models within a Blind Inverse Problem Framework	Antoine De Paepe et.al.	2501.12249	null
2025-01-21	TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space	Daniel Garibi et.al.	2501.12224	null
2025-01-17	DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration	Huiyun Cao et.al.	2501.10325	null
2025-01-17	DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency	Xiaohui Li et.al.	2501.10110	null
2025-01-17	Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning	Shengkui Zhao et.al.	2501.10052	link
2025-01-17	DiffuEraser: A Diffusion Model for Video Inpainting	Xiaowen Li et.al.	2501.10018	link
2025-01-17	Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks	Junlan Chen et.al.	2501.10017	null
2025-01-17	Physics-informed DeepCT: Sinogram Wavelet Decomposition Meets Masked Diffusion	Zekun Zhou et.al.	2501.09935	link
2025-01-16	Geometry-Preserving Encoder/Decoder in Latent Generative Models	Wonjun Lee et.al.	2501.09876	null
2025-01-16	CrossModalityDiffusion: Multi-Modal Novel View Synthesis with Unified Intermediate Representation	Alex Berian et.al.	2501.09838	link
2025-01-16	PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery	Shristi Das Biswas et.al.	2501.09826	link
2025-01-16	Lossy Compression with Pretrained Diffusion Models	Jeremy Vonderfecht et.al.	2501.09815	link
2025-01-16	SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces	Sumit Chaturvedi et.al.	2501.09756	null
2025-01-16	Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps	Nanye Ma et.al.	2501.09732	null
2025-01-16	Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review	Masatoshi Uehara et.al.	2501.09685	null
2025-01-16	Pruning for Sparse Diffusion Models based on Gradient Flow	Ben Wan et.al.	2501.09464	null
2025-01-16	CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation	Hwan Heo et.al.	2501.09433	link
2025-01-16	Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse	Guangyuan Liu et.al.	2501.09391	null
2025-01-16	UVRM: A Scalable 3D Reconstruction Model from Unposed Videos	Shiu-hong Kao et.al.	2501.09347	null
2025-01-16	Domain-conditioned and Temporal-guided Diffusion Modeling for Accelerated Dynamic MRI Reconstruction	Liping Zhang et.al.	2501.09305	null
2025-01-16	Text Semantics to Flexible Design: A Residential Layout Generation Method Based on Stable Diffusion Model	Zijin Qiu et.al.	2501.09279	null
2025-01-16	PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving	Desen Sun et.al.	2501.09253	null
2025-01-15	Grounding Text-To-Image Diffusion Models For Controlled High-Quality Image Generation	Ahmad Süleyman et.al.	2501.09194	null
2025-01-15	Generative diffusion model with inverse renormalization group flows	Kanta Masuki et.al.	2501.09064	link
2025-01-15	NeurOp-Diff:Continuous Remote Sensing Image Super-Resolution via Neural Operator Diffusion	Zihao Xu et.al.	2501.09054	link
2025-01-15	SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation	Aditya Bhat et.al.	2501.09008	null
2025-01-15	RepVideo: Rethinking Cross-Layer Representation for Video Generation	Chenyang Si et.al.	2501.08994	null
2025-01-15	Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution	Shao-Hao Lu et.al.	2501.08819	link
2025-01-15	Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models	Zerui Tao et.al.	2501.08727	null
2025-01-15	FlexiClip: Locality-Preserving Free-Form Character Animation	Anant Khandelwal et.al.	2501.08676	null
2025-01-15	TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis	Bailiang Jian et.al.	2501.08667	null
2025-01-15	Product of Gaussian Mixture Diffusion Model for non-linear MRI Inversion	Laurenz Nagler et.al.	2501.08662	null
2025-01-15	Joint Learning of Depth and Appearance for Portrait Image Animation	Xinya Ji et.al.	2501.08649	null
2025-01-15	Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT)	Krishna Panthi et.al.	2501.08604	null
2025-01-15	DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors	Runqi Wang et.al.	2501.08553	null
2025-01-14	Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models	Weichen Fan et.al.	2501.08453	null
2025-01-14	DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models	Hyeonwoo Kim et.al.	2501.08333	null
2025-01-14	MangaNinja: Line Art Colorization with Precise Reference Following	Zhiheng Liu et.al.	2501.08332	null
2025-01-14	Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise	Ryan Burgert et.al.	2501.08331	link
2025-01-14	GameFactory: Creating New Games with Generative Interactive Videos	Jiwen Yu et.al.	2501.08325	null
2025-01-14	Diffusion Adversarial Post-Training for One-Step Video Generation	Shanchuan Lin et.al.	2501.08316	null
2025-01-14	LayerAnimate: Layer-specific Control for Animation	Yuxue Yang et.al.	2501.08295	null
2025-01-14	Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints	Jonathan Nöther et.al.	2501.08246	null
2025-01-14	FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors	Yabo Zhang et.al.	2501.08225	link
2025-01-14	D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models	Qian Zeng et.al.	2501.08180	link
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-13	Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection	Shiman Zhang et.al.	2501.07533	link
2025-01-13	IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion	Tharun Anand et.al.	2501.07530	null
2025-01-13	PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations	Ting-Yu Dai et.al.	2501.07447	null
2025-01-13	Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation	Xiyue Zhu et.al.	2501.07430	null
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	null
2025-01-13	Bigger Isn’t Always Better: Towards a General Prior for Medical Image Reconstruction	Lukas Glaszner et.al.	2501.07376	link
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-13	D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation	Zhejun Zhang et.al.	2501.07077	link
2025-01-13	Erasing Noise in Signal Detection with Diffusion Model: From Theory to Application	Xiucheng Wang et.al.	2501.07030	null
2025-01-13	Global Search for Optimal Low Thrust Spacecraft Trajectories using Diffusion Models and the Indirect Method	Jannik Graebner et.al.	2501.07005	null
2025-01-13	Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps	Henry Li et.al.	2501.06999	link
2025-01-12	A General Framework for Inference-time Scaling and Steering of Diffusion Models	Raghav Singhal et.al.	2501.06848	link
2025-01-12	ODPG: Outfitting Diffusion with Pose Guided Condition	Seohyun Lee et.al.	2501.06769	null
2025-01-12	Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models	Michael Toker et.al.	2501.06751	null
2025-01-12	DRDT3: Diffusion-Refined Decision Test-Time Training Model	Xingshuai Huang et.al.	2501.06718	null
2025-01-11	Personalized Preference Fine-tuning of Diffusion Models	Meihua Dang et.al.	2501.06655	null
2025-01-11	Boundary-enhanced time series data imputation with long-term dependency diffusion models	Chunjing Xiao et.al.	2501.06585	null
2025-01-11	A Diffusive Data Augmentation Framework for Reconstruction of Complex Network Evolutionary History	En Xu et.al.	2501.06485	null
2025-01-10	MEt3R: Measuring Multi-View Consistency in Generated Images	Mohammad Asim et.al.	2501.06336	null
2025-01-09	Decentralized Diffusion Models	David McAllister et.al.	2501.05450	null
2025-01-09	Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces	Aniruddha Mahapatra et.al.	2501.05442	null
2025-01-09	The GAN is dead; long live the GAN! A Modern GAN Baseline	Yiwen Huang et.al.	2501.05441	link
2025-01-09	Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation	Xuyi Meng et.al.	2501.05427	null
2025-01-09	TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts	Yu-Hao Huang et.al.	2501.05403	link
2025-01-09	Accelerated Diffusion Models via Speculative Sampling	Valentin De Bortoli et.al.	2501.05370	null
2025-01-09	CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models	Junha Park et.al.	2501.05359	null
2025-01-09	Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes	Ludwic Leonard et.al.	2501.05226	link
2025-01-09	FaceMe: Robust Blind Face Restoration with Personal Identification	Siyu Liu et.al.	2501.05177	null
2025-01-09	EquiBoost: An Equivariant Boosting Approach to Molecular Conformation Generation	Yixuan Yang et.al.	2501.05109	link
2025-01-09	Recovery of activation propagation and self-sustained oscillation abilities in stroke brain networks	Yingpeng Liu et.al.	2501.05099	null
2025-01-09	ResPanDiff: Diffusion Model with Disentangled Modulations for Image Fusion	Shiqi Cao et.al.	2501.05091	null
2025-01-09	D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription	Hounsu Kim et.al.	2501.05068	link
2025-01-09	On a reaction-diffusion virus model with general boundary conditions in heterogeneous environments	Mingxin Wang et.al.	2501.04992	null
2025-01-09	FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching	Jun-Hak Yun et.al.	2501.04926	link
2025-01-08	Geophysical inverse problems with measurement-guided diffusion models	Matteo Ravasi et.al.	2501.04881	null
2025-01-08	Using Diffusion Models for Reducing Spatiotemporal Errors of Deep Learning Based Urban Microclimate Predictions at Post-Processing Stage	Sepehrdad Tahmasebi et.al.	2501.04847	null
2025-01-08	EditAR: Unified Conditional Generation with Autoregressive Models	Jiteng Mu et.al.	2501.04699	null
2025-01-08	ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning	Yuzhou Huang et.al.	2501.04698	null
2025-01-08	SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images	Zixuan Huang et.al.	2501.04689	null
2025-01-08	A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI	Kazusato Oko et.al.	2501.04641	link
2025-01-08	Disentangled Clothed Avatar Generation with Layered Representation	Weitian Zhang et.al.	2501.04631	null
2025-01-08	MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation	Daniele Molino et.al.	2501.04614	null
2025-01-08	Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion	Yangfan He et.al.	2501.04606	link
2025-01-08	ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training	Xinfa Zhu et.al.	2501.04416	null
2025-01-08	Edit as You See: Image-guided Video Editing via Masked Motion Modeling	Zhi-Lin Huang et.al.	2501.04325	null
2025-01-08	DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models	Hyogon Ryu et.al.	2501.04304	link
2025-01-08	ContextMRI: Enhancing Compressed Sensing MRI through Metadata Conditioning	Hyungjin Chung et.al.	2501.04284	link
2025-01-08	DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions	Weidong Chen et.al.	2501.04256	null
2025-01-07	NeuralSVG: An Implicit Representation for Text-to-Vector Generation	Sagi Polaczek et.al.	2501.03992	null
2025-01-07	Stabilising effect of generic anomalous diffusion independent of the Rayleigh number	Antonio Barletta et.al.	2501.03990	null
2025-01-07	A precise asymptotic analysis of learning diffusion models: theory and insights	Hugo Cui et.al.	2501.03937	link
2025-01-07	Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers	Yuechen Zhang et.al.	2501.03931	link
2025-01-07	Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control	Zekai Gu et.al.	2501.03847	link
2025-01-07	Impact of diffusion mechanisms on persistence and spreading	Nathanaël Boutillon et.al.	2501.03816	null
2025-01-07	Mixing by Internal Gravity Waves in Stars: Assessing Numerical Simulations Against Theory	Jack Morton et.al.	2501.03796	null
2025-01-07	Exploring Molecule Generation Using Latent Space Graph Diffusion	Prashanth Pombala et.al.	2501.03696	link
2025-01-06	MObI: Multimodal Object Inpainting Using Diffusion Models	Alexandru Buburuzan et.al.	2501.03173	null
2025-01-06	Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches	Alhassan Mumuni et.al.	2501.03151	null
2025-01-06	DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models	Mehmet Onurcan Kaya et.al.	2501.03030	link
2025-01-06	STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution	Rui Xie et.al.	2501.02976	null
2025-01-06	SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild	Jiawei Liu et.al.	2501.02962	null
2025-01-06	Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions	Jianhua Pei et.al.	2501.02928	null
2025-01-06	Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis	Thang-Anh-Quan Nguyen et.al.	2501.02913	null
2025-01-06	Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems	Shayan Mohajer Hamidi et.al.	2501.02880	null
2025-01-06	Towards HRTF Personalization using Denoising Diffusion Models	Juan Camilo Albarracín Sánchez et.al.	2501.02871	null
2025-01-06	Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans	Rezkellah Noureddine Khiati et.al.	2501.02867	null
2025-01-06	InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models	Kai Wang et.al.	2501.02816	null
2025-01-06	Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising	Yunlong Yuan et.al.	2501.02741	null
2025-01-06	Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment	Jiaze Li et.al.	2501.02706	null
2025-01-05	From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering	Wen-ran Li et.al.	2501.02680	null
2025-01-05	DepthMaster: Taming Diffusion Models for Monocular Depth Estimation	Ziyang Song et.al.	2501.02576	link
2025-01-05	Decoding fMRI Data into Captions using Prefix Language Modeling	Vyacheslav Shen et.al.	2501.02570	link
2025-01-05	Unified Guidance for Geometry-Conditioned Molecular Generation	Sirine Ayadi et.al.	2501.02526	null
2025-01-05	Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation	Dawei Dai et.al.	2501.02523	link
2025-01-05	Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors	Minglin Chen et.al.	2501.02519	null
2025-01-05	ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling	Chaojie Mao et.al.	2501.02487	null
2025-01-02	Object-level Visual Prompts for Compositional Image Generation	Gaurav Parmar et.al.	2501.01424	null
2025-01-02	Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models	Jingfeng Yao et.al.	2501.01423	link
2025-01-02	Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement	Z. Zhang et.al.	2501.01368	null
2025-01-02	Conditional Consistency Guided Image Translation and Enhancement	A. V. Subramanyam et.al.	2501.01223	link
2025-01-02	Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission	Maojun Zhang et.al.	2501.01138	link
2025-01-02	EliGen: Entity-Level Controlled Image Generation with Regional Attention	Hong Zhang et.al.	2501.01097	link
2025-01-02	DiffCL: A Diffusion-Based Contrastive Learning Framework with Semantic Alignment for Multimodal Recommendations	Qiya Song et.al.	2501.01066	null
2025-01-02	Optimizing Noise Schedules of Generative Models in High Dimensionss	Santiago Aranguri et.al.	2501.00988	null
2025-01-01	Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model	Omid Saghatchian et.al.	2501.00946	link
2025-01-01	Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion	Hao Wang et.al.	2501.00944	null
2025-01-01	A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset	Junhuan Yang et.al.	2501.00941	null
2025-01-01	Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models	Emily Johnson et.al.	2501.00917	null
2025-01-01	Diffusion Policies for Generative Modeling of Spacecraft Trajectories	Julia Briden et.al.	2501.00915	null
2025-01-01	Population Aware Diffusion for Time Series Generation	Yang Li et.al.	2501.00910	link
2025-01-01	RORem: Training a Robust Object Remover with Human-in-the-Loop	Ruibin Li et.al.	2501.00740	link
2024-12-31	SoundBrush: Sound as a Brush for Visual Scene Editing	Kim Sung-Bin et.al.	2501.00645	null
2024-12-31	Flash-Split: 2D Reflection Removal with Flash Cues and Latent Diffusion Separation	Tianfu Wang et.al.	2501.00637	null
2024-12-31	DiC: Rethinking Conv3x3 Designs in Diffusion Models	Yuchuan Tian et.al.	2501.00603	link
2024-12-31	DreamDrive: Generative 4D Scene Modeling from Street View Images	Jiageng Mao et.al.	2501.00601	null
2024-12-31	Polynomial time sampling from log-smooth distributions in fixed dimension under semi-log-concavity of the forward diffusion with application to strongly dissipative distributions	Adrien Vacher et.al.	2501.00565	null
2024-12-30	Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation	Yuanbo Yang et.al.	2412.21117	null
2024-12-30	Quantum Diffusion Model for Quark and Gluon Jet Generation	Mariia Baidachna et.al.	2412.21082	link
2024-12-30	Edicho: Consistent Image Editing in the Wild	Qingyan Bai et.al.	2412.21079	link
2024-12-30	Varformer: Adapting VAR’s Generative Prior for Image Restoration	Siyang Wang et.al.	2412.21063	link
2024-12-30	E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models	Zhiyu Tan et.al.	2412.21044	null
2024-12-30	Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration	Wanglong Lu et.al.	2412.21042	link
2024-12-30	AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies	Yibo Wen et.al.	2412.20984	null
2024-12-30	Influence Maximization in Temporal Networks with Persistent and Reactive Behaviors	Aaqib Zahoor et.al.	2412.20936	null
2024-12-30	DDIM sampling for Generative AIBIM, a faster intelligent structural design framework	Zhili He et.al.	2412.20899	null
2024-12-30	VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control	Shaojin Wu et.al.	2412.20800	link
2024-12-30	M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs	Bei Yan et.al.	2412.20718	link
2024-12-30	HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images	Sungik Choi et.al.	2412.20704	null
2024-12-30	Diffgrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model	Yonghao Zhang et.al.	2412.20657	null
2024-12-30	Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis	Yousef Yeganeh et.al.	2412.20651	null
2024-12-29	Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)	Tomer Garber et.al.	2412.20596	link
2024-12-29	Testing and Improving the Robustness of Amortized Bayesian Inference for Cognitive Models	Yufei Wu et.al.	2412.20586	link
2024-12-29	Derivations of Animal Movement Models with Explicit Memory	Tianxu Wang et.al.	2412.20568	null
2024-12-29	DPBridge: Latent Diffusion Bridge for Dense Prediction	Haorui Ji et.al.	2412.20506	null
2024-12-29	Single-image reflection removal via self-supervised diffusion models	Zhengyang Lu et.al.	2412.20466	null
2024-12-29	Image Augmentation Agent for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.20439	null
2024-12-24	PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models	Minghao Chen et.al.	2412.18608	null
2024-12-24	DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers	Yuntao Chen et.al.	2412.18607	null
2024-12-24	Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models	Tahira Kazimi et.al.	2412.18604	null
2024-12-24	DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation	Minghong Cai et.al.	2412.18597	link
2024-12-24	LatentCRF: Continuous CRF for Efficient Latent Diffusion	Kanchana Ranasinghe et.al.	2412.18596	null
2024-12-24	Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation	Anselm Krainovic et.al.	2412.18584	null
2024-12-24	3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement	Yihang Luo et.al.	2412.18565	null
2024-12-24	Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models	Qice Qin et.al.	2412.18421	null
2024-12-24	Discovery of 2D Materials via Symmetry-Constrained Diffusion Model	Shihang Xu et.al.	2412.18414	null
2024-12-24	FameBias: Embedding Manipulation Bias Attack in Text-to-Image Models	Jaechul Roh et.al.	2412.18302	null
2024-12-24	GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications	Zhenzhou Jin et.al.	2412.18281	null
2024-12-24	Schödinger Bridge Type Diffusion Models as an Extension of Variational Autoencoders	Kentaro Kaba et.al.	2412.18237	null
2024-12-24	Expand VSR Benchmark for VLLM to Expertize in Spatial Rules	Peijin Xie et.al.	2412.18224	link
2024-12-24	Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks	Changfu Xu et.al.	2412.18212	link
2024-12-24	Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence	Yinbin Han et.al.	2412.18164	null
2024-12-24	Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction	Xiao Guo et.al.	2412.18149	null
2024-12-24	Ensuring Consistency for In-Image Translation	Chengpeng Fu et.al.	2412.18139	null
2024-12-23	Multi-Agent Path Finding in Continuous Spaces with Projected Diffusion Models	Jinhao Liang et.al.	2412.17993	null
2024-12-23	Causal Composition Diffusion Model for Closed-loop Traffic Generation	Haohong Lin et.al.	2412.17920	null
2024-12-23	FaceLift: Single Image to 3D Head with View Generation and GS-LRM	Weijie Lyu et.al.	2412.17812	null
2024-12-23	PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion	Sophia Tang et.al.	2412.17780	null
2024-12-23	The Superposition of Diffusion Models Using the Itô Density Estimator	Marta Skreta et.al.	2412.17762	null
2024-12-23	A Bias-Free Training Paradigm for More General AI-generated Image Detection	Fabrizio Guillaro et.al.	2412.17671	null
2024-12-23	Benchmarking Generative AI Models for Deep Learning Test Input Generation	Maryam et.al.	2412.17652	link
2024-12-23	DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder	Ente Lin et.al.	2412.17644	null
2024-12-23	Retention Score: Quantifying Jailbreak Risks for Vision Language Models	Zaitang Li et.al.	2412.17544	null
2024-12-23	DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak	Hao Wang et.al.	2412.17522	null
2024-12-23	Heterogeneous carrying capacities and global extinction in metapopulations	Jakub Hesoun et.al.	2412.17461	null
2024-12-23	AeroDiT: Diffusion Transformers for Reynolds-Averaged Navier-Stokes Simulations of Airfoil Flows	Hui Xiang et.al.	2412.17394	null
2024-12-23	Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement	Hyeonjin Kim et.al.	2412.17387	link
2024-12-23	Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition	Jaeheun Jung et.al.	2412.17333	null
2024-12-23	Free-viewpoint Human Animation with Pose-correlated Reference Selection	Fa-Ting Hong et.al.	2412.17290	null
2024-12-23	Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory	Xingyao Li et.al.	2412.17254	null
2024-12-23	OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving	Tianyi Yan et.al.	2412.17226	null
2024-12-23	CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder	Lichen Ma et.al.	2412.17225	null
2024-12-23	Discriminative Image Generation with Diffusion Models for Zero-Shot Learning	Dingjie Fu et.al.	2412.17219	null
2024-12-22	Generative Diffusion Modeling: A Practical Handbook	Zihan Ding et.al.	2412.17162	null
2024-12-22	Similarity Trajectories: Linking Sampling Process to Artifacts in Diffusion-Generated Images	Dennis Menn et.al.	2412.17109	null
2024-12-22	Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation	Luoxu Jin et.al.	2412.17042	null
2024-12-19	LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis	Hanlin Wang et.al.	2412.15214	link
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	null
2024-12-19	Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation	Hadi Alzayer et.al.	2412.15211	null
2024-12-19	AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation	Moayed Haji-Ali et.al.	2412.15191	null
2024-12-19	Tiled Diffusion	Or Madar et.al.	2412.15185	null
2024-12-19	OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization	Jiacheng Zhang et.al.	2412.15159	null
2024-12-19	Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM	Yatai Ji et.al.	2412.15156	link
2024-12-19	Jet: A Modern Transformer-Based Normalizing Flow	Alexander Kolesnikov et.al.	2412.15129	null
2024-12-19	Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion	Zhifei Chen et.al.	2412.15050	null
2024-12-19	DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space	Mang Ning et.al.	2412.15032	link
2024-12-19	Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls	Riccardo Fosco Gramaccioni et.al.	2412.15023	null
2024-12-19	MagicNaming: Consistent Identity Generation by Finding a “Name Space” in T2I Diffusion Models	Jing Zhao et.al.	2412.14902	null
2024-12-19	Diffusion priors for Bayesian 3D reconstruction from incomplete measurements	Julian L. Möbius et.al.	2412.14897	null
2024-12-19	Generative CKM Construction using Partially Observed Data with Diffusion Model	Shen Fu et.al.	2412.14812	null
2024-12-19	Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations	Yucheng Hu et.al.	2412.14803	null
2024-12-19	EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space	Jianrong Zhang et.al.	2412.14706	null
2024-12-19	Event-assisted 12-stop HDR Imaging of Dynamic Scene	Shi Guo et.al.	2412.14705	null
2024-12-19	Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model	Minglong Xue et.al.	2412.14630	link
2024-12-19	Qua $^2$ SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models	Keith G. Mills et.al.	2412.14628	null
2024-12-19	LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining	Huawen Shen et.al.	2412.14596	null
2024-12-18	AniDoc: Animation Creation Made Easier	Yihao Meng et.al.	2412.14173	null
2024-12-18	E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling	Zhihang Yuan et.al.	2412.14170	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169	link
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167	null
2024-12-18	MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation	Shenhao Zhu et.al.	2412.14148	null
2024-12-18	SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation	Tong Chen et.al.	2412.14018	null
2024-12-18	Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates	Sen Yan et.al.	2412.13966	null
2024-12-18	IDEQ: an improved diffusion model for the TSP	Mickael Basson et.al.	2412.13858	null
2024-12-18	Object Style Diffusion for Generalized Object Detection in Urban Scene	Hao Li et.al.	2412.13815	null
2024-12-18	Text2Relight: Creative Portrait Relighting with Text Guidance	Junuk Cha et.al.	2412.13734	null
2024-12-18	Diffusion models and stochastic quantisation in lattice field theory	Gert Aarts et.al.	2412.13704	null
2024-12-18	MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing	Chuang Yang et.al.	2412.13684	null
2024-12-18	VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement	Chen Zhao et.al.	2412.13655	link
2024-12-18	TAUDiff: Improving statistical downscaling for extreme weather events using generative diffusion models	Rahul Sundar et.al.	2412.13627	null
2024-12-18	SemiDFL: A Semi-Supervised Paradigm for Decentralized Federated Learning	Xinyang Liu et.al.	2412.13589	link
2024-12-18	Urban Air Temperature Prediction using Conditional Diffusion Models	Siyang Dai et.al.	2412.13504	null
2024-12-18	VaeDiff-DocRE: End-to-end Data Augmentation Framework for Document-level Relation Extraction	Khai Phan Tran et.al.	2412.13503	link
2024-12-18	Real-time One-Step Diffusion-based Expressive Portrait Videos Generation	Hanzhong Guo et.al.	2412.13479	link
2024-12-18	SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation	Kazuki Shimada et.al.	2412.13462	null
2024-12-18	Zero-Shot Low Light Image Enhancement with Diffusion Prior	Joshua Cho et.al.	2412.13401	link
2024-12-16	Causal Diffusion Transformers for Generative Modeling	Chaorui Deng et.al.	2412.12095	link
2024-12-16	CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models	Felix Taubner et.al.	2412.12093	null
2024-12-16	Wonderland: Navigating 3D Scenes from a Single Image	Hanwen Liang et.al.	2412.12091	null
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048	null
2024-12-16	The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation	Gilles Mordant et.al.	2412.12007	null
2024-12-16	Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data	Onur Tasar et.al.	2412.11972	null
2024-12-16	ColorFlow: Retrieval-Augmented Image Sequence Colorization	Junhao Zhuang et.al.	2412.11815	null
2024-12-16	InterDyn: Controllable Interactive Dynamics with Video Diffusion Models	Rick Akkerman et.al.	2412.11785	null
2024-12-16	Joint Reconstruction of the Activity and the Attenuation in PET by Diffusion Posterior Sampling: a Feasibility Study	Clémentine Phung-Ngoc et.al.	2412.11776	null
2024-12-16	No More Adam: Learning Rate Scaling at Initialization is All You Need	Minghao Xu et.al.	2412.11768	link
2024-12-16	Conditional Diffusion Models Based Conditional Independence Testing	Yanfeng Yang et.al.	2412.11744	link
2024-12-16	Re-Attentional Controllable Video Diffusion Editing	Yuanzhi Wang et.al.	2412.11710	link
2024-12-16	VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting	Muhammet Furkan Ilaslan et.al.	2412.11621	link
2024-12-16	3D $^2$ -Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling	Zichen Tang et.al.	2412.11599	link
2024-12-16	StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors	Xiaokun Sun et.al.	2412.11586	link
2024-12-16	MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models	Weilun Feng et.al.	2412.11549	link
2024-12-16	EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting	Dong In Lee et.al.	2412.11520	null
2024-12-16	LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model	Xi Wang et.al.	2412.11519	null
2024-12-16	IGR: Improving Diffusion Model for Garment Restoration from Person Image	Le Shen et.al.	2412.11513	null
2024-12-16	MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes	Ruijie Lu et.al.	2412.11457	null
2024-12-12	FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion	Haonan Qiu et.al.	2412.09626	null
2024-12-12	Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors	Yue Feng et.al.	2412.09625	null
2024-12-12	OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation	Weiqi Li et.al.	2412.09623	null
2024-12-12	LoRACLR: Contrastive Adaptation for Customization of Diffusion Models	Enis Simsar et.al.	2412.09622	null
2024-12-12	SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training	Dongting Hu et.al.	2412.09619	null
2024-12-12	EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Zhuofan Zong et.al.	2412.09618	null
2024-12-12	Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG	Kavana Venkatesh et.al.	2412.09614	null
2024-12-12	LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors	Yabo Chen et.al.	2412.09597	null
2024-12-12	Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion	Zexin He et.al.	2412.09593	null
2024-12-12	SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing	Xueting Li et.al.	2412.09545	null
2024-12-12	Learned Compression for Compressed Learning	Dan Jacobellis et.al.	2412.09405	link
2024-12-12	Diffusion Model with Representation Alignment for Protein Inverse Folding	Chenglin Wang et.al.	2412.09380	null
2024-12-12	Diffusion Predictive Control with Constraints	Ralf Römer et.al.	2412.09342	link
2024-12-12	Auto-Regressive Moving Diffusion Models for Time Series Forecasting	Jiaxin Gao et.al.	2412.09328	link
2024-12-12	Are Conditional Latent Diffusion Models Effective for Image Restoration?	Yunchen Yuan et.al.	2412.09324	null
2024-12-12	GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression	Ziqi Zhou et.al.	2412.09296	link
2024-12-12	LatentSync: Audio Conditioned Latent Diffusion Models for Lip Sync	Chunyu Li et.al.	2412.09262	link
2024-12-12	ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring	Zhongbao Yang et.al.	2412.09193	null
2024-12-12	RAD: Region-Aware Diffusion Models for Image Inpainting	Sora Kim et.al.	2412.09191	null
2024-12-12	DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization	Geonhui Jang et.al.	2412.09169	null
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	null
2024-12-11	DMin: Scalable Training Data Influence Estimation for Diffusion Models	Huawei Lin et.al.	2412.08637	link
2024-12-11	TryOffAnyone: Tiled Cloth Generation from a Dressed Person	Ioannis Xarchakos et.al.	2412.08573	link
2024-12-11	Learning Flow Fields in Attention for Controllable Person Image Generation	Zijian Zhou et.al.	2412.08486	link
2024-12-11	InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models	Min Hou et.al.	2412.08480	link
2024-12-11	CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis	Mu Zhang et.al.	2412.08464	null
2024-12-11	Reliable Uncertainty Quantification for Fiber Orientation in Composite Molding Processes using Multilevel Polynomial Surrogates	Stjepan Salatovic et.al.	2412.08459	null
2024-12-11	Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views	Songchun Zhang et.al.	2412.08412	null
2024-12-11	Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR3	Joao Carvalho et.al.	2412.08398	null
2024-12-11	Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion	Jisheng Chu et.al.	2412.08326	link
2024-12-11	GDSG: Graph Diffusion-based Solution Generation for Optimization Problems in MEC Networks	Ruihuai Liang et.al.	2412.08296	link
2024-12-11	Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations	Nikil Roashan Selvam et.al.	2412.08292	link
2024-12-11	Toward Near-Globally Optimal Nonlinear Model Predictive Control via Diffusion Models	Tzu-Yuan Huang et.al.	2412.08278	null
2024-12-11	Unicorn: Unified Neural Image Compression with One Number Reconstruction	Qi Zheng et.al.	2412.08210	null
2024-12-11	LatentSpeech: Latent Diffusion for Text-To-Speech Generation	Haowei Lou et.al.	2412.08117	null
2024-12-11	DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation	Jaeho Moon et.al.	2412.08116	null
2024-12-10	Diffusion-Based Attention Warping for Consistent 3D Scene Editing	Eyal Gomel et.al.	2412.07984	null
2024-12-10	Non-Normal Diffusion Models	Henry Li et.al.	2412.07935	null
2024-12-10	Score Change of Variables	Stephen Robbins et.al.	2412.07904	null
2024-12-10	Score-Optimal Diffusion Schedules	Christopher Williams et.al.	2412.07877	null
2024-12-09	[MASK] is All You Need	Vincent Tao Hu et.al.	2412.06787	link
2024-12-09	Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation	Ruihan Gao et.al.	2412.06785	link
2024-12-09	Diverse Score Distillation	Yanbo Xu et.al.	2412.06780	null
2024-12-09	Visual Lexicon: Rich Image Features in Language Space	XuDong Wang et.al.	2412.06774	null
2024-12-09	InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention	Howard Zhang et.al.	2412.06753	null
2024-12-09	ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet	Andrei-Robert Alexandrescu et.al.	2412.06742	null
2024-12-09	Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection	Caiyun Xie et.al.	2412.06727	link
2024-12-09	You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale	Baorui Ma et.al.	2412.06699	link
2024-12-09	Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy	Yuxuan Xue et.al.	2412.06698	null
2024-12-09	Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset	Shanshan Wang et.al.	2412.06666	null
2024-12-09	Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion	Shuaiting Li et.al.	2412.06661	null
2024-12-09	MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences	Weitao Wang et.al.	2412.06614	null
2024-12-09	Diffusion on the circle and a stochastic correlation model	Sourav Majumdar et.al.	2412.06343	null
2024-12-09	Normalizing Flows are Capable Generative Models	Shuangfei Zhai et.al.	2412.06329	link
2024-12-09	See Further When Clear: Curriculum Consistency Model	Yunpeng Liu et.al.	2412.06295	null
2024-12-09	No Annotations for Object Detection in Art through Stable Diffusion	Patrick Ramos et.al.	2412.06286	link
2024-12-09	Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction	Dongxu Wei et.al.	2412.06273	null
2024-12-09	Rendering-Refined Stable Diffusion for Privacy Compliant Synthetic Data	Kartik Patwari et.al.	2412.06248	null
2024-12-09	ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance	Yuming Li et.al.	2412.06163	null
2024-12-09	Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters	Yuan Wang et.al.	2412.06143	link
2024-12-05	PaintScene4D: Consistent 4D Scene Generation from Text Prompts	Vinayak Gupta et.al.	2412.04471	null
2024-12-05	LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors	Yusuf Dalva et.al.	2412.04460	null
2024-12-05	Four-Plane Factorized Video Autoencoders	Mohammed Suhail et.al.	2412.04452	null
2024-12-05	MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation	Longtao Zheng et.al.	2412.04448	null
2024-12-05	DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models	Yizhuo Li et.al.	2412.04446	null
2024-12-05	Learning Artistic Signatures: Symmetry Discovery and Style Transfer	Emma Finn et.al.	2412.04441	null
2024-12-05	Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation	Yuying Ge et.al.	2412.04432	link
2024-12-05	Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis	Jian Han et.al.	2412.04431	link
2024-12-05	Reversible molecular simulation for training classical and machine learning force fields	Joe G Greener et.al.	2412.04374	link
2024-12-05	ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation	Dayoung Gong et.al.	2412.04353	null
2024-12-05	RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse	Zhouyingcheng Liao et.al.	2412.04343	null
2024-12-05	Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction	George Webber et.al.	2412.04324	null
2024-12-05	Structure-Aware Stylized Image Synthesis for Robust Medical Image Segmentation	Jie Bao et.al.	2412.04296	link
2024-12-05	LMDM:Latent Molecular Diffusion Model For 3D Molecule Generation	Xiang Chen et.al.	2412.04242	null
2024-12-05	CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model	Ruoyu Yao et.al.	2412.04209	null
2024-12-05	Instructional Video Generation	Yayuan Li et.al.	2412.04189	null
2024-12-05	AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models	Xinghui Li et.al.	2412.04146	null
2024-12-05	Understanding Memorization in Generative Models via Sharpness in Probability Landscapes	Dongjae Jeon et.al.	2412.04140	null
2024-12-05	Compositional Generative Multiphysics and Multi-component Simulation	Tao Zhang et.al.	2412.04134	link
2024-12-05	IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation	Sejong Yang et.al.	2412.04000	null
2024-12-04	MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation	Zehuan Huang et.al.	2412.03558	null
2024-12-04	NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images	Lingen Li et.al.	2412.03517	null
2024-12-04	Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion	Shengyuan Zhang et.al.	2412.03515	link
2024-12-04	CleanDIFT: Diffusion Features without Noise	Nick Stracke et.al.	2412.03439	link
2024-12-04	SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model	Yan Li et.al.	2412.03430	null
2024-12-04	Skel3D: Skeleton Guided Novel View Synthesis	Aron Fóthi et.al.	2412.03407	null
2024-12-04	Identifiability implies consistency of MLE in partially observed diffusions on a torus	Ibrahim Ekren et.al.	2412.03380	null
2024-12-04	TASR: Timestep-Aware Diffusion Model for Image Super-Resolution	Qinwei Lin et.al.	2412.03355	link
2024-12-04	DIVE: Taming DINO for Subject-Driven Video Editing	Yi Huang et.al.	2412.03347	null
2024-12-04	Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis	Tao Jun Lin et.al.	2412.03315	null
2024-12-04	Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression	Junjie Wen et.al.	2412.03293	null
2024-12-04	Black-Box Forgery Attacks on Semantic Watermarks for Diffusion Models	Andreas Müller et.al.	2412.03283	null
2024-12-04	Generating Synthetic Genotypes using Diffusion Models	Philip Kenneweg et.al.	2412.03278	link
2024-12-04	RFSR: Improving ISR Diffusion Models via Reward Feedback Learning	Xiaopeng Sun et.al.	2412.03268	link
2024-12-04	DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation	Qingdong He et.al.	2412.03255	null
2024-12-04	A seamless local-nonlocal coupling diffusion model with $H^1$ vanishing nonlocality convergence	Yanzun Meng et.al.	2412.03153	null
2024-12-04	Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis	Siyoon Jin et.al.	2412.03150	null
2024-12-04	Generalized Diffusion Model with Adjusted Offset Noise	Takuro Kutsuna et.al.	2412.03134	null
2024-12-04	MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction	Gangjian Zhang et.al.	2412.03103	null
2024-12-04	Mimir: Improving Video Diffusion Models for Precise Text Understanding	Shuai Tan et.al.	2412.03085	null
2024-11-29	MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks	Yiming Wu et.al.	2411.19786	null
2024-11-29	Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy	Jeheon Woo et.al.	2411.19769	null
2024-11-29	TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting	Bojun Xiong et.al.	2411.19654	link
2024-11-29	Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing	Wenyi Mo et.al.	2411.19652	link
2024-11-29	Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook	Florinel-Alin Croitoru et.al.	2411.19537	link
2024-11-29	Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis	Tianqi Li et.al.	2411.19509	link
2024-11-29	Diffusion Models Meet Network Management: Improving Traffic Matrix Analysis with Diffusion-based Approach	Xinyu Yuan et.al.	2411.19493	link
2024-11-28	DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models	Shwetha Ram et.al.	2411.19390	null
2024-11-28	Enhancing Sketch Animation: Text-to-Video Diffusion Models with Temporal Consistency and Rigidity Constraints	Gaurav Rai et.al.	2411.19381	null
2024-11-28	Towards a Mechanistic Explanation of Diffusion Model Generalization	Matthew Niedoba et.al.	2411.19339	null
2024-11-28	Trajectory Attention for Fine-grained Video Motion Control	Zeqi Xiao et.al.	2411.19324	null
2024-11-28	Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention	Huiguo He et.al.	2411.19261	null
2024-11-28	Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes	Thomas Wimmer et.al.	2411.19233	link
2024-11-28	Z-STAR+: A Zero-shot Style Transfer Method via Adjusting Style Distribution	Yingying Deng et.al.	2411.19231	null
2024-11-28	Video Depth without Video Models	Bingxin Ke et.al.	2411.19189	null
2024-11-28	SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generation	Yuhan Pei et.al.	2411.19182	null
2024-11-28	Bayesian Deconvolution of Astronomical Images with Diffusion Models: Quantifying Prior-Driven Features in Reconstructions	Alessio Spagnoletti et.al.	2411.19158	link
2024-11-28	Timestep Embedding Tells: It’s Time to Cache for Video Diffusion Model	Feng Liu et.al.	2411.19108	null
2024-11-28	I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting	Nicola Fanelli et.al.	2411.19050	link
2024-11-28	3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes	Tejaswini Medi et.al.	2411.19037	null
2024-11-27	GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data	Wentao Wang et.al.	2411.18624	null
2024-11-27	Diffusion Self-Distillation for Zero-Shot Customized Image Generation	Shengqu Cai et.al.	2411.18616	null
2024-11-27	CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models	Rundi Wu et.al.	2411.18613	null
2024-11-27	Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis	Eva Prakash et.al.	2411.18602	null
2024-11-27	FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion	Haosen Yang et.al.	2411.18552	null
2024-11-27	Enhancing weed detection performance by means of GenAI-based image augmentation	Sourav Modak et.al.	2411.18513	null
2024-11-27	Learning the Evolution of Physical Structure of Galaxies via Diffusion Models	Andrew Lizarraga et.al.	2411.18440	link
2024-11-27	Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models	Yiming Wu et.al.	2411.18375	null
2024-11-27	TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models	Riza Velioglu et.al.	2411.18350	link
2024-11-27	HiFiVFS: High Fidelity Video Face Swapping	Xu Chen et.al.	2411.18293	null
2024-11-27	TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution	Linwei Dong et.al.	2411.18263	link
2024-11-27	Dependency-Aware CAV Task Scheduling via Diffusion-Based Reinforcement Learning	Xiang Cheng et.al.	2411.18230	null
2024-11-27	Uniqueness and regularity of weak solutions of a drift-diffusion system for perovskite solar cells	Annegret Glitzky et.al.	2411.18223	null
2024-11-27	Prediction with Action: Visual Policy Learning via Joint Denoising Process	Yanjiang Guo et.al.	2411.18179	null
2024-11-27	ModeDreamer: Mode Guiding Score Distillation for Text-to-3D Generation using Reference Image Prompts	Uy Dieu Tran et.al.	2411.18135	null
2024-11-27	Training Data Synthesis with Difficulty Controlled Diffusion Model	Zerun Wang et.al.	2411.18109	null
2024-11-27	PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion	Gwanghyun Kim et.al.	2411.18068	null
2024-11-27	Generative Semantic Communication for Joint Image Transmission and Segmentation	Weiwen Yuan et.al.	2411.18005	null
2024-11-27	Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery	Zhenyu Yu et.al.	2411.17973	null
2024-11-27	ROICtrl: Boosting Instance Control for Visual Generation	Yuchao Gu et.al.	2411.17949	null
2024-11-25	Generative Omnimatte: Learning to Decompose Video into Layers	Yao-Chih Lee et.al.	2411.16683	null
2024-11-25	Diffusion Features for Zero-Shot 6DoF Object Pose Estimation	Bernd Von Gimborn et.al.	2411.16668	null
2024-11-25	LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction	Yiran Sun et.al.	2411.16629	link
2024-11-25	Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models	Ronghuan Wu et.al.	2411.16602	null
2024-11-25	Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification	Andre Kassis et.al.	2411.16598	link
2024-11-25	Rethinking Diffusion for Text-Driven Human Motion Generation	Zichong Meng et.al.	2411.16575	null
2024-11-25	Representation Collapsing Problems in Vector Quantization	Wenhao Zhao et.al.	2411.16550	null
2024-11-25	ADOBI: Adaptive Diffusion Bridge For Blind Inverse Problems with Application to MRI Reconstruction	Yuyang Hu et.al.	2411.16535	null
2024-11-25	Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis	Boming Miao et.al.	2411.16503	null
2024-11-25	Model-based reinforcement corrosion prediction: Continuous calibration with Bayesian optimization and corrosion wire sensor data	A. Potnis et.al.	2411.16447	null
2024-11-25	Privacy Protection in Personalized Diffusion Models via Targeted Cross-Attention Adversarial Attack	Xide Xu et.al.	2411.16437	null
2024-11-25	Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing	Kaifeng Gao et.al.	2411.16375	link
2024-11-25	One Diffusion to Generate Them All	Duong H. Le et.al.	2411.16318	link
2024-11-25	An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models	Wentao Qu et.al.	2411.16308	link
2024-11-25	DiffDesign: Controllable Diffusion with Meta Prior for Efficient Interior Design Generation	Yuxuan Yang et.al.	2411.16301	null
2024-11-25	SMGDiff: Soccer Motion Generation using diffusion probabilistic models	Hongdi Yang et.al.	2411.16216	null
2024-11-25	Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation	Qiao Yu et.al.	2411.16185	link
2024-11-25	Image Generation Diversity Issues and How to Tame Them	Mischa Dombrowski et.al.	2411.16171	link
2024-11-25	Text-to-Image Synthesis: A Decade Survey	Nonghai Zhang et.al.	2411.16164	null
2024-11-25	MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model	Chenjie Cao et.al.	2411.16157	link
2024-11-21	Stable Flow: Vital Layers for Training-Free Image Editing	Omri Avrahami et.al.	2411.14430	link
2024-11-21	Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation	Yuanhao Cai et.al.	2411.14384	null
2024-11-21	CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields	Xin-Yang Liu et.al.	2411.14378	null
2024-11-21	Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models	Houze Liu et.al.	2411.14353	null
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	link
2024-11-21	Guided MRI Reconstruction via Schrödinger Bridge	Yue Wang et.al.	2411.14269	null
2024-11-21	TaQ-DiT: Time-aware Quantization for Diffusion Transformers	Xinyan Liu et.al.	2411.14172	null
2024-11-21	RestorerID: Towards Tuning-Free Face Restoration with ID Preservation	Jiacheng Ying et.al.	2411.14125	link
2024-11-21	Point Cloud Resampling with Learnable Heat Diffusion	Wenqiang Xu et.al.	2411.14120	null
2024-11-21	Transforming Static Images Using Generative Models for Video Salient Object Detection	Suhwan Cho et.al.	2411.13975	link
2024-11-21	Decoupled Sparse Priors Guided Diffusion Compression Model for Point Clouds	Xiaoge Zhang et.al.	2411.13860	null
2024-11-21	Detecting Human Artifacts from Text-to-Image Models	Kaihong Wang et.al.	2411.13842	link
2024-11-21	CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Lin Sun et.al.	2411.13836	link
2024-11-21	MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control	Ruiyuan Gao et.al.	2411.13807	null
2024-11-20	Non-Linear Outlier Synthesis for Out-of-Distribution Detection	Lars Doorenbos et.al.	2411.13619	link
2024-11-20	REDUCIO! Generating 1024 $\times$ 1024 Video within 16 Seconds using Extremely Compressed Motion Latents	Rui Tian et.al.	2411.13552	link
2024-11-20	Identity Preserving 3D Head Stylization with Multiview Score Distillation	Bahri Batuhan Bilecen et.al.	2411.13536	null
2024-11-20	Heuristically Adaptive Diffusion-Model Evolutionary Strategy	Benedikt Hartl et.al.	2411.13420	null
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	link
2024-11-20	A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM)	Antonino Visalli et.al.	2411.13203	link
2024-11-20	RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation	Christoph Reinders et.al.	2411.13150	link
2024-11-20	CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models	Naen Xu et.al.	2411.13144	null
2024-11-20	Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry	Yijie Zhang et.al.	2411.13120	null
2024-11-19	Breaking the wire: the impact of critical length on melting pathways in silver nanowires	Kannan M Ridings et.al.	2411.12891	null
2024-11-19	From Text to Pose to Image: Improving Diffusion Model Control and Quality	Clément Bonnett et.al.	2411.12872	link
2024-11-19	CDI: Copyrighted Data Identification in Diffusion Models	Jan Dubiński et.al.	2411.12858	link
2024-11-19	Towards motion from video diffusion models	Paul Janson et.al.	2411.12831	null
2024-11-19	Stylecodes: Encoding Stylistic Information For Image Generation	Ciara Rowles et.al.	2411.12811	link
2024-11-19	PoM: Efficient Image and Video Generation with the Polynomial Mixer	David Picard et.al.	2411.12663	link
2024-11-19	Improving Controllability and Editability for Pretrained Text-to-Music Generation Models	Yixiao Zhang et.al.	2411.12641	null
2024-11-19	Data Pruning in Generative Diffusion Models	Rania Briq et.al.	2411.12523	link
2024-11-19	Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models	Jun Xiao et.al.	2411.12450	null
2024-11-19	Combinational Backdoor Attack against Customized Text-to-Image Models	Wenbo Jiang et.al.	2411.12389	null
2024-11-19	Scalable and Effective Negative Sample Generation for Hyperedge Prediction	Shilin Qu et.al.	2411.12354	null
2024-11-19	Diffusion Product Quantization	Jie Shao et.al.	2411.12306	null
2024-11-18	Aligning Few-Step Diffusion Models with Dense Reward Difference Learning	Ziyi Zhang et.al.	2411.11727	link
2024-11-18	Robust Reinforcement Learning under Diffusion Models for Data with Jumps	Chenyang Jiang et.al.	2411.11697	null
2024-11-18	Conceptwm: A Diffusion Model Watermark for Concept Protection	Liangqi Lei et.al.	2411.11688	null
2024-11-18	Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation	Rüveyda Yilmaz et.al.	2411.11515	link
2024-11-18	MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion	Dongseok Shim et.al.	2411.11475	null
2024-11-18	CLUE-MARK: Watermarking Diffusion Models using CLWE	Kareem Shehata et.al.	2411.11434	null
2024-11-18	Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge	Qinglong Cao et.al.	2411.11343	null
2024-11-18	Stochastic quantization and diffusion models	Kenji Fukushima et.al.	2411.11297	null
2024-11-17	Stealing Training Graphs from Graph Neural Networks	Minhua Lin et.al.	2411.11197	null
2024-11-17	DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images	Zhen Yuan et.al.	2411.11190	null
2024-11-17	Integrated Ising Model with global inhibition for decision making	Olga Tapinova et.al.	2411.11143	null
2024-11-17	Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method	Yan Zheng et.al.	2411.11135	null
2024-11-17	Dynamic Dimensioning of Frequency Containment Reserves: The Case of the Nordic Grid	Jöbke Janssen et.al.	2411.11093	null
2024-11-17	D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification	Minhee Jang et.al.	2411.11087	link
2024-11-17	Time Step Generating: A Universal Synthesized Deepfake Image Detector	Ziyue Zeng et.al.	2411.11016	link
2024-11-17	Direct and Explicit 3D Generation from a Single Image	Haoyu Wu et.al.	2411.10947	null
2024-11-17	Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion	Ni Ou et.al.	2411.10936	null
2024-11-17	Constrained Diffusion with Trust Sampling	William Huang et.al.	2411.10932	link
2024-11-16	Generating Compositional Scenes via Text-to-image RGBA Instance Generation	Alessandro Fontanella et.al.	2411.10913	null
2024-11-16	MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation	Ansh Shah et.al.	2411.10886	link
2024-11-14	Golden Noise for Diffusion Models: A Learning Framework	Zikai Zhou et.al.	2411.09502	link
2024-11-14	DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing	Junjie Zhou et.al.	2411.09451	null
2024-11-14	Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models	Chutian Meng et.al.	2411.09449	null
2024-11-12	Mediffusion: Joint Diffusion for Self-Explainable Semi-Supervised Classification and Medical Image Generation	Joanna Kaleta et.al.	2411.09434	null
2024-11-14	A survey of probabilistic generative frameworks for molecular simulations	Richard John et.al.	2411.09388	link
2024-11-14	EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models	Soowon Kim et.al.	2411.09302	null
2024-11-14	Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance	Md Fahim Anjum et.al.	2411.09174	null
2024-11-14	VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation	Youpeng Wen et.al.	2411.09153	null
2024-11-14	General linear threshold models with application to influence maximization	Alexander Kagan et.al.	2411.09100	link
2024-11-13	Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples	Noël Vouitsis et.al.	2411.08954	link
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879	null
2024-11-13	Offline Adaptation of Quadruped Locomotion using Diffusion Models	Reece O’Mahoney et.al.	2411.08832	link
2024-11-13	Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models	Chengdong Dong et.al.	2411.08642	null
2024-11-13	V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion	Xun Huang et.al.	2411.08402	link
2024-11-13	Physics Informed Distillation for Diffusion Models	Joshua Tian Jin Tee et.al.	2411.08378	link
2024-11-13	Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study	Jinbo Wen et.al.	2411.08341	null
2024-11-13	Motion Control for Enhanced Complex Action Video Generation	Qiang Zhou et.al.	2411.08328	null
2024-11-13	DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach	Xin Tang et.al.	2411.08299	null
2024-11-12	Joint Diffusion models in Continual Learning	Paweł Skierś et.al.	2411.08224	null
2024-11-12	Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing	Zitao Shuai et.al.	2411.08196	null
2024-11-12	Well-posedness of a Variable-Exponent Telegraph Equation Applied to Image Despeckling	Sudeb Majee et.al.	2411.08175	null
2024-11-12	An age-structured diffusive model for epidemic modelling: Lie symmetries and exact solutions	Roman Cherniha et.al.	2411.08083	null
2024-11-13	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-12	GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation	Yushi Lan et.al.	2411.08033	null
2024-11-12	Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules	Binxu Wang et.al.	2411.07873	null
2024-11-12	Novel View Synthesis with Pixel-Space Diffusion Models	Noam Elata et.al.	2411.07765	null
2024-11-12	Nanosecond nanothermometry in an electron microscope	Florian Castioni et.al.	2411.07764	null
2024-11-12	Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion	Kaiyu Song et.al.	2411.07627	null
2024-11-12	Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation	Kaiyu Song et.al.	2411.07625	null
2024-11-12	Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer	F. Qi et.al.	2411.07539	null
2024-11-11	Score-based generative diffusion with “active” correlated noise sources	Alexandra Lamtyugina et.al.	2411.07233	null
2024-11-11	Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models	Yoad Tewel et.al.	2411.07232	null
2024-11-11	DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID	Nyle Siddiqui et.al.	2411.07205	link
2024-11-11	Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter	Domitille Gérard et.al.	2411.07202	null
2024-11-11	OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision	Cong Wei et.al.	2411.07199	null
2024-11-11	More Expressive Attention with Negative Weights	Ang Lv et.al.	2411.07176	link
2024-11-11	Edify 3D: Scalable High-Quality 3D Asset Generation	NVIDIA et.al.	2411.07135	null
2024-11-11	Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models	NVIDIA et.al.	2411.07126	null
2024-11-11	White-Box Diffusion Transformer for single-cell RNA-seq generation	Zhuorui Cui et.al.	2411.06785	link
2024-11-11	DiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite Observations	Xuming He et.al.	2411.06714	null
2024-11-11	Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model	Guandong Li et.al.	2411.06692	null
2024-11-11	SeedEdit: Align Image Re-Generation to Image Editing	Yichun Shi et.al.	2411.06686	null
2024-11-10	Using Diffusion Models as Generative Replay in Continual Federated Learning – What will Happen?	Yongsheng Mei et.al.	2411.06618	null
2024-11-10	CASC: Condition-Aware Semantic Communication with Latent Diffusion Models	Weixuan Chen et.al.	2411.06552	null
2024-11-10	Numerical analysis of the cross-diffusion Cahn-Hilliard model in lymphangiogenesis	Boyi Wang et.al.	2411.06488	null
2024-11-10	Improved Video VAE for Latent Video Diffusion Model	Pingyu Wu et.al.	2411.06449	null
2024-11-10	Detecting AutoEncoder is Enough to Catch LDM Generated Images	Dmitry Vesnin et.al.	2411.06441	link
2024-11-10	PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling	Hyukhun Koh et.al.	2411.06438	null
2024-11-09	Exploring Out-of-distribution Detection for Sparse-view Computed Tomography with Diffusion Models	Ezgi Demircan-Tureyen et.al.	2411.06308	null
2024-11-09	Text2CAD: Text to 3D CAD Generation via Technical Drawings	Mohsen Yavartanoo et.al.	2411.06206	null
2024-11-07	SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models	Muyang Li et.al.	2411.05007	link
2024-11-07	ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing	Jun-Kun Chen et.al.	2411.05006	null
2024-11-07	Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models	Shuhong Zheng et.al.	2411.05005	null
2024-11-07	ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning	David Junhao Zhang et.al.	2411.05003	null
2024-11-07	SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation	Koichi Namekata et.al.	2411.04989	null
2024-11-07	Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification	Mischa Dombrowski et.al.	2411.04956	null
2024-11-07	DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion	Wenqiang Sun et.al.	2411.04928	null
2024-11-07	Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion	Kaizhe Hu et.al.	2411.04919	link
2024-11-06	Boosting Latent Diffusion with Perceptual Objectives	Tariq Berrada et.al.	2411.04873	null
2024-11-07	Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation	Benito Buchheim et.al.	2411.04724	null
2024-11-07	DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction	Li Zhao et.al.	2411.04646	null
2024-11-07	Brain Tumour Removing and Missing Modality Generation using 3D WDM	André Ferreira et.al.	2411.04630	link
2024-11-07	Social EgoMesh Estimation	Luca Scofano et.al.	2411.04598	link
2024-11-07	Series-to-Series Diffusion Bridge Model	Hao Yang et.al.	2411.04491	null
2024-11-07	HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images	Zhenyue Qin et.al.	2411.04332	null
2024-11-06	PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing	Siddharth Seth et.al.	2411.04249	link
2024-11-06	Quantum Diffusion Models for Few-Shot Learning	Ruhan Wang et.al.	2411.04217	null
2024-11-06	DiMSUM: Diffusion Mamba – A Scalable and Unified Spatial-Frequency Method for Image Generation	Hao Phung et.al.	2411.04168	link
2024-11-06	Community Forensics: Using Thousands of Generators to Train Fake Image Detectors	Jeongsoo Park et.al.	2411.04125	link
2024-11-06	Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging	Yuan Bi et.al.	2411.04004	link
2024-11-06	ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy	Chenrui Tie et.al.	2411.03990	null
2024-11-06	ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models	Ashutosh Srivastava et.al.	2411.03982	null
2024-11-06	ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization	Huayang Huang et.al.	2411.03862	link
2024-11-06	Sub-DM:Subspace Diffusion Model with Orthogonal Decomposition for MRI Reconstruction	Yu Guan et.al.	2411.03758	link
2024-11-06	Zero-shot Dynamic MRI Reconstruction with Global-to-local Diffusion Model	Yu Guan et.al.	2411.03723	link
2024-11-06	Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation	Chihaya Matsuhira et.al.	2411.03595	null
2024-11-05	Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data	Seunggeun Chi et.al.	2411.03561	null
2024-11-05	SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture	Andrew Heschl et.al.	2411.03505	link
2024-11-05	DM4Steal: Diffusion Model For Link Stealing Attack On Graph Neural Networks	Jinyin Chen et.al.	2411.03364	null
2024-11-05	DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models	Ying Zhou et.al.	2411.03250	null
2024-11-05	On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models	Tariq Berrada Ifriqi et.al.	2411.03177	null
2024-11-05	Unleashing the power of novel conditional generative approaches for new materials discovery	Lev Novitskiy et.al.	2411.03156	link
2024-11-05	Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising	Tao Huang et.al.	2411.03053	null
2024-11-05	GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details	Zhongjin Luo et.al.	2411.03047	null
2024-11-05	IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems	Heiko Oppel et.al.	2411.02954	null
2024-11-05	LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior	Xingjian Tang et.al.	2411.02951	null
2024-11-05	How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion	Giannis Daras et.al.	2411.02780	link
2024-11-04	Modelling Alzheimer’s Protein Dynamics: A Data-Driven Integration of Stochastic Methods, Machine Learning and Connectome Insights	Alec MacIver et.al.	2411.02644	null
2024-11-04	Training-free Regional Prompting for Diffusion Transformers	Anthony Chen et.al.	2411.02395	link
2024-11-04	Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition	Xinkai Liu et.al.	2411.02334	null
2024-11-04	LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation	Mufei Li et.al.	2411.02322	link
2024-11-04	Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation	Xianghui Yang et.al.	2411.02293	null
2024-11-04	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-04	CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality	Yiqin Zhao et.al.	2411.02179	null
2024-11-04	Model Integrity when Unlearning with T2I Diffusion Models	Andrea Schioppa et.al.	2411.02068	null
2024-11-04	DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability	Bo Gao et.al.	2411.01819	null
2024-11-04	MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence	Fuming You et.al.	2411.01805	null
2024-11-04	A Regressor-Guided Graph Diffusion Model for Predicting Enzyme Mutations to Enhance Turnover Number	Xiaozhu Yu et.al.	2411.01745	link
2024-11-04	xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism	Jiarui Fang et.al.	2411.01738	link
2024-11-04	LaGDif: Latent Graph Diffusion Model for Efficient Protein Inverse Folding with Self-Ensemble	Taoyu Wu et.al.	2411.01737	link
2024-11-03	Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation	Zhenbin Wang et.al.	2411.01647	null
2024-11-03	HC $^3$ L-Diff: Hybrid conditional latent diffusion with high frequency enhancement for CBCT-to-CT synthesis	Shi Yin et.al.	2411.01575	null
2024-11-03	Conditional Controllable Image Fusion	Bing Cao et.al.	2411.01573	link
2024-11-03	Statistical guarantees for denoising reflected diffusion models	Asbjørn Holk et.al.	2411.01563	null
2024-11-03	Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach	Qihe Pan et.al.	2411.01545	link
2024-11-03	Digressions on Irreversibility and Stochastic Systems	Giorgio Picci et.al.	2411.01516	null
2024-11-03	DPCL-Diff: The Temporal Knowledge Graph Reasoning based on Graph Node Diffusion Model with Dual-Domain Periodic Contrastive Learning	Yukun Cao et.al.	2411.01477	null
2024-11-03	Two-Timescale Model Caching and Resource Allocation for Edge-Enabled AI-Generated Content Services	Zhang Liu et.al.	2411.01458	null
2024-10-31	DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion	Weicai Ye et.al.	2410.24203	link
2024-10-31	Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation	Fu Feng et.al.	2410.24160	null
2024-10-31	Scaling Concept With Text-Guided Diffusion Models	Chao Huang et.al.	2410.24151	null
2024-10-31	Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure	Xiang Li et.al.	2410.24060	link
2024-10-31	TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation	Sunjae Yoon et.al.	2410.24037	null
2024-10-31	DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination	Jia Fu et.al.	2410.24006	link
2024-10-31	Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model	Wenjia Xie et.al.	2410.23994	null
2024-10-31	Stochastic Reconstruction of Gappy Lagrangian Turbulent Signals by Conditional Diffusion Models	Tianyi Li et.al.	2410.23971	link
2024-10-31	Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation	Yihang Zhou et.al.	2410.23962	null
2024-10-31	Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model	Hao Zhang et.al.	2410.23905	link
2024-10-31	DiffBatt: A Diffusion Model for Battery Degradation Prediction and Synthesis	Hamidreza Eivazi et.al.	2410.23893	link
2024-10-31	Denoising Diffusion Models for Anomaly Localization in Medical Images	Cosmin I. Bercea et.al.	2410.23834	null
2024-10-31	Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models	Youngjun Jun et.al.	2410.23820	null
2024-10-31	EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching	Xinwang Chen et.al.	2410.23788	link
2024-10-31	On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection	Xiufeng Song et.al.	2410.23623	link
2024-10-31	There and Back Again: On the relation between noises, images, and their inversions in diffusion models	Łukasz Staniszewski et.al.	2410.23530	null
2024-10-30	MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts	Jie Zhu et.al.	2410.23332	null
2024-10-30	ReferEverything: Towards Segmenting Everything We Can Speak of in Videos	Anurag Bagchi et.al.	2410.23287	null
2024-10-30	Provable acceleration for diffusion models under minimal assumptions	Gen Li et.al.	2410.23285	null
2024-10-30	RelationBooth: Towards Relation-Aware Customized Object Generation	Qingyu Shi et.al.	2410.23280	null
2024-10-30	SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation	Yining Hong et.al.	2410.23277	null
2024-10-30	Multi-student Diffusion Distillation for Better One-step Generators	Yanke Song et.al.	2410.23274	null
2024-10-30	CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense	Mingkun Zhang et.al.	2410.23091	link
2024-10-30	Controlling Language and Diffusion Models by Transporting Activations	Pau Rodriguez et.al.	2410.23054	link
2024-10-30	Improving Musical Accompaniment Co-creation via Diffusion Transformers	Javier Nistal et.al.	2410.23005	null
2024-10-30	DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes	Jialiang Zhang et.al.	2410.23004	null
2024-10-30	LumiSculpt: A Consistency Lighting Control Network for Video Generation	Yuxin Zhang et.al.	2410.22979	null
2024-10-30	Private Synthetic Text Generation with Diffusion Models	Sebastian Ochs et.al.	2410.22971	link
2024-10-31	DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data	Hanyang Chen et.al.	2410.22938	link
2024-10-30	HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models	Shengkai Zhang et.al.	2410.22901	link
2024-10-30	Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images	Hanlin Wu et.al.	2410.22830	link
2024-10-30	Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models	Arash Marioriyad et.al.	2410.22775	null
2024-10-30	FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images	Zheng Yu et.al.	2410.22771	link
2024-10-31	Consistency Diffusion Bridge Models	Guande He et.al.	2410.22637	null
2024-10-29	Stochastic Trajectories and Spectral Boundary Conditions for Enhanced Diffusion in Immersed Boundary Problems	Rômulo Damasclin Chaves dos Santos et.al.	2410.22579	null
2024-10-29	Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components	Carl Allen et.al.	2410.22559	null
2024-10-31	FairSkin: Fair Diffusion for Skin Disease Image Generation	Ruichen Zhang et.al.	2410.22551	null
2024-10-28	On Inductive Biases That Enable Generalization of Diffusion Transformers	Jie An et.al.	2410.21273	link
2024-10-28	One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation	Zhendong Wang et.al.	2410.21257	null
2024-10-28	On learning higher-order cumulants in diffusion models	Gert Aarts et.al.	2410.21212	null
2024-10-28	Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences	Zhihao Zhao et.al.	2410.21130	null
2024-10-28	Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models	Wenda Li et.al.	2410.21088	link
2024-10-28	Federated Time Series Generation on Feature and Temporally Misaligned Data	Chenrui Fan et.al.	2410.21072	null
2024-10-28	Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework	Vladimir Arkhipkin et.al.	2410.21061	link
2024-10-28	Beyond Autoregression: Fast LLMs via Self-Distillation Through Time	Justin Deschenaux et.al.	2410.21035	link
2024-10-29	EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior	Xin Xiang et.al.	2410.20981	null
2024-10-28	Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models!	Arash Marioriyad et.al.	2410.20972	null
2024-10-28	*Diff-Instruct: Towards Human-Preferred One-step Text-to-image Generative Models**	Weijian Luo et.al.	2410.20898	link
2024-10-28	Novel Object Synthesis via Adaptive Text-Image Harmony	Zeren Xiong et.al.	2410.20823	null
2024-10-28	Development of a conditional diffusion model to predict process parameters and microstructures of dendrite crystals of matrix resin based on mechanical properties	Arisa Ikeda et.al.	2410.20822	null
2024-10-28	Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design	Xiangxin Zhou et.al.	2410.20688	link
2024-10-27	TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation	Juntong Shi et.al.	2410.20626	link
2024-10-27	Generator Matching: Generative modeling with arbitrary Markov processes	Peter Holderrieth et.al.	2410.20587	null
2024-10-27	Hamiltonian Score Matching and Generative Flows	Peter Holderrieth et.al.	2410.20470	null
2024-10-27	Lodge++: High-quality and Long Dance Generation with Vivid Choreography Patterns	Ronghui Li et.al.	2410.20389	null
2024-10-27	Conditional GAN for Enhancing Diffusion Models in Efficient and Authentic Global Gesture Generation from Audios	Yongkang Cheng et.al.	2410.20359	null
2024-10-26	MarDini: Masked Autoregressive Diffusion for Video Generation at Scale	Haozhe Liu et.al.	2410.20280	null
2024-10-24	MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms	Ling-Hao Chen et.al.	2410.18977	null
2024-10-24	3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation	Hansheng Chen et.al.	2410.18974	link
2024-10-24	On the Crucial Role of Initialization for Matrix Factorization	Bingcong Li et.al.	2410.18965	null
2024-10-24	Stable Consistency Tuning: Understanding and Improving Consistency Models	Fu-Yun Wang et.al.	2410.18958	link
2024-10-24	Generation of synthetic financial time series by diffusion models	Tomonori Takahashi et.al.	2410.18897	null
2024-10-24	The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods	Linda Laurier et.al.	2410.18866	null
2024-10-24	Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation	Xiaoyu Zhang et.al.	2410.18830	null
2024-10-24	Fast constrained sampling in pre-trained diffusion models	Alexandros Graikos et.al.	2410.18804	null
2024-10-24	Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances	Shilin Lu et.al.	2410.18775	link
2024-10-25	Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing	Haonan Lin et.al.	2410.18756	null
2024-10-24	Rectified Diffusion Guidance for Conditional Generation	Mengfei Xia et.al.	2410.18737	null
2024-10-24	Retrieval-Augmented Diffusion Models for Time Series Forecasting	Jingwei Liu et.al.	2410.18712	link
2024-10-24	Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model	Ali Hamza et.al.	2410.18678	null
2024-10-24	DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation	Yuang Ai et.al.	2410.18666	link
2024-10-25	Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Model	Jinxu Lin et.al.	2410.18639	null
2024-10-24	SMITE: Segment Me In TimE	Amirhossein Alimohammadi et.al.	2410.18538	link
2024-10-24	Beyond Color and Lines: Zero-Shot Style-Specific Image Variations with Coordinated Semantics	Jinghao Hu et.al.	2410.18537	null
2024-10-24	Scaling up Masked Diffusion Models on Text	Shen Nie et.al.	2410.18514	link
2024-10-24	FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling	Zhengqiang Zhang et.al.	2410.18410	link
2024-10-23	DMTG: A Human-Like Mouse Trajectory Generation Bot Based on Entropy-Controlled Diffusion Networks	Jiahua Liu et.al.	2410.18233	null
2024-10-23	DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes	Hengwei Bian et.al.	2410.18084	null
2024-10-23	Prioritized Generative Replay	Renhao Wang et.al.	2410.18082	null
2024-10-23	Optical Generative Models	Shiqi Chen et.al.	2410.17970	null
2024-10-23	A Wavelet Diffusion GAN for Image Super-Resolution	Lorenzo Aloisi et.al.	2410.17966	null
2024-10-23	Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation	Wenfang Yao et.al.	2410.17918	link
2024-10-23	Scaling Diffusion Language Models via Adaptation from Autoregressive Models	Shansan Gong et.al.	2410.17891	link
2024-10-23	Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech	Danilo de Oliveira et.al.	2410.17834	null
2024-10-23	PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation	Feiyan Feng et.al.	2410.17812	null
2024-10-23	AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution	Yuanting Fan et.al.	2410.17752	null
2024-10-23	VISAGE: Video Synthesis using Action Graphs for Surgery	Yousef Yeganeh et.al.	2410.17751	null
2024-10-23	Deep Generative Models for 3D Medical Image Synthesis	Paul Friedrich et.al.	2410.17664	null
2024-10-23	Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation	Muquan Li et.al.	2410.17606	link
2024-10-23	How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?	Jiahua Dong et.al.	2410.17594	link
2024-10-23	GDDA: Semantic OOD Detection on Graphs under Covariate Shift via Score-Based Diffusion Models	Zhixia He et.al.	2410.17526	null
2024-10-23	Physics-driven AI for Channel Estimation in Cellular Network	Xiaoqian Qi et.al.	2410.17525	null
2024-10-23	Diffusion Priors for Variational Likelihood Estimation and Image Denoising	Jun Cheng et.al.	2410.17521	link
2024-10-23	Univariate Conditional Variational Autoencoder for Morphogenic Patterns Design in Frontal Polymerization-Based Manufacturing	Qibang Liu et.al.	2410.17518	link
2024-10-22	EEG-DIF: Early Warning of Epileptic Seizures through Generative Diffusion Model-based Multi-channel EEG Signals Forecasting	Zekun Jiang et.al.	2410.17343	link
2024-10-22	Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding	Yasha Ektefaie et.al.	2410.17173	link
2024-10-22	DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization	Haowei Zhu et.al.	2410.16942	null
2024-10-21	MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors	Honghua Chen et.al.	2410.16272	null
2024-10-21	A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data	Simon Deltadahl et.al.	2410.16177	null
2024-10-22	Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models	Giannis Daras et.al.	2410.16152	null
2024-10-21	SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation	Xinyi Zhou et.al.	2410.16119	null
2024-10-21	Continuous Speech Synthesis using per-token Latent Diffusion	Arnon Turetzky et.al.	2410.16048	null
2024-10-22	CamI2V: Camera-Controlled Image-to-Video Diffusion Model	Guangcong Zheng et.al.	2410.15957	link
2024-10-21	Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces	Jifeng Hu et.al.	2410.15698	null
2024-10-21	Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation	Anh Bui et.al.	2410.15618	link
2024-10-20	Data Augmentation via Diffusion Model to Enhance AI Fairness	Christina Hastings Blow et.al.	2410.15470	null
2024-10-20	MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications	Yongrui Yu et.al.	2410.15432	null
2024-10-20	ConSinger: Efficient High-Fidelity Singing Voice Generation with Minimal Steps	Yulin Song et.al.	2410.15342	null
2024-10-20	Diffusion-PINN Sampler	Zhekun Shi et.al.	2410.15336	null
2024-10-20	FoMo: A Foundation Model for Mobile Traffic Forecasting with Diffusion Model	Haoye Chai et.al.	2410.15322	null
2024-10-20	FastSTI: A Fast Conditional Pseudo Numerical Diffusion Model for Spatio-temporal Traffic Data Imputation	Shaokang Cheng et.al.	2410.15248	null
2024-10-19	Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization	Zichen Wang et.al.	2410.15040	null
2024-10-19	DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer	Ying Hu et.al.	2410.15007	link
2024-10-19	Attack as Defense: Run-time Backdoor Implantation for Image Content Protection	Haichuan Zhang et.al.	2410.14966	link
2024-10-19	Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence	Vansh Bansal et.al.	2410.14949	link
2024-10-19	ImmerseDiffusion: A Generative Spatial Audio Latent Diffusion Model	Mojtaba Heydari et.al.	2410.14945	null
2024-10-19	Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step	Mingyuan Zhou et.al.	2410.14919	link
2024-10-17	Diffusing States and Matching Scores: A New Framework for Imitation Learning	Runzhe Wu et.al.	2410.13855	link
2024-10-17	Influence Functions for Scalable Data Attribution in Diffusion Models	Bruno Mlodozeniec et.al.	2410.13850	null
2024-10-17	Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning	Xiaodan Xing et.al.	2410.13823	link
2024-10-17	ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution	Junhao Gu et.al.	2410.13807	null
2024-10-17	Probing the Latent Hierarchical Structure of Data via Diffusion Models	Antonio Sclocchi et.al.	2410.13770	null
2024-10-17	Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers	Yuchen Liang et.al.	2410.13746	null
2024-10-17	Improved Convergence Rate for Diffusion Probabilistic Models	Gen Li et.al.	2410.13738	null
2024-10-18	DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation	Hanbo Cheng et.al.	2410.13726	link
2024-10-18	Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion	Yijun Liang et.al.	2410.13674	link
2024-10-17	Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design	Chenyu Wang et.al.	2410.13643	link
2024-10-17	Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control	Xinyi Yuan et.al.	2410.13586	null
2024-10-17	Can Medical Vision-Language Pre-training Succeed with Purely Synthetic Data?	Che Liu et.al.	2410.13523	null
2024-10-17	Solving Prior Distribution Mismatch in Diffusion Models via Optimal Transport	Zhanpeng Wang et.al.	2410.13431	null
2024-10-17	MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models	Donghao Zhou et.al.	2410.13370	null
2024-10-17	DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone	Hongfan Gao et.al.	2410.13338	null
2024-10-17	FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling	Jintao Zhang et.al.	2410.13253	link
2024-10-17	Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration	Yun-Yen Chuang et.al.	2410.13201	link
2024-10-17	TCP-Diffusion: A Multi-modal Diffusion Model for Global Tropical Cyclone Precipitation Forecasting with Change Awareness	Cheng Huang et.al.	2410.13175	link
2024-10-17	Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance	Jiwan Hur et.al.	2410.13136	link
2024-10-17	Boosting Imperceptibility of Stable Diffusion-based Adversarial Examples Generation with Momentum	Nashrah Haque et.al.	2410.13122	link
2024-10-16	Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts	Hongcheng Gao et.al.	2410.12777	link
2024-10-16	SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation	Jaehong Yoon et.al.	2410.12761	null
2024-10-16	Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization	Xingqi Wang et.al.	2410.12700	link
2024-10-16	AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing	DuoSheng Chen et.al.	2410.12696	link
2024-10-16	One Step Diffusion via Shortcut Models	Kevin Frans et.al.	2410.12557	link
2024-10-16	Disentangling data distribution for Federated Learning	Xinyuan Zhao et.al.	2410.12530	null
2024-10-16	Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing	Mingce Guo et.al.	2410.12526	null
2024-10-16	Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective	Yongxin Zhu et.al.	2410.12490	link
2024-10-16	DaDiff: Domain-aware Diffusion Model for Nighttime UAV Tracking	Haobo Zuo et.al.	2410.12270	link
2024-10-16	FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation	Huadai Liu et.al.	2410.12266	null
2024-10-16	Preference Optimization with Multi-Sample Comparisons	Chaoqi Wang et.al.	2410.12138	null
2024-10-15	DDIL: Improved Diffusion Distillation With Imitation Learning	Risheek Garrepalli et.al.	2410.11971	null
2024-10-15	CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning	Qingqing Cao et.al.	2410.11963	null
2024-10-15	High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion	Junhwa Hur et.al.	2410.11838	null
2024-10-15	On the Effectiveness of Dataset Alignment for Fake Image Detection	Anirudh Sundara Rajan et.al.	2410.11835	null
2024-10-15	Bayesian Experimental Design via Contrastive Diffusions	Jacopo Iollo et.al.	2410.11826	link
2024-10-15	Improving Long-Text Alignment for Text-to-Image Diffusion Models	Luping Liu et.al.	2410.11817	link
2024-10-15	SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing	Zhiyuan Zhang et.al.	2410.11815	null
2024-10-16	Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices	Zhiyuan Ma et.al.	2410.11795	null
2024-10-15	Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems	Jason Hu et.al.	2410.11730	null
2024-10-14	Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models	Jingzhi Bao et.al.	2410.10821	link
2024-10-14	Depth Any Video with Scalable Synthetic Data	Honghui Yang et.al.	2410.10815	link
2024-10-14	HART: Efficient Visual Generation with Hybrid Autoregressive Transformer	Haotian Tang et.al.	2410.10812	link
2024-10-14	TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction	Qingze et.al.	2410.10804	link
2024-10-14	Boosting Camera Motion Control for Video Diffusion Transformers	Soon Yau Cheong et.al.	2410.10802	null
2024-10-14	Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations	Litu Rout et.al.	2410.10792	null
2024-10-14	ControlMM: Controllable Masked Motion Generation	Ekkasit Pinyoanuntapong et.al.	2410.10780	null
2024-10-14	Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation	Youwei Yu et.al.	2410.10766	link
2024-10-14	DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships	Zhang Wan et.al.	2410.10751	null
2024-10-14	FlexGen: Flexible Multi-View Generation from Text and Image Inputs	Xinli Xu et.al.	2410.10745	null
2024-10-14	Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models	Junyu Chen et.al.	2410.10733	link
2024-10-14	TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model	Jiazhi Guan et.al.	2410.10696	null
2024-10-14	Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation	Peiwen Sun et.al.	2410.10676	null
2024-10-14	Generating Model Parameters for Controlling: Parameter Diffusion for Controllable Multi-Task Recommendation	Chenglei Shen et.al.	2410.10639	null
2024-10-15	SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers	Enze Xie et.al.	2410.10629	null
2024-10-14	UniGEM: A Unified Approach to Generation and Property Prediction for Molecules	Shikun Feng et.al.	2410.10516	null
2024-10-14	Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing	Kejie Wang et.al.	2410.10496	link
2024-10-14	An efficient numerical method for American options and their Greeks under the two-asset Kou jump-diffusion model	Karel J. in ‘t Hout et.al.	2410.10444	null
2024-10-14	Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models	Boheng Li et.al.	2410.10437	link
2024-10-14	DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model	Songen Gu et.al.	2410.10429	null
2024-10-10	DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models	Xiaoxiao He et.al.	2410.08207	null
2024-10-10	HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation	Shanyan Guan et.al.	2410.08192	null
2024-10-10	DifFRelight: Diffusion-Based Facial Performance Relighting	Mingming He et.al.	2410.08188	null
2024-10-10	ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion	Zitian Zhang et.al.	2410.08168	link
2024-10-10	DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation	Jiatao Gu et.al.	2410.08159	null
2024-10-10	Progressive Autoregressive Video Diffusion Models	Desai Xie et.al.	2410.08151	link
2024-10-10	Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction	Jarrid Rector-Brooks et.al.	2410.08134	null
2024-10-10	Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models	Vinith M. Suriyakumar et.al.	2410.08074	null
2024-10-10	LADIMO: Face Morph Generation through Biometric Template Inversion with Latent Diffusion	Marcel Grimmer et.al.	2410.07988	link
2024-10-10	AI Surrogate Model for Distributed Computing Workloads	David K. Park et.al.	2410.07940	null
2024-10-10	Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image Generative Models	Abhishek Mandal et.al.	2410.07884	null
2024-10-10	FDDM: Frequency-Decomposed Diffusion Model for Rectum Cancer Dose Prediction in Radiotherapy	Xin Liao et.al.	2410.07876	null
2024-10-10	RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation	Songming Liu et.al.	2410.07864	link
2024-10-10	MinorityPrompt: Text to Minority Image Generation via Prompt Optimization	Soobin Um et.al.	2410.07838	link
2024-10-10	Simulating images of radio galaxies with diffusion models	Tobias Vičánek Martínez et.al.	2410.07794	link
2024-10-10	$\textit{Jump Your Steps}$ : Optimizing Sampling Schedule of Discrete Diffusion Models	Yong-Hyun Park et.al.	2410.07761	null
2024-10-10	Synthesizing Multi-Class Surgical Datasets with Anatomy-Aware Diffusion Models	Danush Kumar Venkatesh et.al.	2410.07753	link
2024-10-10	Flow control-oriented coherent mode prediction via Grassmann-kNN manifold learning	Hongfu Zhang et.al.	2410.07683	null
2024-10-10	Relational Diffusion Distillation for Efficient Image Generation	Weilun Feng et.al.	2410.07679	link
2024-10-10	MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion	Onkar Susladkar et.al.	2410.07659	link
2024-10-09	IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation	Xinchen Zhang et.al.	2410.07171	link
2024-10-09	AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation	Yukang Cao et.al.	2410.07164	null
2024-10-09	InstructG2I: Synthesizing Images from Multimodal Attributed Graphs	Bowen Jin et.al.	2410.07157	link
2024-10-09	Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis	Bohan Zeng et.al.	2410.07155	link
2024-10-09	Diffusion Density Estimators	Akhil Premkumar et.al.	2410.06986	null
2024-10-09	Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control	Shimon Vainer et.al.	2410.06985	null
2024-10-09	Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think	Sihyun Yu et.al.	2410.06940	link
2024-10-09	Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis	Ahmed Abdullah et.al.	2410.06841	null
2024-10-09	Diffuse or Confuse: A Diffusion Deepfake Speech Dataset	Anton Firc et.al.	2410.06796	link
2024-10-09	Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography	Qianqian Xue et.al.	2410.06757	null
2024-10-10	Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques	Benyuan Meng et.al.	2410.06719	link
2024-10-09	Decouple-Then-Merge: Towards Better Training for Diffusion Models	Qianli Ma et.al.	2410.06664	null
2024-10-09	Chemistry-Inspired Diffusion with Non-Differentiable Guidance	Yuchen Shen et.al.	2410.06502	null
2024-10-09	HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution	Hua Li et.al.	2410.06488	link
2024-10-08	Generative Artificial Intelligence (GAI) for Mobile Communications: A Diffusion Model Perspective	Xiaoxia Xu et.al.	2410.06389	link
2024-10-08	SymDiff: Equivariant Diffusion via Stochastic Symmetrisation	Leo Zhang et.al.	2410.06262	null
2024-10-08	Story-Adapter: A Training-free Iterative Framework for Long Story Visualization	Jiawei Mao et.al.	2410.06244	null
2024-10-08	Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach	Sha Guo et.al.	2410.06149	null
2024-10-08	AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation	Boyuan Cao et.al.	2410.06055	link
2024-10-08	Sparse Repellency for Shielded Generation in Text-to-image Diffusion Models	Michael Kirchhof et.al.	2410.06025	null
2024-10-07	DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control	Kaifeng Zhao et.al.	2410.05260	null
2024-10-07	GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting	Yukang Cao et.al.	2410.05259	null
2024-10-07	SePPO: Semi-Policy Preference Optimization for Diffusion Alignment	Daoan Zhang et.al.	2410.05255	link
2024-10-07	DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration	Yongtai Zhuo et.al.	2410.05234	link
2024-10-07	Presto! Distilling Steps and Layers for Accelerating Music Generation	Zachary Novack et.al.	2410.05167	null
2024-10-07	A Simulation-Free Deep Learning Approach to Stochastic Optimal Control	Mengjian Hua et.al.	2410.05163	null
2024-10-07	Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information	Timofey Efimov et.al.	2410.05143	null
2024-10-07	Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning	Ayano Hiranaka et.al.	2410.05116	null
2024-10-07	DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects	Nidhi Mathihalli et.al.	2410.05097	link
2024-10-07	A nodally bound-preserving discontinuous Galerkin method for the drift-diffusion equation	Gabriel R. Barrenechea et.al.	2410.05040	null
2024-10-07	Revealing Directions for Text-guided 3D Face Editing	Zhuo Chen et.al.	2410.04965	null
2024-10-07	Low-Rank Continual Personalization of Diffusion Models	Łukasz Staniszewski et.al.	2410.04891	link
2024-10-07	Patch is Enough: Naturalistic Adversarial Patch against Vision-Language Pre-training Models	Dehong Kong et.al.	2410.04884	null
2024-10-07	Real-time cardiac cine MRI – A comparison of a diffusion probabilistic model with alternative state-of-the-art image reconstruction techniques for undersampled spiral acquisitions	Oliver Schad et.al.	2410.04843	link
2024-10-07	Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration	Zhiyu Zhu et.al.	2410.04811	link
2024-10-07	FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models	Haokun Chen et.al.	2410.04810	null
2024-10-07	Data-driven Diffusion Models for Enhancing Safety in Autonomous Vehicle Traffic Simulations	Jinxiong Lu et.al.	2410.04809	null
2024-10-07	Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models	Yuchen Wu et.al.	2410.04760	null
2024-10-07	Numerical analysis of American option pricing in a two-asset jump-diffusion model	Hao Zhou et.al.	2410.04745	null
2024-10-07	Diffusion Models in 3D Vision: A Survey	Zhen Wang et.al.	2410.04738	null
2024-10-03	Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models	Zhengfeng Lai et.al.	2410.02740	null
2024-10-03	SteerDiff: Steering towards Safe Text-to-Image Diffusion Models	Hongxiang Zhang et.al.	2410.02710	null
2024-10-03	ControlAR: Controllable Image Generation with Autoregressive Models	Zongming Li et.al.	2410.02705	link
2024-10-03	GUD: Generation with Unified Diffusion	Mathis Gerdes et.al.	2410.02667	null
2024-10-03	Efficient calibration of the shifted square-root diffusion model to credit default swap spreads using asymptotic approximations	Ankush Agarwal et.al.	2410.02645	null
2024-10-04	Diffusion Models are Evolutionary Algorithms	Yanbo Zhang et.al.	2410.02543	link
2024-10-03	Lightweight Diffusion Models for Resource-Constrained Semantic Communication	Giovanni Pignata et.al.	2410.02491	link
2024-10-03	Towards a Theoretical Understanding of Memorization in Diffusion Models	Yunhao Chen et.al.	2410.02467	null
2024-10-03	Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models	Seyedmorteza Sadat et.al.	2410.02416	null
2024-10-03	Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks	Zeyu Feng et.al.	2410.02389	null
2024-10-04	Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation	Muzhi Zhu et.al.	2410.02369	link
2024-10-03	Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis	Zikun Zhang et.al.	2410.02321	null
2024-10-03	Channel-aware Contrastive Conditional Diffusion for Multivariate Probabilistic Time Series Forecasting	Siyang Li et.al.	2410.02168	link
2024-10-03	SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model	Xinlei Niu et.al.	2410.02144	null
2024-10-03	MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation	Trung X. Pham et.al.	2410.02130	null
2024-10-03	SC-CDM: Enhancing Quality of Image Semantic Communication with a Compact Diffusion Model	Kexin Zhang et.al.	2410.02121	null
2024-10-02	Stochastic Deep Restoration Priors for Imaging Inverse Problems	Yuyang Hu et.al.	2410.02057	null
2024-10-02	Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data	Sreyan Ghosh et.al.	2410.02056	link
2024-10-02	Using Style Ambiguity Loss to Improve Aesthetics of Diffusion Models	James Baker et.al.	2410.02055	link
2024-10-02	Discrete Copula Diffusion	Anji Liu et.al.	2410.01949	null
2024-10-02	FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images	Cheng Zhang et.al.	2410.01801	null
2024-10-02	Dynamical-generative downscaling of climate model ensembles	Ignacio Lopez-Gomez et.al.	2410.01776	null
2024-10-02	ImageFolder: Autoregressive Image Generation with Folded Tokens	Xiang Li et.al.	2410.01756	link
2024-10-02	VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models	Kailai Feng et.al.	2410.01738	link
2024-10-02	HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration	Yushi Huang et.al.	2410.01723	link
2024-10-02	KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models	Pouyan Navard et.al.	2410.01595	link
2024-10-02	MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation	Mingzhen Sun et.al.	2410.01594	link
2024-10-02	HRTF Estimation using a Score-based Prior	Etienne Thuillier et.al.	2410.01562	null
2024-10-02	Edge-preserving noise for diffusion models	Jente Vandersanden et.al.	2410.01540	null
2024-10-02	Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models	Ching-Chia Kao et.al.	2410.01438	null
2024-10-02	Harnessing the Latent Diffusion Model for Training-Free Image Style Transfer	Kento Masui et.al.	2410.01366	null
2024-10-02	Aggregation of Multi Diffusion Models for Enhancing Learned Representations	Conghan Yue et.al.	2410.01262	link
2024-10-02	Generative Diffusion-based Contract Design for Efficient AI Twins Migration in Vehicular Embodied AI Networks	Yue Zhong et.al.	2410.01176	null
2024-10-02	Text2PDE: Latent Diffusion Models for Accessible Physics Simulation	Anthony Zhou et.al.	2410.01153	link
2024-10-02	Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation	Junlin Han et.al.	2410.00890	null
2024-10-01	Diffusion-Informed Probabilistic Contact Search for Multi-Finger Manipulation	Abhinav Kumar et.al.	2410.00841	null
2024-10-01	Absorbing State Phase Transitions and Stability of Long-Range Coherence in Dissipative Quantum State Preparation	Matthew Wampler et.al.	2410.00819	null
2024-10-01	Modeling Neural Switching via Drift-Diffusion Models	Nicholas Marco et.al.	2410.00781	link
2024-10-01	Improved Generation of Synthetic Imaging Data Using Feature-Aligned Diffusion	Lakshmi Nair et.al.	2410.00731	link
2024-10-01	NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models	Chi-Sheng Chen et.al.	2410.00712	null
2024-09-30	COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models	Divyanshu Daiya et.al.	2409.20502	null
2024-09-30	FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing	Lingling Cai et.al.	2409.20500	null
2024-09-30	Ensemble Kalman Diffusion Guidance: A Derivative-free Method for Inverse Problems	Hongkai Zheng et.al.	2409.20175	null
2024-09-30	Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model	Fulong Ma et.al.	2409.20164	null
2024-09-30	Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation	Rong Tang et.al.	2409.20124	null
2024-09-30	Reaction-diffusion model for a population structured in phenotype and space I – Criterion for persistence	Nathanaël Boutillon et.al.	2409.20118	null
2024-09-30	RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models	Jangyeong Kim et.al.	2409.19989	null
2024-09-30	Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function	Chenyi Zhuang et.al.	2409.19967	link
2024-09-30	Image Copy Detection for Diffusion Models	Wenhao Wang et.al.	2409.19952	null
2024-09-30	Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner	Chenyou Fan et.al.	2409.19949	null
2024-09-30	Replace Anyone in Videos	Xiang Wang et.al.	2409.19911	link
2024-09-30	GameLabel-10K: Collecting Image Preference Data Through Mobile Game Crowdsourcing	Jonathan Zhou et.al.	2409.19830	null
2024-09-29	Text-driven Human Motion Generation with Motion Masked Diffusion Model	Xingyu Chen et.al.	2409.19686	null
2024-09-29	Simple and Fast Distillation of Diffusion Models	Zhenyu Zhou et.al.	2409.19681	link
2024-09-29	SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal	Fang Long et.al.	2409.19679	link
2024-09-29	Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection	Yuhang Ma et.al.	2409.19624	null
2024-09-29	MCDDPM: Multichannel Conditional Denoising Diffusion Model for Unsupervised Anomaly Detection in Brain MRI	Vivek Kumar Trivedi et.al.	2409.19623	link
2024-09-29	Causal Deciphering and Inpainting in Spatio-Temporal Dynamics via Diffusion Model	Yifan Duan et.al.	2409.19608	null
2024-09-29	DiffCP: Ultra-Low Bit Collaborative Perception via Diffusion Model	Ruiqing Mao et.al.	2409.19592	null
2024-09-29	Effective Diffusion Transformer Architecture for Image Super-Resolution	Kun Cheng et.al.	2409.19589	link
2024-09-26	FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner	Wenliang Zhao et.al.	2409.18128	link
2024-09-26	Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction	Jing He et.al.	2409.18124	null
2024-09-26	EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation	Jiaxiang Tang et.al.	2409.18114	null
2024-09-26	StackGen: Generating Stable Structures from Silhouettes via Diffusion	Luzhe Sun et.al.	2409.18098	null
2024-09-26	DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models	Helin Cao et.al.	2409.18092	null
2024-09-26	Stable Video Portraits	Mirela Ostrek et.al.	2409.18083	null
2024-09-26	PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging	Xin Cai et.al.	2409.17996	null
2024-09-26	Joint Localization and Planning using Diffusion	L. Lao Beyer et.al.	2409.17995	null
2024-09-26	CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors	Linye Lyu et.al.	2409.17963	link
2024-09-26	Relativistic diffusion model for hadron production in p-Pb collisions at the LHC	Philipp Schulz et.al.	2409.17960	null
2024-09-26	Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion	Hengrui Gu et.al.	2409.17928	link
2024-09-26	Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation	Qihan Huang et.al.	2409.17920	link
2024-09-26	Continual learning with task specialist	Indu Solomon et.al.	2409.17806	null
2024-09-26	Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs	Qinpeng Cui et.al.	2409.17778	link
2024-09-26	Text Image Generation for Low-Resource Languages with Dual Translation Learning	Chihiro Noguchi et.al.	2409.17747	null
2024-09-26	AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status	Jinghao Zhang et.al.	2409.17740	null
2024-09-26	Dark Miner: Defend against unsafe generation for text-to-image diffusion models	Zheling Meng et.al.	2409.17682	null
2024-09-26	Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation	Huan Yang et.al.	2409.17674	null
2024-09-26	ID $^3$ : Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition	Shen Li et.al.	2409.17576	null
2024-09-26	Flexiffusion: Segment-wise Neural Architecture Search for Flexible Denoising Schedule	Hongtao Huang et.al.	2409.17566	null
2024-09-25	DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion	Yukun Huang et.al.	2409.17145	link
2024-09-25	Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model	Xinfeng Wei et.al.	2409.17104	null
2024-09-25	Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors	Aiping Zhang et.al.	2409.17058	link
2024-09-25	ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis	Fangshuo Zhou et.al.	2409.17049	link
2024-09-25	Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion	Vineet Punyamoorty et.al.	2409.16950	null
2024-09-25	DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling	Kyuheon Jung et.al.	2409.16949	link
2024-09-25	Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model	Hongliang Zhong et.al.	2409.16938	link
2024-09-25	A Versatile and Differentiable Hand-Object Interaction Representation	Théo Morales et.al.	2409.16855	null
2024-09-25	Analytical assessment of workers’ safety concerning direct and indirect ways of getting infected by dangerous pathogen	Krzysztof Domino et.al.	2409.16809	null
2024-09-25	Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model	Shoma Iwai et.al.	2409.16689	null
2024-09-25	CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models	Xin Jing et.al.	2409.16619	null
2024-09-25	Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models	Deepak Sridhar et.al.	2409.16535	link
2024-09-24	Diffusion Models to Enhance the Resolution of Microscopy Images: A Tutorial	Harshith Bachimanchi et.al.	2409.16488	null
2024-09-24	Generative Factor Chaining: Coordinated Manipulation with Diffusion-based Factor Graph	Utkarsh A. Mishra et.al.	2409.16275	null
2024-09-24	MaskBit: Embedding-free Image Generation via Bit Tokens	Mark Weber et.al.	2409.16211	link
2024-09-24	MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling	Yifang Men et.al.	2409.16160	null
2024-09-24	Spreading dynamics of a Fisher-KPP nonlocal diffusion model with a free boundary	Lei Li et.al.	2409.16101	null
2024-09-24	PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation	Mingyo Seo et.al.	2409.16012	null
2024-09-24	Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification	Leire Benito-Del-Valle et.al.	2409.16002	link
2024-09-24	ASD-Diffusion: Anomalous Sound Detection with Diffusion Models	Fengrun Zhang et.al.	2409.15957	null
2024-09-18	Massively Multi-Person 3D Human Motion Forecasting with Scene Context	Felix B Mueller et.al.	2409.12189	link
2024-09-18	MoRAG – Multi-Fusion Retrieval Augmented Generation for Human Motion	Kalakonda Sai Shashank et.al.	2409.12140	link
2024-09-18	Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance	Jaehoon Joo et.al.	2409.12099	null
2024-09-18	Denoising diffusion models for high-resolution microscopy image restoration	Pamela Osuna-Vargas et.al.	2409.12078	null
2024-09-18	LEMON: Localized Editing with Mesh Optimization and Neural Shaders	Furkan Mert Algan et.al.	2409.12024	null
2024-09-18	Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models	Lorenzo Mandelli et.al.	2409.11920	null
2024-09-18	DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech	Xin Qi et.al.	2409.11835	null
2024-09-18	RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets	Jikai Ye et.al.	2409.11831	null
2024-09-18	InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models	Yan Zheng et.al.	2409.11734	null
2024-09-18	GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation	Shuowen Liang et.al.	2409.11689	link
2024-09-18	Recurrent Interpolants for Probabilistic Time Series Prediction	Yu Chen et.al.	2409.11684	null
2024-09-18	SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation	Mingze Sun et.al.	2409.11682	link
2024-09-18	PainDiffusion: Can robot express pain?	Quang Tien Dam et.al.	2409.11635	null
2024-09-17	Context-Generative Default Policy for Bounded Rational Agent	Durgakant Pushp et.al.	2409.11604	null
2024-09-17	DiffESM: Conditional Emulation of Temperature and Precipitation in Earth System Models with 3D Diffusion Models	Seth Bassetti et.al.	2409.11601	null
2024-09-17	Ultrasound Image Enhancement with the Variance of Diffusion Models	Yuxin Zhang et.al.	2409.11380	link
2024-09-17	OSV: One Step is Enough for High-Quality Image to Video Generation	Xiaofeng Mao et.al.	2409.11367	null
2024-09-17	Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Gonzalo Martin Garcia et.al.	2409.11355	link
2024-09-17	OmniGen: Unified Image Generation	Shitao Xiao et.al.	2409.11340	link
2024-09-17	fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction	Jianxiong Gao et.al.	2409.11315	null
2024-09-16	Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation	Noah Buchanan et.al.	2409.10494	null
2024-09-16	SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing	Qi Qian et.al.	2409.10476	null
2024-09-16	MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion	Lehong Wu et.al.	2409.10473	null
2024-09-16	Mamba-ST: State Space Model for Efficient Style Transfer	Filippo Botti et.al.	2409.10385	link
2024-09-16	Taming Diffusion Models for Image Restoration: A Review	Ziwei Luo et.al.	2409.10353	null
2024-09-16	Fairness, not Emotion, Drives Socioeconomic Decision Making	Rudra Mukhopadhyay et.al.	2409.10322	null
2024-09-16	DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis	Fa-Ting Hong et.al.	2409.10281	null
2024-09-16	RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models	Başak Melis Öcal et.al.	2409.10180	null
2024-09-16	PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion	Peng Li et.al.	2409.10141	null
2024-09-16	DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection	Kun Fang et.al.	2409.10094	null
2024-09-16	MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior	Weijing Tao et.al.	2409.10090	link
2024-09-16	Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models	Alexander Koch et.al.	2409.10089	null
2024-09-16	StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion	Yinghao Aaron Li et.al.	2409.10058	null
2024-09-16	AttnMod: Attention-Based New Art Styles	Shih-Chieh Su et.al.	2409.10028	null
2024-09-15	GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion	Vitor Guizilini et.al.	2409.09896	null
2024-09-15	Latent Diffusion Models for Controllable RNA Sequence Generation	Kaixuan Huang et.al.	2409.09828	null
2024-09-15	E-Commerce Inpainting with Mask Guidance in Controlnet for Reducing Overcompletion	Guandong Li et.al.	2409.09681	null
2024-09-15	EditBoard: Towards A Comprehensive Evaluation Benchmark for Text-based Video Editing Models	Yupeng Chen et.al.	2409.09668	link
2024-09-15	Conditional sampling within generative diffusion models	Zheng Zhao et.al.	2409.09650	link
2024-09-15	Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement	Yudong Yang et.al.	2409.09642	null
2024-09-12	DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors	Thomas Hanwen Zhu et.al.	2409.08278	null
2024-09-12	DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer	Runjia Li et.al.	2409.08271	null
2024-09-12	Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation	Samanta Rodriguez et.al.	2409.08269	null
2024-09-12	Improving Text-guided Object Inpainting with Semantic Pre-inpainting	Yifu Chen et.al.	2409.08260	link
2024-09-12	Improving Virtual Try-On with Garment-focused Diffusion Models	Siqi Wan et.al.	2409.08258	link
2024-09-12	LoRID: Low-Rank Iterative Diffusion for Adversarial Purification	Geigh Zollicoffer et.al.	2409.08255	null
2024-09-12	Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding	Hongyu Li et.al.	2409.08251	null
2024-09-12	IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation	Yinwei Wu et.al.	2409.08240	null
2024-09-12	LT3SD: Latent Trees for 3D Scene Diffusion	Quan Meng et.al.	2409.08215	null
2024-09-12	VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis	Hao Chen et.al.	2409.08207	null
2024-09-12	MagicStyle: Portrait Stylization Based on Reference Image	Zhaoli Deng et.al.	2409.08156	null
2024-09-12	EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance	Zicheng Duan et.al.	2409.08091	link
2024-09-12	Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation	Junsung Lee et.al.	2409.08077	null
2024-09-12	AI-accelerated discovery of high critical temperature superconductors	Xiao-Qi Han et.al.	2409.08065	link
2024-09-12	Scribble-Guided Diffusion for Training-free Text-to-Image Generation	Seonho Lee et.al.	2409.08026	link
2024-09-13	Estimating Atmospheric Variables from Digital Typhoon Satellite Images via Conditional Denoising Diffusion Models	Zhangyue Ling et.al.	2409.07961	link
2024-09-12	Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models	Nikolai L. Kühne et.al.	2409.07936	link
2024-09-12	UGAD: Universal Generative AI Detector utilizing Frequency Fingerprints	Inzamamul Alam et.al.	2409.07913	null
2024-09-12	XMOL: Explainable Multi-property Optimization of Molecules	Aye Phyu Phyu Aung et.al.	2409.07786	null
2024-09-12	DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing	Zhenyuan Dong et.al.	2409.07756	link
2024-09-11	DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation	Haibo Yang et.al.	2409.07454	null
2024-09-11	Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models	Haibo Yang et.al.	2409.07452	link
2024-09-11	FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process	Yang Luo et.al.	2409.07451	null
2024-09-11	Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging	Yunzhen Wang et.al.	2409.07417	null
2024-09-11	Training-Free Guidance for Discrete Diffusion Models for Molecular Generation	Thomas J. Kerby et.al.	2409.07359	null
2024-09-11	Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching	Eugenio Chisari et.al.	2409.07343	null
2024-09-11	Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models	Fengzhe Zhang et.al.	2409.07323	null
2024-09-11	Exploring User-level Gradient Inversion with a Diffusion Prior	Zhuohang Li et.al.	2409.07291	null
2024-09-11	CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals	Weixiang Gao et.al.	2409.07271	link
2024-09-11	Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models	Sanoojan Baliah et.al.	2409.07269	link
2024-09-11	EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion	Jian Zhang et.al.	2409.07255	link
2024-09-12	Alignment of Diffusion Models: Fundamentals, Challenges, and Future	Buhua Liu et.al.	2409.07253	link
2024-09-11	Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning	Yingling Lu et.al.	2409.07238	link
2024-09-11	Phy124: Fast Physics-Driven 4D Content Generation from a Single Image	Jiajing Lin et.al.	2409.07179	null
2024-09-11	Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models	Jiahang Cao et.al.	2409.07163	null
2024-09-11	MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis	Hanyu Jiang et.al.	2409.07129	null
2024-09-11	Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and Education	Ali Forootani et.al.	2409.07110	link
2024-09-11	From optimal score matching to optimal sampling	Zehao Dou et.al.	2409.07032	null
2024-09-11	CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion	Joshua Kazdan et.al.	2409.07025	null
2024-09-11	Towards Predicting Temporal Changes in a Patient’s Chest X-ray Images based on Electronic Health Records	Daeun Kyung et.al.	2409.07012	link
2024-09-05	Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding	Yunze Man et.al.	2409.03757	link
2024-09-05	ArtiFade: Learning to Generate High-quality Subject from Blemished Images	Shuya Yang et.al.	2409.03745	null
2024-09-05	RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images	Benzhi Wang et.al.	2409.03644	link
2024-09-05	DiffEVC: Any-to-Any Emotion Voice Conversion with Expressive Guidance	Hsing-Hang Chou et.al.	2409.03636	null
2024-09-05	TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces	Bernardo Biesseck et.al.	2409.03600	link
2024-09-05	DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture	Qianlong Xiang et.al.	2409.03550	link
2024-09-05	Blended Latent Diffusion under Attention Control for Real-World Video Editing	Deyin Liu et.al.	2409.03514	null
2024-09-05	Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration	Pei Wang et.al.	2409.03455	null
2024-09-05	Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning	Huaxi Huang et.al.	2409.03326	null
2024-09-05	SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model	Weipeng Tan et.al.	2409.03270	null
2024-09-05	RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry	Zhaowei Wang et.al.	2409.03198	null
2024-09-04	Spatial Diffusion for Cell Layout Generation	Chen Li et.al.	2409.03106	link
2024-09-04	How DREAMS are made: Emulating Satellite Galaxy and Subhalo Populations with Diffusion Models and Point Clouds	Tri Nguyen et.al.	2409.02980	link
2024-09-06	HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts	Xinyu Liu et.al.	2409.02919	link
2024-09-04	Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling	Kaiwen Zheng et.al.	2409.02908	null
2024-09-04	Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models	Zhibin Liu et.al.	2409.02851	link
2024-09-04	Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model	Tornike Karchkhadze et.al.	2409.02845	null
2024-09-04	Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects	Kyungmin Jo et.al.	2409.02653	null
2024-09-04	MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos	Junyi Ma et.al.	2409.02638	null
2024-09-05	Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency	Jianwen Jiang et.al.	2409.02634	null
2024-09-04	Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models	Pujing Yang et.al.	2409.02597	null
2024-09-04	Solving Video Inverse Problems Using Image Diffusion Models	Taesung Kwon et.al.	2409.02574	null
2024-09-04	StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models	Wen Li et.al.	2409.02543	link
2024-09-04	Sample what you cant compress	Vighnesh Birodkar et.al.	2409.02529	null
2024-09-04	Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal	Jifeng Hu et.al.	2409.02512	link
2024-09-04	Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis	Aishwarya Agarwal et.al.	2409.02429	null
2024-09-04	Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering	Peng Wang et.al.	2409.02426	link
2024-09-04	Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing	Siyi Chen et.al.	2409.02374	link
2024-09-03	QID $^2$ : An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data	Zijian Chen et.al.	2409.02309	null
2024-09-03	FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation	Takuhiro Kaneko et.al.	2409.02245	null
2024-09-05	LinFusion: 1 GPU, 1 Minute, 16K Image	Songhua Liu et.al.	2409.02097	link
2024-09-03	DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos	Wenbo Hu et.al.	2409.02095	link
2024-09-03	ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis	Wangbo Yu et.al.	2409.02048	null
2024-08-30	Subspace Diffusion Posterior Sampling for Travel-Time Tomography	Xiang Cao et.al.	2408.17333	null
2024-08-30	RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance	Avideep Mukherjee et.al.	2408.17095	null
2024-08-30	Instant Adversarial Purification with Adversarial Consistency Distillation	Chun Tong Lei et.al.	2408.17064	null
2024-08-30	Text-to-Image Generation Via Energy-Based CLIP	Roy Ganz et.al.	2408.17046	null
2024-08-30	Contrastive Learning with Synthetic Positives	Dewen Zeng et.al.	2408.16965	link
2024-08-29	Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis	Theodoros Kouzelis et.al.	2408.16845	null
2024-08-29	ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model	Fangfu Liu et.al.	2408.16767	null
2024-08-29	CSGO: Content-Style Composition in Text-to-Image Generation	Peng Xing et.al.	2408.16766	null
2024-08-29	DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving	Yongjie Fu et.al.	2408.16647	null
2024-08-29	RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model	Zhuan Shi et.al.	2408.16634	null
2024-08-29	A Score-based Generative Solver for PDE-constrained Inverse Problems with Complex Priors	Yankun Hong et.al.	2408.16626	null
2024-08-29	GRPose: Learning Graph Relations for Human Image Generation with Pose Priors	Xiangchen Yin et.al.	2408.16540	link
2024-08-29	Spiking Diffusion Models	Jiahang Cao et.al.	2408.16467	link
2024-08-29	What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer	Chaeyeon Chung et.al.	2408.16450	link
2024-08-29	COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation	Jiefeng Li et.al.	2408.16426	null
2024-08-29	Self-Improving Diffusion Models with Synthetic Data	Sina Alemohammad et.al.	2408.16333	null
2024-08-29	Enhanced Control for Diffusion Bridge in Image Restoration	Conghan Yue et.al.	2408.16303	link
2024-08-29	Advancing Architectural Floorplan Design with Geometry-enhanced Graph Diffusion	Sizhe Hu et.al.	2408.16258	link
2024-08-29	Error analysis of conformal finite element method for nonlocal diffusion model	Zuoqiang Shi et.al.	2408.16243	null
2024-08-29	Enhancing Conditional Image Generation with Explainable Latent Space Manipulation	Kshitij Pathania et.al.	2408.16232	link
2024-08-28	TEDRA: Text-based Editing of Dynamic and Photoreal Actors	Basavaraj Sunagad et.al.	2408.15995	null
2024-08-28	Distribution Backtracking Builds A Faster Convergence Trajectory for One-step Diffusion Distillation	Shengyuan Zhang et.al.	2408.15991	link
2024-08-28	Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones	Carlos Plou et.al.	2408.15899	null
2024-08-28	Airfoil Diffusion: Denoising Diffusion Model For Conditional Airfoil Generation	Reid Graves et.al.	2408.15898	link
2024-08-28	Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data	Ayodeji Ijishakin et.al.	2408.15890	null
2024-08-28	GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model	Yongjie Fu et.al.	2408.15868	null
2024-08-28	Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor Attacks	Oscar Chew et.al.	2408.15721	null
2024-08-28	Synthetic Forehead-creases Biometric Generation for Reliable User Verification	Abhishek Tandon et.al.	2408.15693	link
2024-08-28	Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas	Fabio Quattrini et.al.	2408.15660	link
2024-08-28	Grand canonical generative diffusion model for crystalline phases and grain boundaries	Bo Lei et.al.	2408.15601	null
2024-08-28	MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning	Yifu Yuan et.al.	2408.15501	null
2024-08-28	On the implementation of linear finite element method for nonlocal diffusion model over 2D domain	Zuoqiang Shi et.al.	2408.15472	null
2024-08-28	Hand1000: Generating Realistic Hands from Text with Only 1,000 Images	Haozhuo Zhang et.al.	2408.15461	null
2024-08-27	Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution	Marcelo dos Santos et.al.	2408.15386	link
2024-08-27	GenRec: Unifying Video Generation and Recognition with Diffusion Models	Zejia Weng et.al.	2408.15241	link
2024-08-27	Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation	Xiaojuan Wang et.al.	2408.15239	null
2024-08-27	Simulation of Stochastic Discrete Dislocation Dynamics in Ductile Vs Brittle Materials	Santosh Chhetri et.al.	2408.15157	null
2024-08-27	DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays	Yiran Sun et.al.	2408.15118	link
2024-08-27	Constrained Diffusion Models via Dual Training	Shervin Khalafi et.al.	2408.15094	null
2024-08-27	LN-Gen: Rectal Lymph Nodes Generation via Anatomical Features	Weidong Guo et.al.	2408.14977	null
2024-08-27	Foundation Models for Music: A Survey	Yinghao Ma et.al.	2408.14340	link
2024-08-26	TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation	Anh-Dzung Doan et.al.	2408.14227	link
2024-08-26	MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement	Xu He et.al.	2408.14211	null
2024-08-27	SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher	Trung Dao et.al.	2408.14176	link
2024-08-26	Foodfusion: A Novel Approach for Food Image Composition via Diffusion Models	Chaohua Shi et.al.	2408.14135	null
2024-08-26	SurGen: Text-Guided Diffusion Model for Surgical Video Generation	Joseph Cho et.al.	2408.14028	null
2024-08-26	Pixel-Aligned Multi-View Generation with Depth Guided Decoder	Zhenggang Tang et.al.	2408.14016	null
2024-08-25	SimpleSpeech 2: Towards Simple and Efficient Text-to-Speech with Flow-based Scalar Latent Transformer Diffusion Models	Dongchao Yang et.al.	2408.13893	null
2024-08-25	Particle-Filtering-based Latent Diffusion for Inverse Problems	Amir Nazemi et.al.	2408.13868	null
2024-08-25	Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching	Minghao Liu et.al.	2408.13858	null
2024-08-25	Bring the Power of Diffusion Model to Defect Detection	Xuyi Yu et.al.	2408.13845	null
2024-08-25	3D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing	Shichao Dong et.al.	2408.13788	null
2024-08-25	Guided and Fused: Efficient Frozen CLIP-ViT with Feature Guidance and Multi-Stage Feature Fusion for Generalizable Deepfake Detection	Yingjian Chen et.al.	2408.13697	null
2024-08-24	GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars	Keqiang Sun et.al.	2408.13674	null
2024-08-27	Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing	Yitong Yang et.al.	2408.13623	null
2024-08-24	DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation	Ying Jin et.al.	2408.13509	link
2024-08-24	Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model	Chen Rao et.al.	2408.13459	link
2024-08-27	Training-free Long Video Generation with Chain of Diffusion Model Experts	Wenhao Li et.al.	2408.13423	null
2024-08-24	TVG: A Training-free Transition Video Generation Method with Diffusion Models	Rui Zhang et.al.	2408.13413	null
2024-08-23	Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing	Yangyang Xu et.al.	2408.13395	null
2024-08-22	xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations	Can Qin et.al.	2408.12590	null
2024-08-22	ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation	Lujia Zhong et.al.	2408.12561	link
2024-08-22	Show-o: One Single Transformer to Unify Multimodal Understanding and Generation	Jinheng Xie et.al.	2408.12528	null
2024-08-22	FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing	Jue Wang et.al.	2408.12429	link
2024-08-22	4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment	Kaihui Cheng et.al.	2408.12419	null
2024-08-22	CODE: Confident Ordinary Differential Editing	Bastien van Delft et.al.	2408.12418	link
2024-08-22	Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures	Ce Liu et.al.	2408.12413	null
2024-08-22	LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation	Shihao Chen et.al.	2408.12354	null
2024-08-23	GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections	Shiyue Zhang et.al.	2408.12352	null
2024-08-22	Variance reduction of diffusion model’s gradients with Taylor approximation-based control variate	Paul Jeha et.al.	2408.12270	null
2024-08-22	Scalable Autoregressive Image Generation with Mamba	Haopeng Li et.al.	2408.12245	link
2024-08-22	DimeRec: A Unified Framework for Enhanced Sequential Recommendation via Generative Diffusion Models	Wuchao Li et.al.	2408.12153	null
2024-08-22	An evidence-accumulating drift-diffusion model of competing information spread on networks	Julien Corsin et.al.	2408.12127	null
2024-08-22	ZipGait: Bridging Skeleton and Silhouette with Diffusion Model for Advancing Gait Recognition	Fanxu Min et.al.	2408.12111	null
2024-08-22	Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation	Woo Kyung Kim et.al.	2408.12110	null
2024-08-22	Spin relaxation in graphite due to spin-orbital-phonon interaction from first-principles density-matrix approach	Junqing Xu et.al.	2408.12054	null
2024-08-21	CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion	Yunlong Tang et.al.	2408.12009	null
2024-08-21	Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models	Chun-Yen Shih et.al.	2408.11810	null
2024-08-21	Timeline and Boundary Guided Diffusion Network for Video Shadow Detection	Haipeng Zhou et.al.	2408.11785	link
2024-08-21	JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet	Yujia Gu et.al.	2408.11744	null
2024-08-21	Iterative Object Count Optimization for Text-to-image Diffusion Models	Oz Zafar et.al.	2408.11721	null
2024-08-21	FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting	Liyao Jiang et.al.	2408.11706	null
2024-08-21	Moderate deviation principles for a reaction diffusion model in non-equilibrium	Linjie Zhao et.al.	2408.11633	null
2024-08-21	Bayesian inversion for the identification of the doping profile in unipolar semiconductor devices	Leila Taghizadeh et.al.	2408.11485	null
2024-08-21	Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection	Jingwei Sun et.al.	2408.11408	link
2024-08-21	Video Diffusion Models are Strong Video Inpainter	Minhyeok Lee et.al.	2408.11402	null
2024-08-21	Generative AI based Secure Wireless Sensing for ISAC Networks	Jiacheng Wang et.al.	2408.11398	null
2024-08-21	Gender Bias Evaluation in Text-to-image Generation: A Survey	Yankun Wu et.al.	2408.11358	null
2024-08-21	HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model	Yi Wang et.al.	2408.11357	null
2024-08-21	UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation	Xiangyu Zhao et.al.	2408.11305	link
2024-08-21	Taming Generative Diffusion for Universal Blind Image Restoration	Siwei Tu et.al.	2408.11287	null
2024-08-20	Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model	Chunting Zhou et.al.	2408.11039	null
2024-08-20	MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning	Haoning Wu et.al.	2408.11001	link
2024-08-20	GreediRIS: Scalable Influence Maximization using Distributed Streaming Maximum Cover	Reet Barik et.al.	2408.10982	null
2024-08-20	Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling	Jaideep Pathak et.al.	2408.10958	null
2024-08-20	Large Point-to-Gaussian Model for Image-to-3D Generation	Longfei Lu et.al.	2408.10935	null
2024-08-20	A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse	Zhongliang Guo et.al.	2408.10901	null
2024-08-19	MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model	Minghua Liu et.al.	2408.10198	null
2024-08-19	SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views	Chao Xu et.al.	2408.10195	null
2024-08-19	Multi-layer diffusion model of photovoltaic installations	Tomasz Weron et.al.	2408.09904	null
2024-08-19	Instruction-Based Molecular Graph Generation with Unified Text-Graph Diffusion Model	Yuran Xiang et.al.	2408.09896	link
2024-08-19	SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models	Danush Kumar Venkatesh et.al.	2408.09822	link
2024-08-19	Latent Diffusion for Guided Document Table Generation	Syed Jawwad Haider Hamdani et.al.	2408.09800	null
2024-08-19	Unsupervised Composable Representations for Audio	Giovanni Bindi et.al.	2408.09792	link
2024-08-19	Propagating the prior from shallow to deep with a pre-trained velocity-model Generative Transformer network	Randy Harsuko et.al.	2408.09767	null
2024-08-19	Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering	Ruofan Liang et.al.	2408.09702	null
2024-08-19	ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement	Eashan Adhikarla et.al.	2408.09650	link
2024-08-18	Moonshine: Distilling Game Content Generators into Steerable Generative Models	Yuhe Nie et.al.	2408.09594	null
2024-08-18	Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning	Zhiwei Xu et.al.	2408.09501	null
2024-08-18	FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model	Ziyu Yao et.al.	2408.09384	null
2024-08-18	Unpaired Volumetric Harmonization of Brain MRI with Conditional Latent Diffusion	Mengqi Wu et.al.	2408.09315	null
2024-08-17	RepControlNet: ControlNet Reparameterization	Zhaoli Deng et.al.	2408.09240	null
2024-08-17	Are CLIP features all you need for Universal Synthetic Image Origin Attribution?	Dario Cioni et.al.	2408.09153	link
2024-08-17	Realistic Extreme Image Rescaling via Generative Latent Space Learning	Ce Wang et.al.	2408.09151	link
2024-08-17	Barbie: Text to Barbie-Style 3D Avatars	Xiaokun Sun et.al.	2408.09126	link
2024-08-17	Fragment-Masked Molecular Optimization	Kun Li et.al.	2408.09106	null
2024-08-16	Efficient Autoregressive Audio Modeling via Next-Scale Prediction	Kai Qiu et.al.	2408.09027	link
2024-08-15	Accelerated Image-Aware Generative Diffusion Modeling	Tanmay Asthana et.al.	2408.08306	null
2024-08-15	Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding	Xiner Li et.al.	2408.08252	link
2024-08-15	Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion	Adi Haviv et.al.	2408.08184	null
2024-08-15	Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation	Seon-Hoon Kim et.al.	2408.07947	link
2024-08-14	Moderator: Moderating Text-to-Image Diffusion Models through Fine-grained Context-based Policies	Peiran Wang et.al.	2408.07728	link
2024-08-14	Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding	Bing Hu et.al.	2408.07636	null
2024-08-14	Anisotropic Diffusion Model of Communication in 2D Biofilm	Yanahan Paramalingam et.al.	2408.07626	null
2024-08-14	DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model	Erez Yosef et.al.	2408.07541	null
2024-08-14	DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency	Xiaojing Zhong et.al.	2408.07481	null
2024-08-14	One Step Diffusion-based Super-Resolution with Time-Aware Distillation	Xiao He et.al.	2408.07476	link
2024-08-14	Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models	Jean-Marie Lemercier et.al.	2408.07472	null
2024-08-14	KIND: Knowledge Integration and Diversion in Diffusion Models	Yucheng Xie et.al.	2408.07337	link
2024-08-14	GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models	Lei Kang et.al.	2408.07259	link
2024-08-13	Representation-space diffusion models for generating periodic materials	Anshuman Sinha et.al.	2408.07213	null
2024-08-13	SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis	Yuchen Mao et.al.	2408.07196	null
2024-08-13	Imagen 3	Imagen-Team-Google et.al.	2408.07009	null
2024-08-13	Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models	Cheng Chen et.al.	2408.06995	null
2024-08-13	DCMSA: Multi-Head Self-Attention Mechanism Based on Deformable Convolution For Seismic Data Denoising	Wang Mingwei et.al.	2408.06963	null
2024-08-13	Diffusion Model for Slate Recommendation	Federico Tomasi et.al.	2408.06883	null
2024-08-13	DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion	Yujia Wu et.al.	2408.06740	null
2024-08-13	DiffSG: A Generative Solver for Network Optimization with Diffusion Model	Ruihuai Liang et.al.	2408.06701	link
2024-08-13	DC3DO: Diffusion Classifier for 3D Objects	Nursena Koprucu et.al.	2408.06693	link
2024-08-13	Leveraging Priors via Diffusion Bridge for Time Series Generation	Jinseong Park et.al.	2408.06672	null
2024-08-13	Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models	Chenqian Yan et.al.	2408.06646	null
2024-08-13	ViMo: Generating Motions from Casual Videos	Liangdong Qiu et.al.	2408.06614	null
2024-08-12	The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery	Chris Lu et.al.	2408.06292	link
2024-08-12	3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs)	Jaydeep Rade et.al.	2408.06244	null
2024-08-12	Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance	Taewon Kang et.al.	2408.06157	null
2024-08-12	Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models	Ioannis Romanelis et.al.	2408.06145	link
2024-08-12	CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer	Zhuoyi Yang et.al.	2408.06072	link
2024-08-12	ControlNeXt: Powerful and Efficient Control for Image and Video Generation	Bohao Peng et.al.	2408.06070	link
2024-08-12	BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training	Xuanpu Zhang et.al.	2408.06047	link
2024-08-12	Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models	Haifan Gong et.al.	2408.05985	null
2024-08-12	UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization	Junjie He et.al.	2408.05939	link
2024-08-12	Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation	Utkarsh Nath et.al.	2408.05938	null
2024-08-12	A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models	Taehong Moon et.al.	2408.05927	link
2024-08-12	Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information	Mingkun Zhang et.al.	2408.05900	null
2024-08-11	LaWa: Using Latent Space for In-Generation Image Watermarking	Ahmad Rezaei et.al.	2408.05868	link
2024-08-11	Egocentric Vision Language Planning	Zhirui Fang et.al.	2408.05802	null
2024-08-11	MTSCI: A Conditional Diffusion Model for Multivariate Time Series Consistent Imputation	Jianping Zhou et.al.	2408.05740	link
2024-08-11	SSL: A Self-similarity Loss for Improving Generative Image Super-resolution	Du Chen et.al.	2408.05713	link
2024-08-11	TC-KANRecon: High-Quality and Accelerated MRI Reconstruction via Adaptive KAN Mechanisms and Intelligent Feature Scaling	Ruiquan Ge et.al.	2408.05705	link
2024-08-11	StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model	Ziyin Zhou et.al.	2408.05669	link
2024-08-10	Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion	Jacob K Christopher et.al.	2408.05636	null
2024-08-10	Diffusion Model-based Contrastive Learning for Human Activity Recognition	Chunjing Xiao et.al.	2408.05567	null
2024-08-08	Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics	Ruining Li et.al.	2408.04631	null
2024-08-08	Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User’s Casual Sketches	Yongzhi Xu et.al.	2408.04567	null
2024-08-08	Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations	Julen Urain et.al.	2408.04380	null
2024-08-08	InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting	Xin-Yi Yu et.al.	2408.04249	null
2024-08-08	LLDif: Diffusion Models for Low-light Emotion Recognition	Zhifeng Wang et.al.	2408.04235	null
2024-08-08	Connective Viewpoints of Signal-to-Noise Diffusion Models	Khanh Doan et.al.	2408.04221	null
2024-08-08	Diffusion Guided Language Modeling	Justin Lovelace et.al.	2408.04220	link
2024-08-07	Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model	Guoqing Zhu et.al.	2408.03748	link
2024-08-07	Unsupervised Detection of Fetal Brain Anomalies using Denoising Diffusion Models	Markus Ditlev Sjøgren Olsen et.al.	2408.03654	null
2024-08-07	TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization	Kien T. Pham et.al.	2408.03637	null
2024-08-07	Dirichlet forms of diffusion processes on Thoma simplex	Sergei Korotkikh et.al.	2408.03553	null
2024-08-06	Hybrid diffusion models: combining supervised and generative pretraining for label-efficient fine-tuning of segmentation models	Bruno Sauvalle et.al.	2408.03433	null
2024-08-06	Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey	Vu Tuan Truong et.al.	2408.03400	null
2024-08-06	Adversarial Domain Adaptation for Cross-user Activity Recognition Using Diffusion-based Noise-centred Learning	Xiaozhou Ye et.al.	2408.03353	link
2024-08-06	MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation	Xiaofeng Mao et.al.	2408.03312	null
2024-08-06	IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts	Ciara Rowles et.al.	2408.03209	null
2024-08-06	Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models	Sho Ozaki et.al.	2408.03156	null
2024-08-06	Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram Synthesis	Van Phi Nguyen et.al.	2408.03035	link
2024-08-06	Diffusion Model Meets Non-Exemplar Class-Incremental Learning and Beyond	Jichuan Zhang et.al.	2408.02983	null
2024-08-06	Data-Driven Stochastic Closure Modeling via Conditional Diffusion Model and Neural Operator	Xinghao Dong et.al.	2408.02965	null
2024-08-06	Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection	Sen Nie et.al.	2408.02891	null
2024-08-05	Back-Projection Diffusion: Solving the Wideband Inverse Scattering Problem with Diffusion Models	Borong Zhang et.al.	2408.02866	link
2024-08-05	Text Conditioned Symbolic Drumbeat Generation using Latent Diffusion Models	Pushkar Jajoria et.al.	2408.02711	null
2024-08-05	RCDM: Enabling Robustness for Conditional Diffusion Model	Weifeng Xu et.al.	2408.02710	null
2024-08-05	LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba	Yunxiang Fu et.al.	2408.02615	link
2024-08-05	Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models	Tongtong Feng et.al.	2408.02408	null
2024-08-05	A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models	Gen Li et.al.	2408.02320	null
2024-08-05	Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders	Muhammad Abdullah Jamal et.al.	2408.02245	null
2024-08-04	LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation	Dwij Mehta et.al.	2408.02078	null
2024-08-04	Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation	Jean Yu et.al.	2408.02054	null
2024-08-04	Robustness of Watermarking on Text-to-Image Diffusion Models	Xiaodong Wu et.al.	2408.02035	null
2024-08-04	Faster Diffusion Action Segmentation	Shuaibing Wang et.al.	2408.02024	null
2024-08-04	AnomalySD: Few-Shot Multi-Class Anomaly Detection with Stable Diffusion Model	Zhenyu Yan et.al.	2408.01960	null
2024-08-04	Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI	Robert Wolfe et.al.	2408.01959	null
2024-08-04	Why Perturbing Symbolic Music is Necessary: Fitting the Distribution of Never-used Notes through a Joint Probabilistic Diffusion Model	Shipei Liu et.al.	2408.01950	null
2024-08-03	SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm	Junyan Ye et.al.	2408.01812	null
2024-08-03	Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation	Jintao Tan et.al.	2408.01732	null
2024-08-02	Conformal Diffusion Models for Individual Treatment Effect Estimation and Inference	Hengrui Cai et.al.	2408.01582	null
2024-08-02	Conditional LoRA Parameter Generation	Xiaolong Jin et.al.	2408.01415	null
2024-08-02	TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling	Dong Huo et.al.	2408.01291	null
2024-08-02	A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness	Lutao Jiang et.al.	2408.01269	null
2024-08-02	CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models	Kushal Kumar Jain et.al.	2408.01233	null
2024-08-02	EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts	Die Chen et.al.	2408.01014	null
2024-08-06	FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation	Xiang Gao et.al.	2408.00998	link
2024-08-01	Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation	Yixiao Wang et.al.	2408.00766	null
2024-08-01	Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention	Susung Hong et.al.	2408.00760	link
2024-08-01	TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models	Gilad Deutch et.al.	2408.00735	null
2024-08-01	MotionFix: Text-Driven 3D Human Motion Editing	Nikos Athanasiou et.al.	2408.00712	null
2024-08-01	Evaluation Metrics and Methods for Generative Models in the Wireless PHY Layer	Michael Baur et.al.	2408.00634	null
2024-08-01	Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model	Felipe Mahlow et.al.	2408.00544	null
2024-08-01	Towards Reliable Advertising Image Generation Using Human Feedback	Zhenbang Du et.al.	2408.00418	link
2024-08-01	Deepfake Media Forensics: State of the Art and Challenges Ahead	Irene Amerini et.al.	2408.00388	null
2024-08-01	On the Limitations and Prospects of Machine Unlearning for Generative AI	Shiji Zhou et.al.	2408.00376	null
2024-08-01	DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 framework	Fan Zhang et.al.	2408.00370	null
2024-08-01	A Simple Background Augmentation Method for Object Detection with Diffusion Model	Yuhang Li et.al.	2408.00350	null
2024-08-01	ADBM: Adversarial diffusion bridge model for reliable adversarial purification	Xiao Li et.al.	2408.00315	null
2024-08-01	Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection	Jiacheng Deng et.al.	2408.00286	null
2024-08-01	Navigating Text-to-Image Generative Bias across Indic Languages	Surbhi Mittal et.al.	2408.00283	null
2024-08-01	Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models	Juntu Zhao et.al.	2408.00230	link
2024-07-31	Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution	Mridul Khurana et.al.	2408.00160	null
2024-07-31	Generative Learning of the Solution of Parametric Partial Differential Equations Using Guided Diffusion Models and Virtual Observations	Han Gao et.al.	2408.00157	null
2024-07-31	WAS: Dataset and Methods for Artistic Text Segmentation	Xudong Xie et.al.	2408.00106	link
2024-07-31	Localized Gaussian Splatting Editing with Contextual Awareness	Hanyuan Xiao et.al.	2408.00083	null
2024-07-31	Detecting, Explaining, and Mitigating Memorization in Diffusion Models	Yuxin Wen et.al.	2407.21720	link
2024-07-31	Tora: Trajectory-oriented Diffusion Transformer for Video Generation	Zhenghao Zhang et.al.	2407.21705	link
2024-07-31	Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification	Xingchen Shi et.al.	2407.21683	null
2024-07-31	Explainable and Controllable Motion Curve Guided Cardiac Ultrasound Video Generation	Junxuan Yu et.al.	2407.21490	null
2024-07-31	Fine-gained Zero-shot Video Sampling	Dengsheng Chen et.al.	2407.21475	null
2024-07-31	Deformable 3D Shape Diffusion Model	Dengsheng Chen et.al.	2407.21428	null
2024-07-31	Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models	Jiang Hao et.al.	2407.21316	link
2024-07-31	State-observation augmented diffusion model for nonlinear assimilation	Zhuoyuan Li et.al.	2407.21314	link
2024-07-31	DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations	Dongwon Son et.al.	2407.21267	null
2024-07-30	Informed Correctors for Discrete Diffusion Models	Yixiu Zhao et.al.	2407.21243	null
2024-07-30	Diffusion-Based Generation of Neural Activity from Disentangled Latent Codes	Jonathan D. McCart et.al.	2407.21195	null
2024-07-30	Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models	Jack He et.al.	2407.21159	null
2024-07-30	On the optimal design of a new class of proportional portfolio insurance strategies in a jump-diffusion framework	Katia Colaneri et.al.	2407.21148	null
2024-07-30	Matting by Generation	Zhixiang Wang et.al.	2407.21017	null
2024-07-30	Add-SD: Rational Generation without Manual Reference	Lingfeng Yang et.al.	2407.21016	link
2024-07-30	Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks	Yunfeng Diao et.al.	2407.20836	null
2024-07-30	Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning	Norman Di Palo et.al.	2407.20798	null
2024-08-01	SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models	Zheng Liu et.al.	2407.20756	link
2024-07-30	EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos	Aashish Rai et.al.	2407.20592	null
2024-07-30	DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations	Jiageng Zhu et.al.	2407.20553	null
2024-07-29	Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing	Ekaterina Iakovleva et.al.	2407.20232	null
2024-07-29	LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework	Zhenqi He et.al.	2407.20172	link
2024-07-29	Diffusion Feedback Helps CLIP See Better	Wenxuan Wang et.al.	2407.20171	link
2024-07-29	DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models	Jing Yang et.al.	2407.20141	null
2024-07-29	Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning	Liyuan Mao et.al.	2407.20109	null
2024-07-29	Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations	Fangyijie Wang et.al.	2407.20072	link
2024-07-29	ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning	Delyan Boychev et.al.	2407.20020	link
2024-07-29	MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion	Chencan Fu et.al.	2407.19976	null
2024-07-29	FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion Models	Mingzhao Yang et.al.	2407.19953	null
2024-07-29	FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention	Yu Lu et.al.	2407.19918	null
2024-07-29	Map2Traj: Street Map Piloted Zero-shot Trajectory Generation with Diffusion Model	Zhenyu Tao et.al.	2407.19765	null
2024-07-30	Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture	ShahRukh Athar et.al.	2407.19593	null
2024-07-28	Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle	Zhenyu Tang et.al.	2407.19548	null
2024-07-28	Temporal Feature Matters: A Framework for Diffusion Model Quantization	Yushi Huang et.al.	2407.19547	null
2024-07-28	MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability	Buyu Liu et.al.	2407.19468	link
2024-07-28	White Matter Geometry-Guided Score-Based Diffusion Model for Tissue Microstructure Imputation in Tractography Imaging	Yui Lo et.al.	2407.19460	null
2024-07-28	FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models	Changgu Chen et.al.	2407.19453	link
2024-07-28	ClickDiff: Click to Induce Semantic Contact Map for Controllable Grasp Generation with Diffusion Models	Peiming Li et.al.	2407.19370	link
2024-07-27	Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach	Penghui Wen et.al.	2407.19244	link
2024-07-27	Data Processing Techniques for Modern Multimodal Models	Yinheng Li et.al.	2407.19180	null
2024-07-25	RegionDrag: Fast Region-Based Image Editing with Diffusion Models	Jingyi Lu et.al.	2407.18247	null
2024-07-25	VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads	Orest Kupyn et.al.	2407.18245	link
2024-07-25	Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images	Roberto Di Via et.al.	2407.18125	null
2024-07-25	Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions	Jan Nikolas Morshuis et.al.	2407.18026	link
2024-07-25	Self-Supervision Improves Diffusion Models for Tabular Data Imputation	Yixin Liu et.al.	2407.18013	link
2024-07-25	Lightweight Language-driven Grasp Detection using Conditional Consistency Model	Nghia Nguyen et.al.	2407.17967	null
2024-07-25	ReCorD: Reasoning and Correcting Diffusion for HOI Generation	Jian-Yu Jiang-Lin et.al.	2407.17911	link
2024-07-25	Amortized Posterior Sampling with Diffusion Prior Distillation	Abbas Mammadov et.al.	2407.17907	null
2024-07-25	Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion	Xiaodan Xing et.al.	2407.17882	null
2024-07-25	DragText: Rethinking Text Embedding in Point-based Image Editing	Gayoon Choi et.al.	2407.17843	link
2024-07-25	Mpox Detection Advanced: Rapid Epidemic Response Through Synthetic Data	Yudara Kularathne et.al.	2407.17762	null
2024-07-25	Multi-physics Simulation Guided Generative Diffusion Models with Applications in Fluid and Heat Dynamics	Naichen Shi et.al.	2407.17720	link
2024-07-24	Diffusion Models for Multi-Task Generative Modeling	Changyou Chen et.al.	2407.17571	null
2024-07-24	SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency	Yiming Xie et.al.	2407.17470	null
2024-07-24	CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction	Paul Goyes-Peñafiel et.al.	2407.17402	link
2024-07-25	LPGen: Enhancing High-Fidelity Landscape Painting Generation through Diffusion Model	Wanggong Yang et.al.	2407.17229	null
2024-07-24	Unpaired Photo-realistic Image Deraining with Energy-informed Diffusion Model	Yuanbo Wen et.al.	2407.17193	null
2024-07-24	MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models	Chunsan Hong et.al.	2407.17095	link
2024-07-24	Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference	Jian Xu et.al.	2407.17033	null
2024-07-24	Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model	Lirui Zhao et.al.	2407.16982	link
2024-07-24	SAR to Optical Image Translation with Color Supervised Diffusion Model	Xinyu Bai et.al.	2407.16921	null
2024-07-23	VisMin: Visual Minimal-Change Understanding	Rabiul Awal et.al.	2407.16772	null
2024-07-23	Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions	Fabio Tosi et.al.	2407.16698	link
2024-07-23	From Imitation to Refinement – Residual RL for Precise Visual Assembly	Lars Ankile et.al.	2407.16677	null
2024-07-23	MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence	Canyu Zhao et.al.	2407.16655	null
2024-07-23	DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models	Zhenyu Xie et.al.	2407.16511	null
2024-07-23	MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection	Youngmin Oh et.al.	2407.16448	link
2024-07-23	On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models	Deniz Daum et.al.	2407.16405	link
2024-07-23	DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors	Zizheng Yan et.al.	2407.16260	null
2024-07-23	OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person	Ke Sun et.al.	2407.16224	null
2024-07-23	Diff-Shadow: Global-guided Diffusion Model for Shadow Removal	Jinting Luo et.al.	2407.16214	link
2024-07-23	CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation	Hajin Shim et.al.	2407.16193	null
2024-07-23	No Re-Train, More Gain: Upgrading Backbones with Diffusion Model for Few-Shot Segmentation	Shuai Chen et.al.	2407.16182	null
2024-07-22	Artist: Aesthetically Controllable Text-Driven Stylization without Training	Ruixiang Jiang et.al.	2407.15842	link
2024-07-22	Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget	Vikash Sehwag et.al.	2407.15811	link
2024-07-22	Diffusion Model Based Resource Allocation Strategy in Ultra-Reliable Wireless Networked Control Systems	Amirhassan Babazadeh Darabi et.al.	2407.15784	null
2024-07-22	A Hamilton-Jacobi approach to road-field reaction-diffusion models	Christopher Henderson et.al.	2407.15760	null
2024-07-22	Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond	Silvio Galesso et.al.	2407.15739	link
2024-07-22	Estimating Probability Densities with Transformer and Denoising Diffusion	Henry W. Leung et.al.	2407.15703	link
2024-07-22	Voltage mapping in subcellular nanodomains using electro-diffusion modeling	Frédéric Paquin-Lefebvre et.al.	2407.15697	null
2024-07-23	Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models	Xin Ma et.al.	2407.15642	link
2024-07-23	A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control	Karim Kadry et.al.	2407.15631	null
2024-07-22	StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation	Nauman Riaz et.al.	2407.15608	null
2024-07-22	Discrete Flow Matching	Itai Gat et.al.	2407.15595	null
2024-07-22	SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time	Stanislav Frolov et.al.	2407.15507	link
2024-07-22	DiffX: Guide Your Layout to Cross-Modal Generative Modeling	Zeyu Wang et.al.	2407.15488	link
2024-07-22	A New Perspective on the Diffuse Gamma-Ray Emission Excess	Ensheng Chen et.al.	2407.15474	null
2024-07-22	A vector-host epidemic model with spatial structure and seasonality	Mingxin Wang et.al.	2407.15361	null
2024-07-22	Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models	Xiao Liu et.al.	2407.15328	link
2024-07-21	MedEdit: Counterfactual Diffusion-based Image Editing on Brain MRI	Malek Ben Alaya et.al.	2407.15270	null
2024-07-23	CGB-DM: Content and Graphic Balance Layout Generation with Transformer-based Diffusion Model	Yu Li et.al.	2407.15233	null
2024-07-21	Thermodynamics inconsistencies in cosmological unimodular gravity models	Miguel Cruz et.al.	2407.15207	null
2024-07-21	HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions	Haiyang Zhou et.al.	2407.15187	null
2024-07-18	LogoSticker: Inserting Logos into Diffusion Models for Customized Generation	Mingkang Zhu et.al.	2407.13752	null
2024-07-18	Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review	Masatoshi Uehara et.al.	2407.13734	link
2024-07-18	MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis	Ziming Zhong et.al.	2407.13675	link
2024-07-18	Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models	Xiaoyu Zhu et.al.	2407.13642	null
2024-07-18	Training-free Composite Scene Generation for Layout-to-Image Synthesis	Jiaqi Liu et.al.	2407.13609	link
2024-07-18	EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models	Nan Lin et.al.	2407.13538	link
2024-07-18	All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models	Charumathi Badrinath et.al.	2407.13449	link
2024-07-18	Movement-based models for abundance data	Ricardo Carrizo Vergara et.al.	2407.13384	null
2024-07-18	URCDM: Ultra-Resolution Image Synthesis in Histopathology	Sarah Cechnicka et.al.	2407.13277	link
2024-07-18	Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models	Qiao Li et.al.	2407.13252	null
2024-07-18	MEDIC: Zero-shot Music Editing with Disentangled Inversion Control	Huadai Liu et.al.	2407.13220	null
2024-07-18	SpaDiT: Diffusion Transformer for Spatial Gene Expression Prediction using scRNA-seq	Xiaoyu Li et.al.	2407.13182	link
2024-07-18	Training-Free Large Model Priors for Multiple-in-One Image Restoration	Xuanhua He et.al.	2407.13181	null
2024-07-18	Image Inpainting Models are Effective Tools for Instruction-guided Image Editing	Xuan Ju et.al.	2407.13139	null
2024-07-18	FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection	Jianwei Zhao et.al.	2407.13133	null
2024-07-17	Denoising Diffusions in Latent Space for Medical Image Segmentation	Fahim Ahmed Zaman et.al.	2407.12952	link
2024-07-17	DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion	Huiguo He et.al.	2407.12899	null
2024-07-17	SMooDi: Stylized Motion Diffusion Model	Lei Zhong et.al.	2407.12783	null
2024-07-17	VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control	Sherwin Bahmani et.al.	2407.12781	null
2024-07-17	Hallucination Index: An Image Quality Metric for Generative Reconstruction Models	Matthew Tivnan et.al.	2407.12780	null
2024-07-17	GroundUp: Rapid Sketch-Based 3D City Massing	Gizem Esra Unlu et.al.	2407.12739	null
2024-07-17	NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model	Zhongqun Zhang et.al.	2407.12727	null
2024-07-18	SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow	Yuanzhi Zhu et.al.	2407.12718	link
2024-07-17	IMAGDressing-v1: Customizable Virtual Dressing	Fei Shen et.al.	2407.12705	link
2024-07-17	4Dynamic: Text-to-4D Generation with Hybrid Priors	Yu-Jie Yuan et.al.	2407.12684	null
2024-07-17	Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs	Yiqing Shen et.al.	2407.12678	link
2024-07-17	CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems	Jiankun Zhao et.al.	2407.12676	link
2024-07-17	Zero-shot Text-guided Infinite Image Synthesis with LLM guidance	Soyeong Kwon et.al.	2407.12642	null
2024-07-17	VegeDiff: Latent Diffusion Model for Geospatial Vegetation Forecasting	Sijie Zhao et.al.	2407.12592	null
2024-07-17	The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation	Yi Yao et.al.	2407.12579	null
2024-07-17	High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion	Juan Song et.al.	2407.12538	link
2024-07-17	Leveraging the Mahalanobis Distance to enhance Unsupervised Brain MRI Anomaly Detection	Finn Behrendt et.al.	2407.12474	link
2024-07-17	Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning	Xu-Hui Liu et.al.	2407.12448	link
2024-07-17	Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models	Chao Gong et.al.	2407.12383	link
2024-07-17	HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects	Xintao Lv et.al.	2407.12371	null
2024-07-17	I2AM: Interpreting Image-to-Image Latent Diffusion Models via Attribution Maps	Junseo Park et.al.	2407.12331	null
2024-07-17	Label-Efficient 3D Brain Segmentation via Complementary 2D Diffusion Models with Orthogonal Views	Jihoon Cho et.al.	2407.12329	null
2024-07-15	Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion	Yongyuan Liang et.al.	2407.10973	null
2024-07-15	InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models	Nirat Saini et.al.	2407.10958	null
2024-07-16	DataDream: Few-shot Guided Dataset Generation	Jae Myung Kim et.al.	2407.10910	link
2024-07-15	Optical Diffusion Models for Image Generation	Ilker Oguz et.al.	2407.10897	null
2024-07-15	R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection	Zheyuan Zhou et.al.	2407.10862	null
2024-07-15	Physics-Inspired Generative Models in Medical Imaging: A Review	Dennis Hein et.al.	2407.10856	null
2024-07-15	Conditional Guided Generative Diffusion for Particle Accelerator Beam Diagnostics	Alexander Scheinker et.al.	2407.10693	null
2024-07-15	Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval	Youngsun Lim et.al.	2407.10683	null
2024-07-15	Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction	Lin Zhu et.al.	2407.10636	null
2024-07-15	WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models	Zijian He et.al.	2407.10625	null
2024-07-15	InsertDiffusion: Identity Preserving Visualization of Objects through a Training-Free Diffusion Architecture	Phillip Mueller et.al.	2407.10592	link
2024-07-15	Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation	Peng Jin et.al.	2407.10528	null
2024-07-15	Kinetic Typography Diffusion Model	Seonmi Park et.al.	2407.10476	null
2024-07-15	GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis	Weizhi Liu et.al.	2407.10471	null
2024-07-15	LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis	Zhenxiong Tan et.al.	2407.10468	link
2024-07-15	DiffStega: Towards Universal Training-Free Coverless Image Steganography with Diffusion Models	Yiwei Yang et.al.	2407.10459	link
2024-07-15	Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion	Jian Ma et.al.	2407.10373	null
2024-07-14	On an age-structured model in moving boundaries: The effects of nonlocal diffusion and harvesting pulse	Haiyan Xu et.al.	2407.10363	null
2024-07-14	Addressing Class Imbalance and Data Limitations in Advanced Node Semiconductor Defect Inspection: A Generative Approach for SEM Images	Bappaditya Dey et.al.	2407.10348	null
2024-07-14	Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors	Jae Joong Lee et.al.	2407.10330	null
2024-07-11	Video Diffusion Alignment via Reward Gradients	Mihir Prabhudesai et.al.	2407.08737	link
2024-07-11	Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models	Zhening Xing et.al.	2407.08701	null
2024-07-11	Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density	Shuangqi Li et.al.	2407.08659	null
2024-07-11	Latent Conditional Diffusion-based Data Augmentation for Continuous-Time Dynamic Graph Mode	Yuxing Tian et.al.	2407.08500	null
2024-07-11	Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers	Zhengbo Zhang et.al.	2407.08394	null
2024-07-11	Wind Power Assessment based on Super-Resolution and Downscaling – A Comparison of Deep Learning Methods	Luca Schmidt et.al.	2407.08259	null
2024-07-11	Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling	Noam Elata et.al.	2407.08256	null
2024-07-11	E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors	Jinxiu Liang et.al.	2407.08231	null
2024-07-11	Survey on Fundamental Deep Learning 3D Reconstruction Techniques	Yonge Bai et.al.	2407.08137	null
2024-07-10	Geospecific View Generation – Geometry-Context Aware High-resolution Ground View Inference from Satellite Views	Ningli Xu et.al.	2407.08061	null
2024-07-10	Coherent and Multi-modality Image Inpainting via Latent Space Optimization	Lingzhi Pan et.al.	2407.08019	link
2024-07-10	Generative Image as Action Models	Mohit Shridhar et.al.	2407.07875	link
2024-07-10	Dynamical Measure Transport and Neural PDE Solvers for Sampling	Jingtong Sun et.al.	2407.07873	null
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-10	Generic Numerical Analysis of Stochastic Reaction Diffusion Model with applications in excitable media	Yahya Alnashri et.al.	2407.07834	null
2024-07-10	Universal and non-universal signatures in the scaling functions of critical variables	Gianluca Teza et.al.	2407.07782	null
2024-07-10	VEnhancer: Generative Space-Time Enhancement for Video Generation	Jingwen He et.al.	2407.07667	null
2024-07-11	MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis	Wanggui He et.al.	2407.07614	link
2024-07-10	Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field	Ganlin Yang et.al.	2407.07461	null
2024-07-10	Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion	Yutong Hu et.al.	2407.07443	link
2024-07-10	Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis	Jian-Qing Zheng et.al.	2407.07295	link
2024-07-09	A Very Effective and Simple Diffusion Reconstruction for the Diluted Ising Model	Stefano Bae et.al.	2407.07266	null
2024-07-09	Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion	Yu Cao et.al.	2407.07249	null
2024-07-09	Accelerating Mobile Edge Generation (MEG) by Constrained Learning	Xiaoxia Xu et.al.	2407.07245	null
2024-07-09	ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement	Muhammad Atif Butt et.al.	2407.07197	link
2024-07-09	CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model	Xiaoding Yuan et.al.	2407.07174	null
2024-07-09	ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction	Shaozhe Hao et.al.	2407.07077	link
2024-07-11	RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models	Bowen Zhang et.al.	2407.06938	null
2024-07-09	HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance	Guian Fang et.al.	2407.06937	link
2024-07-09	A reaction-diffusion model for relapsing-remitting multiple sclerosis with a treatment term	Romina Travaglini et.al.	2407.06802	null
2024-07-09	Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning	Fanyue Wei et.al.	2407.06642	link
2024-07-08	JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation	Yu Zeng et.al.	2407.06187	null
2024-07-08	The Tug-of-War Between Deepfake Generation and Detection	Hannah Lee et.al.	2407.06174	null
2024-07-08	ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation	Ethan Chern et.al.	2407.06135	link
2024-07-08	Structured Generations: Using Hierarchical Clusters to guide Diffusion Models	Jorge da Silva Goncalves et.al.	2407.06124	link
2024-07-08	PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models	Jinhua Zhang et.al.	2407.06109	link
2024-07-08	Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation	Xinyu Bai et.al.	2407.06095	null
2024-07-08	Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis	Emaad Khwaja et.al.	2407.06079	null
2024-07-08	Analysis and finite element approximation of a diffuse interface approach to the Stokes–Biot coupling	Francis R. A. Aznaran et.al.	2407.05949	null
2024-07-08	Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling	Lintao Zhang et.al.	2407.05875	link
2024-07-08	RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features	Inye Na et.al.	2407.05683	link
2024-07-08	BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space	Yumeng Zhang et.al.	2407.05679	link
2024-07-08	Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder	Jia Liu et.al.	2407.05552	null
2024-07-08	Read, Watch and Scream! Sound Generation from Text and Video	Yujin Jeong et.al.	2407.05551	link
2024-07-08	LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction	Kanghao Chen et.al.	2407.05547	null
2024-07-07	Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image Generation	Marina Domínguez et.al.	2407.05428	link
2024-07-07	BiRoDiff: Diffusion policies for bipedal robot locomotion on unseen terrains	GVS Mothish et.al.	2407.05424	null
2024-07-07	Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model	Danni Yang et.al.	2407.05352	link
2024-07-07	Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models	Chun-Mei Feng et.al.	2407.05323	null
2024-07-07	An Improved Method for Personalizing Diffusion Models	Yan Zeng et.al.	2407.05312	null
2024-07-07	DM-MIMO: Diffusion Models for Robust Semantic Communications over MIMO Channels	Yiheng Duan et.al.	2407.05289	null
2024-07-03	DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents	Yilun Xu et.al.	2407.03300	link
2024-07-03	Improved Noise Schedule for Diffusion Training	Tiankai Hang et.al.	2407.03297	null
2024-07-04	Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis	Tong Zhou et.al.	2407.03089	null
2024-07-03	Electromagnetic Property Sensing Based on Diffusion Model in ISAC System	Yuhua Jiang et.al.	2407.03075	null
2024-07-03	Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models	Chunmei Xu et.al.	2407.03050	null
2024-07-03	SlerpFace: Face Template Protection via Spherical Linear Interpolation	Zhizhou Zhong et.al.	2407.03043	null
2024-07-03	Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation	Xiang Gao et.al.	2407.03006	link
2024-07-04	VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors	Sungwon Hwang et.al.	2407.02945	link
2024-07-03	Single Image Rolling Shutter Removal with Diffusion Models	Zhanglei Yang et.al.	2407.02906	null
2024-07-03	Robot Shape and Location Retention in Video Generation Using Diffusion Models	Peng Wang et.al.	2407.02873	link
2024-07-03	Mirage Sources and Large TeV Halo-Pulsar Offsets: Exploring the Parameter Space	Yiwei Bao et.al.	2407.02829	null
2024-07-03	Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models	Jiayue Chu et.al.	2407.02744	null
2024-07-02	No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models	Seyedmorteza Sadat et.al.	2407.02687	null
2024-07-02	Diffusion Models for Tabular Data Imputation and Synthetic Data Generation	Mario Villaizán-Vallelado et.al.	2407.02549	null
2024-07-02	Magic Insert: Style-Aware Drag-and-Drop	Nataniel Ruiz et.al.	2407.02489	null
2024-07-03	Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models	Fei Shen et.al.	2407.02482	link
2024-07-02	GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models	Jian Ma et.al.	2407.02252	link
2024-07-02	LaMoD: Latent Motion Diffusion Model For Myocardial Strain Generation	Jiarui Xing et.al.	2407.02229	link
2024-07-04	UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks	Jingjing Ren et.al.	2407.02158	null
2024-07-02	Counterfactual Data Augmentation with Denoising Diffusion for Graph Anomaly Detection	Chunjing Xiao et.al.	2407.02143	link
2024-06-28	HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model	Hieu T. Nguyen et.al.	2406.20077	null
2024-06-28	Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence	Xiantao Fan et.al.	2406.20047	null
2024-06-28	HAITCH: A Framework for Distortion and Motion Correction in Fetal Multi-Shell Diffusion-Weighted MRI	Haykel Snoussi et.al.	2406.20042	null
2024-06-28	Deceptive Diffusion: Generating Synthetic Adversarial Examples	Lucas Beerens et.al.	2406.19807	null
2024-06-28	Comprehensive Generative Replay for Task-Incremental Segmentation with Concurrent Appearance and Semantic Forgetting	Wei Li et.al.	2406.19796	link
2024-06-28	Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels	Jie Zhang et.al.	2406.19769	null
2024-06-28	DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems	Kexiong Yu et.al.	2406.19705	null
2024-06-28	Network Bending of Diffusion Models for Audio-Visual Generation	Luke Dzwonczyk et.al.	2406.19589	link
2024-06-27	A Thermal Study of Terahertz Induced Protein Interactions	Hadeel Elayan et.al.	2406.19521	null
2024-06-27	pop-cosmos: Scaleable inference of galaxy properties and redshifts with a data-driven population model	Stephen Thorp et.al.	2406.19437	null
2024-06-27	Accelerating Multiphase Flow Simulations with Denoising Diffusion Model Driven Initializations	Jaehong Chung et.al.	2406.19333	null
2024-06-27	Subtractive Training for Music Stem Insertion using Latent Diffusion Models	Ivan Villa-Renteria et.al.	2406.19328	null
2024-06-27	Compositional Image Decomposition with Diffusion Models	Jocelin Su et.al.	2406.19298	null
2024-06-27	Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model	Jiangtong Tan et.al.	2406.19030	link
2024-06-28	AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation	Yanan Sun et.al.	2406.18958	link
2024-06-27	Investigating and Defending Shortcut Learning in Personalized Diffusion Models	Yixin Liu et.al.	2406.18944	link
2024-06-28	AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models	Aishwarya Agarwal et.al.	2406.18893	null
2024-06-27	Chemical Continuous Time Random Walks under Anomalous Diffusion	Hong Zhang et.al.	2406.18869	null
2024-06-26	MultiDiff: Consistent Novel View Synthesis from a Single Image	Norman Müller et.al.	2406.18524	null
2024-06-26	Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration	Kang Liao et.al.	2406.18516	link
2024-06-26	DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance	Younghyun Kim et.al.	2406.18459	link
2024-06-26	Towards diffusion models for large-scale sea-ice modelling	Tobias Sebastian Finn et.al.	2406.18417	null
2024-06-27	Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process	Tianyu Lin et.al.	2406.18361	link
2024-06-26	Molecular Diffusion Models with Virtual Receptors	Matan Halfon et.al.	2406.18330	null
2024-06-26	Galaxy spectroscopy without spectra: Galaxy properties from photometric images with conditional diffusion models	Lars Doorenbos et.al.	2406.18175	link
2024-06-26	Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models	Xiaolin Hong et.al.	2406.18159	null
2024-06-26	Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation	Qilai Zhang et.al.	2406.18054	link
2024-06-25	DiffusionPDE: Generative PDE-Solving Under Partial Observation	Jiahe Huang et.al.	2406.17763	link
2024-06-25	Unified Auto-Encoding with Masked Diffusion	Philippe Hansen-Estruch et.al.	2406.17688	link
2024-06-25	LaTable: Towards Large Tabular Models	Boris van Breugel et.al.	2406.17673	null
2024-06-25	Aligning Diffusion Models with Noise-Conditioned Perception	Alexander Gambashidze et.al.	2406.17636	null
2024-06-25	Diffusion-based Adversarial Purification for Intrusion Detection	Mohamed Amine Merzouk et.al.	2406.17606	link
2024-06-25	Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text	Xinyang Li et.al.	2406.17601	link
2024-06-25	Detection of Synthetic Face Images: Accuracy, Robustness, Generalization	Nela Petrzelkova et.al.	2406.17547	null
2024-06-25	Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation	Felix Stillger et.al.	2406.17541	null
2024-06-25	The Tree of Diffusion Life: Evolutionary Embeddings to Understand the Generation Process of Diffusion Models	Vidya Prasad et.al.	2406.17462	null
2024-06-25	SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing	Ruihuang Li et.al.	2406.17396	null
2024-06-25	Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers	Lei Chen et.al.	2406.17343	link
2024-06-24	FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models	Haonan Qiu et.al.	2406.16863	link
2024-06-24	Dreamitate: Real-World Visuomotor Policy Learning via Video Generation	Junbang Liang et.al.	2406.16862	null
2024-06-24	General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design	Yue Jian et.al.	2406.16821	link
2024-06-24	Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image	Jinkun Hao et.al.	2406.16710	null
2024-06-24	Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling	Min-Seop Kwak et.al.	2406.16695	null
2024-06-24	Repulsive Score Distillation for Diverse Sampling of Diffusion Models	Nicolas Zilberstein et.al.	2406.16683	link
2024-06-24	OAML: Outlier Aware Metric Learning for OOD Detection Enhancement	Heng Gao et.al.	2406.16525	link
2024-06-24	DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution	Aiwen Jiang et.al.	2406.16477	link
2024-06-24	ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance	Shuwei Shi et.al.	2406.16476	null
2024-06-24	Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models	Yichen Sun et.al.	2406.16333	null
2024-06-24	YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals	Sandeep Mishra et.al.	2406.16273	null
2024-06-24	Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement	Zhiyuan Chang et.al.	2406.16272	link
2024-06-24	Video-Infinity: Distributed Long Video Generation	Zhenxiong Tan et.al.	2406.16260	null
2024-06-23	Provable Statistical Rates for Consistency Diffusion Models	Zehao Dou et.al.	2406.16213	null
2024-06-23	UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery	Pengfei Zhang et.al.	2406.16129	null
2024-06-23	Diffusion Spectral Representation for Reinforcement Learning	Dmitry Shribak et.al.	2406.16121	null
2024-06-23	Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification	Inès Hyeonsu Kim et.al.	2406.16042	null
2024-06-23	TimeAutoDiff: Combining Autoencoder and Diffusion model for time series tabular data synthesizing	Namjoon Suh et.al.	2406.16028	link
2024-06-22	PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection	Alvaro Lopez Pellcier et.al.	2406.15921	null
2024-06-22	Soft Masked Mamba Diffusion Model for CT to MRI Conversion	Zhenbin Wang et.al.	2406.15910	link
2024-06-20	A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models	Xincheng Shuai et.al.	2406.14555	link
2024-06-21	Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation	Eyal Michaeli et.al.	2406.14551	link
2024-06-20	Consistency Models Made Easy	Zhengyang Geng et.al.	2406.14548	link
2024-06-20	Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps	Nikita Starodubcev et.al.	2406.14539	null
2024-06-20	V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data	Rotem Shalev-Arkushin et.al.	2406.14510	null
2024-06-20	SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset	Josef Dai et.al.	2406.14477	link
2024-06-20	CollaFuse: Collaborative Diffusion Models	Simeon Allmendinger et.al.	2406.14429	link
2024-06-20	Active Diffusion Subsampling	Oisin Nolan et.al.	2406.14388	link
2024-06-20	In Tree Structure Should Sentence Be Generated	Yaguang Li et.al.	2406.14189	link
2024-06-20	CriDiff: Criss-cross Injection Diffusion Framework via Generative Pre-train for Prostate Segmentation	Tingwei Liu et.al.	2406.14186	link
2024-06-20	ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning	Zhongjie Duan et.al.	2406.14130	link
2024-06-20	HeartBeat: Towards Controllable Echocardiography Video Synthesis with Multimodal Conditions-Guided Diffusion Models	Xinrui Zhou et.al.	2406.14098	null
2024-06-20	Bridging bulk and surface: An interacting particle system towards the field-road diffusion model	Matthieu Alfaro et.al.	2406.14093	null
2024-06-20	A Practical Diffusion Path for Sampling	Omar Chehab et.al.	2406.14040	null
2024-06-20	Similarity-aware Syncretic Latent Diffusion Model for Medical Image Translation with Representation Learning	Tingyi Lin et.al.	2406.13977	null
2024-06-20	Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models	Yuan Zhong et.al.	2406.13942	null
2024-06-20	EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations	Jie Ren et.al.	2406.13933	null
2024-06-19	INFusion: Diffusion Regularized Implicit Neural Representations for 2D and 3D accelerated MRI reconstruction	Yamin Arefeen et.al.	2406.13895	null
2024-06-19	Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics	Weitong Zhang et.al.	2406.13652	null
2024-06-19	On AI-Inspired UI-Design	Jialiang Wei et.al.	2406.13631	null
2024-06-18	Evaluating the design space of diffusion-based generative models	Yuqing Wang et.al.	2406.12839	null
2024-06-18	Neural Approximate Mirror Maps for Constrained Diffusion Models	Berthy T. Feng et.al.	2406.12816	null
2024-06-18	Extracting Training Data from Unconditional Diffusion Models	Yunhao Chen et.al.	2406.12752	null
2024-06-18	Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation	Miseul Kim et.al.	2406.12688	null
2024-06-18	GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models	Yongtao Ge et.al.	2406.12671	link
2024-06-18	Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in Images	Shivank Garg et.al.	2406.12592	link
2024-06-18	Training Diffusion Models with Federated Learning	Matthijs de Goede et.al.	2406.12575	null
2024-06-18	Variational Distillation of Diffusion Policies into Mixture of Experts	Hongyi Zhou et.al.	2406.12538	null
2024-06-18	HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors	Panwang Pan et.al.	2406.12459	link
2024-06-18	Planning Using Schrödinger Bridge Diffusion Models	Adarsh Srivastava et.al.	2406.12458	link
2024-06-18	Deep Temporal Deaggregation: Large-Scale Spatio-Temporal Generative Models	David Bergström et.al.	2406.12423	null
2024-06-18	TADM: Temporally-Aware Diffusion Model for Neurodegenerative Progression on Brain MRI	Mattia Litrico et.al.	2406.12411	null
2024-06-18	Effective Generation of Feasible Solutions for Integer Programming via Guided Diffusion	Hao Zeng et.al.	2406.12349	link
2024-06-18	Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment	Yiheng Li et.al.	2406.12303	null
2024-06-17	COT Flow: Learning Optimal-Transport Image Sampling and Editing by Contrastive Pairs	Xinrui Zu et.al.	2406.12140	null
2024-06-17	Adding Conditional Control to Diffusion Models with Reinforcement Learning	Yulai Zhao et.al.	2406.12120	null
2024-06-17	Optimal withdrawals in a general diffusion model with control rates subject to a state-dependent upper bound	Hélène Guérin et.al.	2406.12067	null
2024-06-17	ARTIST: Improving the Generation of Text-rich Images by Disentanglement	Jianyi Zhang et.al.	2406.12044	null
2024-06-17	Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models	Alireza Ganjdanesh et.al.	2406.12042	link
2024-06-17	Decomposed evaluations of geographic disparities in text-to-image models	Abhishek Sureddy et.al.	2406.11988	null
2024-06-17	Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models	Bingqi Ma et.al.	2406.11831	null
2024-06-17	MegaScenes: Scene-Level View Synthesis at Scale	Joseph Tung et.al.	2406.11819	link
2024-06-17	DiffMM: Multi-Modal Diffusion Model for Recommendation	Yangqin Jiang et.al.	2406.11781	link
2024-06-17	Latent Denoising Diffusion GAN: Faster sampling, Higher image quality	Luan Thanh Trinh et.al.	2406.11713	link
2024-06-17	MusicScore: A Dataset for Music Score Modeling and Generation	Yuheng Lin et.al.	2406.11462	link
2024-06-17	AnyTrans: Translate AnyText in the Image with Large Scale Models	Zhipeng Qian et.al.	2406.11432	null
2024-06-17	DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer	Keon Lee et.al.	2406.11427	null
2024-06-17	Unfolding Time: Generative Modeling for Turbulent Flows in 4D	Abdullah Saydemir et.al.	2406.11390	null
2024-06-17	Diffusion Models in Low-Level Vision: A Survey	Chunming He et.al.	2406.11138	link
2024-06-16	Exploiting Diffusion Prior for Out-of-Distribution Detection	Armando Zhu et.al.	2406.11105	null
2024-06-16	An Analysis on Quantizing Diffusion Transformers	Yuewei Yang et.al.	2406.11100	null
2024-06-16	A Bayesian Drift-Diffusion Model of Schachter-Singer’s Two Factor Theory of Emotion	Lance Ying et.al.	2406.11086	null
2024-06-16	ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models	Kaifeng Gao et.al.	2406.10981	link
2024-06-16	Graph Neural Reaction Diffusion Models	Moshe Eliasof et.al.	2406.10871	null
2024-06-16	Diffusion Model With Optimal Covariance Matching	Zijing Ou et.al.	2406.10808	null
2024-06-16	Diffusion Models Are Promising for Ab Initio Structure Solutions from Nanocrystalline Powder Diffraction Data	Gabe Guo et.al.	2406.10796	link
2024-06-15	Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft	Ian Vyse et.al.	2406.10724	link
2024-06-18	A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing	Ming Meng et.al.	2406.10553	null
2024-06-15	Self-Supervised Vision Transformer for Enhanced Virtual Clothes Try-On	Lingxiao Lu et.al.	2406.10539	null
2024-06-15	Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space	Mohamed Amine Ketata et.al.	2406.10513	null
2024-06-12	Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation	Raphael Tang et.al.	2406.08482	null
2024-06-12	Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models	Yuxuan Xue et.al.	2406.08475	null
2024-06-12	$\texttt{DiffLense}$ : A Conditional Diffusion Model for Super-Resolution of Gravitational Lensing Data	Pranath Reddy et.al.	2406.08442	null
2024-06-12	Diffusion Soup: Model Merging for Text-to-Image Diffusion Models	Benjamin Biggs et.al.	2406.08431	null
2024-06-12	FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation	Xinzhi Mu et.al.	2406.08392	null
2024-06-12	Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models	Javier Nistal et.al.	2406.08384	null
2024-06-12	2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction	Tianqi Chen et.al.	2406.08374	null
2024-06-12	WMAdapter: Adding WaterMark Control to Latent Diffusion Models	Hai Ci et.al.	2406.08337	null
2024-06-12	Dataset Enhancement with Instance-Level Augmentations	Orest Kupyn et.al.	2406.08249	link
2024-06-12	Diffusion-Promoted HDR Video Reconstruction	Yuanshen Guan et.al.	2406.08204	null
2024-06-12	LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation	Wenhao Guan et.al.	2406.08203	link
2024-06-12	One-Step Effective Diffusion Network for Real-World Image Super-Resolution	Rongyuan Wu et.al.	2406.08177	link
2024-06-12	Defect-related Anomalous Mobility of Small polarons in Oxides: the Case of Congruent Lithium Niobate	Anton Pfannstiel et.al.	2406.08123	null
2024-06-12	Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement	Runyi Yu et.al.	2406.08096	null
2024-06-12	CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models	Hyungjin Chung et.al.	2406.08070	null
2024-06-12	Ablation Based Counterfactuals	Zheng Dai et.al.	2406.07908	null
2024-06-12	DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition	Jiacheng Liu et.al.	2406.07852	null
2024-06-12	Hierarchical Patch Diffusion Models for High-Resolution Video Generation	Ivan Skorokhodov et.al.	2406.07792	null
2024-06-11	HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness	Zihui Xue et.al.	2406.07754	null
2024-06-11	CUPID: Contextual Understanding of Prompt-conditioned Image Distributions	Yayan Zhao et.al.	2406.07699	null
2024-06-10	IllumiNeRF: 3D Relighting without Inverse Rendering	Xiaoming Zhao et.al.	2406.06527	null
2024-06-10	Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation	Peize Sun et.al.	2406.06525	link
2024-06-10	Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer	Sigal Raab et.al.	2406.06508	link
2024-06-10	AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction	Zhen Xing et.al.	2406.06465	null
2024-06-10	Cometh: A continuous-time discrete-state graph diffusion model	Antoine Siraudin et.al.	2406.06449	null
2024-06-10	Margin-aware Preference Optimization for Aligning Diffusion Models without Reference	Jiwoo Hong et.al.	2406.06424	null
2024-06-10	Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization	Yi Gu et.al.	2406.06382	link
2024-06-10	Improving Deep Learning-based Automatic Cranial Defect Reconstruction by Heavy Data Augmentation: From Image Registration to Latent Diffusion Models	Marek Wodzinski et.al.	2406.06372	null
2024-06-10	MVGamba: Unify 3D Content Generation as State Space Sequence Modeling	Xuanyu Yi et.al.	2406.06367	link
2024-06-11	Tuning-Free Visual Customization via View Iterative Self-Attention Control	Xiaojie Li et.al.	2406.06258	link
2024-06-10	Data Augmentation in Earth Observation: A Diffusion Model Approach	Tiago Sousa et.al.	2406.06218	null
2024-06-10	The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems	Philippe Gonzalez et.al.	2406.06160	null
2024-06-10	Thunder : Unified Regression-Diffusion Speech Enhancement with a Single Reverse Step using Brownian Bridge	Thanapat Trachu et.al.	2406.06139	null
2024-06-10	DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection	Donggeun Ko et.al.	2406.06134	null
2024-06-10	ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models	Meng-Li Shih et.al.	2406.06133	null
2024-06-10	Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks	Victor Boutin et.al.	2406.06079	null
2024-06-10	Generalizable Human Gaussians from Single-View Image	Jinnan Chen et.al.	2406.06050	link
2024-06-10	Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training	Ke Niu et.al.	2406.06045	link
2024-06-10	FRAG: Frequency Adapting Group for Diffusion Video Editing	Sunjae Yoon et.al.	2406.06044	link
2024-06-09	Improving Antibody Design with Force-Guided Sampling in Diffusion Models	Paulina Kulytė et.al.	2406.05832	null
2024-06-07	Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion	Fangfu Liu et.al.	2406.04338	null
2024-06-06	Coherent Zero-Shot Visual Instruction Generation	Quynh Phung et.al.	2406.04337	null
2024-06-06	BitsFusion: 1.99 bits Weight Quantization of Diffusion Model	Yang Sui et.al.	2406.04333	link
2024-06-06	Simplified and Generalized Masked Diffusion for Discrete Data	Jiaxin Shi et.al.	2406.04329	link
2024-06-06	SF-V: Single Forward Video Generation Model	Zhixing Zhang et.al.	2406.04324	link
2024-06-06	ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories	Qianlan Yang et.al.	2406.04323	null
2024-06-07	DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data	Qihao Liu et.al.	2406.04322	link
2024-06-06	Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step	Zhanhao Liang et.al.	2406.04314	link
2024-06-06	Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment	Jiayi Guo et.al.	2406.04295	link
2024-06-06	VideoTetris: Towards Compositional Text-to-Video Generation	Ye Tian et.al.	2406.04277	link
2024-06-06	A Survey on 3D Human Avatar Modeling – From Reconstruction to Generation	Ruihe Wang et.al.	2406.04253	null
2024-06-06	Diffusion-based image inpainting with internal learning	Nicolas Cherel et.al.	2406.04206	link
2024-06-06	Multistep Distillation of Diffusion Models via Moment Matching	Tim Salimans et.al.	2406.04103	null
2024-06-06	Enhancing Weather Predictions: Super-Resolution via Deep Diffusion Models	Jan Martinů et.al.	2406.04099	null
2024-06-06	LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression	Junhui Li et.al.	2406.03961	link
2024-06-06	LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model	Yixuan Yang et.al.	2406.03866	null
2024-06-06	Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data	Jingyang Ou et.al.	2406.03736	link
2024-06-06	JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits	Minzhou Pan et.al.	2406.03720	link
2024-06-06	Pi-fusion: Physics-informed diffusion model for learning fluid dynamics	Jing Qiu et.al.	2406.03711	null
2024-06-06	Mean-variance portfolio selection in jump-diffusion model under no-shorting constraint: A viscosity solution approach	Xiaomin Shi et.al.	2406.03709	null
2024-06-05	Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input	Joachim Ott et.al.	2406.03439	null
2024-06-05	Text-to-Image Rectified Flow as Plug-and-Play Priors	Xiaofeng Yang et.al.	2406.03293	link
2024-06-05	Generative Diffusion Models for Fast Simulations of Particle Collisions at CERN	Mikołaj Kita et.al.	2406.03233	null
2024-06-05	Searching Priors Makes Text-to-Video Synthesis Better	Haoran Cheng et.al.	2406.03215	null
2024-06-05	Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion	Hao Wen et.al.	2406.03184	link
2024-06-05	Tiny models from tiny data: Textual and null-text inversion for few-shot distillation	Erik Landolsi et.al.	2406.03146	link
2024-06-05	Floating Anchor Diffusion Model for Multi-motif Scaffolding	Ke Liu et.al.	2406.03141	link
2024-06-05	Phy-Diff: Physics-guided Hourglass Diffusion Model for Diffusion MRI Synthesis	Juanhua Zhang et.al.	2406.03002	null
2024-06-05	Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models	Zihan Ye et.al.	2406.02929	null
2024-06-06	U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation	Chenxin Li et.al.	2406.02918	null
2024-06-05	TSPDiffuser: Diffusion Models as Learned Samplers for Traveling Salesperson Path Planning Problems	Ryo Yonetani et.al.	2406.02858	null
2024-06-04	ORACLE: Leveraging Mutual Information for Consistent Character Generation with LoRAs in Diffusion Models	Kiymet Akdemir et.al.	2406.02820	null
2024-06-04	Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following	Qiaomu Miao et.al.	2406.02774	null
2024-06-04	Neural Representations of Dynamic Visual Stimuli	Jacob Yeung et.al.	2406.02659	null
2024-06-04	Dreamguider: Improved Training free Diffusion-based Conditional Generation	Nithin Gopalakrishnan Nair et.al.	2406.02549	null
2024-06-06	Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting	Inkyu Shin et.al.	2406.02541	null
2024-06-04	CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation	Dejia Xu et.al.	2406.02509	null
2024-06-04	Guiding a Diffusion Model with a Bad Version of Itself	Tero Karras et.al.	2406.02507	link
2024-06-04	Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation	Jiajun Wang et.al.	2406.02485	link
2024-06-04	Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion	Colin Hansen et.al.	2406.02477	null
2024-05-31	Mixed Diffusion for 3D Indoor Scene Synthesis	Siyi Hu et.al.	2405.21066	link
2024-05-31	Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models	Jingjing Wang et.al.	2405.21059	null
2024-05-31	Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models	Xinxi Zhang et.al.	2405.21050	null
2024-05-31	Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling	Jiatao Gu et.al.	2405.21048	null
2024-05-31	Amortizing intractable inference in diffusion models for vision, language, and control	Siddarth Venkatraman et.al.	2405.20971	link
2024-05-31	Flow matching achieves minimax optimal convergence	Kenji Fukumizu et.al.	2405.20879	null
2024-05-31	MegActor: Harness the Power of Raw Video for Vivid Portrait Animation	Shurong Yang et.al.	2405.20851	link
2024-05-31	Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning	Aditya Shankar et.al.	2405.20761	link
2024-05-31	Information Theoretic Text-to-Image Alignment	Chao Wang et.al.	2405.20759	null
2024-05-31	Diffusion Models Are Innate One-Step Generators	Bowen Zheng et.al.	2405.20750	link
2024-05-31	Unleashing the Potential of Diffusion Models for Incomplete Data Imputation	Hengrui Zhang et.al.	2405.20690	link
2024-05-31	Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling	Kidist Amde Mekonnen et.al.	2405.20675	link
2024-05-31	4Diffusion: Multi-view Video Diffusion Model for 4D Generation	Haiyu Zhang et.al.	2405.20674	null
2024-05-31	Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation	Shuzhou Yang et.al.	2405.20669	link
2024-05-31	GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification	Hansang Lee et.al.	2405.20650	null
2024-06-03	Stochastic Optimal Control for Diffusion Bridges in Function Spaces	Byoungwoo Park et.al.	2405.20630	link
2024-05-31	Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization	Yisu Liu et.al.	2405.20584	link
2024-05-31	Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning	Linjiajie Fang et.al.	2405.20555	link
2024-05-30	Diffusion On Syntax Trees For Program Synthesis	Shreyas Kapur et.al.	2405.20519	null
2024-05-30	Slight Corruption in Pre-training Data Makes Better Diffusion Models	Hao Chen et.al.	2405.20494	null
2024-05-30	Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image	Kailu Wu et.al.	2405.20343	link
2024-05-30	VividDream: Generating 3D Scene with Ambient Dynamics	Yao-Chih Lee et.al.	2405.20334	null
2024-05-30	MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion	Shuyuan Tu et.al.	2405.20325	link
2024-05-30	Don’t drop your samples! Coherence-aware training benefits Conditional diffusion	Nicolas Dufour et.al.	2405.20324	null
2024-05-30	Improving the Training of Rectified Flows	Sangyun Lee et.al.	2405.20320	link
2024-05-30	DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation	Zachary Novack et.al.	2405.20289	null
2024-05-30	MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model	Muyao Niu et.al.	2405.20222	link
2024-05-30	Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback	Sanghyeon Na et.al.	2405.20216	null
2024-05-30	MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models	Lukas Uzolas et.al.	2405.20155	null
2024-05-31	DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild	Honghao Fu et.al.	2405.19996	link
2024-05-30	DiffPhysBA: Diffusion-based Physical Backdoor Attack against Person Re-Identification in Real-World	Wenli Sun et.al.	2405.19990	null
2024-05-30	PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting	Qiaowei Miao et.al.	2405.19957	link
2024-05-30	Exploring Diffusion Models’ Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks	Xiaoyu Wu et.al.	2405.19931	null
2024-05-30	Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models	Zeyu Fang et.al.	2405.19878	null
2024-05-31	HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization	Wenxuan Liu et.al.	2405.19751	null
2024-05-30	Streaming Video Diffusion: Online Video Editing with Diffusion Models	Feng Chen et.al.	2405.19726	link
2024-05-30	Text Guided Image Editing with Automatic Concept Locating and Forgetting	Jia Li et.al.	2405.19708	null
2024-05-30	Diffusion Policies creating a Trust Region for Offline Reinforcement Learning	Tianyu Chen et.al.	2405.19690	link
2024-05-30	Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models	Masatoshi Uehara et.al.	2405.19673	null
2024-05-29	Blind Image Restoration via Fast Diffusion Inversion	Hamadi Chihaoui et.al.	2405.19572	link
2024-05-29	ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning	Ruchika Chavhan et.al.	2405.19237	link
2024-05-30	$E^{3}$ Gen: Efficient, Expressive and Editable Avatars Generation	Weitian Zhang et.al.	2405.19203	null
2024-05-29	Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning	Hanye Zhao et.al.	2405.19189	link
2024-05-29	Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization	Zhiwei Tang et.al.	2405.18881	link
2024-05-29	Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors	Zihui Wu et.al.	2405.18782	link
2024-05-29	RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching	Divya Nori et.al.	2405.18768	link
2024-05-29	Stationary distribution approximations of Two-island Wright-Fisher and seed-bank models using Stein’s method	Han L. Gan et.al.	2405.18763	null
2024-05-29	Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning	Tianle Zhang et.al.	2405.18729	null
2024-05-29	Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI	Che Liu et.al.	2405.18726	null
2024-05-29	Learning Diffeomorphism for Image Registration with Time-Continuous Networks using Semigroup Regularization	Mohammadjavad Matinkia et.al.	2405.18684	link
2024-05-29	Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering	Ido Sobol et.al.	2405.18677	null
2024-05-28	DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention	Lianghui Zhu et.al.	2405.18428	link
2024-05-28	Phased Consistency Model	Fu-Yun Wang et.al.	2405.18407	link
2024-05-28	RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives	Jaehong Yoon et.al.	2405.18406	link
2024-05-28	Multi-modal Generation via Cross-Modal In-Context Learning	Amandeep Kumar et.al.	2405.18304	link
2024-05-28	CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths	Reihaneh Teimouri et.al.	2405.18267	link
2024-05-28	EG4D: Explicit Generation of 4D Object without Score Distillation	Qi Sun et.al.	2405.18132	link
2024-05-28	Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers?	Zebin You et.al.	2405.18029	null
2024-05-28	Unveiling the Power of Diffusion Features For Personalized Segmentation and Retrieval	Dvir Samuel et.al.	2405.18025	link
2024-05-28	MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling	Bowen Zhang et.al.	2405.18003	link
2024-05-27	Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer	Ruizhi Shao et.al.	2405.17405	null
2024-05-27	A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training	Kai Wang et.al.	2405.17403	link
2024-05-27	RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control	Litu Rout et.al.	2405.17401	null
2024-05-27	EASI-Tex: Edge-Aware Mesh Texturing from Single Image	Sai Raj Kishore Perla et.al.	2405.17393	null
2024-05-28	Controllable Longer Image Animation with Diffusion Models	Qiang Wang et.al.	2405.17306	null
2024-05-27	Does Diffusion Beat GAN in Image Super Resolution?	Denis Kuznedelev et.al.	2405.17261	link
2024-05-27	DreamMat: High-quality PBR Material Generation with Geometry- and Light-aware Diffusion Models	Yuqing Zhang et.al.	2405.17176	null
2024-05-27	Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction	Wenhao Zhang et.al.	2405.17167	null
2024-05-27	PatchScaler: An Efficient Patch-independent Diffusion Model for Super-Resolution	Yong Liu et.al.	2405.17158	link
2024-05-27	Ensembling Diffusion Models via Adaptive Feature Aggregation	Cong Wang et.al.	2405.17082	link
2024-05-27	The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models	Saravanan Kandasamy et.al.	2405.17068	null
2024-05-27	Glauber Generative Model: Discrete Diffusion Models via Binary Classification	Harshit Varma et.al.	2405.17035	null
2024-05-27	$\text{Di}^2\text{Pose}$ : Discrete Diffusion Model for Occluded 3D Human Pose Estimation	Weiquan Wang et.al.	2405.17016	null
2024-05-28	MotionLLM: Multimodal Motion-Language Learning with Large Language Models	Qi Wu et.al.	2405.17013	link
2024-05-27	A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition	Zilu Guo et.al.	2405.16952	link
2024-05-27	Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models	Qian Wang et.al.	2405.16947	link
2024-05-27	PASTA: Pathology-Aware MRI to PET Cross-Modal Translation with Diffusion Models	Yitong Li et.al.	2405.16942	link
2024-05-28	GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning	Jaewoo Lee et.al.	2405.16907	link
2024-05-27	Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation	Liang Shi et.al.	2405.16895	null
2024-05-27	Part123: Part-aware 3D Reconstruction from a Single-view Image	Anran Liu et.al.	2405.16888	null
2024-05-23	Improved Distribution Matching Distillation for Fast Image Synthesis	Tianwei Yin et.al.	2405.14867	link
2024-05-23	Video Diffusion Models are Training-free Motion Interpreter and Controller	Zeqi Xiao et.al.	2405.14864	null
2024-05-23	Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models	Gen Li et.al.	2405.14861	null
2024-05-23	Semantica: An Adaptable Image-Conditioned Diffusion Model	Manoj Kumar et.al.	2405.14857	null
2024-05-23	TerDiT: Ternary Diffusion Models with Transformers	Xudong Lu et.al.	2405.14854	link
2024-05-23	Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer	Shuang Wu et.al.	2405.14832	null
2024-05-23	Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models	Katherine Xu et.al.	2405.14828	null
2024-05-23	PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher	Dongjun Kim et.al.	2405.14822	link
2024-05-24	Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation	Hongxu Jiang et.al.	2405.14802	link
2024-05-23	Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy	Shengfang Zhai et.al.	2405.14800	link
2024-05-23	EditWorld: Simulating World Dynamics for Instruction-Following Image Editing	Ling Yang et.al.	2405.14785	link
2024-05-23	Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography	Shuo Han et.al.	2405.14770	link
2024-05-23	RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance	Zhicheng Sun et.al.	2405.14677	link
2024-05-23	Reinforcement Learning for Fine-tuning Text-to-speech Diffusion Models	Jingyi Chen et.al.	2405.14632	null
2024-05-23	Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields	Tom Fischer et.al.	2405.14599	null
2024-05-23	Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation	Shiqi Yang et.al.	2405.14598	null
2024-05-23	LDM: Large Tensorial SDF Model for Textured Mesh Generation	Rengan Xie et.al.	2405.14580	link
2024-05-23	Regressor-free Molecule Generation to Support Drug Response Prediction	Kun Li et.al.	2405.14536	null
2024-05-23	LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models	Seyedmorteza Sadat et.al.	2405.14477	null
2024-05-23	TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing	Teng Xu et.al.	2405.14455	null
2024-05-21	Personalized Residuals for Concept-Driven Text-to-Image Generation	Cusuh Ham et.al.	2405.12978	null
2024-05-21	Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control	Yue Han et.al.	2405.12970	null
2024-05-21	Impact of inhomogeneous diffusion on secondary cosmic ray and antiproton local spectra	Álvaro Tovar-Pardo et.al.	2405.12918	null
2024-05-21	Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images	Xiaofei Yu et.al.	2405.12875	link
2024-05-21	Model Free Prediction with Uncertainty Assessment	Yuling Jiao et.al.	2405.12684	null
2024-05-21	CustomText: Customized Textual Image Generation using Diffusion Models	Shubham Paliwal et.al.	2405.12531	null
2024-05-21	Customize Your Own Paired Data via Few-shot Way	Jinshu Chen et.al.	2405.12490	null
2024-05-21	One-step data-driven generative model via Schrödinger Bridge	Hanwen Huang et.al.	2405.12453	null
2024-05-20	Diffusion for World Modeling: Visual Details Matter in Atari	Eloi Alonso et.al.	2405.12399	link
2024-05-20	Images that Sound: Composing Images and Sounds on a Single Canvas	Ziyang Chen et.al.	2405.12221	null
2024-05-20	Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices	Nathaniel Cohen et.al.	2405.12211	link
2024-05-20	Nonequilbrium physics of generative diffusion models	Zhendong Yu et.al.	2405.11932	null
2024-05-20	“Set It Up!”: Functional Object Arrangement with Compositional Generative Models	Yiqing Xu et.al.	2405.11928	null
2024-05-20	Diff-BGM: A Diffusion Model for Video Background Music Generation	Sizhe Li et.al.	2405.11913	link
2024-05-20	Out-of-Distribution Detection with a Single Unconditional Diffusion Model	Alvin Heng et.al.	2405.11881	link
2024-05-20	Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models	Xiyu Wang et.al.	2405.11852	null
2024-05-20	Alternators For Sequence Modeling	Mohammad Reza Rezaei et.al.	2405.11848	null
2024-05-20	ViViD: Video Virtual Try-on using Diffusion Models	Zixun Fang et.al.	2405.11794	null
2024-05-20	Guided Multi-objective Generative AI to Enhance Structure-based Drug Design	Amit Kadan et.al.	2405.11785	link
2024-05-20	Diffusion Models for Generating Ballistic Spacecraft Trajectories	Tyler Presser et.al.	2405.11738	link
2024-05-19	InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios	Yinghao Huang et.al.	2405.11690	null
2024-05-19	Uncertainty-Aware PPG-2-ECG for Enhanced Cardiovascular Diagnosis using Diffusion Models	Omer Belhasin et.al.	2405.11566	null
2024-05-19	Diffusion-Based Hierarchical Image Steganography	Youmin Xu et.al.	2405.11523	null
2024-05-19	FIFO-Diffusion: Generating Infinite Videos from Text without Training	Jihwan Kim et.al.	2405.11473	link
2024-05-19	Discrete-state Continuous-time Diffusion for Graph Generation	Zhe Xu et.al.	2405.11416	link
2024-05-18	On the Trajectory Regularity of ODE-based Diffusion Sampling	Defang Chen et.al.	2405.11326	link
2024-05-18	Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification	Ming Hu et.al.	2405.11289	null
2024-05-18	HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos	Qifeng Chen et.al.	2405.11270	null
2024-05-18	AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA	Weitao Feng et.al.	2405.11135	link
2024-05-16	Text-to-Vector Generation with Neural Path Representation	Peiying Zhang et.al.	2405.10317	null
2024-05-16	Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model	Zheng Gu et.al.	2405.10316	null
2024-05-16	CAT3D: Create Anything in 3D with Multi-View Diffusion Models	Ruiqi Gao et.al.	2405.10314	null
2024-05-16	Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks	João Bordalo et.al.	2405.10122	null
2024-05-16	Spurious reconstruction from brain activity	Ken Shirakawa et.al.	2405.10078	link
2024-05-16	Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution	Xingjian Wang et.al.	2405.10014	null
2024-05-16	VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing	Binghui Chen et.al.	2405.09985	null
2024-05-16	Language-Oriented Semantic Latent Representation for Image Transmission	Giordano Cicchetti et.al.	2405.09976	link
2024-05-16	Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models	Ziyu Wang et.al.	2405.09901	link
2024-05-16	DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection	Yuhao Sun et.al.	2405.09882	link
2024-05-16	Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion	Xinyang Li et.al.	2405.09874	null
2024-05-16	Rethinking Multi-User Semantic Communications with Deep Generative Models	Eleonora Grassucci et.al.	2405.09866	null
2024-05-16	MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis	Joseph Cho et.al.	2405.09806	null
2024-05-15	A Survey of Generative Techniques for Spatial-Temporal Data Mining	Qianru Zhang et.al.	2405.09592	null
2024-05-16	MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer	Chengyu Wu et.al.	2405.09539	link
2024-05-15	Diffusion-based Contrastive Learning for Sequential Recommendation	Ziqiang Cui et.al.	2405.09369	link
2024-05-15	Dance Any Beat: Blending Beats with Visuals in Dance Video Generation	Xuanchen Wang et.al.	2405.09266	null
2024-05-15	SOEDiff: Efficient Distillation for Small Object Editing	Qihe Pan et.al.	2405.09114	null
2024-05-15	RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing	Jiamei Xiong et.al.	2405.09083	link
2024-05-17	Naturalistic Music Decoding from EEG Data via Latent Diffusion Models	Emilian Postolache et.al.	2405.09062	null
2024-05-15	Response Matching for generating materials and molecules	Bingqing Cheng et.al.	2405.09057	null
2024-05-15	CTS: A Consistency-Based Medical Image Segmentation Model	Kejia Zhang et.al.	2405.09056	link
2024-05-14	Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models	Bingdong Li et.al.	2405.08674	null
2024-05-14	Towards Multi-Task Generative-AI Edge Services with an Attention-based Diffusion DRL Approach	Yaju Liu et.al.	2405.08328	null
2024-05-14	Compositional Text-to-Image Generation with Dense Blob Representations	Weili Nie et.al.	2405.08246	null
2024-05-13	Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis	Yifan Wang et.al.	2405.08210	null
2024-05-13	Do Bayesian imaging methods report trustworthy probabilities?	David Y. W. Thong et.al.	2405.08179	null
2024-05-13	DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation	Ziang Cao et.al.	2405.08055	link
2024-05-13	Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning	Wenqi Dong et.al.	2405.08054	null
2024-05-13	Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data	Mahdi Morafah et.al.	2405.07925	null
2024-05-13	CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models	Nick Stracke et.al.	2405.07913	null
2024-05-13	SAR Image Synthesis with Diffusion Models	Denisa Qosja et.al.	2405.07776	null
2024-05-13	CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution	Qingguo Liu et.al.	2405.07648	link
2024-05-13	De novo antibody design with SE(3) diffusion	Daniel Cutting et.al.	2405.07622	null
2024-05-13	Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models	Andrii Tytarenko et.al.	2405.07603	null
2024-05-13	PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator	Hanshu Yan et.al.	2405.07510	link
2024-05-13	GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting	Haodong Chen et.al.	2405.07472	null
2024-05-12	Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning	Masane Fuchi et.al.	2405.07288	link
2024-05-12	Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising	Yao Liu et.al.	2405.07164	null
2024-05-12	Stable Signature is Unstable: Removing Image Watermark from Diffusion Models	Yuepeng Hu et.al.	2405.07145	null
2024-05-11	Diffusion models as probabilistic neural operators for recovering unobserved states of dynamical systems	Katsiaryna Haitsiukevich et.al.	2405.07097	null
2024-05-11	Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior	Ce Wang et.al.	2405.07044	link
2024-05-11	Non-confusing Generation of Customized Concepts in Diffusion Models	Wang Lin et.al.	2405.06914	null
2024-05-10	Self-Consistent Recursive Diffusion Bridge for Medical Image Translation	Fuat Arslan et.al.	2405.06789	link
2024-05-10	Shape Conditioned Human Motion Generation with Diffusion Model	Kebing Xue et.al.	2405.06778	null
2024-05-10	OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation	Jinwei Lin et.al.	2405.06547	link
2024-05-14	SketchDream: Sketch-based Text-to-3D Generation and Editing	Feng-Lin Liu et.al.	2405.06461	null
2024-05-10	PUMA: margin-based data pruning	Javier Maroto et.al.	2405.06298	null
2024-05-10	Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging	Zhuchen Shao et.al.	2405.06175	null
2024-05-09	Distilling Diffusion Models into Conditional GANs	Minguk Kang et.al.	2405.05967	null
2024-05-09	Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask	Zineb Senane et.al.	2405.05959	link
2024-05-09	Frame Interpolation with Consecutive Brownian Bridge Diffusion	Zonglin Lyu et.al.	2405.05953	link
2024-05-09	Composable Part-Based Manipulation	Weiyu Liu et.al.	2405.05876	null
2024-05-09	Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control	Gunshi Gupta et.al.	2405.05852	link
2024-05-09	Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models	Zhe Ma et.al.	2405.05846	link
2024-05-09	MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction	Pinhuang Tan et.al.	2405.05814	null
2024-05-10	MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation	Yuxiang Wei et.al.	2405.05806	link
2024-05-09	DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation	Sitian Shen et.al.	2405.05800	null
2024-05-09	Sequential Amodal Segmentation via Cumulative Occlusion Learning	Jiayang Ao et.al.	2405.05791	null
2024-05-09	DP-MDM: Detail-Preserving MR Reconstruction via Multiple Diffusion Models	Mengxiao Geng et.al.	2405.05763	link
2024-05-09	LatentColorization: Latent Diffusion-Based Speaker Video Colorization	Rory Ward et.al.	2405.05707	null
2024-05-09	StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework	Yiheng Huang et.al.	2405.05691	null
2024-05-09	SubGDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning	Jiying Zhang et.al.	2405.05665	link
2024-05-09	AI in Your Toolbox: A Plugin for Generating Renderings from 3D Models	Mingming Wang et.al.	2405.05627	null
2024-05-09	Denoising Diffusion Delensing Delight: Reconstructing the Non-Gaussian CMB Lensing Potential with Diffusion Models	Thomas Flöss et.al.	2405.05598	link
2024-05-09	Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft	Debabrata Pal et.al.	2405.05574	null
2024-05-09	A Survey on Personalized Content Synthesis with Diffusion Models	Xulu Zhang et.al.	2405.05538	null
2024-05-08	Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo	Nayantara Mudur et.al.	2405.05255	link
2024-05-08	Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models	Hongjie Wang et.al.	2405.05252	null
2024-05-08	Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation	Jonas Kohler et.al.	2405.05224	null
2024-05-08	FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models	Jinglin Xu et.al.	2405.05216	link
2024-05-08	An anti-noise seismic inversion method based on diffusion model	Yingtian Liu et.al.	2405.05026	link
2024-05-08	Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI	Keqiang Fan et.al.	2405.04974	null
2024-05-08	Empowering Wireless Networks with Artificial Intelligence Generated Graph	Jiacheng Wang et.al.	2405.04907	null
2024-05-08	Fast LiDAR Upsampling using Conditional Diffusion Models	Sander Elias Magnussen Helgesen et.al.	2405.04889	link
2024-05-08	FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation	Xuehai He et.al.	2405.04834	null
2024-05-08	Variational Schrödinger Diffusion Models	Wei Deng et.al.	2405.04795	null
2024-05-07	Remote Diffusion	Kunal Sunil Kasodekar et.al.	2405.04717	null
2024-05-07	TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model	Yongming Zhang et.al.	2405.04675	null
2024-05-07	Tactile-Augmented Radiance Fields	Yiming Dou et.al.	2405.04534	link
2024-05-07	Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing	Yi Zuo et.al.	2405.04496	null
2024-05-07	CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model	Haixia Xiao et.al.	2405.04483	null
2024-05-07	Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos	Junyi Ma et.al.	2405.04370	link
2024-05-07	Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation	Jihyun Kim et.al.	2405.04356	link
2024-05-08	Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer	Zhuoyi Yang et.al.	2405.04312	link
2024-05-07	BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models	Eloi Moliner et.al.	2405.04272	null
2024-05-07	Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models	Fan Bao et.al.	2405.04233	null
2024-05-06	Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models	Ludwig Winkler et.al.	2405.03549	null
2024-05-06	CCDM: Continuous Conditional Diffusion Models for Image Generation	Xin Ding et.al.	2405.03546	link
2024-05-06	LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model	Haowen Sun et.al.	2405.03485	link
2024-05-06	Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond	Jiuxiang Gu et.al.	2405.03251	null
2024-05-06	Hyperbolic Geometric Latent Diffusion Model for Graph Generation	Xingcheng Fu et.al.	2405.03188	link
2024-05-06	DeepMpMRI: Tensor-decomposition Regularized Learning for Fast and High-Fidelity Multi-Parametric Microstructural MR Imaging	Wenxin Fan et.al.	2405.03159	null
2024-05-06	Video Diffusion Models: A Survey	Andrew Melnik et.al.	2405.03150	link
2024-05-06	AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding	Tao Liu et.al.	2405.03121	link
2024-05-05	Matten: Video Generation with Mamba-Attention	Yu Gao et.al.	2405.03025	null
2024-05-05	Exploring Text-based Realistic Building Facades Editing Applicaiton	Jing Wang et.al.	2405.02967	null
2024-05-05	Efficient Text-driven Motion Generation via Latent Consistency Training	Mengxian Hu et.al.	2405.02791	link
2024-05-04	DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model	Liangqi Lei et.al.	2405.02696	null
2024-05-03	Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI	Minhui Yu et.al.	2405.02504	link
2024-05-03	Continuous Learned Primal Dual	Christina Runkel et.al.	2405.02478	null
2024-05-03	CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding	Kaiyuan Chen et.al.	2405.02384	null
2024-05-03	DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos	Wen-Hsuan Chu et.al.	2405.02280	link
2024-05-03	Multi-grid reaction-diffusion master equation: applications to morphogen gradient modelling	Radek Erban et.al.	2405.02117	null
2024-05-03	DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model	Peijin Jia et.al.	2405.02008	null
2024-05-03	Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition	Yichun Tai et.al.	2405.01872	null
2024-05-03	Creation of Novel Soft Robot Designs using Generative AI	Wee Kiat Chan et.al.	2405.01824	null
2024-05-02	LocInv: Localization-aware Inversion for Text-Guided Image Editing	Chuanming Tang et.al.	2405.01496	link
2024-05-02	Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models	Matias Mendieta et.al.	2405.01494	null
2024-05-02	Statistical algorithms for low-frequency diffusion data: A PDE approach	Matteo Giordano et.al.	2405.01372	link
2024-05-02	DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines	Ye Tian et.al.	2405.01248	null
2024-05-02	Automated Virtual Product Placement and Assessment in Images using Diffusion Models	Mohammad Mahmudul Alam et.al.	2405.01130	null
2024-05-02	Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields	Yuhang Huang et.al.	2405.00998	null
2024-05-02	Generative manufacturing systems using diffusion models and ChatGPT	Xingyu Li et.al.	2405.00958	null
2024-05-02	EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion	Guangyao Zhai et.al.	2405.00915	null
2024-05-01	SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models	Burak Can Biner et.al.	2405.00878	null
2024-05-01	Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers	Palawat Busaranuvong et.al.	2405.00858	null
2024-05-01	ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties	Jiahui Li et.al.	2405.00797	link
2024-05-01	Obtaining Favorable Layouts for Multiple Object Generation	Barak Battash et.al.	2405.00791	null
2024-05-01	Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models	Xiaoshi Wu et.al.	2405.00760	null
2024-05-01	TexSliders: Diffusion-Based Texture Editing in CLIP Space	Julia Guerrero-Viu et.al.	2405.00672	null
2024-05-01	RGB $\leftrightarrow$ X: Image decomposition and synthesis using material- and lighting-aware diffusion models	Zheng Zeng et.al.	2405.00666	null
2024-05-01	Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure	Assefa Seyoum Wahd et.al.	2405.00631	null
2024-05-01	Lane Segmentation Refinement with Diffusion Models	Antonio Ruiz et.al.	2405.00620	null
2024-05-01	Pricing and delta computation in jump-diffusion models with stochastic intensity by Malliavin calculus	Ayub Ahmadi et.al.	2405.00473	null
2024-05-01	Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable	Haozhe Liu et.al.	2405.00466	null
2024-05-01	Detail-Enhancing Framework for Reference-Based Image Super-Resolution	Zihan Wang et.al.	2405.00431	null
2024-05-01	Streamlining Image Editing with Layered Diffusion Brushes	Peyman Gholami et.al.	2405.00313	null
2024-05-02	An Unstructured Mesh Reaction-Drift-Diffusion Master Equation with Reversible Reactions	Samuel A. Isaacson et.al.	2405.00283	null
2024-05-01	ASAM: Boosting Segment Anything Model with Adversarial Tuning	Bo Li et.al.	2405.00256	link
2024-04-30	Semantically Consistent Video Inpainting with Conditional Diffusion Models	Dylan Green et.al.	2405.00251	null
2024-04-30	IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images	Shadab Ahamed et.al.	2405.00239	link
2024-04-30	SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound	Haohe Liu et.al.	2405.00233	null
2024-04-30	Target-Specific De Novo Peptide Binder Design with DiffPepBuilder	Fanhao Wang et.al.	2405.00128	null
2024-04-30	MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model	Wenxun Dai et.al.	2404.19759	link
2024-04-30	Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting	Paul Engstler et.al.	2404.19758	null
2024-04-30	Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation	Ian Dunn et.al.	2404.19739	link
2024-04-30	X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models	Emmanuelle Bourigault et.al.	2404.19604	null
2024-04-30	MicroDreamer: Zero-shot 3D Generation in $\sim$ 20 Seconds by Score-based Iterative Reconstruction	Luxi Chen et.al.	2404.19525	link
2024-04-30	TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models	Teng Zhou et.al.	2404.19475	link
2024-04-29	Stylus: Automatic Adapter Selection for Diffusion Models	Michael Luo et.al.	2404.18928	null
2024-04-29	TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation	Junhao Cheng et.al.	2404.18919	link
2024-04-29	Learning general Gaussian mixtures with efficient score matching	Sitan Chen et.al.	2404.18893	null
2024-04-29	A Survey on Diffusion Models for Time Series and Spatio-Temporal Data	Yiyuan Yang et.al.	2404.18886	link
2024-04-29	Learning Mixtures of Gaussians Using Diffusion Models	Khashayar Gatmiry et.al.	2404.18869	null
2024-04-29	Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior	Zhiyuan Li et.al.	2404.18820	link
2024-04-29	Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting	Yifei Gao et.al.	2404.18669	null
2024-04-29	FlexiFilm: Long Video Generation with Flexible Conditions	Yichen Ouyang et.al.	2404.18620	link
2024-04-29	Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting	Tianyidan Xie et.al.	2404.18598	null
2024-04-29	U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models	Song Mei et.al.	2404.18444	null
2024-04-28	Fisher Information Improved Training-Free Conditional Diffusion Model	Kaiyu Song et.al.	2404.18252	null
2024-04-28	Paint by Inpaint: Learning to Add Image Objects by Removing Them First	Navve Wasserman et.al.	2404.18212	link
2024-04-28	Generative AI for Visualization: State of the Art and Future Directions	Yilin Ye et.al.	2404.18144	null
2024-04-28	Generative AI for Low-Carbon Artificial Intelligence of Things	Jinbo Wen et.al.	2404.18077	null
2024-04-28	Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model	Xiaolong Li et.al.	2404.18065	null
2024-04-28	Exposing Text-Image Inconsistency Using Diffusion Models	Mingzhen Huang et.al.	2404.18033	link
2024-04-30	Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching	Robert Denkert et.al.	2404.17939	null
2024-04-27	Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling	Di Wu et.al.	2404.17900	null
2024-04-27	DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction	Chenhe Du et.al.	2404.17890	null
2024-04-27	Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission	Mingyu Yang et.al.	2404.17736	link
2024-04-25	Inferring solid-state diffusivity in lithium-ion battery active materials: improving upon the classical GITT method	A. Emir Gumrukcuoglu et.al.	2404.16658	null
2024-04-25	MuseumMaker: Continual Style Customization without Catastrophic Forgetting	Chenxi Liu et.al.	2404.16612	null
2024-04-25	Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models	Parul Gupta et.al.	2404.16556	null
2024-04-25	DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference	Zhihao Shuai et.al.	2404.16474	null
2024-04-25	TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models	Haomiao Ni et.al.	2404.16306	link
2024-04-25	CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions	Haoyuan Li et.al.	2404.16302	link
2024-04-25	One Noise to Rule Them All: Learning a Unified Model of Spatially-Varying Noise Patterns	Arman Maesumi et.al.	2404.16292	null
2024-04-24	Editable Image Elements for Controllable Synthesis	Jiteng Mu et.al.	2404.16029	null
2024-04-24	RetinaRegNet: A Versatile Approach for Retinal Image Registration	Vishal Balaji Sivaraman et.al.	2404.16017	link
2024-04-24	MYCloth: Towards Intelligent and Interactive Online T-Shirt Customization based on User’s Preference	Yexin Liu et.al.	2404.15801	null
2024-04-24	MotionMaster: Training-free Camera Motion Transfer For Video Generation	Teng Hu et.al.	2404.15789	null
2024-04-24	Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations	Kaiwen Xue et.al.	2404.15766	link
2024-04-24	DeepFeatureX Net: Deep Features eXtractors based Network for discriminating synthetic from real images	Orazio Pontorno et.al.	2404.15697	link
2024-04-24	Generative Diffusion Model (GDM) for Optimization of Wi-Fi Networks	Tie Liu et.al.	2404.15684	null
2024-04-24	AnoFPDM: Anomaly Segmentation with Forward Process of Diffusion Models for Brain MRI	Yiming Che et.al.	2404.15683	link
2024-04-24	CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models	Qinghe Wang et.al.	2404.15677	link
2024-04-24	Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models	Xu Shen et.al.	2404.15625	null
2024-04-26	A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution	Zhixiong Yang et.al.	2404.15620	link
2024-04-23	ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning	Weifeng Chen et.al.	2404.15449	null
2024-04-23	GLoD: Composing Global Contexts and Local Details in Image Generation	Moyuru Yamada et.al.	2404.15447	null
2024-04-23	ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model	Yuanshao Zhu et.al.	2404.15380	null
2024-04-23	Heat flow, log-concavity, and Lipschitz transport maps	Giovanni Brigati et.al.	2404.15205	null
2024-04-23	CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method	Mingbao Lin et.al.	2404.15141	link
2024-04-23	Taming Diffusion Probabilistic Models for Character Control	Rui Chen et.al.	2404.15121	null
2024-04-23	Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models	Jingyao Xu et.al.	2404.15081	link
2024-04-23	Music Style Transfer With Diffusion Model	Hong Huang et.al.	2404.14771	null
2024-04-23	Gradient Guidance for Diffusion Models: An Optimization Perspective	Yingqing Guo et.al.	2404.14743	link
2024-04-25	FlashSpeech: Efficient Zero-Shot Speech Synthesis	Zhen Ye et.al.	2404.14700	null
2024-04-23	DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance	Linxuan Xin et.al.	2404.14676	null
2024-04-22	UVMap-ID: A Controllable and Personalized UV Map Generative Model	Weijie Wang et.al.	2404.14568	link
2024-04-22	Align Your Steps: Optimizing Sampling Schedules in Diffusion Models	Amirmojtaba Sabour et.al.	2404.14507	null
2024-04-22	Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses	Inhee Lee et.al.	2404.14410	null
2024-04-22	GeoDiffuser: Geometry-Based Image Editing with Diffusion Models	Rahul Sajnani et.al.	2404.14403	null
2024-04-22	TAVGBench: Benchmarking Text to Audible-Video Generation	Yuxin Mao et.al.	2404.14381	link
2024-04-22	Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion	Alexander Shmakov et.al.	2404.14332	null
2024-04-22	X-Ray: A Sequential 3D Representation for Generation	Tao Hu et.al.	2404.14329	link
2024-04-22	Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity	Yu Hou et.al.	2404.14240	link
2024-04-22	MultiBooth: Towards Generating All Your Concepts in an Image from Text	Chenyang Zhu et.al.	2404.14239	link
2024-04-22	Face2Face: Label-driven Facial Retouching Restoration	Guanhua Zhao et.al.	2404.14177	null
2024-04-22	FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on	Chenhui Wang et.al.	2404.14162	null
2024-04-22	Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments	Jiacheng Wang et.al.	2404.14140	null
2024-04-23	RingID: Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification	Hai Ci et.al.	2404.14055	link
2024-04-22	RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance	Chengrui Wang et.al.	2404.13984	null
2024-04-22	MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets	Zeyu Li et.al.	2404.13923	null
2024-04-23	Accelerating Image Generation with Sub-path Linear Approximation Model	Chen Xu et.al.	2404.13903	null
2024-04-22	Towards Better Text-to-Image Generation Alignment via Attention Modulation	Yihang Wu et.al.	2404.13899	null
2024-04-23	Decoherence of a charged Brownian particle in a magnetic field : an analysis of the roles of coupling via position and momentum variables	Suraka Bhattacharjee et.al.	2404.13883	null
2024-04-21	Universal Fingerprint Generation: Controllable Diffusion Model with Multimodal Conditions	Steven A. Grosz et.al.	2404.13791	null
2024-04-21	Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control	Maria Mihaela Trusca et.al.	2404.13766	null
2024-04-21	A Splice Method for Local-to-Nonlocal Coupling of Weak Forms	Shuai Jiang et.al.	2404.13744	null
2024-04-21	Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models	Vitali Petsiuk et.al.	2404.13706	null
2024-04-18	G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis	Yufei Ye et.al.	2404.12383	null
2024-04-18	Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models	Trevor J. Chan et.al.	2404.12361	null
2024-04-18	AniClipart: Clipart Animation with Text-to-Video Priors	Ronghuan Wu et.al.	2404.12347	null
2024-04-18	Guided Discrete Diffusion for Electronic Health Record Generation	Zixiang Chen et.al.	2404.12314	null
2024-04-18	StyleBooth: Image Style Editing with Multimodal Instruction	Zhen Han et.al.	2404.12154	link
2024-04-18	LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights	Thibault Castells et.al.	2404.11936	null
2024-04-18	FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models	Wei Wu et.al.	2404.11895	link
2024-04-17	Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning	Marzi Heidari et.al.	2404.11795	null
2024-04-17	Diffusion Schrödinger Bridge Models for High-Quality MR-to-CT Synthesis for Head and Neck Proton Treatment Planning	Muheng Li et.al.	2404.11741	null
2024-04-17	Factorized Diffusion: Perceptual Illusions by Noise Decomposition	Daniel Geng et.al.	2404.11615	null
2024-04-17	IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination	Xi Chen et.al.	2404.11593	null
2024-04-17	Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding	Zezhong Fan et.al.	2404.11589	null
2024-04-17	MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation	Kuan-Chieh et.al.	2404.11565	null
2024-04-17	Predicting Long-horizon Futures by Conditioning on Geometry and Time	Tarasha Khurana et.al.	2404.11554	null
2024-04-17	SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening	Yu Zhong et.al.	2404.11537	null
2024-04-17	Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt	Zhanjie Zhang et.al.	2404.11474	link
2024-04-17	Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption	Buzhen Huang et.al.	2404.11291	link
2024-04-17	Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case	João Gabriel Vinholi et.al.	2404.11243	null
2024-04-17	RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models	Han Huang et.al.	2404.11199	link
2024-04-19	LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models	Dingkun Zhang et.al.	2404.11098	null
2024-04-16	Molecular relaxation by reverse diffusion with time step prediction	Khaled Kahouli et.al.	2404.10935	link
2024-04-16	RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting	Ashkan Mirzaei et.al.	2404.10765	null
2024-04-16	LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?	Yuchi Wang et.al.	2404.10763	link
2024-04-16	GazeHTA: End-to-end Gaze Target Detection with Head-Target Association	Zhi-Yi Lin et.al.	2404.10718	null
2024-04-16	Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution	Yutao Yuan et.al.	2404.10688	link
2024-04-16	Generating Human Interaction Motions in Scenes with Text Control	Hongwei Yi et.al.	2404.10685	null
2024-04-16	StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization	Yingshu Chen et.al.	2404.10681	null
2024-04-18	Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay	Jinmei Liu et.al.	2404.10662	link
2024-04-16	Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences	Seungwook Kim et.al.	2404.10603	null
2024-04-15	Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement	Wenyi Lian et.al.	2404.09735	link
2024-04-15	Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models	Ziwei Luo et.al.	2404.09732	link
2024-04-15	All-in-one simulation-based inference	Manuel Gloeckler et.al.	2404.09636	link
2024-04-15	TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models	Haojun Sun et.al.	2404.09532	null
2024-04-15	Magic Clothing: Controllable Garment-Driven Image Synthesis	Weifeng Chen et.al.	2404.09512	link
2024-04-15	PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI	Yandan Yang et.al.	2404.09465	null
2024-04-15	Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models	Peifei Zhu et.al.	2404.09401	null
2024-04-14	Fault Detection in Mobile Networks Using Diffusion Models	Mohamad Nabeel et.al.	2404.09240	null
2024-04-14	DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling	Xuening Yuan et.al.	2404.09227	null
2024-04-14	LoopAnimate: Loopable Salient Object Animation	Fanyi Wang et.al.	2404.09172	null
2024-04-14	RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion	Guoxuan Chi et.al.	2404.09140	link
2024-04-13	Rethinking Iterative Stereo Matching from Diffusion Bridge Model Perspective	Yuguang Shi et.al.	2404.09051	null
2024-04-13	Theoretical research on generative diffusion models: an overview	Melike Nur Yeğin et.al.	2404.09016	null
2024-04-13	Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles	Abhijnan Nath et.al.	2404.08949	link
2024-04-13	Enforcing Paraphrase Generation via Controllable Latent Diffusion	Wei Zou et.al.	2404.08938	link
2024-04-13	Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives	Yidan Liu et.al.	2404.08926	null
2024-04-13	ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model	Kai Tang et.al.	2404.08892	link
2024-04-12	Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation	Brinnae Bent et.al.	2404.08799	link
2024-04-12	Diffusion-Based Joint Temperature and Precipitation Emulation of Earth System Models	Katie Christensen et.al.	2404.08797	null
2024-04-12	Lossy Image Compression with Foundation Diffusion Models	Lucas Relic et.al.	2404.08580	null
2024-04-12	PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction	Siming Shan et.al.	2404.08412	null
2024-04-12	Struggle with Adversarial Defense? Try Diffusion	Yujie Li et.al.	2404.08273	link
2024-04-12	Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models	Zeyu Yang et.al.	2404.08254	link
2024-04-12	Interest Maximization in Social Networks	Rahul Kumar Gautam et.al.	2404.08236	null
2024-04-11	ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback	Ming Li et.al.	2404.07987	link
2024-04-11	Taming Stable Diffusion for Text to 360° Panorama Image Generation	Cheng Zhang et.al.	2404.07949	link
2024-04-11	Adaptive Hyperbolic-cross-space Mapped Jacobi Method on Unbounded Domains with Applications to Solving Multidimensional Spatiotemporal Integrodifferential Equations	Yunhong Deng et.al.	2404.07844	null
2024-04-11	ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model	Lifan Jiang et.al.	2404.07773	link
2024-04-11	An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization	Minshuo Chen et.al.	2404.07771	null
2024-04-11	Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations	Yufeng Yue et.al.	2404.07770	null
2024-04-11	Diffusing in Someone Else’s Shoes: Robotic Perspective Taking with Diffusion	Josua Spisak et.al.	2404.07735	null
2024-04-11	Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models	Tuomas Kynkäänniemi et.al.	2404.07724	link
2024-04-11	Implicit and Explicit Language Guidance for Diffusion-based Visual Perception	Hefeng Wang et.al.	2404.07600	null
2024-04-11	ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation	Stanislav Frolov et.al.	2404.07564	null
2024-04-11	Effects of phase separation on extinction times in population models	Janik Schüttler et.al.	2404.07563	null
2024-04-11	CAT: Contrastive Adapter Training for Personalized Image Generation	Jae Wan Park et.al.	2404.07554	link
2024-04-10	Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models	Yasi Zhang et.al.	2404.07389	null
2024-04-10	GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models	Zewei Zhang et.al.	2404.07206	null
2024-04-10	RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion	Jaidev Shriram et.al.	2404.07199	null
2024-04-10	InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models	Jiale Xu et.al.	2404.07191	link
2024-04-10	Move Anything with Layered Scene Diffusion	Jiawei Ren et.al.	2404.07178	null
2024-04-10	Diffusion-based inpainting of incomplete Euclidean distance matrices of trajectories generated by a fractional Brownian motion	Alexander Lobashev et.al.	2404.07029	link
2024-04-10	DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting	Shijie Zhou et.al.	2404.06903	null
2024-04-10	Fine color guidance in diffusion models and its application to image compression at extremely low bitrates	Tom Bordin et.al.	2404.06865	null
2024-04-10	UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion	Junsheng Zhou et.al.	2404.06851	null
2024-04-10	Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer	Yanqi Ge et.al.	2404.06835	null
2024-04-10	Zero-shot Point Cloud Completion Via 2D Priors	Tianxin Huang et.al.	2404.06814	link
2024-04-10	Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior	Fan Lu et.al.	2404.06780	null
2024-04-10	DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space	Jianxiang Xiang et.al.	2404.06760	null
2024-04-10	Disguised Copyright Infringement of Latent Diffusion Model	Yiwei Lu et.al.	2404.06737	link
2024-04-10	Efficient Denoising using Score Embedding in Score-based Diffusion Models	Andrew S. Na et.al.	2404.06661	null
2024-04-09	Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation	Luca Barsellotti et.al.	2404.06542	null
2024-04-09	GeoDirDock: Guiding Docking Along Geodesic Paths	Raúl Miñán et.al.	2404.06481	null
2024-04-09	Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion	Fan Yang et.al.	2404.06429	link
2024-04-09	ZeST: Zero-Shot Material Transfer from a Single Image	Ta-Ying Cheng et.al.	2404.06425	null
2024-04-09	Policy-Guided Diffusion	Matthew Thomas Jackson et.al.	2404.06356	link
2024-04-09	Quantum State Generation with Structure-Preserving Diffusion Model	Yuchen Zhu et.al.	2404.06336	null
2024-04-08	MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation	Kunpeng Song et.al.	2404.05674	link
2024-04-08	YaART: Yet Another ART Rendering Technology	Sergey Kastryulin et.al.	2404.05666	null
2024-04-08	BinaryDM: Towards Accurate Binarization of Diffusion Model	Xingyu Zheng et.al.	2404.05662	link
2024-04-08	Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model	Jichang Yang et.al.	2404.05648	link
2024-04-08	Learning a Category-level Object Pose Estimator without Pose Annotations	Fengrui Tian et.al.	2404.05626	null
2024-04-08	UniFL: Improve Stable Diffusion via Unified Feedback Learning	Jiacheng Zhang et.al.	2404.05595	null
2024-04-08	Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models	Saman Motamed et.al.	2404.05519	null
2024-04-08	Taming Transformers for Realistic Lidar Point Cloud Generation	Hamed Haghighi et.al.	2404.05505	link
2024-04-08	Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance	Dazhong Shen et.al.	2404.05384	link
2024-04-08	Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt	Zhiqi Huang et.al.	2404.05331	null
2024-04-08	Text-to-Image Synthesis for Any Artistic Styles: Advancements in Personalized Artistic Image Generation via Subdivision and Dual Binding	Junseo Park et.al.	2404.05256	null
2024-04-08	DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation	Yingtao Tian et.al.	2404.05212	null
2024-04-07	Context-dependent Causality (the Non-Nonotonic Case)	Nir Billfeld et.al.	2404.05021	null
2024-04-07	Generative downscaling of PDE solvers with physics-guided diffusion models	Yulong Lu et.al.	2404.05009	link
2024-04-07	Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models	Zijin Yang et.al.	2404.04956	link
2024-04-07	Regularized Conditional Diffusion Model for Multi-Task Preference Alignment	Xudong Yu et.al.	2404.04920	null
2024-04-07	Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder	Yiyang Ma et.al.	2404.04916	null
2024-04-07	ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model	Binghui Chen et.al.	2404.04833	null
2024-04-07	Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving	Jinlong Li et.al.	2404.04804	null
2024-04-07	Rethinking Diffusion Model for Multi-Contrast MRI Super-Resolution	Guangyuan Li et.al.	2404.04785	link
2024-04-04	MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation	Hanzhe Hu et.al.	2404.03656	null
2024-04-04	CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching	Dongzhi Jiang et.al.	2404.03653	link
2024-04-04	The More You See in 2D, the More You Perceive in 3D	Xinyang Han et.al.	2404.03652	null
2024-04-04	DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior	Yiming Zhang et.al.	2404.03642	null
2024-04-04	LCM-Lookahead for Encoder-based Text-to-Image Personalization	Rinon Gal et.al.	2404.03620	null
2024-04-04	DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images	Zhou Jie et.al.	2404.03595	link
2024-04-04	PointInfinity: Resolution-Invariant Point Diffusion Models	Zixuan Huang et.al.	2404.03566	null
2024-04-04	Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models	Siyuan Mei et.al.	2404.03541	null
2024-04-04	A Directional Diffusion Graph Transformer for Recommendation	Zixuan Yi et.al.	2404.03326	null
2024-04-04	SiloFuse: Cross-silo Synthetic Data Generation with Latent Tabular Diffusion Models	Aditya Shankar et.al.	2404.03299	null
2024-04-04	Future-Proofing Class Incremental Learning	Quentin Jodelet et.al.	2404.03200	null
2024-04-04	HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud	Wencan Cheng et.al.	2404.03159	link
2024-04-04	DreamWalk: Style Space Exploration using Diffusion Guidance	Michelle Shu et.al.	2404.03145	null
2024-04-04	Diverse and Tailored Image Generation for Zero-shot Multi-label Classification	Kaixin Zhang et.al.	2404.03144	null
2024-04-04	The Diffusive Ultrasound Modulated Bioluminescence Tomography with Partial Data and Uncertain Optical Parameters	Tianyu Yang et.al.	2404.03124	null
2024-04-03	Many-to-many Image Generation with Auto-regressive Diffusion Models	Ying Shen et.al.	2404.03109	null
2024-04-03	Computing macroscopic reaction rates in reaction-diffusion systems using Monte Carlo simulations	Mohamed Swailem et.al.	2404.03089	null
2024-04-03	ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale	Jinbin Huang et.al.	2404.02990	null
2024-04-03	Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections	Gabriel Loaiza-Ganem et.al.	2404.02954	link
2024-04-03	LidarDM: Generative LiDAR Simulation in a Generated World	Vlas Zyrianov et.al.	2404.02903	link
2024-04-03	Fast Diffusion Model For Seismic Data Noise Attenuation	Junheng Peng et.al.	2404.02767	null
2024-04-03	Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models	Wentian Zhang et.al.	2404.02747	link
2024-04-03	Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition	Behrooz Razeghi et.al.	2404.02696	null
2024-04-03	Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models	Matteo Pennisi et.al.	2404.02618	null
2024-04-03	A Unified Editing Method for Co-Speech Gesture Generation via Diffusion Inversion	Zeyu Zhao et.al.	2404.02411	null
2024-04-03	Enhancing Diffusion-based Point Cloud Generation with Smoothness Constraint	Yukun Li et.al.	2404.02396	null
2024-04-02	Semantic Augmentation in Images using Language	Sahiti Yerramilli et.al.	2404.02353	null
2024-04-02	Heat Death of Generative Models in Closed-Loop Learning	Matteo Marchi et.al.	2404.02325	null
2024-04-02	APEX: Ambidextrous Dual-Arm Robotic Manipulation Using Collision-Free Generative Diffusion Models	Apan Dastider et.al.	2404.02284	null
2024-04-02	Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better	Enshu Liu et.al.	2404.02241	link
2024-04-02	Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models	Zeyu Yang et.al.	2404.02148	link
2024-04-02	WcDT: World-centric Diffusion Transformer for Traffic Scene Generation	Chen Yang et.al.	2404.02082	link
2024-04-03	AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design	Xinze Li et.al.	2404.02003	null
2024-04-02	Bi-LORA: A Vision-Language Approach for Synthetic Image Detection	Mamadou Keita et.al.	2404.01959	link
2024-04-02	Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model	Xu He et.al.	2404.01862	link
2024-04-02	Upsample Guidance: Scale Up Diffusion Models without Training	Juno Hwang et.al.	2404.01709	null
2024-04-02	FashionEngine: Interactive Generation and Editing of 3D Clothed Humans	Tao Hu et.al.	2404.01655	null
2024-04-02	Diffusion Deepfake	Chaitali Bhattacharyya et.al.	2404.01579	link
2024-04-01	Prior Frequency Guided Diffusion Model for Limited Angle (LA)-CBCT Reconstruction	Jiacheng Xie et.al.	2404.01448	null
2024-03-29	Relation Rectification in Diffusion Model	Yinwei Wu et.al.	2403.20249	null
2024-03-29	Motion Inversion for Video Customization	Luozhou Wang et.al.	2403.20193	null
2024-03-29	FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models	Barbara Toniella Corradini et.al.	2403.20105	null
2024-03-29	SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior	Zhongrui Yu et.al.	2403.20079	null
2024-03-29	Probing solar modulation analytic models with cosmic ray periodic spectra	Wei-Cheng Long et.al.	2403.20038	null
2024-04-01	Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting	Haipeng Liu et.al.	2403.19898	link
2024-03-28	Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks	Pooria Ashrafian et.al.	2403.19880	link
2024-03-28	ShapeFusion: A 3D diffusion model for localized shape editing	Rolandos Alexandros Potamias et.al.	2403.19773	null
2024-03-28	MIST: Mitigating Intersectional Bias with Disentangled Cross-Attention Editing in Text-to-Image Diffusion Models	Hidir Yesiltepe et.al.	2403.19738	null
2024-03-28	Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond	Katherine Xu et.al.	2403.19653	link
2024-03-28	InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction	Sirui Xu et.al.	2403.19652	null
2024-03-28	GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models	Yusuf Dalva et.al.	2403.19645	null
2024-03-28	In the driver’s mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles	Samir H. A. Mohammad et.al.	2403.19637	null
2024-03-28	Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model	Zhicai Wang et.al.	2403.19600	link
2024-03-28	Frame by Familiar Frame: Understanding Replication in Video Diffusion Models	Aimon Rahman et.al.	2403.19593	null
2024-03-28	Impact of Resin Molecular Weight on Drying Kinetics and Sag of Coatings	Marola W. Issa et.al.	2403.19544	null
2024-03-28	Debiasing Cardiac Imaging with Controlled Latent Diffusion Models	Grzegorz Skorupko et.al.	2403.19508	link
2024-03-28	Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality	Kyotaro Tokoro et.al.	2403.19428	link
2024-03-28	Imperceptible Protection against Style Imitation from Diffusion Models	Namhyuk Ahn et.al.	2403.19254	null
2024-03-28	RecDiffusion: Rectangling for Image Stitching with Diffusion Models	Tianhao Zhou et.al.	2403.19164	link
2024-03-28	MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation	Seyeon Kim et.al.	2403.19144	link
2024-03-28	QNCD: Quantization Noise Correction for Diffusion Models	Huanpeng Chu et.al.	2403.19140	link
2024-03-27	Egocentric Scene-aware Human Trajectory Prediction	Weizhuo Wang et.al.	2403.19026	null
2024-03-27	TextCraftor: Your Text Encoder Can be Image Quality Controller	Yanyu Li et.al.	2403.18978	null
2024-03-27	CPR: Retrieval Augmented Generation for Copyright Protection	Aditya Golatkar et.al.	2403.18920	null
2024-03-27	A Geometric Explanation of the Likelihood OOD Detection Paradox	Hamidreza Kamkari et.al.	2403.18910	link
2024-03-27	ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion	Daniel Winter et.al.	2403.18818	null
2024-03-28	ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation	Suraj Patni et.al.	2403.18807	link
2024-03-27	Object Pose Estimation via the Aggregation of Diffusion Features	Tianfu Wang et.al.	2403.18791	link
2024-03-27	ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object	Chenshuang Zhang et.al.	2403.18775	link
2024-03-27	A Diffusion-Based Generative Equalizer for Music Restoration	Eloi Moliner et.al.	2403.18636	link
2024-03-27	HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions	Hao Xu et.al.	2403.18575	link
2024-03-27	Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning – A Review	Mohammadreza Amirian et.al.	2403.18565	null
2024-03-27	CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection	Jiayi Zhu et.al.	2403.18554	null
2024-03-27	CT-3DFlow : Leveraging 3D Normalizing Flows for Unsupervised Detection of Pathological Pulmonary CT scans	Aissam Djahnine et.al.	2403.18514	null
2024-03-27	Synthesizing EEG Signals from Event-Related Potential Paradigms with Conditional Diffusion Models	Guido Klein et.al.	2403.18486	link
2024-03-27	DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis	Zhongxi Chen et.al.	2403.18471	link
2024-03-27	DiffStyler: Diffusion-based Localized Image Style Transfer	Shaoxu Li et.al.	2403.18461	link
2024-03-27	SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model	Inhwan Bae et.al.	2403.18452	link
2024-03-27	U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models	Ilias Mitsouras et.al.	2403.18425	null
2024-03-27	ECNet: Effective Controllable Text-to-Image Diffusion Models	Sicheng Li et.al.	2403.18417	null
2024-03-27	Ship in Sight: Diffusion Models for Ship-Image Super Resolution	Luigi Sigillo et.al.	2403.18370	link
2024-03-27	DODA: Diffusion for Object-detection Domain Adaptation in Agriculture	Shuai Xiang et.al.	2403.18334	link
2024-03-27	RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation	Yang Tian et.al.	2403.18259	null
2024-03-27	NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation	Jingyang Huo et.al.	2403.18211	null
2024-03-28	Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models	Kartikeya Bhardwaj et.al.	2403.18159	null
2024-03-25	Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning	Sicong Pan et.al.	2403.16803	link
2024-03-25	Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases	Sophie Starck et.al.	2403.16776	null
2024-03-25	Improving Diffusion Models’s Data-Corruption Resistance using Scheduled Pseudo-Huber Loss	Artem Khrapov et.al.	2403.16728	link
2024-03-25	SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions	Yuda Song et.al.	2403.16627	link
2024-03-25	SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation	Aysim Toker et.al.	2403.16605	null
2024-03-25	Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization	Xiangxin Zhou et.al.	2403.16576	null
2024-03-25	An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models	Zizhao Hu et.al.	2403.16530	null
2024-03-25	Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models	Ziyou Liang et.al.	2403.16513	null
2024-03-25	Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework	Ziyao Huang et.al.	2403.16510	link
2024-03-25	Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation	Sanyam Lakhanpal et.al.	2403.16422	null
2024-03-25	FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models	Lin Zhao et.al.	2403.16379	null
2024-03-24	Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis	Atefeh Khoshkhahtinat et.al.	2403.16258	null
2024-03-24	Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and Editing	Yongqing Liang et.al.	2403.16207	null
2024-03-24	Diffusion Model is a Good Pose Estimator from 3D RF-Vision	Junqiao Fan et.al.	2403.16198	null
2024-03-24	Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery	Siddharth Tourani et.al.	2403.16194	link
2024-03-26	Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method	Jie Tian et.al.	2403.16169	null
2024-03-24	Robust Diffusion Models for Adversarial Purification	Guang Lin et.al.	2403.16067	null
2024-03-24	A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA	Ayush Thakur et.al.	2403.16024	null
2024-03-23	Feature Manipulation for DDPM based Change Detection	Zhenglin Li et.al.	2403.15943	null
2024-03-26	X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention	You Xie et.al.	2403.15931	null
2024-03-21	GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation	Yinghao Xu et.al.	2403.14621	link
2024-03-21	DreamReward: Text-to-3D Generation with Human Preference	Junliang Ye et.al.	2403.14613	null
2024-03-21	ReNoise: Real Image Inversion Through Iterative Noising	Daniel Garibi et.al.	2403.14602	null
2024-03-21	Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting	Alicia Durrer et.al.	2403.14499	link
2024-03-21	Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation	Mathias Öttl et.al.	2403.14429	null
2024-03-21	DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning	Jonathan Lebensold et.al.	2403.14421	link
2024-03-21	Physics-Informed Diffusion Models	Jan-Hendrik Bastek et.al.	2403.14404	link
2024-03-21	Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models	Pablo Marcos-Manchón et.al.	2403.14291	link
2024-03-21	Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation	Francesco Di Felice et.al.	2403.14279	null
2024-03-21	Diffusion Models with Ensembled Structure-Based Anomaly Scoring for Unsupervised Anomaly Detection	Finn Behrendt et.al.	2403.14262	link
2024-03-21	Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition	Sihyun Yu et.al.	2403.14148	null
2024-03-21	Protein Conformation Generation via Force-Guided SE(3) Diffusion Models	Yan Wang et.al.	2403.14088	link
2024-03-21	QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping	Zhuang Xiong et.al.	2403.14070	null
2024-03-21	LeFusion: Synthesizing Myocardial Pathology on Cardiac MRI via Lesion-Focus Diffusion Models	Hantao Zhang et.al.	2403.14066	link
2024-03-21	DiffSTOCK: Probabilistic relational Stock Market Predictions using Diffusion Models	Divyanshu Daiya et.al.	2403.14063	null
2024-03-20	Enhancing Fingerprint Image Synthesis with GANs, Diffusion Models, and Style Transfer Techniques	W. Tang et.al.	2403.13916	null
2024-03-20	Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models	Richard Osuala et.al.	2403.13890	link
2024-03-20	Editing Massive Concepts in Text-to-Image Diffusion Models	Tianwei Xiong et.al.	2403.13807	link
2024-03-20	ZigMa: Zigzag Mamba Diffusion Model	Vincent Tao Hu et.al.	2403.13802	link
2024-03-20	TimeRewind: Rewinding Time with Image-and-Events Video Diffusion	Jingxi Chen et.al.	2403.13800	null
2024-03-20	DepthFM: Fast Monocular Depth Estimation with Flow Matching	Ming Gui et.al.	2403.13788	link
2024-03-20	Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation	Fu-Yun Wang et.al.	2403.13745	link
2024-03-20	DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance	Zixuan Wang et.al.	2403.13667	link
2024-03-20	ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer	Hiroki Azuma et.al.	2403.13652	link
2024-03-20	ReGround: Improving Textual and Spatial Grounding at No Cost	Yuseung Lee et.al.	2403.13589	null
2024-03-20	Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing	Hangeol Chang et.al.	2403.13551	link
2024-03-20	Compress3D: a Compressed Latent Space for 3D Generation from a Single Image	Bowen Zhang et.al.	2403.13524	null
2024-03-20	VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis	Yumeng Li et.al.	2403.13501	link
2024-03-20	Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion	Lucas Nunes et.al.	2403.13470	link
2024-03-20	S2DM: Sector-Shaped Diffusion Models for Video Generation	Haoran Lang et.al.	2403.13408	null
2024-03-20	IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis	Feng Liu et.al.	2403.13378	link
2024-03-20	AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation	Jingkun An et.al.	2403.13352	null
2024-03-20	LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment	Peishan Cong et.al.	2403.13307	link
2024-03-20	DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception	Yibo Wang et.al.	2403.13304	null
2024-03-20	Building Optimal Neural Architectures using Interpretable Knowledge	Keith G. Mills et.al.	2403.13293	link
2024-03-20	Beyond Skeletons: Integrative Latent Mapping for Coherent 4D Sequence Generation	Qitong Yang et.al.	2403.13238	null
2024-03-20	A Contact Model based on Denoising Diffusion to Learn Variable Impedance Control for Contact-rich Manipulation	Masashi Okada et.al.	2403.13221	null
2024-03-18	Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models	Emilian Postolache et.al.	2403.11706	link
2024-03-19	Urban Scene Diffusion through Semantic Occupancy Map	Junge Zhang et.al.	2403.11697	null
2024-03-18	Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection	Julia Wolleb et.al.	2403.11667	link
2024-03-18	Arc2Face: A Foundation Model of Human Faces	Foivos Paraperas Papantoniou et.al.	2403.11641	link
2024-03-18	LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models	Yang Yang et.al.	2403.11627	link
2024-03-18	CRS-Diff: Controllable Generative Remote Sensing Foundation Model	Datao Tang et.al.	2403.11614	link
2024-03-18	EffiVED:Efficient Video Editing via Text-instruction Diffusion Models	Zhenghao Zhang et.al.	2403.11568	link
2024-03-18	EchoReel: Enhancing Action Generation of Existing Video Diffusion Models	Jianzhi liu et.al.	2403.11535	link
2024-03-18	Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors	Ruicheng Wang et.al.	2403.11503	null
2024-03-18	SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction	Shuang Wang et.al.	2403.11482	link
2024-03-18	ALDM-Grasping: Diffusion-aided Zero-Shot Sim-to-Real Transfer for Robot Grasping	Yiwei Li et.al.	2403.11459	null
2024-03-18	CasSR: Activating Image Power for Real-World Image Super-Resolution	Haolan Chen et.al.	2403.11451	null
2024-03-18	VmambaIR: Visual State Space Model for Image Restoration	Yuan Shi et.al.	2403.11423	link
2024-03-18	DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation	Jeongsol Kim et.al.	2403.11415	link
2024-03-18	Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors	Yazid Janati et.al.	2403.11407	link
2024-03-17	StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining	Tushar Kataria et.al.	2403.11340	null
2024-03-17	Fast Personalized Text-to-Image Syntheses With Attention Injection	Yuxuan Zhang et.al.	2403.11284	null
2024-03-17	Understanding Diffusion Models by Feynman’s Path Integral	Yuji Hirono et.al.	2403.11262	null
2024-03-17	THOR: Text to Human-Object Interaction Diffusion via Relation Intervention	Qianyang Wu et.al.	2403.11208	null
2024-03-17	MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation	Yasufumi Kawano et.al.	2403.11194	link
2024-03-14	SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior	Huan-ang Gao et.al.	2403.09638	null
2024-03-14	3D-VLA: A 3D Vision-Language-Action Generative World Model	Haoyu Zhen et.al.	2403.09631	null
2024-03-14	Generalized Predictive Model for Autonomous Driving	Jiazhi Yang et.al.	2403.09630	link
2024-03-14	Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation	Fangfu Liu et.al.	2403.09625	null
2024-03-14	Score-Guided Diffusion for 3D Human Recovery	Anastasis Stathopoulos et.al.	2403.09623	link
2024-03-14	Explore In-Context Segmentation via Latent Diffusion Models	Chaoyang Wang et.al.	2403.09616	null
2024-03-14	MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models	Zunnan Xu et.al.	2403.09471	link
2024-03-14	Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing	Wonjun Kang et.al.	2403.09468	link
2024-03-14	Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk	Zhangheng Li et.al.	2403.09450	link
2024-03-14	3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation	Frank Zhang et.al.	2403.09439	null
2024-03-14	LM2D: Lyrics- and Music-Driven Dance Synthesis	Wenjie Yin et.al.	2403.09407	null
2024-03-14	Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction	Hanyu Chen et.al.	2403.09355	null
2024-03-14	HeadEvolver: Text to Head Avatars via Locally Learnable Mesh Deformation	Duotun Wang et.al.	2403.09326	null
2024-03-14	Regularity and trend to equilibrium for a non-local advection-diffusion model of active particles	Luca Alasio et.al.	2403.09282	null
2024-03-14	XReal: Realistic Anatomy and Pathology-Aware X-ray Generation via Controllable Diffusion Model	Anees Ur Rehman Hashmi et.al.	2403.09240	link
2024-03-14	Intention-driven Ego-to-Exo Video Generation	Hongchen Luo et.al.	2403.09194	null
2024-03-14	Intention-aware Denoising Diffusion Model for Trajectory Prediction	Chen Liu et.al.	2403.09190	null
2024-03-14	Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts	Byeongjun Park et.al.	2403.09176	link
2024-03-14	Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior	Cheng Chen et.al.	2403.09140	null
2024-03-14	Rethinking Referring Object Removal	Xiangtian Xue et.al.	2403.09128	null
2024-03-13	VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis	Enric Corona et.al.	2403.08764	null
2024-03-13	Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI	Shihan Qiu et.al.	2403.08758	null
2024-03-13	Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI	Shihan Qiu et.al.	2403.08749	null
2024-03-14	GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing	Jing Wu et.al.	2403.08733	link
2024-03-13	Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data	Asad Aali et.al.	2403.08728	link
2024-03-13	Data Augmentation in Human-Centric Vision	Wentao Jiang et.al.	2403.08650	null
2024-03-13	ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional Videos	Lei Shi et.al.	2403.08591	null
2024-03-13	Federated Knowledge Graph Unlearning via Diffusion Model	Bingchen Liu et.al.	2403.08554	null
2024-03-13	Model Will Tell: Training Membership Inference for Diffusion Models	Xiaomeng Fu et.al.	2403.08487	null
2024-03-13	MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction	Linjie Fu et.al.	2403.08479	link
2024-03-13	An Analysis of Human Alignment of Latent Diffusion Models	Lorenz Linhardt et.al.	2403.08469	null
2024-03-13	Diffusion Models with Implicit Guidance for Medical Anomaly Detection	Cosmin I. Bercea et.al.	2403.08464	link
2024-03-13	Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion Model	Ruibin Zhang et.al.	2403.08460	link
2024-03-13	PFStorer: Personalized Face Restoration and Super-Resolution	Tuomas Varanka et.al.	2403.08436	null
2024-03-13	Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification	Shuhan Li et.al.	2403.08407	null
2024-03-13	Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models	Pengze Zhang et.al.	2403.08381	link
2024-03-13	Mitigate Target-level Insensitivity of Infrared Small Target Detection via Posterior Distribution Modeling	Haoqing Li et.al.	2403.08380	link
2024-03-13	VIGFace: Virtual Identity Generation Model for Face Image Synthesis	Minsoo Kim et.al.	2403.08277	link
2024-03-13	Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models	Jian Lin et.al.	2403.08266	null
2024-03-13	Make Me Happier: Evoking Emotions Through Image Diffusion Models	Qing Lin et.al.	2403.08255	null
2024-03-11	BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion	Xuan Ju et.al.	2403.06976	link
2024-03-11	Bayesian Diffusion Models for 3D Shape Reconstruction	Haiyang Xu et.al.	2403.06973	null
2024-03-11	POD-ROM methods: from a finite set of snapshots to continuous-in-time approximations	Bosco Garcia-Archilla et.al.	2403.06967	null
2024-03-11	SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data	Jialu Li et.al.	2403.06952	null
2024-03-12	DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations	Tianhao Qi et.al.	2403.06951	link
2024-03-11	Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction	Qing Xiao et.al.	2403.06940	null
2024-03-11	Estimation of parameters and local times in a discretely observed threshold diffusion model	Sara Mazzonetto et.al.	2403.06858	null
2024-03-11	Multistep Consistency Models	Jonathan Heek et.al.	2403.06807	null
2024-03-11	Distribution-Aware Data Expansion with Diffusion Models	Haowei Zhu et.al.	2403.06741	link
2024-03-11	V3D: Video Diffusion Models are Effective 3D Generators	Zilong Chen et.al.	2403.06738	link
2024-03-11	Active Generation for Image Classification	Tao Huang et.al.	2403.06517	link
2024-03-11	Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning	Woojung Han et.al.	2403.06516	null
2024-03-11	Incorporating Improved Sinusoidal Threshold-based Semi-supervised Method and Diffusion Models for Osteoporosis Diagnosis	Wenchi Ke et.al.	2403.06498	null
2024-03-11	Are you sure? Modelling Drivers’ Confidence Judgments in Left-Turn Gap Acceptance Decisions	Arkady Zgonnikov et.al.	2403.06496	null
2024-03-11	Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation	Guangyang Wu et.al.	2403.06452	link
2024-03-11	DivCon: Divide and Conquer for Progressive Text-to-Image Generation	Yuhao Jia et.al.	2403.06400	link
2024-03-11	FSViewFusion: Few-Shots View Generation of Novel Objects	Rukhshanda Hussain et.al.	2403.06394	null
2024-03-11	Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models	Yang Zhang et.al.	2403.06381	link
2024-03-12	Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style	Shuai Tan et.al.	2403.06365	null
2024-03-10	Transferable Reinforcement Learning via Generalized Occupancy Models	Chuning Zhu et.al.	2403.06328	null
2024-03-07	ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes	Hashmat Shadab Malik et.al.	2403.04701	link
2024-03-07	Delving into the Trajectory Long-tail Distribution for Muti-object Tracking	Sijia Chen et.al.	2403.04700	link
2024-03-07	PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation	Junsong Chen et.al.	2403.04692	link
2024-03-07	Pix2Gif: Motion-Guided Diffusion for GIF Generation	Hitesh Kandala et.al.	2403.04634	link
2024-03-07	A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images	Cristiana Tiago et.al.	2403.04612	null
2024-03-07	Anatomy-Guided Surface Diffusion Model for Alzheimer’s Disease Normative Modeling	Jianwei Zhang et.al.	2403.04531	null
2024-03-07	Effect of turbulent diffusion in modeling anaerobic digestion	Jeremy Z. Yan et.al.	2403.04457	null
2024-03-07	Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser	Qingyuan Cai et.al.	2403.04444	link
2024-03-07	StableDrag: Stable Dragging for Point-based Image Editing	Yutao Cui et.al.	2403.04437	null
2024-03-07	On-demand Quantization for Green Federated Generative Diffusion in Mobile Edge Networks	Bingkun Lai et.al.	2403.04430	null
2024-03-07	Controllable Generation with Text-to-Image Diffusion Models: A Survey	Pu Cao et.al.	2403.04279	link
2024-03-06	PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement	Zhijie Wang et.al.	2403.04014	link
2024-03-06	GUIDE: Guidance-based Incremental Learning with Diffusion Models	Bartosz Cywiński et.al.	2403.03938	link
2024-03-06	Latent Dataset Distillation with Diffusion Models	Brian B. Moser et.al.	2403.03881	null
2024-03-06	Accelerating Convergence of Score-Based Diffusion Models, Provably	Gen Li et.al.	2403.03852	null
2024-03-06	Diffusion on language model embeddings for protein sequence generation	Viacheslav Meshchaninov et.al.	2403.03726	null
2024-03-06	Efficient Search and Learning for Agile Locomotion on Stepping Stones	Adithya Kumar Chinnakkonda Ravi et.al.	2403.03639	null
2024-03-06	Diffusion-based Generative Prior for Low-Complexity MIMO Channel Estimation	Benedikt Fesl et.al.	2403.03545	link
2024-03-06	NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging	Takahiro Shirakawa et.al.	2403.03485	link
2024-03-06	FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion	Hao Wang et.al.	2403.03463	link
2024-03-06	Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing	Bingyan Liu et.al.	2403.03431	null
2024-03-05	Scaling Rectified Flow Transformers for High-Resolution Image Synthesis	Patrick Esser et.al.	2403.03206	null
2024-03-05	MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets	Hossein Aboutalebi et.al.	2403.03194	link
2024-03-05	NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models	Zeqian Ju et.al.	2403.03100	null
2024-03-05	Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn’s Rings	Naoya Torii et.al.	2403.03012	null
2024-03-05	Cross-Domain Image Conversion by CycleDM	Sho Shimotsumagari et.al.	2403.02919	null
2024-03-05	MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model	Sen Wang et.al.	2403.02905	link
2024-03-05	Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders	Daniele Mari et.al.	2403.02887	null
2024-03-05	Zero-LED: Zero-Reference Lighting Estimation Diffusion Model for Low-Light Image Enhancement	Jinhong He et.al.	2403.02879	null
2024-03-05	Scalable Continuous-time Diffusion Framework for Network Inference and Influence Estimation	Keke Huang et.al.	2403.02867	link
2024-03-05	Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation	Weijie Li et.al.	2403.02827	null
2024-03-05	Fast, Scale-Adaptive, and Uncertainty-Aware Downscaling of Earth System Model Fields with Generative Foundation Models	Philipp Hess et.al.	2403.02774	null
2024-03-02	DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction	Junwen Xiong et.al.	2403.01226	null
2024-03-02	TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion	Salaheldin Mohamed et.al.	2403.01212	null
2024-03-02	Training Unbiased Diffusion Models From Biased Dataset	Yeongmin Kim et.al.	2403.01189	link
2024-03-02	Volume diffusion modelling of a sheared granular gas	Duncan Dockar et.al.	2403.01188	null
2024-03-02	Text-guided Explorable Image Super-resolution	Kanchana Vaishnavi Gandikota et.al.	2403.01124	null
2024-03-02	Face Swap via Diffusion Model	Feifei Wang et.al.	2403.01108	link
2024-03-01	A time-stepping deep gradient flow method for option pricing in (rough) diffusion models	Antonis Papapantoleon et.al.	2403.00746	link
2024-03-01	Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks	Yuhao Liu et.al.	2403.00644	null
2024-03-01	Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset	Ander Salaberria et.al.	2403.00587	link
2024-03-01	Rethinking cluster-conditioned diffusion models	Nikolas Adaloglou et.al.	2403.00570	link
2024-03-01	Waves, patterns and bifurcations: a tutorial review on the vertebrate segmentation clock	Paul François et.al.	2403.00457	null
2024-03-01	An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels	Shumpei Takezaki et.al.	2403.00452	null
2024-03-01	LoMOE: Localized Multi-Object Editing via Multi-Diffusion	Goirik Chakrabarty et.al.	2403.00437	null
2024-03-01	Abductive Ego-View Accident Video Understanding for Safe Driving Perception	Jianwu Fang et.al.	2403.00436	null
2024-03-01	HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation	Zhiying Leng et.al.	2403.00372	null
2024-03-01	Robust Policy Learning via Offline Skill Diffusion	Woo Kyung Kim et.al.	2403.00225	null
2024-02-29	DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models	Muyang Li et.al.	2402.19481	link
2024-02-29	Towards Generalizable Tumor Synthesis	Qi Chen et.al.	2402.19470	link
2024-02-29	Listening to the Noise: Blind Denoising with Gibbs Diffusion	David Heurtel-Depeiges et.al.	2402.19455	link
2024-02-29	Structure Preserving Diffusion Models	Haoye Lu et.al.	2402.19369	null
2024-02-29	A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation	Hanxi Li et.al.	2402.19330	link
2024-02-29	DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly	Gianluca Scarpellini et.al.	2402.19302	link
2024-02-29	TEncDM: Understanding the Properties of Diffusion Model in the Space of Language Model Encodings	Alexander Shabalin et.al.	2402.19097	link
2024-02-29	Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach	Sarina Thomas et.al.	2402.19062	null
2024-02-29	WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis	Paul Friedrich et.al.	2402.19043	link
2024-02-29	Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding	Guangyi Liu et.al.	2402.19009	link
2024-02-29	ViewFusion: Towards Multi-View Consistency via Interpolated Denoising	Xianghui Yang et.al.	2402.18842	link
2024-02-29	Extended Flow Matching: a Method of Conditional Generation with Generalized Continuity Equation	Noboru Isobe et.al.	2402.18839	null
2024-02-29	A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D	Xiaohan Fei et.al.	2402.18780	null
2024-02-28	Exploring Privacy and Fairness Risks in Sharing Diffusion Models: An Adversarial Perspective	Xinjian Luo et.al.	2402.18607	null
2024-02-28	Logarithmic Sobolev Inequalities for Bounded Domains and Applications to Drift-Diffusion Equations	Elie Abdo et.al.	2402.18572	null
2024-02-28	Dynamical Regimes of Diffusion Models	Giulio Biroli et.al.	2402.18491	null
2024-02-28	Deep Confident Steps to New Pockets: Strategies for Docking Generalization	Gabriele Corso et.al.	2402.18396	link
2024-02-28	Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model	Sangjoon Park et.al.	2402.18362	null
2024-02-28	FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes	Ziying Pan et.al.	2402.18331	link
2024-02-28	Balancing Act: Distribution-Guided Debiasing in Diffusion Models	Rishubh Parihar et.al.	2402.18206	null
2024-02-28	Diffusion-based Neural Network Weights Generation	Bedionita Soro et.al.	2402.18153	link
2024-02-28	Context-aware Talking Face Video Generation	Meidai Xuanyuan et.al.	2402.18092	null
2024-02-28	Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis	Yanzuo Lu et.al.	2402.18078	link
2024-02-28	SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model	Bin Cao et.al.	2402.18068	link
2024-02-28	Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints	Lingkai Kong et.al.	2402.18012	null
2024-02-28	Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning	Zeyang Liu et.al.	2402.17978	null
2024-02-27	Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models	Ashkan Taghipour et.al.	2402.17910	link
2024-02-27	Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning	Xiaoyu Zhang et.al.	2402.17768	null
2024-02-27	Structure-Guided Adversarial Training of Diffusion Models	Ling Yang et.al.	2402.17563	null
2024-02-27	Diffusion Model-Based Image Editing: A Survey	Yi Huang et.al.	2402.17525	link
2024-02-27	Label-Noise Robust Diffusion Models	Byeonghu Na et.al.	2402.17517	link
2024-02-27	EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions	Linrui Tian et.al.	2402.17485	null
2024-02-28	DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models	Shyam Marjit et.al.	2402.17412	null
2024-02-27	Generative diffusion model for surface structure discovery	Nikolaj Rønne et.al.	2402.17404	null
2024-02-26	Stochastic Conditional Diffusion Models for Semantic Image Synthesis	Juyeon Ko et.al.	2402.16506	link
2024-02-26	Outline-Guided Object Inpainting with Diffusion Models	Markus Pobitzer et.al.	2402.16421	null
2024-02-26	Placing Objects in Context via Inpainting for Out-of-distribution Segmentation	Pau de Jorge et.al.	2402.16392	link
2024-02-26	Generative AI in Vision: A Survey on Models, Metrics and Applications	Gaurav Raut et.al.	2402.16369	null
2024-02-26	Feedback Efficient Online Fine-Tuning of Diffusion Models	Masatoshi Uehara et.al.	2402.16359	null
2024-02-26	Graph Diffusion Policy Optimization	Yijing Liu et.al.	2402.16302	link
2024-02-25	Photon-counting CT using a Conditional Diffusion Model for Super-resolution and Texture-preservation	Christopher Wiedeman et.al.	2402.16212	null
2024-02-25	Towards Efficient Quantum Hybrid Diffusion Models	Francesca De Falco et.al.	2402.16147	null
2024-02-25	Cinematographic Camera Diffusion Model	Hongda Jiang et.al.	2402.16143	link
2024-02-25	Behavioral Refinement via Interpolant-based Policy Diffusion	Kaiqi Chen et.al.	2402.16075	link
2024-02-24	HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models	Li Pang et.al.	2402.15865	link
2024-02-23	Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions	Kaihong Zhang et.al.	2402.15602	null
2024-02-23	Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition	Chun-Hsiao Yeh et.al.	2402.15504	link
2024-02-23	ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation	Yi Zhang et.al.	2402.15429	link
2024-02-23	Let’s Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models	Shunyu Liu et.al.	2402.15289	link
2024-02-23	Weak Reproductive Solutions for a Convection-Diffusion Model Describing a Binary Alloy Solidification Processes	Blanca Climent-Ezquerra et.al.	2402.15221	null
2024-02-23	Label-efficient Multi-organ Segmentation Method with Diffusion Model	Yongzhi Huang et.al.	2402.15216	null
2024-02-23	Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control	Masatoshi Uehara et.al.	2402.15194	null
2024-02-23	Dynamics-Guided Diffusion Model for Robot Manipulator Design	Xiaomeng Xu et.al.	2402.15038	null
2024-02-22	Cameras as Rays: Pose Estimation via Ray Diffusion	Jason Y. Zhang et.al.	2402.14817	null
2024-02-22	Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models	Yixuan Ren et.al.	2402.14780	null
2024-02-22	Debiasing Text-to-Image Diffusion Models	Ruifei He et.al.	2402.14577	null
2024-02-22	Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems	Christina Schenk et.al.	2402.14446	null
2024-02-22	Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning	Haoran He et.al.	2402.14407	link
2024-02-22	Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment	Zhaoyang Wang et.al.	2402.14401	link
2024-02-22	Typographic Text Generation with Off-the-Shelf Diffusion Model	KhayTze Peong et.al.	2402.14314	null
2024-02-22	Font Style Interpolation with Diffusion Models	Tetta Kondo et.al.	2402.14311	null
2024-02-22	Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion	Yujia Huang et.al.	2402.14285	link
2024-02-22	MVD $^2$ : Efficient Multiview 3D Reconstruction for Multiview Diffusion	Xin-Yang Zheng et.al.	2402.14253	null
2024-02-21	T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching	Zizheng Pan et.al.	2402.14167	link
2024-02-21	Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate	Yuchen Liang et.al.	2402.13901	null
2024-02-21	NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion	Haoyu Li et.al.	2402.13809	link
2024-02-22	Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions	Jiayu Chen et.al.	2402.13777	link
2024-02-21	Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion	Lianghu Guo et.al.	2402.13776	null
2024-02-21	Music Style Transfer with Time-Varying Inversion of Diffusion Models	Sifei Li et.al.	2402.13763	null
2024-02-21	SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model	Xudong Ling et.al.	2402.13737	link
2024-02-21	Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation	Kihong Kim et.al.	2402.13729	null
2024-02-21	Flexible Physical Camouflage Generation Based on a Differential Approach	Yang Li et.al.	2402.13575	null
2024-02-21	ToDo: Token Downsampling for Efficient Generation of High-Resolution Images	Ethan Smith et.al.	2402.13573	null
2024-02-21	Generative AI for Secure Physical Layer Communications: A Survey	Changyuan Zhao et.al.	2402.13553	null
2024-02-21	DiffPLF: A Conditional Diffusion Model for Probabilistic Forecasting of EV Charging Load	Siyang Li et.al.	2402.13548	link
2024-02-21	Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models	Chen Wu et.al.	2402.13490	null
2024-02-20	Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control	Denis Lukovnikov et.al.	2402.13404	null
2024-02-20	The Uncanny Valley: A Comprehensive Analysis of Diffusion Models	Karam Ghanem et.al.	2402.13369	null
2024-02-20	Neural Network Diffusion	Kai Wang et.al.	2402.13144	link
2024-02-20	Text-Guided Molecule Generation with Diffusion Language Model	Haisong Gong et.al.	2402.13040	link
2024-02-21	Visual Style Prompting with Swapping Self-Attention	Jaeseok Jeong et.al.	2402.12974	link
2024-02-20	CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection	Sohail Ahmed Khan et.al.	2402.12927	link
2024-02-20	RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models	Xinchen Zhang et.al.	2402.12908	link
2024-02-20	Two-stage Rainfall-Forecasting Diffusion Model	XuDong Ling et.al.	2402.12779	link
2024-02-19	FiT: Flexible Vision Transformer for Diffusion Model	Zeyu Lu et.al.	2402.12376	link
2024-02-19	Synthetic location trajectory generation using categorical diffusion models	Simon Dirmeier et.al.	2402.12242	link
2024-02-19	Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training	Leo Hyun Park et.al.	2402.12187	null
2024-02-19	Human Video Translation via Query Warping	Haiming Zhu et.al.	2402.12099	null
2024-02-19	Direct Consistency Optimization for Compositional Text-to-Image Personalization	Kyungmin Lee et.al.	2402.12004	null
2024-02-19	Privacy-Preserving Low-Rank Adaptation for Latent Diffusion Models	Zihao Luo et.al.	2402.11989	link
2024-02-19	DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation	Chong Zeng et.al.	2402.11929	link
2024-02-19	A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning	Yuan Yuan et.al.	2402.11922	link
2024-02-19	ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image	Yan Hong et.al.	2402.11849	null
2024-02-19	UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models	Yihua Zhang et.al.	2402.11846	link
2024-02-19	WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection	Yan Hong et.al.	2402.11843	null
2024-02-19	Statistical Test for Generated Hypotheses by Diffusion Models	Teruyuki Katsuoka et.al.	2402.11789	null
2024-02-19	Towards Theoretical Understandings of Self-Consuming Generative Models	Shi Fu et.al.	2402.11778	null
2024-02-18	SDiT: Spiking Diffusion Model with Transformer	Shu Yang et.al.	2402.11588	null
2024-02-18	CaloGraph: Graph-based diffusion model for fast shower generation in calorimeters with irregular geometry	Dmitrii Kobylianskii et.al.	2402.11575	null
2024-02-18	Temporal Disentangled Contrastive Diffusion Model for Spatiotemporal Imputation	Yakun Chen et.al.	2402.11558	null
2024-02-18	Visual Concept-driven Image Generation with Text-to-Image Diffusion Model	Tanzila Rahman et.al.	2402.11487	null
2024-02-17	Partial Ly $α$ thermalization in an analytic nonlinear diffusion model	Georg Wolschin et.al.	2402.11320	null
2024-02-17	TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet method	Chenyan Zhang et.al.	2402.11274	link
2024-02-17	DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model	Yu Feng et.al.	2402.11241	null
2024-02-15	Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation	Huizhuo Yuan et.al.	2402.10210	null
2024-02-15	Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment	Rui Yang et.al.	2402.10207	link
2024-02-15	Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model	Mariia Drozdova et.al.	2402.10204	link
2024-02-15	Classification Diffusion Models	Shahar Yadin et.al.	2402.10095	null
2024-02-15	Diffusion Models Meet Contextual Bandits with Large Action Spaces	Imad Aouali et.al.	2402.10028	null
2024-02-15	Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion	Hila Manor et.al.	2402.10009	null
2024-02-15	Accelerating Parallel Sampling of Diffusion Models	Zhiwei Tang et.al.	2402.09970	link
2024-02-15	Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation	Junjie Shentu et.al.	2402.09966	link
2024-02-15	Lester: rotoscope animation through video object segmentation and tracking	Ruben Tous et.al.	2402.09883	link
2024-02-15	Diffusion Models for Audio Restoration	Jean-Marie Lemercier et.al.	2402.09821	null
2024-02-15	DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization	Jisu Nam et.al.	2402.09812	link
2024-02-15	Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement	Tao Yang et.al.	2402.09712	null
2024-02-14	Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection	Pengfei Zhou et.al.	2402.09242	link
2024-02-14	Semi-Supervised Diffusion Model for Brain Age Prediction	Ayodeji Ijishakin et.al.	2402.09137	null
2024-02-14	L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects	Yutaro Yamada et.al.	2402.09052	null
2024-02-14	Extreme Video Compression with Pre-trained Diffusion Models	Bohan Li et.al.	2402.08934	link
2024-02-14	The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes	Myeongseob Ko et.al.	2402.08922	link
2024-02-13	Percolating transition to turbulence without puffs or bands	Sébastien Gomé et.al.	2402.08829	null
2024-02-13	LDTrack: Dynamic People Tracking by Service Robots using Diffusion Models	Angus Fung et.al.	2402.08774	null
2024-02-13	Towards the Detection of AI-Synthesized Human Face Images	Yuhang Lu et.al.	2402.08750	null
2024-02-13	PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models	Fei Deng et.al.	2402.08714	null
2024-02-13	Zero Shot Molecular Generation via Similarity Kernels	Rokas Elijošius et.al.	2402.08708	link
2024-02-13	Chain Reaction of Ideas: Can Radioactive Decay Predict Technological Innovation?	Guilherme S. Y. Giardini et.al.	2402.08681	null
2024-02-13	Target Score Matching	Valentin De Bortoli et.al.	2402.08667	null
2024-02-13	Learning Continuous 3D Words for Text-to-Image Generation	Ta-Ying Cheng et.al.	2402.08654	link
2024-02-13	Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator	Amartya Mukherjee et.al.	2402.08563	null
2024-02-13	Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases	Ziyi Zhang et.al.	2402.08552	link
2024-02-13	A Dense Reward View on Aligning Text-to-Image Diffusion with Preference	Shentao Yang et.al.	2402.08265	link
2024-02-13	Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation	AprilPyone MaungMaung et.al.	2402.08200	null
2024-02-14	Convergence Analysis of Discrete Diffusion Model: Exact Implementation through Uniformization	Hongrui Chen et.al.	2402.08095	null
2024-02-12	Nearest Neighbour Score Estimators for Diffusion Generative Models	Matthew Niedoba et.al.	2402.08018	link
2024-02-12	Towards a mathematical theory for consistency training in diffusion models	Gen Li et.al.	2402.07802	null
2024-02-12	Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models	Jiacheng Ye et.al.	2402.07754	link
2024-02-12	Cosmology at the Field Level with Probabilistic Machine Learning	Adam Rouhiainen et.al.	2402.07694	null
2024-02-12	Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback	Cansu Korkmaz et.al.	2402.07597	null
2024-02-12	Score-based Diffusion Models via Stochastic Differential Equations – a Technical Tutorial	Wenpin Tang et.al.	2402.07487	null
2024-02-12	SALAD: Smart AI Language Assistant Daily	Ragib Amin Nihal et.al.	2402.07431	null
2024-02-12	Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation	Tonglong Wei et.al.	2402.07369	link
2024-02-11	Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL	Sungyoon Kim et.al.	2402.07226	link
2024-02-11	Towards Fast Stochastic Sampling in Diffusion Generative Models	Kushagra Pandey et.al.	2402.07211	null
2024-02-10	Synthesizing CTA Image Data for Type-B Aortic Dissection using Stable Diffusion Models	Ayman Abaid et.al.	2402.06969	null
2024-02-09	Towards Principled Assessment of Tabular Data Synthesis Algorithms	Yuntao Du et.al.	2402.06806	link
2024-02-09	Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following	Brian Yang et.al.	2402.06559	link
2024-02-09	Sequential Flow Matching for Generative Modeling	Jongmin Yoon et.al.	2402.06461	null
2024-02-09	ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation	Fengyi Shen et.al.	2402.06446	null
2024-02-09	Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation	Peter Hönig et.al.	2402.06436	null
2024-02-09	Particle Denoising Diffusion Sampler	Angus Phillips et.al.	2402.06320	link
2024-02-09	Controllable seismic velocity synthesis using generative diffusion models	Fu Wang et.al.	2402.06277	null
2024-02-09	MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models	Yixiao Zhang et.al.	2402.06178	link
2024-02-08	CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models	Maitreya Suin et.al.	2402.06106	null
2024-02-08	Animated Stickers: Bringing Stickers to Life with Video Diffusion	David Yan et.al.	2402.06088	null
2024-02-08	InstaGen: Enhancing Object Detection by Training on Synthetic Dataset	Chengjian Feng et.al.	2402.05937	null
2024-02-08	Time Series Diffusion in the Frequency Domain	Jonathan Crabbé et.al.	2402.05933	link
2024-02-08	AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning	Wamiq Reyaz Para et.al.	2402.05803	null
2024-02-08	DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer	Zhiyuan Ma et.al.	2402.05712	link
2024-02-08	Scalable Diffusion Models with State Space Backbone	Zhengcong Fei et.al.	2402.05608	link
2024-02-08	Get What You Want, Not What You Don’t: Image Content Suppression for Text-to-Image Diffusion Models	Senmao Li et.al.	2402.05375	link
2024-02-08	Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model	Junghun Cha et.al.	2402.05350	null
2024-02-07	SPAD : Spatially Aware Multiview Diffusers	Yash Kant et.al.	2402.05235	null
2024-02-07	Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models	Nicholas Konz et.al.	2402.05210	link
2024-02-07	$λ$ -ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space	Maitreya Patel et.al.	2402.05195	null
2024-02-07	On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling	Marcin Sendera et.al.	2402.05098	link
2024-02-07	NITO: Neural Implicit Fields for Resolution-free Topology Optimization	Amin Heyrani Nobari et.al.	2402.05073	link
2024-02-07	LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation	Jiaxiang Tang et.al.	2402.05054	null
2024-02-07	Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design	Andrew Campbell et.al.	2402.04997	link
2024-02-07	Blue noise for diffusion models	Xingchang Huang et.al.	2402.04930	link
2024-02-07	Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation	Shivang Chopra et.al.	2402.04929	null
2024-02-07	Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints	Jian Chen et.al.	2402.04754	link
2024-02-07	Cortical Surface Diffusion Generative Models	Zhenshan Xie et.al.	2402.04753	null
2024-02-07	EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions	Shashank Kotyan et.al.	2402.04699	link
2024-02-07	Noise Map Guidance: Inversion with Spatial Context for Real Image Editing	Hansam Cho et.al.	2402.04625	link
2024-02-07	BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception	Aniket Roy et.al.	2402.04541	link
2024-02-07	Text2Street: Controllable Text-to-image Generation for Street Views	Jinming Su et.al.	2402.04504	null
2024-02-06	Fine-Tuned Language Models Generate Stable Inorganic Materials as Text	Nate Gruver et.al.	2402.04379	link
2024-02-06	Bidirectional Autoregressive Diffusion Model for Dance Generation	Canyu Zhang et.al.	2402.04356	link
2024-02-06	Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation	Zolnamar Dorjsembe et.al.	2402.04031	link
2024-02-06	Space Group Constrained Crystal Generation	Rui Jiao et.al.	2402.03992	null
2024-02-06	Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting	Yiming Xu et.al.	2402.03981	null
2024-02-06	EscherNet: A Generative Model for Scalable View Synthesis	Xin Kong et.al.	2402.03908	link
2024-02-06	On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models	Christian Horvat et.al.	2402.03845	null
2024-02-06	SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising	Yu-Tung Liu et.al.	2402.03808	link
2024-02-05	Do Diffusion Models Learn Semantically Meaningful and Efficient Representations?	Qiyao Liang et.al.	2402.03305	null
2024-02-05	Zero-shot Object-Level OOD Detection with Context-Aware Inpainting	Quang-Huy Nguyen et.al.	2402.03292	null
2024-02-05	InstanceDiffusion: Instance-level Control for Image Generation	Xudong Wang et.al.	2402.03290	link
2024-02-05	Organic or Diffused: Can We Distinguish Human Art from AI-generated Images?	Anna Yoo Jeong Ha et.al.	2402.03214	null
2024-02-05	Light and Optimal Schrödinger Bridge Matching	Nikita Gushchin et.al.	2402.03207	link
2024-02-05	Guidance with Spherical Gaussian Constraint for Conditional Diffusion	Lingxiao Yang et.al.	2402.03201	link
2024-02-05	Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion	Shiyuan Yang et.al.	2402.03162	null
2024-02-05	PFDM: Parser-Free Virtual Try-on via Diffusion Model	Yunfang Niu et.al.	2402.03047	null
2024-02-05	Diffusive Gibbs Sampling	Wenlin Chen et.al.	2402.03008	link
2024-02-05	DexDiffuser: Generating Dexterous Grasps with Diffusion Models	Zehang Weng et.al.	2402.02989	null
2024-02-05	Retrieval-Augmented Score Distillation for Text-to-3D Generation	Junyoung Seo et.al.	2402.02972	link
2024-02-05	ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis	Bernard Spiegl et.al.	2402.02906	link
2024-02-05	SynthVision – Harnessing Minimal Input for Maximal Output in Computer Vision Models using Synthetic Image data	Yudara Kularathne et.al.	2402.02826	null
2024-02-05	Extreme Two-View Geometry From Object Poses with Diffusion Models	Yujing Sun et.al.	2402.02800	link
2024-02-05	Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning	Yixiang Shan et.al.	2402.02772	null
2024-02-05	DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models	Yang Sui et.al.	2402.02739	null
2024-02-04	DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing	Chong Mou et.al.	2402.02583	link
2024-02-04	Latent Graph Diffusion: A Unified Framework for Generation and Prediction on Graphs	Zhou Cai et.al.	2402.02518	link
2024-02-04	PoCo: Policy Composition from and for Heterogeneous Robot Learning	Lirui Wang et.al.	2402.02511	null
2024-02-04	PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal	Tao Wang et.al.	2402.02374	link
2024-02-01	ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields	Jiahua Dong et.al.	2402.00864	link
2024-02-01	An Analysis of the Variance of Diffusion-based Speech Enhancement	Bunlong Lay et.al.	2402.00811	null
2024-02-01	Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching	Shangzhe Li et.al.	2402.00807	null
2024-02-01	AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning	Fu-Yun Wang et.al.	2402.00769	link
2024-01-31	SeFi-IDE: Semantic-Fidelity Identity Embedding for Personalized Diffusion-Based Generation	Yang Li et.al.	2402.00631	null
2024-02-01	Cylindrically symmetric diffusion model for relativistic heavy-ion collisions	Johannes Hoelck et.al.	2402.00628	null
2024-02-01	CapHuman: Capture Your Moments in Parallel Universes	Chao Liang et.al.	2402.00627	link
2024-02-01	Masked Conditional Diffusion Model for Enhancing Deepfake Detection	Tiewen Chen et.al.	2402.00541	null
2024-02-01	Energetic Particles in the Central Starburst, Disc, and Halo of NGC253	Yoel Rephaeli et.al.	2402.00523	null
2024-02-01	LRDif: Diffusion Models for Under-Display Camera Emotion Recognition	Zhifeng Wang et.al.	2402.00250	null
2024-01-31	SuperDiff: Diffusion Models for Conditional Generation of Hypothetical New Families of Superconductors	Samuel Yuan et.al.	2402.00198	link
2024-01-31	Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators	Daniel Geng et.al.	2401.18085	null
2024-01-31	Ljusternik-Schnirelmann eigenvalues for the fractional $m-$Laplacian without the $Δ_2$ condition	Julian Fernandez Bonder et.al.	2401.18041	null
2024-01-31	Diagnosing the particle transport mechanism in the pulsar halo via X-ray observations	Qi-Zuo Wu et.al.	2401.17982	null
2024-01-31	Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances	Xuefeng Gao et.al.	2401.17958	null
2024-01-31	AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error	Jonas Ricker et.al.	2401.17879	link
2024-01-31	Drift Diffusion Model to understand (mis)information sharing dynamic in complex networks	Lucila G. Alvarez-Zuzek et.al.	2401.17846	null
2024-01-31	A new class of efficient high order semi-Lagrangian IMEX discontinuous Galerkin methods on staggered unstructured meshes	M. Tavelli et.al.	2401.17806	null
2024-01-31	Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models	Sifei Li et.al.	2401.17800	link
2024-01-31	Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation	Yuanhuiyi Lyu et.al.	2401.17664	null
2024-01-31	Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models	Kyungsung Lee et.al.	2401.17629	null
2024-01-31	Topology-Aware Latent Diffusion for 3D Shape Generation	Jiangbei Hu et.al.	2401.17603	null
2024-01-31	Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model	Yafei Dong et.al.	2401.17593	null
2024-01-31	Task-Oriented Diffusion Model Compression	Geonung Kim et.al.	2401.17547	null
2024-01-31	Enhancing Score-Based Sampling Methods with Ensembles	Tobias Bischoff et.al.	2401.17539	null
2024-01-30	You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation	Mehdi Noroozi et.al.	2401.17258	null
2024-01-30	ContactGen: Contact-Guided Interactive 3D Human Generation for Partners	Dongjun Gu et.al.	2401.17212	null
2024-01-30	Transfer Learning for Text Diffusion Models	Kehang Han et.al.	2401.17181	null
2024-01-30	PlantoGraphy: Incorporating Iterative Design Process into Generative Artificial Intelligence for Landscape Rendering	Rong Huang et.al.	2401.17120	null
2024-01-30	Local modification of subdiffusion by initial Fickian diffusion: Multiscale modeling, analysis and computation	Xiangcheng Zheng et.al.	2401.16885	null
2024-01-30	A Literature Review on Fetus Brain Motion Correction in MRI	Haoran Zhang et.al.	2401.16782	null
2024-01-29	Using multiple Dirac delta points to describe inhomogeneous flux density over a cell boundary in a single-cell diffusion model	Qiyao Peng et.al.	2401.16261	null
2024-01-29	Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models	Zhongjie Duan et.al.	2401.16224	null
2024-01-29	Spatial-Aware Latent Initialization for Controllable Image Generation	Wenqiang Sun et.al.	2401.16157	null
2024-01-29	DMCE: Diffusion Model Channel Enhancer for Multi-User Semantic Communication Systems	Youcheng Zeng et.al.	2401.16017	null
2024-01-29	Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling	Xiaoyu Shi et.al.	2401.15977	null
2024-01-29	EmoDM: A Diffusion Model for Evolutionary Multi-objective Optimization	Xueming Yan et.al.	2401.15931	null
2024-01-28	Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding	Jianxiang Lu et.al.	2401.15708	null
2024-01-28	Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance	Qingcheng Zhao et.al.	2401.15687	null
2024-01-28	CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement	Xiaowen Shi et.al.	2401.15649	null
2024-01-28	FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models	Feihong He et.al.	2401.15636	link
2024-01-28	Generative AI-enabled Blockchain Networks: Fundamentals, Applications, and Case Study	Cong T. Nguyen et.al.	2401.15625	null
2024-01-28	Diffusion-based graph generative methods	Hongyang Chen et.al.	2401.15617	link
2024-01-28	Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization	Yinbin Han et.al.	2401.15604	null
2024-01-28	BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry	Xiang Xu et.al.	2401.15563	link
2024-01-27	Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models	Fabio Merizzi et.al.	2401.15469	link
2024-01-27	A Survey on Data Augmentation in Large Model Era	Yue Zhou et.al.	2401.15422	link
2024-01-27	GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis	Jing Hao et.al.	2401.15282	link
2024-01-26	Annotated Hands for Generative Models	Yue Yang et.al.	2401.15075	link
2024-01-26	Text Image Inpainting via Global Structure-Guided Diffusion Models	Shipeng Zhu et.al.	2401.14832	link
2024-01-25	Opposite variations for pore pressure on and off the fault during simulated earthquakes in the laboratory	Dong Liu et.al.	2401.14506	null
2024-01-25	Deconstructing Denoising Diffusion Models for Self-Supervised Learning	Xinlei Chen et.al.	2401.14404	null
2024-01-25	pix2gestalt: Amodal Segmentation by Synthesizing Wholes	Ege Ozguroglu et.al.	2401.14398	link
2024-01-25	UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models	Timo Kapsalis et.al.	2401.14379	null
2024-01-25	Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation	Minglin Chen et.al.	2401.14257	null
2024-01-25	Scene Graph to Image Synthesis: Integrating CLIP Guidance with Graph Conditioning in Diffusion Models	Rameshwar Mishra et.al.	2401.14111	null
2024-01-25	CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion	Nisha Huang et.al.	2401.14066	link
2024-01-25	Diffusion-based Data Augmentation for Object Counting Problems	Zhen Wang et.al.	2401.13992	null
2024-01-25	BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models	Senthil Purushwalkam et.al.	2401.13974	link
2024-01-25	StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models	Yalong Bai et.al.	2401.13942	null
2024-01-24	Inverse Molecular Design with Multi-Conditional Diffusion Guidance	Gang Liu et.al.	2401.13858	link
2024-01-24	Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All	Mehmet Saygin Seyfioglu et.al.	2401.13795	null
2024-01-24	Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials	Yanyan Yang et.al.	2401.13570	link
2024-01-25	UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion	Wei Li et.al.	2401.13388	null
2024-01-24	Generative Design of Crystal Structures by Point Cloud Representations and Diffusion Model	Zhelin Li et.al.	2401.13192	link
2024-01-24	Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model	Yuanming Li et.al.	2401.13191	null
2024-01-24	Compositional Generative Inverse Design	Tailin Wu et.al.	2401.13171	link
2024-01-24	Choose Your Diffusion: Efficient and flexible ways to accelerate the diffusion model in fast high energy physics simulation	Cheng Jiang et.al.	2401.13162	null
2024-01-23	GALA: Generating Animatable Layered Assets from a Single Scan	Taeksoo Kim et.al.	2401.12979	null
2024-01-24	Zero-Shot Learning for the Primitives of 3D Affordance in General Objects	Hyeonwoo Kim et.al.	2401.12978	link
2024-01-23	Lumiere: A Space-Time Diffusion Model for Video Generation	Omer Bar-Tal et.al.	2401.12945	null
2024-01-23	UniHDA: Towards Universal Hybrid Domain Adaptation of Image Generators	Hengjia Li et.al.	2401.12596	null
2024-01-23	ToDA: Target-oriented Diffusion Attacker against Recommendation System	Xiaohao Liu et.al.	2401.12578	null
2024-01-23	DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations	Dogyun Park et.al.	2401.12517	link
2024-01-22	DITTO: Diffusion Inference-Time T-Optimization for Music Generation	Zachary Novack et.al.	2401.12179	null
2024-01-22	Single-View 3D Human Digitalization with Large Reconstruction Models	Zhenzhen Weng et.al.	2401.12175	null
2024-01-22	Feature Denoising Diffusion Model for Blind Image Quality Assessment	Xudong Li et.al.	2401.11949	null
2024-01-22	EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models	Koichi Namekata et.al.	2401.11739	null
2024-01-22	Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs	Ling Yang et.al.	2401.11708	link
2024-01-21	Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers	Katherine Crowson et.al.	2401.11605	link
2024-01-20	Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient	Weiguo Lu et.al.	2401.11261	null
2024-01-20	Product-Level Try-on: Characteristics-preserving Try-on with Realistic Clothes Shading and Wrinkles	Yanlong Zang et.al.	2401.11239	null
2024-01-20	MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation	Nhat M. Hoang et.al.	2401.11115	link
2024-01-20	UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures	Mingyuan Zhou et.al.	2401.11078	null
2024-01-20	Make-A-Shape: a Ten-Million-scale 3D Shape Model	Ka-Hei Hui et.al.	2401.11067	link
2024-01-19	Synthesizing Moving People with 3D Control	Boyi Li et.al.	2401.10889	null
2024-01-19	ActAnywhere: Subject-Aware Video Background Generation	Boxiao Pan et.al.	2401.10822	null
2024-01-19	From Market Saturation to Social Reinforcement: Understanding the Impact of Non-Linearity in Information Diffusion Models	Tobias Friedrich et.al.	2401.10818	null
2024-01-19	Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion	Zuoyue Li et.al.	2401.10786	null
2024-01-19	Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model	Yinan Zheng et.al.	2401.10700	link
2024-01-19	MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images	Rui Xu et.al.	2401.10561	null
2024-01-18	Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution	Xin Yuan et.al.	2401.10404	null
2024-01-18	A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting	Wouter Van Gansbeke et.al.	2401.10227	link
2024-01-22	Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation	Changgu Chen et.al.	2401.10150	null
2024-01-18	DiffusionGPT: LLM-Driven Text-to-Image Generation System	Jie Qin et.al.	2401.10061	null
2024-01-18	CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects	Zhao Wang et.al.	2401.09962	null
2024-01-18	BlenDA: Domain Adaptive Object Detection through diffusion-based blending	Tzuhsuan Huang et.al.	2401.09921	link
2024-01-18	Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework	Junkun Jiang et.al.	2401.09836	link
2024-01-18	Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing	Gwanhyeong Koo et.al.	2401.09794	null
2024-01-18	Image Translation as Diffusion Visual Programmers	Cheng Han et.al.	2401.09742	null
2024-01-17	Total fraction of drug released from diffusion-controlled delivery systems with binding reactions	Elliot J. Carr et.al.	2401.09644	link
2024-01-17	Efficient generative adversarial networks using linear additive-attention Transformers	Emilio Morales-Juarez et.al.	2401.09596	link
2024-01-17	TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion	Yu-Ying Yeh et.al.	2401.09416	null
2024-01-17	Vlogger: Make Your Dream A Vlog	Shaobin Zhuang et.al.	2401.09414	link
2024-01-17	On the $\varepsilon$ -Euler-Maruyama scheme for time inhomogeneous jump-driven SDEs	Mireille Bossy et.al.	2401.09338	null
2024-01-17	Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery	Jia Jia et.al.	2401.09325	null
2024-01-17	T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis	Yoonjin Chung et.al.	2401.09294	link
2024-01-17	Training-Free Semantic Video Composition via Pre-trained Diffusion Model	Jiaqi Guo et.al.	2401.09195	null
2024-01-17	Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior	Zike Wu et.al.	2401.09050	link
2024-01-17	Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis	Jonghyun Lee et.al.	2401.09048	link
2024-01-17	VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models	Haoxin Chen et.al.	2401.09047	link
2024-01-17	Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation	Tong Xie et.al.	2401.09031	link
2024-01-17	3D Human Pose Analysis via Diffusion Synthesis	Haorui Ji et.al.	2401.08930	null
2024-01-16	Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive	Yumeng Li et.al.	2401.08815	link
2024-01-16	Fixed Point Diffusion Models	Xingjian Bai et.al.	2401.08741	link
2024-01-16	SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers	Nanye Ma et.al.	2401.08740	link
2024-01-16	RoHM: Robust Human Motion Reconstruction via Diffusion	Siwei Zhang et.al.	2401.08570	null
2024-01-16	Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation	Mathis Petrovich et.al.	2401.08559	null
2024-01-16	Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing	Bin Zhang et.al.	2401.08275	null
2024-01-16	Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization	Chongzhi Zhang et.al.	2401.08232	null
2024-01-16	Photonic Modes Prediction via Multi-Modal Diffusion Model	Jinyang Sun et.al.	2401.08199	null
2024-01-16	Key-point Guided Deformable Image Manipulation Using Diffusion Model	Seok-Hwan Oh et.al.	2401.08178	null
2024-01-12	A deep implicit-explicit minimizing movement method for option pricing in jump-diffusion models	Emmanuil H. Georgoulis et.al.	2401.06740	null
2024-01-12	Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks	Stefan Blücher et.al.	2401.06654	link
2024-01-12	Adversarial Examples are Misaligned in Diffusion Model Manifolds	Peter Lorenz et.al.	2401.06637	null
2024-01-12	Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking	Wei Cao et.al.	2401.06614	null
2024-01-12	360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model	Qian Wang et.al.	2401.06578	null
2024-01-12	RotationDrag: Point-based Image Editing with Rotated Diffusion Features	Minxing Luo et.al.	2401.06442	link
2024-01-12	Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering	Chang Yu et.al.	2401.06345	null
2024-01-11	Frequency-Time Diffusion with Neural Cellular Automata	John Kalkhof et.al.	2401.06291	null
2024-01-11	Demystifying Variational Diffusion Models	Fabio De Sousa Ribeiro et.al.	2401.06281	null
2024-01-11	Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications	Yuwen Xiong et.al.	2401.06197	link
2024-01-11	TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation	Rajaei Khatib et.al.	2401.06191	null
2024-01-11	E $^{2}$ GAN: Efficient Training of Efficient GANs for Image-to-Image Translation	Yifan Gong et.al.	2401.06127	null
2024-01-11	DiffDA: a diffusion model for weather-scale data assimilation	Langwen Huang et.al.	2401.05932	link
2024-01-11	Efficient Image Deblurring Networks based on Diffusion Models	Kang Chen et.al.	2401.05907	link
2024-01-11	HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models	Hanzhang Wang et.al.	2401.05870	null
2024-01-11	EraseDiff: Erasing Data Influence in Diffusion Models	Jing Wu et.al.	2401.05779	link
2024-01-10	Diffusion Priors for Dynamic View Synthesis from Monocular Videos	Chaoyang Wang et.al.	2401.05583	null
2024-01-10	From Pampas to Pixels: Fine-Tuning Diffusion Models for Gaúcho Heritage	Marcellus Amadeus et.al.	2401.05520	null
2024-01-10	InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes	Mohamad Shahbazi et.al.	2401.05335	null
2024-01-10	Score Distillation Sampling with Learned Manifold Corrective	Thiemo Alldieck et.al.	2401.05293	null
2024-01-10	PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models	Junsong Chen et.al.	2401.05252	link
2024-01-10	Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN	Muhammad Ali Farooq et.al.	2401.05159	null
2024-01-10	CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model	Yinghui Xing et.al.	2401.05153	null
2024-01-10	SwiMDiff: Scene-wide Matching Contrastive Learning with Diffusion Constraint for Remote Sensing Image	Jiayuan Tian et.al.	2401.05093	null
2024-01-10	A novel bond-based nonlocal diffusion model with matrix-valued coefficients in non-divergence form and its collocation discretization	Lili Ju et.al.	2401.04973	null
2024-01-09	Transmission-eigenchannel velocity and diffusion	Azriel Z. Genack et.al.	2401.04818	null
2024-01-09	DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation	Junming Chen et.al.	2401.04747	null
2024-01-09	Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation	Xiyi Chen et.al.	2401.04728	link
2024-01-09	Efficient estimation for ergodic diffusion processes sampled at high frequency	Michael Sørensen et.al.	2401.04689	null
2024-01-09	EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models	Jingyuan Yang et.al.	2401.04608	null
2024-01-09	Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models	Xuewen Liu et.al.	2401.04585	link
2024-01-09	MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation	Weimin Wang et.al.	2401.04468	null
2024-01-09	D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection	Justin Tebbe et.al.	2401.04463	link
2024-01-09	SonicVisionLM: Playing Sound with Vision Language Models	Zhifeng Xie et.al.	2401.04394	null
2024-01-09	Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example	Kwan Yun et.al.	2401.04362	null
2024-01-09	Memory-Efficient Personalization using Quantized Diffusion Model	Hyogon Ryu et.al.	2401.04339	null
2024-01-08	FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation	Yang Liu et.al.	2401.04283	null
2024-01-08	Robust Image Watermarking using Stable Diffusion	Lijun Zhang et.al.	2401.04247	link
2024-01-08	scDiffusion: conditional generation of high-quality single-cell data using diffusion model	Erpai Luo et.al.	2401.03968	link
2024-01-08	D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement	Danqi Yan et.al.	2401.03914	null
2024-01-08	DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement	Jiaqi Liu et.al.	2401.03629	null
2024-01-07	ROIC-DM: Robust Text Inference and Classification via Diffusion Model	Shilong Yuan et.al.	2401.03514	null
2024-01-07	Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness	Sicheng Yang et.al.	2401.03476	null
2024-01-07	Deep Learning-based Image and Video Inpainting: A Survey	Weize Quan et.al.	2401.03395	null
2024-01-06	Reflected Schrödinger Bridge for Constrained Generative Modeling	Wei Deng et.al.	2401.03228	null
2024-01-06	MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond	Yupei Lin et.al.	2401.03221	null
2024-01-06	Fair Sampling in Diffusion Models through Switching Mechanism	Yujin Choi et.al.	2401.03140	link
2024-01-05	Latte: Latent Diffusion Transformer for Video Generation	Xin Ma et.al.	2401.03048	link
2024-01-05	The Rise of Diffusion Models in Time-Series Forecasting	Caspar Meijer et.al.	2401.03006	link
2024-01-08	Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction	Yuxin Yang et.al.	2401.02916	null
2024-01-05	Plug-in Diffusion Model for Sequential Recommendation	Haokai Ma et.al.	2401.02913	link
2024-01-05	Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors	Top Piriyakulkij et.al.	2401.02739	link
2024-01-05	Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation	Can Xu et.al.	2401.02683	link
2024-01-04	Comprehensive Exploration of Synthetic Data Generation: A Survey	André Bauer et.al.	2401.02524	null
2024-01-04	VASE: Object-Centric Appearance and Shape Manipulation of Real Videos	Elia Peruzzo et.al.	2401.02473	null
2024-01-04	Bring Metric Functions into Diffusion Models	Jie An et.al.	2401.02414	null
2024-01-06	GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation	Xuehao Gao et.al.	2401.02142	link
2024-01-04	Preserving Image Properties Through Initializations in Diffusion Models	Jeffrey Zhang et.al.	2401.02097	null
2024-01-04	Energy based diffusion generator for efficient sampling of Boltzmann distributions	Yan Wang et.al.	2401.02080	null
2024-01-04	DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection	Yunfan Ye et.al.	2401.02032	link
2024-01-04	Improving Diffusion-Based Image Synthesis with Context Prediction	Ling Yang et.al.	2401.02015	null
2024-01-03	Instruct-Imagen: Image Generation with Multi-modal Instruction	Hexiang Hu et.al.	2401.01952	null
2024-01-03	Can We Generate Realistic Hands Only Using Convolution?	Mehran Hosseini et.al.	2401.01951	null
2024-01-03	Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions	David Junhao Zhang et.al.	2401.01827	link
2024-01-03	DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models	Yichen Liu et.al.	2401.01659	null
2024-01-03	SIGNeRF: Scene Integrated Generation for Neural Radiance Fields	Jan-Niklas Dihlmann et.al.	2401.01647	null
2024-01-03	S $^{2}$ -DMs:Skip-Step Diffusion Models	Yixuan Wang et.al.	2401.01520	link
2024-01-02	ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text	Dingkun Yan et.al.	2401.01456	link
2024-01-02	VALD-MD: Visual Attribution via Latent Diffusion for Medical Diagnostics	Ammar A. Siddiqui et.al.	2401.01414	null
2024-01-01	DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition	Parul Gupta et.al.	2401.01387	null
2024-01-02	VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM	Fuchen Long et.al.	2401.01256	link
2024-01-02	Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation	Renshuai Liu et.al.	2401.01207	null
2024-01-02	A comparative study of resistivity models for simulations of magnetic reconnection in the solar atmosphere. II. Plasmoid formation	Øystein Håvard Færder et.al.	2401.01177	null
2024-01-02	Joint Generative Modeling of Scene Graphs and Images via Diffusion Models	Bicheng Xu et.al.	2401.01130	null
2024-01-02	Robust single-particle cryo-EM image denoising and restoration	Jing Zhang et.al.	2401.01097	null
2024-01-02	Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation	Jinlong Xue et.al.	2401.01044	link
2024-01-01	DiffMorph: Text-less Image Morphing with Diffusion Models	Shounak Chatterjee et.al.	2401.00739	null
2024-01-01	Diffusion Models, Image Super-Resolution And Everything: A Survey	Brian B. Moser et.al.	2401.00736	null
2024-01-02	GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields	Xiao Pan et.al.	2401.00616	null
2024-01-03	Diff-PCR: Diffusion-Based Correspondence Searching in Doubly Stochastic Matrix Space for Point Cloud Registration	Qianliang Wu et.al.	2401.00436	null
2023-12-31	SynCDR : Training Cross Domain Retrieval Models with Synthetic Data	Samarth Mishra et.al.	2401.00420	link
2023-12-31	Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion	Wei-Jer Chang et.al.	2401.00391	null
2023-12-30	Probing the Limits and Capabilities of Diffusion Models for the Anatomic Editing of Digital Twins	Karim Kadry et.al.	2401.00247	null
2023-12-28	iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views	Chin-Hsuan Wu et.al.	2312.17250	link
2023-12-28	Personalized Restoration via Dual-Pivot Tuning	Pradyumna Chari et.al.	2312.17234	null
2023-12-28	4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency	Yuyang Yin et.al.	2312.17225	null
2023-12-28	Restoration by Generation with Constrained Priors	Zheng Ding et.al.	2312.17161	null
2023-12-28	DiffKG: Knowledge Graph Diffusion Model for Recommendation	Yangqin Jiang et.al.	2312.16890	link
2023-12-28	DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors	Biwen Lei et.al.	2312.16837	null
2023-12-27	I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models	Xun Guo et.al.	2312.16693	link
2023-12-27	Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection	Huan Liu et.al.	2312.16649	link
2023-12-27	Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance	Tomer Garber et.al.	2312.16519	link
2023-12-27	PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion	Guansong Lu et.al.	2312.16486	null
2023-12-27	SVGDreamer: Text Guided SVG Generation with Diffusion Model	Ximing Xing et.al.	2312.16476	link
2023-12-27	Natural Adversarial Patch Generation Method Based on Latent Diffusion Model	Xianyi Chen et.al.	2312.16401	null
2023-12-26	One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications	Mengyao Lyu et.al.	2312.16145	null
2023-12-26	Compositional Search of Stable Crystalline Structures in Multi-Component Alloys Using Generative Diffusion Models	Grzegorz Kaszuba et.al.	2312.16073	null
2023-12-26	HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D	Sangmin Woo et.al.	2312.15980	link
2023-12-26	Semantic Guidance Tuning for Text-To-Image Diffusion Models	Hyun Kang et.al.	2312.15964	link
2023-12-26	Implied volatility (also) is path-dependent	Hervé Andrès et.al.	2312.15950	link
2023-12-26	EnchantDance: Unveiling the Potential of Music-Driven Dance Movement	Bo Han et.al.	2312.15946	link
2023-12-26	Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection	Songmin Dai et.al.	2312.15911	null
2023-12-26	Cross Initialization for Personalized Text-to-Image Generation	Lianyu Pang et.al.	2312.15905	link
2023-12-21	Diffusion Reward: Learning Rewards via Conditional Video Diffusion	Tao Huang et.al.	2312.14134	link
2023-12-21	Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation	Philipp Schröppel et.al.	2312.14124	link
2023-12-21	HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models	Hayk Manukyan et.al.	2312.14091	link
2023-12-21	Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning	Desai Xie et.al.	2312.13980	null
2023-12-21	Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models	Xianfang Zeng et.al.	2312.13913	link
2023-12-21	Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models	Huan Ling et.al.	2312.13763	null
2023-12-21	Free-Editor: Zero-shot Text-driven 3D Scene Editing	Nazmul Karim et.al.	2312.13663	link
2023-12-21	Diff-Oracle: Diffusion Model for Oracle Character Generation with Controllable Styles and Contents	Jing Li et.al.	2312.13631	null
2023-12-21	Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion	Nishtha Madaan et.al.	2312.13616	null
2023-12-21	Front stability of infinitely steep travelling waves in population biology	Matthew J Simpson et.al.	2312.13601	link
2023-12-20	Unlocking Pre-trained Image Backbones for Semantic Image Synthesis	Tariq Berrada et.al.	2312.13314	null
2023-12-21	Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting	Junwu Zhang et.al.	2312.13271	link
2023-12-20	Conditional Image Generation with Pretrained Generative Model	Rajesh Shrestha et.al.	2312.13253	null
2023-12-20	Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model	Saurabh Saxena et.al.	2312.13252	null
2023-12-20	Diffusion Models With Learned Adaptive Noise	Subham Sekhar Sahoo et.al.	2312.13236	link
2023-12-21	DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis	Yuming Gu et.al.	2312.13016	link
2023-12-20	RadEdit: stress-testing biomedical vision models via diffusion image editing	Fernando Pérez-García et.al.	2312.12865	null
2023-12-20	ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement	Yuhui Wu et.al.	2312.12826	null
2023-12-20	All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models	Seunghoo Hong et.al.	2312.12807	null
2023-12-21	AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion	Beibei Jing et.al.	2312.12763	null
2023-12-20	How Good Are Deep Generative Models for Solving Inverse Problems?	Shichong Peng et.al.	2312.12691	null
2023-12-19	Surf-CDM: Score-Based Surface Cold-Diffusion Model For Medical Image Segmentation	Fahim Ahmed Zaman et.al.	2312.12649	null
2023-12-19	Fixed-point Inversion for Text-to-image diffusion models	Barak Meiri et.al.	2312.12540	link
2023-12-19	StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation	Akio Kodaira et.al.	2312.12491	link
2023-12-19	InstructVideo: Instructing Video Diffusion Models with Human Feedback	Hangjie Yuan et.al.	2312.12490	null
2023-12-19	Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models	Angela Castillo et.al.	2312.12487	null
2023-12-19	On Inference Stability for Diffusion Models	Viet Nguyen et.al.	2312.12431	link
2023-12-19	Scene-Conditional 3D Object Stylization and Composition	Jinghao Zhou et.al.	2312.12419	null
2023-12-19	Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models	Shweta Mahajan et.al.	2312.12416	null
2023-12-19	Travelling pulses on three spatial scales in a Klausmeier-type vegetation-autotoxicity model	Paul Carter et.al.	2312.12277	null
2023-12-19	Intrinsic Image Diffusion for Single-view Material Estimation	Peter Kocsis et.al.	2312.12274	link
2023-12-18	A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm	Yong Niu et.al.	2312.10885	null
2023-12-17	Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models	Nikita Starodubcev et.al.	2312.10835	link
2023-12-17	CogCartoon: Towards Practical Story Visualization	Zhongyang Zhu et.al.	2312.10718	null
2023-12-17	VidToMe: Video Token Merging for Zero-Shot Video Editing	Xirui Li et.al.	2312.10656	link
2023-12-16	VecFusion: Vector Font Generation with Diffusion	Vikas Thamizharasan et.al.	2312.10540	null
2023-12-16	A Unified Filter Method for Jointly Estimating State and Parameters of Stochastic Dynamical Systems via the Ensemble Score Filter	Feng Bao et.al.	2312.10503	null
2023-12-16	Continuous Diffusion for Mixed-Type Tabular Data	Markus Mueller et.al.	2312.10431	link
2023-12-16	Lecture Notes in Probabilistic Diffusion Models	Inga Strümke et.al.	2312.10393	null
2023-12-16	Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge	Conghan Yue et.al.	2312.10299	link
2023-12-15	Two simple criterion to prove the existence of patterns in reaction-diffusion models of two components	Francisco J. Vielma-Leal et.al.	2312.10231	null
2023-12-15	Tell Me What You See: Text-Guided Real-World Image Denoising	Erez Yosef et.al.	2312.10191	null
2023-12-15	Improving new physics searches with diffusion models for event observables and jet constituents	Debajyoti Sengupta et.al.	2312.10130	null
2023-12-15	MVHuman: Tailoring 2D Diffusion with Multi-view Sampling For Realistic 3D Human Generation	Suyi Jiang et.al.	2312.10120	null
2023-12-15	Plasticine3D: Non-rigid 3D editting with text guidance	Yige Chen et.al.	2312.10111	null
2023-12-15	Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology	Pedro Osorio et.al.	2312.09792	null
2023-12-15	DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models	Yifeng Ma et.al.	2312.09767	link
2023-12-15	PPFM: Image denoising in photon-counting CT using single-step posterior sampling Poisson flow generative models	Dennis Hein et.al.	2312.09754	link
2023-12-15	Positivity and global existence for nonlocal advection-diffusion models of interacting populations	Valeria Giunta et.al.	2312.09692	null
2023-12-15	Exploring the Feasibility of Generating Realistic 3D Models of Endangered Species Using DreamGaussian: An Analysis of Elevation Angle’s Impact on Model Generation	Selcuk Anil Karatopak et.al.	2312.09682	null
2023-12-15	Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models	Senmao Li et.al.	2312.09608	link
2023-12-14	LIME: Localized Image Editing via Attention Regularization in Diffusion Models	Enis Simsar et.al.	2312.09256	null
2023-12-14	FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection	Hongsuk Choi et.al.	2312.09252	null
2023-12-14	Single Mesh Diffusion Models with Field Latents for Texture Generation	Thomas W. Mitchel et.al.	2312.09250	null
2023-12-14	A framework for conditional diffusion modelling with applications in motif scaffolding for protein design	Kieran Didi et.al.	2312.09236	null
2023-12-14	Mosaic-SDF for 3D Generative Models	Lior Yariv et.al.	2312.09222	null
2023-12-14	Fast Sampling via De-randomization for Discrete Diffusion Models	Zixiang Chen et.al.	2312.09193	null
2023-12-14	Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures	Huijie Zhang et.al.	2312.09181	link
2023-12-14	DiffusionLight: Light Probes for Free by Painting a Chrome Ball	Pakkapon Phongthawee et.al.	2312.09168	link
2023-12-14	Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers	Zi-Xin Zou et.al.	2312.09147	null
2023-12-14	VideoLCM: Video Latent Consistency Model	Xiang Wang et.al.	2312.09109	null
2023-12-14	PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion	Ying-Tian Liu et.al.	2312.09069	null
2023-12-14	Brain Diffuser with Hierarchical Transformer for MCI Causality Analysis	Qiankun Zuo et.al.	2312.09022	null
2023-12-14	OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers	Han Liang et.al.	2312.08985	null
2023-12-14	Motion Flow Matching for Human Motion Synthesis and Editing	Vincent Tao Hu et.al.	2312.08895	null
2023-12-14	VaLID: Variable-Length Input Diffusion for Novel View Synthesis	Shijie Li et.al.	2312.08892	null
2023-12-14	Diffusion-C: Unveiling the Generative Challenges of Diffusion Models through Corrupted Data	Keywoong Bae et.al.	2312.08843	null
2023-12-14	Speeding up Photoacoustic Imaging using Diffusion Models	Irem Loc et.al.	2312.08834	link
2023-12-14	Guided Diffusion from Self-Supervised Diffusion Features	Vincent Tao Hu et.al.	2312.08825	null
2023-12-14	Reconstruction of Sound Field through Diffusion Models	Federico Miotello et.al.	2312.08821	null
2023-12-14	Local Conditional Controlling for Text-to-Image Diffusion Models	Yibo Zhao et.al.	2312.08768	link
2023-12-13	PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models	Anis Bourou et.al.	2312.08290	link
2023-12-13	Black-box Membership Inference Attacks against Fine-tuned Diffusion Models	Yan Pang et.al.	2312.08207	link
2023-12-13	Concept-centric Personalization with Large-scale Diffusion Priors	Pu Cao et.al.	2312.08195	link
2023-12-13	$ρ$ -Diffusion: A diffusion-based density estimation framework for computational physics	Maxwell X. Cai et.al.	2312.08153	link
2023-12-13	Clockwork Diffusion: Efficient Generation With Model-Step Distillation	Amirhossein Habibian et.al.	2312.08128	link
2023-12-13	Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision	Shengguang Wu et.al.	2312.08056	null
2023-12-13	Compositional Inversion for Stable Diffusion Models	Xu-Lu Zhang et.al.	2312.08048	link
2023-12-13	AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing	Zhiyuan Ma et.al.	2312.08019	link
2023-12-13	Time Series Diffusion Method: A Denoising Diffusion Probabilistic Model for Vibration Signal Generation	Haiming Yi et.al.	2312.07981	null
2023-12-13	LMD: Faster Image Reconstruction with Latent Masking Diffusion	Zhiyuan Ma et.al.	2312.07971	link
2023-12-13	Semantic-aware Data Augmentation for Text-to-image Synthesis	Zhaorui Tan et.al.	2312.07951	link
2023-12-13	BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics	Wenqian Zhang et.al.	2312.07937	link
2023-12-13	SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models	Feifei Wang et.al.	2312.07865	link
2023-12-13	Diffusion Models Enable Zero-Shot Pose Estimation for Lower-Limb Prosthetic Users	Tianxun Zhou et.al.	2312.07854	null
2023-12-13	Noise in the reverse process improves the approximation capabilities of diffusion models	Karthik Elamvazhuthi et.al.	2312.07851	null
2023-12-13	Stable Rivers: A Case Study in the Application of Text-to-Image Generative Models for Earth Sciences	C Kupferschmidt et.al.	2312.07833	null
2023-12-12	Brain-optimized inference improves reconstructions of fMRI brain activity	Reese Kneeland et.al.	2312.07705	link
2023-12-12	FreeInit: Bridging Initialization Gap in Video Diffusion Models	Tianxing Wu et.al.	2312.07537	link
2023-12-12	FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition	Sicheng Mo et.al.	2312.07536	null
2023-12-12	Cosmological Field Emulation and Parameter Inference with Diffusion Models	Nayantara Mudur et.al.	2312.07534	null
2023-12-11	CAD: Photorealistic 3D Generation via Adversarial Distillation	Ziyu Wan et.al.	2312.06663	null
2023-12-11	Photorealistic Video Generation with Diffusion Models	Agrim Gupta et.al.	2312.06662	null
2023-12-11	UpFusion: Novel View Diffusion from Unposed Sparse View Observations	Bharath Raj Nagoor Kani et.al.	2312.06661	null
2023-12-11	Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior	Fangfu Liu et.al.	2312.06655	link
2023-12-11	Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution	Shangchen Zhou et.al.	2312.06640	null
2023-12-11	DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection	Haoyang He et.al.	2312.06607	link
2023-12-11	ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models	Denis Zavadski et.al.	2312.06573	link
2023-12-11	HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models	Xiaogang Peng et.al.	2312.06553	null
2023-12-11	STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction	Xi Ye et.al.	2312.06486	link
2023-12-11	Semantic Image Synthesis for Abdominal CT	Yan Zhuang et.al.	2312.06453	null
2023-12-11	DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior	Tianyu Huang et.al.	2312.06439	link
2023-12-11	DiT-Head: High-Resolution Talking Head Synthesis using Diffusion Transformers	Aaron Mir et.al.	2312.06400	null
2023-12-11	PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization	Xu Peng et.al.	2312.06354	null
2023-12-11	DiffAIL: Diffusion Adversarial Imitation Learning	Bingzheng Wang et.al.	2312.06348	link
2023-12-11	Compensation Sampling for Improved Convergence in Diffusion Models	Hui Lu et.al.	2312.06285	link
2023-12-11	UIEDP:Underwater Image Enhancement with Diffusion Prior	Dazhao Du et.al.	2312.06240	link
2023-12-11	The Journey, Not the Destination: How Data Guides Diffusion Models	Kristian Georgiev et.al.	2312.06205	link
2023-12-11	Offloading and Quality Control for AI Generated Content Services in Edge Computing Networks	Yitong Wang et.al.	2312.06203	null
2023-12-11	Optimized View and Geometry Distillation from Multi-view Diffuser	Youjia Zhang et.al.	2312.06198	link
2023-12-11	SP-DiffDose: A Conditional Diffusion Model for Radiation Dose Prediction Based on Multi-Scale Fusion of Anatomical Structures, Guided by SwinTransformer and Projector	Linjie Fu et.al.	2312.06187	null
2023-12-07	Gen2Det: Generate to Detect	Saksham Suri et.al.	2312.04566	null
2023-12-07	NeRFiller: Completing Scenes via Generative 3D Inpainting	Ethan Weber et.al.	2312.04560	null
2023-12-07	PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation	Zhaoxi Chen et.al.	2312.04559	link
2023-12-07	GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation	Shoufa Chen et.al.	2312.04557	null
2023-12-07	Generating Illustrated Instructions	Sachit Menon et.al.	2312.04552	link
2023-12-07	PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play	Lili Chen et.al.	2312.04549	null
2023-12-07	Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance	Yuto Enyo et.al.	2312.04529	null
2023-12-07	RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models	Ozgur Kara et.al.	2312.04524	link
2023-12-07	Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation	Zhiwu Qing et.al.	2312.04483	link
2023-12-07	Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion	Kiran Chhatre et.al.	2312.04466	link
2023-12-07	FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models	Stathis Galanakis et.al.	2312.04465	null
2023-12-07	DreamVideo: Composing Your Dream Videos with Customized Subject and Motion	Yujie Wei et.al.	2312.04433	link
2023-12-07	Approximate Caching for Efficiently Serving Diffusion Models	Shubham Agarwal et.al.	2312.04429	null
2023-12-07	Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views	Yabo Chen et.al.	2312.04424	null
2023-12-07	Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models	Jiayi Guo et.al.	2312.04410	link
2023-12-07	Adversarial Denoising Diffusion Model for Unsupervised Anomaly Detection	Jongmin Yu et.al.	2312.04382	null
2023-12-07	Generating Multiphase Fluid Configurations in Fractures using Diffusion Models	Jaehong Chung et.al.	2312.04375	null
2023-12-07	Investigating the Design Space of Diffusion Models for Speech Enhancement	Philippe Gonzalez et.al.	2312.04370	link
2023-12-07	Improved Efficient Two-Stage Denoising Diffusion Power System Measurement Recovery Against False Data Injection Attacks and Data Losses	Jianhua Pei et.al.	2312.04346	null
2023-12-07	Multi-View Unsupervised Image Generation with Cross Attention Guidance	Llukman Cerkezi et.al.	2312.04337	null
2023-12-06	Self-conditioned Image Generation via Generating Representations	Tianhong Li et.al.	2312.03701	link
2023-12-06	Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication	Ali Naseh et.al.	2312.03692	null
2023-12-06	WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on	xujie zhang et.al.	2312.03667	null
2023-12-06	TokenCompose: Grounding Diffusion with Token-level Supervision	Zirui Wang et.al.	2312.03626	link
2023-12-06	DreamComposer: Controllable 3D Object Generation via Multi-View Conditions	Yunhan Yang et.al.	2312.03611	link
2023-12-06	DiffusionSat: A Generative Foundation Model for Satellite Imagery	Samar Khanna et.al.	2312.03606	null
2023-12-06	MMM: Generative Masked Motion Model	Ekkasit Pinyoanuntapong et.al.	2312.03596	link
2023-12-06	Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention	Jianjin Xu et.al.	2312.03556	null
2023-12-06	FoodFusion: A Latent Diffusion Model for Realistic Food Image Generation	Olivia Markham et.al.	2312.03540	null
2023-12-06	FRDiff: Feature Reuse for Exquisite Zero-shot Acceleration of Diffusion Models	Junhyuk So et.al.	2312.03517	null
2023-12-06	Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis	Zehua Chen et.al.	2312.03491	null
2023-12-06	F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis	Sitong Su et.al.	2312.03459	null
2023-12-06	Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning	Sangwoong Yoon et.al.	2312.03397	null
2023-12-06	Diffused Task-Agnostic Milestone Planner	Mineui Hong et.al.	2312.03395	null
2023-12-06	DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction	Yanlong Li et.al.	2312.03298	link
2023-12-06	Cache Me if You Can: Accelerating Diffusion Models through Block Caching	Felix Wimbauer et.al.	2312.03209	null
2023-12-05	ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet	Soon Yau Cheong et.al.	2312.03154	link
2023-12-05	DiffusionPCR: Diffusion Models for Robust Multi-Step Point Cloud Registration	Zhi Chen et.al.	2312.03053	link
2023-12-05	Alchemist: Parametric Control of Material Properties with Diffusion Models	Prafull Sharma et.al.	2312.02970	null
2023-12-05	AmbiGen: Generating Ambigrams from Pre-trained Diffusion Model	Boheng Zhao et.al.	2312.02967	null
2023-12-04	Latent Feature-Guided Diffusion Models for Shadow Removal	Kangfu Mei et.al.	2312.02156	null
2023-12-04	Readout Guidance: Learning Control from Diffusion Features	Grace Luo et.al.	2312.02150	null
2023-12-04	Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation	Bingxin Ke et.al.	2312.02145	link
2023-12-04	DiffiT: Diffusion Vision Transformers for Image Generation	Ali Hatamizadeh et.al.	2312.02139	link
2023-12-04	Stochastic Optimal Control Matching	Carles Domingo-Enrich et.al.	2312.02027	link
2023-12-04	UniGS: Unified Representation for Image Generation and Segmentation	Lu Qi et.al.	2312.01985	link
2023-12-04	Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation	Joshua Niemeijer et.al.	2312.01850	link
2023-12-04	Collaborative Neural Painting	Nicola Dall’Asen et.al.	2312.01800	null
2023-12-04	Open-DDVM: A Reproduction and Extension of Diffusion Model for Optical Flow Estimation	Qiaole Dong et.al.	2312.01746	link
2023-12-04	Fully Spiking Denoising Diffusion Implicit Models	Ryo Watanabe et.al.	2312.01742	link
2023-12-04	StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On	Jeongho Kim et.al.	2312.01725	link
2023-12-04	ResEnsemble-DDPM: Residual Denoising Diffusion Probabilistic Models for Ensemble Learning	Shi Zhenning et.al.	2312.01682	null
2023-12-03	CalliPaint: Chinese Calligraphy Inpainting with Diffusion Model	Qisheng Liao et.al.	2312.01536	null
2023-12-03	CityGen: Infinite and Controllable 3D City Layout Generation	Jie Deng et.al.	2312.01508	null
2023-12-03	Existence of finite time blow-up in Keller-Segel system	Federico Buseghin et.al.	2312.01475	null
2023-12-03	Distilling Functional Rearrangement Priors from Large Models	Yiming Zeng et.al.	2312.01474	null
2023-12-03	Diffusion Posterior Sampling for Nonlinear CT Reconstruction	Shudong Li et.al.	2312.01464	null
2023-12-03	Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models	Shengqu Cai et.al.	2312.01409	null
2023-12-03	Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts	Tianqi Chen et.al.	2312.01408	null
2023-12-03	ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models	Jeong-gi Kwak et.al.	2312.01305	null
2023-11-30	VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models	Zhen Xing et.al.	2311.18837	null
2023-11-30	ART $\boldsymbol{\cdot}$ V: Auto-Regressive Text-to-Video Generation with Diffusion Models	Wenming Weng et.al.	2311.18834	null
2023-11-30	Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction	Hsin-Ying Lee et.al.	2311.18832	link
2023-11-30	MotionEditor: Editing Video Motion via Content-Aware Diffusion	Shuyuan Tu et.al.	2311.18830	link
2023-11-30	MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation	Yanhui Wang et.al.	2311.18829	null
2023-11-30	One-step Diffusion with Distribution Matching Distillation	Tianwei Yin et.al.	2311.18828	null
2023-11-30	ElasticDiffusion: Training-free Arbitrary Size Image Generation	Moayed Haji-Ali et.al.	2311.18822	link
2023-11-30	Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters	James Seale Smith et.al.	2311.18763	null
2023-11-30	Detailed Human-Centric Text Description-Driven Large Scene Synthesis	Gwanghyun Kim et.al.	2311.18654	null
2023-11-30	Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing	Hyelin Nam et.al.	2311.18608	null
2023-11-30	DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution	Axi Niu et.al.	2311.18508	null
2023-11-30	Layered Rendering Diffusion Model for Zero-Shot Guided Image Synthesis	Zipeng Qi et.al.	2311.18435	null
2023-11-30	CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model	Jianhao Zeng et.al.	2311.18405	link
2023-11-30	Age Effects on Decision-Making, Drift Diffusion Model	Zahra Kavian et.al.	2311.18376	null
2023-11-30	Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning	Ruxiao Duan et.al.	2311.18266	link
2023-11-30	Diffusion Models Without Attention	Jing Nathan Yan et.al.	2311.18257	null
2023-11-30	SMaRt: Improving GANs with Score Matching Regularity	Mengfei Xia et.al.	2311.18208	null
2023-11-30	HiPA: Enabling One-Step Text-to-Image Diffusion Models via High-Frequency-Promoting Adaptation	Yifan Zhang et.al.	2311.18158	null
2023-11-29	Zooming Out on Zooming In: Advancing Super-Resolution for Remote Sensing	Piper Wolters et.al.	2311.18082	link
2023-11-29	DiffGEPCI: 3D MRI Synthesis from mGRE Signals using 2.5D Diffusion Model	Yuyang Hu et.al.	2311.18073	null
2023-11-29	Do text-free diffusion models learn discriminative visual representations?	Soumik Mukhopadhyay et.al.	2311.17921	link
2023-11-29	Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models	Daniel Geng et.al.	2311.17919	null
2023-11-29	AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text	Jianfeng Zhang et.al.	2311.17917	null
2023-11-29	CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting	Alexander Vilesov et.al.	2311.17907	null
2023-11-29	SODA: Bottleneck Diffusion Models for Representation Learning	Drew A. Hudson et.al.	2311.17901	null
2023-11-29	Leveraging Graph Diffusion Models for Network Refinement Tasks	Puja Trivedi et.al.	2311.17856	null
2023-11-29	SPiC-E : Structural Priors in 3D Diffusion Models using Cross Entity Attention	Etai Sella et.al.	2311.17834	null
2023-11-29	Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers	Chi-Pin Huang et.al.	2311.17717	link
2023-11-29	Fair Text-to-Image Diffusion via Fair Mapping	Jia Li et.al.	2311.17695	null
2023-11-29	AnyLens: A Generative Diffusion Model with Any Rendering Lens	Andrey Voynov et.al.	2311.17609	null
2023-11-29	Query-Relevant Images Jailbreak Large Multi-Modal Models	Xin Liu et.al.	2311.17600	link
2023-11-29	Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning	Liang Peng et.al.	2311.17536	link
2023-11-29	HiDiffusion: Unlocking High-Resolution Creativity and Efficiency in Low-Resolution Trained Diffusion Models	Shen Zhang et.al.	2311.17528	null
2023-11-29	MMA-Diffusion: MultiModal Attack on Diffusion Models	Yijun Yang et.al.	2311.17516	link
2023-11-29	When StyleGAN Meets Stable Diffusion: a $\mathscr{W}_+$ Adapter for Personalized Image Generation	Xiaoming Li et.al.	2311.17461	link
2023-11-29	DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Diffusion Model	Jiuming Liu et.al.	2311.17456	link
2023-11-29	Wireless Network Digital Twin for 6G: Generative AI as A Key Enabler	Zhenyu Tao et.al.	2311.17451	null
2023-11-29	VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model	Haoyu Zhao et.al.	2311.17338	link
2023-11-28	Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation	Hang Li et.al.	2311.17216	null
2023-11-28	A point cloud approach to generative modeling for galaxy surveys at the field level	Carolina Cuesta-Lazaro et.al.	2311.17141	link
2023-11-27	Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback	Mihir Prabhudesai et.al.	2311.16102	null
2023-11-27	Self-correcting LLM-controlled Diffusion Models	Tsung-Han Wu et.al.	2311.16090	link
2023-11-27	DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization	Zhaoyang Xia et.al.	2311.16060	link
2023-11-27	Exploring Attribute Variations in Style-based GANs using Diffusion Models	Rishubh Parihar et.al.	2311.16052	null
2023-11-27	GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions	Jiemin Fang et.al.	2311.16037	null
2023-11-27	Closing the ODE-SDE gap in score-based diffusion models through the Fokker-Planck equation	Teo Deveney et.al.	2311.15996	null
2023-11-27	DiffAnt: Diffusion Models for Action Anticipation	Zeyun Zhong et.al.	2311.15991	null
2023-11-27	Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion	Yuanxun Lu et.al.	2311.15980	null
2023-11-27	Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models	Claudio Rota et.al.	2311.15908	link
2023-11-27	InterControl: Generate Human Motion Interactions by Controlling Every Joint	Zhenzhi Wang et.al.	2311.15864	link
2023-11-27	SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion	Hsuan-I Ho et.al.	2311.15855	link
2023-11-27	FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax	Yu Lu et.al.	2311.15813	null
2023-11-27	Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation	Biao Gong et.al.	2311.15773	null
2023-11-27	One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls	Minghui Hu et.al.	2311.15744	null
2023-11-27	SceneDM: Scene-level Multi-agent Trajectory Generation with Consistent Diffusion Models	Zhiming Guo et.al.	2311.15736	null
2023-11-27	Regularization by Texts for Latent Diffusion Inverse Solvers	Jeongsol Kim et.al.	2311.15658	link
2023-11-27	Enhancing Diffusion Models with Text-Encoder Reinforcement Learning	Chaofeng Chen et.al.	2311.15657	link
2023-11-27	ET3D: Efficient Text-to-3D Generation via Multi-View Distillation	Yiming Chen et.al.	2311.15561	null
2023-11-27	Instruct2Attack: Language-Guided Semantic Adversarial Attacks	Jiang Liu et.al.	2311.15551	null
2023-11-27	Efficient Dataset Distillation via Minimax Diffusion	Jianyang Gu et.al.	2311.15529	link
2023-11-22	WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space	Katja Schwarz et.al.	2311.13570	null
2023-11-22	ADriver-I: A General World Model for Autonomous Driving	Fan Jia et.al.	2311.13549	null
2023-11-22	DiffusionMat: Alpha Matting as Sequential Refinement Learning	Yangyang Xu et.al.	2311.13535	null
2023-11-22	Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure	Ian Dunn et.al.	2311.13466	link
2023-11-22	Guided Flows for Generative Modeling and Decision Making	Qinqing Zheng et.al.	2311.13443	null
2023-11-22	Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution	Yuxuan Zhou et.al.	2311.13317	null
2023-11-22	Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model	Kai Yang et.al.	2311.13231	link
2023-11-22	Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models	Mengyang Feng et.al.	2311.13141	link
2023-11-22	Toward Robust Imperceptible Perturbation against Unauthorized Text-to-image Diffusion-based Synthesis	Yixin Liu et.al.	2311.13127	link
2023-11-22	On the Limitation of Diffusion Models for Synthesizing Training Datasets	Shin’ya Yamaguchi et.al.	2311.13090	null
2023-11-22	FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline	Vladimir Arkhipkin et.al.	2311.13073	link
2023-11-21	Diffusion Model Alignment Using Direct Preference Optimization	Bram Wallace et.al.	2311.12908	null
2023-11-21	Text-Guided Texturing by Synchronized Multi-View Diffusion	Yuxin Liu et.al.	2311.12891	link
2023-11-21	Fine-Grained Open Domain Image Animation with Motion Guidance	Zuozhuo Dai et.al.	2311.12886	link
2023-11-21	GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning	Jiaxi Lv et.al.	2311.12631	null
2023-11-21	Stable Diffusion For Aerial Object Detection	Yanan Jian et.al.	2311.12345	null
2023-11-21	LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis	Peiang Zhao et.al.	2311.12342	null
2023-11-20	NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation	Shachar Rosenman et.al.	2311.12229	link
2023-11-20	Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models	Rohit Gandikota et.al.	2311.12092	link
2023-11-20	An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis	Aishwarya Agarwal et.al.	2311.11919	null
2023-11-20	Multiplicative noise removal based on a variable-order fractional diffusion model	Yuhang Li et.al.	2311.11680	null
2023-11-20	Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model	Chunming He et.al.	2311.11638	link
2023-11-20	Generating Realistic Counterfactuals for Retinal Fundus and OCT Images using Diffusion Models	Indu Ilanchezian et.al.	2311.11629	link
2023-11-20	Deep Equilibrium Diffusion Restoration with Parallel Sampling	Jiezhang Cao et.al.	2311.11600	link
2023-11-20	Advancing Urban Renewal: An Automated Approach to Generating Historical Arcade Facades with Stable Diffusion Models	Zheyuan Kuang et.al.	2311.11590	null
2023-11-19	DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model	Zhenghao Pan et.al.	2311.11417	link
2023-11-19	A Survey of Emerging Applications of Diffusion Probabilistic Models in MRI	Yuheng Fan et.al.	2311.11383	null
2023-11-19	MoVideo: Motion-Aware Video Generation with Diffusion Models	Jingyun Liang et.al.	2311.11325	null
2023-11-19	GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise	Xinhai Li et.al.	2311.11221	null
2023-11-19	On the Noise Scheduling for Generating Plausible Designs with Diffusion Models	Jiajie Fan et.al.	2311.11207	null
2023-11-18	Mitigating Exposure Bias in Discriminator Guided Diffusion Models	Eleftherios Tsonis et.al.	2311.11164	null
2023-11-18	User-Centric Interactive AI for Distributed Diffusion Model-based AI-Generated Content	Hongyang Du et.al.	2311.11094	null
2023-11-18	DSCom: A Data-Driven Self-Adaptive Community-Based Framework for Influence Maximization in Social Networks	Yuxin Zuo et.al.	2311.11080	null
2023-11-18	Make Pixels Dance: High-Dynamic Video Generation	Yan Zeng et.al.	2311.10982	null
2023-11-17	The Hidden Linear Structure in Score-Based Models and its Application	Binxu Wang et.al.	2311.10892	null
2023-11-17	SDDPM: Speckle Denoising Diffusion Probabilistic Models	Soumee Guha et.al.	2311.10868	null
2023-11-17	A Study on Altering the Latent Space of Pretrained Text to Speech Models for Improved Expressiveness	Mathias Vogel et.al.	2311.10804	null
2023-11-17	SelfEval: Leveraging the discriminative nature of generative models for evaluation	Sai Saketh Rambhatla et.al.	2311.10708	null
2023-11-17	Enhancing Object Coherence in Layout-to-Image Synthesis	Yibin Wang et.al.	2311.10522	link
2023-11-16	The Chosen One: Consistent Characters in Text-to-Image Diffusion Models	Omri Avrahami et.al.	2311.10093	null
2023-11-16	TransFusion – A Transparency-Based Diffusion Model for Anomaly Detection	Matic Fučka et.al.	2311.09999	link
2023-11-16	DSR-Diff: Depth Map Super-Resolution with Diffusion Model	Yuan Shi et.al.	2311.09919	null
2023-11-16	Diffusion-Augmented Neural Processes	Lorenzo Bonito et.al.	2311.09848	null
2023-11-16	MAM-E: Mammographic synthetic image generation with diffusion models	Ricardo Montoya-del-Angel et.al.	2311.09822	link
2023-11-16	Scene Text Image Super-resolution based on Text-conditional Diffusion Models	Chihiro Noguchi et.al.	2311.09759	link
2023-11-16	DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics	Aniket Roy et.al.	2311.09753	null
2023-11-16	What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization	Yuhan Liu et.al.	2311.09741	link
2023-11-16	DECDM: Document Enhancement using Cycle-Consistent Diffusion Models	Jiaxin Zhang et.al.	2311.09625	null
2023-11-16	3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation	Dale Decatur et.al.	2311.09571	link
2023-11-15	Synthetically Enhanced: Unveiling Synthetic Data’s Potential in Medical Imaging Research	Bardia Khosravi et.al.	2311.09402	link
2023-11-15	Privacy Threats in Stable Diffusion Models	Thomas Cilloni et.al.	2311.09355	null
2023-11-15	Generative AI-Based Probabilistic Constellation Shaping With Diffusion Models	Mehdi Letafati et.al.	2311.09349	null
2023-11-15	FastBlend: a Powerful Model-Free Toolkit Making Video Stylization Easier	Zhongjie Duan et.al.	2311.09265	link
2023-11-15	Single-Image 3D Human Digitization with Shape-Guided Diffusion	Badour AlBahar et.al.	2311.09221	null
2023-11-15	DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model	Yinghao Xu et.al.	2311.09217	null
2023-11-15	Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search	Hefeng Wu et.al.	2311.09084	link
2023-11-15	A Spectral Diffusion Prior for Hyperspectral Image Super-Resolution	Jianjun Liu et.al.	2311.08955	link
2023-11-16	One-Shot Federated Learning with Classifier-Guided Diffusion Models	Mingzhao Yang et.al.	2311.08870	null
2023-11-15	A Diffusion Model Based Quality Enhancement Method for HEVC Compressed Video	Zheng Liu et.al.	2311.08746	null
2023-11-15	Towards Graph-Aware Diffusion Modeling for Collaborative Filtering	Yunqin Zhu et.al.	2311.08744	link
2023-11-15	EDMSound: Spectrogram Based Diffusion Models for Efficient and High-Quality Audio Synthesis	Ge Zhu et.al.	2311.08667	null
2023-11-14	Probabilistic reconstruction of Dark Matter fields from biased tracers using diffusion models	Core Francisco Park et.al.	2311.08558	link
2023-11-14	Mustango: Toward Controllable Text-to-Music Generation	Jan Melechovsky et.al.	2311.08355	link
2023-11-15	Generative De-Quantization for Neural Speech Codec via Latent Diffusion	Haici Yang et.al.	2311.08330	null
2023-11-14	Diffusion-based generation of Histopathological Whole Slide Images at a Gigapixel scale	Robert Harb et.al.	2311.08199	null
2023-11-14	Influence of departures from LTE on determinations of the scandium abundances in A-B type stars	L. Mashonkina et.al.	2311.07982	null
2023-11-14	Brain-Driven Representation Learning Based on Diffusion Model	Soowon Kim et.al.	2311.07925	null
2023-11-14	Bayesian Conditional Diffusion Models for Versatile Spatiotemporal Turbulence Generation	Han Gao et.al.	2311.07896	null
2023-11-14	One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion	Minghua Liu et.al.	2311.07885	null
2023-11-13	Fast and Space-Efficient Parallel Algorithms for Influence Maximization	Letong Wang et.al.	2311.07554	link
2023-11-13	Robust semi-supervised segmentation with timestep ensembling diffusion models	Margherita Rosnati et.al.	2311.07421	null
2023-11-13	Zero-Shot Duet Singing Voices Separation with Diffusion Models	Chin-Yun Yu et.al.	2311.07345	link
2023-11-13	A Gaussian Process Based Method with Deep Kernel Learning for Pricing High-dimensional American Options	Jirong Zhuang et.al.	2311.07211	null
2023-11-13	MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model	Shuwei Shao et.al.	2311.07198	link
2023-11-13	Adversarial Purification for Data-Driven Power System Event Classifiers with Diffusion Models	Yuanbin Cheng et.al.	2311.07110	null
2023-11-12	Augmented Bridge Matching	Valentin De Bortoli et.al.	2311.06978	null
2023-11-12	Sampler Scheduler for Diffusion Models	Zitong Cheng et.al.	2311.06845	link
2023-11-12	IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models	Zhaoyuan Yang et.al.	2311.06792	link
2023-11-11	A 3D Conditional Diffusion Model for Image Quality Transfer – An Application to Low-Field MRI	Seunghoi Kim et.al.	2311.06631	link
2023-11-11	Generative AI for Space-Air-Ground Integrated Networks (SAGIN)	Ruichen Zhang et.al.	2311.06523	null
2023-11-11	Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance	June-Woo Kim et.al.	2311.06480	link
2023-11-10	On degenerate reaction-diffusion epidemic models with mass action or standard incidence mechanism	Rachidi Salako et.al.	2311.06434	null
2023-11-10	Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models	Siao Tang et.al.	2311.06322	link
2023-11-10	Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization	Weiyang Liu et.al.	2311.06243	null
2023-11-10	Diffusion Models for Earth Observation Use-cases: from cloud removal to urban change detection	Fulvio Sanguigni et.al.	2311.06222	null
2023-11-10	Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model	Jiahao Li et.al.	2311.06214	null
2023-11-10	Enhancing Rock Image Segmentation in Digital Rock Physics: A Fusion of Generative AI and State-of-the-Art Neural Networks	Zhaoyang Ma et.al.	2311.06079	null
2023-11-10	Semantic Map Guided Synthesis of Wireless Capsule Endoscopy Images using Diffusion Models	Haejin Lee et.al.	2311.05889	null
2023-11-10	Diffusion Shape Prior for Wrinkle-Accurate Cloth Registration	Jingfan Guo et.al.	2311.05828	null
2023-11-09	LCM-LoRA: A Universal Stable-Diffusion Acceleration Module	Simian Luo et.al.	2311.05556	link
2023-11-09	Onset of pattern formation for the stochastic Allen-Cahn equation	Stella Brassesco et.al.	2311.05526	null
2023-11-09	3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models	Haibo Yang et.al.	2311.05464	link
2023-11-09	ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors	Jingwen Chen et.al.	2311.05463	null
2023-11-09	Control3D: Towards Controllable Text-to-3D Generation	Yang Chen et.al.	2311.05461	null
2023-11-09	Predicting the Position Uncertainty at the Time of Closest Approach with Diffusion Models	Marta Guimarães et.al.	2311.05417	null
2023-11-09	ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image	Senthil Purushwalkam et.al.	2311.05230	null
2023-11-09	Super-Resolution Emulation of Large Cosmological Fields with a 3D Conditional Diffusion Model	Adam Rouhiainen et.al.	2311.05217	null
2023-11-09	BrainNetDiff: Generative AI Empowers Brain Network Generation via Multimodal Diffusion Model	Yongcheng Zong et.al.	2311.05199	null
2023-11-08	Lightweight Diffusion Models with Distillation-Based Block Neural Architecture Search	Siao Tang et.al.	2311.04950	null
2023-11-08	Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation	Ha-Yeong Choi et.al.	2311.04693	link
2023-11-08	Weakly-supervised deepfake localization in diffusion-generated images	Dragos Tantaru et.al.	2311.04584	link
2023-11-08	A 3D generative model of pathological multi-modal MR images and segmentations	Virginia Fernandez et.al.	2311.04552	link
2023-11-07	3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features	Chenfeng Xu et.al.	2311.04391	null
2023-11-07	Dose-aware Diffusion Model for 3D Ultra Low-dose PET Imaging	Huidong Xie et.al.	2311.04248	null
2023-11-07	I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models	Shiwei Zhang et.al.	2311.04145	link
2023-11-07	Generative Structural Design Integrating BIM and Diffusion Model	Zhili He et.al.	2311.04052	link
2023-11-07	Formulating Discrete Probability Flow Through Optimal Transport	Pengze Zhang et.al.	2311.03886	link
2023-11-07	Reducing Spatial Fitting Error in Distillation of Denoising Diffusion Models	Shengzhe Zhou et.al.	2311.03830	link
2023-11-07	3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion	Xinhao Xiang et.al.	2311.03742	null
2023-11-06	The steady state of the boundary-driven multiparticle asymmetric diffusion model	Rouven Frassek et.al.	2311.03603	null
2023-11-06	Generative Diffusion Models for Lattice Field Theory	Lingxiao Wang et.al.	2311.03578	null
2023-11-06	Multi-Resolution Diffusion for Privacy-Sensitive Recommender Systems	Derek Lilienthal et.al.	2311.03488	link
2023-11-06	TS-Diffusion: Generating Highly Complex Time Series with Diffusion Models	Yangming Li et.al.	2311.03303	null
2023-11-06	LDM3D-VR: Latent Diffusion Model for 3D VR	Gabriela Ben Melech Stan et.al.	2311.03226	null
2023-11-06	Algebraic Dynamical Systems in Machine Learning	Iolo Jones et.al.	2311.03118	null
2023-11-07	AnyText: Multilingual Visual Text Generation And Editing	Yuxiang Tuo et.al.	2311.03054	link
2023-11-06	Exploring the Capability of Text-to-Image Diffusion Models with Structural Edge Guidance for Multi-Spectral Satellite Image Inpainting	Mikolaj Czerkawski et.al.	2311.03008	null
2023-11-06	Diffusion-based Radiotherapy Dose Prediction Guided by Inter-slice Aware Structure Encoding	Zhenghao Feng et.al.	2311.02991	null
2023-11-06	Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video	Yanqin Jiang et.al.	2311.02848	null
2023-11-04	From Trojan Horses to Castle Walls: Unveiling Bilateral Backdoor Effects in Diffusion Models	Zhuoshi Pan et.al.	2311.02373	link
2023-11-04	Domain Transfer in Latent Space (DTLS) Wins on Image Super-Resolution – a Non-Denoising Model	Chun-Chuen Hui et.al.	2311.02358	link
2023-11-04	Stable Diffusion Reference Only: Image Prompt and Blueprint Jointly Guided Multi-Condition Diffusion Model for Secondary Painting	Hao Ai et.al.	2311.02343	link
2023-11-03	Patch-based Selection and Refinement for Early Object Detection	Tianyi Zhang et.al.	2311.02274	link
2023-11-03	Sparse Training of Discrete Diffusion Models for Graph Generation	Yiming Qin et.al.	2311.02142	link
2023-11-03	Quantum circuit synthesis with diffusion models	Florian Fürrutter et.al.	2311.02041	link
2023-11-03	Latent Diffusion Model for Conditional Reservoir Facies Generation	Daesoo Lee et.al.	2311.01968	link
2023-11-03	On the Generalization Properties of Diffusion Models	Puheng Li et.al.	2311.01797	link
2023-11-06	CDGraph: Dual Conditional Social Graph Synthesizing via Diffusion Model	Jui-Yi Tsai et.al.	2311.01729	null
2023-11-02	Improving Fairness using Vision-Language Driven Image Augmentation	Moreno D’Incà et.al.	2311.01573	link
2023-11-02	Exploring the Hyperparameter Space of Image Diffusion Models for Echocardiogram Generation	Hadrien Reynaud et.al.	2311.01567	null
2023-11-02	Investigating the Behavior of Diffusion Models for Accelerating Electronic Structure Calculations	Daniel Rothchild et.al.	2311.01491	null
2023-11-02	Time Series Anomaly Detection using Diffusion-based Models	Ioana Pintilie et.al.	2311.01452	link
2023-11-02	Constrained-Context Conditional Diffusion Models for Imitation Learning	Vaibhav Saxena et.al.	2311.01419	null
2023-11-02	Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors	Gabriele M. Caddeo et.al.	2311.01380	link
2023-11-02	DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning	Wenxuan Bao et.al.	2311.01295	link
2023-11-02	Optimal Transport-Guided Conditional Score-Based Diffusion Models	Xiang Gu et.al.	2311.01226	link
2023-11-02	Diffusion Models for Reinforcement Learning: A Survey	Zhengbang Zhu et.al.	2311.01223	link
2023-11-02	Add and Thin: Diffusion for Temporal Point Processes	David Lüdke et.al.	2311.01139	null
2023-11-02	Infusion: Internal Diffusion for Video Inpainting	Nicolas Cherel et.al.	2311.01090	link
2023-11-02	Expanding Expressiveness of Diffusion Models with Limited Data via Self-Distillation based Fine-Tuning	Jiwan Hur et.al.	2311.01018	null
2023-11-02	Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs	Peng Jin et.al.	2311.01015	link
2023-11-02	Optimal Noise pursuit for Augmenting Text-to-Video Generation	Shijie Ma et.al.	2311.00949	null
2023-11-02	Gaussian Mixture Solvers for Diffusion Models	Hanzhong Guo et.al.	2311.00941	link
2023-11-02	Bridging the Gap: Addressing Discrepancies in Diffusion Model Training for Classifier-Free Guidance	Niket Patel et.al.	2311.00938	null
2023-11-02	Towards High-quality HDR Deghosting with Conditional Diffusion Models	Qingsen Yan et.al.	2311.00932	null
2023-11-01	HIDM: Emulating Large Scale HI Maps using Score-based Diffusion Models	Sultan Hassan et.al.	2311.00833	null
2023-11-01	Quantum Computational Algorithms for Derivative Pricing and Credit Risk in a Regime Switching Economy	Eric Ghysels et.al.	2311.00825	null
2023-11-01	De-Diffusion Makes Text a Strong Cross-Modal Interface	Chen Wei et.al.	2311.00618	null
2023-11-01	Controllable Music Production with Diffusion Models and Guidance Gradients	Mark Levy et.al.	2311.00613	null
2023-11-01	Intriguing Properties of Data Attribution on Diffusion Models	Xiaosen Zheng et.al.	2311.00500	link
2023-11-01	Generating HSR Bogie Vibration Signals via Pulse Voltage-Guided Conditional Diffusion Model	Xuan Liu et.al.	2311.00496	link
2023-11-01	Diffusion models for probabilistic programming	Simon Dirmeier et.al.	2311.00474	link
2023-11-01	Dual Conditioned Diffusion Models for Out-Of-Distribution Detection: Application to Fetal Ultrasound Videos	Divyanshu Mishra et.al.	2311.00469	null
2023-11-01	LatentWarp: Consistent Diffusion Latents for Zero-Shot Video-to-Video Translation	Yuxiang Bao et.al.	2311.00353	null
2023-11-01	Space Narrative: Generating Images and 3D Scenes of Chinese Garden from Text using Deep Learning	Jiaxi Shi1 et.al.	2311.00339	null
2023-11-01	Adaptive Latent Diffusion Model for 3D Medical Image to Image Translation: Multi-modal Magnetic Resonance Imaging Study	Jonghun Kim et.al.	2311.00265	link
2023-10-31	Score Normalization for a Faster Diffusion Exponential Integrator Sampler	Guoxuan Xia et.al.	2311.00157	link
2023-10-31	SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction	Xinyuan Chen et.al.	2310.20700	null
2023-10-31	Diffusion Reconstruction of Ultrasound Images with Informative Uncertainty	Yuxin Zhang et.al.	2310.20618	null
2023-10-31	Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion	Zhengyi Yang et.al.	2310.20453	link
2023-10-31	In Search of Lost Online Test-time Adaptation: A Survey	Zixin Wang et.al.	2310.20199	link
2023-10-31	A Perturbative Solution to the Linear Influence/Network Autocorrelation Model Under Network Dynamics	Carter T. Butts et.al.	2310.20163	null
2023-10-31	Synthesizing Diabetic Foot Ulcer Images with Diffusion Model	Reza Basiri et.al.	2310.20140	null
2023-10-31	Beyond U: Making Diffusion Models Faster & Lighter	Sergio Calvo-Ordonez et.al.	2310.20092	null
2023-10-30	Scaling Riemannian Diffusion Models	Aaron Lou et.al.	2310.20030	null
2023-10-30	DiffEnc: Variational Diffusion with a Learned Encoder	Beatrix M. G. Nielsen et.al.	2310.19789	link
2023-10-30	CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models	Ziyang Yuan et.al.	2310.19784	null
2023-10-29	Learning to Follow Object-Centric Image Editing Instructions Faithfully	Tuhin Chakrabarty et.al.	2310.19145	link
2023-10-29	Adversarial Examples Are Not Real Features	Ang Li et.al.	2310.18936	link
2023-10-28	Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models	Hai Wang et.al.	2310.18840	link
2023-10-28	Successfully Applying Lottery Ticket Hypothesis to Diffusion Model	Chao Jiang et.al.	2310.18823	link
2023-10-28	Purify++: Improving Diffusion-Purification with Advanced Diffusion Models and Control of Randomness	Boya Zhang et.al.	2310.18762	null
2023-10-27	From Generative AI to Generative Internet of Things: Fundamentals, Framework, and Outlooks	Jinbo Wen et.al.	2310.18382	null
2023-10-27	Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models	Pushkal Katara et.al.	2310.18308	null
2023-10-27	ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image	Kyle Sargent et.al.	2310.17994	link
2023-10-26	6-DoF Stability Field via Diffusion Models	Takuma Yoneda et.al.	2310.17649	null
2023-10-26	Generative Fractional Diffusion Models	Gabriel Nobis et.al.	2310.17638	link
2023-10-26	Noise-Free Score Distillation	Oren Katzir et.al.	2310.17590	null
2023-10-26	Convergence of flow-based generative models via proximal gradient descent in Wasserstein space	Xiuyuan Cheng et.al.	2310.17582	link
2023-10-27	Global Structure-Aware Diffusion Process for Low-Light Image Enhancement	Jinhui Hou et.al.	2310.17577	link
2023-10-26	DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation	Yongxin Zhu et.al.	2310.17570	null
2023-10-26	SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching	Xinghui Li et.al.	2310.17569	null
2023-10-27	The Expressive Power of Low-Rank Adaptation	Yuchen Zeng et.al.	2310.17513	link
2023-10-26	The statistical thermodynamics of generative diffusion models	Luca Ambrogioni et.al.	2310.17467	null
2023-10-26	Likelihood-based Out-of-Distribution Detection with Denoising Diffusion Probabilistic Models	Joseph Goodier et.al.	2310.17432	null
2023-10-26	Causal Modeling with Stationary Diffusions	Lars Lorch et.al.	2310.17405	link
2023-10-26	Towards Unifying Diffusion Models for Probabilistic Spatio-Temporal Graph Learning	Junfeng Hu et.al.	2310.17360	null
2023-10-26	SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation	Haobo Jiang et.al.	2310.17359	null
2023-10-26	CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling	Seyedmorteza Sadat et.al.	2310.17347	null
2023-10-26	Attribute Based Interpretable Evaluation Metrics for Generative Models	Dongkyun Kim et.al.	2310.17261	link
2023-10-26	Exploring Iterative Refinement with Diffusion Models for Video Grounding	Xiao Liang et.al.	2310.17189	link
2023-10-26	Improving Denoising Diffusion Models via Simultaneous Estimation of Image and Noise	Zhenkai Zhang et.al.	2310.17167	null
2023-10-26	Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration	Longlin Yu et.al.	2310.17153	link
2023-10-25	Discrete Diffusion Language Modeling by Estimating the Ratios of the Data Distribution	Aaron Lou et.al.	2310.16834	link
2023-10-25	PERF: Panoramic Neural Radiance Field from a Single Panorama	Guangcong Wang et.al.	2310.16831	link
2023-10-25	CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images	Aaron Gokaslan et.al.	2310.16825	link
2023-10-26	DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior	Jingxiang Sun et.al.	2310.16818	link
2023-10-25	Using Diffusion Models to Generate Synthetic Labelled Data for Medical Image Segmentation	Daniel Saragih et.al.	2310.16794	link
2023-10-26	Multi-scale Diffusion Denoised Smoothing	Jongheon Jeong et.al.	2310.16779	link
2023-10-25	Local Statistics for Generative Image Detection	Yung Jer Wong et.al.	2310.16684	null
2023-10-25	A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation	Eyal Segalis et.al.	2310.16656	null
2023-10-25	Constraining the slow-diffusion zone size and electron injection spectral index for the Geminga pulsar halo	Kun Fang et.al.	2310.16594	null
2023-10-25	Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models	Weijie Chen et.al.	2310.16573	null
2023-10-25	Open Knowledge Base Canonicalization with Multi-task Unlearning	Bingchen Liu et.al.	2310.16419	null
2023-10-25	Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models	Tianyi Lu et.al.	2310.16400	link
2023-10-25	DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection	Se-Ho Kim et.al.	2310.16349	null
2023-10-25	Diffusion model approach to simulating electron-proton scattering events	Peter Devlin et.al.	2310.16308	null
2023-10-25	Dolfin: Diffusion Layout Transformers without Autoencoder	Yilin Wang et.al.	2310.16305	null
2023-10-25	Removing Dust from CMB Observations with Diffusion Models	David Heurtel-Depeiges et.al.	2310.16285	null
2023-10-24	iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis	Yash Kant et.al.	2310.16167	null
2023-10-24	RePoseDM: Recurrent Pose Alignment and Gradient Guidance for Pose Guided Image Synthesis	Anant Khandelwal et.al.	2310.16074	null
2023-10-25	Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles	Xing Shen et.al.	2310.15952	null
2023-10-24	Language-driven Scene Synthesis using Multi-conditional Diffusion Model	An Vuong et.al.	2310.15948	link
2023-10-23	FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling	Haonan Qiu et.al.	2310.15169	link
2023-10-23	Matryoshka Diffusion Models	Jiatao Gu et.al.	2310.15111	link
2023-10-23	Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model	Ruoxi Shi et.al.	2310.15110	link
2023-10-24	Wonder3D: Single Image to 3D using Cross-Domain Diffusion	Xiaoxiao Long et.al.	2310.15008	null
2023-10-23	Orientation-Aware Leg Movement Learning for Action-Driven Human Motion Prediction	Chunzhi Gu et.al.	2310.14907	null
2023-10-23	Joint Non-Linear MRI Inversion with Diffusion Priors	Moritz Erlacher et.al.	2310.14842	null
2023-10-23	MAS: Multi-view Ancestral Sampling for 3D motion generation using 2D diffusion	Roy Kapon et.al.	2310.14729	null
2023-10-23	$Λ$ -Split: A Privacy-Preserving Split Computing Framework for Cloud-Powered Generative AI	Shoki Ohta et.al.	2310.14651	link
2023-10-23	DICE: Diverse Diffusion Model with Scoring for Trajectory Prediction	Younwoo Choi et.al.	2310.14570	null
2023-10-22	Diffusion-Model-Assisted Supervised Learning of Generative Models for Density Estimation	Yanfang Liu et.al.	2310.14458	null
2023-10-22	Diffusion-based Data Augmentation for Nuclei Image Segmentation	Xinyi Yu et.al.	2310.14197	link
2023-10-22	Improved Techniques for Training Consistency Models	Yang Song et.al.	2310.14189	null
2023-10-21	Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models	Jincheng Zhang et.al.	2310.14044	link
2023-10-21	Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions	Jincheng Zhang et.al.	2310.14040	null
2023-10-21	Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States	Zidan Wang et.al.	2310.13914	null
2023-10-20	GraphMaker: Can Diffusion Models Generate Large Attributed Graphs?	Mufei Li et.al.	2310.13833	link
2023-10-20	TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models	Tianshi Cao et.al.	2310.13772	null
2023-10-20	Localizing and Editing Knowledge in Text-to-Image Generative Models	Samyadeep Basu et.al.	2310.13730	null
2023-10-20	ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection	Zhongzhan Huang et.al.	2310.13545	link
2023-10-19	CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation	Sihan Xu et.al.	2310.13165	link
2023-10-19	EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model	Zheyuan Zhang et.al.	2310.12868	link
2023-10-19	Energy-Based Models For Speech Synthesis	Wanli Sun et.al.	2310.12765	null
2023-10-19	TapMo: Shape-aware Motion Generation of Skeleton-free Characters	Jiaxu Zhang et.al.	2310.12678	null
2023-10-19	Product of Gaussian Mixture Diffusion Models	Martin Zach et.al.	2310.12653	link
2023-10-19	Denoising Heat-inspired Diffusion with Insulators for Collision Free Motion Planning	Junwoo Chang et.al.	2310.12609	null
2023-10-19	Diverse Diffusion: Enhancing Image Diversity in Text-to-Image Generation	Mariia Zameshina et.al.	2310.12583	null
2023-10-19	SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation	Chongyu Fan et.al.	2310.12508	link
2023-10-19	Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping	Zijie Pan et.al.	2310.12474	link
2023-10-19	Closed-Form Diffusion Models	Christopher Scarvelis et.al.	2310.12395	null
2023-10-18	DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors	Jinbo Xing et.al.	2310.12190	link
2023-10-18	Quality Diversity through Human Feedback	Li Ding et.al.	2310.12103	link
2023-10-20	Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach	Feng Luo et.al.	2310.12004	link
2023-10-18	Bayesian Flow Networks in Continual Learning	Mateusz Pyla et.al.	2310.12001	null
2023-10-18	InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation	Renzhi Wang et.al.	2310.11976	link
2023-10-18	To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images … For Now	Yimeng Zhang et.al.	2310.11868	link
2023-10-20	Equivariant Bootstrapping for Uncertainty Quantification in Imaging Inverse Problems	Julian Tachella et.al.	2310.11838	link
2023-10-18	Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts	Xinhua Cheng et.al.	2310.11784	null
2023-10-18	Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale	Qichao Wang et.al.	2310.11778	null
2023-10-18	On the Evaluation of Generative Models in Distributed Learning Tasks	Zixiao Wang et.al.	2310.11714	null
2023-10-17	Reflection-Equivariant Diffusion for 3D Structure Determination from Isotopologue Rotational Spectra in Natural Abundance	Austin Cheng et.al.	2310.11609	link
2023-10-17	GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment	Dhruba Ghosh et.al.	2310.11513	link
2023-10-17	Elucidating The Design Space of Classifier-Guided Diffusion Generation	Jiajun Ma et.al.	2310.11311	link
2023-10-17	BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference	Siqi Kou et.al.	2310.11142	link
2023-10-17	3D Structure-guided Network for Tooth Alignment in 2D Photograph	Yulong Dou et.al.	2310.11106	link
2023-10-16	LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation	Ruiqi Wu et.al.	2310.10769	link
2023-10-18	BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys	Yu Gu et.al.	2310.10765	null
2023-10-16	MOFDiff: Coarse-grained Diffusion for Metal-Organic Framework Design	Xiang Fu et.al.	2310.10732	null
2023-10-16	A Survey on Video Diffusion Models	Zhen Xing et.al.	2310.10647	link
2023-10-16	LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts	Hanan Gani et.al.	2310.10640	link
2023-10-16	Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models	Kevin Black et.al.	2310.10639	link
2023-10-16	ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a protein language diffusion model	Bo Ni et.al.	2310.10605	null
2023-10-16	Generation or Replication: Auscultating Audio Latent Diffusion Models	Dimitrios Bralios et.al.	2310.10604	null
2023-10-16	Model Selection of Anomaly Detectors in the Absence of Labeled Validation Data	Clement Fung et.al.	2310.10461	null
2023-10-16	ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion	Jiayu Yang et.al.	2310.10343	link
2023-10-16	Scene Graph Conditioning in Latent Diffusion	Frank Fundel et.al.	2310.10338	link
2023-10-16	Towards image compression with perfect realism at ultra-low bitrates	Marlène Careil et.al.	2310.10325	null
2023-10-16	Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model	Junpeng Tan et.al.	2310.10209	null
2023-10-16	Ring-A-Bell! How Reliable are Concept Removal Methods for Diffusion Models?	Yu-Lin Tsai et.al.	2310.10012	link
2023-10-15	Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models	Zijian Zhang et.al.	2310.09912	null
2023-10-15	Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation	Wangyu Wu et.al.	2310.09760	null
2023-10-15	LOVECon: Text-driven Training-Free Long Video Editing with ControlNet	Zhenyi Liao et.al.	2310.09711	link
2023-10-14	Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space	Hengrui Zhang et.al.	2310.09656	link
2023-10-14	Adaptive Online Replanning with Diffusion Models	Siyuan Zhou et.al.	2310.09629	null
2023-10-14	JSMoCo: Joint Coil Sensitivity and Motion Correction in Parallel MRI with a Self-Calibrating Score-Based Diffusion Model	Lixuan Chen et.al.	2310.09625	null
2023-10-14	Neural Network for valuing Bitcoin options under jump-diffusion and market sentiment model	Edson Pindza et.al.	2310.09622	null
2023-10-14	Unified High-binding Watermark for Unconditional Image Generation Models	Ruinan Ma et.al.	2310.09479	null
2023-10-14	Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner	Mengfei Xia et.al.	2310.09469	null
2023-10-12	HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion	Xian Liu et.al.	2310.08579	null
2023-10-12	NetDiffusion: Network Data Augmentation Through Protocol-Constrained Traffic Generation	Xi Jiang et.al.	2310.08543	null
2023-10-12	GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors	Taoran Yi et.al.	2310.08529	link
2023-10-12	MotionDirector: Motion Customization of Text-to-Video Diffusion Models	Rui Zhao et.al.	2310.08465	link
2023-10-12	Debias the Training of Diffusion Models	Hu Yu et.al.	2310.08442	link
2023-10-12	A new local and explicit kinetic method for linear and non-linear convection-diffusion problems with finite kinetic speeds: I. One-dimensional case	Gauthier Wissocq et.al.	2310.08356	null
2023-10-12	Neural Diffusion Models	Grigory Bartosh et.al.	2310.08337	null
2023-10-12	Consistent123: Improve Consistency for One Image to 3D Object Synthesis	Haohan Weng et.al.	2310.08092	null
2023-10-12	Interpretable Diffusion via Information Decomposition	Xianghao Kong et.al.	2310.07972	link
2023-10-11	NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration	Ajay Sridhar et.al.	2310.07896	link
2023-10-11	Efficient Integrators for Diffusion Generative Models	Kushagra Pandey et.al.	2310.07894	link
2023-10-13	Generative Modeling with Phase Stochastic Bridges	Tianrong Chen et.al.	2310.07805	link
2023-10-11	Quantum sequential scattering model for quantum state learning	Mingrui Jing et.al.	2310.07797	null
2023-10-11	DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model	Xiaofan Li et.al.	2310.07771	link
2023-10-11	ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models	Yingqing He et.al.	2310.07702	link
2023-10-12	Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models	Zeqiang Lai et.al.	2310.07653	link
2023-10-11	Boosting Black-box Attack to Deep Neural Networks with Conditional Diffusion Models	Renyang Liu et.al.	2310.07492	link
2023-10-11	Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else	Hazarapet Tunanyan et.al.	2310.07419	null
2023-10-12	WiGenAI: The Symphony of Wireless and Generative AI via Diffusion Models	Mehdi Letafati et.al.	2310.07312	null
2023-10-12	Score Regularized Policy Optimization through Diffusion Behavior	Huayu Chen et.al.	2310.07297	link
2023-10-11	Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model	Shiyuan Yang et.al.	2310.07222	link
2023-10-11	Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes	Jaehyeong Jo et.al.	2310.07216	link
2023-10-11	State of the Art on Diffusion Models for Visual Computing	Ryan Po et.al.	2310.07204	null
2023-10-11	The Ubiquity of Diffusiophoresis: Exploring Human Population Dynamics While Including Concentration Gradient-Driven Advection	Benjamin M. Alessio et.al.	2310.07185	null
2023-10-11	Imitation Learning from Purified Demonstration	Yunke Wang et.al.	2310.07143	link
2023-10-11	Denoising Task Routing for Diffusion Models	Byeongjun Park et.al.	2310.07138	link
2023-10-11	Echocardiography video synthesis from end diastolic semantic map via diffusion model	Phi Nguyen Van et.al.	2310.07131	null
2023-10-10	Investigating the Adversarial Robustness of Density Estimation Using the Probability Flow ODE	Marius Arvinte et.al.	2310.07084	null
2023-10-10	ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning	Alec Helbling et.al.	2310.06968	null
2023-10-10	Monsters in the Dark: Sanitizing Hidden Threats with Diffusion Models	Preston K. Robinette et.al.	2310.06951	null
2023-10-10	Stochastic Super-resolution of Cosmological Simulations with Denoising Diffusion Models	Andreas Schanz et.al.	2310.06929	null
2023-10-10	HiFi-123: Towards High-fidelity One Image to 3D Content Generation	Wangbo Yu et.al.	2310.06744	null
2023-10-10	Tweedie Moment Projected Diffusions For Inverse Problems	Benjamin Boys et.al.	2310.06721	null
2023-10-10	Latent Diffusion Counterfactual Explanations	Karim Farid et.al.	2310.06668	null
2023-10-09	FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing	Yuren Cong et.al.	2310.05922	null
2023-10-10	Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models	Zhili Liu et.al.	2310.05873	null
2023-10-09	A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models	Sebastian G. Gruber et.al.	2310.05833	link
2023-10-09	DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models	Shansan Gong et.al.	2310.05793	link
2023-10-09	Language Model Beats Diffusion – Tokenizer is Key to Visual Generation	Lijun Yu et.al.	2310.05737	link
2023-10-09	DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning	Longxiang He et.al.	2310.05333	link
2023-10-08	Image Compression and Decompression Framework Based on Latent Diffusion Model for Breast Mammography	InChan Hwang et.al.	2310.05299	link
2023-10-08	Fast protein backbone generation with SE(3) flow matching	Jason Yim et.al.	2310.05297	null
2023-10-08	The Emergence of Reproducibility and Consistency in Diffusion Models	Huijie Zhang et.al.	2310.05264	null
2023-10-08	Latent Diffusion Model for Medical Image Standardization and Enhancement	Md Selim et.al.	2310.05237	null
2023-10-07	Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models	Gabriele Tolomei et.al.	2310.04875	null
2023-10-07	Conditional Diffusion Model for Target Speaker Extraction	Theodor Nguyen et.al.	2310.04791	null
2023-10-10	DiffNAS: Bootstrapping Diffusion Models by Prompting for Better Architectures	Wenhao Li et.al.	2310.04750	null
2023-10-07	SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection	Pengfei Zhou et.al.	2310.04689	link
2023-10-07	Understanding and Improving Adversarial Attacks on Latent Diffusion Model	Boyang Zheng et.al.	2310.04687	link
2023-10-07	VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model	Yayun He et.al.	2310.04681	null
2023-10-07	EasyPhoto: Your Smart AI Photo Generator	Ziheng Wu et.al.	2310.04672	link
2023-10-07	Score-based Diffusion Models With Self-supervised Learning For Accelerated 3D Multi-contrast Cardiac Magnetic Resonance Imaging	Yuanyuan Liu et.al.	2310.04669	null
2023-10-06	DragD3D: Vertex-based Editing for Realistic Mesh Deformations using 2D Diffusion Priors	Tianhao Xie et.al.	2310.04561	null
2023-10-06	Generative Diffusion From An Action Principle	Akhil Premkumar et.al.	2310.04490	null
2023-10-05	Aligning Text-to-Image Diffusion Models with Reward Backpropagation	Mihir Prabhudesai et.al.	2310.03739	link
2023-10-05	Certification of Deep Learning Models for Medical Image Segmentation	Othmane Laousy et.al.	2310.03664	link
2023-10-05	Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints	Chuan Fang et.al.	2310.03602	null
2023-10-05	Deep Generative Models of Music Expectation	Ninon Lizé Masclef et.al.	2310.03500	null
2023-10-05	FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators	Haiping Wang et.al.	2310.03420	link
2023-10-05	ACT-Net: Anchor-context Action Detection in Surgery Videos	Luoying Hao et.al.	2310.03377	null
2023-10-05	Realistic Speech-to-Face Generation with Speech-Conditioned Latent Diffusion Model with Face Prior	Jinting Wang et.al.	2310.03363	null
2023-10-05	Denoising Diffusion Step-aware Models	Shuai Yang et.al.	2310.03337	link
2023-10-05	EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models	Yefei He et.al.	2310.03270	link
2023-10-04	Low-Energy Radiative Backgrounds in CCD-Based Dark-Matter Detectors	Peizhi Du et.al.	2310.03068	null
2023-10-04	Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models	Jianglong Ye et.al.	2310.03020	null
2023-10-04	Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day	Yifan Jiang et.al.	2310.03015	null
2023-10-04	Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples	Phillip Howard et.al.	2310.02988	null
2023-10-04	T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation	Yuze He et.al.	2310.02977	link
2023-10-04	Fast, Expressive SE $(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation Space	Erik J Bekkers et.al.	2310.02970	link
2023-10-04	Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts	Shiyi Du et.al.	2310.02906	null
2023-10-04	Magicremover: Tuning-free Text-guided Image inpainting with Diffusion Models	Siyuan Yang et.al.	2310.02848	null
2023-10-04	ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF	Jangho Park et.al.	2310.02712	null
2023-10-04	On Memorization in Diffusion Models	Xiangming Gu et.al.	2310.02664	link
2023-10-05	MagicDrive: Street View Generation with Diverse 3D Geometry Control	Ruiyuan Gao et.al.	2310.02601	null
2023-10-04	SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D	Weiyu Li et.al.	2310.02596	link
2023-10-04	Generalization in diffusion models arises from geometry-adaptive harmonic representation	Zahra Kadkhodaie et.al.	2310.02557	link
2023-10-04	Prepare Ansatz for VQE with Diffusion Model	Yilin Shen et.al.	2310.02511	null
2023-10-04	Learning to Reach Goals via Diffusion	Vineet Jain et.al.	2310.02505	link
2023-10-03	FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models	Yingqian Cui et.al.	2310.02401	null
2023-10-03	Generalized Schrödinger Bridge Matching	Guan-Horng Liu et.al.	2310.02233	link
2023-10-03	A Variable Eddington Factor Model for Thermal Radiative Transfer with Closure based on Data-Driven Shape Function	Joseph M. Coale et.al.	2310.02072	null
2023-10-03	Global Attractor for a Reaction-Diffusion Model Arising in Biological Dynamic in 3D Soil Structure	Mohamed Elghandouri et.al.	2310.02060	null
2023-10-03	AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model	Zibin Dong et.al.	2310.02054	null
2023-10-03	Amazing Combinatorial Creation: Acceptable Swap-Sampling for Text-to-Image Generation	Jun Li et.al.	2310.01819	null
2023-10-02	LLM-grounded Video Diffusion Models	Long Lian et.al.	2309.17444	null
2023-09-29	Directly Fine-Tuning Diffusion Models on Differentiable Rewards	Kevin Clark et.al.	2309.17400	null
2023-09-29	Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation	Tuan Le et.al.	2309.17296	null
2023-09-29	In search of dispersed memories: Generative diffusion models are associative memory networks	Luca Ambrogioni et.al.	2309.17290	null
2023-09-29	Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors	Yukang Lin et.al.	2309.17261	null
2023-09-29	ResBit: Residual Bit Vector for Categorical Values	Masane Fuchi et.al.	2309.17196	null
2023-09-29	Advances in Kidney Biopsy Structural Assessment through Dense Instance Segmentation	Zhan Xiong et.al.	2309.17166	null
2023-09-29	Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining	Tianyu Han et.al.	2309.17123	link
2023-09-29	Diffusion Models as Stochastic Quantization in Lattice Field Theory	Lingxiao Wang et.al.	2309.17082	link
2023-09-29	DeeDiff: Dynamic Uncertainty-Aware Early Exiting for Accelerating Diffusion Model Generation	Shengkun Tang et.al.	2309.17074	null
2023-09-29	ReFlow-TTS: A Rectified Flow Model for High-fidelity Text-to-Speech	Wenhao Guan et.al.	2309.17056	null
2023-09-29	Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning	Zihan Ding et.al.	2309.16984	link
2023-09-29	Leveraging Optimization for Adaptive Attacks on Image Watermarks	Nils Lukas et.al.	2309.16952	link
2023-09-29	Denoising Diffusion Bridge Models	Linqi Zhou et.al.	2309.16948	link
2023-09-28	SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models	Orkhan Baghirli et.al.	2309.16812	link
2023-09-28	Memory in Plain Sight: A Survey of the Uncanny Resemblances between Diffusion Models and Associative Memories	Benjamin Hoover et.al.	2309.16750	null
2023-09-28	KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing	Jiancheng Huang et.al.	2309.16608	null
2023-09-28	CCEdit: Creative and Controllable Video Editing via Diffusion Models	Ruoyu Feng et.al.	2309.16496	null
2023-09-28	Distilling ODE Solvers of Diffusion Models into Smaller Steps	Sanghwan Kim et.al.	2309.16421	null
2023-09-28	DeepPCR: Parallelizing Sequential Operations in Neural Networks	Federico Danieli et.al.	2309.16318	null
2023-09-28	Long time behavior of the field-road diffusion model: an entropy method and a finite volume scheme	Matthieu Alfaro et.al.	2309.16242	null
2023-09-28	Object Motion Guided Human Motion Synthesis	Jiaman Li et.al.	2309.16237	null
2023-09-28	Compositional Sculpting of Iterative Generative Processes	Timur Garipov et.al.	2309.16115	link
2023-09-27	High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models	Selim F. Yilmaz et.al.	2309.15889	link
2023-09-27	Exploiting the Signal-Leak Bias in Diffusion Models	Martin Nicolas Everaert et.al.	2309.15842	null
2023-09-27	Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation	David Junhao Zhang et.al.	2309.15818	link
2023-09-27	Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack	Xiaoliang Dai et.al.	2309.15807	null
2023-09-27	Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation	Xin Yuan et.al.	2309.15726	null
2023-09-27	Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing	Kai Wang et.al.	2309.15664	link
2023-09-27	Uncertainty Quantification via Neural Posterior Principal Components	Elias Nehme et.al.	2309.15533	null
2023-09-27	High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models	Chunyu Qiang et.al.	2309.15512	null
2023-09-27	DreamCom: Finetuning Text-guided Inpainting Model for Image Composition	Lingxiao Lu et.al.	2309.15508	null
2023-09-27	LD4MRec: Simplifying and Powering Diffusion Model for Multimedia Recommendation	Penghang Yu et.al.	2309.15363	null
2023-09-26	Learning Using Generated Privileged Information by Text-to-Image Diffusion Models	Rafael-Edy Menadil et.al.	2309.15238	null
2023-09-27	LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models	Yaohui Wang et.al.	2309.15103	link
2023-09-26	The ATM implied skew in the ADO-Heston model	Andrey Itkin et.al.	2309.15044	null
2023-09-26	FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image Editing	Songyan Chen et.al.	2309.14934	null
2023-09-27	ITEM3D: Illumination-Aware Directional Texture Editing for 3D Models	Shengqi Liu et.al.	2309.14872	null
2023-09-26	On a class of solvable stationary non equilibrium states for mass exchange models	Monia Capanna et.al.	2309.14836	null
2023-09-26	Diffusion-based Holistic Texture Rectification and Synthesis	Guoqing Hao et.al.	2309.14759	null
2023-09-26	On quantifying and improving realism of images generated with diffusion	Yunzhuo Chen et.al.	2309.14756	null
2023-09-26	Text-image guided Diffusion Model for generating Deepfake celebrity interactions	Yunzhuo Chen et.al.	2309.14751	null
2023-09-26	Bootstrap Diffusion Model Curve Estimation for High Resolution Low-Light Image Enhancement	Jiancheng Huang et.al.	2309.14709	null
2023-09-26	Efficient Post-training Quantization with FP8 Formats	Haihao Shen et.al.	2309.14592	link
2023-09-25	Bayesian parameter estimation for characterising mobile ion vacancies in perovskite solar cells	Samuel G. McCallum et.al.	2309.14302	null
2023-09-25	Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models	Yangming Li et.al.	2309.14068	null
2023-09-24	VoiceLDM: Text-to-Speech with Environmental Context	Yeonghyeon Lee et.al.	2309.13664	null
2023-09-26	Adaptation of the super resolution SOTA for Art Restoration in camera capture images	Sandeep Nagar et.al.	2309.13655	link
2023-09-23	Dream the Impossible: Outlier Imagination with Diffusion Models	Xuefeng Du et.al.	2309.13415	link
2023-09-23	GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER	Mingzhen Sun et.al.	2309.13274	link
2023-09-22	Invisible Watermarking for Audio Generation Diffusion Models	Xirong Cao et.al.	2309.13166	link
2023-09-22	AntiBARTy Diffusion for Property Guided Antibody Design	Jordan Venderley et.al.	2309.13129	null
2023-09-22	MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation	Jiahao Xie et.al.	2309.13042	link
2023-09-22	Diffusion Augmentation for Sequential Recommendation	Qidong Liu et.al.	2309.12858	link
2023-09-22	Synthetic Boost: Leveraging Synthetic Data for Enhanced Vision-Language Segmentation in Echocardiography	Rabin Adhikari et.al.	2309.12829	link
2023-09-21	A Diffusion-Model of Joint Interactive Navigation	Matthew Niedoba et.al.	2309.12508	null
2023-09-21	License Plate Super-Resolution Using Diffusion Models	Sawsan AlHalawani et.al.	2309.12506	null
2023-09-21	Synthetic Image Detection: Highlights from the IEEE Video and Image Processing Cup 2022 Student Competition	Davide Cozzolino et.al.	2309.12428	null
2023-09-21	Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal	Xiao Feng Zhang et.al.	2309.11715	null
2023-09-24	Latent Diffusion Models for Structural Component Design	Ethan Herron et.al.	2309.11601	null
2023-09-20	Light Field Diffusion for Single-View Novel View Synthesis	Yifeng Xiong et.al.	2309.11525	null
2023-09-20	FreeU: Free Lunch in Diffusion U-Net	Chenyang Si et.al.	2309.11497	link
2023-09-20	Generative Agent-Based Modeling: Unveiling Social System Dynamics through Coupling Mechanistic Models with Generative Artificial Intelligence	Navid Ghaffarzadegan et.al.	2309.11456	null
2023-09-20	Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models	Song Mei et.al.	2309.11420	null
2023-09-20	Face Aging via Diffusion-based Editing	Xiangyi Chen et.al.	2309.11321	link
2023-09-20	Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates	Ka Chun Shum et.al.	2309.11281	link
2023-09-20	TwinTex: Geometry-aware Texture Generation for Abstracted 3D Architectural Models	Weidan Xiong et.al.	2309.11258	null
2023-09-20	Investigating Personalization Methods in Text to Music Generation	Manos Plitsis et.al.	2309.11140	link
2023-09-20	PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement	Chengyou Jia et.al.	2309.11125	null
2023-09-19	Language-Conditioned Affordance-Pose Detection in 3D Point Clouds	Toan Nguyen et.al.	2309.10911	null
2023-09-19	Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context	Rucha Deshpande et.al.	2309.10817	null
2023-09-19	PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance	Peiqing Yang et.al.	2309.10810	link
2023-09-19	Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation	Yatong Bai et.al.	2309.10740	link
2023-09-19	Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising	Yujin Wang et.al.	2309.10714	null
2023-09-19	Forgedit: Text Guided Image Editing via Learning and Forgetting	Shiwen Zhang et.al.	2309.10556	link
2023-09-19	Towards Generative Modeling of Urban Flow through Knowledge-enhanced Denoising Diffusion	Zhilun Zhou et.al.	2309.10547	link
2023-09-21	Learning End-to-End Channel Coding with Diffusion Models	Muah Kim et.al.	2309.10505	null
2023-09-19	Unsupervised speech enhancement with diffusion-based generative models	Berné Nortier et.al.	2309.10450	link
2023-09-19	Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder	Mostafa Sadeghi et.al.	2309.10439	null
2023-09-19	AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration	Lijiang Li et.al.	2309.10438	link
2023-09-19	$Γ$ -convergence of Nonlocal Dirichlet Energies With Penalty Formulations of Dirichlet Boundary Data	Weiye Gan et.al.	2309.10352	null
2023-09-18	What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews	Zoe De Simone et.al.	2309.09944	link
2023-09-18	DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving	Xiaofeng Wang et.al.	2309.09777	null
2023-09-18	Application-driven Validation of Posteriors in Inverse Problems	Tim J. Adler et.al.	2309.09764	null
2023-09-18	Single and Few-step Diffusion for Generative Speech Enhancement	Bunlong Lay et.al.	2309.09677	link
2023-09-18	Speeding Up Speech Synthesis In Diffusion Models By Reducing Data Distribution Recovery Steps Via Content Transfer	Peter Ochieng et.al.	2309.09652	null
2023-09-18	Gradpaint: Gradient-Guided Inpainting with Diffusion Models	Asya Grechka et.al.	2309.09614	null
2023-09-18	Causal-Story: Local Causal Attention Utilizing Parameter-Efficient Tuning For Visual Story Synthesis	Tianyi Song et.al.	2309.09553	link
2023-09-18	Progressive Text-to-Image Diffusion with Soft Latent Direction	YuTeng Ye et.al.	2309.09466	link
2023-09-17	Enhancing Knee Osteoarthritis severity level classification using diffusion augmented images	Paleti Nikhil Chowdary et.al.	2309.09328	null
2023-09-17	PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts	Jixun Yao et.al.	2309.09262	null
2023-09-16	CDDM: Channel Denoising Diffusion Models for Wireless Semantic Communications	Tong Wu et.al.	2309.08895	null
2023-09-15	Probabilistic Constellation Shaping With Denoising Diffusion Probabilistic Models: A Novel Approach	Mehdi Letafati et.al.	2309.08688	null
2023-09-15	Compositional Foundation Models for Hierarchical Planning	Anurag Ajay et.al.	2309.08587	null
2023-09-15	Denoising Diffusion Probabilistic Models for Hardware-Impaired Communications	Mehdi Letafati et.al.	2309.08568	null
2023-09-15	Breathing New Life into 3D Assets with Generative Repainting	Tianfu Wang et.al.	2309.08523	link
2023-09-15	Generalised Probabilistic Diffusion Scale-Spaces	Pascal Peter et.al.	2309.08511	null
2023-09-15	Biological invasions and epidemics with nonlocal diffusion along a line	Henri Berestycki et.al.	2309.08298	null
2023-09-15	Large Intestine 3D Shape Refinement Using Point Diffusion Models for Digital Phantom Generation	Kaouther Mouheb et.al.	2309.08289	null
2023-09-15	Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models	Ruian He et.al.	2309.08273	link
2023-09-15	Cartoondiff: Training-free Cartoon Image Generation with Diffusion Transformer Models	Feihong He et.al.	2309.08251	null
2023-09-15	Large-Vocabulary 3D Diffusion Model with Transformer	Ziang Cao et.al.	2309.07920	null
2023-09-14	Beta Diffusion	Mingyuan Zhou et.al.	2309.07867	link
2023-09-14	EMOCONV-DIFF: Diffusion-based Speech Emotion Conversion for Non-parallel and In-the-wild Data	Navin Raj Prabhu et.al.	2309.07828	null
2023-09-14	DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks	Zipeng Qi et.al.	2309.07509	null
2023-09-14	Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos	Fen Fang et.al.	2309.07409	link
2023-09-14	Semantic Adversarial Attacks via Diffusion Models	Chenan Wang et.al.	2309.07398	link
2023-09-14	Beta quantile regression for robust estimation of uncertainty in the presence of outliers	Haleh Akrami et.al.	2309.07374	null
2023-09-13	Unbiased Face Synthesis With Diffusion Models: Are We There Yet?	Harrison Rosenberg et.al.	2309.07277	link
2023-09-13	Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement	Chenghao Li et.al.	2309.07254	link
2023-09-13	Diffusion models for audio semantic communication	Eleonora Grassucci et.al.	2309.07195	null
2023-09-13	UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons	Sicheng Yang et.al.	2309.07051	link
2023-09-13	VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance	Carlos Hernandez-Olivan et.al.	2309.06934	null
2023-09-13	DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models	Namhyuk Ahn et.al.	2309.06933	null
2023-09-13	DCTTS: Discrete Diffusion Model with Contrastive Learning for Text-to-speech Generation	Zhichao Wu et.al.	2309.06787	null
2023-09-12	Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion Models	Zalan Fabian et.al.	2309.06642	link
2023-09-12	InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation	Xingchao Liu et.al.	2309.06380	link
2023-09-12	Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model	Yin Wang et.al.	2309.06284	null
2023-09-15	Spreading speeds of a nonlocal diffusion model with free boundaries in the time almost periodic media	Chengcheng Cheng et.al.	2309.06190	null
2023-09-12	Dynamics and spreading speeds of a nonlocal diffusion model with advection and free boundaries	Chengcheng Cheng et.al.	2309.06185	null
2023-09-12	Elucidating the solution space of extended reverse-time SDE for diffusion models	Qinpeng Cui et.al.	2309.06169	link
2023-09-12	Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts	Zhi-Yi Chin et.al.	2309.06135	link
2023-09-12	A monotone numerical integration method for mean-variance portfolio optimization under jump-diffusion models	Hanwen Zhang et.al.	2309.05977	null
2023-09-12	Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation	Zhiqing Zhang et.al.	2309.05929	null
2023-09-11	Predicting the Radiation Field of Molecular Clouds using Denoising Diffusion Probabilistic Models	Duo Xu et.al.	2309.05811	null
2023-09-11	Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models	Sumeet Singh et.al.	2309.05803	null
2023-09-11	Diffusion-based Adversarial Purification for Robust Deep MRI Reconstruction	Ismail Alkhouri et.al.	2309.05794	link
2023-09-11	PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models	Li Chen et.al.	2309.05793	null
2023-09-11	CaloClouds II: Ultra-Fast Geometry-Independent Highly-Granular Calorimeter Simulation	Erik Buhmann et.al.	2309.05704	link
2023-09-11	PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud	Chengyu Wang et.al.	2309.05534	null
2023-09-14	Treatment-aware Diffusion Probabilistic Model for Longitudinal MRI Generation and Diffuse Glioma Growth Prediction	Qinghui Liu et.al.	2309.05406	null
2023-09-11	Diff-Privacy: Diffusion-based Face Privacy Protection	Xiao He et.al.	2309.05330	null
2023-09-10	Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood	Yaxuan Zhu et.al.	2309.05153	link
2023-09-10	VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching	Yiwei Guo et.al.	2309.05027	link
2023-09-10	SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models	Shuchen Xue et.al.	2309.05019	link
2023-09-10	Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning	Guisheng Liu et.al.	2309.04965	null
2023-09-10	Seismic Data Strong Noise Attenuation Based on Diffusion Model and Principal Component Analysis	Junheng Peng et.al.	2309.04944	link
2023-09-10	Text-driven Editing of 3D Scenes without Retraining	Shuangkang Fang et.al.	2309.04917	link
2023-09-09	Global Convergence of Receding-Horizon Policy Search in Learning Estimator Designs	Xiangyuan Zhang et.al.	2309.04831	link
2023-09-09	Influence Maximization in Social Networks: A Survey	Hui Li et.al.	2309.04668	null
2023-09-08	The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion	Yujin Jeong et.al.	2309.04509	null
2023-09-08	Create Your World: Lifelong Text-to-Image Diffusion	Gan Sun et.al.	2309.04430	null
2023-09-08	MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask	Yupeng Zhou et.al.	2309.04399	null
2023-09-08	MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers	Sijia Li et.al.	2309.04372	null
2023-09-08	From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models	Changming Xiao et.al.	2309.04109	link
2023-09-07	DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection	Manlin Zhang et.al.	2309.03893	null
2023-09-07	Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption	Teng Hu et.al.	2309.03729	link
2023-09-07	DiffDefense: Defending against Adversarial Attacks via Diffusion Models	Hondamunige Prasanna Silva et.al.	2309.03702	link
2023-09-07	Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model	Sungwon Hwang et.al.	2309.03550	null
2023-09-07	Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation	Jiaxi Gu et.al.	2309.03549	null
2023-09-07	SyncDreamer: Generating Multiview-consistent Images from a Single-view Image	Yuan Liu et.al.	2309.03453	link
2023-09-07	Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy	Yi Tang et.al.	2309.03445	link
2023-09-07	Mean field limits of particle-based stochastic reaction-drift-diffusion models	Max Heldman et.al.	2309.03431	null
2023-09-06	SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction	Nivetha Jayakumar et.al.	2309.03335	null
2023-09-06	My Art My Choice: Adversarial Protection Against Unruly AI	Anthony Rhodes et.al.	2309.03198	null
2023-09-06	Optical pulse induced ultrafast antiferrodistortive transition in SrTiO3	Saqeeb Adnan et.al.	2309.03172	null
2023-09-06	MCM: Multi-condition Motion Synthesis Framework for Multi-scenario	Zeyu Ling et.al.	2309.03031	null
2023-09-06	Predicting the emergence of localised dihedral patterns in models for dryland vegetation	Dan J. Hill et.al.	2309.02956	link
2023-09-06	Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter	Jinglong Wang et.al.	2309.02773	link
2023-09-05	Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts	Hongyang Du et.al.	2309.02616	null
2023-09-05	Diffusion on the Probability Simplex	Griffin Floto et.al.	2309.02530	null
2023-09-05	Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models	Haixu Song et.al.	2309.02218	link
2023-09-05	Hierarchical Masked 3D Diffusion Model for Video Outpainting	Fanda Fan et.al.	2309.02119	null
2023-09-05	Diffusion-based 3D Object Detection with Random Boxes	Xin Zhou et.al.	2309.02049	null
2023-09-05	Diffusion Generative Inverse Design	Marin Vlastelica et.al.	2309.02040	null
2023-09-05	sasdim: self-adaptive noise scaling diffusion model for spatial time series imputation	Shunyang Zhang et.al.	2309.01988	null
2023-09-05	Efficient Bayesian Computational Imaging with a Surrogate Score-Based Prior	Berthy T. Feng et.al.	2309.01949	link
2023-09-05	Gradient Domain Diffusion Models for Image Synthesis	Yuanhao Gong et.al.	2309.01875	null
2023-09-04	Turbulent Flow Simulation using Autoregressive Conditional Diffusion Models	Georg Kohl et.al.	2309.01745	link
2023-09-07	Generative-based Fusion Mechanism for Multi-Modal Tracking	Zhangyong Tang et.al.	2309.01728	link
2023-09-04	ControlMat: A Controlled Generative Approach to Material Capture	Giuseppe Vecchio et.al.	2309.01700	null
2023-09-07	Improving Visual Quality and Transferability of Adversarial Attacks on Face Recognition Simultaneously with Adversarial Restoration	Fengfan Zhou et.al.	2309.01582	null
2023-09-04	DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion	Cédric Rommel et.al.	2309.01575	null
2023-09-04	Image denoising in photon-counting CT using PFGM++ with hijacked regularized sampling	Dennis Hein et.al.	2309.01553	link
2023-09-01	Iterative Multi-granular Image Editing using Diffusion Models	K J Joseph et.al.	2309.00613	null
2023-09-01	VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation	Xin Li et.al.	2309.00398	null
2023-09-01	Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution	Charles Laroche et.al.	2309.00287	link
2023-09-01	DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion Models	Michael Shenoda et.al.	2309.00248	link
2023-09-01	Diffusion Model with Clustering-based Conditioning for Food Image Generation	Yue Han et.al.	2309.00199	null
2023-09-01	Breakdown of the drift-diffusion model for transverse spin transport in a disordered Pt film	K. D. Belashchenko et.al.	2309.00183	null
2023-08-31	BuilDiff: 3D Building Shape Generation using Single-Image Conditional Point Cloud Diffusion Models	Yao Wei et.al.	2309.00158	null
2023-08-31	InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion	Sirui Xu et.al.	2308.16905	link
2023-08-31	Diffusion Models for Interferometric Satellite Aperture Radar	Alexandre Tuel et.al.	2308.16847	link
2023-08-31	Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains	Xuan Liu et.al.	2308.16742	link
2023-08-31	Modelling of highly extended Gamma-ray emission around the Geminga Pulsar as detected with H.E.S.S	A. M. W. Mitchell et.al.	2308.16669	null
2023-08-31	Generate Your Own Scotland: Satellite Image Generation Conditioned on Maps	Miguel Espinosa et.al.	2308.16648	link
2023-08-31	MFR-Net: Multi-faceted Responsive Listening Head Generation via Denoising Diffusion Model	Jin Liu et.al.	2308.16635	null
2023-08-31	Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images	Qingping Zheng et.al.	2308.16582	null
2023-08-31	Conditioning Score-Based Generative Models by Neuro-Symbolic Constraints	Davide Scassola et.al.	2308.16534	link
2023-08-31	MVDream: Multi-view Diffusion for 3D Generation	Yichun Shi et.al.	2308.16512	null
2023-08-30	A Recycling Training Strategy for Medical Image Segmentation with Diffusion Denoising Models	Yunguan Fu et.al.	2308.16355	link
2023-08-30	Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art	Tanujit Chakraborty et.al.	2308.16316	null
2023-08-30	Modality Cycles with Masked Conditional Diffusion for Unsupervised Anomaly Segmentation in MRI	Ziyun Liang et.al.	2308.16150	link
2023-08-30	SignDiff: Learning Diffusion Models for American Sign Language Production	Sen Fang et.al.	2308.16082	null
2023-08-30	DiffuVolume: Diffusion Model for Volume based Stereo Matching	Dian Zheng et.al.	2308.15989	null
2023-08-30	Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT Reconstruction	Kai Xu et.al.	2308.15942	link
2023-08-30	Physics-Informed DeepMRI: Bridging the Gap from Heat Diffusion to k-Space Interpolation	Zhuo-Xu Cui et.al.	2308.15918	null
2023-08-30	Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models	Zhanbo Feng et.al.	2308.15854	link
2023-08-30	A Dual-Zone Diffusion Model for High Energy Emissions of the Cygnus Cocoon	Shihong Zhan et.al.	2308.15831	null
2023-08-30	Intriguing Properties of Diffusion Models: A Large-Scale Dataset for Evaluating Natural Attack Capability in Text-to-Image Generative Models	Takami Sato et.al.	2308.15692	null
2023-08-30	Asymptotics for Short Maturity Asian Options in a Jump-Diffusion model with Local Volatility	Dan Pirjol et.al.	2308.15672	null
2023-08-29	ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer	Zachary Horvitz et.al.	2308.15459	link
2023-08-30	Elucidating the Exposure Bias in Diffusion Models	Mang Ning et.al.	2308.15321	link
2023-08-29	DiffusionVMR: Diffusion Model for Video Moment Retrieval	Henghao Zhao et.al.	2308.15109	null
2023-08-29	DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior	Xinqi Lin et.al.	2308.15070	link
2023-08-29	C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model	Longbin Ji et.al.	2308.15016	link
2023-08-28	Identifying and Mitigating the Security Risks of Generative AI	Clark Barrett et.al.	2308.14840	null
2023-08-28	Generating tabular datasets under differential privacy	Gianluca Truda et.al.	2308.14784	link
2023-08-30	Priority-Centric Human Motion Generation in Discrete Latent Space	Hanyang Kong et.al.	2308.14480	null
2023-08-28	Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization	Tao Yang et.al.	2308.14469	link
2023-08-28	Data-iterative Optimization Score Model for Stable Ultra-Sparse-View CT Reconstruction	Weiwen Wu et.al.	2308.14437	null
2023-08-28	Steerable Conditional Diffusion for Out-of-Distribution Adaptation in Imaging Inverse Problems	Riccardo Barbano et.al.	2308.14409	link
2023-08-28	InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models	Bing Han et.al.	2308.14360	null
2023-08-28	DiffSmooth: Certifiably Robust Learning via Diffusion Models and Local Smoothing	Jiawei Zhang et.al.	2308.14333	link
2023-08-27	SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation	Zhiyu Qu et.al.	2308.14191	link
2023-08-27	Diffusion Schrödinger Bridges for Bayesian Computation	Jeremy Heng et.al.	2308.14106	null
2023-08-27	Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views	Zi-Xin Zou et.al.	2308.14078	null
2023-08-26	Unsupervised Domain Adaptation via Domain-Adaptive Diffusion	Duo Peng et.al.	2308.13893	null
2023-08-26	The DiffuseStyleGesture+ entry to the GENEA Challenge 2023	Sicheng Yang et.al.	2308.13879	link
2023-08-26	Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models	Hao Fei et.al.	2308.13812	null
2023-08-26	DiffI2I: Efficient Diffusion Model for Image-to-Image Translation	Bin Xia et.al.	2308.13767	null
2023-08-25	Residual Denoising Diffusion Models	Jiawei Liu et.al.	2308.13712	link
2023-08-25	Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation	Debaditya Shome et.al.	2308.13568	link
2023-08-25	Distribution-Aligned Diffusion for Human Mesh Recovery	Lin Geng Foo et.al.	2308.13369	null
2023-08-25	EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior	Minda Zhao et.al.	2308.13223	link
2023-08-25	Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model	Xunpeng Yi et.al.	2308.13164	null
2023-08-25	A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions	Tianyi Zhang et.al.	2308.13142	null
2023-08-24	Full-dose PET Synthesis from Low-dose PET Using High-efficiency Diffusion Denoising Probabilistic Model	Shaoyan Pan et.al.	2308.13072	link
2023-08-24	Dense Text-to-Image Generation with Attention Modulation	Yunji Kim et.al.	2308.12964	link
2023-08-24	Hydrogen jet diffusion modeling by using physics-informed graph neural network and sparsely-distributed sensor data	Xinqi Zhang et.al.	2308.12621	null
2023-08-24	APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency	Yupu Yao et.al.	2308.12605	null
2023-08-23	Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion	Junjiao Tian et.al.	2308.12469	link
2023-08-23	InverseSR: 3D Brain MRI Super-Resolution Using a Latent Diffusion Model	Jueqi Wang et.al.	2308.12465	link
2023-08-23	Augmenting medical image classifiers with synthetic data from latent diffusion models	Luke W. Sagers et.al.	2308.12453	null
2023-08-23	Renormalizing Diffusion Models	Jordan Cotler et.al.	2308.12355	null
2023-08-23	Improving Generative Model-based Unfolding with Schrödinger Bridges	Sascha Diefenbacher et.al.	2308.12351	link
2023-08-23	Score diffusion models without early stopping: finite Fisher information is all you need	Giovanni Conforti et.al.	2308.12240	null
2023-08-25	Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning	Jiasheng Ye et.al.	2308.12219	link
2023-08-23	Quantum-Noise-driven Generative Diffusion Models	Marco Parigi et.al.	2308.12013	null
2023-08-23	High-quality Image Dehazing with Diffusion Model	Hu Yu et.al.	2308.11949	link
2023-08-23	Efficient Transfer Learning in Diffusion Models via Adversarial Noise	Xiyu Wang et.al.	2308.11948	null
2023-08-23	LongDanceDiff: Long-term Dance Generation with Conditional Diffusion Model	Siqi Yang et.al.	2308.11945	null
2023-08-23	Boosting Diffusion Models with an Adaptive Momentum Sampler	Xiyu Wang et.al.	2308.11941	null
2023-08-23	Audio Generation with Multiple Conditional Diffusion Model	Zhifang Guo et.al.	2308.11940	null
2023-08-23	Shape-conditioned 3D Molecule Generation via Equivariant Diffusion Models	Ziqi Chen et.al.	2308.11890	null
2023-08-22	IT3D: Improved Text-to-3D Generation with Explicit View Synthesis	Yiwen Chen et.al.	2308.11473	link
2023-08-22	Convergence guarantee for consistency models	Junlong Lyu et.al.	2308.11449	null
2023-08-22	MatFuse: Controllable Material Generation with Diffusion Models	Giuseppe Vecchio et.al.	2308.11408	link
2023-08-22	MusicJam: Visualizing Music Insights via Generated Narrative Illustrations	Chuer Chen et.al.	2308.11329	null
2023-08-22	DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment	Xujie Zhang et.al.	2308.11206	null
2023-08-22	Hey That’s Mine Imperceptible Watermarks are Preserved in Diffusion Generated Outputs	Luke Ditria et.al.	2308.11123	null
2023-08-21	TADA! Text to Animatable Digital Avatars	Tingting Liao et.al.	2308.10899	null
2023-08-23	Backdooring Textual Inversion for Concept Censorship	Yutong Wu et.al.	2308.10718	null
2023-08-21	EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints	Yutao Chen et.al.	2308.10648	null
2023-08-21	Frequency Compensated Diffusion Model for Real-scene Dehazing	Jing Wang et.al.	2308.10510	link
2023-08-21	Texture Generation on 3D Meshes with Point-UV Diffusion	Xin Yu et.al.	2308.10490	null
2023-08-21	DySuse: Susceptibility Estimation in Dynamic Social Networks	Yingdan Shi et.al.	2308.10442	null
2023-08-21	Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models	Heyang Xue et.al.	2308.10428	null
2023-08-20	Turning Waste into Wealth: Leveraging Low-Quality Samples for Enhancing Continuous Conditional Generative Adversarial Networks	Xin Ding et.al.	2308.10273	link
2023-08-20	Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image	Liao Shen et.al.	2308.10257	null
2023-08-20	Spiking-Diffusion: Vector Quantized Discrete Diffusion Model with Spiking Neural Networks	Mingxuan Liu et.al.	2308.10187	link
2023-08-20	Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction	Zeyu Han et.al.	2308.10157	link
2023-08-20	SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation	Chengyou Jia et.al.	2308.10156	null
2023-08-20	Disorder-induced linear magnetoresistance in Al $_2$O$_3$/SrTiO$_3$ heterostructures	Gao Kuang Hong et.al.	2308.10152	null
2023-08-19	MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance	Ernie Chu et.al.	2308.10079	null
2023-08-19	ControlCom: Controllable Image Composition using Diffusion Model	Bo Zhang et.al.	2308.10040	link
2023-08-19	AltDiffusion: A Multilingual Text-to-Image Diffusion Model	Fulong Ye et.al.	2308.09991	link
2023-08-19	Physics-Guided Human Motion Capture with Pose Probability Modeling	Jingyi Ju et.al.	2308.09910	link
2023-08-19	DiffusionTrack: Diffusion Model For Multi-Object Tracking	Run Luo et.al.	2308.09905	link
2023-08-18	DiffCharge: Generating EV Charging Scenarios via a Denoising Diffusion Model	Siyang Li et.al.	2308.09857	link
2023-08-18	Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization	Soumik Mukhopadhyay et.al.	2308.09716	link
2023-08-16	TeCH: Text-guided Reconstruction of Lifelike Clothed Humans	Yangyi Huang et.al.	2308.08545	link
2023-08-16	Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model	Ran Jiang et.al.	2308.08367	null
2023-08-18	Dual-Stream Diffusion Net for Text-to-Video Generation	Binhui Liu et.al.	2308.08316	null
2023-08-15	Interplay between particle trapping and heterogeneity in anomalous diffusion	Haroldo V. Ribeiro et.al.	2308.07989	null
2023-08-15	Monte Carlo guided Diffusion for Bayesian linear inverse problems	Gabriel Cardoso et.al.	2308.07983	link
2023-08-15	StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models	Zhizhong Wang et.al.	2308.07863	null
2023-08-15	CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction	Yan Di et.al.	2308.07837	null
2023-08-15	Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model	Bosheng Qin et.al.	2308.07749	null
2023-08-16	DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models	Ruiyuan Gao et.al.	2308.07687	link
2023-08-15	Maat: Performance Metric Anomaly Anticipation for Cloud Services with Conditional Diffusion	Cheryl Lee et.al.	2308.07676	link
2023-08-15	Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations without Training	Ximing Xing et.al.	2308.07665	link
2023-08-15	SGDiff: A Style Guided Diffusion Model for Fashion Synthesis	Zhengwentai Sun et.al.	2308.07605	link
2023-08-14	UniBrain: Unify Image Reconstruction and Captioning All in One Diffusion Model from Human Brain Activity	Weijian Mai et.al.	2308.07428	null
2023-08-14	U-Turn Diffusion	Hamidreza Behjoo et.al.	2308.07421	null
2023-08-14	DiffHopp: A Graph Diffusion Model for Novel Drug Design via Scaffold Hopping	Jos Torge et.al.	2308.07416	link
2023-08-14	Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation	Alexander Martin et.al.	2308.07316	link
2023-08-14	Bayesian Flow Networks	Alex Graves et.al.	2308.07037	link
2023-08-14	Discrete Conditional Diffusion for Reranking in Recommendation	Xiao Lin et.al.	2308.06982	null
2023-08-13	Well-posedness of a reaction-diffusion model with stochastic dynamical boundary conditions	Mario Maurelli et.al.	2308.06847	null
2023-08-13	Shape-guided Conditional Latent Diffusion Models for Synthesising Brain Vasculature	Yash Deo et.al.	2308.06781	null
2023-08-13	TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution	Baolin Liu et.al.	2308.06743	link
2023-08-13	Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks	David Junhao Zhang et.al.	2308.06739	null
2023-08-13	Precipitation nowcasting with generative diffusion models	Andrea Asperti et.al.	2308.06733	link
2023-08-13	CLE Diffusion: Controllable Light Enhancement Diffusion Model	Yuyang Yin et.al.	2308.06725	null
2023-08-13	IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models	Hu Ye et.al.	2308.06721	null
2023-08-13	LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts	Binbin Yang et.al.	2308.06713	null
2023-08-12	Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation	Junwei Huang et.al.	2308.06644	link
2023-08-12	CMR exploration II – filament identification with machine learning	Duo Xu et.al.	2308.06641	null
2023-08-12	EquiDiff: A Conditional Equivariant Diffusion Model For Trajectory Prediction	Kehua Chen et.al.	2308.06564	null
2023-08-11	White-box Membership Inference Attacks against Diffusion Models	Yan Pang et.al.	2308.06405	null
2023-08-11	Mirror Diffusion Models	Jaesung Tae et.al.	2308.06342	null
2023-08-11	DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models	Weijia Wu et.al.	2308.06160	link
2023-08-11	Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow	Junhong Gou et.al.	2308.06101	link
2023-08-11	Head Rotation in Denoising Diffusion Models	Andrea Asperti et.al.	2308.06057	link
2023-08-11	Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning	Chun-Mei Feng et.al.	2308.06038	link
2023-08-10	AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining	Haohe Liu et.al.	2308.05734	link
2023-08-10	PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers	Phillip Lippe et.al.	2308.05732	null
2023-08-10	Masked Diffusion as Self-supervised Representation Learner	Zixuan Pan et.al.	2308.05695	link
2023-08-10	Generative Diffusion Models for Radio Wireless Channel Modelling and Sampling	Ushnish Sengupta et.al.	2308.05583	null
2023-08-10	Beyond Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization	Hongyang Du et.al.	2308.05384	link
2023-08-09	Do Diffusion Models Suffer Error Propagation? Theoretical Analysis and Consistency Regularization	Yangming Li et.al.	2308.05021	null
2023-08-10	IDiff-Face: Synthetic-based Face Recognition through Fizzy Identity-Conditioned Diffusion Models	Fadi Boutros et.al.	2308.04995	link
2023-08-09	JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models	Peike Li et.al.	2308.04729	null
2023-08-08	Semi-Supervised Semantic Segmentation of Cell Nuclei via Diffusion-based Large-Scale Pre-Training and Collaborative Learning	Zhuchen Shao et.al.	2308.04578	null
2023-08-08	3D Scene Diffusion Guidance using Scene Graphs	Mohammad Naanaa et.al.	2308.04468	null
2023-08-08	DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images	Xuechao Zou et.al.	2308.04417	link
2023-08-08	Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On	Daiheng Gao et.al.	2308.04288	null
2023-08-08	Synthetic Augmentation with Large-scale Unconditional Pre-training	Jiarong Ye et.al.	2308.04020	link
2023-08-08	Target Speech Extraction with Conditional Diffusion Model	Naoyuki Kamo et.al.	2308.03987	null
2023-08-07	A staggered-in-time and non-conforming-in-space numerical framework for realistic cardiac electrophysiology outputs	Elena Zappon et.al.	2308.03884	null
2023-08-07	CaloDiffusion with GLaM for High Fidelity Calorimeter Simulation	Oz Amram et.al.	2308.03876	link
2023-08-07	CaloScore v2: Single-shot Calorimeter Shower Simulation with Diffusion Models	Vinicius Mikuni et.al.	2308.03847	link
2023-08-07	Linear Convergence Bounds for Diffusion Models via Stochastic Localization	Joe Benton et.al.	2308.03686	null
2023-08-07	Diffusion Model in Causal Inference with Unmeasured Confounders	Tatsuhiro Shimizu et.al.	2308.03669	link
2023-08-07	AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose	Huichao Zhang et.al.	2308.03610	link
2023-08-10	DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis	Zhongjie Duan et.al.	2308.03463	link
2023-08-07	Energy-Guided Diffusion Model for CBCT-to-CT Synthesis	Linjie Fu et.al.	2308.03354	null
2023-08-06	Photorealistic and Identity-Preserving Image-Based Emotion Manipulation with Latent Diffusion Models	Ioannis Pikoulis et.al.	2308.03183	link
2023-08-05	Generative Approach for Probabilistic Human Mesh Recovery using Diffusion Models	Hanbyel Cho et.al.	2308.02963	link
2023-08-05	DermoSegDiff: A Boundary-aware Segmentation Diffusion Model for Skin Lesion Delineation	Afshin Bozorgpour et.al.	2308.02959	link
2023-08-05	DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation	Qiaosong Qi et.al.	2308.02915	null
2023-08-05	Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation	Zijie Wu et.al.	2308.02874	null
2023-08-05	Thin On-Sensor Nanophotonic Array Cameras	Praneeth Chakravarthula et.al.	2308.02797	null
2023-08-04	A geometric singular perturbation analysis of generalised shock selection rules in reaction-nonlinear diffusion models	Bronwyn H Bradshaw-Hajek et.al.	2308.02719	null
2023-08-04	Diffusion-Augmented Depth Prediction with Sparse Annotations	Jiaqi Li et.al.	2308.02283	null
2023-08-04	Painterly Image Harmonization using Diffusion Model	Lingxiao Lu et.al.	2308.02228	link
2023-08-04	Towards Personalized Prompt-Model Retrieval for Generative Recommendation	Yuanhe Guo et.al.	2308.02205	link
2023-08-04	Optimal Control of Stationary Doubly Diffusive Flows on Two and Three Dimensional Bounded Lipschitz Domains: A Theoretical Study	Jai Tushar et.al.	2308.02178	null
2023-08-04	Improved Order Analysis and Design of Exponential Integrator for Diffusion Models Sampling	Qinsheng Zhang et.al.	2308.02157	null
2023-08-04	SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation	Shikun Sun et.al.	2308.02154	null
2023-08-03	On the Biometric Capacity of Generative Face Models	Vishnu Naresh Boddeti et.al.	2308.02065	null
2023-08-03	Diffusion Models for Counterfactual Generation and Anomaly Detection in Brain Images	Alessandro Fontanella et.al.	2308.02062	link
2023-08-03	Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling	Zhao Yang et.al.	2308.01850	link
2023-08-03	DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models	Jianxin Lin et.al.	2308.01655	null
2023-08-03	Reference-Free Isotropic 3D EM Reconstruction using Diffusion Models	Kyungryun Lee et.al.	2308.01594	null
2023-08-03	Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS	Myeongjin Ko et.al.	2308.01573	link
2023-08-03	Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models	Joao Carvalho et.al.	2308.01557	null
2023-08-03	MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies	Ke Chen et.al.	2308.01546	link
2023-08-02	Reverse Stable Diffusion: What prompt was used to generate this image?	Florinel-Alin Croitoru et.al.	2308.01472	link
2023-08-02	Patched Denoising Diffusion Models For High-Resolution Image Synthesis	Zheng Ding et.al.	2308.01316	link
2023-08-02	Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation	Guojin Zhong et.al.	2308.01147	link
2023-08-02	Exploiting Synthetic Data for Data Imbalance Problems: Baselines from a Data Perspective	Moon Ye-Bin et.al.	2308.00994	null
2023-08-01	Radial Evolution in a Reaction-Diffusion Model	Sofia M. Silveira et.al.	2308.00671	null
2023-08-01	Diffusion Model for Camouflaged Object Detection	Zhennan Chen et.al.	2308.00303	null
2023-08-02	EC-Conf: An Ultra-fast Diffusion Model for Molecular Conformation Generation with Equivariant Consistency	Zhiguang Fan et.al.	2308.00237	link
2023-07-31	DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models	Chao Huang et.al.	2308.00122	null
2023-08-02	Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models	Weikang Yu et.al.	2307.16865	link
2023-07-31	DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation	Runyang Feng et.al.	2307.16687	null
2023-08-03	On the Trustworthiness Landscape of State-of-the-art Generative Models: A Comprehensive Survey	Mingyuan Fan et.al.	2307.16680	null
2023-07-31	Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech	Guangyan Zhang et.al.	2307.16679	null
2023-07-31	Contrastive Conditional Latent Diffusion for Audio-visual Segmentation	Yuxin Mao et.al.	2307.16579	null
2023-07-31	DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training	Hyung-Seok Oh et.al.	2307.16549	link
2023-07-31	Don’t be so negative! Score-based Generative Modeling with Oracle-assisted Guidance	Saeid Naderiparizi et.al.	2307.16463	null
2023-07-31	MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning	Baoquan Zhang et.al.	2307.16424	null
2023-07-31	Mapping brain microstructure in vivo in health and disease using diffusion MRI	Ying Liao et.al.	2307.16386	link
2023-07-31	MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text	Junchen Zhu et.al.	2307.16371	null
2023-07-30	TransFusion: A Practical and Effective Transformer-based Diffusion Model for 3D Human Motion Prediction	Sibo Tian et.al.	2307.16106	link
2023-07-29	UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models	Sen Fang et.al.	2307.15898	null
2023-07-29	Parameter identifiability in PDE models of fluorescence recovery after photobleaching	Maria-Veronica Ciocanel et.al.	2307.15857	null
2023-07-28	Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding	Chunyu Qiang et.al.	2307.15484	null
2023-07-27	Generative AI for Medical Imaging: extending the MONAI Framework	Walter H. L. Pinaya et.al.	2307.15208	link
2023-07-27	LLDiffusion: Learning Degradation Representations in Diffusion Models for Low-Light Image Enhancement	Tao Wang et.al.	2307.14659	link
2023-07-29	Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior	Adam Block et.al.	2307.14619	null
2023-07-26	Visual Instruction Inversion: Image Editing via Visual Prompting	Thao Nguyen et.al.	2307.14331	link
2023-07-26	Founding a mathematical diffusion model in linguistics. The case study of German syntactic features in the North-Eastern Italian dialects	I. Lazzizzera et.al.	2307.14291	null
2023-07-26	VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet	Zhihao Hu et.al.	2307.14073	null
2023-07-27	Pre-Training with Diffusion models for Dental Radiography segmentation	Jérémy Rousseau et.al.	2307.14066	null
2023-07-26	MCMC-Correction of Score-Based Diffusion Models for Model Composition	Anders Sjöberg et.al.	2307.14012	link
2023-07-26	How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?	Huazheng Wang et.al.	2307.13949	link
2023-07-26	Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation	Chaohui Yu et.al.	2307.13908	null
2023-07-25	**Composite Diffusion	whole >= Σparts**	Vikram Jamwal et.al.	2307.13720
2023-07-25	Score-based Diffusion Models for Generating Liquid Argon Time Projection Chamber Images	Zeviel Imani et.al.	2307.13687	link
2023-07-25	Fake It Without Making It: Conditioned Face Generation for Accurate 3D Face Shape Estimation	Will Rowan et.al.	2307.13639	null
2023-07-25	XDLM: Cross-lingual Diffusion Language Model for Machine Translation	Linyao Chen et.al.	2307.13560	null
2023-07-25	Not with my name! Inferring artists’ names of input strings employed by Diffusion Models	Roberto Leotta et.al.	2307.13527	link
2023-07-25	Modelling functionalized drug release for a spherical capsule	Elliot J. Carr et.al.	2307.13224	link
2023-07-24	Deep Learning Approaches for Data Augmentation in Medical Imaging: A Review	Aghiles Kebaili et.al.	2307.13125	null
2023-07-24	Data-free Black-box Attack based on Diffusion Model	Mingwen Shao et.al.	2307.12872	link
2023-07-24	Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry	Yong-Hyun Park et.al.	2307.12868	link
2023-07-24	TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with Transformers	Md Fahim Sikder et.al.	2307.12667	link
2023-07-24	Interpolating between Images with Diffusion Models	Clinton J. Wang et.al.	2307.12560	null
2023-07-24	AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models	Xuelong Dai et.al.	2307.12499	link
2023-07-25	TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition	Shilin Lu et.al.	2307.12493	link
2023-07-25	ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting	Zongsheng Yue et.al.	2307.12348	link
2023-07-23	TabADM: Unsupervised Tabular Anomaly Detection with Diffusion Models	Guy Zamberg et.al.	2307.12336	null
2023-07-23	An axiomatized PDE model of deep neural networks	Tangjun Wang et.al.	2307.12333	null
2023-07-22	PLANTAIN: Diffusion-inspired Pose Score Minimization for Fast and Accurate Molecular Docking	Michael Brocidiacono et.al.	2307.12090	link
2023-07-22	Iterative Reconstruction Based on Latent Diffusion Model for Sparse Data Reconstruction	Linchao He et.al.	2307.12070	null
2023-07-22	FSDiffReg: Feature-wise and Score-wise Diffusion-guided Unsupervised Deformable Image Registration for Cardiac Images	Yi Qin et.al.	2307.12035	link
2023-07-21	PartDiff: Image Super-resolution with Partial Diffusion Models	Kai Zhao et.al.	2307.11926	null
2023-07-21	Learning minimal representations of stochastic processes with variational autoencoders	Gabriel Fernández-Fernández et.al.	2307.11608	link
2023-07-21	Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting	Marcel Kollovieh et.al.	2307.11494	link
2023-07-21	Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning	Jian Ma et.al.	2307.11410	link
2023-07-20	Dehazing Ultrasound using Diffusion Models	Tristan S. W. Stevens et.al.	2307.11204	null
2023-07-20	Diffusion Models for Probabilistic Deconvolution of Galaxy Images	Zhiwei Xue et.al.	2307.11122	link
2023-07-20	Diffusion Sampling with Momentum for Mitigating Divergence Artifacts	Suttisak Wizadwongsa et.al.	2307.11118	link
2023-07-20	Progressive distillation diffusion for raw music generation	Svetlana Pavlova et.al.	2307.10994	null
2023-07-20	Structure-preserving schemes for drift-diffusion systems on general meshes: DDFV vs HFV	Stella Krell et.al.	2307.10911	null
2023-07-20	BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion	Jinheng Xie et.al.	2307.10816	link
2023-07-21	AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models	Jiachun Pan et.al.	2307.10711	link
2023-07-20	Reference-based Painterly Inpainting via Diffusion: Crossing the Wild Reference Domain Gap	Dejia Xu et.al.	2307.10584	null
2023-07-19	PreDiff: Precipitation Nowcasting with Latent Diffusion Models	Zhihan Gao et.al.	2307.10422	link
2023-07-19	TokenFlow: Consistent Diffusion Features for Consistent Video Editing	Michal Geyer et.al.	2307.10373	null
2023-07-19	Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls	Lejun Min et.al.	2307.10304	link
2023-07-18	Modeling pattern formation in communities by using information particles	Junichi Miyakoshi et.al.	2307.10270	null
2023-07-19	FABRIC: Personalizing Diffusion Models with Iterative Feedback	Dimitri von Rütte et.al.	2307.10159	link
2023-07-19	Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis	Lingting Zhu et.al.	2307.10094	null
2023-07-19	Modelling the Spatial Spread of COVID-19 in aGerman District using a Diffusion Model	Moritz Schäfer et.al.	2307.09956	null
2023-07-19	BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection	Jitao Ma et.al.	2307.09861	link
2023-07-19	A Siamese-based Verification System for Open-set Architecture Attribution of Synthetic Images	Lydia Abady et.al.	2307.09822	link
2023-07-19	DiffDP: Radiotherapy Dose Prediction via a Diffusion Model	Zhenghao Feng et.al.	2307.09794	null
2023-07-19	Text2Layer: Layered Image Generation using Latent Diffusion Model	Xinyang Zhang et.al.	2307.09781	null
2023-07-18	An approximate maximum likelihood estimator of drift parameters in a multidimensional diffusion model	Miljenko Huzak et.al.	2307.09199	null
2023-07-18	DiTTO: Diffusion-inspired Temporal Transformer Operator	Oded Ovadia et.al.	2307.09072	null
2023-07-18	Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond	Yang Zhao et.al.	2307.08996	null
2023-07-17	Autoregressive Diffusion Model for Graph Generation	Lingkai Kong et.al.	2307.08849	null
2023-07-17	Diffusion Models Beat GANs on Image Classification	Soumik Mukhopadhyay et.al.	2307.08702	null
2023-07-17	SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor Defect Classification and Segmentation	Vic De Ridder et.al.	2307.08693	null
2023-07-17	Identity-Preserving Aging of Face Images via Latent Diffusion Models	Sudipta Banerjee et.al.	2307.08585	link
2023-07-17	Synthetic Lagrangian Turbulence by Generative Diffusion Models	Tianyi Li et.al.	2307.08529	link
2023-07-17	Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation	Luozhou Wang et.al.	2307.08448	link
2023-07-18	Unstoppable Attack: Label-Only Model Inversion via Conditional Diffusion Model	Rongke Liu et.al.	2307.08424	null
2023-07-17	Complexity Matters: Rethinking the Latent Space for Generative Modeling	Tianyang Hu et.al.	2307.08283	null
2023-07-17	Manifold-Guided Sampling in Diffusion Models for Unbiased Image Generation	Xingzhe Su et.al.	2307.08199	null
2023-07-16	Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency	Bowen Song et.al.	2307.08123	link
2023-07-16	Discovering a reaction-diffusion model for Alzheimer’s disease by combining PINNs with symbolic regression	Zhen Zhang et.al.	2307.08107	null
2023-07-16	Diffusion to Confusion: Naturalistic Adversarial Patch Generation Based on Diffusion Model for Object Detector	Shuo-Yen Lin et.al.	2307.08076	null
2023-07-16	LafitE: Latent Diffusion Model with Feature Editing for Unsupervised Multi-class Anomaly Detection	Haonan Yin et.al.	2307.08059	null
2023-07-16	Noise-aware Speech Enhancement using Diffusion Probabilistic Model	Yuchen Hu et.al.	2307.08029	link
2023-07-15	ExposureDiffusion: Learning to Expose for Low-light Image Enhancement	Yufei Wang et.al.	2307.07710	link
2023-07-14	NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis	Nilesh Kulkarni et.al.	2307.07511	null
2023-07-14	Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks	Chaoyu Liu et.al.	2307.07344	null
2023-07-14	Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection	Alessandro Flaborea et.al.	2307.07205	link
2023-07-14	Federated Learning-Empowered AI-Generated Content in Wireless Networks	Xumin Huang et.al.	2307.07146	null
2023-07-13	Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement	Hui Yuan et.al.	2307.07055	null
2023-07-13	HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models	Nataniel Ruiz et.al.	2307.06949	null
2023-07-14	PC-Droid: Faster diffusion and improved quality for particle cloud generation	Matthew Leigh et.al.	2307.06836	null
2023-07-13	AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion	Shuo Huang et.al.	2307.06526	null
2023-07-13	Improving Nonalcoholic Fatty Liver Disease Classification Performance With Latent Diffusion Models	Romain Hardy et.al.	2307.06507	null
2023-07-12	Exposing the Fake: Effective Diffusion-Generated Images Detection	Ruipeng Ma et.al.	2307.06272	null
2023-07-12	Diffusion Based Multi-Agent Adversarial Tracking	Sean Ye et.al.	2307.06244	null
2023-07-12	Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models	Sanghyun Kim et.al.	2307.05977	link
2023-07-11	WHFast512: A symplectic N-body integrator for planetary systems optimized with AVX512 instructions	Pejvak Javaheri et.al.	2307.05683	link
2023-07-07	AutoDecoding Latent 3D Diffusion Models	Evangelos Ntavelis et.al.	2307.05445	link
2023-07-11	Metropolis Sampling for Constrained Diffusion Models	Nic Fishman et.al.	2307.05439	null
2023-07-11	Geometric Neural Diffusion Processes	Emile Mathieu et.al.	2307.05431	link
2023-07-11	On the Vulnerability of DeepFake Detectors to Attacks Generated by Denoising Diffusion Models	Marija Ivanovska et.al.	2307.05397	null
2023-07-11	Diffusion idea exploration for art generation	Nikhil Verma et.al.	2307.04978	null
2023-07-10	Articulated 3D Head Avatar Generation using Text-to-Image Diffusion Models	Alexander W. Bergman et.al.	2307.04859	null
2023-07-10	Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback	Jaskirat Singh et.al.	2307.04749	null
2023-07-10	Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning	Suzan Ece Ada et.al.	2307.04726	null
2023-07-10	AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning	Yuwei Guo et.al.	2307.04725	link
2023-07-10	Timbre transfer using image-to-image denoising diffusion models	Luca Comanducci et.al.	2307.04586	null
2023-07-10	Enhancing Adversarial Robustness via Score-Based Optimization	Boya Zhang et.al.	2307.04333	link
2023-07-11	DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer	Dan Ruta et.al.	2307.04157	null
2023-07-08	Measuring the Success of Diffusion Models at Imitating Human Artists	Stephen Casper et.al.	2307.04028	null
2023-07-08	Stimulating the Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling	Tong Li et.al.	2307.03992	link
2023-07-07	Nonresonant scattering of energetic electrons by electromagnetic ion cyclotron waves: spacecraft observations and theoretical framework	Xin An et.al.	2307.03795	null
2023-07-07	Unsupervised 3D out-of-distribution detection with latent diffusion models	Mark S. Graham et.al.	2307.03777	link
2023-07-07	IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model	Tianhao Wu et.al.	2307.03177	null
2023-07-06	Patterning of nonlocal transport models in biology: the impact of spatial dimension	Thomas Jun Jewell et.al.	2307.03117	null
2023-07-06	How to Detect Unauthorized Data Usages in Text-to-image Diffusion Models	Zhenting Wang et.al.	2307.03108	link
2023-07-06	On the Cultural Gap in Text-to-Image Generation	Bingshuai Liu et.al.	2307.02971	null
2023-07-06	Probabilistic and Semantic Descriptions of Image Manifolds and Their Applications	Peter Tu et.al.	2307.02881	null
2023-07-06	A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task	Shiqi Yang et.al.	2307.02862	null
2023-07-06	Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback	TaeHo Yoon et.al.	2307.02770	link
2023-07-06	Towards Symmetry-Aware Generation of Periodic Materials	Youzhi Luo et.al.	2307.02707	link
2023-07-06	Applying a Color Palette with Local Control using Diffusion Models	Vaibhav Vavilala et.al.	2307.02698	link
2023-07-05	Pattern formation and bifurcation analysis of delay induced fractional-order epidemic spreading on networks	Jiaying Zhou et.al.	2307.02669	null
2023-07-05	Diffusion Models for Computational Design at the Example of Floor Plans	Joern Ploennigs et.al.	2307.02511	link
2023-07-05	DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models	Chong Mou et.al.	2307.02421	link
2023-07-05	RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation	Renato Sortino et.al.	2307.02392	null
2023-07-05	Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality	Peter Lorenz et.al.	2307.02347	link
2023-07-05	SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection	Yuguang Shi et.al.	2307.02270	null
2023-07-05	Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions	Sandipana Dowerah et.al.	2307.02244	null
2023-07-05	DiffFlow: A Unified SDE Framework for Score-Based Diffusion Models and Generative Adversarial Networks	Jingwei Zhang et.al.	2307.02159	null
2023-07-05	Prompting Diffusion Representations for Cross-Domain Semantic Segmentation	Rui Gong et.al.	2307.02138	null
2023-07-05	Monte Carlo Sampling without Isoperimetry: A Reverse Diffusion Approach	Xunpeng Huang et.al.	2307.02037	null
2023-07-04	Hybrid Neural Diffeomorphic Flow for Shape Representation and Generation via Triplane	Kun Han et.al.	2307.01957	null
2023-07-04	SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis	Dustin Podell et.al.	2307.01952	link
2023-07-04	ProtoDiffusion: Classifier-Free Diffusion Guidance with Prototype Learning	Gulcin Baykal et.al.	2307.01924	link
2023-07-04	Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning	Xiang Li et.al.	2307.01849	link
2023-07-04	Stochastic and self-consistent 3D modeling of streamer discharge trees with Kinetic Monte Carlo	Robert Marskar et.al.	2307.01797	link
2023-07-04	On the Constrained Time-Series Generation Problem	Andrea Coletta et.al.	2307.01717	null
2023-07-04	Disentanglement in a GAN for Unconditional Speech Synthesis	Matthew Baas et.al.	2307.01673	link
2023-07-04	SwinGNN: Rethinking Permutation Invariance in Diffusion Models for Graph Generation	Qi Yan et.al.	2307.01646	link
2023-07-04	Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations	Anil Osman Tur et.al.	2307.01533	link
2023-07-04	LEAT: Towards Robust Deepfake Disruption in Real-World Scenarios via Latent Ensemble Attack	Joonkyo Shim et.al.	2307.01520	null
2023-07-04	Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning	Zhuoran Li et.al.	2307.01472	null
2023-07-03	Squeezing Large-Scale Diffusion Models for Mobile	Jiwoong Choi et.al.	2307.01193	null
2023-06-30	Practical and Asymptotically Exact Conditional Sampling in Diffusion Models	Luhuan Wu et.al.	2306.17775	link
2023-06-30	Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling	Li Sanqian et.al.	2306.17717	null
2023-06-30	Counting Guidance for High Fidelity Text-to-Image Synthesis	Wonjun Kang et.al.	2306.17567	null
2023-06-30	Class-Incremental Learning using Diffusion Model for Distillation and Replay	Quentin Jodelet et.al.	2306.17560	null
2023-06-29	Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models	Simian Luo et.al.	2306.17203	link
2023-06-29	Generate Anything Anywhere in Any Scene	Yuheng Li et.al.	2306.17154	null
2023-06-29	Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion Models	Zeqi Gu et.al.	2306.17141	link
2023-06-29	ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models	Weihao Cheng et.al.	2306.17140	null
2023-07-03	Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation	Zibo Zhao et.al.	2306.17115	link
2023-06-29	Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation	Zhongwei Qiu et.al.	2306.17074	null
2023-06-29	One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization	Minghua Liu et.al.	2306.16928	link
2023-06-28	PFB-Diff: Progressive Feature Blending Diffusion for Text-driven Image Editing	Wenjing Huang et.al.	2306.16894	link
2023-06-29	SaGess: Sampling Graph Denoising Diffusion Model for Scalable Graph Generation	Stratis Limnios et.al.	2306.16827	null
2023-06-29	Graph Denoising Diffusion for Inverse Protein Folding	Kai Yi et.al.	2306.16819	link
2023-06-29	DiffusionSTR: Diffusion Model for Scene Text Recognition	Masato Fujitake et.al.	2306.16707	null
2023-06-29	Self-Supervised MRI Reconstruction with Unrolled Diffusion Models	Yilmaz Korkmaz et.al.	2306.16654	link
2023-06-28	DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy	Yiwen Zhang et.al.	2306.16324	link
2023-06-28	SVNR: Spatially-variant Noise Removal with Denoising Diffusion	Naama Pearl et.al.	2306.16052	null
2023-06-28	GeXSe (Generative Explanatory Sensor System): An Interpretable Deep Generative Model for Human Activity Recognition in Smart Spaces	Yuan Sun et.al.	2306.15857	null
2023-06-27	Easing Color Shifts in Score-Based Diffusion Models	Katherine Deck et.al.	2306.15832	link
2023-06-26	Restart Sampling for Improving Generative Processes	Yilun Xu et.al.	2306.14878	link
2023-06-26	ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion	Yingjun Du et.al.	2306.14770	link
2023-06-26	DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models	Ximing Xing et.al.	2306.14685	link
2023-06-26	A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis	Aishwarya Agarwal et.al.	2306.14544	null
2023-06-27	DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing	Yujun Shi et.al.	2306.14435	link
2023-06-26	Decompose and Realign: Tackling Condition Misalignment in Text-to-Image Diffusion Models	Luozhou Wang et.al.	2306.14408	link
2023-06-25	CDiffMR: Can We Replace the Gaussian Noise with K-Space Undersampling for Fast MRI?	Jiahao Huang et.al.	2306.14350	link
2023-06-25	Diffusion Model Based Low-Light Image Enhancement for Space Satellite	Yiman Zhu et.al.	2306.14227	null
2023-06-25	DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data	Jingyuan Zhu et.al.	2306.14153	null
2023-06-25	YOLO-based Semantic Communication with Generative AI-aided Resource Allocation for Digital Twins Construction	Baoxia Du et.al.	2306.14138	null
2023-06-25	DiffMix: Diffusion Model-based Data Synthesis for Nuclei Segmentation and Classification in Imbalanced Pathology Image Datasets	Hyun-Jic Oh et.al.	2306.14132	null
2023-06-24	SEEDS: Emulation of Weather Forecast Ensembles with Diffusion Models	Lizao Li et.al.	2306.14066	null
2023-06-24	DiffDTM: A conditional structure-free framework for bioactive molecules generation targeted for dual proteins	Lei Huang et.al.	2306.13957	null
2023-06-23	The role of convection in the limit shape of the critical front profile for Born-Infeld diffusion models	Maurizio Garrione et.al.	2306.13806	null
2023-06-23	Asymptotic study of critical wave fronts for parameter-dependent Born-Infeld models: physically predicted behaviors and new phenomena	Maurizio Garrione et.al.	2306.13788	null
2023-06-23	Zero-shot spatial layout conditioning for text-to-image diffusion models	Guillaume Couairon et.al.	2306.13754	null
2023-06-23	Decoupled Diffusion Models with Explicit Transition Probability	Yuhang Huang et.al.	2306.13720	link
2023-06-23	DreamEditor: Text-Driven 3D Scene Editing with Neural Fields	Jingyu Zhuang et.al.	2306.13455	link
2023-06-23	DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology	Marco Aversa et.al.	2306.13384	link
2023-06-22	Directional diffusion models for graph representation learning	Run Yang et.al.	2306.13210	null
2023-06-22	Continuous Layout Editing of Single Images with Diffusion Models	Zhiyuan Zhang et.al.	2306.13078	null
2023-06-22	Towards More Realistic Membership Inference Attacks on Large Diffusion Models	Jan Dubiński et.al.	2306.12983	null
2023-06-22	DiffWA: Diffusion Models for Watermark Attack	Xinyu Li et.al.	2306.12790	null
2023-06-22	A prior regularized full waveform inversion using generative diffusion models	Fu Wang et.al.	2306.12776	null
2023-06-22	One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation	Bohan Li et.al.	2306.12681	null
2023-06-23	Semi-Implicit Denoising Diffusion Models (SIDDMs)	Yanwu Xu et.al.	2306.12511	link
2023-06-21	DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation	Yukun Huang et.al.	2306.12422	null
2023-06-21	Diffusion Posterior Sampling for Informed Single-Channel Dereverberation	Jean-Marie Lemercier et.al.	2306.12286	link
2023-06-21	HumanDiffusion: diffusion model using perceptual gradients	Yota Ueda et.al.	2306.12169	null
2023-06-21	DiffuseIR:Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images	Mingjie Pan et.al.	2306.12109	null
2023-06-21	HSR-Diff:Hyperspectral Image Super-Resolution via Conditional Diffusion Models	Chanyue Wu et.al.	2306.12085	null
2023-06-21	Ambigram Generation by A Diffusion Model	Takahiro Shirakawa et.al.	2306.12049	link
2023-06-22	Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems	Prashant K. Jha et.al.	2306.12047	null
2023-06-21	TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent Diffusion Models	Se-In Jang et.al.	2306.11984	null
2023-06-20	Mercury’s chaotic secular evolution as a subdiffusive process	Dorian S. Abbot et.al.	2306.11870	null
2023-06-20	Exploring the Effectiveness of Dataset Synthesis: An application of Apple Detection in Orchards	Alexander van Meekeren et.al.	2306.11763	null
2023-06-20	Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning	Huiguo He et.al.	2306.11731	null
2023-06-20	Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision	Ayush Tewari et.al.	2306.11719	null
2023-06-20	Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputs	Yu Takagi et.al.	2306.11536	link
2023-06-20	Align, Adapt and Inject: Sound-guided Unified Image Generation	Yue Yang et.al.	2306.11504	null
2023-06-20	EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model	Lianying Yin et.al.	2306.11496	null
2023-06-20	Hierarchical GNNs for Large Graph Generation	Alex O. Davies et.al.	2306.11412	null
2023-06-20	Masked Diffusion Models are Fast Learners	Jiachen Lei et.al.	2306.11363	link
2023-06-20	RS5M: A Large Scale Vision-Language Dataset for Remote Sensing Vision-Language Foundation Model	Zilun Zhang et.al.	2306.11300	link
2023-06-20	Eliminating Lipschitz Singularities in Diffusion Models	Zhantao Yang et.al.	2306.11251	null
2023-06-19	GD-VDM: Generated Depth for better Diffusion-based Video Generation	Ariel Lapid et.al.	2306.11173	link
2023-06-16	Group Orthogonalization Regularization For Vision Models Adaptation and Robustness	Yoav Kurtz et.al.	2306.10001	link
2023-06-16	Towards Better Certified Segmentation via Diffusion Models	Othmane Laousy et.al.	2306.09949	link
2023-06-16	Drag-guided diffusion models for vehicle image generation	Nikos Arechiga et.al.	2306.09935	null
2023-06-16	Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models	Geon Yeong Park et.al.	2306.09869	link
2023-06-16	AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation	Yifei Zeng et.al.	2306.09864	null
2023-06-16	Understanding Deep Generative Models with Generalized Empirical Likelihoods	Suman Ravuri et.al.	2306.09780	link
2023-06-16	The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models	Roy Voetman et.al.	2306.09762	null
2023-06-16	CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models	Hao-Wen Dong et.al.	2306.09635	null
2023-06-15	Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model	Lu Yu et.al.	2306.09551	null
2023-06-15	Hierarchical Planning and Control for Box Loco-Manipulation	Zhaoming Xie et.al.	2306.09532	null
2023-06-15	R2-Diff: Denoising by diffusion as a refinement of retrieved motion for image-based motion prediction	Takeru Oba et.al.	2306.09483	null
2023-06-15	Generative Proxemics: A Prior for 3D Social Interaction from Images	Lea Müller et.al.	2306.09337	link
2023-06-19	ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models	Dar-Yen Chen et.al.	2306.09330	link
2023-06-15	Diffusion Models for Zero-Shot Open-Vocabulary Segmentation	Laurynas Karazija et.al.	2306.09316	null
2023-06-15	Fast Training of Diffusion Models with Masked Transformers	Hongkai Zheng et.al.	2306.09305	link
2023-06-15	A Score-based Nonlinear Filter for Data Assimilation	Feng Bao et.al.	2306.09282	null
2023-06-15	Conditional Human Sketch Synthesis with Explicit Abstraction Control	Dar-Yen Chen et.al.	2306.09274	null
2023-06-15	Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models	Gen Li et.al.	2306.09251	null
2023-06-15	Training Diffusion Classifiers with Denoising Assistance	Chandramouli Sastry et.al.	2306.09192	null
2023-06-15	DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks in the Physical World	Caixin Kang et.al.	2306.09124	link
2023-06-15	Relation-Aware Diffusion Model for Controllable Poster Layout Generation	Fengheng Li et.al.	2306.09086	link
2023-06-15	Parameterizing Vertical Mixing Coefficients in the Ocean Surface Boundary Layer using Neural Networks	Aakash Sane et.al.	2306.09045	null
2023-06-15	Annotator Consensus Prediction for Medical Image Segmentation with Diffusion Models	Tomer Amit et.al.	2306.09004	link
2023-06-15	When Hyperspectral Image Classification Meets Diffusion Models: An Unsupervised Feature Learning Framework	Jingyi Zhou et.al.	2306.08964	link
2023-06-15	RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation	Gabriel Bénédict et.al.	2306.08947	link
2023-06-15	Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment	Royi Rassin et.al.	2306.08877	link
2023-06-15	OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models	Enshu Liu et.al.	2306.08860	link
2023-06-14	InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models	Yingheng Wang et.al.	2306.08757	null
2023-06-14	VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing	Paul Couairon et.al.	2306.08707	null
2023-06-14	GHP-MOFassemble: Diffusion modeling, high throughput screening, and molecular dynamics for rational discovery of novel metal-organic frameworks for carbon capture at scale	Hyun Park et.al.	2306.08695	link
2023-06-14	Norm-guided latent space exploration for text-to-image generation	Dvir Samuel et.al.	2306.08687	link
2023-06-13	Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation	Shuai Yang et.al.	2306.07954	null
2023-06-13	Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data	Stanislaw Szymanowicz et.al.	2306.07881	null
2023-06-13	StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models	Yinghao Aaron Li et.al.	2306.07691	link
2023-06-15	Hyperbolic Graph Diffusion Model for Molecule Generation	Lingfeng Wen et.al.	2306.07618	link
2023-06-13	Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model	Xin Zhang et.al.	2306.07596	null
2023-06-13	User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems	Marc Finzi et.al.	2306.07526	null
2023-06-13	Multi-objective Molecular Optimization for Opioid Use Disorder Treatment Using Generative Network Complex	Hongsong Feng et.al.	2306.07484	null
2023-06-13	3D molecule generation by denoising voxel grids	Pedro O. Pinheiro et.al.	2306.07473	link
2023-06-12	Controlling Text-to-Image Diffusion by Orthogonal Finetuning	Zeju Qiu et.al.	2306.07280	null
2023-06-12	MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images	Junchen Zhu et.al.	2306.07257	null
2023-06-12	Diffusion Models for Black-Box Optimization	Siddarth Krishnamoorthy et.al.	2306.07180	link
2023-06-12	InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions	Jiale Xu et.al.	2306.07154	null
2023-06-12	Fast Diffusion Model	Zike Wu et.al.	2306.06991	link
2023-06-13	VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models	Sheng-Yen Chou et.al.	2306.06874	link
2023-06-12	HiddenSinger: High-Quality Singing Voice Synthesis via Neural Audio Codec and Latent Diffusion Models	Ji-Sang Hwang et.al.	2306.06814	null
2023-06-11	Stable Remaster: Bridging the Gap Between Old Content and New Displays	Nathan Paull et.al.	2306.06803	link
2023-06-10	How movement bias to attractive regions determines population spread and critical habitat size	Vivian Dornelas et.al.	2306.06450	link
2023-06-10	Language-Guided Traffic Simulation via Scene-Level Diffusion	Ziyuan Zhong et.al.	2306.06344	null
2023-06-09	Boosting GUI Prototyping with Diffusion Models	Jialiang Wei et.al.	2306.06233	null
2023-06-09	Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions	Ian Huang et.al.	2306.06212	link
2023-06-09	Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics Alignment with Diffusion Model	Yule Wang et.al.	2306.06138	link
2023-06-09	Beyond Diffusion: A Generalized Mean-Field Theory of Turbulent Dust Transport in Protoplanetary Disks	Fabian Binkert et.al.	2306.06103	null
2023-06-09	Beyond Surface Statistics: Scene Representations in a Latent Diffusion Model	Yida Chen et.al.	2306.05720	link
2023-06-12	Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion	Haogeng Liu et.al.	2306.05708	null
2023-06-09	RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models	Xingchen Zhou et.al.	2306.05668	null
2023-06-08	BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping	Jiatao Gu et.al.	2306.05544	null
2023-06-08	Grounded Text-to-Image Synthesis with Attention Refocusing	Quynh Phung et.al.	2306.05427	null
2023-06-08	Stochastic Multi-Person 3D Motion Forecasting	Sirui Xu et.al.	2306.05421	link
2023-06-08	PriSampler: Mitigating Property Inference of Diffusion Models	Hailong Hu et.al.	2306.05208	null
2023-06-08	A cognitive process approach to modeling gap acceptance in overtaking	Samir H. A. Mohammad et.al.	2306.05203	null
2023-06-08	SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions	Yuseung Lee et.al.	2306.05178	null
2023-06-08	Non-autoregressive Conditional Diffusion Models for Time Series Prediction	Lifeng Shen et.al.	2306.05043	null
2023-06-08	Multi-Architecture Multi-Expert Diffusion Models	Yunsung Lee et.al.	2306.04990	null
2023-06-08	Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning	Jifeng Hu et.al.	2306.04875	null
2023-06-09	Complexity-aware Large Scale Origin-Destination Network Generation via Diffusion Model	Can Rong et.al.	2306.04873	null
2023-06-08	Ground states for aggregation-diffusion models on Cartan-Hadamard manifolds	Razvan C. Fetecau et.al.	2306.04856	null
2023-06-08	Interpreting and Improving Diffusion Models Using the Euclidean Distance Function	Frank Permenter et.al.	2306.04848	link
2023-06-07	WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models	Changhoon Kim et.al.	2306.04744	link
2023-06-07	ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models	Maitreya Patel et.al.	2306.04695	link
2023-06-07	Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models	George Stein et.al.	2306.04675	link
2023-06-07	Designing a Better Asymmetric VQGAN for StableDiffusion	Zixin Zhu et.al.	2306.04632	link
2023-06-07	ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections	Chun-Han Yao et.al.	2306.04619	null
2023-06-09	Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt	Kai Chen et.al.	2306.04607	null
2023-06-07	On the Design Fundamentals of Diffusion Models: A Survey	Ziyi Chang et.al.	2306.04542	null
2023-06-07	Multi-modal Latent Diffusion	Mustapha Bounoua et.al.	2306.04445	link
2023-06-07	Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance	Gihyun Kwon et.al.	2306.04396	link
2023-06-07	Generative Semantic Communication: Diffusion Models Beyond Bit Recovery	Eleonora Grassucci et.al.	2306.04321	link
2023-06-07	A Survey on Generative Diffusion Models for Structured Data	Heejoon Koo et.al.	2306.04139	null
2023-06-07	Phoenix: A Federated Generative Diffusion Model	Fiona Victoria Stanley Jothiraj et.al.	2306.04098	null
2023-06-07	Professional Basketball Player Behavior Synthesis via Planning with Diffusion	Xiusi Chen et.al.	2306.04090	link
2023-06-06	A machine learning potential-based generative algorithm for on-lattice crystal structure prediction	Vadim Sotskov et.al.	2306.03989	null
2023-06-06	High-dimensional and Permutation Invariant Anomaly Detection	Vinicius Mikuni et.al.	2306.03933	link
2023-06-06	Emergent Correspondence from Image Diffusion	Luming Tang et.al.	2306.03881	link
2023-06-06	Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation	Xinrong Hu et.al.	2306.03878	link
2023-06-06	Towards Visual Foundational Models of Physical Scenes	Chethan Parameshwara et.al.	2306.03727	null
2023-06-06	Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias	Ziyue Jiang et.al.	2306.03509	null
2023-06-08	DFormer: Diffusion-guided Transformer for Universal Image Segmentation	Hefeng Wang et.al.	2306.03437	link
2023-06-06	Protecting the Intellectual Property of Diffusion Models by the Watermark Diffusion Process	Sen Peng et.al.	2306.03436	link
2023-06-06	Change Diffusion: Change Detection Map Generation Based on Difference-Feature Guided DDPM	Yihan Wen et.al.	2306.03424	link
2023-06-08	DreamSparse: Escaping from Plato’s Cave with 2D Diffusion Model Given Sparse Views	Paul Yoo et.al.	2306.03414	null
2023-06-05	Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models	Andrew F. Luo et.al.	2306.03089	null
2023-06-05	HeadSculpt: Crafting 3D Head Avatars with Text	Xiao Han et.al.	2306.03038	null
2023-06-05	Brain tumor segmentation using synthetic MR images – A comparison of GANs and diffusion models	Muhammad Usman Akbar et.al.	2306.02986	link
2023-06-05	Complex Preferences for Different Convergent Priors in Discrete Graph Diffusion	Alex M. Tseng et.al.	2306.02957	null
2023-06-05	INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems	Di You et.al.	2306.02949	null
2023-06-05	Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions	Shaoxu Li et.al.	2306.02903	link
2023-06-06	Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark	Shuyu Yang et.al.	2306.02898	link
2023-06-05	User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques	Sunwoo Kim et.al.	2306.02717	null
2023-06-05	Faster Training of Diffusion Models and Improved Density Estimation via Parallel Score Matching	Etrit Haxholli et.al.	2306.02658	null
2023-06-05	Physics-Informed Kernel Function Neural Networks for Solving Partial Differential Equations	Zhuojia Fu et.al.	2306.02606	null
2023-06-05	Video Diffusion Models with Local-Global Context Guidance	Siyuan Yang et.al.	2306.02562	link
2023-06-05	PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model	Yizhe Zhang et.al.	2306.02531	link
2023-06-04	Spear or Shield: Leveraging Generative AI to Tackle Security Threats of Intelligent Network Services	Hongyang Du et.al.	2306.02384	null
2023-06-04	Temporal Dynamic Quantization for Diffusion Models	Junhyuk So et.al.	2306.02316	null
2023-06-04	Detector Guidance for Multi-Object Text-to-Image Generation	Luping Liu et.al.	2306.02236	link
2023-06-03	Training Data Attribution for Diffusion Models	Zheng Dai et.al.	2306.02174	link
2023-06-03	Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution	Yiji Cheng et.al.	2306.02083	null
2023-06-03	Exploring the Optimal Choice for Generative Processes in Diffusion Models: Ordinary vs Stochastic Differential Equations	Yu Cao et.al.	2306.02063	null
2023-06-03	DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting	Salva Rühling Cachay et.al.	2306.01984	link
2023-06-02	Generative Autoencoders as Watermark Attackers: Analyses of Vulnerabilities and Threats	Xuandong Zhao et.al.	2306.01953	link
2023-06-02	Video Colorization with Pre-trained Text-to-Image Diffusion Models	Hanyuan Liu et.al.	2306.01732	null
2023-06-02	Denoising Diffusion Semantic Segmentation with Mask Prior Modeling	Zeqiang Lai et.al.	2306.01721	link
2023-06-02	DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation	Guanqun Bi et.al.	2306.01657	null
2023-06-02	PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models	Jiacheng Chen et.al.	2306.01461	link
2023-06-02	Zero-Shot Blind Audio Bandwidth Extension	Eloi Moliner et.al.	2306.01433	link
2023-06-02	Audio-Visual Speech Enhancement with Score-Based Generative Models	Julius Richter et.al.	2306.01432	null
2023-06-02	Quantifying Sample Anonymity in Score-Based Generative Models with Adversarial Fingerprinting	Mischa Dombrowski et.al.	2306.01363	null
2023-06-02	Privacy Distillation: Reducing Re-identification Risk of Multimodal Diffusion Models	Virginia Fernandez et.al.	2306.01322	null
2023-06-02	Diffusion Self-Guidance for Controllable Image Generation	Dave Epstein et.al.	2306.00986	null
2023-06-01	SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds	Yanyu Li et.al.	2306.00980	link
2023-06-01	Intriguing Properties of Text-guided Diffusion Models	Qihao Liu et.al.	2306.00974	link
2023-06-01	Intelligent Grimm – Open-ended Visual Storytelling via Latent Diffusion Models	Chang Liu et.al.	2306.00973	link
2023-06-01	ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation	Shaozhe Hao et.al.	2306.00971	link
2023-06-01	The Hidden Language of Diffusion Models	Hila Chefer et.al.	2306.00966	link
2023-06-01	Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation	Minghui Hu et.al.	2306.00964	null
2023-06-01	Differential Diffusion: Giving Each Pixel Its Strength	Eran Levin et.al.	2306.00950	link
2023-06-01	Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance	Jinbo Xing et.al.	2306.00943	null
2023-06-01	Inserting Anybody in Diffusion Models via Celeb Basis	Ge Yuan et.al.	2306.00926	link
2023-06-01	Conditioning Diffusion Models via Attributes and Semantic Masks for Face Generation	Nico Giambi et.al.	2306.00914	null
2023-06-01	Robust Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers	Ruotong Wang et.al.	2306.00816	null
2023-06-01	UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning	Xiao Dong et.al.	2306.00813	null
2023-06-01	FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models	Hao Zhang et.al.	2306.00783	link
2023-06-01	UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model	Anastasiia Iashchenko et.al.	2306.00721	link
2023-06-01	EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis	Haobin Tang et.al.	2306.00648	null
2023-06-01	AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars	Mohit Mendiratta. Xingang Pan et.al.	2306.00547	null
2023-06-01	Image generation with shortest path diffusion	Ayan Das et.al.	2306.00501	link
2023-06-01	Random advection-diffusion models and their statistics	Stefano Lepri et.al.	2306.00463	null
2023-06-01	Controllable Motion Diffusion Model	Yi Shi et.al.	2306.00416	link

semantic segmentation

Publish Date	Title	Authors	PDF	Code
2025-07-21	ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction	Danhui Chen et.al.	2507.15803	null
2025-07-21	Rethinking Occlusion in FER: A Semantic-Aware Perspective and Go Beyond	Huiyu Zhai et.al.	2507.15401	null
2025-07-20	A Novel Downsampling Strategy Based on Information Complementarity for Medical Image Segmentation	Wenbo Yue et.al.	2507.14790	null
2025-07-19	GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset	Zhiwei Zhang et.al.	2507.14697	null
2025-07-19	Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall	Shayan Rokhva et.al.	2507.14662	null
2025-07-19	Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection	Jifeng Shen et.al.	2507.14643	null
2025-07-19	DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF	Doriand Petit et.al.	2507.14596	null
2025-07-18	Semantic Segmentation based Scene Understanding in Autonomous Vehicles	Ehsan Rassekh et.al.	2507.14303	null
2025-07-17	A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique	Homare Sueyoshi et.al.	2507.12730	null
2025-07-16	NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting	Kuangshi Ai et.al.	2507.12621	null
2025-07-16	Out-of-distribution data supervision towards biomedical semantic segmentation	Yiquan Gao et.al.	2507.12105	null
2025-07-16	Frequency-Dynamic Attention Modulation for Dense Prediction	Linwei Chen et.al.	2507.12006	null
2025-07-16	SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation	Jun Yin et.al.	2507.11994	null
2025-07-16	Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation	Yuhang Zhang et.al.	2507.11955	null
2025-07-16	Spatial Frequency Modulation for Semantic Segmentation	Linwei Chen et.al.	2507.11893	null
2025-07-15	SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics	Suyuan Zhao et.al.	2507.11588	null
2025-07-15	Personalized OVSS: Understanding Personal Concept in Open-Vocabulary Semantic Segmentation	Sunghyun Park et.al.	2507.11030	null
2025-07-15	Graph Aggregation Prototype Learning for Semantic Change Detection in Remote Sensing	Zhengyi Xu et.al.	2507.10938	null
2025-07-14	Static or Temporal? Semantic Scene Simplification to Aid Wayfinding in Immersive Simulations of Bionic Vision	Justin M. Kasowski et.al.	2507.10813	null
2025-07-14	FGSSNet: Feature-Guided Semantic Segmentation of Real World Floorplans	Hugo Norrby et.al.	2507.10343	null
2025-07-14	Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks	Ben Hamscher et.al.	2507.10239	null
2025-07-14	Spatial Lifting for Dense Prediction	Mingzhi Xu et.al.	2507.10222	null
2025-07-14	DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation	Ivan Martinović et.al.	2507.10118	null
2025-07-11	Multimodal HD Mapping for Intersections by Intelligent Roadside Units	Zhongzhang Chen et.al.	2507.08903	null
2025-07-11	Image Translation with Kernel Prediction Networks for Semantic Segmentation	Cristina Mata et.al.	2507.08554	null
2025-07-11	From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning	Sen Wang et.al.	2507.08380	null
2025-07-08	3D forest semantic segmentation using multispectral LiDAR and 3D deep learning	Narges Takhtkeshha et.al.	2507.08025	null
2025-07-10	Diffusion-Guided Knowledge Distillation for Weakly-Supervised Low-Light Semantic Segmentation	Chunyan Wang et.al.	2507.07578	null
2025-07-08	CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings	Cristina Mata et.al.	2507.07125	null
2025-07-09	Know Your Attention Maps: Class-specific Token Masking for Weakly Supervised Semantic Segmentation	Joelle Hanna et.al.	2507.06848	null
2025-07-09	Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning	Yang Chen et.al.	2507.06592	null
2025-07-08	Centralized Copy-Paste: Enhanced Data Augmentation Strategy for Wildland Fire Semantic Segmentation	Joon Tai Kim et.al.	2507.06321	null
2025-07-08	FineGrasp: Towards Robust Grasping for Delicate Objects	Yun Du et.al.	2507.05978	null
2025-07-08	I $^2$ R: Inter and Intra-image Refinement in Few Shot Segmentation	Ourui Fu et.al.	2507.05838	null
2025-07-09	Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework	Wang Wang et.al.	2507.05814	null
2025-07-07	Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations	Xiang Xu et.al.	2507.05260	null
2025-07-07	MOSU: Autonomous Long-range Robot Navigation with Multi-modal Scene Understanding	Jing Liang et.al.	2507.04686	null
2025-07-06	Street design and driving behavior: evidence from a large-scale study in Milan, Amsterdam, and Dubai	Giacomo Orsi et.al.	2507.04434	null
2025-07-06	CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning	Fatmaelzahraa Ali Ahmed et.al.	2507.04317	null
2025-07-06	Surg-SegFormer: A Dual Transformer-Based Model for Holistic Surgical Scene Segmentation	Fatimaelzahraa Ahmed et.al.	2507.04304	null
2025-07-05	Differentiable High-Performance Ray Tracing-Based Simulation of Radio Propagation with Point Clouds	Niklas Vaara et.al.	2507.04021	null
2025-07-05	NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models	Siyu Li et.al.	2507.04002	null
2025-07-05	CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes via Chain-of-Thought Reasoning	Jeonghyo Song et.al.	2507.03984	null
2025-07-04	Efficient Event-Based Semantic Segmentation via Exploiting Frame-Event Fusion: A Hybrid Neural Network Approach	Hebei Li et.al.	2507.03765	null
2025-07-04	Leveraging Out-of-Distribution Unlabeled Images: Semi-Supervised Semantic Segmentation with an Open-Vocabulary Model	Wooseok Shin et.al.	2507.03302	null
2025-07-03	LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion	Fangfu Liu et.al.	2507.02813	null
2025-07-03	From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images	Danrong Zhang et.al.	2507.02781	null
2025-07-03	MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention	Zunhui Xia et.al.	2507.02488	null
2025-07-08	Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis	Byung Hyun Lee et.al.	2507.02395	null
2025-07-02	How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks	Rahul Ramachandran et.al.	2507.01955	null
2025-07-02	A Gift from the Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation	Hao Wang et.al.	2507.01573	null
2025-07-01	Rectifying Magnitude Neglect in Linear Attention	Qihang Fan et.al.	2507.00698	null
2025-07-02	ExPaMoE: An Expandable Parallel Mixture of Experts for Continual Test-Time Adaptation	JianChao Zhao et.al.	2507.00502	null
2025-07-01	Process-aware and high-fidelity microstructure generation using stable diffusion	Hoang Cuong Phan et.al.	2507.00459	null
2025-07-01	PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching	Xin Yang et.al.	2507.00371	null
2025-06-30	Diffusion-Based Image Augmentation for Semantic Segmentation in Outdoor Robotics	Peter Mortimer et.al.	2507.00153	null
2025-06-30	Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors	Ce Wang et.al.	2506.23801	null
2025-06-30	Deep Learning-Based Semantic Segmentation for Real-Time Kidney Imaging and Measurements with Augmented Reality-Assisted Ultrasound	Gijs Luijten et.al.	2506.23721	null
2025-06-30	PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum	Shiqi Zhang et.al.	2506.23607	null
2025-06-30	Interactive Interface For Semantic Segmentation Dataset Synthesis	Ngoc-Do Tran et.al.	2506.23470	null
2025-06-30	Contrastive Learning with Diffusion Features for Weakly Supervised Medical Image Segmentation	Dewen Zeng et.al.	2506.23460	null
2025-06-29	Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement	Siyuan Chai et.al.	2506.23353	null
2025-06-29	FastSeg: Efficient Training-Free Open-Vocabulary Segmentation via Hierarchical Attention Refinement Method	Quang-Huy Che et.al.	2506.23323	null
2025-06-29	BPD-Neo: An MRI Dataset for Lung-Trachea Segmentation with Clinical Data for Neonatal Bronchopulmonary Dysplasia	Rachit Saluja et.al.	2506.23305	null
2025-06-29	High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation	Lunhao Duan et.al.	2506.23227	null
2025-06-28	Probabilistic Prototype Calibration of Vision-Language Models for Generalized Few-shot Semantic Segmentation	Jie Liu et.al.	2506.22979	null
2025-06-28	Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception	Hang-Cheng Dong et.al.	2506.22866	null
2025-06-28	Unleashing the Multi-View Fusion Potential: Noise Correction in VLM for Open-Vocabulary 3D Scene Understanding	Xingyilang Yin et.al.	2506.22817	null
2025-06-27	Dual Atrous Separable Convolution for Improving Agricultural Semantic Segmentation	Chee Mei Ling et.al.	2506.22570	null
2025-06-27	Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation	Jialei Chen et.al.	2506.22032	null
2025-06-27	TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models	Meng Yu et.al.	2506.21975	null
2025-06-27	SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images	Naftaly Wambugu et.al.	2506.21945	null
2025-06-26	Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection	Tobias J. Riedlinger et.al.	2506.21486	null
2025-06-27	ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation	Xiwei Xuan et.al.	2506.21233	null
2025-06-26	Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4	Jongyeon Park et.al.	2506.21174	null
2025-06-27	DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation	Wenzhou Lyu et.al.	2506.21034	null
2025-06-26	TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation	Chade Li et.al.	2506.20991	null
2025-06-26	Segment Anything in Pathology Images with Natural Language	Zhixuan Chen et.al.	2506.20988	null
2025-06-25	U-R-VEDA: Integrating UNET, Residual Links, Edge and Dual Attention, and Vision Transformer for Accurate Semantic Segmentation of CMRs	Racheal Mukisa et.al.	2506.20689	null
2025-06-25	Building Lightweight Semantic Segmentation Models for Aerial Images Using Dual Relation Distillation	Minglong Li et.al.	2506.20688	null
2025-06-25	A Deep Learning Approach to Identify Rock Bolts in Complex 3D Point Clouds of Underground Mines Captured Using Mobile Laser Scanners	Dibyayan Patra et.al.	2506.20464	null
2025-06-26	Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition	Man Duc Chuc et.al.	2506.20174	null
2025-06-24	A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects	Shulan Ruan et.al.	2506.19769	null
2025-06-24	A Global-Local Cross-Attention Network for Ultra-high Resolution Remote Sensing Image Semantic Segmentation	Chen Yi et.al.	2506.19406	null
2025-06-25	AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation	Ziyan Zhao et.al.	2506.19269	null
2025-06-23	Orthogonal Projection Subspace to Aggregate Online Prior-knowledge for Continual Test-time Adaptation	Jinlong Li et.al.	2506.19022	null
2025-06-23	Multi-Scale Spectral Attention Module-based Hyperspectral Segmentation in Autonomous Driving Scenarios	Imad Ali Shah et.al.	2506.18682	null
2025-06-22	OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model	Shuaiyu Chen et.al.	2506.18006	null
2025-06-22	Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation	Xiaodong Guo et.al.	2506.17869	null
2025-06-20	ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds	Binbin Xiang et.al.	2506.16991	null
2025-06-19	From Semantic To Instance: A Semi-Self-Supervised Learning Approach	Keyhan Najafian et.al.	2506.16563	null
2025-06-19	Structured Semantic 3D Reconstruction (S23DR) Challenge 2025 – Winning solution	Jan Skvrna et.al.	2506.16421	null
2025-06-19	LBMamba: Locally Bi-directional Mamba	Jingwei Zhang et.al.	2506.15976	null
2025-06-19	Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging	Jiawen Yang et.al.	2506.15971	null
2025-06-19	Polyline Path Masked Attention for Vision Transformer	Zhongchen Zhao et.al.	2506.15940	link
2025-06-18	MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning	Leonid Ivanov et.al.	2506.15313	link
2025-06-18	Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation	Jiaqi Shi et.al.	2506.15160	link
2025-06-17	Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset	Nikolaos Dionelis et.al.	2506.14765	link
2025-06-17	VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy	Zhuoyue Tan et.al.	2506.14525	null
2025-06-17	DepthSeg: Depth prompting in remote sensing semantic segmentation	Ning Zhou et.al.	2506.14382	null
2025-06-16	HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment	Numair Nadeem et.al.	2506.13925	null
2025-06-16	A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects	Guohuan Xie et.al.	2506.13552	null
2025-06-16	Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning	Rohit Mohan et.al.	2506.13265	null
2025-06-16	ViewPCL: a point cloud based active learning method for multi-view segmentation	Christian Hilaire et.al.	2506.13043	null
2025-06-15	A large-scale, physically-based synthetic dataset for satellite pose estimation	Szabolcs Velkei et.al.	2506.12782	null
2025-06-15	Unleashing Diffusion and State Space Models for Medical Image Segmentation	Rong Wu et.al.	2506.12747	null
2025-06-15	Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups	Zhenghao Xi et.al.	2506.12712	null
2025-06-13	A $^2$ LC: Active and Automated Label Correction for Semantic Segmentation	Youjin Jeon et.al.	2506.11599	null
2025-06-12	GynSurg: A Comprehensive Gynecology Laparoscopic Surgery Dataset	Sahar Nasirihaghighi et.al.	2506.11356	null
2025-06-11	FARCLUSS: Fuzzy Adaptive Rebalancing and Contrastive Uncertainty Learning for Semi-Supervised Semantic Segmentation	Ebenezer Tarubinga et.al.	2506.11142	link
2025-06-12	Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes	Masahiro Yasuda et.al.	2506.10676	link
2025-06-12	Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models	Francisco Caetano et.al.	2506.10634	null
2025-06-12	Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration	Jun Wang et.al.	2506.10573	null
2025-06-12	Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation	Shuyang Li et.al.	2506.10503	null
2025-06-12	Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success	Che Wang et.al.	2506.10359	null
2025-06-11	Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements	Mustafa Atahan Nuhoglu et.al.	2506.10107	null
2025-06-11	Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation	Siyu Chen et.al.	2506.09881	link
2025-06-11	The Four Color Theorem for Cell Instance Segmentation	Ye Zhang et.al.	2506.09724	link
2025-06-11	Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments	Fatemeh Mohammadi Amin et.al.	2506.09552	null
2025-06-12	Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20 $^{th}$ century Urban Landscapes with Satellite Imageries	Tianxiang Hao et.al.	2506.09476	link
2025-06-11	MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning	Tong Wang et.al.	2506.09327	null
2025-06-10	WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos	Negin Ghamsarian et.al.	2506.08896	null
2025-06-11	RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation	Jiayi Song et.al.	2506.08772	link
2025-06-10	ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction	Juan Yeo et.al.	2506.08678	null
2025-06-10	ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network	Feixiang Du et.al.	2506.08629	null
2025-06-10	DCD: A Semantic Segmentation Model for Fetal Ultrasound Four-Chamber View	Donglian Li et.al.	2506.08534	null
2025-06-11	IGraSS: Learning to Identify Infrastructure Networks from Satellite Imagery by Iterative Graph-constrained Semantic Segmentation	Oishee Bintey Hoque et.al.	2506.08137	null
2025-06-09	LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds	Zihui Zhang et.al.	2506.07857	link
2025-06-09	F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation	Hengzhi Chen et.al.	2506.07847	null
2025-06-09	Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity	Mohamed Djilani et.al.	2506.07773	link
2025-06-09	Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation	Jintao Tong et.al.	2506.07376	null
2025-06-09	Multiple Object Stitching for Unsupervised Representation Learning	Chengchao Shen et.al.	2506.07364	link
2025-06-08	BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite	Liyang Chen et.al.	2506.07116	null
2025-06-08	Technical Report for ICRA 2025 GOOSE 3D Semantic Segmentation Challenge: Adaptive Point Cloud Understanding for Heterogeneous Robotic Systems	Xiaoya Zhang et.al.	2506.06995	null
2025-06-07	Position Prediction Self-Supervised Learning for Multimodal Satellite Imagery Semantic Segmentation	John Waithaka et.al.	2506.06852	null
2025-06-07	EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery	Guankun Wang et.al.	2506.06830	null
2025-06-06	GS4: Generalizable Sparse Splatting Semantic SLAM	Mingqi Jiang et.al.	2506.06517	null
2025-06-06	NeurNCD: Novel Class Discovery via Implicit Neural Representation	Junming Wang et.al.	2506.06412	null
2025-06-06	Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness	Steven Landgraf et.al.	2506.05917	null
2025-06-05	FRAME: Pre-Training Video Feature Representations via Anticipation and Memory	Sethuraman TV et.al.	2506.05543	null
2025-06-05	U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation	Marwane Kzadri et.al.	2506.05444	null
2025-06-05	Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting	Alfred T. Christiansen et.al.	2506.05009	null
2025-06-04	You Only Train Once	Christos Sakaridis et.al.	2506.04349	null
2025-06-04	AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives	Aniruddh Sikdar et.al.	2506.03709	null
2025-06-04	OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation	Aditya Gandhamal et.al.	2506.03706	null
2025-06-04	BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation	Jialei Chen et.al.	2506.03675	null
2025-06-03	Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery	Pengyu Chen et.al.	2506.03388	null
2025-06-03	Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding	Weiqing Xiao et.al.	2506.03134	link
2025-06-03	GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal	Shufan Qing et.al.	2506.02736	link
2025-06-03	Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather	Longyu Yang et.al.	2506.02396	null
2025-06-04	SAB3R: Semantic-Augmented Backbone in 3D Reconstruction	Xuweiyi Chen et.al.	2506.02112	null
2025-06-02	SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation	Rafael Flor-Rodríguez et.al.	2506.01418	link
2025-06-01	Perceptual Inductive Bias Is What You Need Before Contrastive Learning	Tianqin Li et.al.	2506.01201	null
2025-06-01	GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning	Sahiti Yerramilli et.al.	2506.00785	null
2025-05-31	BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation	Wei Tao et.al.	2506.00475	null
2025-05-30	Bi-Manual Joint Camera Calibration and Scene Representation	Haozhan Tang et.al.	2505.24819	null
2025-06-02	NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation	Xuzhi Wang et.al.	2505.24634	link
2025-05-30	Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation	Roger Ferrod et.al.	2505.24361	link
2025-05-30	Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors	Peiran Xu et.al.	2505.24103	link
2025-05-29	MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking	Numair Nadeem et.al.	2505.24026	null
2025-05-29	Semantics-Guided Generative Image Compression	Cheng-Lin Wu et.al.	2505.24015	link
2025-05-29	Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts	Xuweiyi Chen et.al.	2505.23926	null
2025-05-29	TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models	Yao Xiao et.al.	2505.23769	link
2025-05-29	Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation	Georgios Voulgaris et.al.	2505.23597	null
2025-05-29	VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration	Ben Li et.al.	2505.23439	link
2025-05-29	Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation	Lingyan Ran et.al.	2505.23438	null
2025-05-29	Federated Unsupervised Semantic Segmentation	Evangelos Charalampakis et.al.	2505.23292	null
2025-05-29	LeMoRe: Learn More Details for Lightweight Semantic Segmentation	Mian Muhammad Naeem Abid et.al.	2505.23093	link
2025-05-28	ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions	Maxence Wynen et.al.	2505.22537	null
2025-05-28	Universal Domain Adaptation for Semantic Segmentation	Seun-An Choe et.al.	2505.22458	null
2025-05-28	LiDAR Based Semantic Perception for Forklifts in Outdoor Environments	Benjamin Serfling et.al.	2505.22258	null
2025-05-29	YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction	Mingzhuang Wang et.al.	2505.22250	null
2025-05-28	Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation	Zhisong Wang et.al.	2505.22230	null
2025-05-28	A Survey on Training-free Open-Vocabulary Semantic Segmentation	Naomi Kombol et.al.	2505.22209	null
2025-05-28	S2AFormer: Strip Self-Attention for Efficient Vision Transformer	Guoan Xu et.al.	2505.22195	null
2025-05-28	LiDARDustX: A LiDAR Dataset for Dusty Unstructured Road Environments	Chenfeng Wei et.al.	2505.21914	null
2025-05-28	Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation	Mehrdad Noori et.al.	2505.21844	link
2025-05-27	Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning	Nikos Giannakakis et.al.	2505.20962	null
2025-05-27	DSOcc: Leveraging Depth Awareness and Semantic Aid to Boost Camera-Based 3D Semantic Occupancy Prediction	Naiyu Fang et.al.	2505.20951	null
2025-05-26	Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments	Julio de la Torre-Vanegas et.al.	2505.20423	null
2025-05-26	A fully automated urban PV parameterization framework for improved estimation of energy production profiles	Bowen Tian et.al.	2505.19876	null
2025-05-29	Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation	Nagito Saito et.al.	2505.19846	null
2025-05-26	The Missing Point in Vision Transformers for Universal Image Segmentation	Sajjad Shahabodini et.al.	2505.19795	null
2025-05-26	ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting	Wenhua Wu et.al.	2505.19420	null
2025-05-25	A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic Segmentation	Yuze Wang et.al.	2505.19159	link
2025-05-25	SPARS: Self-Play Adversarial Reinforcement Learning for Segmentation of Liver Tumours	Catalina Tan et.al.	2505.18989	link
2025-05-25	LLM-Guided Taxonomy and Hierarchical Uncertainty for 3D Point CLoud Active Learning	Chenxi Li et.al.	2505.18924	null
2025-05-23	REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders	Savya Khosla et.al.	2505.18153	link
2025-05-23	SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification	Shashank Agnihotri et.al.	2505.18015	link
2025-05-23	Semantic segmentation with reward	Xie Ting et.al.	2505.17905	null
2025-05-23	Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring	Nikolas Papadopoulos et.al.	2505.17782	null
2025-05-23	EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy	Yichun Yu et.al.	2505.17665	null
2025-05-22	Deep mineralogical segmentation of thin section images based on QEMSCAN maps	Jean Pablo Vieira de Mello et.al.	2505.17008	link
2025-05-22	OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning	Zongyan Han et.al.	2505.16974	link
2025-05-25	NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification	NovelSeek Team et.al.	2505.16938	link
2025-05-22	TextureSAM: Towards a Texture Aware Foundation Model for Segmentation	Inbal Cohen et.al.	2505.16540	null
2025-05-22	Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation	Estelle Chigot et.al.	2505.16360	link
2025-05-21	VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation	Niccolo Avogaro et.al.	2505.15592	null
2025-05-21	seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation	Andrew Caunes et.al.	2505.15545	link
2025-05-21	Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation	Ce Zhang et.al.	2505.15491	null
2025-05-21	From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation	Quanwei Liu et.al.	2505.15147	null
2025-05-20	Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning	Amine Elhafsi et.al.	2505.14938	null
2025-05-20	LOD1 3D City Model from LiDAR: The Impact of Segmentation Accuracy on Quality of Urban 3D Modeling and Morphology Extraction	Fatemeh Chajaei et.al.	2505.14747	link
2025-05-19	Enhancing Shape Perception and Segmentation Consistency for Industrial Image Inspection	Guoxuan Mao et.al.	2505.14718	null
2025-05-20	Instance Segmentation for Point Sets	Abhimanyu Talwar et.al.	2505.14583	null
2025-05-20	ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains	Guillaume Vray et.al.	2505.14511	null
2025-05-20	Intra-class Patch Swap for Self-Distillation	Hongjun Choi et.al.	2505.14124	link
2025-05-20	Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts	Xi Chen et.al.	2505.14088	null
2025-05-20	Scaling Vision Mamba Across Resolutions via Fractal Traversal	Bo Li et.al.	2505.14062	null
2025-05-20	EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation	Zelin Zhang et.al.	2505.14014	null
2025-05-19	Self-Supervised Learning for Image Segmentation: A Comprehensive Survey	Thangarajah Akilan et.al.	2505.13584	null
2025-05-19	Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation	Jiaqi Tan et.al.	2505.12861	link
2025-05-18	Temporal-Spectral-Spatial Unified Remote Sensing Dense Prediction	Sijie Zhao et.al.	2505.12280	link
2025-05-17	EarthSynth: Generating Informative Earth Observation with Diffusion Models	Jiancheng Pan et.al.	2505.12108	null
2025-05-17	Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Boosting Off-Road Segmentation via Photometric Distortion and Exponential Moving Average	Wonjune Kim et.al.	2505.11769	null
2025-05-16	DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation	Ziyu Zhao et.al.	2505.11676	null
2025-05-16	Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation	David Minkwan Kim et.al.	2505.10781	null
2025-05-15	Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis	Francisco Raverta Capua et.al.	2505.10751	link
2025-05-15	TartanGround: A Large-Scale Dataset for Ground Robot Perception and Navigation	Manthan Patel et.al.	2505.10696	null
2025-05-15	SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity	Shihao Zou et.al.	2505.10352	null
2025-05-15	APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds	Yuan Gao et.al.	2505.09971	link
2025-05-14	FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization	Xiaoyang Yu et.al.	2505.09385	null
2025-05-14	MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning	Bin-Bin Gao et.al.	2505.09265	null
2025-05-13	MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment	Barak Pinkovich et.al.	2505.08589	null
2025-05-13	Dynamic Snake Upsampling Operater and Boundary-Skeleton Weighted Loss for Tubular Structure Segmentation	Yiqi Chen et.al.	2505.08525	null
2025-05-13	Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency	Adel Ammar et.al.	2505.08445	null
2025-05-13	GNCAF: A GNN-based Neighboring Context Aggregation Framework for Tertiary Lymphoid Structures Semantic Segmentation in WSI	Lei Su et.al.	2505.08430	null
2025-05-12	Privacy Risks of Robot Vision: A User Study on Image Modalities and Resolution	Xuying Huang et.al.	2505.07766	null
2025-05-12	Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation	Negin Ghamsarian et.al.	2505.07691	null
2025-05-13	TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset	Olaf Wysocki et.al.	2505.07396	null
2025-05-11	Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution	Zihang Liu et.al.	2505.07071	link
2025-05-11	Depth-Sensitive Soft Suppression with RGB-D Inter-Modal Stylization Flow for Domain Generalization Semantic Segmentation	Binbin Wei et.al.	2505.07050	null
2025-05-11	Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding	Chih-Chung Hsu et.al.	2505.06991	null
2025-05-11	Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation	Seokjun Kwon et.al.	2505.06951	null
2025-05-10	Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization	Xu Zheng et.al.	2505.06635	null
2025-05-10	RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation	Zhiwen Zeng et.al.	2505.06515	null
2025-05-06	Show or Tell? A Benchmark To Evaluate Visual and Textual Prompts in Semantic Segmentation	Gabriele Rosi et.al.	2505.06280	link
2025-05-13	Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet	Kodai Hirata et.al.	2505.06185	null
2025-05-09	UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model	Timo Kaiser et.al.	2505.05049	link
2025-05-08	Split Matching for Inductive Zero-shot Semantic Segmentation	Jialei Chen et.al.	2505.05023	null
2025-05-07	Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?	Shashank Agnihotri et.al.	2505.04835	link
2025-05-07	Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer	Sainath Dey et.al.	2505.04740	null
2025-05-07	DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception	Junjie Wang et.al.	2505.04410	link
2025-05-07	MFSeg: Efficient Multi-frame 3D Semantic Segmentation	Chengjie Huang et.al.	2505.04408	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679	null
2025-05-06	Panoramic Out-of-Distribution Segmentation	Mengfei Duan et.al.	2505.03539	link
2025-05-06	3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation	Andrew Caunes et.al.	2505.03300	null
2025-05-05	Platelet enumeration in dense aggregates	H. Martin Gillis et.al.	2505.02751	null
2025-05-04	Segment Any RGB-Thermal Model with Language-aided Distillation	Dong Xing et.al.	2505.01950	null
2025-05-03	OODTE: A Differential Testing Engine for the ONNX Optimizer	Nikolaos Louloudakis et.al.	2505.01892	null
2025-05-02	A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative Learning	Anan Yaghmour et.al.	2505.01558	link
2025-05-02	Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation	Zhen Yao et.al.	2505.01548	link
2025-05-02	GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation	Boris Kriuk et.al.	2505.01057	null
2025-05-03	Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook	Muyi Bao et.al.	2505.00630	link
2025-05-01	Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation	Feng Xue et.al.	2505.00378	null
2025-04-30	Real Time Semantic Segmentation of High Resolution Automotive LiDAR Scans	Hannes Reichert et.al.	2504.21602	link
2025-05-04	Make Both Ends Meet: A Synergistic Optimization Infrared Small Target Detection with Streamlined Computational Overhead	Yuxin Jing et.al.	2504.21581	null
2025-04-30	ClassWise-CRF: Category-Specific Fusion for Enhanced Semantic Segmentation of Remote Sensing Imagery	Qinfeng Zhu et.al.	2504.21491	null
2025-04-29	DeepVoid: A Deep Learning Void Detector	Sam Kumagai et.al.	2504.21134	null
2025-04-29	Learning a General Model: Folding Clothing with Topological Dynamics	Yiming Liu et.al.	2504.20720	null
2025-04-28	DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes	Junlin Guo et.al.	2504.20303	null
2025-04-28	SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation	Yulong Guo et.al.	2504.19839	null
2025-04-28	Open-set Anomaly Segmentation in Complex Scenarios	Song Xia et.al.	2504.19706	null
2025-04-28	Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding	Yan Wang et.al.	2504.19500	null
2025-04-28	GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field	Zuxing Lu et.al.	2504.19409	null
2025-04-27	DeepSPG: Exploring Deep Semantic Prior Guidance for Low-light Image Enhancement with Multimodal Learning	Jialang Lu et.al.	2504.19127	null
2025-04-26	Federated Learning-based Semantic Segmentation for Lane and Object Detection in Autonomous Driving	Gharbi Khamis Alshammari et.al.	2504.18939	null
2025-04-25	A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes	Nicolas Münger et.al.	2504.18213	null
2025-04-25	Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition	Yin Tang et.al.	2504.18201	null
2025-04-25	What is the Added Value of UDA in the VFM Era?	Brunó B. Englert et.al.	2504.18190	null
2025-04-25	Back to Fundamentals: Low-Level Visual Features Guided Progressive Token Pruning	Yuanbing Ouyang et.al.	2504.17996	null
2025-04-24	Virtual Roads, Smarter Safety: A Digital Twin Framework for Mixed Autonomous Traffic Safety Analysis	Hao Zhang et.al.	2504.17968	null
2025-04-24	Masked strategies for images with small objects	H. Martin Gillis et.al.	2504.17935	null
2025-04-24	Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images	Zebo Huang et.al.	2504.17582	null
2025-04-23	SemanticSugarBeets: A Multi-Task Framework and Dataset for Inspecting Harvest and Storage Characteristics of Sugar Beets	Gerardus Croonen et.al.	2504.16684	link
2025-04-23	Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections	Max Kirchner et.al.	2504.16612	null
2025-04-23	SAIP-Net: Enhancing Remote Sensing Image Segmentation via Spectral Adaptive Information Propagation	Zhongtao Wang et.al.	2504.16564	null
2025-04-22	Efficient Adaptation of Deep Neural Networks for Semantic Segmentation in Space Applications	Leonardo Olivi et.al.	2504.15991	null
2025-04-22	DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining	Wei Zhuo et.al.	2504.15669	null
2025-04-21	Segmentation with Noisy Labels via Spatially Correlated Distributions	Ryu Tadokoro et.al.	2504.14795	link
2025-04-19	Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation	Johannes Spoecklberger et.al.	2504.14231	null
2025-04-19	Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection	Ghodsiyeh Rostami et.al.	2504.14138	null
2025-04-19	Lightweight Road Environment Segmentation using Vector Quantization	Jiyong Kwag et.al.	2504.14113	null
2025-04-18	Occlusion-Ordered Semantic Instance Segmentation	Soroosh Baselizadeh et.al.	2504.14054	null
2025-04-18	HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework	Shuobin Wei et.al.	2504.13579	null
2025-04-18	Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping	Wang Liu et.al.	2504.13458	link
2025-04-18	DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images	Racheal Mukisa et.al.	2504.13415	null
2025-04-18	Cardiac MRI Semantic Segmentation for Ventricles and Myocardium using Deep Learning	Racheal Mukisa et.al.	2504.13391	null
2025-04-17	SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling	Yasin Almalioglu et.al.	2504.13310	null
2025-04-17	Digital Twin Generation from Visual Data: A Survey	Andrew Melnik et.al.	2504.13159	link
2025-04-17	High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion	Libo Zhang et.al.	2504.12844	null
2025-04-17	Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation	Siyu Chen et.al.	2504.12753	link
2025-04-17	Parsimonious Dataset Construction for Laparoscopic Cholecystectomy Structure Segmentation	Yuning Zhou et.al.	2504.12573	null
2025-04-17	Privacy-Preserving Operating Room Workflow Analysis using Digital Twins	Alejandra Perez et.al.	2504.12552	null
2025-04-16	3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap	Minmin Yang et.al.	2504.12442	link
2025-04-16	Remote sensing colour image semantic segmentation of trails created by large herbivorous Mammals	Jose Francisco Diez-Pastor et.al.	2504.12121	null
2025-04-12	SDIGLM: Leveraging Large Language Models and Multi-Modal Chain of Thought for Structural Damage Identification	Yunkai Zhang et.al.	2504.11477	null
2025-04-15	PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation	Bo-Cheng Hu et.al.	2504.10986	link
2025-04-15	LightFormer: A lightweight and efficient decoder for remote sensing image segmentation	Sihang Chen et.al.	2504.10834	null
2025-04-15	OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding	Dianbing Xi et.al.	2504.10825	null
2025-04-15	Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics’ Gramian on the Manifold Underlying the Patch Space	Kelum Gajamannage et.al.	2504.10820	null
2025-04-14	Real-time Seafloor Segmentation and Mapping	Michele Grimaldi et.al.	2504.10750	null
2025-04-14	FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation	Yasser Benigmim et.al.	2504.10487	link
2025-04-14	The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer	Weixian Lei et.al.	2504.10462	link
2025-04-14	M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data	Tzu-Yun Tseng et.al.	2504.10123	link
2025-04-14	DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation	Beomseok Kang et.al.	2504.09814	null
2025-04-14	IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme	Dinh Dai Quan Tran et.al.	2504.09797	null
2025-04-14	Advancing RFI-Detection in Radio Astronomy with Liquid State Machines	Nicholas J Pritchard et.al.	2504.09796	null
2025-04-12	Evolved Hierarchical Masking for Self-Supervised Learning	Zhanzhou Feng et.al.	2504.09155	null
2025-04-11	Data-Importance-Aware Power Allocation for Adaptive Real-Time Communication in Computer Vision Applications	Chunmei Xu et.al.	2504.08922	null
2025-04-11	Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing	Vinal Asodia et.al.	2504.08704	null
2025-04-11	SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis	Yi Chen et.al.	2504.08361	link
2025-04-11	DSM: Building A Diverse Semantic Map for 3D Visual Grounding	Qinghongbing Xie et.al.	2504.08307	null
2025-04-10	ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings	Astitva Srivastava et.al.	2504.08022	null
2025-04-10	Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation	Yanglin Huang et.al.	2504.07691	null
2025-04-10	RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability	Jonggwon Park et.al.	2504.07416	null
2025-04-09	RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration	Omar Alama et.al.	2504.06994	null
2025-04-09	Domain Generalization through Attenuation of Domain-Specific Information	Reiji Saito et.al.	2504.06781	link
2025-04-08	SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation	Hritam Basak et.al.	2504.06389	null
2025-04-09	Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation	Xiaoxing Hu et.al.	2504.06220	link
2025-04-08	WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care	Vanessa Borst et.al.	2504.06185	null
2025-04-08	Towards Varroa destructor mite detection using a narrow spectra illumination	Samuel Bielik et.al.	2504.06099	null
2025-04-08	econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians	Can Zhang et.al.	2504.06003	null
2025-04-08	Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques	Luca Barco et.al.	2504.05882	null
2025-04-08	DefMamba: Deformable Visual State Space Model	Leiye Liu et.al.	2504.05794	null
2025-04-08	Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation	Enming Zhang et.al.	2504.05774	null
2025-04-07	Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection	Jon Gutiérrez Zaballa et.al.	2504.05119	null
2025-04-07	DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation	Bo-Wen Yin et.al.	2504.04701	link
2025-04-05	CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation	Kai Fang et.al.	2504.04156	null
2025-04-05	DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning	Xiao-Hui Li et.al.	2504.04085	null
2025-04-01	Input Resolution Downsizing as a Compression Technique for Vision Deep Learning Systems	Jeremy Morlier et.al.	2504.03749	null
2025-04-04	Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation	Xin Zhang et.al.	2504.03193	link
2025-04-02	Global Rice Multi-Class Segmentation Dataset (RiceSEG): A Comprehensive and Diverse High-Resolution RGB-Annotated Images for the Development and Benchmarking of Rice Segmentation Algorithms	Junchi Zhou et.al.	2504.02880	null
2025-04-03	Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation	Feng Gao et.al.	2504.02647	link
2025-04-03	Semantic segmentation of forest stands using deep learning	Håkon Næss Sandum et.al.	2504.02471	null
2025-04-03	Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation	Changshuo Wang et.al.	2504.02454	null
2025-04-02	Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation	Junjie Chen et.al.	2504.01668	null
2025-04-03	Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks	Haosheng Li et.al.	2504.01659	null
2025-04-02	ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation	Haosheng Li et.al.	2504.01648	null
2025-04-02	Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions	Giulia Marchiori Pietrosanti et.al.	2504.01632	null
2025-04-02	Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training	Luca Ciampi et.al.	2504.01547	link
2025-04-02	Beyond Nearest Neighbor Interpolation in Data Augmentation	Olivier Rukundo et.al.	2504.01527	null
2025-04-02	Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement	Zaipeng Duan et.al.	2504.01449	null
2025-04-01	CAPE: Connectivity-Aware Path Enforcement Loss for Curvilinear Structure Delineation	Elyar Esmaeilzadeh et.al.	2504.00753	null
2025-04-01	FSSUWNet: Mitigating the Fragility of Pre-trained Models with Feature Enhancement for Few-Shot Semantic Segmentation in Underwater Images	Zhuohao Li et.al.	2504.00478	link
2025-03-31	Spectral-Adaptive Modulation Networks for Visual Perception	Guhnoo Yun et.al.	2503.23947	link
2025-03-31	Bridge the Gap Between Visual and Linguistic Comprehension for Generalized Zero-shot Semantic Segmentation	Xiaoqing Guo et.al.	2503.23806	null
2025-03-31	Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks	Yu Zhou et.al.	2503.23751	null
2025-03-31	Semantic Packet Aggregation and Repeated Transmission for Text-to-Image Generation	Seunghun Lee et.al.	2503.23734	null
2025-04-02	CrossFormer: Cross-Segment Semantic Fusion for Document Segmentation	Tongke Ni et.al.	2503.23671	null
2025-03-30	BoundMatch: Boundary detection applied to semi-supervised segmentation for urban-driving scenes	Haruya Ishikawa et.al.	2503.23519	null
2025-03-30	Improving underwater semantic segmentation with underwater image quality attention and muti-scale aggregation attention	Xin Zuo et.al.	2503.23422	link
2025-03-29	Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments	Yifan Xu et.al.	2503.23105	null
2025-03-28	Enhancing DeepLabV3+ to Fuse Aerial and Satellite Images for Semantic Segmentation	Anas Berka et.al.	2503.22909	null
2025-03-28	The Marine Debris Forward-Looking Sonar Datasets	Matias Valdenegro-Toro et.al.	2503.22880	null
2025-03-28	KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation	Thomas Boucher et.al.	2503.22592	null
2025-03-28	A Dataset for Semantic Segmentation in the Presence of Unknowns	Zakaria Laskar et.al.	2503.22309	null
2025-03-28	Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation	Minho Park et.al.	2503.22172	null
2025-03-28	Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation	Hongmei Yin et.al.	2503.22136	link
2025-03-28	Semantic segmentation for building houses from wooden cubes	Ivan Beleacov et.al.	2503.22125	null
2025-03-28	Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes	Binh Thien Nguyen et.al.	2503.22088	null
2025-03-28	A Deep Learning Framework for Boundary-Aware Semantic Segmentation	Tai An et.al.	2503.22050	null
2025-03-27	Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation	Reza Qorbani et.al.	2503.21780	link
2025-03-27	A Unified Image-Dense Annotation Generation Model for Underwater Scenes	Hongkai Lin et.al.	2503.21771	link
2025-03-27	Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving	Lucas Nunes et.al.	2503.21449	link
2025-03-26	Exploring CLIP’s Dense Knowledge for Weakly Supervised Semantic Segmentation	Zhiwei Yang et.al.	2503.20826	link
2025-03-26	Exploiting Temporal State Space Sharing for Video Semantic Segmentation	Syed Ariff Syed Hesham et.al.	2503.20824	link
2025-03-25	Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception	Luke Chen et.al.	2503.20011	null
2025-03-25	The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs	Jonathan Sauder et.al.	2503.20000	link
2025-03-25	LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation	Vladan Stojnić et.al.	2503.19777	link
2025-03-25	OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations	Christina Kassab et.al.	2503.19764	null
2025-03-25	Show or Tell? Effectively prompting Vision-Language Models for semantic segmentation	Niccolo Avogaro et.al.	2503.19647	null
2025-03-25	Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model	Peishan Huang et.al.	2503.19386	null
2025-03-25	BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation	Hanshuo Qiu et.al.	2503.19303	null
2025-03-25	Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications	Ben Rahman et.al.	2503.19276	null
2025-03-24	DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation	Karim Abou Zeid et.al.	2503.18944	link
2025-03-24	Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation	DeShin Hwa et.al.	2503.18862	null
2025-03-24	HiRes-FusedMIM: A High-Resolution RGB-DSM Pre-trained Model for Building-Level Remote Sensing Applications	Guneet Mutreja et.al.	2503.18540	null
2025-03-24	Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness	Chenfei Liao et.al.	2503.18445	link
2025-03-24	PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes	Xinhua Xu et.al.	2503.18393	null
2025-03-24	MaSS13K: A Matting-level Semantic Segmentation Benchmark	Chenxi Xie et.al.	2503.18364	link
2025-03-23	Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images	Yara AlaaEldin et.al.	2503.17982	link
2025-03-23	FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation	Dong Zhao et.al.	2503.17940	null
2025-03-23	Semi-supervised Semantic Segmentation with Multi-Constraint Consistency Learning	Jianjian Yin et.al.	2503.17914	link
2025-03-22	HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving	R. D. Lin et.al.	2503.17752	link
2025-03-22	Multi-modality Anomaly Segmentation on the Road	Heng Gao et.al.	2503.17712	link
2025-03-21	Should we pre-train a decoder in contrastive learning for dense prediction tasks?	Sébastien Quetin et.al.	2503.17526	null
2025-03-21	Center-guided Classifier for Semantic Segmentation of Remote Sensing Images	Wei Zhang et.al.	2503.16963	link
2025-03-21	Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision	Maoji Zheng et.al.	2503.16811	null
2025-03-20	SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality	Chiara Schiavo et.al.	2503.16747	null
2025-03-20	Panoptic-CUDAL Technical Report: Rural Australia Point Cloud Dataset in Rainy Conditions	Tzu-Yun Tseng et.al.	2503.16378	null
2025-03-20	Controllable Segmentation-Based Text-Guided Style Editing	Jingwen Li et.al.	2503.16129	null
2025-03-24	No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather	Junsung Park et.al.	2503.15910	null
2025-03-19	High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight	Cédric Vincent et.al.	2503.15676	link
2025-03-19	Transport-Related Surface Detection with Machine Learning: Analyzing Temporal Trends in Madrid and Vienna	Miguel Ureña Pliego et.al.	2503.15653	link
2025-03-19	CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation	Masud Ahmed et.al.	2503.15617	link
2025-03-21	SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes	Weixiao Gao et.al.	2503.15300	null
2025-03-19	Semantic Segmentation of Transparent and Opaque Drinking Glasses with the Help of Zero-shot Learning	Annalena Blänsdorf et.al.	2503.15004	null
2025-03-19	USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network	Joseph Emmanuel DL Dayo et.al.	2503.14950	null
2025-03-18	PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds	Barza Nisar et.al.	2503.13914	null
2025-03-18	Exploiting Inherent Class Label: Towards Robust Scribble Supervised Semantic Segmentation	Xinliang Zhang et.al.	2503.13895	link
2025-03-17	Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization	Hao Li et.al.	2503.13617	null
2025-03-17	3D Hierarchical Panoptic Segmentation in Real Orchard Environments Across Different Sensors	Matteo Sodano et.al.	2503.13188	null
2025-03-17	DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model	Zhicheng Zhao et.al.	2503.13073	null
2025-03-17	Adaptive Transformer Attention and Multi-Scale Fusion for Spine 3D Segmentation	Yanlin Xiang et.al.	2503.12853	null
2025-03-17	LangDA: Building Context-Awareness via Language for Domain Adaptive Semantic Segmentation	Chang Liu et.al.	2503.12780	null
2025-03-17	TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image	Haoxiao Wang et.al.	2503.12779	null
2025-03-16	Point Cloud Based Scene Segmentation: A Survey	Dan Halperin et.al.	2503.12595	null
2025-03-16	BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis	Weiguang Zhao et.al.	2503.12539	link
2025-03-16	SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs	Guibiao Liao et.al.	2503.12535	null
2025-03-16	Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation	Edgar Heinert et.al.	2503.12453	null
2025-03-17	COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation	Sanghyun Jo et.al.	2503.11439	null
2025-03-14	SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets	Hao Liu et.al.	2503.11133	null
2025-03-14	A Novel Decomposed Feature-Oriented Framework for Open-Set Semantic Segmentation on LiDAR Data	Wenbang Deng et.al.	2503.11097	link
2025-03-12	Knowledge Consultation for Semi-Supervised Semantic Segmentation	Thuan Than et.al.	2503.10693	null
2025-03-11	VFM-UDA++: Improving Network Architectures and Data Strategies for Unsupervised Domain Adaptive Semantic Segmentation	Brunó B. Englert et.al.	2503.10685	null
2025-03-13	RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing	Fengxiang Wang et.al.	2503.10392	link
2025-03-13	OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions	Maxim Popov et.al.	2503.10331	null
2025-03-12	CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation	Hariprasath Govindarajan et.al.	2503.09878	null
2025-03-12	Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets	Hannah Kniesel et.al.	2503.09221	null
2025-03-07	Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows	Julien Posso et.al.	2503.08700	null
2025-03-11	SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation	Sachin Verma et.al.	2503.08290	null
2025-03-16	Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation	Deyi Ji et.al.	2503.08043	null
2025-03-11	DiffEGG: Diffusion-Driven Edge Generation as a Pixel-Annotation-Free Alternative for Instance Annotation	Sanghyun Jo et.al.	2503.07982	null
2025-03-10	Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?	Yuru Jia et.al.	2503.07890	null
2025-03-10	REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding	Yan Tai et.al.	2503.07413	link
2025-03-10	Semantic Communications with Computer Vision Sensing for Edge Video Transmission	Yubo Peng et.al.	2503.07252	null
2025-03-10	OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation	Ding Zhong et.al.	2503.07098	null
2025-03-10	Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation	Xingye Fan et.al.	2503.06954	null
2025-03-10	Aligning Instance-Semantic Sparse Representation towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives	Jiaxin Li et.al.	2503.06947	null
2025-03-10	HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors	Siyu Li et.al.	2503.06821	link
2025-03-09	CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving	Rui Song et.al.	2503.06744	null
2025-03-09	MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation	Chenfei Liao et.al.	2503.06700	null
2025-03-09	Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence	Zhaowei Chen et.al.	2503.06685	null
2025-03-09	Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation	Renhao Lu et.al.	2503.06604	null
2025-03-09	MultiCo3D: Multi-Label Voxel Contrast for One-Shot Incremental Segmentation of 3D Neuroimages	Hao Xu et.al.	2503.06598	null
2025-03-08	ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation	Qizhen Lan et.al.	2503.06307	null
2025-03-11	PointDiffuse: A Dual-Conditional Diffusion Model for Enhanced Point Cloud Semantic Segmentation	Yong He et.al.	2503.06094	null
2025-03-07	Kaiwu: A Multimodal Manipulation Dataset and Framework for Robot Learning and Human-Robot Interaction	Shuo Jiang et.al.	2503.05231	null
2025-03-08	EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images	Rohit Menon et.al.	2503.04441	null
2025-03-06	PointsToWood: A deep learning framework for complete canopy leaf-wood segmentation of TLS data across diverse European forests	Harry J. F. Owen et.al.	2503.04420	null
2025-03-06	Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes	Hui Zhang et.al.	2503.04235	null
2025-03-06	MASTER: Multimodal Segmentation with Text Prompts	Fuyang Liu et.al.	2503.04199	null
2025-03-06	Towards Intelligent Transportation with Pedestrians and Vehicles In-the-Loop: A Surveillance Video-Assisted Federated Digital Twin Framework	Xiaolong Li et.al.	2503.04170	null
2025-03-06	H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision	Yunxiao Shi et.al.	2503.04059	null
2025-03-06	GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding	Xihan Wang et.al.	2503.04034	null
2025-03-06	DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation	Amin Karimi et.al.	2503.04006	null
2025-03-05	COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation	Aurelio Noca et.al.	2503.03947	null
2025-03-05	SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection	Devanish N. Kamtam et.al.	2503.03942	null
2025-03-05	Golden Cudgel Network for Real-Time Semantic Segmentation	Guoyu Yang et.al.	2503.03325	link
2025-03-05	Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters	Julia Hindel et.al.	2503.03299	null
2025-03-05	Car-STAGE: Automated framework for large-scale high-dimensional simulated time-series data generation based on user-defined criteria	Asma A. Almutairi et.al.	2503.03100	null
2025-03-04	Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance	Jiayi Zhao et.al.	2503.02581	link
2025-03-04	TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping	Xinying Hong et.al.	2503.02578	link
2025-03-04	Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation	Dengke Zhang et.al.	2503.02459	link
2025-03-03	SAGE: A Framework of Precise Retrieval for RAG	Jintao Zhang et.al.	2503.01713	null
2025-03-04	UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface	Hao Tang et.al.	2503.01342	link
2025-03-03	Convex Hull-based Algebraic Constraint for Visual Quadric SLAM	Xiaolong Yu et.al.	2503.01254	link
2025-03-03	Identity documents recognition and detection using semantic segmentation with convolutional neural network	Mykola Kozlenko et.al.	2503.01085	null
2025-03-02	Using Synthetic Images to Augment Small Medical Image Datasets	Minh H. Vu et.al.	2503.00962	null
2025-03-02	Unifying Light Field Perception with Field of Parallax	Fei Teng et.al.	2503.00747	link
2025-03-01	Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence	Zhan Qu et.al.	2503.00518	null
2025-02-27	Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds	Mohamed Abdelsamad et.al.	2502.20316	null
2025-02-27	OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels	Meng Lou et.al.	2502.20087	link
2025-02-28	SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird’s-Eye-View Segmentation	Zijie Zhou et.al.	2502.20077	link
2025-03-04	3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds	Hengshuo Chu et.al.	2502.20041	null
2025-02-27	Learning Mask Invariant Mutual Information for Masked Image Modeling	Tao Huang et.al.	2502.19718	null
2025-02-26	Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach	Anton Backhaus et.al.	2502.19177	null
2025-02-26	Enhanced Neuromorphic Semantic Segmentation Latency through Stream Event	D. Hareb et.al.	2502.18982	null
2025-02-22	Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition	Chuanguang Yang et.al.	2502.18510	null
2025-02-28	OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation	Yunpeng Gao et.al.	2502.18041	null
2025-02-25	CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems	Rui Liu et.al.	2502.17821	null
2025-02-25	DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks	Canyu Zhao et.al.	2502.17157	link
2025-02-24	SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations	Wendi Liu et.al.	2502.17056	null
2025-02-25	VPNeXt – Rethinking Dense Decoding for Plain Vision Transformer	Xikai Tang et.al.	2502.16654	null
2025-02-23	Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration	Kim Jun-Seong et.al.	2502.16652	null
2025-02-23	OpenVox: Real-time Instance-level Open-vocabulary Probabilistic Voxel Representation	Yinan Deng et.al.	2502.16528	null
2025-02-23	Deep learning approaches to surgical video segmentation and object detection: A Scoping Review	Devanish N. Kamtam et.al.	2502.16459	null
2025-02-22	Importance-Aware Source-Channel Coding for Multi-Modal Task-Oriented Semantic Communication	Yi Ma et.al.	2502.16194	null
2025-02-22	FeatSharp: Your Vision Model Features, Sharper	Mike Ranzinger et.al.	2502.16025	null
2025-02-22	Cross-Model Transferability of Adversarial Patches in Real-time Segmentation for Autonomous Driving	Prashant Shekhar et.al.	2502.16012	link
2025-02-21	Graph Attention Convolutional U-NET: A Semantic Segmentation Model for Identifying Flooded Areas	Muhammad Umair Danish et.al.	2502.15907	null
2025-02-21	DOEI: Dual Optimization of Embedding Information for Attention-Enhanced Class Activation Maps	Hongjie Zhu et.al.	2502.15885	link
2025-02-21	Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence	Yufeng Diao et.al.	2502.15472	null
2025-02-24	DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation	Luzhou Ge et.al.	2502.15309	link
2025-02-21	Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation	Ebenezer Tarubinga et.al.	2502.15152	link
2025-02-20	RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation	Henrique Piñeiro Monteagudo et.al.	2502.14792	null
2025-02-20	Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes	Lukas Rauch et.al.	2502.14721	null
2025-02-20	Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving	Jon Gutiérrez-Zaballa et.al.	2502.14416	null
2025-02-20	Bayesian SegNet for Semantic Segmentation with Improved Interpretation of Microstructural Evolution During Irradiation of Materials	Marjolein Oostrom et.al.	2502.14184	null
2025-02-19	SegRet: An Efficient Design for Semantic Segmentation with Retentive Network	Zhiyuan Li et.al.	2502.14014	link
2025-02-19	Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model	Huiying Shi et.al.	2502.13990	null
2025-02-19	MGFI-Net: A Multi-Grained Feature Integration Network for Enhanced Medical Image Segmentation	Yucheng Zeng et.al.	2502.13808	null
2025-02-19	CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models	Nikolaos Dionelis et.al.	2502.13734	null
2025-02-18	Enhancing Power Grid Inspections with Machine Learning	Diogo Lavado et.al.	2502.13037	null
2025-02-18	DAMamba: Vision State Space Model with Dynamic Adaptive Scan	Tanzhe Li et.al.	2502.12627	link
2025-02-17	From Open-Vocabulary to Vocabulary-Free Semantic Segmentation	Klara Reichard et.al.	2502.11891	null
2025-02-16	Detecting Cadastral Boundary from Satellite Images Using U-Net model	Neda Rahimpour Anaraki et.al.	2502.11044	null
2025-02-15	NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing	Shutong Zhang et.al.	2502.10720	null
2025-02-15	Deep Learning for Wound Tissue Segmentation: A Comprehensive Evaluation using A Novel Dataset	Muhammad Ashad Kabir et.al.	2502.10652	link
2025-02-14	Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs – A Multinational Study	Yin-Chih Chelsea Wang et.al.	2502.10277	link
2025-02-13	SQ-GAN: Semantic Image Communications Using Masked Vector Quantization	Francesco Pezone et.al.	2502.09520	link
2025-02-13	FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation	Bin Yang et.al.	2502.09274	null
2025-02-17	Memory-based Ensemble Learning in CMR Semantic Segmentation	Yiwei Liu et.al.	2502.09269	link
2025-02-13	Latents of latents to delineate pixels: hybrid Matryoshka autoencoder-to-U-Net pairing for segmenting large medical images in GPU-poor and low-data regimes	Tahir Syed et.al.	2502.08988	null
2025-02-17	Knowledge Swapping via Learning and Unlearning	Mingyu Xing et.al.	2502.08075	link
2025-02-11	Efficient Continuous Group Convolutions for Local SE(3) Equivariance in 3D Point Clouds	Lisa Weijler et.al.	2502.07505	link
2025-02-11	A Survey on Mamba Architecture for Vision Applications	Fady Ibrahim et.al.	2502.07161	null
2025-02-09	A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation	Wang Jiangtao et.al.	2502.06895	null
2025-02-10	SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement	Yuqi Lin et.al.	2502.06756	link
2025-02-11	Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation	Emanuele Mule et.al.	2502.06288	link
2025-02-10	Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds	Lassi Ruoppa et.al.	2502.06227	null
2025-02-12	Traveling Waves Integrate Spatial Information Into Spectral Representations	Mozes Jacobs et.al.	2502.06034	link
2025-02-09	LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification	Shubham Kumar Nigam et.al.	2502.05836	null
2025-02-08	Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture	Mitul Goswami et.al.	2502.05476	null
2025-02-08	LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation	Shengdong Zhang et.al.	2502.05473	null
2025-02-08	A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation	Canxuan Gang et.al.	2502.05396	null
2025-02-07	IPSeg: Image Posterior Mitigates Semantic Drift in Class-Incremental Segmentation	Xiao Yu et.al.	2502.04870	link
2025-02-05	DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation	Luciano Baresi et.al.	2502.04378	link
2025-02-06	Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation	Yang Chen et.al.	2502.04111	null
2025-02-06	LeAP: Consistent multi-domain 3D labeling using Foundation Models	Simon Gebraad et.al.	2502.03901	null
2025-02-06	Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation	Xuan Li et.al.	2502.03813	null
2025-02-05	Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics	Indrashis Das et.al.	2502.03654	link
2025-02-08	Disentangling CLIP Features for Enhanced Localized Understanding	Samyak Rawlekar et.al.	2502.02977	null
2025-02-05	From DeepSense to Open RAN: AI/ML Advancements in Dynamic Spectrum Sensing and Their Applications	Ryan Barker et.al.	2502.02889	null
2025-02-04	Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications	William O’Donnell et.al.	2502.02624	null
2025-02-04	Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation	Shutong Duan et.al.	2502.02340	null
2025-02-04	UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation	Tao Zhang et.al.	2502.02257	link
2025-02-04	Deep Ensemble approach for Enhancing Brain Tumor Segmentation in Resource-Limited Settings	Jeremiah Fadugba et.al.	2502.02179	null
2025-02-04	Memory Efficient Transformer Adapter for Dense Predictions	Dong Zhang et.al.	2502.01962	null
2025-02-03	Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis	Haowen Bai et.al.	2502.01467	null
2025-02-03	Temporal-consistent CAMs for Weakly Supervised Video Segmentation in Waste Sorting	Andrea Marelli et.al.	2502.01455	null
2025-02-03	ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies	Costin F. Ciusdel et.al.	2502.01335	null
2025-02-03	FSPGD: Rethinking Black-box Attacks on Semantic Segmentation	Eun-Sol Park et.al.	2502.01262	link
2025-02-03	Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models	Tongkun Liu et.al.	2502.01216	link
2025-02-02	SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation	Mingyu Yang et.al.	2502.00960	null
2025-02-01	Complex Wavelet Mutual Information Loss: A Multi-Scale Loss Function for Semantic Segmentation	Renhao Lu et.al.	2502.00563	link
2025-01-31	Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation	Rohan Chacko et.al.	2502.00173	null
2025-01-31	CerraData-4MM: A multimodal benchmark dataset on Cerrado for land use and land cover classification	Mateus de Souza Miranda et.al.	2502.00083	link
2025-01-31	GO: The Great Outdoors Multimodal Dataset	Peng Jiang et.al.	2501.19274	null
2025-01-31	Medical Semantic Segmentation with Diffusion Pretrain	David Li et.al.	2501.19265	null
2025-01-31	ContextFormer: Redefining Efficiency in Semantic Segmentation	Mian Muhammad Naeem Abid et.al.	2501.19255	null
2025-01-31	Integrating Semi-Supervised and Active Learning for Semantic Segmentation	Wanli Ma et.al.	2501.19227	null
2025-01-31	SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging	Javier Montalvo et.al.	2501.19035	link
2025-01-31	Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks	Xiaoyan Jiang et.al.	2501.18851	null
2025-02-03	Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models	Hao Dong et.al.	2501.18592	link
2025-01-30	Ground Awareness in Deep Learning for Large Outdoor Point Cloud Segmentation	Kevin Qiu et.al.	2501.18246	null
2025-01-29	Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation	Lin Chen et.al.	2501.17642	null
2025-01-29	3DSES: an indoor Lidar point cloud segmentation dataset with real and pseudo-labels from a 3D model	Maxime Mérizette et.al.	2501.17534	null
2025-01-29	Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models	Muhammad Atta ur Rahman et.al.	2501.16769	null
2025-01-28	AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies	Surojit Saha et.al.	2501.16760	null
2025-01-28	SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios	Yinqi Chen et.al.	2501.16754	null
2025-01-27	Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation	Philip Hughes et.al.	2501.16467	null
2025-01-27	DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation	Han Sun et.al.	2501.16410	null
2025-01-27	The Linear Attention Resurrection in Vision Transformer	Chuanyang Zheng et.al.	2501.16182	null
2025-01-27	D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation	Maik Steinhauser et.al.	2501.15870	null
2025-01-26	iFormer: Integrating ConvNet and Transformer for Mobile Application	Chuanyang Zheng et.al.	2501.15369	link
2025-01-25	A Training-free Synthetic Data Selection Method for Semantic Segmentation	Hao Tang et.al.	2501.15201	link
2025-01-24	3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving	Jules Sanchez et.al.	2501.14605	link
2025-01-23	ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban 3D Change Detection	Luqi Zhang et.al.	2501.14004	link
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	null
2025-01-23	Where Do You Go? Pedestrian Trajectory Prediction using Scene Features	Mohammad Ali Rezaei et.al.	2501.13848	null
2025-01-23	Overcoming Support Dilution for Robust Few-shot Semantic Segmentation	Wailing Tang et.al.	2501.13529	null
2025-01-22	Revisiting Data Augmentation for Ultrasound Images	Adam Tupper et.al.	2501.13193	link
2025-01-22	A Novel Scene Coupling Semantic Mask Network for Remote Sensing Image Segmentation	Xiaowen Ma et.al.	2501.13130	link
2025-01-22	Hybridization of Attention UNet with Repeated Atrous Spatial Pyramid Pooling for Improved Brain Tumour Segmentation	Satyaki Roy Chowdhury et.al.	2501.13129	null
2025-01-22	Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks	Alessio Quercia et.al.	2501.12824	link
2025-01-19	Comparative Analysis of Hand-Crafted and Machine-Driven Histopathological Features for Prostate Cancer Classification and Segmentation	Feda Bolus Al Baqain et.al.	2501.12415	null
2025-01-21	Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems	Stefano Carlo Lambertenghi et.al.	2501.12269	link
2025-01-21	A margin-based replacement for cross-entropy loss	Michael W. Spratling et.al.	2501.12191	null
2025-01-20	MedicoSAM: Towards foundation models for medical image segmentation	Anwai Archit et.al.	2501.11734	link
2025-01-20	Automatic Labelling & Semantic Segmentation with 4D Radar Tensors	Botao Sun et.al.	2501.11351	null
2025-01-20	Enhancing Uncertainty Estimation in Semantic Segmentation via Monte-Carlo Frequency Dropout	Tal Zeevi et.al.	2501.11258	link
2025-01-19	Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation	Zhengwen Shen et.al.	2501.10958	null
2025-01-22	OpenEarthMap-SAR: A Benchmark Synthetic Aperture Radar Dataset for Global High-Resolution Land Cover Mapping	Junshi Xia et.al.	2501.10891	null
2025-01-18	GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation	Yannik Frisch et.al.	2501.10819	null
2025-01-18	Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention	Shanwen Wang et.al.	2501.10736	link
2025-01-17	Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks	Michael Schwingshackl et.al.	2501.10080	link
2025-01-17	Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework	Ali Can Karaca et.al.	2501.10075	null
2025-01-17	One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression	Keita Miwa et.al.	2501.10064	null
2025-01-17	LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks	Wei Lu et.al.	2501.10040	link
2025-01-16	The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning	Wonjun Jo et.al.	2501.09485	null
2025-01-16	Scaling up self-supervised learning for improved surgical foundation models	Tim J. M. Jaspers et.al.	2501.09436	link
2025-01-16	SVIA: A Street View Image Anonymization Framework for Self-Driving Applications	Dongyu Liu et.al.	2501.09393	link
2025-01-15	UNIR-Net: A Novel Approach for Restoring Underwater Images with Non-Uniform Illumination Using Synthetic Data	Ezequiel Perez-Zarate et.al.	2501.09053	link
2025-01-15	Pseudolabel guided pixels contrast for domain adaptive semantic segmentation	Jianzi Xiang et.al.	2501.09040	link
2025-01-14	FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing	Isaac Corley et.al.	2501.08490	null
2025-01-14	Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers	Efstathios Karypidis et.al.	2501.08303	link
2025-01-14	A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation	Steven Landgraf et.al.	2501.08188	null
2025-01-14	Threshold Attention Network for Semantic Segmentation of Remote Sensing Images	Wei Long et.al.	2501.07984	null
2025-01-14	Balance Divergence for Knowledge Distillation	Yafei Qi et.al.	2501.07804	null
2025-01-13	Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation	Xianping Ma et.al.	2501.07390	link
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-12	LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier	Haojun Yu et.al.	2501.06862	link
2025-01-12	SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation	Javier Gamazo Tejero et.al.	2501.06836	null
2025-01-11	Parking Space Detection in the City of Granada	Crespo-Orti Luis et.al.	2501.06651	link
2025-01-06	The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge	Qing Wu et.al.	2501.05472	null
2025-01-09	Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions	Shishir Muralidhara et.al.	2501.05246	null
2025-01-09	Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment	Haoyi Xiu et.al.	2501.05095	link
2025-01-08	Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation	Ulindu De Silva et.al.	2501.04696	link
2025-01-07	Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images	Hongyi Wu et.al.	2501.03891	null
2025-01-07	Image Segmentation: Inducing graph-based learning	Aryan Singh et.al.	2501.03765	link
2025-01-06	4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation	Jiexi Zhong et.al.	2501.02937	null
2025-01-08	GLoG-CSUnet: Enhancing Vision Transformers with Adaptable Radiomic Features for Medical Image Segmentation	Niloufar Eghbali et.al.	2501.02788	link
2025-01-04	Unsupervised Class Generation to Expand Semantic Segmentation Datasets	Javier Montalvo et.al.	2501.02264	null
2025-01-03	Semantic Segmentation for Sequential Historical Maps by Learning from Only One Map	Yunshuang Yuan et.al.	2501.01845	null
2025-01-03	IAM: Enhancing RGB-D Instance Segmentation with New Benchmarks	Aecheon Jung et.al.	2501.01685	link
2025-01-03	Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation	Rini Smita Thakur et.al.	2501.01640	null
2025-01-02	A Multi-task Supervised Compression Model for Split Computing	Yoshitomo Matsubara et.al.	2501.01420	link
2025-01-03	FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation	Bingyu Li et.al.	2501.00877	link
2024-12-31	H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters	Pedram Fekri et.al.	2501.00514	null
2024-12-31	PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM	Runnan Chen et.al.	2501.00352	null
2024-12-31	OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies	Runnan Chen et.al.	2501.00326	null
2024-12-30	HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization	Zijie Fang et.al.	2412.20924	link
2024-12-30	LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training	Fardin Ayar et.al.	2412.20881	null
2024-12-29	Image Augmentation Agent for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.20439	null
2024-12-27	Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP	Zhongxing Xu et.al.	2412.19650	null
2024-12-27	An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments	Vignesh Kottayam Viswanathan et.al.	2412.19582	null
2024-12-27	Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation	Chengyang Ye et.al.	2412.19492	link
2024-12-26	Impact of color and mixing proportion of synthetic point clouds on semantic segmentation	Shaojie Zhou et.al.	2412.19145	link
2024-12-24	AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction	Pufan Zou et.al.	2412.18255	null
2024-12-25	VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis	Shicheng Yin et.al.	2412.18178	link
2024-12-24	UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision	Yuru Wang et.al.	2412.18131	null
2024-12-24	LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding	Hao Li et.al.	2412.17635	null
2024-12-25	AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation	Jiaqi Ma et.al.	2412.17601	link
2024-12-24	Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation	Jianjian Yin et.al.	2412.17331	link
2024-12-22	Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation	Samuel Marschall et.al.	2412.16990	null
2024-12-22	Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection	Yuhang Gan et.al.	2412.16918	null
2024-12-22	MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection	Xu Zheng et.al.	2412.16876	null
2024-12-22	Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation	Jongmin Yu et.al.	2412.16859	null
2024-12-21	A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection	Shahid Ansari et.al.	2412.16755	null
2024-12-21	IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks	Yaming Zhang et.al.	2412.16654	link
2024-12-21	V”Mean”ba: Visual State Space Models only need 1 hidden dimension	Tien-Yu Chi et.al.	2412.16602	null
2024-12-21	Leveraging Contrastive Learning for Semantic Segmentation with Consistent Labels Across Varying Appearances	Javier Montalvo et.al.	2412.16592	null
2024-12-20	DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment	Cijo Jose et.al.	2412.16334	null
2024-12-20	SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data	Xinwei Ju et.al.	2412.16078	link
2024-12-20	Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer	Xinyue Chen et.al.	2412.15835	link
2024-12-19	GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation	G. Andrade-Miranda et.al.	2412.15054	link
2024-12-19	PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation	Shoumeng Qiu et.al.	2412.14821	link
2024-12-19	Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation	Zhenxin Lei et.al.	2412.14587	link
2024-12-18	Split Learning in Computer Vision for Semantic Segmentation Delay Minimization	Nikos G. Evgenidis et.al.	2412.14272	null
2024-12-18	Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation	Jianyu Zhang et.al.	2412.14145	null
2024-12-18	Prompt Categories Cluster for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.13823	null
2024-12-18	Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data	Junki Mori et.al.	2412.13757	null
2024-12-18	Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration	Dominik Werner Wolf et.al.	2412.13695	null
2024-12-18	GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting	Yuning Peng et.al.	2412.13654	null
2024-12-17	S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging	Yimu Pan et.al.	2412.13156	link
2024-12-17	Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks	Xiaxin Zhu et.al.	2412.12843	null
2024-12-17	Open-World Panoptic Segmentation	Matteo Sodano et.al.	2412.12740	null
2024-12-17	SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing	Chen Chen et.al.	2412.12685	null
2024-12-17	Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation	Dongyue Wu et.al.	2412.12672	link
2024-12-17	Adaptive Prototype Replay for Class Incremental Semantic Segmentation	Guilin Zhu et.al.	2412.12669	link
2024-12-17	SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation	Shuangping Huang et.al.	2412.12660	null
2024-12-16	Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation	Hongwei Niu et.al.	2412.12050	link
2024-12-16	SAMIC: Segment Anything with In-Context Spatial Prompt Engineering	Savinay Nagendra et.al.	2412.11998	null
2024-12-16	SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation	Yunxiang Fu et.al.	2412.11890	link
2024-12-16	Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation	Svetlana Pavlitska et.al.	2412.11608	link
2024-12-15	MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation	Zhiwei Yang et.al.	2412.11076	link
2024-12-14	RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone	Mustafa Munir et.al.	2412.10995	link
2024-12-14	DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting	Luis Wiedmann et.al.	2412.10972	link
2024-12-14	SegACIL: Solving the Stability-Plasticity Dilemma in Class-Incremental Semantic Segmentation	Jiaxu Li et.al.	2412.10834	link
2024-12-14	Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation	Jurica Runtas et.al.	2412.10765	link
2024-12-14	OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving	Lianqing Zheng et.al.	2412.10734	null
2024-12-13	A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation	Wangkai Li et.al.	2412.10339	null
2024-12-13	SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians	Siyun Liang et.al.	2412.10231	null
2024-12-13	Object-Focused Data Selection for Dense Prediction Tasks	Niclas Popp et.al.	2412.10032	null
2024-12-12	Towards Open-Vocabulary Video Semantic Segmentation	Xinhao Li et.al.	2412.09329	link
2024-12-16	FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation	Yuntian Bo et.al.	2412.09319	link
2024-12-12	VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation	Roberto Alcover-Couso et.al.	2412.09240	null
2024-12-11	A Deep Semantic Segmentation Network with Semantic and Contextual Refinements	Zhiyan Wang et.al.	2412.08671	null
2024-12-11	A feature refinement module for light-weight semantic segmentation network	Zhiyan Wang et.al.	2412.08670	null
2024-12-11	SegFace: Face Segmentation of Long-Tail Classes	Kartik Narayan et.al.	2412.08647	link
2024-12-11	EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation	Hongwei Niu et.al.	2412.08628	link
2024-12-12	Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning	Fan Lu et.al.	2412.08614	link
2024-12-11	Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction	Bohan Li et.al.	2412.08243	null
2024-12-11	THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots	Zeshun Li et.al.	2412.08096	null
2024-12-11	Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation	Zhigang Cen et.al.	2412.08034	null
2024-12-09	SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception	Yaniv Benny et.al.	2412.06968	null
2024-12-10	ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet	Andrei-Robert Alexandrescu et.al.	2412.06742	null
2024-12-09	Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation	Fei Wu et.al.	2412.06470	null
2024-12-09	GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image	Lei Su et.al.	2412.06129	null
2024-12-12	Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation	Zipeng Qi et.al.	2412.05969	null
2024-12-08	CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation	Elay Dahan et.al.	2412.05833	null
2024-12-10	RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts	Xu Liu et.al.	2412.05679	link
2024-12-06	FogROS2-FT: Fault Tolerant Cloud Robotics	Kaiyuan Chen et.al.	2412.05408	null
2024-12-06	Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images	Junno Yun et.al.	2412.05341	null
2024-12-05	Assessing and Learning Alignment of Unimodal Vision and Language Models	Le Zhang et.al.	2412.04616	null
2024-12-05	A Hitchhiker’s Guide to Understanding Performances of Two-Class Classifiers	Anaïs Halin et.al.	2412.04377	null
2024-12-05	Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts	Chenyang Zhu et.al.	2412.04220	null
2024-12-05	Text Change Detection in Multilingual Documents Using Image Comparison	Doyoung Park et.al.	2412.04137	null
2024-12-05	SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning	Seokju Yun et.al.	2412.04077	link
2024-12-05	Quality Control in Open-Ended Crowdsourcing: A Survey	Lei Chai et.al.	2412.03991	null
2024-12-05	Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation	Hao Zhu et.al.	2412.03968	link
2024-12-05	LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model	Yuan Xue et.al.	2412.03841	null
2024-12-04	Designing DNNs for a trade-off between robustness and processing performance in embedded devices	Jon Gutiérrez-Zaballa et.al.	2412.03682	null
2024-12-04	Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective	Jon Gutiérrez-Zaballa et.al.	2412.03630	link
2024-12-04	FLAIR: VLM with Fine-grained Language-informed Image Representations	Rui Xiao et.al.	2412.03561	link
2024-12-04	Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy	Ronald L. P. D. de Jong et.al.	2412.03401	null
2024-12-04	Task-driven Image Fusion with Learnable Fusion Loss	Haowen Bai et.al.	2412.03240	null
2024-12-04	Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging	Luca Ciampi et.al.	2412.03192	null
2024-12-04	Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype	Song Tang et.al.	2412.02983	null
2024-12-04	Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch	Qing Zhang et.al.	2412.02978	null
2024-12-04	Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution	Jiahua Xiao et.al.	2412.02960	null
2024-12-03	SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection	Joongwon Chae et.al.	2412.02565	link
2024-12-03	Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps	Malik Abdul Manan et.al.	2412.02443	null
2024-12-03	AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation	Jaehyun Choi et.al.	2412.02280	null
2024-12-03	Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance	Jing Zeng et.al.	2412.02249	null
2024-12-02	INSIGHT: Explainable Weakly-Supervised Medical Image Analysis	Wenbo Zhang et.al.	2412.02012	null
2024-12-02	Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers	Alberto Gonzalo Rodriguez Salgado et.al.	2412.01941	null
2024-12-02	COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training	Sanghwan Kim et.al.	2412.01814	link
2024-12-02	Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior	Yi Yu et.al.	2412.01646	null
2024-12-02	Epipolar Attention Field Transformers for Bird’s Eye View Semantic Segmentation	Christian Witte et.al.	2412.01595	null
2024-12-01	Token Cropr: Faster ViTs for Quite a Few Tasks	Benjamin Bergner et.al.	2412.00965	link
2024-12-03	DPE-Net: Dual-Parallel Encoder Based Network for Semantic Segmentation of Polyps	Malik Abdul Manan et.al.	2412.00888	null
2024-12-01	2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification	Jingwei Zhang et.al.	2412.00678	link
2024-11-30	Density-aware Global-Local Attention Network for Point Cloud Segmentation	Chade Li et.al.	2412.00489	null
2024-11-29	LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention	Zewen Du et.al.	2411.19585	link
2024-11-29	Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding	Wenbo Zhang et.al.	2411.19551	link
2024-11-29	Retrieval-guided Cross-view Image Synthesis	Hongji Yang et.al.	2411.19510	null
2024-11-28	GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model	Rui Zhou et.al.	2411.19289	null
2024-11-28	MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers	Jongseong Bae et.al.	2411.18995	null
2024-11-28	Textured As-Is BIM via GIS-informed Point Cloud Segmentation	Mohamed S. H. Alabassy et.al.	2411.18898	null
2024-11-27	The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation	Daniel Morales-Brotons et.al.	2411.18728	null
2024-11-27	HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior	Li-Yuan Tsao et.al.	2411.18662	link
2024-11-26	Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation	Sudarshan Rajagopalan et.al.	2411.17814	null
2024-12-02	Efficient Multi-modal Large Language Models via Visual Token Grouping	Minbin Huang et.al.	2411.17773	null
2024-11-26	Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation	Niharika Hegde et.al.	2411.17610	null
2024-11-26	Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving	Jon Gutiérrez-Zaballa et.al.	2411.17543	null
2024-11-26	Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning	Hoàng-Ân Lê et.al.	2411.17536	link
2024-11-26	TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba	Xiaowen Ma et.al.	2411.17473	link
2024-11-26	MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection	Juefei He et.al.	2411.17167	null
2024-11-26	Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation	Chanyoung Kim et.al.	2411.17150	null
2024-11-26	ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction	Chang Li et.al.	2411.17088	null
2024-11-26	SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation	Guoan Xu et.al.	2411.17061	null
2024-11-25	SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models	Harsh Goel et.al.	2411.16776	null
2024-11-25	Deformable Mamba for Wide Field of View Segmentation	Jie Hu et.al.	2411.16481	link
2024-11-25	A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models	Manuel Schwonberg et.al.	2411.16407	null
2024-11-27	An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models	Wentao Qu et.al.	2411.16308	link
2024-11-25	A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads	Rafael S. Toledo et.al.	2411.16295	link
2024-11-25	Learn from Foundation Model: Fruit Detection Model without Manual Annotation	Yanan Wang et.al.	2411.16196	link
2024-11-25	Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training	Man Yao et.al.	2411.16061	link
2024-11-24	Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan	Saba Zahid et.al.	2411.15923	null
2024-11-24	Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation	Sule Bai et.al.	2411.15869	link
2024-11-24	ResCLIP: Residual Attention for Training-free Dense Vision-language Inference	Yuhang Yang et.al.	2411.15851	link
2024-11-24	Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation	Arvind Murari Vepa et.al.	2411.15763	link
2024-11-22	Effective SAM Combination for Open-Vocabulary Semantic Segmentation	Minhyeok Lee et.al.	2411.14723	null
2024-11-21	Revisiting the Integration of Convolution and Attention for Vision Backbone	Lei Zhu et.al.	2411.14429	link
2024-11-21	CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Lin Sun et.al.	2411.13836	link
2024-11-21	Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals	Hussni Mohd Zakir et.al.	2411.13774	null
2024-11-20	FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting	Ola Shorinwa et.al.	2411.13753	null
2024-11-20	BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation	Umamaheswaran Raman Kumar et.al.	2411.13251	null
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	link
2024-11-20	Automating Sonologists USG Commands with AI and Voice Interface	Emad Mohamed et.al.	2411.13006	null
2024-11-19	A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation	Jiaqi Yang et.al.	2411.12615	link
2024-11-19	SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation	Ron Keuth et.al.	2411.12602	link
2024-11-15	ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding	Hesam Hosseini et.al.	2411.12589	null
2024-11-19	ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator	Xiao Jiang et.al.	2411.12250	null
2024-11-18	ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements	M. Arda Aydın et.al.	2411.12044	link
2024-11-18	Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation	Hanieh Shojaei Miandashti et.al.	2411.11935	null
2024-11-18	MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models	Harshita Sharma et.al.	2411.11362	null
2024-11-18	Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications	Scarlett Raine et.al.	2411.11287	null
2024-11-16	Attention-based U-Net Method for Autonomous Lane Detection	Mohammadhamed Tangestanizadeh et.al.	2411.10902	null
2024-11-16	Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation	Jaisidh Singh et.al.	2411.10845	null
2024-11-19	Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients	Maria Monzon et.al.	2411.10755	link
2024-11-15	Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images	Ammar Qammaz et.al.	2411.10334	null
2024-11-15	CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Dengke Zhang et.al.	2411.10086	link
2024-11-14	OneNet: A Channel-Wise 1D Convolutional U-Net	Sanghyun Byun et.al.	2411.09838	link
2024-11-14	Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks	Zengyi Yang et.al.	2411.09387	null
2024-11-14	Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	Yuheng Shi et.al.	2411.09219	link
2024-11-14	Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery	Ashim Dahal et.al.	2411.09101	link
2024-11-13	CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2411.09023	null
2024-11-14	Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation	Yangyang Li et.al.	2411.08756	null
2024-11-13	Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model	Jun Xie et.al.	2411.08592	null
2024-11-12	Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry	Christopher Hahne et.al.	2411.07918	link
2024-11-12	Semantic segmentation on multi-resolution optical and microwave data using deep learning	Jai G Singla et.al.	2411.07581	null
2024-11-11	SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation	Jiale Chen et.al.	2411.06991	null
2024-11-14	Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision	Yueyang Cang et.al.	2411.06727	null
2024-11-10	Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments	Deegan Atha et.al.	2411.06632	null
2024-11-09	Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing	Kaixuan Lu et.al.	2411.06091	null
2024-11-08	Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model	Shuchang Lyu et.al.	2411.05878	link
2024-11-08	Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation	Sien Li et.al.	2411.05307	link
2024-11-07	In the Era of Prompt Learning with Vision-Language Models	Ankit Jha et.al.	2411.04892	null
2024-11-11	ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset	Olaf Wysocki et.al.	2411.04865	link
2024-11-06	Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts	Zhitong Gao et.al.	2411.03829	link
2024-11-06	Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model	Yansong Qu et.al.	2411.03672	null
2024-11-05	Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation	Zhiling Yue et.al.	2411.03551	null
2024-11-05	SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture	Andrew Heschl et.al.	2411.03505	link
2024-11-05	Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need	Qishuai Wen et.al.	2411.03033	link
2024-11-05	Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation	Xavier Timoneda et.al.	2411.02969	null
2024-11-05	Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery	Mohammad Kakooei et.al.	2411.02935	link
2024-11-05	CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation	Jinchao Ge et.al.	2411.02715	link
2024-11-04	Deep Learning on 3D Semantic Segmentation: A Detailed Review	Thodoris Betsas et.al.	2411.02104	null
2024-11-04	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925	null
2024-11-04	DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability	Bo Gao et.al.	2411.01819	null
2024-11-04	Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations	Thanh Nguyen Canh et.al.	2411.01816	null
2024-11-03	PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation	Xinyu Xu et.al.	2411.01624	null
2024-11-01	Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions	Lixiao Yang et.al.	2411.01039	null
2024-11-01	Event-guided Low-light Video Semantic Segmentation	Zhen Yao et.al.	2411.00639	null
2024-11-01	Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data	Hairuo Hu et.al.	2411.00499	null
2024-11-01	Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing	Naufal Suryanto et.al.	2411.00425	link
2024-10-31	A Recipe for Geometry-Aware 3D Mesh Transformers	Mohammad Farazi et.al.	2411.00164	null
2024-10-31	Federated Black-Box Adaptation for Semantic Segmentation	Jay N. Paranjape et.al.	2410.24181	link
2024-10-31	COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes	Muhammad Ali et.al.	2410.24139	link
2024-10-31	Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model	Hao Zhang et.al.	2410.23905	link
2024-11-04	S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving	Maciej K. Wozniak et.al.	2410.23085	null
2024-10-31	CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation	Ziyang Gong et.al.	2410.22629	link
2024-11-03	Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation	Zhaochong An et.al.	2410.22489	link
2024-10-29	Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation	Jintao Tong et.al.	2410.22135	link
2024-10-29	Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models	Imad Ali Shah et.al.	2410.22101	link
2024-10-29	Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation	Ruihao Xia et.al.	2410.21708	link
2024-10-28	Domain Adaptation with a Single Vision-Language Embedding	Mohammad Fahes et.al.	2410.21361	null
2024-10-28	IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks	Manjunath D et.al.	2410.20953	link
2024-10-27	A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models	Camilo Espinosa-Curilem et.al.	2410.20595	link
2024-10-27	Unlocking Comics: The AI4VA Dataset for Visual Understanding	Peter Grönquist et.al.	2410.20459	link
2024-10-27	Historical Test-time Prompt Tuning for Vision Foundation Models	Jingyi Zhang et.al.	2410.20346	null
2024-10-25	OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery	Philipe Dias et.al.	2410.19965	null
2024-10-25	IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation	Kaixian Qu et.al.	2410.19697	null
2024-10-25	Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation	Yao Wu et.al.	2410.19446	link
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-24	Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks	Alexander Jaus et.al.	2410.18684	null
2024-10-24	Unsupervised semantic segmentation of urban high-density multispectral point clouds	Oona Oinonen et.al.	2410.18520	null
2024-10-26	CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator	Stefanos Pasios et.al.	2410.18238	link
2024-10-23	Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers	Achille Chiuchiarelli et.al.	2410.17738	null
2024-10-22	EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding	Zhiyi Pan et.al.	2410.17207	null
2024-10-22	SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments	Jumman Hossain et.al.	2410.16686	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-21	GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation	Nazanin Moradinasab et.al.	2410.16485	null
2024-10-21	LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training	Thomas Kreutz et.al.	2410.15833	link
2024-10-21	TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight	Hyun-Kurl Jang et.al.	2410.15674	link
2024-10-21	Deep Learning and Machine Learning – Object Detection and Semantic Segmentation: From Theory to Applications	Jintao Ren et.al.	2410.15584	null
2024-10-22	Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation	Fnu Neha et.al.	2410.15472	null
2024-10-18	On the Influence of Shape, Texture and Color for Learning Semantic Segmentation	Annika Mütze et.al.	2410.14878	null
2024-10-18	Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+	Arpan Mahara et.al.	2410.14836	null
2024-10-17	ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding	Guangda Ji et.al.	2410.13924	link
2024-10-17	Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks	Clément Playout et.al.	2410.13822	link
2024-10-22	EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything	Joonhyeon Song et.al.	2410.13621	link
2024-10-17	Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation	Ziyang Chen et.al.	2410.13472	null
2024-10-17	SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing	Bin Wang et.al.	2410.13471	link
2024-10-17	Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation	Florian Wulff et.al.	2410.13383	null
2024-10-17	Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation	Houze Liu et.al.	2410.13099	null
2024-10-16	Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation	Wenbo Xu et.al.	2410.13094	null
2024-10-16	Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation	Jesús Alejandro Loera-Ponce et.al.	2410.12988	null
2024-10-16	VividMed: Vision Language Model with Versatile Visual Grounding for Medicine	Lingxiao Luo et.al.	2410.12694	link
2024-10-16	Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans	Luca Marsilio et.al.	2410.12641	null
2024-10-17	SAM-Guided Masked Token Prediction for 3D Scene Understanding	Zhimin Chen et.al.	2410.12158	null
2024-10-15	WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation	Chenghao Qian et.al.	2410.12075	link
2024-10-15	Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning	Rijun Wang et.al.	2410.11913	null
2024-10-15	RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation	Anton Antonov et.al.	2410.11722	link
2024-10-15	InvSeg: Test-Time Prompt Inversion for Semantic Segmentation	Jiayi Lin et.al.	2410.11473	null
2024-10-15	MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation	Xianping Ma et.al.	2410.11160	link
2024-10-14	Locality Alignment Improves Vision-Language Models	Ian Covert et.al.	2410.11087	null
2024-10-14	Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes	Tim Broedermann et.al.	2410.10791	link
2024-10-14	UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation	Lihe Yang et.al.	2410.10777	link
2024-10-14	Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation	Daniel Fusaro et.al.	2410.10510	link
2024-10-14	LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections	Xuezhi Xiang et.al.	2410.10433	null
2024-10-14	V2M: Visual 2-Dimensional Mamba for Image Representation Learning	Chengkun Wang et.al.	2410.10382	link
2024-10-14	GlobalMamba: Global Image Serialization for Vision Mamba	Chengkun Wang et.al.	2410.10316	link
2024-10-13	AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model	Yuchen Li et.al.	2410.09714	null
2024-10-12	An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation	Wei Liang et.al.	2410.09443	null
2024-10-11	Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation	Varduhi Yeghiazaryan et.al.	2410.08946	null
2024-10-11	Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation	Hanieh Shojaei et.al.	2410.08687	null
2024-10-11	DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention	Nguyen Huu Bao Long et.al.	2410.08582	link
2024-10-10	Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?	Samir Abou Haidar et.al.	2410.08365	null
2024-10-10	Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation	Zhiyi Pan et.al.	2410.08091	null
2024-10-10	Shift and matching queries for video semantic segmentation	Tsubasa Mizuno et.al.	2410.07635	null
2024-10-10	3D Vision-Language Gaussian Splatting	Qucheng Peng et.al.	2410.07577	null
2024-10-11	Bridge the Points: Graph-based Few-shot Segment Anything Semantically	Anqi Zhang et.al.	2410.06964	link
2024-10-09	Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation	Seungho Lee et.al.	2410.06893	link
2024-10-09	Rethinking the Evaluation of Visible and Infrared Image Fusion	Dayan Guan et.al.	2410.06811	link
2024-10-10	QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model	Fei Xie et.al.	2410.06806	link
2024-10-09	Transesophageal Echocardiography Generation using Anatomical Models	Emmanuel Oladokun et.al.	2410.06781	null
2024-10-09	Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy	Qinfeng Zhu et.al.	2410.06725	null
2024-10-09	Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments	Meng Yu et.al.	2410.06626	null
2024-10-09	Towards Natural Image Matting in the Wild via Real-Scenario Prior	Ruihao Xia et.al.	2410.06593	link
2024-10-08	Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions	Mateus Karvat et.al.	2410.06380	null
2024-10-08	Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading	Fang Gao et.al.	2410.05762	null
2024-10-08	Advancements in Road Lane Mapping: Comparative Fine-Tuning Analysis of Deep Learning-based Semantic Segmentation Methods Using Aerial Imagery	Xuanchen et.al.	2410.05717	null
2024-10-08	Remote Sensing Image Segmentation Using Vision Mamba and Multi-Scale Multi-Frequency Feature Fusion	Yice Cao et.al.	2410.05624	null
2024-10-07	Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation	Vince Zhu et.al.	2410.04689	null
2024-10-04	SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2	Hao Yu et.al.	2410.03962	null
2024-10-10	Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features	Benyuan Meng et.al.	2410.03558	link
2024-10-04	Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images	Abhijeet Patil et.al.	2410.03289	link
2024-10-04	HRVMamba: High-Resolution Visual State Space Model for Dense Prediction	Hao Zhang et.al.	2410.03174	null
2024-10-10	HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer	Jingjing Ren et.al.	2410.02528	null
2024-10-04	Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation	Muzhi Zhu et.al.	2410.02369	link
2024-10-03	RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds	Remco Royen et.al.	2410.02323	link
2024-10-03	Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network	Yangyang Qiu et.al.	2410.02224	null
2024-10-03	Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images	Qingyuan Liu et.al.	2410.02207	null
2024-10-02	SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images	Kaiyu Li et.al.	2410.01768	link
2024-10-02	One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations	Shaokang Wu et.al.	2410.01630	null
2024-10-02	Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation	Zhaofeng Shi et.al.	2410.01341	link
2024-10-02	VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings	Andrea Carrara et.al.	2410.01336	null
2024-10-01	RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation	Yazhou Zhu et.al.	2410.01110	link
2024-10-01	Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer	Vlatko Spasev et.al.	2410.01092	null
2024-10-01	Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time	Chiao-An Yang et.al.	2410.01083	link
2024-10-01	DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles	Robert Krajewski et.al.	2410.00769	link
2024-10-01	Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection	Pengxi Zeng et.al.	2410.00582	null
2024-10-01	Precise Workcell Sketching from Point Clouds Using an AR Toolbox	Krzysztof Zieliński et.al.	2410.00479	null
2024-10-01	Deep Multimodal Fusion for Semantic Segmentation of Remote Sensing Earth Observation Data	Ivica Dimitrovski et.al.	2410.00469	null
2024-10-01	AARK: An Open Toolkit for Autonomous Racing Research	James Bockman et.al.	2410.00358	null
2024-09-30	Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation	Aleyna Kütük et.al.	2410.00266	null
2024-09-30	AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation	Boyu Han et.al.	2409.20398	link
2024-09-30	Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation	Tillmann Rheude et.al.	2409.20287	link
2024-09-30	Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model	Fulong Ma et.al.	2409.20164	null
2024-09-30	Segmenting Wood Rot using Computer Vision Models	Roland Kammerbauer et.al.	2409.20137	null
2024-09-30	Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels	Heeseong Shin et.al.	2409.19846	null
2024-09-27	Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation	Raphael Hagmanns et.al.	2409.18788	null
2024-09-27	Learning from Pattern Completion: Self-supervised Controllable Generation	Zhiqiang Chen et.al.	2409.18694	link
2024-09-27	Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast	Xiaoke Hao et.al.	2409.18543	link
2024-10-01	Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization	Siru Li et.al.	2409.18434	null
2024-09-26	Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning	Siyi Lu et.al.	2409.17659	null
2024-09-26	Global-Local Medical SAM Adaptor Based on Full Adaption	Meng Wang et.al.	2409.17486	null
2024-09-25	VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection	Liangyu Zhong et.al.	2409.17330	null
2024-09-25	2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation	Tommie Kerssies et.al.	2409.17208	link
2024-09-25	WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks	Alberto Bacchin et.al.	2409.16999	link
2024-09-25	Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis	Illia Tsiporenko et.al.	2409.16940	null
2024-09-24	A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation	Avisha Kumar et.al.	2409.16441	link
2024-09-24	Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds	Asad Ur Rahman et.al.	2409.16381	null
2024-09-24	Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation	Hannah Kerner et.al.	2409.16252	link
2024-09-24	Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation	Harry Rogers et.al.	2409.16213	link
2024-09-24	Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification	Pang-Yuan Pao et.al.	2409.15846	null
2024-09-24	DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation	Soojin Jang et.al.	2409.15801	null
2024-09-24	Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis	Camndon Reed et.al.	2409.15671	null
2024-09-23	ZeroSCD: Zero-Shot Street Scene Change Detection	Shyam Sundar Kannan et.al.	2409.15255	null
2024-09-27	Diffusion-based RGB-D Semantic Segmentation with Deformable Attention Transformer	Minh Bui et.al.	2409.15117	null
2024-09-23	The BRAVO Semantic Segmentation Challenge Results in UNCV2024	Tuan-Hung Vu et.al.	2409.15107	link
2024-09-21	MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors	Zhenhua Du et.al.	2409.14019	null
2024-09-21	Enhanced Semantic Segmentation for Large-Scale and Imbalanced Point Clouds	Haoran Gong et.al.	2409.13983	null
2024-09-21	CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise	Fuyang Yu et.al.	2409.13982	null
2024-09-20	Efficient Domain Augmentation for Autonomous Driving Testing Using Diffusion Models	Luciano Baresi et.al.	2409.13661	null
2024-09-20	Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning	Daniele Rege Cambrin et.al.	2409.13641	link
2024-09-20	Towards Semi-supervised Dual-modal Semantic Segmentation	Qiulei Dong et.al.	2409.13325	null
2024-09-19	AutoPET III Challenge: PET/CT Semantic Segmentation	Reza Safdari et.al.	2409.13006	null
2024-09-19	Automated Linear Disturbance Mapping via Semantic Segmentation of Sentinel-2 Imagery	Andrew M. Nagel et.al.	2409.12817	null
2024-09-17	Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks	Edgar Heinert et.al.	2409.11373	link
2024-09-17	MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping	Amirreza Fateh et.al.	2409.11316	link
2024-09-17	Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark	Clifford Broni-Bediako et.al.	2409.11227	link
2024-09-17	HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios	Nick Theisen et.al.	2409.11205	link
2024-09-16	Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning	Amin Karimi Monsefi et.al.	2409.10362	link
2024-09-16	BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images	Wentao Wang et.al.	2409.10269	null
2024-09-15	Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation	Zhanteng Xie et.al.	2409.09899	null
2024-09-15	Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation	Qilong Zhangli et.al.	2409.09893	null
2024-09-15	High Definition Map Mapping and Update: A General Overview and Future Directions	Benny Wijaya et.al.	2409.09726	null
2024-09-14	Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation	Hugo Porta et.al.	2409.09497	link
2024-09-13	AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation	Zechao Sun et.al.	2409.08516	null
2024-09-13	VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation	Ezra MacDonald et.al.	2409.08461	link
2024-09-12	Bayesian Self-Training for Semi-Supervised 3D Segmentation	Ozan Unal et.al.	2409.08102	null
2024-09-12	Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes	Siyu Chen et.al.	2409.07995	null
2024-09-12	SURGIVID: Annotation-Efficient Surgical Video Object Discovery	Çağhan Köksal et.al.	2409.07801	null
2024-09-12	Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation	Fuchen Zheng et.al.	2409.07793	link
2024-09-12	ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation	Fuchen Zheng et.al.	2409.07779	link
2024-09-12	Open-Vocabulary Remote Sensing Image Semantic Segmentation	Qinglong Cao et.al.	2409.07683	link
2024-09-11	Token Turing Machines are Efficient Vision Models	Purvish Jajal et.al.	2409.07613	link
2024-09-11	AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution	Wangduo Xie et.al.	2409.07171	null
2024-09-11	Brain-Inspired Stepwise Patch Merging for Vision Transformers	Yonghao Yu et.al.	2409.06963	null
2024-09-10	Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds	Mu Cai et.al.	2409.06827	link
2024-09-10	A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO	Sabit Ahamed Preanto et.al.	2409.06671	null
2024-09-10	PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation	Yin Hu et.al.	2409.06309	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-09	SVS-GAN: Leveraging GANs for Semantic Video Synthesis	Khaled M. Seyam et.al.	2409.06074	null
2024-09-12	Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance	Quang-Huy Che et.al.	2409.06002	null
2024-09-09	Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features	Jacob Gildenblat et.al.	2409.05697	null
2024-09-09	ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions	Furqan Ahmed Shaik et.al.	2409.05327	null
2024-09-08	RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network	Zhiwei Lin et.al.	2409.04979	null
2024-09-06	Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation	Björn Michele et.al.	2409.04409	link
2024-09-05	Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution	Marga Don et.al.	2409.03754	link
2024-09-05	LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones	Moritz Nottebaum et.al.	2409.03460	link
2024-09-05	Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications	Tong Bu et.al.	2409.03368	link
2024-09-05	UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking	Md. Mahfuzur Rahman et.al.	2409.03245	null
2024-09-05	Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation	Xixi Jiang et.al.	2409.03228	link
2024-09-06	iSeg: An Iterative Refinement-based Framework for Training-free Segmentation	Lin Sun et.al.	2409.03209	link
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-04	CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation	Minhee Cho et.al.	2409.02699	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-09-03	K-Origins: Better Colour Quantification for Neural Networks	Lewis Mason et.al.	2409.02281	link
2024-09-03	AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions	Chenghao Qian et.al.	2409.02045	link
2024-09-03	Segmenting Object Affordances: Reproducibility and Sensitivity to Scale	Tommaso Apicella et.al.	2409.01814	link
2024-09-03	Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation	Haodong Wang et.al.	2409.01662	null
2024-09-02	Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition	Xuanrui Zeng et.al.	2409.01472	link
2024-09-02	SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation	Alberto Bacchin et.al.	2409.01109	link
2024-09-02	Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions	Taorong Liu et.al.	2409.01072	null
2024-09-02	From Bird’s-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model	Xiaojie Xu et.al.	2409.01014	null
2024-09-02	SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution	Mevan Ekanayake et.al.	2409.01013	null
2024-09-02	IVGF: The Fusion-Guided Infrared and Visible General Framework	Fangcen Liu et.al.	2409.00973	null
2024-09-01	Image-to-Lidar Relational Distillation for Autonomous Driving Data	Anas Mahmoud et.al.	2409.00845	null
2024-09-01	Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background	Biyuan Liu et.al.	2409.00589	link
2024-08-31	Plant detection from ultra high resolution remote sensing images: A Semantic Segmentation approach based on fuzzy loss	Shivam Pande et.al.	2409.00513	null
2024-08-30	Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes	Li Zhang et.al.	2408.17421	link
2024-08-30	Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations	Ahmed Hammam et.al.	2408.17311	null
2024-08-30	Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training	Zizheng Huang et.al.	2408.17081	link
2024-08-30	Transient Fault Tolerant Semantic Segmentation for Autonomous Driving	Leonardo Iurada et.al.	2408.16952	link
2024-08-29	SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection	Rohit Venkata Sai Dulam et.al.	2408.16645	link
2024-08-29	MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation	Linyan Yang et.al.	2408.16478	null
2024-08-29	Multi-source Domain Adaptation for Panoramic Semantic Segmentation	Jing Jiang et.al.	2408.16469	link
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	null
2024-08-28	SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors	Zhiqing Zhang et.al.	2408.15887	null
2024-08-28	DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries	Yu Yang et.al.	2408.15813	null
2024-08-28	TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation	Junbao Zhou et.al.	2408.15657	link
2024-08-27	Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images	Silvia Seidlitz et.al.	2408.15373	link
2024-08-27	An Investigation on The Position Encoding in Vision-Based Dynamics Prediction	Jiageng Zhu et.al.	2408.15201	null
2024-08-27	Applying ViT in Generalized Few-shot Semantic Segmentation	Liyuan Geng et.al.	2408.14957	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	link
2024-08-27	MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation	Yuanbing Zhu et.al.	2408.14776	null
2024-08-26	Physically Feasible Semantic Segmentation	Shamik Basu et.al.	2408.14672	link
2024-08-25	OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation	Muhammad Rameez ur Rahman et.al.	2408.13936	link
2024-08-25	Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation	Yuwen Pan et.al.	2408.13838	null
2024-08-25	TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather	Xiongwei Zhao et.al.	2408.13802	link
2024-08-25	ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation	Xin Zhang et.al.	2408.13771	null
2024-08-25	Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation	Zhaoyang Li et.al.	2408.13752	null
2024-08-24	ESA: Annotation-Efficient Active Learning for Semantic Segmentation	Jinchao Ge et.al.	2408.13491	link
2024-08-23	Accuracy Improvement of Cell Image Segmentation Using Feedback Former	Hinako Mitsuoka et.al.	2408.12974	null
2024-08-23	Image Segmentation in Foundation Model Era: A Survey	Tianfei Zhou et.al.	2408.12957	link
2024-08-23	Symmetric masking strategy enhances the performance of Masked Image Modeling	Khanh-Binh Nguyen et.al.	2408.12772	null
2024-08-22	Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets	Wolfgang Boettcher et.al.	2408.12489	link
2024-08-22	The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation	Tuyen Tran et.al.	2408.12447	null
2024-08-26	UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images	Enze Zhu et.al.	2408.11545	link
2024-08-21	Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation	Chuandong Liu et.al.	2408.11280	link
2024-08-20	NeCo: Improving DINOv2’s spatial representations in 19 GPU hours with Patch Neighbor Consistency	Valentinos Pariza et.al.	2408.11054	null
2024-08-20	CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients	Karen Sanchez et.al.	2408.10827	link
2024-08-20	Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?	Chen Liang et.al.	2408.10627	null
2024-08-20	Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation	Jiawei Han et.al.	2408.10537	link
2024-08-19	Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network	Rasha Alshawi et.al.	2408.10181	null
2024-08-19	Dynamic Label Injection for Imbalanced Industrial Defect Segmentation	Emanuele Caruso et.al.	2408.10031	link
2024-08-19	Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis	Kira Maag et.al.	2408.10021	null
2024-08-19	Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving	Jun Yan et.al.	2408.09839	link
2024-08-18	OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Muhammad Rameez Ur Rahman et.al.	2408.09424	link
2024-08-18	Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration	Hao Ai et.al.	2408.09336	null
2024-08-17	Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology	Junchao Zhu et.al.	2408.09278	link
2024-08-17	GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation	Weiming Zhang et.al.	2408.09115	null
2024-08-17	Depth-guided Texture Diffusion for Image Semantic Segmentation	Wei Sun et.al.	2408.09097	null
2024-08-15	5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks	Dongshuo Yin et.al.	2408.08345	link
2024-08-14	MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis	Nimeesha Chan et.al.	2408.07773	link
2024-08-15	MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation	Beoungwoo Kang et.al.	2408.07576	link
2024-08-19	MagicFace: Training-free Universal-Style Human Image Customized Synthesis	Yibin Wang et.al.	2408.07433	null
2024-08-14	Segment Using Just One Example	Pratik Vora et.al.	2408.07393	null
2024-08-14	Ensemble architecture in polyp segmentation	Hao-Yun Hsu et.al.	2408.07262	link
2024-08-14	Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks	Raghavendra Singh et.al.	2408.07243	null
2024-08-14	Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training	Ethan Kou et.al.	2408.07239	link
2024-08-13	ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation	Jingyun Wang et.al.	2408.06747	link
2024-08-10	Dilated Convolution with Learnable Spacings	Ismail Khalfaoui-Hassani et.al.	2408.06383	null
2024-08-12	Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images	Siladittya Manna et.al.	2408.06235	null
2024-08-12	A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting	Felix Assion et.al.	2408.06071	null
2024-08-12	Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning	Xinrong Hu et.al.	2408.05889	link
2024-08-11	Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task	Hannuo Zhang et.al.	2408.05777	null
2024-08-11	MacFormer: Semantic Segmentation with Fine Object Boundaries	Guoan Xu et.al.	2408.05699	null
2024-08-10	Multimodal generative semantic communication based on latent diffusion model	Weiqi Fu et.al.	2408.05455	null
2024-08-09	In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	Dahyun Kang et.al.	2408.04961	link
2024-08-09	ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	Mengcheng Lan et.al.	2408.04883	link
2024-08-09	Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning	Fumihiro Kaneko et.al.	2408.04795	null
2024-08-08	SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation	Jieming Yu et.al.	2408.04593	null
2024-08-08	SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios	Sriram Mandalika et.al.	2408.04482	null
2024-08-08	What could go wrong? Discovering and describing failure modes in computer vision	Gabriela Csurka et.al.	2408.04471	null
2024-08-07	CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications	Tianfang Zhang et.al.	2408.03703	link
2024-08-07	SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology	Mingya Zhang et.al.	2408.03651	link
2024-08-06	Post-Mortem Human Iris Segmentation Analysis with Deep Learning	Afzal Hossain et.al.	2408.03448	null
2024-08-06	Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Jonas Schmitt et.al.	2408.03046	link
2024-08-05	Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation	Sai Prasanna et.al.	2408.02297	null
2024-08-05	Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs	Jeongkee Lim et.al.	2408.02261	link
2024-08-05	Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders	Muhammad Abdullah Jamal et.al.	2408.02245	null
2024-08-04	Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation	Ye Du et.al.	2408.02039	null
2024-08-03	Bayesian Active Learning for Semantic Segmentation	Sima Didari et.al.	2408.01694	null
2024-08-03	A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection	Omkar Oak et.al.	2408.01692	null
2024-08-03	Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation	Balázs Opra et.al.	2408.01640	null
2024-08-02	Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans	Lukas Kratochvila et.al.	2408.01526	null
2024-08-02	Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation	Yuanzhi Su et.al.	2408.01356	null
2024-08-02	StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation	Bingyu Li et.al.	2408.01343	null
2024-08-02	Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach	Yabin Zhu et.al.	2408.00969	link
2024-08-01	Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	Siyu Jiao et.al.	2408.00744	link
2024-08-01	Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function	Matias Oscar Volman Stern et.al.	2408.00707	null
2024-08-01	AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation	Asbjørn Munk et.al.	2408.00640	link
2024-08-01	SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation	Shengbo Tan et.al.	2408.00496	link
2024-07-31	Open-Vocabulary Audio-Visual Semantic Segmentation	Ruohao Guo et.al.	2407.21721	null
2024-07-31	MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment	Anurag Das et.al.	2407.21654	null
2024-07-31	Small Object Few-shot Segmentation for Vision-based Industrial Inspection	Zilong Zhang et.al.	2407.21351	link
2024-07-31	On-the-fly Point Feature Representation for Point Clouds Analysis	Jiangyi Wang et.al.	2407.21335	null
2024-07-31	Fine-grained Metrics for Point Cloud Semantic Segmentation	Zhuheng Lu et.al.	2407.21289	null
2024-07-30	PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds	Kerem Mertoğlu et.al.	2407.21150	null
2024-07-30	Learning Ordinality in Semantic Segmentation	Rafael Cristino et.al.	2407.20959	null
2024-07-29	Improving 2D Feature Representations by 3D-Aware Fine-Tuning	Yuanwen Yue et.al.	2407.20229	null
2024-07-29	Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset	Yimian Dai et.al.	2407.20078	link
2024-07-29	Language-driven Grasp Detection with Mask-guided Attention	Tuan Van Vo et.al.	2407.19877	null
2024-07-29	Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets	Muhammad Abdullah Jamal et.al.	2407.19714	null
2024-07-29	ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement	Ezequiel Perez-Zarate et.al.	2407.19708	link
2024-07-28	ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding	Zhen Chen et.al.	2407.19435	link
2024-07-27	Ensembling convolutional neural networks for human skin segmentation	Patryk Kuban et.al.	2407.19310	null
2024-07-27	Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Gang Pan et.al.	2407.19271	null
2024-07-26	Sparse Refinement for Efficient High-Resolution Semantic Segmentation	Zhijian Liu et.al.	2407.19014	null
2024-07-29	Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation	Jingjun Yi et.al.	2407.18568	null
2024-07-25	Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception	Julia Hindel et.al.	2407.18145	null
2024-07-25	TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework	Guanfeng Tang et.al.	2407.18038	null
2024-07-25	Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions	Jan Nikolas Morshuis et.al.	2407.18026	link
2024-07-24	Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation	Hyunwoo Yu et.al.	2407.17261	link
2024-07-24	Trans2Unet: Neural fusion for Nuclei Semantic Segmentation	Dinh-Phu Tran et.al.	2407.17181	null
2024-07-24	PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning	Mu Chen et.al.	2407.17101	null
2024-07-25	Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste	Qinfeng Zhu et.al.	2407.17028	link
2024-07-24	Progressive Query Refinement Framework for Bird’s-Eye-View Semantic Segmentation from Surrounding Images	Dooseop Choi et.al.	2407.17003	link
2024-07-23	Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving	Anam Manzoor et.al.	2407.16647	null
2024-07-23	Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging	Daniela L. Ramos et.al.	2407.16608	link
2024-07-23	Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision	Aditya Krishnan et.al.	2407.16102	null
2024-07-22	MILAN: Milli-Annotations for Lidar Semantic Segmentation	Nermin Samet et.al.	2407.15797	null
2024-07-22	Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond	Silvio Galesso et.al.	2407.15739	link
2024-07-22	MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics	Alexander Melekhin et.al.	2407.15663	link
2024-07-22	Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling	Bo Yuan et.al.	2407.15429	link
2024-07-22	Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data	Junha Song et.al.	2407.15383	link
2024-07-21	Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation	Xiaoyang Wu et.al.	2407.15282	null
2024-07-20	Downstream-Pretext Domain Knowledge Traceback for Active Learning	Beichen Zhang et.al.	2407.14720	null
2024-07-19	Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model	Kun Zhao et.al.	2407.14326	null
2024-07-19	Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation	Zhengyuan Xie et.al.	2407.14142	link
2024-07-19	GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation	Florian Chabot et.al.	2407.14108	null
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model	Abdelrahman Shaker et.al.	2407.13772	link
2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761	null
2024-07-23	MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis	Ziming Zhong et.al.	2407.13675	link
2024-07-18	Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models	Xiaoyu Zhu et.al.	2407.13642	null
2024-07-18	FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures	Hao Lu et.al.	2407.13500	link
2024-07-18	FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions	Sohyun Lee et.al.	2407.13437	null
2024-07-18	Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability	Judith Dijk et.al.	2407.13392	null
2024-07-18	Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation	Chang Liu et.al.	2407.13363	link
2024-07-18	Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation	Shoumeng Qiu et.al.	2407.13254	link
2024-07-18	OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation	Jian Sun et.al.	2407.13137	null
2024-07-18	Tree semantic segmentation from aerial image time series	Venkatesh Ramesh et.al.	2407.13102	null
2024-07-17	ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders	Carlos Hinojosa et.al.	2407.13036	link
2024-07-17	Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation	Prantik Howlader et.al.	2407.12630	link
2024-07-17	Instance-wise Uncertainty for Class Imbalance in Semantic Segmentation	Luís Almeida et.al.	2407.12609	null
2024-07-18	Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks	Antoni Kowalczuk et.al.	2407.12588	link
2024-07-17	Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation	Ruijie Xu et.al.	2407.12489	link
2024-07-17	Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation	Hyun Seok Seong et.al.	2407.12463	link
2024-07-17	ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference	Mengcheng Lan et.al.	2407.12442	null
2024-07-17	Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model	Tao Wang et.al.	2407.12319	null
2024-07-16	FoodMem: Near Real-time and Precise Food Video Segmentation	Ahmad AlMughrabi et.al.	2407.12121	null
2024-07-16	Mitigating Background Shift in Class-Incremental Semantic Segmentation	Gilhan Park et.al.	2407.11859	link
2024-07-16	Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation	Juncheng Ma et.al.	2407.11820	link
2024-07-16	XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach	Truong Thanh Hung Nguyen et.al.	2407.11771	link
2024-07-16	OAM-TCD: A globally diverse dataset of high-resolution tree cover maps	Josh Veitch-Michaelis et.al.	2407.11743	link
2024-07-16	SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds	Yanbo Wang et.al.	2407.11569	link
2024-07-16	Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations	Yunya Gao et.al.	2407.11381	link
2024-07-16	Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities	Xu Zheng et.al.	2407.11351	null
2024-07-16	Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation	Xu Zheng et.al.	2407.11344	null
2024-07-16	TCFormer: Visual Recognition via Token Clustering Transformer	Wang Zeng et.al.	2407.11321	link
2024-07-15	Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding	Danish Nazir et.al.	2407.11224	null
2024-07-15	Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras	Hoonhee Cho et.al.	2407.11216	link
2024-07-15	No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Walter Simoncini et.al.	2407.10964	link
2024-07-15	APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2407.10649	null
2024-07-15	Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs	Rong Ma et.al.	2407.10534	null
2024-07-14	Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data	Tuo Feng et.al.	2407.10200	link
2024-07-14	RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation	Li Li et.al.	2407.10159	link
2024-07-14	HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation	Chengjie Jiang et.al.	2407.10047	null
2024-07-13	Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation	Anqi Zhang et.al.	2407.09838	null
2024-07-13	Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach	Md Rakibul Islam et.al.	2407.09828	null
2024-07-13	3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance	Xiaoxu Xu et.al.	2407.09826	link
2024-07-13	TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation	Xiaopei Wu et.al.	2407.09751	link
2024-07-12	Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion	Shiqi Tan et.al.	2407.09697	null
2024-07-12	SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images	Josh Myers-Dean et.al.	2407.09686	null
2024-07-12	FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background	Muhammad Ali et.al.	2407.09379	link
2024-07-12	Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy	Julian Wyatt et.al.	2407.09192	null
2024-07-12	Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off	Levente Halmosi et.al.	2407.09150	link
2024-07-12	Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation	Wei Cong et.al.	2407.09047	null
2024-07-12	Textual Query-Driven Mask Transformer for Domain Generalized Segmentation	Byeonghyun Pak et.al.	2407.09033	link
2024-07-12	Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation	Zihao Li et.al.	2407.08994	null
2024-07-11	Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation	Tong Shao et.al.	2407.08268	link
2024-07-11	Enrich the content of the image Using Context-Aware Copy Paste	Qiushi Guo et.al.	2407.08151	null
2024-07-10	MambaVision: A Hybrid Mamba-Transformer Vision Backbone	Ali Hatamizadeh et.al.	2407.08083	link
2024-07-10	Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift	Elliot Vincent et.al.	2407.07616	link
2024-07-10	H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper	Ryan Banks et.al.	2407.07604	link
2024-07-11	Trainable Highly-expressive Activation Functions	Irit Chelly et.al.	2407.07564	link
2024-07-10	Deformable-Heatmap-Segmentation for Automobile Visual Perception	Hongyu Jin et.al.	2407.07493	null
2024-07-10	Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining	Tianfang Sun et.al.	2407.07465	null
2024-07-11	HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation	Guoan Xu et.al.	2407.07441	null
2024-07-09	ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation	Yuyuan Liu et.al.	2407.07171	link
2024-07-08	Training-free CryoET Tomogram Segmentation	Yizhou Zhao et.al.	2407.06833	link
2024-07-09	CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM	Aditya Murali et.al.	2407.06795	null
2024-07-09	LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration	Jiayi Liu et.al.	2407.06512	link
2024-07-08	Leveraging image captions for selective whole slide image annotation	Jingna Qiu et.al.	2407.06363	link
2024-07-08	Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots	Siva Krishna Ravipati et.al.	2407.06077	link
2024-07-08	Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts	Puzuo Wang et.al.	2407.06043	null
2024-07-08	RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation	Sarah Elmahdy et.al.	2407.06016	null
2024-07-07	Semantic Segmentation for Real-World and Synthetic Vehicle’s Forward-Facing Camera Images	Tuan T. Nguyen et.al.	2407.05452	null
2024-07-07	Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness	Idris Hamoud et.al.	2407.05448	null
2024-07-06	A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation	Monika Wysoczańska et.al.	2407.05061	null
2024-07-06	BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support	Vladyslav Polushko et.al.	2407.05007	null
2024-07-05	Explainable Metric Learning for Deflating Data Bias	Emma Andrews et.al.	2407.04866	null
2024-07-10	LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes	Zexian Huang et.al.	2407.04326	null
2024-07-04	Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier	Prantik Howlader et.al.	2407.04036	link
2024-07-04	Relative Difficulty Distillation for Semantic Segmentation	Dong Liang et.al.	2407.03719	link
2024-07-04	POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation	Arindam Dutta et.al.	2407.03549	null
2024-07-03	A Unified Framework for 3D Scene Understanding	Wei Xu et.al.	2407.03263	link
2024-07-03	ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation	Chang Li et.al.	2407.03033	null
2024-07-03	ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation	Yipin Guo et.al.	2407.02881	null
2024-07-03	Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation	Tao Chen et.al.	2407.02768	link
2024-07-02	Open Panoramic Segmentation	Junwei Zheng et.al.	2407.02685	link
2024-07-08	Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction	Tinghuai Wang et.al.	2407.02639	null
2024-07-02	Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather	Junsung Park et.al.	2407.02286	link
2024-07-02	MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders	Baijiong Lin et.al.	2407.02228	link
2024-07-02	Occlusion-Aware Seamless Segmentation	Yihong Cao et.al.	2407.02182	link
2024-07-02	VRBiom: A New Periocular Dataset for Biometric Applications of HMD	Ketan Kotwal et.al.	2407.02150	null
2024-07-02	Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts	Pasquale De Marinis et.al.	2407.02075	link
2024-07-02	Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning	Chengchao Shen et.al.	2407.02014	link
2024-07-01	Label-free Neural Semantic Image Synthesis	Jiayi Wang et.al.	2407.01790	null
2024-07-01	PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction	Xuan Yu et.al.	2407.01349	null
2024-07-01	CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes	Danial Qashqai et.al.	2407.01328	link
2024-06-29	SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City	Guohao Wang et.al.	2407.00296	link
2024-06-28	Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review	Moseli Mots’oehli et.al.	2407.00252	null
2024-07-01	Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding	Yifan Tang et.al.	2406.19791	null
2024-06-28	Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation	Junsung Park et.al.	2406.19638	link
2024-06-28	PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation	Deyi Ji et.al.	2406.19632	null
2024-06-27	Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model	Haobo Yuan et.al.	2406.19369	link
2024-06-27	ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation	Nazanin Moradinasab et.al.	2406.19225	null
2024-06-30	Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO	Fuseini Mumuni et.al.	2406.19057	null
2024-06-27	Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation	Tao Lian et.al.	2406.18809	null
2024-06-26	CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data	Nikolaos Dionelis et.al.	2406.18279	link
2024-06-26	The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval	Meinardus Boris et.al.	2406.18113	link
2024-06-26	Few-Shot Medical Image Segmentation with High-Fidelity Prototypes	Song Tang et.al.	2406.18074	link
2024-06-25	Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2406.17679	null
2024-06-25	DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation	Ahmad Mohammadshirazi et.al.	2406.17591	link
2024-06-25	Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation	Felix Stillger et.al.	2406.17541	null
2024-06-25	Investigating Self-Supervised Methods for Label-Efficient Learning	Srinivasa Rao Nandam et.al.	2406.17460	null
2024-06-25	Pseudo Labelling for Enhanced Masked Autoencoders	Srinivasa Rao Nandam et.al.	2406.17450	null
2024-06-25	Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model	Zhuoyuan Li et.al.	2406.17442	null
2024-06-25	Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes	Qi Ma et.al.	2406.17438	link
2024-06-24	Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation	Yizheng Wu et.al.	2406.16776	link
2024-06-24	μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation	Pierangela Bruno et.al.	2406.16724	null
2024-06-24	GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection	Harnaik Dhami et.al.	2406.16625	link
2024-06-24	LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images	Xiaowen Ma et.al.	2406.16502	link
2024-06-24	Cascade Reward Sampling for Efficient Decoding-Time Alignment	Bolian Li et.al.	2406.16306	link
2024-06-24	SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments	Neng Wang et.al.	2406.16279	link
2024-06-23	UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery	Pengfei Zhang et.al.	2406.16129	null
2024-06-22	Fine-grained Background Representation for Weakly Supervised Semantic Segmentation	Xu Yin et.al.	2406.15755	link
2024-06-20	Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery	Ilham Adi Panuntun et.al.	2406.14220	null
2024-06-20	Trusting Semantic Segmentation Networks	Samik Some et.al.	2406.14201	null
2024-06-20	EvSegSNN: Neuromorphic Semantic Segmentation for Event Data	Dalia Hareb et.al.	2406.14178	null
2024-06-20	Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images	Qinfeng Zhu et.al.	2406.14086	link
2024-06-19	Search-based DNN Testing and Retraining with GAN-enhanced Simulations	Mohammed Oualid Attaoui et.al.	2406.13359	null
2024-06-19	Deep Learning-Based 3D Instance and Semantic Segmentation: A Review	Siddiqui Muhammad Yasir et.al.	2406.13308	null
2024-06-18	Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation	Guoyu Yang et.al.	2406.12496	link
2024-06-18	Agriculture-Vision Challenge 2024 – The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble	Wang Liu et.al.	2406.12271	null
2024-06-17	OoDIS: Anomaly Instance Segmentation Benchmark	Alexey Nekrasov et.al.	2406.11835	link
2024-06-17	Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT	Maximilian E. Tschuchnig et.al.	2406.11650	null
2024-06-17	SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation	Zhenchao Lin et.al.	2406.11441	link
2024-06-17	Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding	Yunsong Wang et.al.	2406.11283	null
2024-06-17	Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation	Bingfeng Zhang et.al.	2406.11189	link
2024-06-21	$α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion	Sanbao Su et.al.	2406.11021	null
2024-06-16	PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery	Libo Wang et.al.	2406.10828	link
2024-06-15	GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR	Bharat Singh et.al.	2406.10722	null
2024-06-15	A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection	Chenyao Zhou et.al.	2406.10678	link
2024-06-14	ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers	Narges Norouzi et.al.	2406.09936	link
2024-06-14	Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions	Aldi Piroli et.al.	2406.09906	null
2024-06-17	Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation	Brunó B. Englert et.al.	2406.09896	link
2024-06-14	Open-Vocabulary Semantic Segmentation with Image Embedding Balancing	Xiangheng Shan et.al.	2406.09829	link
2024-06-13	Instance-level quantitative saliency in multiple sclerosis lesion segmentation	Federico Spagnolo et.al.	2406.09335	link
2024-06-13	APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation	Weizhao He et.al.	2406.08372	null
2024-06-12	Dataset Enhancement with Instance-Level Augmentations	Orest Kupyn et.al.	2406.08249	link
2024-06-16	A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder	Lixian Zhang et.al.	2406.08079	null
2024-06-12	OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding	Yinan Deng et.al.	2406.08009	link
2024-06-12	SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation	Chanda Grover Kamra et.al.	2406.07986	link
2024-06-12	Small Scale Data-Free Knowledge Distillation	He Liu et.al.	2406.07876	link
2024-06-11	Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph	Sergey Linok et.al.	2406.07113	null
2024-06-11	PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving	Yining Shi et.al.	2406.07037	null
2024-06-12	LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection	Jiahua Xu et.al.	2406.07023	null
2024-06-10	Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation	Dong Zhao et.al.	2406.06813	link
2024-06-09	Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation	Abdul Qayyum et.al.	2406.06643	null
2024-06-10	Merlin: A Vision Language Foundation Model for 3D Computed Tomography	Louis Blankemeier et.al.	2406.06512	null
2024-06-10	UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving	Daniel Bogdoll et.al.	2406.06370	null
2024-06-09	Scaling Graph Convolutions for Mobile Vision	William Avery et.al.	2406.05850	link
2024-06-09	Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation	Jun Yu et.al.	2406.05837	null
2024-06-09	Convolution and Attention-Free Mamba-based Cardiac Image Segmentation	Abbas Khan et.al.	2406.05786	link
2024-06-09	Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language	Mark Hamilton et.al.	2406.05629	link
2024-06-08	A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+	Jianzhao Wang et.al.	2406.05513	null
2024-06-08	Layered Image Vectorization via Semantic Simplification	Zhenyu Wang et.al.	2406.05404	null
2024-06-08	1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR’24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation	Qingfeng Liu et.al.	2406.05352	null
2024-06-07	USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation	Xiaoqi Wang et.al.	2406.05271	null
2024-06-07	Semantic Segmentation on VSPW Dataset through Masked Video Consistency	Chen Liang et.al.	2406.04979	null
2024-06-07	Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment	Venkanna Babu Guthula et.al.	2406.04949	null
2024-06-06	Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis	Chengeng Liu et.al.	2406.04149	null
2024-06-06	Frequency-based Matcher for Long-tailed Semantic Segmentation	Shan Li et.al.	2406.03917	link
2024-06-07	Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge	Nan Zhang et.al.	2406.03799	link
2024-06-06	DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation	Zilu Guo et.al.	2406.03702	link
2024-06-05	Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation	Maximilian Zenk et.al.	2406.03323	null
2024-06-05	Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy	Yunho Kim et.al.	2406.02989	null
2024-06-04	W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics	Andre Schreiber et.al.	2406.02822	link
2024-06-04	Window to Wall Ratio Detection using SegFormer	Zoe De Simone et.al.	2406.02706	link
2024-06-04	Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning	Heather Doig et.al.	2406.01932	null
2024-06-03	EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding	Thanh-Dat Truong et.al.	2406.01429	null
2024-06-03	TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation	Antonio Santo et.al.	2406.01395	link
2024-06-03	ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds	Ka Lung Cheung et.al.	2406.01337	link
2024-06-03	LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism	Miao Fu et.al.	2406.01228	null
2024-06-04	GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer	Ding Jia et.al.	2406.01210	link
2024-06-03	S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography	Yuhan Song et.al.	2406.01191	link
2024-06-02	Diffusion Features to Bridge Domain Gap for Semantic Segmentation	Yuxiang Ji et.al.	2406.00777	link
2024-06-06	Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation	Yunheng Li et.al.	2406.00670	link
2024-06-02	Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024	Biao Wu et.al.	2406.00587	null
2024-06-01	Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation	Xinyue Chen et.al.	2406.00545	null
2024-06-01	2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation	Biao Wu et.al.	2406.00500	null
2024-06-01	DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation	Qihang Xie et.al.	2406.00341	null
2024-06-01	Complex Style Image Transformations for Domain Generalization in Medical Images	Nikolaos Spanos et.al.	2406.00298	null
2024-05-31	TotalVibeSegmentator: Full Torso Segmentation for the NAKO and UK Biobank in Volumetric Interpolated Breath-hold Examination Body Images	Robert Graf et.al.	2406.00125	link
2024-05-31	Uncertainty Quantification for Bird’s Eye View Semantic Segmentation: Methods and Benchmarks	Linlin Yu et.al.	2405.20986	null
2024-05-31	Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation	Wooseok Shin et.al.	2405.20610	link
2024-05-30	P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation	Qi Zhang et.al.	2405.20443	link
2024-05-30	SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow	Chaoyang Wang et.al.	2405.20282	link
2024-05-30	MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion	Angel Villar-Corrales et.al.	2405.19921	link
2024-05-30	Open-Set Domain Adaptation for Semantic Segmentation	Seun-An Choe et.al.	2405.19899	link
2024-05-30	DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation	Ron Keuth et.al.	2405.19746	link
2024-05-30	Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes	Yong-Qiang Mao et.al.	2405.19735	null
2024-05-30	CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation	Ankush Gajanan Arudkar et.al.	2405.19672	null
2024-05-29	Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation	Lianlei Shan et.al.	2405.19568	null
2024-05-29	Enabling Visual Recognition at Radio Frequency	Haowen Lai et.al.	2405.19516	null
2024-05-29	Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models	Tianrun Chen et.al.	2405.19326	null
2024-05-29	A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation	Niclas Vödisch et.al.	2405.19035	link
2024-05-29	Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	Zelin Peng et.al.	2405.18840	null
2024-05-28	Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation	JuneHyoung Kwon et.al.	2405.18148	null
2024-05-28	Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images	Lianlei Shan et.al.	2405.18078	null
2024-05-28	RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields	Mihnea-Bogdan Jurca et.al.	2405.18033	link
2024-05-28	DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture	Shentong Mo et.al.	2405.17995	link
2024-05-28	The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention	Xingyu Ding et.al.	2405.17776	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	null
2024-05-27	DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking	Hongtao Wang et.al.	2405.16980	null
2024-05-27	Collective Perception Datasets for Autonomous Driving: A Comprehensive Review	Sven Teufel et.al.	2405.16973	null
2024-05-27	Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models	Qian Wang et.al.	2405.16947	link
2024-05-27	A re-calibration method for object detection with multi-modal alignment bias in autonomous driving	Zhihang Song et.al.	2405.16848	null
2024-05-25	BOLD: Boolean Logic Deep Learning	Van Minh Nguyen et.al.	2405.16339	null
2024-05-25	Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation	Huizhou Chen et.al.	2405.16099	null
2024-05-25	Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality	Hakim Ikebayashi et.al.	2405.16008	null
2024-05-24	Visualize and Paint GAN Activations	Rudolf Herdt et.al.	2405.15636	null
2024-05-24	Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets	Hoàng-Ân Lê et.al.	2405.15394	link
2024-05-24	U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation	Bingyu Li et.al.	2405.15365	link
2024-05-24	Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation	Jiayi Chen et.al.	2405.15265	link
2024-05-23	Mamba-R: Vision Mamba ALSO Needs Registers	Feng Wang et.al.	2405.14858	null
2024-05-23	Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation	Daniel Kienzle et.al.	2405.14467	link
2024-05-23	MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models	Jiuming Liu et.al.	2405.14338	null
2024-05-23	Tuning-free Universally-Supervised Semantic Segmentation	Xiaobo Yang et.al.	2405.14294	null
2024-05-23	SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation	Kai Yao et.al.	2405.14278	null
2024-05-23	Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations	Mohammed Baharoon et.al.	2405.14239	link
2024-05-24	Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification	Taylor Archibald et.al.	2405.14162	null
2024-05-23	Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips	Yaotian Liu et.al.	2405.14154	null
2024-05-22	TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System	Diogo Lavado et.al.	2405.13989	null
2024-05-22	Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer	Qihang Fan et.al.	2405.13337	link
2024-05-22	Vision Transformer with Sparse Scan Prior	Qihang Fan et.al.	2405.13335	link
2024-05-22	Deep Learning-Driven State Correction: A Hybrid Architecture for Radar-Based Dynamic Occupancy Grid Mapping	Max Peter Ronecker et.al.	2405.13307	null
2024-05-21	Transparency Distortion Robustness for SOTA Image Segmentation Tasks	Volker Knauthe et.al.	2405.12864	null
2024-05-20	A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation	Sushmita Sarker et.al.	2405.11903	null
2024-05-20	Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments	Jooyong Park et.al.	2405.11855	null
2024-05-20	Universal Organizer of SAM for Unsupervised Semantic Segmentation	Tingting Li et.al.	2405.11742	link
2024-05-19	Interpreting a Semantic Segmentation Model for Coastline Detection	Conor O’Sullivan et.al.	2405.11500	link
2024-05-17	CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation	Mushui Liu et.al.	2405.10530	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	link
2024-05-16	A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance	Andrea Matteazzi et.al.	2405.10046	null
2024-05-16	Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation	Jihwan Kwak et.al.	2405.09858	link
2024-05-15	Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation	Guo Yachan et.al.	2405.09682	null
2024-05-14	CLIP with Quality Captions: A Strong Pretraining for Vision Tasks	Pavan Kumar Anasosalu Vasu et.al.	2405.08911	null
2024-05-14	Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study	Qinfeng Zhu et.al.	2405.08493	null
2024-05-14	TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road Detection	Martín Bayón-Gutiérrez et.al.	2405.08429	link
2024-05-13	IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data	Ziyang Zhang et.al.	2405.07916	null
2024-05-12	Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception	Haoming Chen et.al.	2405.07201	link
2024-05-10	GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs	Mustafa Munir et.al.	2405.06849	link
2024-05-10	Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach	Elham Ravanbakhsh et.al.	2405.06586	null
2024-05-10	Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation	Xiaowen Ma et.al.	2405.06525	link
2024-05-10	Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data	Yonghao Xu et.al.	2405.06502	link
2024-05-10	Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data	Rongyu Zhang et.al.	2405.06413	null
2024-05-10	Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation	Zhenliang Ni et.al.	2405.06228	link
2024-05-10	Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection	Koji Takeda et.al.	2405.06185	null
2024-05-10	Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging	Zhuchen Shao et.al.	2405.06175	null
2024-05-09	Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation	Yudian Zhang et.al.	2405.05830	null
2024-05-08	OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies	Lingdong Kong et.al.	2405.05259	link
2024-05-08	Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving	Lingdong Kong et.al.	2405.05258	link
2024-05-08	Weakly-supervised Semantic Segmentation via Dual-stream Contrastive Learning of Cross-image Contextual Information	Qi Lai et.al.	2405.04913	null
2024-05-08	DeepDamageNet: A two-step deep-learning model for multi-disaster building damage segmentation and classification using satellite imagery	Irene Alisjahbana et.al.	2405.04800	null
2024-05-13	FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes	Charles Gaydon et.al.	2405.04634	link
2024-05-07	A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields	Raiyan Rahman et.al.	2405.04305	null
2024-05-07	ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation	Zhibo Zhang et.al.	2405.04121	null
2024-05-06	PTQ4SAM: Post-Training Quantization for Segment Anything	Chengtao Lv et.al.	2405.03144	link
2024-05-04	MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning	Vishal Nedungadi et.al.	2405.02771	link
2024-05-04	Few-Shot Fruit Segmentation via Transfer Learning	Jordan A. James et.al.	2405.02556	link
2024-05-03	DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model	Peijin Jia et.al.	2405.02008	null
2024-05-02	Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey	Guoping Xu et.al.	2405.01725	link
2024-05-02	Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey	Rokas Gipiškis et.al.	2405.01636	null
2024-05-02	CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation	Chenying Liu et.al.	2405.01217	null
2024-05-02	Uncertainty-aware self-training with expectation maximization basis transformation	Zijia Wang et.al.	2405.01175	null
2024-05-01	Exploring Self-Supervised Vision Transformers for Deepfake Detection: A Comparative Analysis	Huy H. Nguyen et.al.	2405.00355	link
2024-04-30	Masked Multi-Query Slot Attention for Unsupervised Object Discovery	Rishav Pramanik et.al.	2404.19654	link
2024-04-30	DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents	Taylor Archibald et.al.	2404.19259	null
2024-04-29	Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing	Leonardo Rossi et.al.	2404.18924	link
2024-04-29	IPixMatch: Boost Semi-supervised Semantic Segmentation with Inter-Pixel Relation	Kebin Wu et.al.	2404.18891	null
2024-04-29	Towards Long-term Robotics in the Wild	Stephen Hausler et.al.	2404.18477	null
2024-04-27	Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic Environments	Benoît Gérin et.al.	2404.17930	link
2024-04-27	GLIMS: Attention-Guided Lightweight Multi-Scale Hybrid Network for Volumetric Semantic Segmentation	Ziya Ata Yazıcı et.al.	2404.17854	link
2024-04-27	CLFT: Camera-LiDAR Fusion Transformer for Semantic Segmentation in Autonomous Driving	Junyi Gu et.al.	2404.17793	link
2024-04-26	Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment	Kazi Shahriar Sanjid et.al.	2404.17235	null
2024-04-25	Calculation of Femur Caput Collum Diaphyseal angle for X-Rays images using Semantic Segmentation	Deepak Bhatia et.al.	2404.17083	null
2024-04-25	Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals	Oliver Hahn et.al.	2404.16818	link
2024-04-26	Multi-Scale Representations by Varying Window Attention for Semantic Segmentation	Haotian Yan et.al.	2404.16573	link
2024-04-25	360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes	Xu Zheng et.al.	2404.16501	null
2024-04-25	Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models	Hedda Cohen Indelman et.al.	2404.16325	null
2024-04-25	Style Adaptation for Domain-adaptive Semantic Segmentation	Ting Li et.al.	2404.16301	null
2024-04-29	A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation	Yifan Zhao et.al.	2404.16266	link
2024-04-24	3D Freehand Ultrasound using Visual Inertial and Deep Inertial Odometry for Measuring Patellar Tracking	Russell Buchanan et.al.	2404.15847	null
2024-04-24	Vision Transformer-based Adversarial Domain Adaptation	Yahan Li et.al.	2404.15817	link
2024-04-22	OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks	Sophia Sirko-Galouchenko et.al.	2404.14027	link
2024-04-21	Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation	Guanlong Jiao et.al.	2404.13701	null
2024-04-21	PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images	Abhishek Jha et.al.	2404.13693	link
2024-04-21	A Complete System for Automated 3D Semantic-Geometric Mapping of Corrosion in Industrial Environments	Rui Pimentel de Figueiredo et.al.	2404.13691	null
2024-04-21	LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing	Tong Wang et.al.	2404.13659	null
2024-04-21	Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering	Ben Fei et.al.	2404.13619	null
2024-04-20	AMMUNet: Multi-Scale Attention Map Merging for Remote Sensing Image Segmentation	Yang Yang et.al.	2404.13408	link
2024-04-19	BACS: Background Aware Continual Semantic Segmentation	Mostafa ElAraby et.al.	2404.13148	link
2024-04-19	ToNNO: Tomographic Reconstruction of a Neural Network’s Output for Weakly Supervised Segmentation of 3D Medical Images	Marius Schmidt-Mengin et.al.	2404.13103	null
2024-04-19	Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation	Yilong Chen et.al.	2404.12861	null
2024-04-19	COIN: Counterfactual inpainting for weakly supervised semantic segmentation for medical images	Dmytro Shvetsov et.al.	2404.12832	link
2024-04-19	A Point-Based Approach to Efficient LiDAR Multi-Task Perception	Christopher Lang et.al.	2404.12798	null
2024-04-19	Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework	Zhuohong Li et.al.	2404.12721	link
2024-04-19	Improving Prediction Accuracy of Semantic Segmentation Methods Using Convolutional Autoencoder Based Pre-processing Layers	Hisashi Shimodaira et.al.	2404.12718	null
2024-04-19	Show and Grasp: Few-shot Semantic Segmentation for Robot Grasping through Zero-shot Foundation Models	Leonardo Barcellona et.al.	2404.12717	null
2024-04-18	A Perspective on Deep Vision Performance with Standard Image and Video Codecs	Christoph Reich et.al.	2404.12330	null
2024-04-18	Deep Gaussian mixture model for unsupervised image segmentation	Matthias Schwab et.al.	2404.12252	link
2024-04-18	Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training	Jin Gao et.al.	2404.12210	link
2024-04-18	How to Benchmark Vision Foundation Models for Semantic Segmentation?	Tommie Kerssies et.al.	2404.12172	link
2024-04-19	Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation	Chongjie Si et.al.	2404.11981	null
2024-04-18	Group-On: Boosting One-Shot Segmentation with Supportive Query	Hanjing Zhou et.al.	2404.11871	null
2024-04-17	Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach	Mir Rayat Imtiaz Hossain et.al.	2404.11732	null
2024-04-17	A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching	Francesco Pro et.al.	2404.11302	link
2024-04-17	Learning from Unlabelled Data with Transformers: Domain Adaptation for Semantic Segmentation of High Resolution Aerial Images	Nikolaos Dionelis et.al.	2404.11299	link
2024-04-16	A Concise Tiling Strategy for Preserving Spatial Context in Earth Observation Imagery	Ellianna Abrahams et.al.	2404.10927	link
2024-04-16	Vocabulary-free Image Classification and Semantic Segmentation	Alessandro Conti et.al.	2404.10864	link
2024-04-16	Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging	Toqi Tahamid Sarker et.al.	2404.10841	link
2024-04-16	Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark	Jiangning Zhang et.al.	2404.10760	link
2024-04-16	ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation	Iaroslav Melekhov et.al.	2404.10699	link
2024-04-16	Contextrast: Contextual Contrastive Learning for Semantic Segmentation	Changki Sung et.al.	2404.10633	null
2024-04-16	Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation	Aaron Kujawa et.al.	2404.10572	null
2024-04-16	LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System	Shijing Hu et.al.	2404.10498	null
2024-04-16	Adversarial Identity Injection for Semantic Face Image Synthesis	Giuseppe Tarollo et.al.	2404.10408	null
2024-04-16	Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation	Jiapeng Su et.al.	2404.10322	link
2024-04-16	Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain	Steve Andreas Immanuel et.al.	2404.10307	link
2024-04-15	Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL	Fangwei Zhong et.al.	2404.09857	null
2024-04-15	In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation	Han Xue et.al.	2404.09633	null
2024-04-15	The revenge of BiSeNet: Efficient Multi-Task Image Segmentation	Gabriele Rosi et.al.	2404.09570	null
2024-04-16	Human-in-the-Loop Segmentation of Multi-species Coral Imagery	Scarlett Raine et.al.	2404.09406	link
2024-04-14	Bridging Data Islands: Geographic Heterogeneity-Aware Federated Learning for Collaborative Remote Sensing Semantic Segmentation	Jieyi Tan et.al.	2404.09292	null
2024-04-12	Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning	Girmaw Abebe Tadesse et.al.	2404.08544	null
2024-04-12	LaSagnA: Language-based Segmentation Assistant for Complex Queries	Cong Wei et.al.	2404.08506	link
2024-04-12	Tackling Ambiguity from Perspective of Uncertainty Inference and Affinity Diversification for Weakly Supervised Semantic Segmentation	Zhiwei Yang et.al.	2404.08195	link
2024-04-12	Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation	Sina Hajimiri et.al.	2404.08181	link
2024-04-10	AI-Guided Feature Segmentation Techniques to Model Features from Single Crystal Diamond Growth	Rohan Reddy Mekala et.al.	2404.08017	null
2024-04-11	Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification	Ricardo Pereira et.al.	2404.07739	null
2024-04-11	OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities	Lasse H. Hansen et.al.	2404.07711	link
2024-04-11	Implicit and Explicit Language Guidance for Diffusion-based Visual Perception	Hefeng Wang et.al.	2404.07600	null
2024-04-11	Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling	Sourajit Saha et.al.	2404.07410	link
2024-04-10	AI-Guided Defect Detection Techniques to Model Single Crystal Diamond Growth	Rohan Reddy Mekala et.al.	2404.07306	null
2024-04-10	RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds	Remco Royen et.al.	2404.06863	null
2024-04-10	O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation	Muer Tie et.al.	2404.06836	null
2024-04-10	Convolution-based Probability Gradient Loss for Semantic Segmentation	Guohang Shan et.al.	2404.06704	link
2024-04-09	Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation	Luca Barsellotti et.al.	2404.06542	null
2024-04-09	QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding	Yash Mehan et.al.	2404.06442	null
2024-04-09	DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird’s Eye View Segmentation with Occlusion Reasoning	Senthil Yogamani et.al.	2404.06352	null
2024-04-09	Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation	Mariella Dreissig et.al.	2404.06124	null
2024-04-09	Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation	Zong-Wei Hong et.al.	2404.06029	null
2024-04-08	Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery	Ionut M. Motoi et.al.	2404.05693	link
2024-04-08	AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation	Jiannan Ge et.al.	2404.05667	null
2024-04-08	Impact of LiDAR visualisations on semantic segmentation of archaeological objects	Raveerat Jaturapitpornchai et.al.	2404.05512	null
2024-04-08	Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance	Dazhong Shen et.al.	2404.05384	link
2024-04-08	GPS-free Autonomous Navigation in Cluttered Tree Rows with Deep Semantic Segmentation	Alessandro Navone et.al.	2404.05338	null
2024-04-08	Human Detection from 4D Radar Data in Low-Visibility Field Conditions	Mikael Skog et.al.	2404.05307	null
2024-04-08	iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection	Nan Zhou et.al.	2404.05207	null
2024-04-08	UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather	Haimei Zhao et.al.	2404.05145	null
2024-04-07	D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation	Xuan Sun et.al.	2404.04807	null
2024-04-06	HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene	Ziang Guo et.al.	2404.04653	link
2024-04-06	Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation	Danpei Zhao et.al.	2404.04608	null
2024-04-06	PIE: Physics-inspired Low-light Enhancement	Dong Liang et.al.	2404.04586	null
2024-04-06	Frequency Decomposition-Driven Unsupervised Domain Adaptation for Remote Sensing Image Semantic Segmentation	Xianping Ma et.al.	2404.04531	link
2024-04-05	Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation	Zifu Wan et.al.	2404.04256	link
2024-04-05	Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation	Ji-Jia Wu et.al.	2404.04231	link
2024-04-05	MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector	Junbo Li et.al.	2404.04155	null
2024-04-04	Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation	Elham Amin Mansour et.al.	2404.03799	null
2024-04-04	Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball	Simon Weber et.al.	2404.03778	link
2024-04-09	Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation	Izumi Fujimori et.al.	2404.03394	null
2024-04-03	GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation	Meher Niger et.al.	2404.02813	null
2024-04-03	RS-Mamba for Large Remote Sensing Image Dense Prediction	Sijie Zhao et.al.	2404.02668	link
2024-04-03	A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task	Eduardo Neto et.al.	2404.02659	null
2024-04-03	SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation	Junyan Ye et.al.	2404.02638	link
2024-04-03	Active learning for efficient annotation in precision agriculture: a use-case on crop-weed semantic segmentation	Bart M. van Marrewijk et.al.	2404.02580	null
2024-04-03	HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras	Zhongyu Xia et.al.	2404.02517	link
2024-04-03	Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression	I. Dror et.al.	2404.02481	null
2024-04-03	RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation	Xianping Ma et.al.	2404.02457	link
2024-04-02	Constrained Robotic Navigation on Preferred Terrains Using LLMs and Speech Instruction: Exploiting the Power of Adverbs	Faraz Lotfi et.al.	2404.02294	null
2024-04-01	Versatile Navigation under Partial Observability via Value-guided Diffusion Policy	Gengyu Zhang et.al.	2404.02176	null
2024-04-02	Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation	Hui Xiao et.al.	2404.02065	null
2024-04-02	Synthetic Data for Robust Stroke Segmentation	Liam Chalcroft et.al.	2404.01946	link
2024-04-02	Improving Bird’s Eye View Semantic Segmentation by Task Decomposition	Tianhao Zhao et.al.	2404.01925	link
2024-04-02	Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model	Qinfeng Zhu et.al.	2404.01705	link
2024-04-04	Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss	Jaeha Kim et.al.	2404.01692	link
2024-04-01	PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation	Jinfeng Xu et.al.	2404.00979	link
2024-04-01	GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields	Yunsong Wang et.al.	2404.00931	link
2024-04-02	Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation	Beomyoung Kim et.al.	2404.00918	link
2024-03-31	Training-Free Semantic Segmentation via LLM-Supervision	Wenfang Sun et.al.	2404.00701	null
2024-03-31	LAESI: Leaf Area Estimation with Synthetic Imagery	Jacek Kałużny et.al.	2404.00593	null
2024-03-29	Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation	Qi Bi et.al.	2403.20092	null
2024-03-29	MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection	Ali Behrouz et.al.	2403.19888	null
2024-03-28	Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation	Qitian Ma et.al.	2403.19826	null
2024-03-28	ENet-21: An Optimized light CNN Structure for Lane Detection	Seyed Rasoul Hosseini et.al.	2403.19782	null
2024-03-29	Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers	Pingcheng Dong et.al.	2403.19591	link
2024-03-28	DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs	Donghyun Kim et.al.	2403.19588	link
2024-03-28	Learning Multiple Representations with Inconsistency-Guided Detail Regularization for Mask-Guided Matting	Weihao Jiang et.al.	2403.19213	null
2024-03-27	Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D	Mukund Varma T et.al.	2403.18922	null
2024-03-27	I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation	Ayoub Karine et.al.	2403.18490	null
2024-03-28	ViTAR: Vision Transformer with Any Resolution	Qihang Fan et.al.	2403.18361	null
2024-03-27	Generating Diverse Agricultural Data for Vision-Based Farming Applications	Mikolaj Cieslak et.al.	2403.18351	null
2024-03-27	Road Obstacle Detection based on Unknown Objectness Scores	Chihiro Noguchi et.al.	2403.18207	null
2024-03-26	The Need for Speed: Pruning Transformers with One Recipe	Samir Khaki et.al.	2403.17921	link
2024-03-26	Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation	Carlos Gomes et.al.	2403.17886	link
2024-03-26	PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition	Chenhongyi Yang et.al.	2403.17695	link
2024-03-26	Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion	Kazi Shahriar Sanjid et.al.	2403.17432	null
2024-03-25	Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions	Ye Li et.al.	2403.17009	link
2024-03-25	DreamLIP: Language-Image Pre-training with Long Captions	Kecheng Zheng et.al.	2403.17007	link
2024-03-25	TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation	Quang-Huy Che et.al.	2403.16958	link
2024-03-25	HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation	Linglin Jing et.al.	2403.16788	null
2024-03-25	SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation	Aysim Toker et.al.	2403.16605	null
2024-03-25	Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes	Tianwei Zhang et.al.	2403.16499	null
2024-03-25	GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation	Weiming Zhang et.al.	2403.16370	null
2024-03-24	Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System	Jing Li et.al.	2403.16227	null
2024-03-24	Segment Anything Model for Road Network Graph Extraction	Congrui Hetang et.al.	2403.16051	link
2024-03-24	SM2C: Boost the Semi-supervised Segmentation for Medical Image by using Meta Pseudo Labels and Mixed Images	Yifei Wang et.al.	2403.16009	null
2024-03-22	Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting	Jun Guo et.al.	2403.15624	null
2024-03-22	A2DMN: Anatomy-Aware Dilated Multiscale Network for Breast Ultrasound Semantic Segmentation	Kyle Lucke et.al.	2403.15560	null
2024-03-22	InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding	Yi Wang et.al.	2403.15377	link
2024-03-22	Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations	Pranav Kulkarni et.al.	2403.15218	link
2024-03-22	Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion	Sofia Casarin et.al.	2403.15194	null
2024-03-22	Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation	Wenlve Zhou et.al.	2403.14995	link
2024-03-21	WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather	Blake Gella et.al.	2403.14874	null
2024-03-21	Learning to Project for Cross-Task Knowledge Distillation	Dylan Auty et.al.	2403.14494	null
2024-03-21	OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation	Bohao Peng et.al.	2403.14418	link
2024-03-21	Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models	Pablo Marcos-Manchón et.al.	2403.14291	link
2024-03-21	OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation	Kwanyoung Kim et.al.	2403.14183	link
2024-03-21	Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference	Junyoung Kim et.al.	2403.14138	null
2024-03-21	Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling	Yong He et.al.	2403.14124	null
2024-03-21	Semantics from Space: Satellite-Guided Thermal Semantic Segmentation Annotation for Aerial Field Robots	Connor Lee et.al.	2403.14056	null
2024-03-20	When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather	Giulia Rizzoli et.al.	2403.13762	link
2024-03-20	Next day fire prediction via semantic segmentation	Konstantinos Alexis et.al.	2403.13545	null
2024-03-20	MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining	Di Wang et.al.	2403.13430	link
2024-03-20	AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments	Mohamed Elnoor et.al.	2403.13235	null
2024-03-20	Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation	Linshan Wu et.al.	2403.13225	link
2024-03-19	Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation	Kasi Viswanath et.al.	2403.13188	link
2024-03-19	As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?	Anjun Hu et.al.	2403.12693	null
2024-03-19	PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation	Haruya Ishikawa et.al.	2403.12530	null
2024-03-19	Semantics, Distortion, and Style Matter: Towards Source-free UDA for Panoramic Segmentation	Xu Zheng et.al.	2403.12505	null
2024-03-18	Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation	Wangbo Zhao et.al.	2403.11808	link
2024-03-22	LSKNet: A Foundation Lightweight Backbone for Remote Sensing	Yuxuan Li et.al.	2403.11735	link
2024-03-18	TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models	Lisa Weijler et.al.	2403.11691	null
2024-03-18	OurDB: Ouroboric Domain Bridging for Multi-Target Domain Adaptive Semantic Segmentation	Seungbeom Woo et.al.	2403.11582	null
2024-03-18	MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception	Thien-Minh Nguyen et.al.	2403.11496	null
2024-03-18	Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting	Mingkui Tan et.al.	2403.11491	null
2024-03-17	TAG: Guidance-free Open-Vocabulary Semantic Segmentation	Yasufumi Kawano et.al.	2403.11197	link
2024-03-17	MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation	Yasufumi Kawano et.al.	2403.11194	link
2024-03-17	DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation	Yuanchen Wu et.al.	2403.11184	link
2024-03-17	LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation	Hanze Ding et.al.	2403.11122	null
2024-03-17	Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution	Jialu Sui et.al.	2403.11078	link
2024-03-17	Intelligent Railroad Grade Crossing: Leveraging Semantic Segmentation and Object Detection for Enhanced Safety	Al Amin et.al.	2403.11060	null
2024-03-16	Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation	Soumyajyoti Dey et.al.	2403.10884	null
2024-03-16	Active Label Correction for Semantic Segmentation with Foundation Models	Hoyoung Kim et.al.	2403.10820	link
2024-03-15	SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images	Pardis Taghavi et.al.	2403.10662	link
2024-03-15	FeatUp: A Model-Agnostic Framework for Features at Any Resolution	Stephanie Fu et.al.	2403.10516	link
2024-03-15	Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search	Hongyuan Yu et.al.	2403.10413	link
2024-03-15	Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning	Meixuan Li et.al.	2403.10252	null
2024-03-15	Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation	Marcos Fernández-Rodríguez et.al.	2403.10216	null
2024-03-15	TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model	Changhong Hou et.al.	2403.10127	null
2024-03-15	Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation	Jingyi Xu et.al.	2403.10001	link
2024-03-14	WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity	Qiyuan Wang et.al.	2403.09551	null
2024-03-14	Annotation Free Semantic Segmentation with Vision Foundation Models	Soroush Seifi et.al.	2403.09307	null
2024-03-14	When Semantic Segmentation Meets Frequency Aliasing	Linwei Chen et.al.	2403.09065	link
2024-03-13	CART: Caltech Aerial RGB-Thermal Dataset in the Wild	Connor Lee et.al.	2403.08997	link
2024-03-13	SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net	Helin Cao et.al.	2403.08885	link
2024-03-13	Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches	Yun Xin Teoh et.al.	2403.08761	null
2024-03-13	Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution	Samuel Sze et.al.	2403.08748	null
2024-03-13	Semantic Segmentation of Solar Radio Spikes at Low Frequencies	Pearse C. Murphy et.al.	2403.08546	null
2024-03-13	Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation	Zicheng Zhang et.al.	2403.08426	null
2024-03-13	LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving	Sicen Guo et.al.	2403.08215	null
2024-03-13	Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks	Fuzhi Wu et.al.	2403.08157	link
2024-03-12	Mitigating the Impact of Attribute Editing on Face Recognition	Sudipta Banerjee et.al.	2403.08092	null
2024-03-12	Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation	Feilong Tang et.al.	2403.07630	link
2024-03-12	PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution	Honghao Chen et.al.	2403.07589	null
2024-03-12	Open-World Semantic Segmentation Including Class Similarity	Matteo Sodano et.al.	2403.07532	link
2024-03-11	Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation	Theodore Barfoot et.al.	2403.06759	link
2024-03-11	Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation	Bianca-Cerasela-Zelia Blaga et.al.	2403.06621	null
2024-03-11	OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation	Baran Ozaydin et.al.	2403.06546	null
2024-03-11	3D Semantic Segmentation-Driven Representations for 3D Object Detection	Hayeon O et.al.	2403.06501	link
2024-03-11	Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy	Jiuming Liu et.al.	2403.06467	link
2024-03-14	Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation	Xiaoyang Wang et.al.	2403.06462	link
2024-03-11	Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation	Peng Zhang et.al.	2403.06401	null
2024-03-10	Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning	Woo-Jin Ahn et.al.	2403.06122	link
2024-03-09	Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation	Hairong Shi et.al.	2403.05912	link
2024-03-08	Attention-guided Feature Distillation for Semantic Segmentation	Amir M. Mansourian et.al.	2403.05451	link
2024-03-08	Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation	Yu Han et.al.	2403.05388	null
2024-03-12	Frequency-Adaptive Dilated Convolution for Semantic Segmentation	Linwei Chen et.al.	2403.05369	link
2024-03-08	Embedded Deployment of Semantic Segmentation in Medicine through Low-Resolution Inputs	Erik Ostrowski et.al.	2403.05340	null
2024-03-08	LVIC: Multi-modality segmentation by Lifting Visual Info as Cue	Zichao Dong et.al.	2403.05159	null
2024-03-06	ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic Segmentation	Erik Brorsson et.al.	2403.03854	link
2024-03-06	Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision	Yajie Liu et.al.	2403.03707	null
2024-03-06	Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery	Jingru Zhu et.al.	2403.03704	null
2024-03-06	GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding	Zi-Ting Chou et.al.	2403.03608	null
2024-03-06	Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator	Wonhyeok Choi et.al.	2403.03468	null
2024-03-05	Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection	Mohamed Afifi et.al.	2403.03111	null
2024-03-05	ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving	Han Lu et.al.	2403.02877	null
2024-03-05	DDF: A Novel Dual-Domain Image Fusion Strategy for Remote Sensing Image Semantic Segmentation with Unsupervised Domain Adaptation	Lingyan Ran et.al.	2403.02784	null
2024-03-08	Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels	Zhuohong Li et.al.	2403.02746	link
2024-03-05	FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird’s-Eye View and Perspective View	Jiawei Hou et.al.	2403.02710	null
2024-03-05	Deep Common Feature Mining for Efficient Video Semantic Segmentation	Yaoyan Zheng et.al.	2403.02689	link
2024-03-04	Self-Supervised Facial Representation Learning with Facial Region Awareness	Zheng Gao et.al.	2403.02138	null
2024-03-04	Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey	Lingyan Ran et.al.	2403.01909	null
2024-03-04	Map-aided annotation for pole base detection	Benjamin Missaoui et.al.	2403.01868	null
2024-03-06	AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation	Haonan Wang et.al.	2403.01818	link
2024-03-03	EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation	Chanyoung Kim et.al.	2403.01482	link
2024-03-02	Benchmarking Segmentation Models with Mask-Preserved Attribute Editing	Zijin Yin et.al.	2403.01231	link
2024-03-02	Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation	Lian Xu et.al.	2403.01156	null
2024-03-01	Rethinking Few-shot 3D Point Cloud Semantic Segmentation	Zhaochong An et.al.	2403.00592	link
2024-03-01	Small, Versatile and Mighty: A Range-View Perception Framework	Qiang Meng et.al.	2403.00325	null
2024-03-01	YOLO-MED : Multi-Task Interaction Network for Biomedical Images	Suizhi Huang et.al.	2403.00245	null
2024-02-29	FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything	Safouane El Ghazouali et.al.	2403.00175	link
2024-02-29	RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation	Jie Zhang et.al.	2402.19004	null
2024-02-28	Spatial Coherence Loss for Salient and Camouflaged Object Detection and Beyond	Ziyun Yang et.al.	2402.18698	null
2024-02-29	Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation	Zhiwei Yang et.al.	2402.18467	link
2024-02-29	A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation	Francesco Barbato et.al.	2402.18402	link
2024-02-28	Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis	Miriam Louise Carnot et.al.	2402.18309	null
2024-02-28	Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis	Bashir Kazimi et.al.	2402.18286	null
2024-02-28	PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation	Haoyu Xie et.al.	2402.18117	null
2024-02-28	Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image Annotation	Samuel O. Folorunsho et.al.	2402.18084	link
2024-02-27	Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation	Xinyu Yang et.al.	2402.17891	link
2024-02-27	Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data	David S. W. Williams et.al.	2402.17653	null
2024-02-27	Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling	David S. W. Williams et.al.	2402.17622	null
2024-02-27	A Large-scale Evaluation of Pretraining Paradigms for the Detection of Defects in Electroluminescence Solar Cell Images	David Torpey et.al.	2402.17611	null
2024-02-27	Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label	Xinliang Zhang et.al.	2402.17555	link
2024-02-26	ConSept: Continual Semantic Segmentation via Adapter-based Vision Transformer	Bowen Dong et.al.	2402.16674	null
2024-02-26	UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images	Zhen Chen et.al.	2402.16663	link
2024-02-26	Placing Objects in Context via Inpainting for Out-of-distribution Segmentation	Pau de Jorge et.al.	2402.16392	link
2024-02-29	BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM	Li Zhang et.al.	2402.16338	link
2024-02-23	Modified CycleGAN for the synthesization of samples for wheat head segmentation	Jaden Myers et.al.	2402.15135	null
2024-02-22	Semantic Image Synthesis with Unconditional Generator	Jungwoo Chae et.al.	2402.14395	null
2024-02-22	Think before You Leap: Content-Aware Low-Cost Edge-Assisted Video Semantic Segmentation	Mingxuan Yan et.al.	2402.14326	null
2024-02-21	Tumor segmentation on whole slide images: training or prompting?	Huaqian Wu et.al.	2402.13932	null
2024-02-26	BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery	Loddo Fabio et.al.	2402.13918	link
2024-02-21	Zero-BEV: Zero-shot Projection of Any First-Person Modality to BEV Maps	Gianluca Monaci et.al.	2402.13848	null
2024-02-21	Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation	Jialei Chen et.al.	2402.13697	null
2024-02-20	Cross-Domain Transfer Learning with CoRTe: Consistent and Reliable Transfer from Black-Box to Lightweight Segmentation Model	Claudia Cuttano et.al.	2402.13122	null
2024-02-19	LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks	Truong Thanh Hung Nguyen et.al.	2402.12525	link
2024-02-19	Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization	Abhishek Kuriyal et.al.	2402.12098	link
2024-02-19	ISCUTE: Instance Segmentation of Cables Using Text Embedding	Shir Kozlovsky et.al.	2402.11996	null
2024-02-18	Key Patch Proposer: Key Patches Contain Rich Information	Jing Xu et.al.	2402.11458	link
2024-02-17	ChatEarthNet: A Global-Scale, High-Quality Image-Text Dataset for Remote Sensing	Zhenghang Yuan et.al.	2402.11325	link
2024-02-17	A Decoding Scheme with Successive Aggregation of Multi-Level Features for Light-Weight Semantic Segmentation	Jiwon Yoo et.al.	2402.11201	null
2024-02-16	HistoSegCap: Capsules for Weakly-Supervised Semantic Segmentation of Histological Tissue Type in Whole Slide Images	Mobina Mansoori et.al.	2402.10851	null
2024-02-16	Selective Prediction for Semantic Segmentation using Post-Hoc Confidence Estimation and Its Performance under Distribution Shift	Bruno Laboissiere Camargos Borges et.al.	2402.10665	null
2024-02-16	Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2402.10580	null
2024-02-15	Is Continual Learning Ready for Real-world Challenges?	Theodora Kontogianni et.al.	2402.10130	null
2024-02-15	Robust semi-automatic vessel tracing in the human retinal image by an instance segmentation neural network	Siyi Chen et.al.	2402.10055	null
2024-02-22	MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding	Hai-Tao Yu et.al.	2402.10002	link
2024-02-14	Automated Plaque Detection and Agatston Score Estimation on Non-Contrast CT Scans: A Multicenter Study	Andrew M. Nguyen et.al.	2402.09569	null
2024-02-14	Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion	Edgar Heinert et.al.	2402.09530	link
2024-02-13	Adaptive Hierarchical Certification for Segmentation using Randomized Smoothing	Alaa Anani et.al.	2402.08400	link
2024-02-13	Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss	Kei Iino et.al.	2402.08267	null
2024-02-12	Semantic segmentation for recognition of epileptiform patterns recorded via Microelectrode Arrays in vitro	Gabriel Galeote-Checa et.al.	2402.08099	null
2024-02-11	Data Quality Aware Approaches for Addressing Model Drift of Semantic Segmentation Models	Samiha Mirza et.al.	2402.07258	null
2024-02-09	More than the Sum of Its Parts: Ensembling Backbone Networks for Few-Shot Segmentation	Nico Catalano et.al.	2402.06581	null
2024-02-09	Hybridnet for depth estimation and semantic segmentation	Dalila Sánchez-Escobedo et.al.	2402.06539	null
2024-02-09	Classifying point clouds at the facade-level using geometric features and deep learning networks	Yue Tan et.al.	2402.06506	link
2024-02-09	ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation	Fengyi Shen et.al.	2402.06446	null
2024-02-08	Early Fusion of Features for Semantic Segmentation	Anupam Gupta et.al.	2402.06091	null
2024-02-08	Privacy-Preserving Synthetic Continual Semantic Segmentation for Robotic Surgery	Mengya Xu et.al.	2402.05860	link
2024-02-08	On the Effect of Image Resolution on Semantic Segmentation	Ritambhara Singh et.al.	2402.05398	null
2024-02-07	Multi-Scale Semantic Segmentation with Modified MBConv Blocks	Xi Chen et.al.	2402.04618	null
2024-02-06	Energy-based Domain-Adaptive Segmentation with Depth Guidance	Jinjing Zhu et.al.	2402.03795	null
2024-02-05	SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM	Mingrui Li et.al.	2402.03246	link
2024-02-05	RRWNet: Recursive Refinement Network for Effective Retinal Artery/Vein Segmentation and Classification	José Morano et.al.	2402.03166	link
2024-02-05	Unsupervised semantic segmentation of high-resolution UAV imagery for road scene parsing	Zihan Ma et.al.	2402.02985	link
2024-02-04	M $^3$ Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing	Mohammadreza Mofayezi et.al.	2402.02369	null
2024-02-04	Exploring Intrinsic Properties of Medical Images for Self-Supervised Binary Semantic Segmentation	Pranav Singh et.al.	2402.02367	null
2024-02-04	Region-Based Representations Revisited	Michal Shlapentokh-Rothman et.al.	2402.02352	link
2024-02-03	Multi-Level Feature Aggregation and Recursive Alignment Network for Real-Time Semantic Segmentation	Yanhua Zhang et.al.	2402.02286	link
2024-02-03	Revisiting Generative Adversarial Networks for Binary Semantic Segmentation on Imbalanced Datasets	Lei Xu et.al.	2402.02245	link
2024-02-03	Evaluating the Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric analysis	Pankaj Deoli et.al.	2402.02154	link
2024-02-03	Decomposition-based and Interference Perception for Infrared and Visible Image Fusion in Complex Scenes	Xilai Li et.al.	2402.02096	null
2024-02-03	MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning	Zhe Li et.al.	2402.02045	null
2024-02-02	Convolution kernel adaptation to calibrated fisheye	Bruno Berenguel-Baeta et.al.	2402.01456	link
2024-02-02	Delving into Decision-based Black-box Attacks on Semantic Segmentation	Zhaoyu Chen et.al.	2402.01220	null
2024-02-02	Scale Equalization for Multi-Level Feature Fusion	Bum Jun Kim et.al.	2402.01149	link
2024-02-06	We’re Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline	Simar Kareer et.al.	2402.00868	link
2024-02-01	Automatic Segmentation of the Spinal Cord Nerve Rootlets	Jan Valosek et.al.	2402.00724	link
2024-02-01	A Framework for Building Point Cloud Cleaning, Plane Detection and Semantic Segmentation	Ilyass Abouelaziz et.al.	2402.00692	null
2024-01-31	Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model	Zihan Zhong et.al.	2401.17868	link
2024-01-31	Leveraging Swin Transformer for Local-to-Global Weakly Supervised Semantic Segmentation	Rozhan Ahmadi et.al.	2401.17828	link
2024-02-01	Tiered approach for rapid damage characterisation of infrastructure enabled by remote sensing and deep learning technologies	Nadiia Kopiika et.al.	2401.17759	null
2024-01-31	Towards Image Semantics and Syntax Sequence Learning	Chun Tao et.al.	2401.17515	link
2024-01-30	Evaluation of Out-of-Distribution Detection Performance on Autonomous Driving Datasets	Jens Henriksson et.al.	2401.17013	null
2024-01-30	CAFCT: Contextual and Attentional Feature Fusions of Convolutional Neural Networks and Transformer for Liver Tumor Segmentation	Ming Kang et.al.	2401.16886	null
2024-01-29	Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors	Shiyin Dong et.al.	2401.16459	null
2024-01-28	SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks	Serdar Erisen et.al.	2401.15741	link
2024-01-28	UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration	Nachuan Ma et.al.	2401.15647	null
2024-01-27	Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes	Diandian Guo et.al.	2401.15261	link
2024-01-26	Biological Valuation Map of Flanders: A Sentinel-2 Imagery Analysis	Mingshi Li et.al.	2401.15223	null
2024-01-26	Kitchen Food Waste Image Segmentation and Classification for Compost Nutrients Estimation	Raiyan Rahman et.al.	2401.15175	null
2024-01-26	SSR: SAM is a Strong Regularizer for domain adaptive semantic segmentation	Yanqi Ge et.al.	2401.14686	null
2024-01-25	CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds	Muhammad Ahmed Chaudhry et.al.	2401.14486	null
2024-01-25	Unlocking Past Information: Temporal Embeddings in Cooperative Bird’s Eye View Prediction	Dominik Rößle et.al.	2401.14325	null
2024-01-24	Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation	Saiyang Na et.al.	2401.13220	null
2024-01-24	Boundary and Relation Distillation for Semantic Segmentation	Dong Zhang et.al.	2401.13174	null
2024-01-23	DatUS^2: Data-driven Unsupervised Semantic Segmentation with Pre-trained Self-supervised Vision Transformer	Sonal Kumar et.al.	2401.12820	link
2024-01-23	Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels	Seungho Lee et.al.	2401.12535	null
2024-01-23	Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration	Yifan Zhang et.al.	2401.12452	link
2024-01-22	Scaling Up Quantization-Aware Neural Architecture Search for Efficient Deep Learning on the Edge	Yao Lu et.al.	2401.12350	null
2024-01-22	Exploring Simple Open-Vocabulary Semantic Segmentation	Zihang Lai et.al.	2401.12217	link
2024-01-22	Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy	Will LeVine et.al.	2401.12129	link
2024-01-22	HomeRobot Open Vocabulary Mobile Manipulation Challenge 2023 Participant Report (Team KuzHum)	Volodymyr Kuzma et.al.	2401.12048	null
2024-01-22	SemPLeS: Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation	Ci-Siang Lin et.al.	2401.11791	link
2024-01-22	EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models	Koichi Namekata et.al.	2401.11739	null
2024-01-22	MetaSeg: Content-Aware Meta-Net for Omni-Supervised Semantic Segmentation	Shenwang Jiang et.al.	2401.11738	null
2024-01-22	SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation	Xinqiao Zhao et.al.	2401.11719	link
2024-01-21	A Survey on African Computer Vision Datasets, Topics and Researchers	Abdul-Hakeem Omotayo et.al.	2401.11617	link
2024-01-21	Embedded Hyperspectral Band Selection with Adaptive Optimization for Image Semantic Segmentation	Yaniv Zimmer et.al.	2401.11420	null
2024-01-21	S $^3$ M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving	Zhiyuan Wu et.al.	2401.11414	null
2024-01-21	ANNA: A Deep Learning Based Dataset in Heterogeneous Traffic for Autonomous Vehicles	Mahedi Kamal et.al.	2401.11358	link
2024-01-20	Weakly-Supervised Semantic Segmentation of Circular-Scan, Synthetic-Aperture-Sonar Imagery	Isaac J. Sledge et.al.	2401.11313	null
2024-01-20	A Novel Benchmark for Few-Shot Semantic Segmentation in the Era of Foundation Models	Reda Bensaid et.al.	2401.11311	link
2024-01-20	Spatial Structure Constraints for Weakly Supervised Semantic Segmentation	Tao Chen et.al.	2401.11122	link
2024-01-19	One Step Learning, One Step Review	Xiaolong Huang et.al.	2401.10962	link
2024-01-19	RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision	Fernando Pérez-García et.al.	2401.10815	null
2024-01-19	Exploring Color Invariance through Image-Level Ensemble Learning	Yunpeng Gong et.al.	2401.10512	link
2024-01-18	RAP-SAM: Towards Real-Time All-Purpose Segment Anything	Shilin Xu et.al.	2401.10228	link
2024-01-18	Ventricular Segmentation: A Brief Comparison of U-Net Derivatives	Ketan Suhaas Saichandran et.al.	2401.09980	null
2024-01-18	XAI-Enhanced Semantic Segmentation Models for Visual Quality Inspection	Tobias Clement et.al.	2401.09900	null
2024-01-18	Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation	Songhe Deng et.al.	2401.09883	link
2024-01-18	Boosting Few-Shot Semantic Segmentation Via Segment Anything Model	Chen-Bin Feng et.al.	2401.09826	null
2024-01-18	P2Seg: Pointly-supervised Segmentation via Mutual Distillation	Zipeng Wang et.al.	2401.09709	null
2024-01-17	Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model	Lianghui Zhu et.al.	2401.09417	link
2024-01-17	POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images	Antonin Vobecky et.al.	2401.09413	null
2024-01-17	PixelDINO: Semi-Supervised Semantic Segmentation for Detecting Permafrost Disturbances	Konrad Heidler et.al.	2401.09271	link
2024-01-17	Uncertainty estimates for semantic segmentation: providing enhanced reliability for automated motor claims handling	Jan Küchler et.al.	2401.09245	null
2024-01-17	Learning to detect cloud and snow in remote sensing images from noisy labels	Zili Liu et.al.	2401.08932	null
2024-01-16	Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive	Yumeng Li et.al.	2401.08815	link
2024-01-16	ValUES: A Framework for Systematic Validation of Uncertainty Estimation in Semantic Segmentation	Kim-Celine Kahl et.al.	2401.08501	link
2024-01-16	Faster ISNet for Background Bias Mitigation on Deep Neural Networks	Pedro R. A. S. Bassi et.al.	2401.08409	link
2024-01-17	Generative Denoise Distillation: Simple Stochastic Noises Induce Efficient Knowledge Transfer for Dense Prediction	Zhaoge Liu et.al.	2401.08332	link
2024-01-16	End-to-End Optimized Image Compression with the Frequency-Oriented Transform	Yuefeng Zhang et.al.	2401.08194	null
2024-01-16	S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera	Thanh Nguyen Canh et.al.	2401.08134	null
2024-01-16	UV-SAM: Adapting Segment Anything Model for Urban Village Identification	Xin Zhang et.al.	2401.08083	link
2024-01-15	Semantic Scene Segmentation for Robotics	Juana Valeria Hurtado et.al.	2401.07589	null
2024-01-15	Compositional Oil Spill Detection Based on Object Detector and Adapted Segment Anything Model from SAR Images	Wenhui Wu et.al.	2401.07502	null
2024-01-15	Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention	Xin Yang et.al.	2401.07459	null
2024-01-14	Semi-supervised Semantic Segmentation using Redesigned Self-Training for White Blood Cel	Vinh Quoc Luu et.al.	2401.07278	null
2024-01-13	Weak Labeling for Cropland Mapping in Africa	Gilles Quentin Hacheme et.al.	2401.07014	null
2024-01-13	Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization	Mengtian Li et.al.	2401.06975	null
2024-01-12	Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery	Caleb Robinson et.al.	2401.06762	link
2024-01-12	UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	Bowen Shi et.al.	2401.06397	link
2024-01-11	Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications	Yuwen Xiong et.al.	2401.06197	link
2024-01-09	Generic Knowledge Boosted Pre-training For Remote Sensing Images	Ziyue Huang et.al.	2401.04614	link
2024-01-08	Fully Attentional Networks with Self-emerging Token Labeling	Bingyin Zhao et.al.	2401.03844	link
2024-01-07	SeTformer is What You Need for Vision and Language	Pourya Shamsolmoali et.al.	2401.03540	null
2024-01-06	Multi-View 3D Instance Segmentation of Structural Anomalies for Enhanced Structural Inspection of Concrete Bridges	Christian Benz et.al.	2401.03298	link
2024-01-02	Unsupervised Federated Domain Adaptation for Segmentation of MRI Images	Navapat Nananukul et.al.	2401.02941	null
2024-01-04	ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation	Xinyang Pu et.al.	2401.02326	link
2024-01-04	Source-Free Online Domain Adaptive Semantic Segmentation of Satellite Images under Image Degradation	Fahim Faisal Niloy et.al.	2401.02113	null
2024-01-03	Towards Robust Semantic Segmentation against Patch-based Attack via Attention Refinement	Zheng Yuan et.al.	2401.01750	null
2024-01-03	S3Net: Innovating Stereo Matching and Semantic Segmentation with a Single-Branch Semantic Stereo Network in Satellite Epipolar Imagery	Qingyuan Yang et.al.	2401.01643	link
2024-01-03	Context-Aware Interaction Network for RGB-T Semantic Segmentation	Ying Lv et.al.	2401.01624	link
2024-01-02	Off-Road LiDAR Intensity Based Semantic Segmentation	Kasi Viswanath et.al.	2401.01439	link
2024-01-02	Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images	Subin Sahayam et.al.	2401.01303	null
2024-01-02	Physics-informed Generalizable Wireless Channel Modeling with Segmentation and Deep Learning: Fundamentals, Methodologies, and Challenges	Ethan Zhu et.al.	2401.01288	null
2024-01-02	GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction	Yuping Hu et.al.	2401.01178	null
2024-01-02	DTBS: Dual-Teacher Bi-directional Self-training for Domain Adaptation in Nighttime Semantic Segmentation	Fanding Huang et.al.	2401.01066	link
2024-01-02	Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal Representations	Serban Stan et.al.	2401.01035	link
2023-12-31	Analyzing Local Representations of Self-supervised Vision Transformers	Ani Vanyan et.al.	2401.00463	null
2023-12-28	Learning Vision from Models Rivals Learning Vision from Data	Yonglong Tian et.al.	2312.17742	link
2024-01-04	HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping	Xin Zhang et.al.	2312.17492	null
2023-12-28	Unsupervised Universal Image Segmentation	Dantong Niu et.al.	2312.17243	link
2024-01-03	An Improved Baseline for Reasoning Segmentation with Large Language Model	Senqiao Yang et.al.	2312.17240	null
2023-12-28	SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation	Zhengze Xu et.al.	2312.17071	link
2023-12-28	EvPlug: Learn a Plug-and-Play Module for Event and Image Fusion	Jianping Jiang et.al.	2312.16933	null
2023-12-29	Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation	Xiawei Li et.al.	2312.16578	link
2023-12-27	ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments	Maghsood Salimi et.al.	2312.16516	link
2023-12-26	VirtualPainting: Addressing Sparsity with Virtual Points and Distance-Aware Data Augmentation for 3D Object Detection	Sudip Dhakal et.al.	2312.16141	null
2023-12-26	LangSplat: 3D Language Gaussian Splatting	Minghan Qin et.al.	2312.16084	link
2023-12-23	WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments	Kavisha Vidanapathirana et.al.	2312.15364	link
2023-12-23	Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models	Gianni Franchi et.al.	2312.15297	null
2023-12-22	Harnessing Diffusion Models for Visual Perception with Meta Prompts	Qiang Wan et.al.	2312.14733	link
2023-12-22	Variance-insensitive and Target-preserving Mask Refinement for Interactive Image Segmentation	Chaowei Fang et.al.	2312.14387	null
2023-12-26	TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification	Qinying Liu et.al.	2312.14149	link
2023-12-21	Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation	Rasha Alshawi et.al.	2312.14053	link
2023-12-21	Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection	Soopil Kim et.al.	2312.13783	link
2023-12-22	Weakly Supervised Semantic Segmentation for Driving Scenes	Dongseob Kim et.al.	2312.13646	link
2023-12-20	DVIS++: Improved Decoupled Framework for Universal Video Segmentation	Tao Zhang et.al.	2312.13305	link
2023-12-20	BEVSeg2TP: Surround View Camera Bird’s-Eye-View Based Joint Vehicle Segmentation and Ego Vehicle Trajectory Prediction	Sushil Sharma et.al.	2312.13081	link
2023-12-20	Multi-task Learning To Improve Semantic Segmentation Of CBCT Scans Using Image Reconstruction	Maximilian Ernst Tschuchnig et.al.	2312.12990	null
2023-12-20	TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training	Yuqi Lin et.al.	2312.12828	link
2023-12-20	Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation	Wenhao Xu et.al.	2312.12754	link
2023-12-20	MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images	Libo Wang et.al.	2312.12735	null
2023-12-20	Segment Anything Model Meets Image Harmonization	Haoxing Chen et.al.	2312.12729	null
2023-12-19	DDOS: The Drone Depth and Obstacle Segmentation Dataset	Benedikt Kolbeinsson et.al.	2312.12494	null
2023-12-19	SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process	Mengyu Wang et.al.	2312.12425	link
2023-12-19	CLIP-DINOiser: Teaching CLIP a few DINO tricks	Monika Wysoczańska et.al.	2312.12359	link
2023-12-19	All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes	Jose L. Gómez et.al.	2312.12176	null
2023-12-19	Domain Generalization in LiDAR Semantic Segmentation Leveraged by Density Discriminative Feature Embedding	Jaeyeul Kim et.al.	2312.12098	null
2023-12-18	Detecting the edges of galaxies with deep learning	Jesús Fernández et.al.	2312.11654	null
2023-12-18	PlaNet-S: Automatic Semantic Segmentation of Placenta	Shinnosuke Yamamoto et.al.	2312.11580	null
2023-12-18	Language-Assisted 3D Scene Understanding	Yanmin Wu et.al.	2312.11451	link
2023-12-18	Research on Multilingual Natural Scene Text Detection Algorithm	Tao Wang et.al.	2312.11153	null
2023-12-18	SeeBel: Seeing is Believing	Sourajit Saha et.al.	2312.10933	link
2023-12-17	Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s	Maksim Makarenko et.al.	2312.10639	null
2023-12-16	Transformers in Unsupervised Structure-from-Motion	Hemang Chawla et.al.	2312.10529	link
2023-12-16	All Attention U-NET for Semantic Segmentation of Intracranial Hemorrhages In Head CT Images	Chia Shuo Chang et.al.	2312.10483	null
2023-12-16	Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning	Kaiyou Song et.al.	2312.10457	link
2023-12-15	Forging Tokens for Improved Storage-efficient Training	Minhyun Lee et.al.	2312.10105	link
2023-12-15	Collaborating Foundation models for Domain Generalized Semantic Segmentation	Yasser Benigmim et.al.	2312.09788	link
2023-12-15	Density Matters: Improved Core-set for Active Domain Adaptive Segmentation	Shizhan Liu et.al.	2312.09595	null
2023-12-15	AEGIS-Net: Attention-guided Multi-Level Feature Aggregation for Indoor Place Recognition	Yuhang Ming et.al.	2312.09538	link
2023-12-15	WeatherProof: A Paired-Dataset Approach to Semantic Segmentation in Adverse Weather	Blake Gella et.al.	2312.09534	null
2023-12-14	LIME: Localized Image Editing via Attention Regularization in Diffusion Models	Enis Simsar et.al.	2312.09256	null
2023-12-14	Reliability in Semantic Segmentation: Can We Use Synthetic Data?	Thibaut Loiseau et.al.	2312.09231	link
2023-12-18	Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation	Jingxuan He et.al.	2312.08916	link
2023-12-14	Agent Attention: On the Integration of Softmax and Linear Attention	Dongchen Han et.al.	2312.08874	link
2023-12-14	Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities	Runwei Guan et.al.	2312.08851	link
2023-12-14	Offshore Wind Plant Instance Segmentation Using Sentinel-1 Time Series, GIS, and Semantic Segmentation Models	Osmar Luiz Ferreira de Carvalho et.al.	2312.08773	null
2023-12-14	Segment Beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation	Renjie Wu et.al.	2312.08673	null
2023-12-14	Semi-supervised Semantic Segmentation Meets Masked Modeling:Fine-grained Locality Learning Matters in Consistency Regularization	Wentao Pan et.al.	2312.08631	null
2023-12-11	DFGET: Displacement-Field Assisted Graph Energy Transmitter for Gland Instance Segmentation	Caiqing Jian et.al.	2312.07584	null
2023-12-12	X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer	Linglin Jing et.al.	2312.07378	link
2023-12-12	Adversarial Semi-Supervised Domain Adaptation for Semantic Segmentation: A New Role for Labeled Target Samples	Marwa Kechaou et.al.	2312.07370	null
2023-12-12	Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization	Jiyoung Kim et.al.	2312.07342	null
2023-12-12	Transferring CLIP’s Knowledge into Zero-Shot Point Cloud Semantic Segmentation	Yuanbin Wang et.al.	2312.07221	null
2023-12-12	MCFNet: Multi-scale Covariance Feature Fusion Network for Real-time Semantic Segmentation	Xiaojie Fang et.al.	2312.07207	null
2023-12-11	Densify Your Labels: Unsupervised Clustering with Bipartite Matching for Weakly Supervised Point Cloud Segmentation	Shaobo Xia et.al.	2312.06799	null
2023-12-11	Deciphering ‘What’ and ‘Where’ Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations	Xiao Zhang et.al.	2312.06716	link
2023-12-10	AM-RADIO: Agglomerative Model – Reduce All Domains Into One	Mike Ranzinger et.al.	2312.06709	link
2023-12-11	Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation	Xiaoyi Bao et.al.	2312.06474	null
2023-12-11	Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation	Dong Zhao et.al.	2312.06331	link
2023-12-11	U-MixFormer: UNet-like Transformer with Mix-Attention for Efficient Semantic Segmentation	Seul-Ki Yeom et.al.	2312.06272	link
2023-12-11	Adaptive Annotation Distribution for Weakly Supervised Point Cloud Semantic Segmentation	Zhiyi Pan et.al.	2312.06259	link
2023-12-10	Deep-Learning-Assisted Analysis of Cataract Surgery Videos	Negin Ghamsarian et.al.	2312.05900	null
2023-12-09	CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen	Hao Zhang et.al.	2312.05538	null
2023-12-08	Loss Functions in the Era of Semantic Segmentation: A Survey and Outlook	Reza Azad et.al.	2312.05391	link
2023-12-08	Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects	Junyu Lu et.al.	2312.05278	null
2023-12-08	Datasets, Models, and Algorithms for Multi-Sensor, Multi-agent Autonomy Using AVstack	R. Spencer Hallyburton et.al.	2312.04970	null
2023-12-07	Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds	Yujia Liu et.al.	2312.04962	null
2023-12-08	Segmentation of Kidney Tumors on Non-Contrast CT Images using Protuberance Detection Network	Taro Hatsutani et.al.	2312.04796	null
2023-12-07	gcDLSeg: Integrating Graph-cut into Deep Learning for Binary Semantic Segmentation	Hui Xie et.al.	2312.04713	null
2023-12-07	HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image	Tong Wu et.al.	2312.04543	null
2023-12-07	Self-Guided Open-Vocabulary Semantic Segmentation	Osman Ülger et.al.	2312.04539	link
2023-12-07	Semi-Supervised Active Learning for Semantic Segmentation in Unknown Environments Using Informative Path Planning	Julius Rückin et.al.	2312.04402	link
2023-12-07	Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation	Zhixiang Wei et.al.	2312.04265	link
2023-12-07	Fine-tune vision foundation model for crack segmentation in civil infrastructures	Kang Ge et.al.	2312.04233	null
2023-12-07	Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation	Jiawei Fan et.al.	2312.04168	link
2023-12-07	Residual Graph Convolutional Network for Bird’s-Eye-View Semantic Segmentation	Qiuxiao Chen et.al.	2312.04044	null
2023-12-06	Novel class discovery meets foundation models for 3D semantic segmentation	Luigi Riz et.al.	2312.03782	null
2023-12-10	Foundation Model Assisted Weakly Supervised Semantic Segmentation	Xiaobo Yang et.al.	2312.03585	link
2023-12-06	ShareCMP: Polarization-Aware RGB-P Semantic Segmentation	Zhuoyan Liu et.al.	2312.03430	link
2023-12-06	DeepPyramid+: Medical Image Segmentation using Pyramid View Fusion and Deformable Pyramid Reception	Negin Ghamsarian et.al.	2312.03409	null
2023-12-06	Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields	Shijie Zhou et.al.	2312.03203	link
2023-12-05	AI-SAM: Automatic and Interactive Segment Anything Model	Yimu Pan et.al.	2312.03119	link
2023-12-05	DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control	Yuru Jia et.al.	2312.03048	null
2023-12-05	Uni3DL: Unified Model for 3D and Language Understanding	Xiang Li et.al.	2312.03026	null
2023-12-05	6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation	K. Samarawickrama et.al.	2312.02593	link
2023-12-05	Towards More Unified In-context Visual Understanding	Dianmo Sheng et.al.	2312.02520	null
2023-12-05	SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints	Xianping Ma et.al.	2312.02464	link
2023-12-05	Towards Granularity-adjusted Pixel-level Semantic Annotation	Rohit Kundu et.al.	2312.02420	null
2023-12-04	Class-Discriminative Attention Maps for Vision Transformers	Lennart Brocki et.al.	2312.02364	link
2023-12-04	Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding	Guofeng Mei et.al.	2312.02244	link
2023-12-04	Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation	Aniruddh Sikdar et.al.	2312.02240	null
2023-12-04	VLTSeg: Simple Transfer of CLIP-Based Vision-Language Representations for Domain Generalized Semantic Segmentation	Christoph Hümmer et.al.	2312.02021	null
2023-12-04	Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation	Joshua Niemeijer et.al.	2312.01850	link
2023-12-04	Few Clicks Suffice: Active Test-Time Adaptation for Semantic Segmentation	Longhui Yuan et.al.	2312.01835	null
2023-12-04	SE-LIO: Semantics-enhanced Solid-State-LiDAR-Inertial Odometry for Tree-rich Environments	Tisheng Zhang et.al.	2312.01809	null
2023-12-04	SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference	Feng Wang et.al.	2312.01597	link
2023-12-03	G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training	Che Liu et.al.	2312.01522	link
2023-12-03	A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors	Kangcheng Liu et.al.	2312.01262	null
2023-12-02	Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited Labels	Changrui Chen et.al.	2312.01169	link
2023-12-01	Improve Supervised Representation Learning with Masked Image Modeling	Kaifeng Chen et.al.	2312.00950	null
2023-12-01	Grounding Everything: Emerging Localization Properties in Vision-Language Transformers	Walid Bousselham et.al.	2312.00878	link
2023-12-01	Sequential Modeling Enables Scalable Learning for Large Vision Models	Yutong Bai et.al.	2312.00785	link
2023-12-01	GIFT: Generative Interpretable Fine-Tuning Transformers	Chinmay Savadikar et.al.	2312.00700	link
2023-12-01	CellMixer: Annotation-free Semantic Cell Segmentation of Heterogeneous Cell Populations	Mehdi Naouar et.al.	2312.00671	null
2023-12-01	SCHEME: Scalable Channer Mixer for Vision Transformers	Deepak Sridhar et.al.	2312.00412	null
2023-12-04	Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning	Shaohua Dong et.al.	2312.00360	link
2023-12-01	Improving Normalization with the James-Stein Estimator	Seyedalireza Khoshsirat et.al.	2312.00313	null
2023-12-01	A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing	Longfeng Nie et.al.	2312.00308	null
2023-11-30	InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation	Rongyao Fang et.al.	2311.18835	link
2023-11-30	Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction	Hsin-Ying Lee et.al.	2311.18832	link
2023-11-30	Semi-supervised Semantic Segmentation via Boosting Uncertainty on Unlabeled Data	Daoan Zhang et.al.	2311.18758	null
2023-11-30	Learning Part Segmentation from Synthetic Animals	Jiawei Peng et.al.	2311.18661	null
2023-11-30	A Lightweight Clustering Framework for Unsupervised Semantic Segmentation	Yau Shing Jonathan Cheung et.al.	2311.18628	null
2023-11-30	Each Test Image Deserves A Specific Prompt: Continual Test-Time Adaptation for 2D Medical Image Segmentation	Ziyang Chen et.al.	2311.18363	link
2023-11-30	MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation	Sumanth Udupa et.al.	2311.18331	link
2023-11-30	Beyond Entropy: Style Transfer Guided Single Image Continual Test-Time Adaptation	Younggeol Cho et.al.	2311.18270	null
2023-11-29	ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction	Silvan Weder et.al.	2311.18068	null
2023-11-29	A Simple Recipe for Language-guided Domain Generalized Segmentation	Mohammad Fahes et.al.	2311.17922	link
2023-11-30	Do text-free diffusion models learn discriminative visual representations?	Soumik Mukhopadhyay et.al.	2311.17921	link
2023-11-29	Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation	Yu Zheng et.al.	2311.17491	link
2023-11-29	Continual Learning for Image Segmentation with Dynamic Query	Weijia Wu et.al.	2311.17450	link
2023-11-28	TransNeXt: Robust Foveal Visual Perception for Vision Transformers	Dai Shi et.al.	2311.17132	link
2023-11-28	Generative Data Augmentation Improves Scribble-supervised Semantic Segmentation	Jacob Schnell et.al.	2311.17121	null
2023-11-28	Plug-and-Play, Dense-Label-Free Extraction of Open-Vocabulary Semantic Segmentation from Vision-Language Models	Luo Jiayun et.al.	2311.17095	link
2023-11-28	ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention	Jiawei Wang et.al.	2311.16682	null
2023-11-27	Segment Every Out-of-Distribution Object	Wenjie Zhao et.al.	2311.16516	link
2023-11-27	SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance	Lukas Hoyer et.al.	2311.16241	link
2023-11-27	Seeing Beyond Cancer: Multi-Institutional Validation of Object Localization and 3D Semantic Segmentation using Deep Learning for Breast MRI	Arda Pekis et.al.	2311.16213	null
2023-11-27	Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images	Aiyu Cui et.al.	2311.16094	null
2023-11-27	FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding in Open World	Thanh-Dat Truong et.al.	2311.15965	null
2023-11-27	2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation	Ozan Unal et.al.	2311.15605	null
2023-11-27	An Ensemble of 2.5D ResUnet Based Models for Segmentation for Kidney and Masses	Cancan Chen et.al.	2311.15586	null
2023-11-27	SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation	Bin Xie et.al.	2311.15537	link
2023-11-26	Advancing Vision Transformers with Group-Mix Attention	Chongjian Ge et.al.	2311.15157	link
2023-11-25	Can SAM recognize crops? Quantifying the zero-shot performance of a semantic segmentation foundation model on generating crop-type maps using satellite imagery for precision agriculture	Rutuja Gurav et.al.	2311.15138	null
2023-11-25	Adapter is All You Need for Tuning Visual Tasks	Dongshuo Yin et.al.	2311.15010	link
2023-11-28	Uncertainty Aware AI for 2D MRI Segmentation	Lohith Konathala et.al.	2311.14875	null
2023-11-24	Understanding Self-Supervised Features for Learning Unsupervised Instance Segmentation	Paul Engstler et.al.	2311.14665	null
2023-11-24	IDD-AW: A Benchmark for Safe and Robust Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather	Furqan Ahmed Shaik et.al.	2311.14459	null
2023-11-24	Segment (Almost) Nothing: Prompt-Agnostic Adversarial Attacks on Segmentation Models	Francesco Croce et.al.	2311.14450	null
2023-11-24	OneFormer3D: One Transformer for Unified Point Cloud Segmentation	Maxim Kolodiazhnyi et.al.	2311.14405	link
2023-11-23	Class Balanced Dynamic Acquisition for Domain Adaptive Semantic Segmentation using Active Learning	Marc Schachtsiek et.al.	2311.14146	null
2023-11-23	Language-guided Few-shot Semantic Segmentation	Jing Wang et.al.	2311.13865	null
2023-11-22	DiverseNet: Decision Diversified Semi-supervised Semantic Segmentation Networks for Remote Sensing Imagery	Wanli Ma et.al.	2311.13716	null
2023-11-22	BenthIQ: a Transformer-Based Benthic Classification Model for Coral Restoration	Rupa Kurinchi-Vendhan et.al.	2311.13661	null
2023-11-22	DA-STC: Domain Adaptive Video Semantic Segmentation via Spatio-Temporal Consistency	Zhe Zhang et.al.	2311.13254	link
2023-11-22	Self-guided Few-shot Semantic Segmentation for Remote Sensing Imagery Based on Large Vision Models	Xiyu Qi et.al.	2311.13200	null
2023-11-22	FuseNet: Self-Supervised Dual-Path Network for Medical Image Segmentation	Amirhossein Kazerouni et.al.	2311.13069	link
2023-11-21	AI for Agriculture: the Comparison of Semantic Segmentation Methods for Crop Mapping with Sentinel-2 Imagery	Irina Korotkova et.al.	2311.12993	null
2023-11-21	Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots	Youqi Liao et.al.	2311.12651	link
2023-11-21	Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers	Bo Sun et.al.	2311.12291	null
2023-11-20	Disentangling Structure and Appearance in ViT Feature Space	Narek Tumanyan et.al.	2311.12193	null
2023-11-20	Model-aware 3D Eye Gaze from Weak and Few-shot Supervisions	Nikola Popovic et.al.	2311.12157	link
2023-11-20	GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding	Hao Li et.al.	2311.11863	null
2023-11-20	Predicting urban tree cover from incomplete point labels and limited background information	Hui Zhang et.al.	2311.11592	null
2023-11-20	Generalized Category Discovery in Semantic Segmentation	Zhengyuan Peng et.al.	2311.11525	link
2023-11-19	SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints	Aditya Nalgunda Ganesh et.al.	2311.11371	null
2023-11-19	Optimizing rgb-d semantic segmentation through multi-modal interaction and pooling attention	Shuai Zhang et.al.	2311.11312	null
2023-11-18	Low-Precision Floating-Point for Efficient On-Board Deep Neural Network Processing	Cédric Gernigon et.al.	2311.11172	null
2023-11-18	SNI-SLAM: Semantic Neural Implicit SLAM	Siting Zhu et.al.	2311.11016	link
2023-11-17	Labeling Indoor Scenes with Fusion of Out-of-the-Box Perception Models	Yimeng Li et.al.	2311.10883	null
2023-11-17	Self-trained Panoptic Segmentation	Shourya Verma et.al.	2311.10648	null
2023-11-17	A Framework of Landsat-8 Band Selection based on UMDA for Deforestation Detection	Eduardo B. Neto et.al.	2311.10513	null
2023-11-15	NormNet: Scale Normalization for 6D Pose Estimation in Stacked Scenarios	En-Te Lin et.al.	2311.09269	link
2023-11-15	Correlation-aware active learning for surgery video segmentation	Fei Wu et.al.	2311.08811	null
2023-11-14	Efficient Rotation Invariance in Deep Neural Networks through Artificial Mental Rotation	Lukas Tuggener et.al.	2311.08525	null
2023-11-14	LocaliseBot: Multi-view 3D object localisation with differentiable rendering for robot grasping	Sujal Vijayaraghavan et.al.	2311.08438	null
2023-11-14	Test-Time Training for Semantic Segmentation with Output Contrastive Loss	Yunlong Zhang et.al.	2311.07877	link
2023-11-13	Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks	Laura Fieback et.al.	2311.07477	null
2023-11-14	Simultaneous Clutter Detection and Semantic Segmentation of Moving Objects for Automotive Radar Data	Johannes Kopp et.al.	2311.07247	null
2023-11-13	SpectralGPT: Spectral Foundation Model	Danfeng Hong et.al.	2311.07113	null
2023-11-11	Unsupervised and semi-supervised co-salient object detection via segmentation frequency statistics	Souradeep Chakraborty et.al.	2311.06654	null
2023-11-10	Lidar-based Norwegian tree species detection using deep learning	Martijn Vermeer et.al.	2311.06066	null
2023-11-09	PolyMaX: General Dense Prediction with Mask Transformer	Xuan Yang et.al.	2311.05770	link
2023-11-09	TLCFuse: Temporal Multi-Modality Fusion Towards Occlusion-Aware Semantic Segmentation-Aided Motion Planning	Gustavo Salazar-Gomez et.al.	2311.05319	null
2023-11-09	Reducing the Side-Effects of Oscillations in Training of Quantized YOLO Networks	Kartik Gupta et.al.	2311.05109	null
2023-11-07	Data exploitation: multi-task learning of object detection and semantic segmentation on partially annotated data	Hoàng-Ân Lê et.al.	2311.04040	link
2023-11-07	A Comparative Study of Knowledge Transfer Methods for Misaligned Urban Building Labels	Bipul Neupane et.al.	2311.03867	null
2023-11-07	Autonomous Exploration and General Visual Inspection of Ship Ballast Water Tanks using Aerial Robots	Mihir Dharmadhikari et.al.	2311.03838	null
2023-11-06	Leveraging point annotations in segmentation learning with boundary loss	Eva Breznik et.al.	2311.03537	null
2023-11-06	TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding	Shuo Wang et.al.	2311.03427	link
2023-11-06	SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis	Hanrong Ye et.al.	2311.03355	null
2023-11-06	Segmentation of Drone Collision Hazards in Airborne RADAR Point Clouds Using PointNet	Hector Arroyo et.al.	2311.03221	null
2023-11-06	Pelvic floor MRI segmentation based on semi-supervised deep learning	Jianwei Zuo et.al.	2311.03105	null
2023-11-06	COLA: COarse-LAbel multi-source LiDAR semantic segmentation for autonomous driving	Jules Sanchez et.al.	2311.03017	null
2023-11-08	Deep Image Semantic Communication Model for Artificial Intelligent Internet of Things	Li Ping Qian et.al.	2311.02926	link
2023-11-05	PotholeGuard: A Pothole Detection Approach by Point Cloud Semantic Segmentation	Sahil Nawale et.al.	2311.02641	null
2023-11-05	TFNet: Tuning Fork Network with Neighborhood Pixel Aggregation for Improved Building Footprint Extraction	Muhammad Ahmad Waseem et.al.	2311.02617	null
2023-11-03	Image Recognition of Oil Leakage Area Based on Logical Semantic Discrimination	Weiying Lin et.al.	2311.02256	null
2023-11-03	MineSegSAT: An automated system to evaluate mining disturbed area extents from Sentinel-2 imagery	Ezra MacDonald et.al.	2311.01676	link
2023-11-02	MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory	Enxu Li et.al.	2311.01556	null
2023-11-02	AiluRus: A Scalable ViT Framework for Dense Prediction	Jin Li et.al.	2311.01197	link
2023-11-02	A deep learning experiment for semantic segmentation of overlapping characters in palimpsests	Michela Perino et.al.	2311.01130	null
2023-11-02	Overhead Line Defect Recognition Based on Unsupervised Semantic Segmentation	Weixi Wang et.al.	2311.00979	null
2023-11-01	PAUMER: Patch Pausing Transformer for Semantic Segmentation	Evann Courdier et.al.	2311.00586	null
2023-10-31	Joint Depth Prediction and Semantic Segmentation with Multi-View SAM	Mykhailo Shvets et.al.	2311.00134	null
2023-10-31	Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation	Liang Liao et.al.	2310.20305	link
2023-10-31	Annotator: A Generic Active Learning Baseline for LiDAR Semantic Segmentation	Binhui Xie et.al.	2310.20293	null
2023-10-30	Dynamic Gaussian Splatting from Markerless Motion Capture can Reconstruct Infants Movements	R. James Cotton et.al.	2310.19441	null
2023-10-30	Resource Constrained Semantic Segmentation for Waste Sorting	Elisa Cascina et.al.	2310.19407	link
2023-10-30	L2T-DLN: Learning to Teach with Dynamic Loss Network	Zhoyang Hai et.al.	2310.19313	null
2023-10-30	Revisiting Evaluation Metrics for Semantic Segmentation: Optimization and Evaluation of Fine-grained Intersection over Union	Zifu Wang et.al.	2310.19252	link
2023-10-30	Modular Anti-noise Deep Learning Network for Robotic Grasp Detection Based on RGB Images	Zhaocong Li et.al.	2310.19223	link
2023-10-29	Dynamic Task and Weight Prioritization Curriculum Learning for Multimodal Imagery	Huseyin Fuat Alsan et.al.	2310.19109	link
2023-10-29	Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation	Fei Zhang et.al.	2310.19001	null
2023-10-29	Mask Propagation for Efficient Video Semantic Segmentation	Yuetian Weng et.al.	2310.18954	link
2023-10-28	Exploring Data Augmentations on Self-/Semi-/Fully- Supervised Pre-trained Models	Shentong Mo et.al.	2310.18850	null
2023-10-28	One-shot Localization and Segmentation of Medical Images with Foundation Models	Deepa Anand et.al.	2310.18642	null
2023-10-28	Switching Temporary Teachers for Semi-Supervised Semantic Segmentation	Jaemin Na et.al.	2310.18640	link
2023-10-27	A Self-Supervised Approach to Land Cover Segmentation	Charles Moore et.al.	2310.18251	null
2023-10-27	SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation	Mengcheng Lan et.al.	2310.17874	link
2023-10-26	Image Prior and Posterior Conditional Probability Representation for Efficient Damage Assessment	Jie Wei et.al.	2310.17801	null
2023-10-26	Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving	Gilles Puy et.al.	2310.17504	link
2023-10-26	Uncertainty-weighted Loss Functions for Improved Adversarial Attacks on Semantic Segmentation	Kira Maag et.al.	2310.17436	link
2023-10-26	BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds	Corentin Sautier et.al.	2310.17281	link
2023-10-26	Virtual Accessory Try-On via Keypoint Hallucination	Junhong Gou et.al.	2310.17131	null
2023-10-26	Automating lichen monitoring in ecological studies using instance segmentation of time-lapse images	Safwen Naimi et.al.	2310.17080	null
2023-10-25	Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement	Xingchen Zhao et.al.	2310.16979	null
2023-10-25	4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via 4D Semantic Segmentation	Dadong Jiang et.al.	2310.16858	null
2023-10-25	Gramian Attention Heads are Strong yet Efficient Vision Learners	Jongbin Ryu et.al.	2310.16483	link
2023-10-24	Pixel-Level Clustering Network for Unsupervised Image Segmentation	Cuong Manh Hoang et.al.	2310.16234	null
2023-10-26	CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting	Lei Li et.al.	2310.16069	null
2023-10-26	ConvBKI: Real-Time Probabilistic Semantic Mapping Network with Quantifiable Uncertainty	Joey Wilson et.al.	2310.16020	null
2023-10-24	Semantic-preserving image coding based on Conditional Diffusion models	Francesco Pezone et.al.	2310.15737	link
2023-10-26	GNeSF: Generalizable Neural Semantic Fields	Hanlin Chen et.al.	2310.15712	null
2023-10-23	SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding	Haoxiang Wang et.al.	2310.15308	null
2023-10-23	FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models	Lihe Yang et.al.	2310.15160	link
2023-10-23	P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation	Mohammed A. M. Elhassan et.al.	2310.15025	link
2023-10-22	A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application	Bo Yuan et.al.	2310.14277	link
2023-10-22	Partition Speeds Up Learning Implicit Neural Representations Based on Exponential-Increase Hypothesis	Ke Liu et.al.	2310.14184	link
2023-10-20	Longer-range Contextualized Masked Autoencoder	Taekyung Kim et.al.	2310.13593	link
2023-10-20	ROSS: Radar Off-road Semantic Segmentation	Peng Jiang et.al.	2310.13551	null
2023-10-20	Technical Report for ICCV 2023 Visual Continual Learning Challenge: Continuous Test-time Adaptation for Semantic Segmentation	Damian Sójka et.al.	2310.13533	null
2023-10-20	A review of individual tree crown detection and delineation from optical remote sensing images	Juepeng Zheng et.al.	2310.13481	null
2023-10-20	FLAIR: a Country-Scale Land Cover Semantic Segmentation Dataset From Multi-Source Optical Imagery	Anatol Garioud et.al.	2310.13336	link
2023-10-19	LeTFuser: Light-weight End-to-end Transformer-Based Sensor Fusion for Autonomous Driving with Multi-Task Learning	Pedram Agand et.al.	2310.13135	link
2023-10-19	Using Logic Programming and Kernel-Grouping for Improving Interpretability of Convolutional Neural Networks	Parth Padalkar et.al.	2310.13073	null
2023-10-19	Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models	Zhaozheng Chen et.al.	2310.13026	link
2023-10-19	Minimalist and High-Performance Semantic Segmentation with Plain Vision Transformers	Yuanduo Hong et.al.	2310.12755	link
2023-10-19	Cross-attention Spatio-temporal Context Transformer for Semantic Segmentation of Historical Maps	Sidi Wu et.al.	2310.12616	link
2023-10-19	RecolorCloud: A Point Cloud Tool for Recoloring, Segmentation, and Conversion	Esteban Segarra Martinez et.al.	2310.12470	null
2023-10-19	Lidar Panoptic Segmentation and Tracking without Bells and Whistles	Abhinav Agarwalla et.al.	2310.12464	link
2023-10-18	SegmATRon: Embodied Adaptive Semantic Segmentation for Indoor Environment	Tatiana Zemskova et.al.	2310.12031	link
2023-10-16	IDRNet: Intervention-Driven Relation Network for Semantic Segmentation	Zhenchao Jin et.al.	2310.10755	link
2023-10-16	Motion2Language, Unsupervised learning of synchronized semantic motion segmentation	Karim Radouane et.al.	2310.10594	link
2023-10-16	RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets	Zhicheng Cai et.al.	2310.10563	link
2023-10-17	Label-efficient Segmentation via Affinity Propagation	Wentong Li et.al.	2310.10533	link
2023-10-16	On the Transferability of Learning Models for Semantic Segmentation for Remote Sensing Data	Rongjun Qin et.al.	2310.10490	link
2023-10-15	Top-K Pooling with Patch Contrastive Learning for Weakly-Supervised Semantic Segmentation	Wangyu Wu et.al.	2310.09828	null
2023-10-15	Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation	Wangyu Wu et.al.	2310.09760	null
2023-10-13	Equirectangular image construction method for standard CNNs for Semantic Segmentation	Haoqian Chen et.al.	2310.09122	null
2023-10-13	Faster 3D cardiac CT segmentation with Vision Transformers	Lee Jollans et.al.	2310.09099	link
2023-10-13	Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving	Feng Jiang et.al.	2310.08826	null
2023-10-12	SSG2: A new modelling paradigm for semantic segmentation	Foivos I. Diakogiannis et.al.	2310.08671	link
2023-10-16	SegLoc: Novel Visual Self-supervised Learning Scheme for Dense Prediction Tasks of Security Inspection X-ray Images	Shervin Halat et.al.	2310.08421	null
2023-10-12	UniPAD: A Universal Pre-training Paradigm for Autonomous Driving	Honghui Yang et.al.	2310.08370	link
2023-10-12	NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding	Yuhao Dong et.al.	2310.08326	null
2023-10-12	GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object Detection	Ziying Song et.al.	2310.08261	null
2023-10-12	BaSAL: Size Balanced Warm Start Active Learning for LiDAR Semantic Segmentation	Jiarong Wei et.al.	2310.08035	null
2023-10-11	HaarNet: Large-scale Linear-Morphological Hybrid Network for RGB-D Semantic Segmentation	Rick Groenendijk et.al.	2310.07669	null
2023-10-11	Context-Enhanced Detector For Building Detection From Remote Sensing Images	Ziyue Huang et.al.	2310.07638	null
2023-10-11	PeP: a Point enhanced Painting method for unified point cloud tasks	Zichao Dong et.al.	2310.07591	null
2023-10-11	Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning	Zhiming Qian et.al.	2310.07510	null
2023-10-11	CLIP for Lightweight Semantic Segmentation	Ke Jin et.al.	2310.07394	null
2023-10-11	Causal Unsupervised Semantic Segmentation	Junho Kim et.al.	2310.07379	link
2023-10-11	Distilling Efficient Vision Transformers from CNNs for Semantic Segmentation	Xu Zheng et.al.	2310.07265	null
2023-10-11	Robust Unsupervised Domain Adaptation by Retaining Confident Entropy via Edge Concatenation	Hye-Seong Hong et.al.	2310.07149	null
2023-10-10	Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real Images	Che Liu et.al.	2310.07027	link
2023-10-10	CoinSeg: Contrast Inter- and Intra- Class Representations for Incremental Segmentation	Zekang Zhang et.al.	2310.06368	link
2023-10-09	CoBEVFusion: Cooperative Perception with LiDAR-Camera Bird’s-Eye View Fusion	Donghao Qiao et.al.	2310.06008	null
2023-10-09	Unleashing the power of Neural Collapse for Transferability Estimation	Yuhe Ding et.al.	2310.05754	null
2023-10-10	Hierarchical Side-Tuning for Vision Transformers	Weifeng Lin et.al.	2310.05393	link
2023-10-11	A Critical Look at Classic Test-Time Adaptation Methods in Semantic Segmentation	Chang’an Yi et.al.	2310.05341	link
2023-10-08	Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation	Dominik Hollidt et.al.	2310.05133	null
2023-10-08	Bidirectional Knowledge Reconfiguration for Lightweight Point Cloud Analysis	Peipei Li et.al.	2310.05125	null
2023-10-08	Enhancing Representations through Heterogeneous Self-Supervised Learning	Zhong-Yu Li et.al.	2310.05108	null
2023-10-08	OV-PARTS: Towards Open-Vocabulary Part Segmentation	Meng Wei et.al.	2310.05107	link
2023-10-08	Low-Resolution Self-Attention for Semantic Segmentation	Yu-Huan Wu et.al.	2310.05026	link
2023-10-08	Human-in-the-loop: The future of Machine Learning in Automated Electron Microscopy	Sergei V. Kalinin et.al.	2310.05018	null
2023-10-08	SemST: Semantically Consistent Multi-Scale Image Translation via Structure-Texture Alignment	Ganning Zhao et.al.	2310.04995	null
2023-10-07	Federated Self-Supervised Learning of Monocular Depth Estimators for Autonomous Vehicles	Elton F. de S. Soares et.al.	2310.04837	null
2023-10-07	Combining UPerNet and ConvNeXt for Contrails Identification to reduce Global Warming	Zhenkuan Wang et.al.	2310.04808	link
2023-10-07	Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation	Jingyi Pan et.al.	2310.04747	null
2023-10-07	Activate and Reject: Towards Safe Domain Generalization under Category Shift	Chaoqi Chen et.al.	2310.04724	null
2023-10-07	Memory-Constrained Semantic Segmentation for Ultra-High Resolution UAV Imagery	Qi Li et.al.	2310.04721	null
2023-10-06	VTON-IT: Virtual Try-On using Image Translation	Santosh Adhikari et.al.	2310.04558	link
2023-10-06	Semantic segmentation of longitudinal thermal images for identification of hot and cool spots in urban areas	Vasantha Ramani et.al.	2310.04247	null
2023-10-06	DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions	Sanket Kalwar et.al.	2310.04181	null
2023-10-06	A Deeply Supervised Semantic Segmentation Method Based on GAN	Wei Zhao et.al.	2310.04081	null
2023-10-06	Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation	Md Kaykobad Reza et.al.	2310.03986	null
2023-10-05	Ammonia-Net: A Multi-task Joint Learning Model for Multi-class Segmentation and Classification in Tooth-marked Tongue Diagnosis	Shunkai Shi et.al.	2310.03472	null
2023-10-03	CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation	Jialei Chen et.al.	2310.02296	null
2023-10-03	TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation	Yahia Dalbah et.al.	2310.02260	link
2023-10-03	Exploring Model Learning Heterogeneity for Boosting Ensemble Robustness	Yanzhao Wu et.al.	2310.02237	link
2023-10-03	TreeScope: An Agricultural Robotics Dataset for LiDAR-Based Mapping of Trees in Forests and Orchards	Derek Cheng et.al.	2310.02162	link
2023-10-03	Trainable Noise Model as an XAI evaluation method: application on Sobol for remote sensing image segmentation	Hossein Shreim et.al.	2310.01828	link
2023-10-03	Predicting Future Spatiotemporal Occupancy Grids with Semantics for Autonomous Driving	Maneekwan Toyungyernsub et.al.	2310.01723	null
2023-10-02	CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction	Size Wu et.al.	2310.01403	link
2023-10-02	Efficient Remote Sensing Segmentation With Generative Adversarial Transformer	Luyi Qiu et.al.	2310.01292	null
2023-10-02	LoCUS: Learning Multiscale 3D-consistent Features from Posed Images	Dominik A. Kloepfer et.al.	2310.01095	null
2023-10-02	Improved Crop and Weed Detection with Diverse Data Ensemble Learning in Agriculture	Muhammad Hamza Asad et.al.	2310.01055	null
2023-10-02	Multi-task Learning with 3D-Aware Regularization	Wei-Hong Li et.al.	2310.00986	link
2023-10-01	Propagating Semantic Labels in Video Data	David Balaban et.al.	2310.00783	null
2023-10-01	Counterfactual Image Generation for adversarially robust and interpretable Classifiers	Rafael Bischof et.al.	2310.00761	null
2023-10-01	Win-Win: Training High-Resolution Vision Transformers from Two Windows	Vincent Leroy et.al.	2310.00632	null
2023-09-30	Technical Report of 2023 ABO Fine-grained Semantic Segmentation Competition	Zeyu Dong et.al.	2310.00427	null
2023-09-30	An easy zero-shot learning combination: Texture Sensitive Semantic Segmentation IceHrNet and Advanced Style Transfer Learning Strategy	Zhiyong Yang et.al.	2310.00310	link
2023-09-30	Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation	Jingliang Deng et.al.	2310.00307	null
2023-10-04	Text-image Alignment for Diffusion-based Perception	Neehar Kondapaneni et.al.	2310.00031	link
2023-09-29	APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds	Weijie Wei et.al.	2309.17162	link
2023-09-29	SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning	Risa Shinoda et.al.	2309.17083	link
2023-09-29	Synthetic Data Generation and Deep Learning for the Topological Analysis of 3D Data	Dylan Peek et.al.	2309.16968	null
2023-09-29	COMNet: Co-Occurrent Matching for Weakly Supervised Semantic Segmentation	Yukun Su et.al.	2309.16959	null
2023-09-29	Model2Scene: Learning 3D Scene Representation via Contrastive Language-CAD Models Pre-training	Runnan Chen et.al.	2309.16956	null
2023-09-29	YOLOR-Based Multi-Task Learning	Hung-Shuo Chang et.al.	2309.16921	link
2023-10-02	Superpixel Transformers for Efficient Semantic Segmentation	Alex Zihao Zhu et.al.	2309.16889	null
2023-10-03	Cross-City Matters: A Multimodal Remote Sensing Benchmark Dataset for Cross-City Semantic Segmentation using High-Resolution Domain Adaptation Networks	Danfeng Hong et.al.	2309.16499	null
2023-09-28	Open Compound Domain Adaptation with Object Style Compensation for Semantic Segmentation	Tingliang Feng et.al.	2309.16127	null
2023-09-27	Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback	Teresa Yeo et.al.	2309.15762	null
2023-09-27	CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs	Ao Wang et.al.	2309.15755	null
2023-09-27	InfraParis: A multi-modal and multi-task autonomous driving dataset	Gianni Franchi et.al.	2309.15751	link
2023-09-27	Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation	Xin Yuan et.al.	2309.15726	null
2023-09-27	Learning from SAM: Harnessing a Segmentation Foundation Model for Sim2Real Domain Adaptation through Regularization	Mayara E. Bonani et.al.	2309.15562	null
2023-09-27	Investigating the changes in BOLD responses during viewing of images with varied complexity: An fMRI time-series based analysis on human vision	Naveen Kanigiri et.al.	2309.15495	link
2023-09-27	The Robust Semantic Segmentation UNCV2023 Challenge Results	Xuanlong Yu et.al.	2309.15478	null
2023-09-27	Inherit with Distillation and Evolve with Contrast: Exploring Class Incremental Semantic Segmentation Without Exemplar Memory	Danpei Zhao et.al.	2309.15413	null
2023-09-27	Seeing Beyond the Patch: Scale-Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery based on Reinforcement Learning	Yinhe Liu et.al.	2309.15372	null
2023-09-26	M $^{3}$ 3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding	Muhammad Abdullah Jamal et.al.	2309.15313	null
2023-09-26	ZiCo-BC: A Bias Corrected Zero-Shot NAS for Vision Tasks	Kartikeya Bhardwaj et.al.	2309.14666	null
2023-09-25	Dynamic Scene Graph Representation for Surgical Video	Felix Holm et.al.	2309.14538	null
2023-09-29	Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation	Quang Nguyen et.al.	2309.14303	link
2023-09-25	CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free	Monika Wysoczańska et.al.	2309.14289	link
2023-09-25	Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic Segmentation	Muxin Liao et.al.	2309.14282	link
2023-09-25	Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation	Yuxi Wang et.al.	2309.14241	null
2023-09-25	Masked Image Residual Learning for Scaling Deeper Vision Transformers	Guoxi Huang et.al.	2309.14136	link
2023-09-25	Small Objects Matters in Weakly-supervised Semantic Segmentation	Cheolhyun Mun et.al.	2309.14117	null
2023-09-26	AsymFormer: Asymmetrical Cross-Modal Representation Learning for Mobile Platform Real-Time RGB-D Semantic Segmentation	Siqi Du et.al.	2309.14065	link
2023-09-25	Weakly Supervised Semantic Segmentation by Knowledge Graph Inference	Jia Zhang et.al.	2309.14057	link
2023-09-24	Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation	Jiayi Ni et.al.	2309.13604	link
2023-09-24	LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning	Liulei Li et.al.	2309.13556	null
2023-09-24	Towards Robust Robot 3D Perception in Urban Environments: The UT Campus Object Dataset	Arthur Zhang et.al.	2309.13549	link
2023-09-24	Bridging Semantic Gaps for Language-Supervised Semantic Segmentation	Yun Xing et.al.	2309.13505	link
2023-09-23	A Unified Scheme of ResNet and Softmax	Zhao Song et.al.	2309.13482	null
2023-09-23	FedDrive v2: an Analysis of the Impact of Label Skewness in Federated Semantic Segmentation for Autonomous Driving	Eros Fanì et.al.	2309.13336	link
2023-09-23	Discwise Active Learning for LiDAR Semantic Segmentation	Ozan Unal et.al.	2309.13276	null
2023-09-22	ClusterFormer: Clustering As A Universal Visual Learner	James C. Liang et.al.	2309.13196	link
2023-09-22	Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation	Wei Zhai et.al.	2309.12943	link
2023-09-22	Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning	Jonathan Sauder et.al.	2309.12804	null
2023-09-22	Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation	Ping Li et.al.	2309.12557	null
2023-09-21	DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion	Zhenzhen Chu et.al.	2309.12424	null
2023-09-21	MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic Segmentation	Haozhi Cao et.al.	2309.11839	link
2023-09-21	2DDATA: 2D Detection Annotations Transmittable Aggregation for Semantic Segmentation on Point Cloud	Guan-Cheng Lee et.al.	2309.11755	null
2023-09-21	MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic Segmentation	Fei Pan et.al.	2309.11711	link
2023-09-20	EPTQ: Enhanced Post-Training Quantization via Label-Free Hessian	Ofir Gordon et.al.	2309.11531	link
2023-09-20	RMT: Retentive Networks Meet Vision Transformers	Qihang Fan et.al.	2309.11523	link
2023-09-20	Towards Robust Few-shot Point Cloud Semantic Segmentation	Yating Xu et.al.	2309.11228	link
2023-09-20	Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation	Heeseung Yun et.al.	2309.11081	link
2023-09-21	CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration	A. Abdullah et.al.	2309.11038	null
2023-09-19	Change of Scenery: Unsupervised LiDAR Change Detection for Mobile Robots	Alexander Krawciw et.al.	2309.10924	null
2023-09-19	Few-Shot Panoptic Segmentation With Foundation Models	Markus Käppeler et.al.	2309.10726	link
2023-09-19	Cross-modal and Cross-domain Knowledge Transfer for Label-free 3D Segmentation	Jingyu Zhang et.al.	2309.10649	null
2023-09-19	Adversarial Attacks Against Uncertainty Quantification	Emanuele Ledda et.al.	2309.10586	null
2023-09-19	SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving	Xiangchao Yan et.al.	2309.10527	link
2023-09-19	Spatial-Assistant Encoder-Decoder Network for Real Time Semantic Segmentation	Yalun Wang et.al.	2309.10519	link
2023-09-19	RECALL+: Adversarial Web-based Replay for Continual Learning in Semantic Segmentation	Chang Liu et.al.	2309.10479	null
2023-09-19	LineMarkNet: Line Landmark Detection for Valet Parking	Zizhang Wu et.al.	2309.10475	null
2023-09-19	An Empirical Study of Attention Networks for Semantic Segmentation	Hao Guo et.al.	2309.10217	null
2023-09-18	DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation	Bowen Yin et.al.	2309.09668	link
2023-09-18	Heterogeneous Generative Knowledge Distillation with Masked Image Modeling	Ziming Wang et.al.	2309.09571	null
2023-09-18	PanoMixSwap Panorama Mixing via Structural Swapping for Indoor Scene Understanding	Yu-Cheng Hsieh et.al.	2309.09514	null
2023-09-18	Target-aware Bi-Transformer for Few-shot Segmentation	Xianglin Wang et.al.	2309.09492	null
2023-09-17	Active Learning for Semantic Segmentation with Multi-class Label Query	Sehyun Hwang et.al.	2309.09319	null
2023-09-17	CLIPUNetr: Assisting Human-robot Interface for Uncalibrated Visual Servoing Control with CLIP-driven Referring Expression Segmentation	Chen Jiang et.al.	2309.09183	null
2023-09-15	T-UDA: Temporal Unsupervised Domain Adaptation in Sequential Point Clouds	Awet Haileslassie Gebrehiwot et.al.	2309.08302	link
2023-09-14	Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation	Zhaochong An et.al.	2309.08020	link
2023-09-17	TFNet: Exploiting Temporal Cues for Fast and Accurate LiDAR Semantic Segmentation	Rong Li et.al.	2309.07849	null
2023-09-14	Large-scale Weakly Supervised Learning for Road Extraction from Satellite Imagery	Shiqiao Meng et.al.	2309.07823	null
2023-09-14	Neural Field Representations of Articulated Objects for Robotic Manipulation Planning	Phillip Grote et.al.	2309.07620	null
2023-09-14	JSMNet Improving Indoor Point Cloud Semantic and Instance Segmentation through Self-Attention and Multiscale	Shuochen Xu et.al.	2309.07425	null
2023-09-13	Automated Assessment of Critical View of Safety in Laparoscopic Cholecystectomy	Yunfan Li et.al.	2309.07330	null
2023-09-13	Lavender Autonomous Navigation with Semantic Segmentation at the Edge	Alessandro Navone et.al.	2309.06863	null
2023-09-15	Dynamic Spectrum Mixer for Visual Recognition	Zhiqiang Hu et.al.	2309.06721	null
2023-09-12	Padding-free Convolution based on Preservation of Differential Characteristics of Kernels	Kuangdai Leng et.al.	2309.06370	null
2023-09-12	Exploring Flat Minima for Domain Generalization with Large Learning Rates	Jian Zhang et.al.	2309.06337	null
2023-09-12	IBAFormer: Intra-batch Attention Transformer for Domain Generalized Semantic Segmentation	Qiyu Sun et.al.	2309.06282	null
2023-09-12	Active Label Refinement for Semantic Segmentation of Satellite Images	Tuan Pham Minh et.al.	2309.06159	null
2023-09-12	A2V: A Semi-Supervised Domain Adaptation Framework for Brain Vessel Segmentation via Two-Phase Training Angiography-to-Venography Translation	Francesco Galati et.al.	2309.06075	null
2023-09-12	Real-Time Semantic Segmentation: A Brief Survey & Comparative Study in Remote Sensing	Clifford Broni-Bediako et.al.	2309.06047	null
2023-09-15	Self-Correlation and Cross-Correlation Learning for Few-Shot Remote Sensing Image Semantic Segmentation	Linhan Wang et.al.	2309.05840	link
2023-09-11	UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase	Youquan Liu et.al.	2309.05573	link
2023-09-11	Learning Semantic Segmentation with Query Points Supervision on Aerial Images	Santiago Rivier et.al.	2309.05490	link
2023-09-11	Panoptic Vision-Language Feature Fields	Haoran Chen et.al.	2309.05448	link
2023-09-11	Towards Content-based Pixel Retrieval in Revisited Oxford and Paris	Guoyuan An et.al.	2309.05438	link
2023-09-15	DeCUR: decoupling common & unique representations for multimodal self-supervision	Yi Wang et.al.	2309.05300	link
2023-09-12	MFPNet: Multi-scale Feature Propagation Network For Lightweight Semantic Segmentation	Guoan Xu et.al.	2309.04914	null
2023-09-12	Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation	Shyam Nandan Rai et.al.	2309.04573	null
2023-09-08	Long-Range Correlation Supervision for Land-Cover Classification from Remote Sensing Images	Dawen Yu et.al.	2309.04225	null
2023-09-08	From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models	Changming Xiao et.al.	2309.04109	link
2023-09-08	Weakly Supervised Point Clouds Transformer for 3D Object Detection	Zuojin Tang et.al.	2309.04105	null
2023-09-07	Towards Comparable Knowledge Distillation in Semantic Image Segmentation	Onno Niemann et.al.	2309.03659	null
2023-09-07	BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale Weakly Supervised Applications	Jiatai Lin et.al.	2309.03509	link
2023-09-06	EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation	Nikolai Körber et.al.	2309.03244	link
2023-09-11	Exploring Semantic Consistency in Unpaired Image Translation to Generate Data for Surgical Applications	Danush Kumar Venkatesh et.al.	2309.03048	link
2023-09-06	Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter	Jinglong Wang et.al.	2309.02773	link
2023-09-05	Compressing Vision Transformers for Low-Resource Visual Learning	Eric Youn et.al.	2309.02617	link
2023-09-05	Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach	Vimal K B et.al.	2309.02429	null
2023-09-05	DCP-Net: A Distributed Collaborative Perception Network for Remote Sensing Semantic Segmentation	Zhechao Wang et.al.	2309.02230	null
2023-09-06	Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNN	Kin Wai Lau et.al.	2309.01439	link
2023-09-04	DAT++: Spatially Dynamic Vision Transformer with Deformable Attention	Zhuofan Xia et.al.	2309.01430	link
2023-09-04	Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion	Ryota Yoshihashi et.al.	2309.01369	null
2023-09-03	FOR-instance: a UAV laser scanning benchmark dataset for semantic and instance segmentation of individual trees	Stefano Puliti et.al.	2309.01279	null
2023-09-02	RevColV2: Exploring Disentangled Representations in Masked Image Modeling	Qi Han et.al.	2309.01005	link
2023-09-07	Exploring the Robustness of Human Parsers Towards Common Corruptions	Sanyi Zhang et.al.	2309.00938	null
2023-09-02	Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer for Exposure Correction	Gehui Li et.al.	2309.00872	null
2023-09-02	Deep Learning and Inverse Problems	Ali Mohammad-Djafari et.al.	2309.00802	null
2023-09-01	dacl10k: Benchmark for Semantic Bridge Damage Segmentation	Johannes Flotzinger et.al.	2309.00460	null
2023-09-01	Dense Voxel 3D Reconstruction Using a Monocular Event Camera	Haodong Chen et.al.	2309.00385	null
2023-08-31	Self-supervised Semantic Segmentation: Consistency over Transformation	Sanaz Karimijafarbigloo et.al.	2309.00143	link
2023-08-31	Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection	Reza Azad et.al.	2309.00108	link
2023-08-31	Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation	Chaofan Ma et.al.	2309.00096	link
2023-08-31	PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction	Sicheng Zuo et.al.	2308.16896	link
2023-08-31	BTSeg: Barlow Twins Regularization for Domain Adaptation in Semantic Segmentation	Johannes Künzel et.al.	2308.16819	link
2023-08-31	Towards Optimal Patch Size in Vision Transformers for Tumor Segmentation	Ramtin Mojtahedi et.al.	2308.16598	link
2023-09-01	Self-Sampling Meta SAM: Enhancing Few-shot Medical Image Segmentation with Meta-Learning	Yiming Zhang et.al.	2308.16466	link
2023-09-04	Deep Video Codec Control	Christoph Reich et.al.	2308.16215	null
2023-08-30	Semi-supervised Domain Adaptation with Inter and Intra-domain Mixing for Semantic Segmentation	Weifu Fu et.al.	2308.15855	null
2023-08-31	CongNaMul: A Dataset for Advanced Image Processing of Soybean Sprouts	Byunghyun Ban et.al.	2308.15690	null
2023-08-29	3D Adversarial Augmentations for Robust Out-of-Domain Predictions	Alexander Lehner et.al.	2308.15479	null
2023-08-29	Complementing Onboard Sensors with Satellite Map: A New Perspective for HD Map Construction	Wenjie Gao et.al.	2308.15427	link
2023-08-29	Learning to Upsample by Learning to Sample	Wenze Liu et.al.	2308.15085	link
2023-08-28	Maturity-Aware Active Learning for Semantic Segmentation with Hierarchically-Adaptive Sample Assessment	Amirsaeed Yazdani et.al.	2308.14904	link
2023-08-29	Compositional Semantic Mix for Domain Adaptation in Point Cloud Segmentation	Cristiano Saltori et.al.	2308.14619	link
2023-08-28	Semi-Supervised Learning for Visual Bird’s Eye View Semantic Segmentation	Junyu Zhu et.al.	2308.14525	link
2023-08-28	Attention-Guided Lidar Segmentation and Odometry Using Image-to-Point Cloud Saliency Transfer	Guanqun Ding et.al.	2308.14332	null
2023-08-27	Rethinking Exemplars for Continual Semantic Segmentation in Endoscopy Scenes: Entropy-based Mini-Batch Pseudo-Replay	Guankun Wang et.al.	2308.14100	null
2023-08-26	Semi-Supervised Semantic Segmentation via Marginal Contextual Information	Moshe Kimhi et.al.	2308.13900	link
2023-08-26	ReFuSeg: Regularized Multi-Modal Fusion for Precise Brain Tumour Segmentation	Aditya Kasliwal et.al.	2308.13883	null
2023-08-25	RestNet: Boosting Cross-Domain Few-Shot Segmentation with Residual Transformation Network	Xinyang Huang et.al.	2308.13469	link
2023-08-25	A Re-Parameterized Vision Transformer (ReVT) for Domain-Generalized Semantic Segmentation	Jan-Aike Termöhlen et.al.	2308.13331	link
2023-08-25	SVQNet: Sparse Voxel-Adjacent Query Network for 4D Spatio-Temporal LiDAR Semantic Segmentation	Xuechao Chen et.al.	2308.13323	null
2023-08-25	Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin Memory	Jingyi Zhang et.al.	2308.13236	link
2023-08-24	Enhancing Perception and Immersion in Pre-Captured Environments through Learning-Based Eye Height Adaptation	Qi Feng et.al.	2308.13042	null
2023-08-24	Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks	Xiangyang Zhu et.al.	2308.12961	link
2023-08-25	Efficient assessment of window views in high-rise, high-density urban areas using 3D color City Information Models	Maosu Li et.al.	2308.12909	null
2023-08-24	Boosting Semantic Segmentation from the Perspective of Explicit Class Embeddings	Yuhe Liu et.al.	2308.12894	null
2023-08-24	Logic-induced Diagnostic Reasoning for Semi-supervised Semantic Segmentation	Chen Liang et.al.	2308.12595	null
2023-08-24	Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation	Zikun Zhou et.al.	2308.12534	null
2023-08-23	A Spatiotemporal Correspondence Approach to Unsupervised LiDAR Segmentation with Traffic Applications	Xiao Li et.al.	2308.12433	null
2023-08-23	Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation	Duo Peng et.al.	2308.12350	null
2023-08-24	ACLS: Adaptive and Conditional Label Smoothing for Network Calibration	Hyekang Park et.al.	2308.11911	null
2023-08-23	SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets	Cody Simons et.al.	2308.11880	link
2023-08-22	Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations	Mohammadreza Salehi et.al.	2308.11796	link
2023-08-22	G3Reg: Pyramid Graph-based Global Registration using Gaussian Ellipsoid Model	Zhijian Qiao et.al.	2308.11573	link
2023-08-22	Food Image Classification and Segmentation with Attention-based Multiple Instance Learning	Valasia Vlachopoulou et.al.	2308.11452	null
2023-08-22	Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding	Jiantao Wu et.al.	2308.11448	null
2023-08-22	Semantic RGB-D Image Synthesis	Shijie Li et.al.	2308.11356	null
2023-08-22	DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment	Xujie Zhang et.al.	2308.11206	null
2023-08-22	A three in one bottom-up framework for simultaneous semantic segmentation, instance segmentation and classification of multi-organ nuclei in digital cancer histology	Ibtihaj Ahmad et.al.	2308.11179	null
2023-08-22	Hierarchical Point-based Active Learning for Semi-supervised Point Cloud Semantic Segmentation	Zongyi Xu et.al.	2308.11166	link
2023-08-21	Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for Weakly Supervised Semantic Segmentation	M. Maruf et.al.	2308.11052	null
2023-08-21	Diffusion Model as Representation Learner	Xingyi Yang et.al.	2308.10916	link
2023-08-21	Dataset Quantization	Daquan Zhou et.al.	2308.10524	link
2023-08-21	PHE-SICH-CT-IDS: A Benchmark CT Image Dataset for Evaluation Semantic Segmentation, Object Detection and Radiomic Feature Extraction of Perihematomal Edema in Spontaneous Intracerebral Hemorrhage	Deguo Ma et.al.	2308.10521	null
2023-08-21	SynDrone – Multi-modal UAV Dataset for Urban Scenarios	Giulia Rizzoli et.al.	2308.10491	link
2023-08-21	CVFC: Attention-Based Cross-View Feature Consistency for Weakly Supervised Semantic Segmentation of Pathology Images	Liangrui Pan et.al.	2308.10449	null
2023-08-20	Hyper Association Graph Matching with Uncertainty Quantification for Coronary Artery Semantic Labeling	Chen Zhao et.al.	2308.10320	null
2023-08-20	Efficient-VRNet: An Exquisite Fusion Network for Riverway Panoptic Perception based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar	Runwei Guan et.al.	2308.10287	link
2023-08-20	EDDense-Net: Fully Dense Encoder Decoder Network for Joint Segmentation of Optic Cup and Disc	Mehwish Mehmood et.al.	2308.10192	null
2023-08-19	Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation	Dan Zhang et.al.	2308.09965	null
2023-08-19	Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos	Rui Qian et.al.	2308.09951	link
2023-08-18	ResQ: Residual Quantization for Video Perception	Davide Abati et.al.	2308.09511	null
2023-08-18	Metadata Improves Segmentation Through Multitasking Elicitation	Iaroslav Plutenko et.al.	2308.09411	link
2023-08-18	Single Frame Semantic Segmentation Using Multi-Modal Spherical Images	Suresh Guttikonda et.al.	2308.09369	link
2023-08-18	Retro-FPN: Retrospective Feature Pyramid Network for Point Cloud Semantic Segmentation	Peng Xiang et.al.	2308.09314	link
2023-08-18	A review of technical factors to consider when designing neural networks for semantic segmentation of Earth Observation imagery	Sam Khallaghi et.al.	2308.09221	null
2023-08-16	ECPC-IDS:A benchmark endometrail cancer PET/CT image dataset for evaluation of semantic segmentation and detection of hypermetabolic regions	Dechao Tang et.al.	2308.08313	null
2023-08-16	MEDOE: A Multi-Expert Decoder and Output Ensemble Framework for Long-tailed Semantic Segmentation	Junao Shen et.al.	2308.08213	null
2023-08-16	AATCT-IDS: A Benchmark Abdominal Adipose Tissue CT Image Dataset for Image Denoising, Semantic Segmentation, and Radiomics Evaluation	Zhiyu Ma et.al.	2308.08172	null
2023-08-15	Future Video Prediction from a Single Frame for Video Anomaly Detection	Mohammad Baradaran et.al.	2308.07783	null
2023-08-15	Graph-Segmenter: Graph Transformer with Boundary-aware Attention for Semantic Segmentation	Zizhang Wu et.al.	2308.07592	null
2023-08-15	Confidence Contours: Uncertainty-Aware Annotation for Medical Semantic Segmentation	Andre Ye et.al.	2308.07528	link
2023-08-14	SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation	An Wang et.al.	2308.07156	null
2023-08-14	ICPC: Instance-Conditioned Prompting with Contrastive Learning for Semantic Segmentation	Chaohui Yu et.al.	2308.07078	null
2023-08-14	A One Stop 3D Target Reconstruction and multilevel Segmentation Method	Jiexiong Xu et.al.	2308.06974	link
2023-08-14	Towards Open-Set Test-Time Adaptation Utilizing the Wisdom of Crowds in Entropy Minimization	Jungsoo Lee et.al.	2308.06879	null
2023-08-12	LadleNet: Translating Thermal Infrared Images to Visible Light Images Using A Scalable Two-stage U-Net	Tonghui Zou et.al.	2308.06603	link
2023-08-12	BEV-DG: Cross-Modal Learning under Bird’s-Eye View for Domain Generalization of 3D Semantic Segmentation	Miaoyu Li et.al.	2308.06530	null
2023-08-12	Seed Feature Maps-based CNN Models for LEO Satellite Remote Sensing Services	Zhichao Lu et.al.	2308.06515	null
2023-08-11	R2S100K: Road-Region Segmentation Dataset For Semi-Supervised Autonomous Driving in the Wild	Muhammad Atif Butt et.al.	2308.06393	null
2023-08-11	Defensive Perception: Estimation and Monitoring of Neural Network Performance under Deployment	Hendrik Vogt et.al.	2308.06299	null
2023-08-11	Physical Adversarial Attacks For Camera-based Smart Systems: Current Trends, Categorization, Applications, Research Challenges, and Future Outlook	Amira Guesmi et.al.	2308.06173	null
2023-08-11	DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models	Weijia Wu et.al.	2308.06160	link
2023-08-11	Spatial-information Guided Adaptive Context-aware Network for Efficient RGB-D Semantic Segmentation	Yang Zhang et.al.	2308.06024	link
2023-08-11	FoodSAM: Any Food Segmentation	Xing Lan et.al.	2308.05938	link
2023-08-11	Semantic-embedded Similarity Prototype for Scene Recognition	Chuanxin Song et.al.	2308.05896	null
2023-08-10	SegDA: Maximum Separable Segment Mask with Pseudo Labels for Domain Adaptive Semantic Segmentation	Anant Khandelwal et.al.	2308.05851	null
2023-08-10	DiLogics: Creating Web Automation Programs With Diverse Logics	Kevin Pu et.al.	2308.05828	null
2023-08-10	Masked Diffusion as Self-supervised Representation Learner	Zixuan Pan et.al.	2308.05695	link
2023-08-10	Category Feature Transformer for Semantic Segmentation	Quan Tang et.al.	2308.05581	link
2023-08-10	Look at the Neighbor: Distortion-aware Unsupervised Domain Adaptation for Panoramic Semantic Segmentation	Xu Zheng et.al.	2308.05493	null
2023-08-10	Deep Semantic Graph Matching for Large-scale Outdoor Point Clouds Registration	Shaocong Liu et.al.	2308.05314	null
2023-08-09	SegMatch: A semi-supervised learning method for surgical instrument segmentation	Meng Wei et.al.	2308.05232	null
2023-08-10	Prototypical Kernel Learning and Open-set Foreground Perception for Generalized Few-shot Semantic Segmentation	Kai Huang et.al.	2308.04952	null
2023-08-09	Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation	Lei Zhu et.al.	2308.04949	null
2023-08-09	MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation	Kaixin Cai et.al.	2308.04829	null
2023-08-09	Continual Road-Scene Semantic Segmentation via Feature-Aligned Symmetric Multi-Modal Network	Francesco Barbato et.al.	2308.04702	null
2023-08-08	Semi-Supervised Semantic Segmentation of Cell Nuclei via Diffusion-based Large-Scale Pre-Training and Collaborative Learning	Zhuchen Shao et.al.	2308.04578	null
2023-08-08	All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation	Weixuan Sun et.al.	2308.04321	link
2023-08-08	AICSD: Adaptive Inter-Class Similarity Distillation for Semantic Segmentation	Amir M. Mansourian et.al.	2308.04243	link
2023-08-08	PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation	Zhu Liu et.al.	2308.03979	link
2023-08-07	FeatEnHancer: Enhancing Hierarchical Features for Object Detection and Beyond Under Low-Light Vision	Khurram Azeem Hashmi et.al.	2308.03594	link
2023-08-11	DiT: Efficient Vision Transformers with Dynamic Token Routing	Yuchen Ma et.al.	2308.03409	link
2023-08-06	Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities	Rohit Mohan et.al.	2308.03193	null
2023-08-06	High-Resolution Vision Transformers for Pixel-Level Identification of Structural Components and Damage	Kareem Eltouny et.al.	2308.03006	null
2023-08-06	MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation	Lian Xu et.al.	2308.03005	link
2023-08-06	Cal-SFDA: Source-Free Domain-adaptive Semantic Segmentation with Differentiable Expected Calibration Error	Zixin Wang et.al.	2308.03003	link
2023-08-05	Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation	Yiyang Chen et.al.	2308.02883	null
2023-08-05	NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation	Jianfeng Wang et.al.	2308.02866	link
2023-08-05	Few-shot Class-Incremental Semantic Segmentation via Pseudo-Labeling and Knowledge Distillation	Chengjia Jiang et.al.	2308.02790	link
2023-08-04	Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP	Qihang Yu et.al.	2308.02487	link
2023-08-04	Frustratingly Easy Model Generalization by Dummy Risk Minimization	Juncheng Wang et.al.	2308.02287	null
2023-08-04	On the Calibration of Uncertainty Estimation in LiDAR-based Semantic Segmentation	Mariella Dreissig et.al.	2308.02248	null
2023-08-04	Deep Semantic Model Fusion for Ancient Agricultural Terrace Detection	Yi Wang et.al.	2308.02225	link
2023-08-04	ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo	Qiang Zhou et.al.	2308.02191	null
2023-08-04	Synthetic outlier generation for anomaly detection in autonomous driving	Martin Bikandi et.al.	2308.02184	null
2023-08-04	Semantics-guided Transformer-based Sensor Fusion for Improved Waypoint Prediction	Hwan-Soo Choi et.al.	2308.02126	link
2023-08-04	Rethinking Class Activation Maps for Segmentation: Revealing Semantic Information in Shallow Layers by Reducing Noise	Hang-Cheng Dong et.al.	2308.02118	null
2023-08-03	Dynamic Token-Pass Transformers for Semantic Segmentation	Yuang Liu et.al.	2308.01944	null
2023-08-03	LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment	Zhiwei Zhang et.al.	2308.01686	link
2023-08-03	Assessing Systematic Weaknesses of DNNs using Counterfactuals	Sujan Sai Gannamaneni et.al.	2308.01614	null
2023-08-03	Target-point Attention Transformer: A novel trajectory predict network for end-to-end autonomous driving	Jingyu Du et.al.	2308.01496	null
2023-08-02	DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation	Jingfan Chen et.al.	2308.01127	null
2023-08-02	Dynamic Token Pruning in Plain Vision Transformers for Semantic Segmentation	Quan Tang et.al.	2308.01045	null
2023-08-02	Training-Free Instance Segmentation from Semantic Image Segmentation Masks	Yuchen Shen et.al.	2308.00949	link
2023-08-01	MonoNext: A 3D Monocular Object Detection with ConvNext	Marcelo Eduardo Pederiva et.al.	2308.00596	null
2023-08-01	A Satellite Imagery Dataset for Long-Term Sustainable Development in United States Cities	Yanxin Xi et.al.	2308.00465	link
2023-08-01	Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding	Runyu Ding et.al.	2308.00353	null
2023-08-01	Improving Pixel-based MIM by Reducing Wasted Modeling Capability	Yuan Liu et.al.	2308.00261	link
2023-07-31	Multispectral Image Segmentation in Agriculture: A Comprehensive Study on Fusion Approaches	Nuno Cunha et.al.	2308.00159	link
2023-07-29	A 3D deep learning classifier and its explainability when assessing coronary artery disease	Wing Keung Cheung et.al.	2308.00009	null
2023-08-02	Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models	Weikang Yu et.al.	2307.16865	link
2023-07-31	Transferable Attack for Semantic Segmentation	Mengqi He et.al.	2307.16572	link
2023-07-29	CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation	Ruihao Xia et.al.	2307.15942	link
2023-07-28	OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation of Road Scenes	Fei Teng et.al.	2307.15588	link
2023-07-27	To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation	Marc Botet Colomer et.al.	2307.15063	link
2023-07-31	pCTFusion: Point Convolution-Transformer Fusion with Semantic Aware Loss for Outdoor LiDAR Point Cloud Segmentation	Abhishek Kuriyal et.al.	2307.14777	link
2023-07-27	GenCo: An Auxiliary Generator from Contrastive Learning for Enhanced Few-Shot Learning in Remote Sensing	Jing Wu et.al.	2307.14612	null
2023-07-27	MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation	Liang Xu et.al.	2307.14588	link
2023-07-26	Self-supervised Few-shot Learning for Semantic Segmentation: An Annotation-free Approach	Sanaz Karimijafarbigloo et.al.	2307.14446	link
2023-07-26	Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy	Luca Clissa et.al.	2307.14243	null
2023-07-26	Resolution-Aware Design of Atrous Rates for Semantic Segmentation Networks	Bum Jun Kim et.al.	2307.14179	null
2023-07-27	Pre-Training with Diffusion models for Dental Radiography segmentation	Jérémy Rousseau et.al.	2307.14066	null
2023-07-31	Causal reasoning in typical computer vision tasks	Kexuan Zhang et.al.	2307.13992	null
2023-07-26	Topology-aware Robust Optimization for Out-of-distribution Generalization	Fengchun Qiao et.al.	2307.13943	link
2023-07-26	Improving Semi-Supervised Semantic Segmentation with Dual-Level Siamese Structure Network	Zhibo Tain et.al.	2307.13938	link
2023-07-25	Optical Flow boosts Unsupervised Localization and Segmentation	Xinyu Zhang et.al.	2307.13640	link
2023-07-25	Fashion Matrix: Editing Photos by Just Talking	Zheng Chong et.al.	2307.13240	link
2023-07-25	Image Segmentation Keras : Implementation of Segnet, FCN, UNet, PSPNet and other models in Keras	Divam Gupta et.al.	2307.13215	link
2023-07-24	Compact & Capable: Harnessing Graph Neural Networks and Edge Convolution for Medical Image Classification	Aryan Singh et.al.	2307.12790	link
2023-07-24	CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components	Davide Di Nucci et.al.	2307.12718	null
2023-07-24	MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised Learning of Motion and Content Features	Adrien Bardes et.al.	2307.12698	null
2023-07-24	Damage Vision Mining Opportunity for Imbalanced Anomaly Detection	Takato Yasuno et.al.	2307.12676	null
2023-07-24	PRIOR: Prototype Representation Joint Learning from Medical Images and Reports	Pujin Cheng et.al.	2307.12577	link
2023-07-24	A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation	Jinjing Zhu et.al.	2307.12574	null
2023-07-23	EnTri: Ensemble Learning with Tri-level Representations for Explainable Scene Recognition	Amirhossein Aminimehr et.al.	2307.12442	null
2023-07-23	ComPtr: Towards Diverse Bi-source Dense Prediction Tasks via A Simple yet General Complementary Transformer	Youwei Pang et.al.	2307.12349	link
2023-07-22	Morphology-inspired Unsupervised Gland Segmentation via Selective Semantic Grouping	Qixiang Zhang et.al.	2307.11989	link
2023-07-25	CORE: Cooperative Reconstruction for Multi-Agent Perception	Binglu Wang et.al.	2307.11514	link
2023-07-21	SA-BEV: Generating Semantic-Aware Bird’s-Eye-View Feature for Multi-view 3D Object Detection	Jinqing Zhang et.al.	2307.11477	link
2023-07-20	Spinal nerve segmentation method and dataset construction in endoscopic surgical scenarios	Shaowu Peng et.al.	2307.10955	link
2023-07-20	Label Calibration for Semantic Segmentation Under Domain Shift	Ondrej Bohdal et.al.	2307.10842	null
2023-07-20	Gradient-Semantic Compensation for Incremental Semantic Segmentation	Wei Cong et.al.	2307.10822	null
2023-07-22	TwinLiteNet: An Efficient and Lightweight Model for Driveable Area and Lane Segmentation in Self-Driving Cars	Quang Huy Che et.al.	2307.10705	link
2023-07-19	CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation	Lizhao Liu et.al.	2307.10316	link
2023-07-18	Towards Automated Semantic Segmentation in Mammography Images	Cesar A. Sierra-Franco et.al.	2307.10296	null
2023-07-17	On the Real-Time Semantic Segmentation of Aphid Clusters in the Wild	Raiyan Rahman et.al.	2307.10267	null
2023-07-19	Boundary-Refined Prototype Generation: A General End-to-End Paradigm for Semi-Supervised Semantic Segmentation	Junhao Dong et.al.	2307.10097	link
2023-07-19	U-CE: Uncertainty-aware Cross-Entropy for Semantic Segmentation	Steven Landgraf et.al.	2307.09947	null
2023-07-19	Space Engage: Collaborative Space Supervision for Contrastive-based Semi-Supervised Semantic Segmentation	Changqi Wang et.al.	2307.09755	null
2023-07-19	ClickSeg: 3D Instance Segmentation with Click-Level Weak Annotations	Leyao Liu et.al.	2307.09732	null
2023-07-14	LEST: Large-scale LiDAR Semantic Segmentation with Transformer	Chuanyu Luo et.al.	2307.09367	null
2023-07-19	Disentangle then Parse:Night-time Semantic Segmentation with Illumination Disentanglement	Zhixiang Wei et.al.	2307.09362	link
2023-07-18	MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds	Jiahui Liu et.al.	2307.09316	link
2023-07-18	CG-fusion CAM: Online segmentation of laser-induced damage on large-aperture optics	Yueyue Han et.al.	2307.09161	null
2023-07-18	Mining of Single-Class by Active Learning for Semantic Segmentation	Hugues Lambert et.al.	2307.09109	null
2023-07-18	EgoVM: Achieving Precise Ego-Localization using Lightweight Vectorized Maps	Yuzhe He et.al.	2307.08991	null
2023-07-19	Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation	Rundong Luo et.al.	2307.08779	null
2023-07-17	A Nested U-Structure for Instrument Segmentation in Robotic Surgery	Yanjie Xia et.al.	2307.08630	null
2023-07-17	Scale-Aware Modulation Meet Transformer	Weifeng Lin et.al.	2307.08579	link
2023-07-17	Variational Probabilistic Fusion Network for RGB-T Semantic Segmentation	Baihong Lin et.al.	2307.08536	null
2023-07-17	On Point Affiliation in Feature Upsampling	Wenze Liu et.al.	2307.08198	link
2023-07-16	HRHD-HK: A benchmark dataset of high-rise and high-density urban scenes for 3D semantic segmentation of photogrammetric point clouds	Maosu Li et.al.	2307.07976	link
2023-07-16	Dual-level Interaction for Domain Adaptive Semantic Segmentation	Dongyu Yao et.al.	2307.07972	link
2023-07-15	Improving Translation Invariance in Convolutional Neural Networks with Peripheral Prediction Padding	Kensuke Mukai et.al.	2307.07725	null
2023-07-15	PSGformer: Enhancing 3D Point Cloud Instance Segmentation via Precise Semantic Guidance	Lei Pan et.al.	2307.07708	null
2023-07-14	A scoping review on multimodal deep learning in biomedical images and texts	Zhaoyi Sun et.al.	2307.07362	null
2023-07-14	Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks	Chaoyu Liu et.al.	2307.07344	null
2023-07-14	HEAL-SWIN: A Vision Transformer On The Sphere	Oscar Carlsson et.al.	2307.07313	link
2023-07-14	Adaptive Region Selection for Active Learning in Whole Slide Image Semantic Segmentation	Jingna Qiu et.al.	2307.07168	link
2023-07-13	YOLIC: An Efficient Method for Object Localization and Classification on Edge Devices	Kai Su et.al.	2307.06689	link
2023-07-13	WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmark for Autonomous Driving on Water Surfaces	Shanliang Yao et.al.	2307.06505	link
2023-07-12	Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution	Mostafa Dehghani et.al.	2307.06304	null
2023-07-12	OG: Equip vision occupancy with instance segmentation and visual grounding	Zichao Dong et.al.	2307.05873	null
2023-07-11	Automatic Generation of Semantic Parts for Face Image Synthesis	Tomaso Fontanini et.al.	2307.05317	link
2023-07-11	Estimating label quality and errors in semantic segmentation data via any model	Vedang Lad et.al.	2307.05080	link
2023-07-10	Test-Time Adaptation for Nighttime Color-Thermal Semantic Segmentation	Yexin Liu et.al.	2307.04470	null
2023-07-10	Stroke Extraction of Chinese Character Based on Deep Structure Deformable Image Registration	Meng Li et.al.	2307.04341	link
2023-07-09	Mx2M: Masked Cross-Modality Modeling in Domain Adaptation for 3D Semantic Segmentation	Boxiang Zhang et.al.	2307.04231	null
2023-07-11	Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird’s Eye View	Jiayu Yang et.al.	2307.04106	null
2023-07-09	Enhancing Building Semantic Segmentation Accuracy with Super Resolution and Deep Learning: Investigating the Impact of Spatial Resolution on Various Datasets	Zhiling Guo et.al.	2307.04101	null
2023-07-09	CMDFusion: Bidirectional Fusion Network with Cross-modality Knowledge Distillation for LIDAR Semantic Segmentation	Jun Cen et.al.	2307.04091	link
2023-07-08	Building and Road Segmentation Using EffUNet and Transfer Learning Approach	Sahil Gangurde et.al.	2307.03980	null
2023-07-07	Tranfer Learning of Semantic Segmentation Methods for Identifying Buried Archaeological Structures on LiDAR Data	Paolo Soleni et.al.	2307.03512	null
2023-07-07	Large AI Model-Based Semantic Communications	Feibo Jiang et.al.	2307.03492	null
2023-07-07	A Deep Active Contour Model for Delineating Glacier Calving Fronts	Konrad Heidler et.al.	2307.03461	null
2023-07-07	General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation	Nhi Kieu et.al.	2307.03388	link
2023-07-06	To pretrain or not to pretrain? A case study of domain-specific pretraining for semantic segmentation in histopathology	Tushar Kataria et.al.	2307.03275	link
2023-07-10	Art Authentication with Vision Transformers	Ludovica Schaerf et.al.	2307.03039	null
2023-07-05	Spherical Feature Pyramid Networks For Semantic Segmentation	Thomas Walker et.al.	2307.02658	null
2023-07-05	AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images	Ao Cheng et.al.	2307.02464	null
2023-07-05	RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation	Renato Sortino et.al.	2307.02392	null
2023-07-05	Prompting Diffusion Representations for Cross-Domain Semantic Segmentation	Rui Gong et.al.	2307.02138	null
2023-07-05	Line Graphics Digitization: A Step Towards Full Automation	Omar Moured et.al.	2307.02065	link
2023-07-05	Multi-Modal Prototypes for Open-Set Semantic Segmentation	Yuhuan Yang et.al.	2307.02003	null
2023-07-05	The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT	Nicholas Heller et.al.	2307.01984	link
2023-07-04	Augment Features Beyond Color for Domain Generalized Segmentation	Qiyu Sun et.al.	2307.01703	null
2023-07-04	Exploiting Richness of Learned Compressed Representation of Images for Semantic Segmentation	Ravi Kakaiya et.al.	2307.01524	null
2023-07-04	Semantic Segmentation on 3D Point Clouds with High Density Variations	Ryan Faulkner et.al.	2307.01489	null
2023-07-03	MeT: A Graph Transformer for Semantic Segmentation of 3D Meshes	Giuseppe Vecchio et.al.	2307.01115	null
2023-07-03	TomatoDIFF: On-plant Tomato Segmentation with Denoising Diffusion Models	Marija Ivanovska et.al.	2307.01064	link
2023-07-03	DifFSS: Diffusion Model for Few-Shot Semantic Segmentation	Weimin Tan et.al.	2307.00773	link
2023-07-03	Hierarchical Open-vocabulary Universal Image Segmentation	Xudong Wang et.al.	2307.00764	link
2023-07-02	Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization	Yumeng Li et.al.	2307.00648	link
2023-07-01	Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation	Qi Bi et.al.	2307.00371	link
2023-07-01	SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation	Fabian Duffhauss et.al.	2307.00306	link
2023-07-01	Efficient Subclass Segmentation in Medical Images	Linrui Dai et.al.	2307.00257	link
2023-07-01	Internal-External Boundary Attention Fusion for Glass Surface Segmentation	Dongshen Han et.al.	2307.00212	null
2023-06-30	Obscured Wildfire Flame Detection By Temporal Analysis of Smoke Patterns Captured by Unmanned Aerial Systems	Uma Meleti et.al.	2307.00104	null
2023-06-30	Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation	Balamurali Murugesan et.al.	2307.00097	link
2023-06-30	Achieving RGB-D level Segmentation Performance from a Single ToF Camera	Pranav Sharma et.al.	2306.17636	null
2023-06-28	Analysis of LiDAR Configurations on Off-road Semantic Segmentation Performance	Jinhee Yu et.al.	2306.16551	null
2023-06-28	Land Cover Segmentation with Sparse Annotations from Sentinel-2 Imagery	Marco Galatola et.al.	2306.16252	link
2023-07-03	GraSS: Contrastive Learning with Gradient Guided Sampling Strategy for Remote Sensing Image Semantic Segmentation	Zhaoyang Zhang et.al.	2306.15868	link
2023-06-27	What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation	Benedikt Blumenstiel et.al.	2306.15521	link
2023-06-27	Enhancing Navigation Benchmarking and Perception Data Generation for Row-based Crops in Simulation	Mauro Martini et.al.	2306.15517	null
2023-06-27	SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion	Jianbiao Mei et.al.	2306.15349	link
2023-06-27	Hierarchical Dense Correlation Distillation for Few-Shot Segmentation-Extended Abstract	Bohao Peng et.al.	2306.15278	null
2023-06-27	Semantic Segmentation Using Super Resolution Technique as Pre-Processing	Chih-Chia Chen et.al.	2306.15218	null
2023-06-28	MIMIC: Masked Image Modeling with Image Correspondences	Kalyani Marathe et.al.	2306.15128	link
2023-06-26	Localized Text-to-Image Generation for Free via Cross Attention Control	Yutong He et.al.	2306.14636	null
2023-06-26	AME-CAM: Attentive Multiple-Exit CAM for Weakly Supervised Segmentation on MRI Brain Tumor	Yu-Jen Chen et.al.	2306.14505	link
2023-06-25	On Evaluating the Adversarial Robustness of Semantic Segmentation Models	Levente Halmosi et.al.	2306.14217	null
2023-06-25	The Second-place Solution for CVPR VISION 23 Challenge Track 1 – Data Effificient Defect Detection	Xian Tao et.al.	2306.14116	link
2023-06-25	When SAM Meets Sonar Images	Lin Wang et.al.	2306.14109	link
2023-06-24	Semantic Segmentation of Porosity in 4D Spatio-Temporal X-ray μCT of Titanium Coated Ni wires using Deep Learning	Pradyumna Elavarthi et.al.	2306.14039	null
2023-06-23	OpenMask3D: Open-Vocabulary 3D Instance Segmentation	Ayça Takmaz et.al.	2306.13631	link
2023-06-23	3DSAM-adapter: Holistic Adaptation of SAM from 2D to 3D for Promptable Medical Image Segmentation	Shizhan Gong et.al.	2306.13465	link
2023-06-22	Robust Semantic Segmentation: Strong Adversarial Attacks and Fast Training of Robust Models	Francesco Croce et.al.	2306.12941	link
2023-06-21	Multi-Task Consistency for Active Learning	Aral Hekimoglu et.al.	2306.12398	null
2023-06-20	No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths	Charles Guille-Escuret et.al.	2306.11922	link
2023-06-20	Using super-resolution for enhancing visual perception and segmentation performance in veterinary cytology	Jakub Caputa et.al.	2306.11848	null
2023-06-26	Hyperbolic Active Learning for Semantic Segmentation under Domain Shift	Luca Franco et.al.	2306.11180	link
2023-06-19	Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation	Shuting He et.al.	2306.11087	link
2023-06-19	A spatio-temporal network for video semantic segmentation in surgical videos	Maria Grammatikopoulou et.al.	2306.11052	null
2023-06-18	Balanced Energy Regularization Loss for Out-of-distribution Detection	Hyunjun Choi et.al.	2306.10485	link
2023-06-17	Residual Spatial Fusion Network for RGB-Thermal Semantic Segmentation	Ping Li et.al.	2306.10364	null
2023-06-17	Benchmarking Deep Learning Architectures for Urban Vegetation Points Segmentation	Aditya et.al.	2306.10274	null
2023-06-16	ALP: Action-Aware Embodied Learning for Perception	Xinran Liang et.al.	2306.10190	null
2023-06-16	Enhancing Visual Domain Adaptation with Source Preparation	Anirudha Ramesh et.al.	2306.10142	null
2023-06-16	PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation	Yuqi Wang et.al.	2306.10013	link
2023-06-15	SSL4EO-L: Datasets and Foundation Models for Landsat Imagery	Adam J. Stewart et.al.	2306.09424	link
2023-06-15	Infinite Photorealistic Worlds using Procedural Generation	Alexander Raistrick et.al.	2306.09310	link
2023-06-15	Neural World Models for Computer Vision	Anthony Hu et.al.	2306.09179	null
2023-06-15	Contrast, Stylize and Adapt: Unsupervised Contrastive Learning Framework for Domain Adaptive Semantic Segmentation	Tianyu Li et.al.	2306.09098	link
2023-06-15	A Self-Supervised Miniature One-Shot Texture Segmentation (MOSTS) Model for Real-Time Robot Navigation and Embedded Applications	Yu Chen et.al.	2306.08814	link
2023-06-13	BPKD: Boundary Privileged Knowledge Distillation For Semantic Segmentation	Liyang Liu et.al.	2306.08075	link
2023-06-13	Efficient 3D Semantic Segmentation with Superpoint Transformer	Damien Robert et.al.	2306.08045	link
2023-06-13	Low-Resource White-Box Semantic Segmentation of Supporting Towers on 3D Point Clouds via Signature Shape Identification	Diogo Lavado et.al.	2306.07809	null
2023-06-12	Video-to-Music Recommendation using Temporal Alignment of Segments	Laure Prétet et.al.	2306.07187	null
2023-06-12	Volume-DROID: A Real-Time Implementation of Volumetric Mapping with DROID-SLAM	Peter Stratton et.al.	2306.06850	link
2023-06-12	AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation	Kashu Yamazaki et.al.	2306.06842	link
2023-06-11	3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation	Jinming Su et.al.	2306.06753	null
2023-06-09	SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers	Bowen Zhang et.al.	2306.06289	link
2023-06-09	Data-Link: High Fidelity Manufacturing Datasets for Model2Real Transfer under Industrial Settings	Sunny Katyara et.al.	2306.05766	null
2023-06-09	Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding	Jie Gui et.al.	2306.05675	link
2023-06-08	A Novel Confidence Induced Class Activation Mapping for MRI Brain Tumor Segmentation	Yu-Jen Chen et.al.	2306.05476	link
2023-06-08	Mesh-MLP: An all-MLP Architecture for Mesh Classification and Semantic Segmentation	Qiujie Dong et.al.	2306.05246	null
2023-06-08	Unsupervised augmentation optimization for few-shot medical image segmentation	Quan Quan et.al.	2306.05107	null
2023-06-08	Improving Visual Prompt Tuning for Self-supervised Vision Transformers	Seungryong Yoo et.al.	2306.05067	link
2023-06-08	A Dynamic Feature Interaction Framework for Multi-task Visual Perception	Yuling Xi et.al.	2306.05061	null
2023-06-08	Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction	Ali Jamali et.al.	2306.04947	link
2023-06-07	UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks	Yanan Sun et.al.	2306.04715	null
2023-06-06	DenseDINO: Boosting Dense Self-Supervised Learning with Token-Based Point-Level Consistency	Yike Yuan et.al.	2306.04654	null
2023-06-07	PhenoBench – A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain	Jan Weyler et.al.	2306.04557	link
2023-06-14	CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation	Boyuan Sun et.al.	2306.04300	link
2023-06-07	Randomized 3D Scene Generation for Generalizable Self-supervised Pre-training	Lanxiao Li et.al.	2306.04237	null
2023-06-06	Accurate Fine-Grained Segmentation of Human Anatomy in Radiographs via Volumetric Pseudo-Labeling	Constantin Seibold et.al.	2306.03934	link
2023-06-06	Towards Label-free Scene Understanding by Vision Foundation Models	Runnan Chen et.al.	2306.03899	link
2023-06-06	Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation	Xinrong Hu et.al.	2306.03878	link
2023-06-06	Single-Shot Global Localization via Graph-Theoretic Correspondence Matching	Shigemichi Matsuzaki et.al.	2306.03641	null
2023-06-06	Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach	Min Yan et.al.	2306.03508	null
2023-06-08	DFormer: Diffusion-guided Transformer for Universal Image Segmentation	Hefeng Wang et.al.	2306.03437	link
2023-06-06	SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation	Xuewei Li et.al.	2306.03403	link
2023-06-05	Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing	Biao Wu et.al.	2306.02894	null
2023-06-05	Learning from Multi-View Representation for Point-Cloud Pre-Training	Siming Yan et.al.	2306.02558	null
2023-06-04	Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation	Haochen Wang et.al.	2306.02314	null
2023-06-04	Cross-CBAM: A Lightweight network for Scene Segmentation	Zhengbin Zhang et.al.	2306.02306	null
2023-06-06	3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW	Shijie Chang et.al.	2306.02291	link
2023-06-03	Content-aware Token Sharing for Efficient Semantic Segmentation with Vision Transformers	Chenyang Lu et.al.	2306.02095	link
2023-06-03	Balancing Logit Variation for Long-tailed Semantic Segmentation	Yuchao Wang et.al.	2306.02061	link
2023-06-03	Efficient Multi-Grained Knowledge Reuse for Class Incremental Segmentation	Zhihe Lu et.al.	2306.02027	link
2023-06-02	Denoising Diffusion Semantic Segmentation with Mask Prior Modeling	Zeqiang Lai et.al.	2306.01721	link
2023-06-02	Towards In-context Scene Understanding	Ivana Balažević et.al.	2306.01667	null
2023-06-02	Towards Source-free Domain Adaptive Semantic Segmentation via Importance-aware and Prototype-contrast Learning	Yihong Cao et.al.	2306.01598	link
2023-06-05	Robust and Generalisable Segmentation of Subtle Epilepsy-causing Lesions: a Graph Convolutional Approach	Hannah Spitzer et.al.	2306.01375	link
2023-06-01	Geo-Tiles for Semantic Segmentation of Earth Observation Imagery	Sebastian Bullinger et.al.	2306.00823	link
2023-06-01	Exploring Open-Vocabulary Semantic Segmentation without Human Labels	Jun Chen et.al.	2306.00450	null
2023-05-31	Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN	Yangfan Hu et.al.	2305.19868	link
2023-06-01	Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards	Guian Fang et.al.	2305.19599	link
2023-05-30	TrueDeep: A systematic approach of crack detection with less data	Ram Krishna Pandey et.al.	2305.19088	null
2023-05-28	Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR	W. Ronny Huang et.al.	2305.18419	null
2023-05-29	Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising	Fu-Yun Wang et.al.	2305.18264	link
2023-05-29	Contrastive Learning Based Recursive Dynamic Multi-Scale Network for Image Deraining	Zhiying Jiang et.al.	2305.18092	null
2023-05-29	CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models	Zhongxi Chen et.al.	2305.17932	link
2023-05-27	Condition-Invariant Semantic Segmentation	Christos Sakaridis et.al.	2305.17349	link
2023-05-26	SSSegmenation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch	Zhenchao Jin et.al.	2305.17091	link
2023-05-26	Maskomaly:Zero-Shot Mask Anomaly Segmentation	Jan Ackermann et.al.	2305.16972	null
2023-05-26	Semantic segmentation of sparse irregular point clouds for leaf/wood discrimination	Yuchen Bai et.al.	2305.16963	link
2023-05-26	Localization under consistent assumptions over dynamics	Matti Pekkanen et.al.	2305.16702	null
2023-05-25	GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds	Zihui Zhang et.al.	2305.16404	link
2023-05-25	Making Vision Transformers Truly Shift-Equivariant	Renan A. Rojas-Gomez et.al.	2305.16316	null
2023-05-25	Interactive Segment Anything NeRF with Feature Imitation	Xiaokang Chen et.al.	2305.16233	null
2023-05-26	Energy-based Detection of Adverse Weather Effects in LiDAR Data	Aldi Piroli et.al.	2305.16129	link
2023-05-25	DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification	Sitian Shen et.al.	2305.15957	null
2023-05-25	Knowledge Diffusion for Distillation	Tao Huang et.al.	2305.15712	link

image restoration

Publish Date	Title	Authors	PDF	Code
2025-07-20	Exploring Scalable Unified Modeling for General Low-Level Vision	Xiangyu Chen et.al.	2507.14801	null
2025-07-18	Global Modeling Matters: A Fast, Lightweight and Effective Baseline for Efficient Image Restoration	Xingyu Jiang et.al.	2507.13663	null
2025-07-16	Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints	Jiahao Xia et.al.	2507.11985	null
2025-07-14	Expert Operational GANS: Towards Real-Color Underwater Image Restoration	Ozer Can Devecioglu et.al.	2507.11562	null
2025-07-14	RefSTAR: Blind Facial Image Restoration with Reference Selection, Transfer, and Reconstruction	Zhicun Yin et.al.	2507.10470	null
2025-07-14	On a class of forward-backward reaction-diffusion systems with local and nonlocal coupling for image restoration	Yihui Tong et.al.	2507.10393	null
2025-07-11	Single-Step Latent Diffusion for Underwater Image Restoration	Jiayi Wu et.al.	2507.07878	null
2025-07-10	Degradation-Agnostic Statistical Facial Feature Transformation for Blind Face Restoration in Adverse Weather Conditions	Chang-Hwan Son et.al.	2507.07464	null
2025-07-09	Enhancing Diffusion Model Stability for Image Restoration via Gradient Management	Hongjie Wu et.al.	2507.06656	null
2025-07-08	Kernel Density Steering: Inference-Time Scaling via Mode Seeking for Image Restoration	Yuyang Hu et.al.	2507.05604	null
2025-07-07	Simulating Refractive Distortions and Weather-Induced Artifacts for Resource-Constrained Autonomous Perception	Moseli Mots’oehli et.al.	2507.05536	null
2025-07-06	Quick Bypass Mechanism of Zero-Shot Diffusion-Based Image Restoration	Yu-Shan Tai et.al.	2507.04207	null
2025-07-04	LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior Sampling	Huaqiu Li et.al.	2507.00790	null
2025-07-01	Laplace-Mamba: Laplace Frequency Prior-Guided Mamba-CNN Fusion Network for Image Dehazing	Yongzhen Wang et.al.	2507.00501	null
2025-06-29	Double-Diffusion: Diffusion Conditioned Diffusion Probabilistic Model For Air Quality Prediction	Hanlin Dong et.al.	2506.23053	null
2025-06-27	EAMamba: Efficient All-Around Vision State Space Model for Image Restoration	Yu-Cheng Lin et.al.	2506.22246	null
2025-06-26	Elucidating and Endowing the Diffusion Training Paradigm for General Image Restoration	Xin Lu et.al.	2506.21722	null
2025-07-08	Wild refitting for black box prediction	Martin J. Wainwright et.al.	2506.21460	null
2025-06-25	TDiR: Transformer based Diffusion for Image Restoration Tasks	Abbas Anwar et.al.	2506.20302	null
2025-06-24	A Comparative Study of NAFNet Baselines for Image Restoration	Vladislav Esaulov et.al.	2506.19845	null
2025-06-24	NAADA: A Noise-Aware Attention Denoising Autoencoder for Dental Panoramic Radiographs	Khuram Naveed et.al.	2506.19387	null
2025-06-23	Enhancing Image Restoration Transformer via Adaptive Translation Equivariance	JiaKui Hu et.al.	2506.18520	null
2025-06-23	BSMamba: Brightness and Semantic Modeling for Long-Range Interaction in Low-Light Image Enhancement	Tongshun Zhang et.al.	2506.18346	null
2025-06-20	Reversing Flow for Image Restoration	Haina Qin et.al.	2506.16961	null
2025-06-20	Visual-Instructed Degradation Diffusion for All-in-One Image Restoration	Wenyang Luo et.al.	2506.16960	link
2025-06-23	RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought	Junbo Qiao et.al.	2506.16796	link
2025-06-19	MoiréXNet: Adaptive Multi-Scale Demoiréing with Linear Attention Test-Time Training and Truncated Flow Matching Prior	Liangyan Li et.al.	2506.15929	null
2025-06-16	ADAM-Dehaze: Adaptive Density-Aware Multi-Stage Dehazing for Improved Object Detection in Foggy Conditions	Fatmah AlHindaassi et.al.	2506.15837	null
2025-06-17	Optimization-Based Image Restoration under Implementation Constraints in Optical Analog Circuits	Taisei Kato et.al.	2506.14624	null
2025-06-17	Unsupervised Imaging Inverse Problems with Diffusion Distribution Matching	Giacomo Meanti et.al.	2506.14605	link
2025-06-22	Exploring Diffusion with Test-Time Training on Efficient Image Restoration	Rongchang Lu et.al.	2506.14541	null
2025-06-16	Exploiting the Exact Denoising Posterior Score in Training-Free Guidance of Diffusion Models	Gregory Bellchambers et.al.	2506.13614	null
2025-06-15	Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution	Hang Xu et.al.	2506.12738	null
2025-06-14	UniDet-D: A Unified Dynamic Spectral Attention Model for Object Detection under Adverse Weathers	Yuantao Wang et.al.	2506.12324	null
2025-06-10	Adaptive Object Detection with ESRGAN-Enhanced Resolution & Faster R-CNN	Divya Swetha K et.al.	2506.11122	null
2025-06-11	Text-Aware Image Restoration with Diffusion Models	Jaewon Min et.al.	2506.09993	null
2025-06-09	M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration	Yongzhen Wang et.al.	2506.07814	null
2025-06-08	Multi-Step Guided Diffusion for Image Restoration on Edge Devices: Toward Lightweight Perception in Embodied AI	Aditya Chakravarty et.al.	2506.07286	null
2025-06-08	A PDE-Based Image Restoration Method: Mathematical Analysis and Implementation	Dragos-Patru Covei et.al.	2506.07132	null
2025-06-06	NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces	Pierluigi Zama Ramirez et.al.	2506.05815	null
2025-06-05	UniRes: Universal Image Restoration for Complex Degradations	Mo Zhou et.al.	2506.05599	null
2025-06-05	SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training	Jianyi Wang et.al.	2506.05301	null
2025-06-03	NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results	Xiaohong Liu et.al.	2506.02875	null
2025-06-03	ControlMambaIR: Conditional Controls with State-Space Model for Image Restoration	Cheng Yang et.al.	2506.02633	null
2025-06-04	NTIRE 2025 Challenge on RAW Image Restoration and Super-Resolution	Marcos V. Conde et.al.	2506.02197	null
2025-06-02	RAW Image Reconstruction from RGB on Smartphones. NTIRE 2025 Challenge Report	Marcos V. Conde et.al.	2506.01947	null
2025-06-02	NTIRE 2025 the 2nd Restore Any Image Model (RAIM) in the Wild Challenge	Jie Liang et.al.	2506.01394	null
2025-05-31	Image Restoration Learning via Noisy Supervision in the Fourier Domain	Haosen Liu et.al.	2506.00564	null
2025-05-30	IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models	Hanting Wang et.al.	2505.24406	link
2025-05-30	Boosting All-in-One Image Restoration via Self-Improved Privilege Learning	Gang Wu et.al.	2505.24207	link
2025-05-29	Proximal Algorithm Unrolling: Flexible and Efficient Reconstruction Networks for Single-Pixel Imaging	Ping Wang et.al.	2505.23180	link
2025-05-29	URWKV: Unified RWKV Model with Multi-state Perspective for Low-light Image Restoration	Rui Xu et.al.	2505.23068	link
2025-05-29	EquiReg: Equivariance Regularized Diffusion for Inverse Problems	Bahareh Tolooshams et.al.	2505.22973	null
2025-05-28	From Controlled Scenarios to Real-World: Cross-Domain Degradation Pattern Matching for All-in-One Image Restoration	Junyu Fan et.al.	2505.22284	null
2025-05-28	Reference-Guided Identity Preserving Face Restoration	Mo Zhou et.al.	2505.21905	null
2025-05-27	BaryIR: Learning Multi-Source Unified Representation in Continuous Barycenter Space for Generalizable All-in-One Image Restoration	Xiaole Tang et.al.	2505.21637	null
2025-05-23	UniDB++: Fast Sampling of Unified Diffusion Bridge	Mokai Pan et.al.	2505.21528	null
2025-05-28	PreP-OCR: A Complete Pipeline for Document Image Restoration and Enhanced OCR Accuracy	Shuhao Guan et.al.	2505.20429	null
2025-05-26	A Regularization-Guided Equivariant Approach for Image Restoration	Yulu Bai et.al.	2505.19799	link
2025-05-25	Benchmarking Laparoscopic Surgical Image Restoration and Beyond	Jialun Pei et.al.	2505.19161	link
2025-05-25	Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition	Xiaoyang Liu et.al.	2505.19120	link
2025-05-24	Manifold-aware Representation Learning for Degradation-agnostic Image Restoration	Bin Ren et.al.	2505.18679	null
2025-05-23	RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration	Sudarshan Rajagopalan et.al.	2505.18047	null
2025-05-23	MODEM: A Morton-Order Degradation Estimation Mechanism for Adverse Weather Image Recovery	Hainuo Wang et.al.	2505.17581	link
2025-05-23	Dual Ascent Diffusion for Inverse Problems	Minseo Kim et.al.	2505.17353	null
2025-05-22	Forward-only Diffusion Probabilistic Models	Ziwei Luo et.al.	2505.16733	link
2025-05-22	Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration	Yuetong Liu et.al.	2505.16479	null
2025-05-22	NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment	Shuhao Han et.al.	2505.16314	null
2025-05-22	Deep Learning-Driven Ultra-High-Definition Image Restoration: A Survey	Liyan Wang et.al.	2505.16161	link
2025-05-22	Breaking Complexity Barriers: High-Resolution Image Restoration with Rank Enhanced Linear Attention	Yuang Ai et.al.	2505.16157	null
2025-05-22	Continuous Representation Methods, Theories, and Applications: An Overview and Perspectives	Yisi Luo et.al.	2505.15222	link
2025-05-20	UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache	Pu Wang et.al.	2505.14010	null
2025-05-19	Adaptive Image Restoration for Video Surveillance: A Real-Time Approach	Muhammad Awais Amin et.al.	2505.13130	null
2025-05-19	LatentINDIGO: An INN-Guided Latent Diffusion Algorithm for Image Restoration	Di You et.al.	2505.12935	null
2025-05-19	Towards a Universal Image Degradation Model via Content-Degradation Disentanglement	Wenbo Yang et.al.	2505.12860	null
2025-05-19	Degradation-Aware Feature Perturbation for All-in-One Image Restoration	Xiangpeng Tian et.al.	2505.12630	link
2025-05-18	Trustworthy Image Super-Resolution via Generative Pseudoinverse	Andreas Floros et.al.	2505.12375	link
2025-05-20	Diff-Unfolding: A Model-Based Score Learning Framework for Inverse Problems	Yuanhao Wang et.al.	2505.11393	null
2025-05-15	torchmfbd: a flexible multi-object multi-frame blind deconvolution code	A. Asensio Ramos et.al.	2505.10639	link
2025-05-13	Behind the Noise: Conformal Quantile Regression Reveals Emergent Representations	Petrus H. Zwart et.al.	2505.08176	null
2025-05-12	Image Restoration via Integration of Optimal Control Techniques and the Hamilton-Jacobi-Bellman Equation	Dragos-Patru Covei et.al.	2505.07699	null
2025-05-12	Generalizable Pancreas Segmentation via a Dual Self-Supervised Learning Framework	Jun Li et.al.	2505.07165	null
2025-05-10	UnfoldIR: Rethinking Deep Unfolding Network in Illumination Degradation Image Restoration	Chunming He et.al.	2505.06683	null
2025-05-17	A Preliminary Study for GPT-4o on Image Restoration	Hao Yang et.al.	2505.05621	link
2025-05-07	Image Restoration via Multi-domain Learning	Xingyu Jiang et.al.	2505.05504	link
2025-05-08	SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation	Yonwoo Choi et.al.	2505.05475	link
2025-05-08	EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution	Haizhen Xie et.al.	2505.05209	null
2025-05-03	Multi-Scale Target-Aware Representation Learning for Fundus Image Enhancement	Haofan Wu et.al.	2505.01831	null
2025-05-02	Deblurring fission fragment mass distributions	Pierre Nzabahimana et.al.	2505.01294	null
2025-05-01	GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution	Aditya Arora et.al.	2505.00687	null
2025-05-08	DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration	Hebaixu Wang et.al.	2504.21487	link
2025-04-27	Marine Snow Removal Using Internally Generated Pseudo Ground Truth	Alexandra Malyugina et.al.	2504.19289	null
2025-04-27	Rendering Anywhere You See: Renderability Field-guided Gaussian Splatting	Xiaofeng Jin et.al.	2504.19261	null
2025-04-24	Dual Prompting Image Restoration with Diffusion Transformers	Dehong Kong et.al.	2504.17825	null
2025-04-24	DPMambaIR:All-in-One Image Restoration via Degradation-Aware Prompt State Space Model	Zhanwen Liu et.al.	2504.17732	null
2025-04-24	Inverse-Designed Metasurfaces for Wavefront Restoration in Under-Display Camera Systems	Jaegang Jo et.al.	2504.17368	null
2025-04-24	I-INR: Iterative Implicit Neural Representations	Ali Haider et.al.	2504.17364	null
2025-04-23	RouteWinFormer: A Route-Window Transformer for Middle-range Attention in Image Restoration	Qifan Li et.al.	2504.16637	null
2025-04-23	Cross Paradigm Representation and Alignment Transformer for Image Deraining	Shun Zou et.al.	2504.16455	null
2025-04-21	Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration	Junyuan Deng et.al.	2504.15159	null
2025-04-21	Distribution-aware Dataset Distillation for Efficient Image Restoration	Zhuoran Zheng et.al.	2504.14826	null
2025-04-19	Any Image Restoration via Efficient Spatial-Frequency Degradation Adaptation	Bin Ren et.al.	2504.14249	null
2025-04-21	Circular Image Deturbulence using Quasi-conformal Geometry	Chu Chen et.al.	2504.13432	null
2025-04-17	Saliency-Aware Diffusion Reconstruction for Effective Invisible Watermark Removal	Inzamamul Alam et.al.	2504.12809	link
2025-04-17	AdaQual-Diff: Diffusion-Based Image Restoration via Adaptive Quality Prompting	Xin Su et.al.	2504.12605	null
2025-04-16	Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging	Tristan S. W. Stevens et.al.	2504.12154	null
2025-04-16	HyperKING: Quantum-Classical Generative Adversarial Networks for Hyperspectral Image Restoration	Chia-Hsiang Lin et.al.	2504.11782	null
2025-04-15	Efficient Medical Image Restoration via Reliability Guided Learning in Frequency Domain	Pengcheng Zheng et.al.	2504.11286	null
2025-04-20	An Efficient and Mixed Heterogeneous Model for Image Restoration	Yubin Gu et.al.	2504.10967	link
2025-04-14	Enhancing Image Restoration through Learning Context-Rich and Detail-Accurate Features	Hu Gao et.al.	2504.10558	link
2025-04-14	PG-DPIR: An efficient plug-and-play method for high-count Poisson-Gaussian inverse problems	Maud Biquard et.al.	2504.10375	null
2025-04-14	VibrantLeaves: A principled parametric image generator for training deep restoration models	Raphael Achddou et.al.	2504.10201	link
2025-04-14	Progressive Transfer Learning for Multi-Pass Fundus Image Restoration	Uyen Phan et.al.	2504.10025	null
2025-04-14	Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration	Gang Wu et.al.	2504.09973	link
2025-04-13	Computationally iterative methods for salt-and-pepper denoising	Jianwei Ke et.al.	2504.09408	null
2025-04-12	Beyond Degradation Conditions: All-in-One Image Restoration via HOG Transformers	Jiawei Wu et.al.	2504.09377	link
2025-04-11	ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration	Yongsheng Yu et.al.	2504.08591	null
2025-04-11	VL-UR: Vision-Language-guided Universal Restoration of Images Degraded by Adverse Weather Conditions	Ziyan Liu et.al.	2504.08219	null
2025-04-09	Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model	Yingjie Zhou et.al.	2504.07148	null
2025-04-09	Rethinking LayerNorm in Image Restoration Transformers	MinKyu Lee et.al.	2504.06629	null
2025-04-08	AstroClearNet: Deep image prior for multi-frame astronomical image restoration	Yashil Sukurdeep et.al.	2504.06463	null
2025-04-07	DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration	Jiamei Xiong et.al.	2504.05135	null
2025-04-08	Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision	Yuandong Pu et.al.	2504.04903	null
2025-04-07	Content-Aware Transformer for All-in-one Image Restoration	Gang Wu et.al.	2504.04869	link
2025-04-05	JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration	Yunlong Lin et.al.	2504.04158	null
2025-04-04	Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal	Yuyang Hu et.al.	2504.03607	null
2025-04-04	Finding the Reflection Point: Unpadding Images to Remove Data Augmentation Artifacts in Large Open Source Image Datasets for Machine Learning	Lucas Choi et.al.	2504.03168	null
2025-04-03	RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models	ZhongLi Fang et.al.	2504.02640	null
2025-04-02	Bridge the Gap between SNN and ANN for Image Restoration	Xin Su et.al.	2504.01755	null
2025-04-01	Deconver: A Deconvolutional Network for Medical Image Segmentation	Pooya Ashtari et.al.	2504.00302	link
2025-03-31	InstructRestore: Region-Customized Image Restoration with Human Instructions	Shuaizheng Liu et.al.	2503.24357	link
2025-03-29	indiSplit: Bringing Severity Cognizance to Image Decomposition in Fluorescence Microscopy	Ashesh Ashesh et.al.	2503.22983	null
2025-03-28	RELD: Regularization by Latent Diffusion Models for Image Restoration	Pasquale Cascarano et.al.	2503.22563	null
2025-04-02	Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration	Yujie Chen et.al.	2503.21970	null
2025-03-27	Invert2Restore: Zero-Shot Degradation-Blind Image Restoration	Hamadi Chihaoui et.al.	2503.21486	null
2025-03-27	Diffusion Image Prior	Hamadi Chihaoui et.al.	2503.21410	null
2025-03-26	Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration	Shihao Zhou et.al.	2503.20174	null
2025-03-23	Cat-AIR: Content and Task-Aware All-in-One Image Restoration	Jiachen Jiang et.al.	2503.17915	null
2025-03-22	Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration	Yawei Li et.al.	2503.17825	null
2025-03-21	Vision-Language Gradient Descent-driven All-in-One Deep Unfolding Networks	Haijin Zeng et.al.	2503.16930	null
2025-03-20	Efficient Bayesian Computation Using Plug-and-Play Priors for Poisson Inverse Problems	Teresa Klatzer et.al.	2503.16222	null
2025-03-20	DIPLI: Deep Image Prior Lucky Imaging for Blind Astronomical Image Restoration	Suraj Singh et.al.	2503.15984	null
2025-03-21	UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations	Debabrata Mandal et.al.	2503.15868	null
2025-03-19	Image Restoration Models with Optimal Transport and Total Variation Regularization	Weijia Huang et.al.	2503.14947	null
2025-03-18	SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model	Yucheng Mao et.al.	2503.14463	null
2025-03-18	Towards properties of adversarial image perturbations	Egor Kuznetsov et.al.	2503.14111	null
2025-03-18	Intra and Inter Parser-Prompted Transformers for Effective Image Restoration	Cong Wang et.al.	2503.14037	link
2025-03-17	From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective	Chen Zhao et.al.	2503.13165	null
2025-03-17	Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion	Yidi Liu et.al.	2503.12764	null
2025-03-16	Pathology Image Restoration via Mixture of Prompts	Jiangdong Cai et.al.	2503.12399	link
2025-03-14	InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences	Hongkai Zheng et.al.	2503.11043	null
2025-03-13	Hybrid Agents for Image Restoration	Bingchen Li et.al.	2503.10120	null
2025-03-13	Dream-IF: Dynamic Relative EnhAnceMent for Image Fusion	Xingxin Xu et.al.	2503.10109	null
2025-03-17	Multi-Agent Image Restoration	Xu Jiang et.al.	2503.09403	null
2025-03-12	MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration	Zhehui Wu et.al.	2503.09131	link
2025-03-12	Prompt to Restore, Restore to Prompt: Cyclic Prompting for Universal Adverse Weather Removal	Rongxin Liao et.al.	2503.09013	link
2025-03-11	QUIET-SR: Quantum Image Enhancement Transformer for Single Image Super-Resolution	Siddhant Dutta et.al.	2503.08759	null
2025-03-11	Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios	Chenglu Pan et.al.	2503.07232	null
2025-03-03	Hyperspectral Image Restoration and Super-resolution with Physics-Aware Deep Learning for Biomedical Applications	Yuchen Xiang et.al.	2503.02908	null
2025-03-04	ERetinex: Event Camera Meets Retinex Theory for Low-Light Image Enhancement	Xuejian Guo et.al.	2503.02484	link
2025-03-18	Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration	Pengchen Liang et.al.	2503.02321	null
2025-03-03	MRI super-resolution reconstruction using efficient diffusion probabilistic model with residual shifting	Mojtaba Safari et.al.	2503.01576	link
2025-03-03	Wavelet-Enhanced Desnowing: A Novel Single Image Restoration Approach for Traffic Surveillance under Adverse Weather Conditions	Zihan Shen et.al.	2503.01339	null
2025-03-03	Reconciling Stochastic and Deterministic Strategies for Zero-shot Image Restoration using Diffusion Model in Dual	Chong Wang et.al.	2503.01288	link
2025-02-28	Diffusion Restoration Adapter for Real-World Image Restoration	Hanbang Liang et.al.	2502.20679	null
2025-02-26	Self-supervised conformal prediction for uncertainty quantification in Poisson imaging problems	Bernardin Tamo Amougou et.al.	2502.19194	null
2025-02-26	Multi-level Attention-guided Graph Neural Network for Image Restoration	Jiatao Jiang et.al.	2502.19181	null
2025-02-27	RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images	Yuhan Tang et.al.	2502.19153	null
2025-03-08	Dynamic Degradation Decomposition Network for All-in-One Image Restoration	Huiqiang Wang et.al.	2502.19068	null
2025-02-24	Splitting Regularized Wasserstein Proximal Algorithms for Nonsmooth Sampling Problems	Fuqun Han et.al.	2502.16773	link
2025-02-19	RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior	Ching-Hua Lee et.al.	2502.13574	null
2025-02-19	Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal	Jinpei Guo et.al.	2502.09873	link
2025-02-13	Source function from two-particle correlation function through entropy-regularized Richardson-Lucy deblurring	C. K. Tam et.al.	2502.09478	null
2025-02-19	MRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers	Ao Li et.al.	2502.07856	null
2025-02-10	UniDemoiré: Towards Universal Image Demoiréing with Data Generation and Synthesis	Zemin Yang et.al.	2502.06324	null
2025-02-21	UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control	Kaizhen Zhu et.al.	2502.05749	link
2025-02-07	Self-supervised Conformal Prediction for Uncertainty Quantification in Imaging Problems	Jasper M. Everink et.al.	2502.05127	null
2025-02-05	All-in-One Image Compression and Restoration	Huimin Zeng et.al.	2502.03649	link
2025-02-05	Efficient Image Restoration via Latent Consistency Flow Matching	Elad Cohen et.al.	2502.03500	null
2025-02-04	Blind Visible Watermark Removal with Morphological Dilation	Preston K. Robinette et.al.	2502.02676	null
2025-02-03	Human Body Restoration with One-Step Diffusion Model and A New Benchmark	Jue Gong et.al.	2502.01411	null
2025-02-10	Compressed Image Generation with Denoising Diffusion Codebook Models	Guy Ohayon et.al.	2502.01189	null
2025-02-01	Shape from Semantics: 3D Shape Generation from Multi-View Semantics	Liangchen Li et.al.	2502.00360	null
2025-01-30	Integrating Spatial and Frequency Information for Under-Display Camera Image Restoration	Kyusu Ahn et.al.	2501.18517	null
2025-01-31	MatIR: A Hybrid Mamba-Transformer Image Restoration Model	Juan Wen et.al.	2501.18401	link
2025-01-27	Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration	Long Peng et.al.	2501.16583	null
2025-01-27	CausalSR: Structural Causal Model-Driven Super-Resolution with Counterfactual Inference	Zhengyang Lu et.al.	2501.15852	link
2025-01-26	Universal Image Restoration Pre-training via Degradation Classification	JiaKui Hu et.al.	2501.15510	link
2025-01-24	CDI: Blind Image Restoration Fidelity Evaluation based on Consistency with Degraded Image	Xiaojun Tang et.al.	2501.14264	null
2025-01-23	INDIGO+: A Unified INN-Guided Probabilistic Diffusion Algorithm for Blind and Non-Blind Image Restoration	Di You et.al.	2501.14014	null
2025-01-23	Binary Diffusion Probabilistic Model	Vitaliy Kinakh et.al.	2501.13915	null
2025-01-22	UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior	I-Hsiang Chen et.al.	2501.13134	null
2025-01-22	Deep Learning-Based Image Recovery and Pose Estimation for Resident Space Objects	Louis Aberdeen et.al.	2501.13009	null
2025-01-22	UniUIR: Considering Underwater Image Restoration as An All-in-One Learner	Xu Zhang et.al.	2501.12981	null
2025-01-22	FDG-Diff: Frequency-Domain-Guided Diffusion Framework for Compressed Hazy Image Restoration	Ruicheng Zhang et.al.	2501.12832	link
2025-01-21	Proxies for Distortion and Consistency with Applications for Real-World Image Restoration	Sean Man et.al.	2501.12102	null
2025-01-20	SILO: Solving Inverse Problems with Latent Operators	Ron Raphaeli et.al.	2501.11746	null
2025-01-17	DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration	Huiyun Cao et.al.	2501.10325	null
2025-01-16	Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression	Yongheng Zhang et.al.	2501.09321	null
2025-01-16	Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images	Yongheng Zhang et.al.	2501.09268	null
2025-01-08	Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration	Laibin Chang et.al.	2501.04740	null
2025-01-08	MB-TaylorFormer V2: Improved Multi-branch Linear Transformer Expanded by Taylor Formula for Image Restoration	Zhi Jin et.al.	2501.04486	link
2025-01-07	Fixed Points of Deep Neural Networks: Emergence, Stability, and Applications	L. Berlyand et.al.	2501.04182	null
2025-01-07	Convergent Primal-Dual Plug-and-Play Image Restoration: A General Algorithm and Applications	Yodai Suzuki et.al.	2501.03780	link
2025-01-06	ImageMM: Joint multi-frame image restoration and super-resolution	Yashil Sukurdeep et.al.	2501.03002	null
2025-01-06	Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis	Xiaojiao Guo et.al.	2501.02701	link
2024-12-30	Varformer: Adapting VAR’s Generative Prior for Image Restoration	Siyang Wang et.al.	2412.21063	link
2024-12-29	Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)	Tomer Garber et.al.	2412.20596	link
2024-12-28	UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity	Jingbo Lin et.al.	2412.20157	link
2024-12-28	MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration	Boyun Li et.al.	2412.20066	link
2024-12-28	An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models	Yuang Wang et.al.	2412.19992	null
2024-12-27	Generative Adversarial Network on Motion-Blur Image Restoration	Zhengdong Li et.al.	2412.19479	null
2024-12-24	Underwater Image Restoration via Polymorphic Large Kernel CNNs	Xiaojiao Guo et.al.	2412.18459	link
2024-12-24	UNet–: Memory-Efficient and Feature-Enhanced Network Architecture based on U-Net with Reduced Skip-Connections	Lingxiao Yin et.al.	2412.18276	null
2024-12-21	Optoelectronic generative adversarial networks	Jumin Qiu et.al.	2412.16672	link
2025-01-11	NeuroPump: Simultaneous Geometric and Color Rectification for Underwater Images	Yue Guo et.al.	2412.15890	null
2024-12-20	Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation	Aiwen Jiang et.al.	2412.15845	link
2024-12-19	Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model	Minglong Xue et.al.	2412.14630	link
2024-12-18	Personalized Generative Low-light Image Denoising and Enhancement	Xijun Wang et.al.	2412.14327	null
2024-12-18	Distilled Pooling Transformer Encoder for Efficient Realistic Image Dehazing	Le-Anh Tran et.al.	2412.14220	link
2024-12-18	DarkIR: Robust Low-Light Image Restoration	Daniel Feijoo et.al.	2412.13443	link
2024-12-17	Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration	Xinlong Cheng et.al.	2412.12550	null
2024-12-15	Towards Context-aware Convolutional Network for Image Restoration	Fangwei Hao et.al.	2412.11008	null
2024-12-14	Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification	Yucong Meng et.al.	2412.10776	null
2024-12-16	Matrix Completion via Residual Spectral Matching	Ziyuan Chen et.al.	2412.10005	null
2024-12-12	OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs	Yuanzhi Zhu et.al.	2412.09465	link
2024-12-13	Are Conditional Latent Diffusion Models Effective for Image Restoration?	Yunchen Yuan et.al.	2412.09324	null
2024-12-12	ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring	Zhongbao Yang et.al.	2412.09193	null
2024-12-17	Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration	Yunshuai Zhou et.al.	2412.08939	link
2024-12-11	Convergence Analysis of a Proximal Stochastic Denoising Regularization Algorithm	Marien Renaud et.al.	2412.08262	null
2024-12-10	Modeling Dual-Exposure Quad-Bayer Patterns for Joint Denoising and Deblurring	Yuzhi Zhao et.al.	2412.07256	link
2024-12-10	EchoIR: Advancing Image Restoration with Echo Upsampling and Bi-Level Optimization	Yuhan He et.al.	2412.07225	null
2024-12-10	A Progressive Image Restoration Network for High-order Degradation Imaging in Remote Sensing	Yujie Feng et.al.	2412.07195	null
2024-12-09	InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention	Howard Zhang et.al.	2412.06753	null
2024-12-07	Enhancing Sample Generation of Diffusion Models using Noise Level Correction	Abulikemu Abuduweili et.al.	2412.05488	null
2024-12-06	Equivariant Denoisers for Image Restoration	Marien Renaud et.al.	2412.05343	null
2024-12-06	ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration	Chi-Wei Hsiao et.al.	2412.05043	null
2024-12-05	Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian Noise	Brayan Monroy et.al.	2412.04648	link
2024-12-05	MetaFormer: High-fidelity Metalens Imaging via Aberration Correcting Transformers	Byeonghyeon Lee et.al.	2412.04591	null
2024-12-05	Deep priors for satellite image restoration with accurate uncertainties	Biquard Maud et.al.	2412.04130	null
2024-12-05	Blind Underwater Image Restoration using Co-Operational Regressor Networks	Ozer Can Devecioglu et.al.	2412.03995	null
2024-12-05	LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model	Yuan Xue et.al.	2412.03841	null
2024-12-11	Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration	Yuzhen Du et.al.	2412.03814	null
2024-12-04	Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution	Jiahua Xiao et.al.	2412.02960	null
2024-12-03	Relaxed and Inertial Nonlinear Forward-Backward with Momentum	Fernando Roldán et.al.	2412.02045	link
2024-12-02	Phaseformer: Phase-based Attention Mechanism for Underwater Image Restoration and Beyond	MD Raqib Khan et.al.	2412.01456	link
2024-12-02	FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration	Hao Li et.al.	2412.01427	null
2024-12-06	Beyond Pixels: Text Enhances Generalization in Real-World Image Restoration	Haoze Sun et.al.	2412.00878	null
2024-11-30	Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion	Michail Dontas et.al.	2412.00557	null
2024-11-27	Hierarchical Information Flow for Generalized Efficient Image Restoration	Yawei Li et.al.	2411.18588	null
2024-11-27	Complexity Experts are Task-Discriminative Learners for Any Image Restoration	Eduard Zamfir et.al.	2411.18466	null
2024-11-27	Adaptive Blind All-in-One Image Restoration	David Serrano-Lozano et.al.	2411.18412	link
2024-11-27	TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution	Linwei Dong et.al.	2411.18263	link
2024-11-26	Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation	Sudarshan Rajagopalan et.al.	2411.17814	null
2024-11-26	GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration	Sudarshan Rajagopalan et.al.	2411.17687	null
2024-11-26	Puzzle Similarity: A Perceptually-guided No-Reference Metric for Artifact Detection in 3D Scene Reconstructions	Nicolai Hermann et.al.	2411.17489	null
2024-11-26	MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers	Ruoxi Zhu et.al.	2411.17226	link
2024-11-23	Gradient-Guided Parameter Mask for Multi-Scenario Image Restoration Under Adverse Weather	Jilong Guo et.al.	2411.16739	link
2024-11-25	Mixed Degradation Image Restoration via Local Dynamic Optimization and Conditional Embedding	Yubin Gu et.al.	2411.16217	null
2024-11-25	U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields	Vinayak Gupta et.al.	2411.16172	null
2024-11-29	PromptHSI: Universal Hyperspectral Image Restoration Framework for Composite Degradation	Chia-Ming Lee et.al.	2411.15922	link
2024-11-24	LTCF-Net: A Transformer-Enhanced Dual-Channel Fourier Framework for Low-Light Image Restoration	Gaojing Zhang et.al.	2411.15740	null
2024-11-22	Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration	Darshan Thaker et.al.	2411.15295	null
2024-11-22	MambaIRv2: Attentive State Space Restoration	Hang Guo et.al.	2411.15269	link
2024-11-20	Analysis and Synthesis Denoisers for Forward-Backward Plug-and-Play Algorithms	Matthieu Kowalski et.al.	2411.13276	null
2024-11-19	Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models	Jun Xiao et.al.	2411.12450	null
2024-11-19	Versatile Cataract Fundus Image Restoration Model Utilizing Unpaired Cataract and High-quality Images	Zheng Gong et.al.	2411.12278	null
2024-11-19	TSFormer: A Robust Framework for Efficient UHD Image Restoration	Xin Su et.al.	2411.10951	null
2024-11-16	AllRestorer: All-in-One Transformer for Image Restoration under Composite Degradations	Jiawei Mao et.al.	2411.10708	null
2024-11-15	Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence	Guodong Sun et.al.	2411.10321	null
2024-11-12	Joint multi-dimensional dynamic attention and transformer for general image restoration	Huan Zhang et.al.	2411.07893	link
2024-11-12	All-in-one Weather-degraded Image Restoration via Adaptive Degradation-aware Self-prompting Model	Yuanbo Wen et.al.	2411.07445	null
2024-11-11	Multi-scale Frequency Enhancement Network for Blind Image Deblurring	Yawen Xiang et.al.	2411.06893	null
2024-11-10	Dropout the High-rate Downsampling: A Novel Design Paradigm for UHD Image Restoration	Chen Wu et.al.	2411.06456	null
2024-11-08	A Modular Conditional Diffusion Framework for Image Reconstruction	Magauiya Zhussip et.al.	2411.05993	null
2024-11-03	Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image Restoration	Xiaole Tang et.al.	2411.01656	link
2024-10-31	Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes	Shaohua Liu et.al.	2411.00239	null
2024-10-31	Chasing Better Deep Image Priors between Over- and Under-parameterization	Qiming Wu et.al.	2410.24187	link
2024-10-31	Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data	Yucun Hou et.al.	2410.23628	null
2024-10-31	MS-Glance: Non-semantic context vectors and the applications in supervising image reconstruction	Ziqi Gao et.al.	2410.23577	link
2024-10-30	EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models	Shangquan Sun et.al.	2410.22959	link
2024-10-29	DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation	Yuang Ai et.al.	2410.18666	link
2024-10-23	DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection	Qingpeng Li et.al.	2410.17822	link
2024-10-23	An Intelligent Agentic System for Complex Image Restoration Problems	Kaiwen Zhu et.al.	2410.17809	link
2024-10-23	A variational approach to nonlocal image restoration flows	Harsh Prasad et.al.	2410.17649	null
2024-10-23	Diffusion Priors for Variational Likelihood Estimation and Image Denoising	Jun Cheng et.al.	2410.17521	link
2024-11-16	LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration	Yuang Ai et.al.	2410.15385	link
2024-10-19	A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends	Junjun Jiang et.al.	2410.15067	link
2024-10-16	Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond	Pengwei Liang et.al.	2410.12274	null
2024-10-15	Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos	Zhouxia Wang et.al.	2410.11828	null
2024-10-11	Chain-of-Restoration: Multi-Task Image Restoration Models are Zero-Shot Step-by-Step Universal Image Restorers	Jin Cao et.al.	2410.08688	link
2024-10-10	TANet: Triplet Attention Network for All-In-One Adverse Weather Image Restoration	Hsing-Hua Wang et.al.	2410.08177	link
2024-10-09	InstantIR: Blind Image Restoration with Instant Generative Reference	Jen-Yuan Huang et.al.	2410.06551	null
2024-10-08	ReFIR: Grounding Large Restoration Models with Retrieval Augmentation	Hang Guo et.al.	2410.05601	link
2024-10-07	Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration	Zhiyu Zhu et.al.	2410.04811	link
2024-10-06	SITCOM: Step-wise Triple-Consistent Diffusion Sampling for Inverse Problems	Ismail Alkhouri et.al.	2410.04479	link
2024-10-05	Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model	Keda Tao et.al.	2410.04161	null
2024-10-04	Diffusion State-Guided Projected Gradient for Inverse Problems	Rayhan Zirvi et.al.	2410.03463	link
2024-10-03	PnP-Flow: Plug-and-Play Image Restoration with Flow Matching	Ségolène Martin et.al.	2410.02423	link
2024-10-02	Posterior sampling via Langevin dynamics based on generative priors	Vishal Purohit et.al.	2410.02078	null
2024-10-01	Three-Operator Splitting Method with Two-Step Inertial Extrapolation	Olaniyi S. Iyiola et.al.	2410.01099	null
2024-10-01	Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration	Guy Ohayon et.al.	2410.00418	link
2024-10-01	GLMHA A Guided Low-rank Multi-Head Self-Attention for Efficient Image Restoration and Spectral Reconstruction	Zaid Ilyas et.al.	2410.00380	null
2024-09-30	A Survey on Diffusion Models for Inverse Problems	Giannis Daras et.al.	2410.00083	null
2024-09-30	UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation	Cheng Zhang et.al.	2409.20197	link
2024-09-28	Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration	Chu-Jie Qin et.al.	2409.19403	link
2024-09-26	Toward Efficient Deep Blind RAW Image Restoration	Marcos V. Conde et.al.	2409.18204	link
2024-09-26	Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs	Qinpeng Cui et.al.	2409.17778	link
2024-10-05	PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions	Weifeng Lin et.al.	2409.15278	link
2024-09-18	Denoising diffusion models for high-resolution microscopy image restoration	Pamela Osuna-Vargas et.al.	2409.12078	null
2024-09-16	Taming Diffusion Models for Image Restoration: A Review	Ziwei Luo et.al.	2409.10353	null
2024-09-12	Quaternion Nuclear Norm minus Frobenius Norm Minimization for color image reconstruction	Yu Guo et.al.	2409.07797	null
2024-09-11	PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening	RuoCheng Wu et.al.	2409.06980	null
2024-09-24	Lightweight single-image super-resolution network based on dual paths	Li Ke et.al.	2409.06590	null
2024-09-10	Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement	Yang Wen et.al.	2409.06334	null
2024-09-10	AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration	Hongyi Cai et.al.	2409.06206	null
2024-09-07	Power Line Aerial Image Restoration under dverse Weather: Datasets and Baselines	Sai Yang et.al.	2409.04812	link
2024-09-06	Empirical Bayesian image restoration by Langevin sampling with a denoising diffusion implicit prior	Charlesquin Kemajou Mbakam et.al.	2409.04384	null
2024-09-05	Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration	Pei Wang et.al.	2409.03455	null
2024-09-05	Multiple weather images restoration using the task transformer and adaptive mixup strategy	Yang Wen et.al.	2409.03249	null
2024-09-05	Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem	Qiwen Zhu et.al.	2409.03179	link
2024-09-03	Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models	Jiaqi Xu et.al.	2409.02101	link
2024-09-03	F2former: When Fractional Fourier Meets Deep Wiener Deconvolution and Selective Frequency Transformer for Image Deblurring	Subhajit Paul et.al.	2409.02056	null
2024-09-03	GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting	Zixuan Guo et.al.	2409.01581	null
2024-09-01	Accurate Forgetting for All-in-One Image Restoration Model	Xin Su et.al.	2409.00685	null
2024-08-30	AWRaCLe: All-Weather Image Restoration using Visual In-Context Learning	Sudarshan Rajagopalan et.al.	2409.00263	null
2024-08-30	Efficient Image Restoration through Low-Rank Adaptation and Stable Diffusion XL	Haiyang Zhao et.al.	2408.17060	null
2024-08-29	GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content	Lebin Zhou et.al.	2408.16866	null
2024-08-29	Enhanced Control for Diffusion Bridge in Image Restoration	Conghan Yue et.al.	2408.16303	link
2024-08-28	Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration	Xu Zhang et.al.	2408.15994	null
2024-08-27	A Preliminary Exploration Towards General Image Restoration	Xiangtao Kong et.al.	2408.15143	null
2024-08-22	CODE: Confident Ordinary Differential Editing	Bastien van Delft et.al.	2408.12418	link
2024-08-21	OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal	Qiao Mo et.al.	2408.11480	link
2024-08-21	Taming Generative Diffusion for Universal Blind Image Restoration	Siwei Tu et.al.	2408.11287	null
2024-08-19	Multi-Scale Representation Learning for Image Restoration with State-Space Model	Yuhong He et.al.	2408.10145	null
2024-08-19	Harnessing Multi-resolution and Multi-scale Attention for Underwater Image Restoration	Alik Pramanick et.al.	2408.09912	link
2024-08-17	Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration	Xin Lin et.al.	2408.09241	link
2024-08-15	Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks	Jiawei Wu et.al.	2408.08149	link
2024-08-28	HAIR: Hypernetworks-based All-in-One Image Restoration	Jin Cao et.al.	2408.08091	link
2024-08-13	Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method	Xin Su et.al.	2408.06709	null
2024-08-12	Wavelet based inpainting detection	Barglazan Adrian-Alin et.al.	2408.06429	null
2024-08-10	Greedy randomized block Kaczmarz method for matrix equation AXB=C and its applications in color image restoration	Wenli Wang et.al.	2408.05444	null
2024-08-08	Physical prior guided cooperative learning framework for joint turbulence degradation estimation and infrared video restoration	Ziran Zhang et.al.	2408.04227	null
2024-08-08	MultiColor: Image Colorization by Learning from Multiple Color Spaces	Xiangcheng Du et.al.	2408.04172	null
2024-08-28	Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models	Tongtong Feng et.al.	2408.02408	null
2024-08-02	Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration	Donwon Park et.al.	2408.01099	null
2024-08-01	A Prior Embedding-Driven Architecture for Long Distance Blind Iris Recognition	Qi Xiong et.al.	2408.00210	null
2024-07-30	UniProcessor: A Text-induced Unified Low-level Image Processor	Huiyu Duan et.al.	2407.20928	link
2024-07-27	Inverse Problems with Diffusion Models: A MAP Estimation Perspective	Sai bharath chandra Gutha et.al.	2407.20784	link
2024-07-27	Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration	Xiaoyan Yu et.al.	2407.19139	link
2024-07-19	GroupCDL: Interpretable Denoising and Compressed Sensing MRI via Learned Group-Sparsity and Circulant Attention	Nikola Janjusevic et.al.	2407.18967	null
2024-07-26	Dilated Strip Attention Network for Image Restoration	Fangwei Hao et.al.	2407.18613	null
2024-07-25	RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models	Haoyu Chen et.al.	2407.18035	null
2024-07-23	CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction	Liang Zhao et.al.	2407.16204	null
2024-07-23	Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems	Sojin Lee et.al.	2407.16125	link
2024-07-20	Deep Learning CT Image Restoration using System Blur and Noise Models	Yijie Yuan et.al.	2407.14983	null
2024-07-20	Dual High-Order Total Variation Model for Underwater Image Restoration	Yuemei Li et.al.	2407.14868	link
2024-07-18	Any Image Restoration with Efficient Automatic Degradation Adaptation	Bin Ren et.al.	2407.13372	link
2024-07-18	Training-Free Large Model Priors for Multiple-in-One Image Restoration	Xuanhua He et.al.	2407.13181	null
2024-07-21	HPPP: Halpern-type Preconditioned Proximal Point Algorithms and Applications to Image Restoration	Shuchang Zhang et.al.	2407.13120	link
2024-07-17	GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity	Shuo Cao et.al.	2407.12273	null
2024-07-16	Haze-Aware Attention Network for Single-Image Dehazing	Lihan Tong et.al.	2407.11505	null
2024-07-31	Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV	Zhiwen Yang et.al.	2407.11087	link
2024-07-15	In-Loop Filtering via Trained Look-Up Tables	Zhuoyuan Li et.al.	2407.10926	null
2024-07-15	MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration	Yulin Ren et.al.	2407.10833	null
2024-07-25	Restoring Images in Adverse Weather Conditions via Histogram Transformer	Shangquan Sun et.al.	2407.10172	link
2024-07-12	Region Attention Transformer for Medical Image Restoration	Zhiwen Yang et.al.	2407.09268	link
2024-07-12	Exploring Richer and More Accurate Information via Frequency Selection for Image Restoration	Hu Gao et.al.	2407.08950	link
2024-07-11	Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey	Laniqng Guo et.al.	2407.08865	link
2024-07-11	Haar Nuclear Norms with Applications to Remote Sensing Imagery Restoration	Shuang Xu et.al.	2407.08509	null
2024-07-10	Aging-Resistant Wideband Precoding in 5G and Beyond Using 3D Convolutional Neural Networks	Alejandro Villena-Rodriguez et.al.	2407.07434	null
2024-07-15	Asymmetric Mask Scheme for Self-Supervised Real Image Denoising	Xiangyu Liao et.al.	2407.06514	link
2024-07-07	Multi-scale Conditional Generative Modeling for Microscopic Image Restoration	Luzhe Huang et.al.	2407.05259	null
2024-07-06	Robust Skin Color Driven Privacy Preserving Face Recognition via Function Secret Sharing	Dong Han et.al.	2407.05045	null
2024-07-05	On a nonlinear nonlocal reaction-diffusion system applied to image restoration	Yuhang Li et.al.	2407.04347	null
2024-07-04	Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration	Yuhong Zhang et.al.	2407.03636	null
2024-07-04	MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration	Yuhong Zhang et.al.	2407.03635	null
2024-07-02	Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model	Cong Cao et.al.	2407.01960	null
2024-06-30	Learning Frequency-Aware Dynamic Transformers for All-In-One Image Restoration	Zenglin Shi et.al.	2407.01636	null
2024-07-01	Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing	Bingliang Zhang et.al.	2407.01521	link
2024-07-01	DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models	Chang-Han Yeh et.al.	2407.01519	link
2024-07-01	Unrolling Plug-and-Play Gradient Graph Laplacian Regularizer for Image Restoration	Jianghe Cai et.al.	2407.01469	null
2024-07-01	Blind Inversion using Latent Diffusion Priors	Weimin Bai et.al.	2407.01027	null
2024-06-30	Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation	Yuchuan Tian et.al.	2407.00676	link
2024-06-27	Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model	Jiangtong Tan et.al.	2406.19030	link
2024-06-26	Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration	Kang Liao et.al.	2406.18516	link
2024-06-26	ConStyle v2: A Strong Prompter for All-in-One Image Restoration	Dongqi Fan et.al.	2406.18242	link
2024-06-26	MFDNet: Multi-Frequency Deflare Network for Efficient Nighttime Flare Removal	Yiguo Jiang et.al.	2406.18079	link
2024-06-24	DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution	Aiwen Jiang et.al.	2406.16477	link
2024-06-22	Ultra-High-Definition Restoration: New Benchmarks and A Dual Interaction Prior-Driven Solution	Liyan Wang et.al.	2406.13607	link
2024-06-19	Diffusion Model-based FOD Restoration from High Distortion in dMRI	Shuo Huang et.al.	2406.13209	null
2024-06-18	Restorer: Solving Multiple Image Restoration Tasks with One Set of Parameters	Jiawei Mao et.al.	2406.12587	link
2024-06-13	DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer	Wei-Ting Chen et.al.	2406.09622	null
2024-06-13	Blind Super-Resolution via Meta-learning and Markov Chain Monte Carlo Simulation	Jingyuan Xia et.al.	2406.08896	link
2024-06-12	LayeredDoc: Domain Adaptive Document Restoration with a Layer Separation Approach	Maria Pilligua et.al.	2406.08610	link
2024-06-12	DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor	Juncheng Wu et.al.	2406.08377	link
2024-06-14	One-Step Effective Diffusion Network for Real-World Image Super-Resolution	Rongyuan Wu et.al.	2406.08177	link
2024-06-12	3D CBCT Challenge 2024: Improved Cone Beam CT Reconstruction using SwinIR-Based Sinogram and Image Enhancement	Sasidhar Alavala et.al.	2406.08048	null
2024-06-12	DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera	Senyan Xu et.al.	2406.07951	link
2024-06-11	Beware of Aliases – Signal Preservation is Crucial for Robust Image Restoration	Shashank Agnihotri et.al.	2406.07435	null
2024-06-11	Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems	Jiawei Zhang et.al.	2406.06959	link
2024-06-07	Optimal Eye Surgeon: Finding Image Priors through Sparse Generators at Initialization	Avrajit Ghosh et.al.	2406.05288	link
2024-06-06	Diffusion-based image inpainting with internal learning	Nicolas Cherel et.al.	2406.04206	link
2024-06-04	Deep Block Proximal Linearised Minimisation Algorithm for Non-convex Inverse Problems	Chaoyan Huang et.al.	2406.02458	null
2024-06-02	Correlation Matching Transformation Transformers for UHD Image Restoration	Cong Wang et.al.	2406.00629	link
2024-05-30	Sharing Key Semantics in Transformer Makes Efficient Image Restoration	Bin Ren et.al.	2405.20008	link
2024-05-30	All-In-One Medical Image Restoration via Task-Adaptive Routing	Zhiwen Yang et.al.	2405.19769	link
2024-05-29	Blind Image Restoration via Fast Diffusion Inversion	Hamadi Chihaoui et.al.	2405.19572	link
2024-05-27	Fast Samplers for Inverse Problems in Iterative Refinement Models	Kushagra Pandey et.al.	2405.17673	link
2024-06-04	Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models	Regev Cohen et.al.	2405.16475	null
2024-05-24	Hierarchical Uncertainty Exploration via Feedforward Posterior Trees	Elias Nehme et.al.	2405.15719	null
2024-06-01	Efficient Degradation-aware Any Image Restoration	Eduard Zamfir et.al.	2405.15475	null
2024-05-24	Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving	Jia He et.al.	2405.15241	null
2024-05-23	Efficient Visual State Space Model for Image Deblurring	Lingshun Kong et.al.	2405.14343	link
2024-05-22	Perceptual Fairness in Image Restoration	Guy Ohayon et.al.	2405.13805	null
2024-05-21	DARK: Denoising, Amplification, Restoration Kit	Zhuoheng Li et.al.	2405.12891	link
2024-05-21	Spatial-aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image	Zerui Zhang et.al.	2405.12872	link
2024-05-20	A New Cross-Space Total Variation Regularization Model for Color Image Restoration with Quaternion Blur Operator	Zhigang Jia et.al.	2405.12114	null
2024-05-19	Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image Enhancement	Igor Morawski et.al.	2405.11478	null
2024-05-19	Emphasizing Crucial Features for Efficient Image Restoration	Hu Gao et.al.	2405.11468	link
2024-05-17	A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model	Mingxiang Fu et.al.	2405.10890	null
2024-05-16	RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing	Huiling Zhou et.al.	2405.10030	null
2024-05-16	NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge	Jie Liang et.al.	2405.09923	null
2024-05-15	Inference in higher-order undirected graphical models and binary polynomial optimization	Aida Khajavirad et.al.	2405.09727	null
2024-05-13	FRRffusion: Unveiling Authenticity with Diffusion-Based Face Retouching Reversal	Fengchuang Xing et.al.	2405.07582	link
2024-05-09	RPBG: Towards Robust Neural Point-based Graphics in the Wild	Qingtian Zhu et.al.	2405.05663	link
2024-05-07	DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks	Jiaxin Zhang et.al.	2405.04408	link
2024-05-11	Residual-Conditioned Optimal Transport: Towards Structure-Preserving Unpaired and Paired Image Restoration	Xiaole Tang et.al.	2405.02843	link
2024-05-04	Deep Image Restoration For Image Anti-Forensics	Eren Tahir et.al.	2405.02751	link
2024-05-23	SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising	Guanyiman Fu et.al.	2405.01726	link
2024-04-29	Reconstructing Satellites in 3D from Amateur Telescope Images	Zhiming Chang et.al.	2404.18394	null
2024-04-26	PromptCIR: Blind Compressed Image Restoration with Prompt Learning	Bingchen Li et.al.	2404.17433	link
2024-04-26	One-Shot Image Restoration	Deborah Pereg et.al.	2404.17426	null
2024-05-07	NTIRE 2024 Quality Assessment of AI-Generated Content Challenge	Xiaohong Liu et.al.	2404.16687	null
2024-04-26	A Survey on Visual Mamba	Hanwei Zhang et.al.	2404.15956	null
2024-04-26	A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution	Zhixiong Yang et.al.	2404.15620	link
2024-04-22	Face2Face: Label-driven Facial Retouching Restoration	Guanhua Zhao et.al.	2404.14177	null
2024-04-22	CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task	Kangzhen Yang et.al.	2404.14132	link
2024-04-24	Bracketing Image Restoration and Enhancement with High-Low Frequency Decomposition	Genggeng Chen et.al.	2404.13537	link
2024-04-20	PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition	Xi Fang et.al.	2404.13299	null
2024-04-17	CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration	Rui Deng et.al.	2404.11778	null
2024-04-17	AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters	Hao-Wei Chen et.al.	2404.11475	null
2024-04-16	Improving Bracket Image Restoration and Enhancement with Flow-guided Alignment and Enhanced Feature Aggregation	Wenjie Lin et.al.	2404.10358	null
2024-04-16	Referring Flexible Image Restoration	Runwei Guan et.al.	2404.10342	link
2024-04-17	OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model	Runyi Li et.al.	2404.10312	null
2024-04-15	The Problem Of Image Super-Resolution, Denoising And Some Image Restoration Methods In Deep Learning Models	Ngoc-Giau Pham et.al.	2404.09817	null
2024-04-15	Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement	Wenyi Lian et.al.	2404.09735	link
2024-04-15	Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models	Ziwei Luo et.al.	2404.09732	link
2024-04-11	TBSN: Transformer-Based Blind-Spot Network for Self-Supervised Image Denoising	Junyi Li et.al.	2404.07846	link
2024-04-11	Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations	Yufeng Yue et.al.	2404.07770	null
2024-04-10	Unfolding ADMM for Enhanced Subspace Clustering of Hyperspectral Images	Xianlu Li et.al.	2404.07112	link
2024-04-07	STAIC regularization for spatio-temporal image reconstruction	Deepak G Skariah et.al.	2404.05070	null
2024-04-09	Empowering Image Recovery_ A Multi-Attention Approach	Juan Wen et.al.	2404.04617	null
2024-04-04	DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior	Yiming Zhang et.al.	2404.03642	null
2024-04-02	Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration	Akshay Dudhane et.al.	2404.02154	link
2024-03-31	GAMA-IR: Global Additive Multidimensional Averaging for Fast Image Restoration	Youssef Mansour et.al.	2404.00807	null
2024-03-31	IPT-V2: Efficient Image Processing Transformer using Hierarchical Attentions	Zhijun Tu et.al.	2404.00633	null
2024-03-30	Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration	Shihao Zhou et.al.	2404.00288	null
2024-03-30	Look-Around Before You Leap: High-Frequency Injected Transformer for Image Restoration	Shihao Zhou et.al.	2404.00279	null
2024-03-29	Deeper, Sharper, Faster: Application of Efficient Transformer to Galaxy Image Restoration	Hyosun Park et.al.	2404.00102	link
2024-03-27	Towards Image Ambient Lighting Normalization	Florin-Alexandru Vasluianu et.al.	2403.18730	link
2024-03-26	Serpent: Scalable and Efficient Image Restoration via Multi-scale Structured State Space Models	Mohammad Shahab Sepehri et.al.	2403.17902	null
2024-03-26	SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational Autoencoder	Dihan Zheng et.al.	2403.17502	link
2024-03-26	Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance	Donghoon Ahn et.al.	2403.17377	link
2024-04-02	Distilling Semantic Priors from SAM to Efficient Image Restoration Models	Quan Zhang et.al.	2403.16368	null
2024-03-23	Graph Image Prior for Unsupervised Dynamic MRI Reconstruction	Zhongsen Li et.al.	2403.15770	link
2024-03-22	Latent Neural Cellular Automata for Resource-Efficient Image Restoration	Andrea Menta et.al.	2403.15525	null
2024-03-21	Osmosis: RGBD Diffusion Prior for Underwater Image Restoration	Opher Bar Nathan et.al.	2403.14837	null
2024-03-21	AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation	Yuning Cui et.al.	2403.14614	link
2024-03-26	Step-Calibrated Diffusion for Biomedical Optical Image Restoration	Yiwei Lyu et.al.	2403.13680	link
2024-03-20	A multilevel framework for accelerating uSARA in radio-interferometric imaging	Guillaume Lauga et.al.	2403.13385	null
2024-03-19	Multispectral Image Restoration by Generalized Opponent Transformation Total Variation	Zhantao Ma et.al.	2403.12770	null
2024-03-18	CasSR: Activating Image Power for Real-World Image Super-Resolution	Haolan Chen et.al.	2403.11451	null
2024-03-18	VmambaIR: Visual State Space Model for Image Restoration	Yuan Shi et.al.	2403.11423	link
2024-03-18	Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors	Yazid Janati et.al.	2403.11407	link
2024-03-17	Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model	Dian Zheng et.al.	2403.11157	link
2024-03-16	A Spectrum-based Image Denoising Method with Edge Feature Enhancement	Peter Luvton et.al.	2403.11036	null
2024-03-15	Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint	Haoyue Tang et.al.	2403.10585	null
2024-03-15	How Powerful Potential of Attention on Image Restoration?	Cong Wang et.al.	2403.10336	null
2024-03-15	BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution	Feng Li et.al.	2403.10211	link
2024-03-20	D-YOLO a robust framework for object detection in adverse weather conditions	Zihan Chu et.al.	2403.09233	null
2024-03-13	Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data	Asad Aali et.al.	2403.08728	link
2024-03-12	Efficient Diffusion Model for Image Restoration by Residual Shifting	Zongsheng Yue et.al.	2403.07319	link
2024-03-12	Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure	De Cheng et.al.	2403.07292	link
2024-03-19	Boosting Image Restoration via Priors from Pre-trained Models	Xiaogang Xu et.al.	2403.06793	null
2024-03-10	Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising	Yuang Wang et.al.	2403.06069	link
2024-03-12	Decoupled Data Consistency with Diffusion Purification for Image Restoration	Xiang Li et.al.	2403.06054	link
2024-03-09	Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration	Jingyun Xue et.al.	2403.05906	null
2024-03-09	Generalizing to Out-of-Sample Degradations via Model Reprogramming	Runhua Jiang et.al.	2403.05886	link
2024-03-08	Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera	Chengxu Liu et.al.	2403.05660	link
2024-03-07	FriendNet: Detection-Friendly Dehazing Network	Yihua Fan et.al.	2403.04443	link
2024-03-02	Extrapolated Plug-and-Play Three-Operator Splitting Methods for Nonconvex Optimization with Applications to Image Restoration	Zhongming Wu et.al.	2403.01144	link
2024-02-26	Randomized Algorithms for Solving Singular Value Decomposition Problems with Matlab Toolbox	Xiaowen Li et.al.	2402.17794	null
2024-02-25	Diffusion Posterior Proximal Sampling for Image Restoration	Hongjie Wu et.al.	2402.16907	link
2024-03-04	Learning to See Through Dazzle	Xiaopeng Peng et.al.	2402.15919	null
2024-02-24	HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models	Li Pang et.al.	2402.15865	link
2024-03-07	IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer	Dongqi Fan et.al.	2402.15784	link
2024-02-23	MambaIR: A Simple Baseline for Image Restoration with State-Space Model	Hang Guo et.al.	2402.15648	link
2024-02-21	Adversarial Purification and Fine-tuning for Robust UDC Image Restoration	Zhenbo Song et.al.	2402.13629	null
2024-02-14	DestripeCycleGAN: Stripe Simulation CycleGAN for Unsupervised Infrared Image Destriping	Shiqi Yang et.al.	2402.09101	null
2024-02-10	Gyroscope-Assisted Motion Deblurring Network	Simin Luan et.al.	2402.06854	link
2024-02-08	Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model	Junghun Cha et.al.	2402.05350	null
2024-02-16	U-shaped Vision Mamba for Single Image Dehazing	Zhuoran Zheng et.al.	2402.04139	link
2024-02-08	Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction	Shijun Liang et.al.	2402.04097	null
2024-02-05	Rethinking RGB Color Representation for Image Restoration Models	Jaerin Lee et.al.	2402.03399	null
2024-02-05	Knowledge-driven deep learning for fast MR imaging: undersampled MR image reconstruction from supervised to un-supervised learning	Shanshan Wang et.al.	2402.02704	null
2024-02-04	Key-Graph Transformer for Image Restoration	Bin Ren et.al.	2402.02634	null
2024-03-04	RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction	Nikolaos Stathoulopoulos et.al.	2402.02192	null
2024-02-01	Plug-and-Play image restoration with Stochastic deNOising REgularization	Marien Renaud et.al.	2402.01779	link
2024-02-29	LIR: A Lightweight Baseline for Image Restoration	Dongqi Fan et.al.	2402.01368	link
2024-01-31	Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models	Kyungsung Lee et.al.	2401.17629	null
2024-01-31	Task-Oriented Diffusion Model Compression	Geonung Kim et.al.	2401.17547	null
2024-02-21	InstructIR: High-Quality Image Restoration Following Human Instructions	Marcos V. Conde et.al.	2401.16468	link
2024-01-28	UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration	Nachuan Ma et.al.	2401.15647	null
2024-01-26	CascadedGaze: Efficiency in Global Context Extraction for Image Restoration	Amirhosein Ghasemabadi et.al.	2401.15235	link
2024-01-24	Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild	Fanghua Yu et.al.	2401.13627	null
2024-01-24	Unified-Width Adaptive Dynamic Network for All-In-One Image Restoration	Yimin Xu et.al.	2401.13221	link
2024-01-21	LLMRA: Multi-modal Large Language Model based Restoration Assistant	Xiaoyu Jin et.al.	2401.11401	null
2024-01-19	MixNet: Towards Effective and Efficient UHD Low-Light Image Enhancement	Chen Wu et.al.	2401.10666	link
2024-01-03	Image Restoration: A Comparative Analysis of Image De noising Using Different Spatial Filtering Techniques	E. G. Onyedinma et.al.	2401.09460	null
2024-01-16	Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network	Zida Chen et.al.	2401.08171	link
2024-01-12	LiDAR Depth Map Guided Image Compression Model	Alessandro Gnutti et.al.	2401.06517	null
2024-01-10	Content-Aware Depth-Adaptive Image Restoration	Tom Richard Vargis et.al.	2401.05049	null
2024-01-07	Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning Strategy	Xiangtao Kong et.al.	2401.03379	link
2024-01-06	MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond	Yupei Lin et.al.	2401.03221	null
2024-01-05	Analysis of a wavelet frame based two-scale model for enhanced edges	Bin Dong et.al.	2401.02688	null
2024-01-04	Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain	Xuanhua He et.al.	2401.02161	link
2024-01-01	Bracketing is All You Need: Unifying Image Restoration and Enhancement Tasks with Multi-Exposure Images	Zhilu Zhang et.al.	2401.00766	link
2023-12-31	UGPNet: Universal Generative Prior for Image Restoration	Hwayoon Lee et.al.	2401.00370	null
2023-12-28	Improving Image Restoration through Removing Degradations in Textual Representations	Jingbo Lin et.al.	2312.17334	link
2023-12-28	Personalized Restoration via Dual-Pivot Tuning	Pradyumna Chari et.al.	2312.17234	null
2023-12-28	Restoration by Generation with Constrained Priors	Zheng Ding et.al.	2312.17161	null
2024-01-10	DarkShot: Lighting Dark Images with Low-Compute and High-Quality	Jiazhang Zheng et.al.	2312.16805	null
2023-12-27	Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation	Rongyu Zhang et.al.	2312.16610	null
2023-12-27	Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance	Tomer Garber et.al.	2312.16519	link
2023-12-25	Rotation Equivariant Proximal Operator for Deep Unfolding Methods in Image Restoration	Jiahong Fu et.al.	2312.15701	link
2023-12-25	MuLA-GAN: Multi-Level Attention GAN for Enhanced Underwater Visibility	Ahsan Baidar Bakht et.al.	2312.15633	null
2023-12-24	Perception-Distortion Balanced Super-Resolution: A Multi-Objective Optimization Perspective	Lingchen Sun et.al.	2312.15408	link
2023-12-19	Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion	Fan Zhang et.al.	2312.12471	link
2023-12-18	TIP: Text-Driven Image Processing with Semantic and Restoration Instructions	Chenyang Qi et.al.	2312.11595	null
2023-12-17	Bengali License Plate Recognition: Unveiling Clarity with CNN and GFP-GAN	Noushin Afrin et.al.	2312.10701	link
2023-12-16	Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge	Conghan Yue et.al.	2312.10299	link
2023-12-15	Image Deblurring using GAN	Zhengdong Li et.al.	2312.09496	null
2023-12-12	AdaptIR: Parameter Efficient Multi-task Adaptation for Pre-trained Image Restoration Models	Hang Guo et.al.	2312.08881	link
2023-12-14	Guided Image Restoration via Simultaneous Feature and Image Guided Fusion	Xinyi Liu et.al.	2312.08853	null
2023-12-16	VQCNIR: Clearer Night Image Restoration with Vector-Quantized Codebook	Wenbin Zou et.al.	2312.08606	link
2023-12-12	Uncertainty Visualization via Low-Dimensional Posterior Projections	Omer Yair et.al.	2312.07804	link
2023-12-12	Hyper-Restormer: A General Hyperspectral Image Restoration Transformer for Remote Sensing Imaging	Yo-Yu Lai et.al.	2312.07016	null
2023-12-12	WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction	Jingchun Zhou et.al.	2312.06946	null
2023-12-11	Textual Prompt Guided Image Restoration	Qiuhai Yan et.al.	2312.06162	link
2023-12-08	Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation	Bruno Lecouat et.al.	2312.05190	null
2023-12-08	Prompt-In-Prompt Learning for Universal Image Restoration	Zilong Li et.al.	2312.05038	link
2023-12-08	Decoupling Degradation and Content Processing for Adverse Weather Image Restoration	Xi Wang et.al.	2312.05006	null
2023-12-06	Training Neural Networks on RAW and HDR Images for Restoration Tasks	Lei Luo et.al.	2312.03640	link
2023-12-05	Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration	Yuang Ai et.al.	2312.02918	null
2023-12-05	Deep-learning-driven end-to-end metalens imaging	Joonhyuk Seo et.al.	2312.02669	link
2023-12-02	Exploiting Diffusion Priors for All-in-One Image Restoration	Yuanbiao Gou et.al.	2312.02197	link
2023-12-05	Multi-task Image Restoration Guided By Robust DINO Features	Xin Lin et.al.	2312.01677	null
2023-12-05	T3D: Towards 3D Medical Image Understanding through Vision-Language Pre-training	Che Liu et.al.	2312.01529	null
2023-12-03	An Augmented Lagrangian Primal-Dual Semismooth Newton Method for Multi-Block Composite Optimization	Zhanwang Deng et.al.	2312.01273	null
2023-12-01	Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution	Xi Yang et.al.	2312.00853	link
2023-11-30	A Novel Variational Approach for Multiphoton Microscopy Image Restoration: from PSF Estimation to 3D Deconvolution	Julien Ajdenbaum et.al.	2311.18386	null
2023-11-29	Variational Bayes image restoration with compressive autoencoders	Maud Biquard et.al.	2311.17744	null
2023-11-29	Improving Stability during Upsampling – on the Importance of Spatial Context	Shashank Agnihotri et.al.	2311.17524	null
2023-11-28	Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration	Chen Zhao et.al.	2311.16845	link
2023-11-28	Decomposer: Semi-supervised Learning of Image Restoration and Image Decomposition	Boris Meinardus et.al.	2311.16829	null
2023-11-28	Full-resolution MLPs Empower Medical Dense Prediction	Mingyuan Meng et.al.	2311.16707	link
2023-11-27	Joint Deep Image Restoration and Unsupervised Quality Assessment	Hakan Emre Gedik et.al.	2311.16372	null
2023-11-26	FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration	Zihao Zou et.al.	2311.15445	null
2023-11-20	Clarity ChatGPT: An Interactive and Adaptive Processing System for Image Restoration and Enhancement	Yanyan Wei et.al.	2311.11695	null
2023-11-20	Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model	Chunming He et.al.	2311.11638	link
2023-11-20	Deep Equilibrium Diffusion Restoration with Parallel Sampling	Jiezhang Cao et.al.	2311.11600	link
2023-11-14	The Perception-Robustness Tradeoff in Deterministic Image Restoration	Guy Ohayon et.al.	2311.09253	null
2023-11-09	Dynamic Association Learning of Self-Attention and Convolution in Image Restoration	Kui Jiang et.al.	2311.05147	null
2023-11-08	LuminanceL1Loss: A loss function which measures percieved brightness and colour differences	Dominic De Jonge et.al.	2311.04614	null
2023-11-21	Energy-Calibrated VAE with Test Time Free Lunch	Yihong Luo et.al.	2311.04071	link
2023-11-07	Constrained Regularization by Denoising with Automatic Parameter Selection	Pasquale Cascarano et.al.	2311.03819	null
2023-11-22	Pelvic floor MRI segmentation based on semi-supervised deep learning	Jianwei Zuo et.al.	2311.03105	null
2023-11-06	A New Extrapolation Economy Cascadic Multigrid Method for Image Restoration Problems	Zhaoteng Chu et.al.	2311.03010	null
2023-11-08	Deep Image Semantic Communication Model for Artificial Intelligent Internet of Things	Li Ping Qian et.al.	2311.02926	link
2023-11-03	Cascadic Tensor Multigrid Method and Economic Cascadic Tensor Multigrid Method for Image Restoration Problems	Ziqi Yan et.al.	2311.01924	null
2023-11-02	Convergent plug-and-play with proximal denoiser and unconstrained regularization parameter	Samuel Hurault et.al.	2311.01216	null
2023-10-31	Image Restoration with Point Spread Function Regularization and Active Learning	Peng Jia et.al.	2311.00186	null
2023-10-27	Always Clear Days: Degradation Type and Severity Aware All-In-One Adverse Weather Removal	Yu-Wei Chen et.al.	2310.18293	link
2023-10-24	From Posterior Sampling to Meaningful Diversity in Image Restoration	Noa Cohen et.al.	2310.16047	null
2023-10-19	Neural Degradation Representation Learning for All-In-One Image Restoration	Mingde Yao et.al.	2310.12848	link
2023-10-18	A Comparative Study of Image Restoration Networks for General Backbone Network Design	Xiangyu Chen et.al.	2310.11881	link
2023-10-16	Unifying Image Processing as Visual Prompting Question Answering	Yihao Liu et.al.	2310.10513	null
2023-11-19	AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion	Yitong Jiang et.al.	2310.10123	null
2023-10-12	Frequency-Aware Re-Parameterization for Over-Fitting Based Image Compression	Yun Ye et.al.	2310.08068	null
2023-10-10	Tweedie Moment Projected Diffusions For Inverse Problems	Benjamin Boys et.al.	2310.06721	null
2023-10-06	Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution	Qingguo Liu et.al.	2310.04180	link
2023-11-07	Deformation-Invariant Neural Network and Its Applications in Distorted Image Restoration and Analysis	Han Zhang et.al.	2310.02641	null
2023-10-03	Leveraging Classic Deconvolution and Feature Extraction in Zero-Shot Image Restoration	Tomáš Chobola et.al.	2310.02097	link
2023-10-02	A Restoration Network as an Implicit Prior	Yuyang Hu et.al.	2310.01391	null
2023-10-02	Controlling Vision-Language Models for Universal Image Restoration	Ziwei Luo et.al.	2310.01018	link
2023-10-02	JPEG Information Regularized Deep Image Prior for Denoising	Tsukasa Takagi et.al.	2310.00894	null
2023-10-22	Guided Frequency Loss for Image Restoration	Bilel Benjdira et.al.	2309.15563	null
2023-09-27	Uncertainty Quantification via Neural Posterior Principal Components	Elias Nehme et.al.	2309.15533	null
2023-10-09	Survey on Deep Face Restoration: From Non-blind to Blind and Beyond	Wenjie Li et.al.	2309.15490	link
2023-09-21	License Plate Super-Resolution Using Diffusion Models	Sawsan AlHalawani et.al.	2309.12506	null
2023-09-21	Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal	Xiao Feng Zhang et.al.	2309.11715	null
2023-09-19	Local Lipschitz continuity for energy integrals with slow growth and lower order terms	Michela Eleuteri et.al.	2309.10727	null
2023-09-19	Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising	Yujin Wang et.al.	2309.10714	null
2023-09-16	AOSR-Net: All-in-One Sandstorm Removal Network	Yazhong Si et.al.	2309.08838	null
2023-09-14	A Multi-scale Generalized Shrinkage Threshold Network for Image Blind Deblurring in Remote Sensing	Yujie Feng et.al.	2309.07524	null
2023-09-13	FAIR: Frequency-aware Image Restoration for Industrial Visual Anomaly Detection	Tongkun Liu et.al.	2309.07068	link
2023-09-12	Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration	Gang Wu et.al.	2309.06023	link
2023-09-11	HAT: Hybrid Attention Transformer for Image Restoration	Xiangyu Chen et.al.	2309.05239	link
2023-10-10	Prompt-based Ingredient-Oriented All-in-One Image Restoration	Hu Gao et.al.	2309.03063	link
2023-09-05	SAM-Deblur: Let Segment Anything Boost Image Deblurring	Siwei Li et.al.	2309.02270	link
2023-09-05	Advanced Underwater Image Restoration in Complex Illumination Conditions	Yifan Song et.al.	2309.02217	null
2023-09-04	Memory augment is All You Need for image restoration	Xiao Feng Zhang et.al.	2309.01377	link
2023-09-04	Restoration Guarantee of Image Inpainting via Low Rank Patch Matrix Completion	Jian-Feng Cai et.al.	2309.01328	null
2023-09-03	Holistic Dynamic Frequency Transformer for Image Fusion and Exposure Correction	Xiaoke Shang et.al.	2309.01183	null
2023-08-29	DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior	Xinqi Lin et.al.	2308.15070	link
2023-09-05	MetaWeather: Few-Shot Weather-Degraded Image Restoration via Degradation Pattern Matching	Youngrae Kim et.al.	2308.14334	link
2023-08-27	Hierarchical Contrastive Learning for Pattern-Generalizable Image Corruption Detection	Xin Feng et.al.	2308.14061	link
2023-08-25	Residual Denoising Diffusion Models	Jiawei Liu et.al.	2308.13712	link
2023-08-24	MOFA: A Model Simplification Roadmap for Image Restoration on Mobile Devices	Xiangyu Chen et.al.	2308.12494	link
2023-08-23	Synergistic Multiscale Detail Refinement via Intrinsic Supervision for Underwater Image Enhancement	Dehuan Zhang et.al.	2308.11932	link
2023-08-20	Blind Face Restoration for Under-Display Camera via Dictionary Guided Transformer	Jingfan Tan et.al.	2308.10196	null
2023-08-22	WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning	Dongjian Huo et.al.	2308.10195	null
2023-08-18	Diffusion Models for Image Restoration and Enhancement – A Comprehensive Survey	Xin Li et.al.	2308.09388	link
2023-08-29	Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration	Liyan Wang et.al.	2308.08730	link
2023-08-08	Under-Display Camera Image Restoration with Scattering Effect	Binbin Song et.al.	2308.04163	link
2023-08-06	Nest-DGIL: Nesterov-optimized Deep Geometric Incremental Learning for CS Image Reconstruction	Xiaohong Fan et.al.	2308.03807	link
2023-08-06	PNN: From proximal algorithms to robust unfolded image denoising networks and Plug-and-Play methods	Hoang Trieu Vy Le et.al.	2308.03139	null
2023-08-06	All-in-one Multi-degradation Image Restoration Network via Hierarchical Degradation Representation	Cheng Zhang et.al.	2308.03021	null
2023-08-06	Recurrent Spike-based Image Restoration under General Illumination	Lin Zhu et.al.	2308.03018	link
2023-08-01	Decomposition Ascribed Synergistic Learning for Unified Image Restoration	Jinghao Zhang et.al.	2308.00759	null
2023-07-27	The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation	Lingdong Kong et.al.	2307.15061	link
2023-07-26	SuperInpaint: Learning Detail-Enhanced Attentional Implicit Representation for Super-resolutional Image Inpainting	Canyu Zhang et.al.	2307.14489	null
2023-08-22	Phenotype-preserving metric design for high-content image reconstruction by generative inpainting	Vaibhav Sharma et.al.	2307.14436	link
2023-07-25	On the unreasonable vulnerability of transformers for image restoration – and an easy fix	Shashank Agnihotri et.al.	2307.13856	null
2023-07-24	A Theoretically Guaranteed Quaternion Weighted Schatten p-norm Minimization Method for Color Image Restoration	Qing-Hua Zhang et.al.	2307.12656	link
2023-07-20	Physics-Driven Turbulence Image Restoration with Stochastic Refinement	Ajay Jaiswal et.al.	2307.10603	link
2023-07-19	NTIRE 2023 Quality Assessment of Video Enhancement Challenge	Xiaohong Liu et.al.	2307.09729	null
2023-07-18	Unleashing the Imagination of Text: A Novel Framework for Text-to-image Person Retrieval via Exploring the Power of Words	Delong Liu et.al.	2307.09059	link
2023-07-18	Soft-IntroVAE for Continuous Latent space Image Super-Resolution	Zhi-Song Liu et.al.	2307.09008	null
2023-07-16	LUCYD: A Feature-Driven Richardson-Lucy Deconvolution Network	Tomáš Chobola et.al.	2307.07998	link
2023-07-15	DRM-IR: Task-Adaptive Deep Unfolding Network for All-In-One Image Restoration	Yuanshuo Cheng et.al.	2307.07688	null
2023-07-12	Latent Graph Attention for Enhanced Spatial Context	Ayush Singh et.al.	2307.04149	null
2023-06-29	FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude	Feng Liu et.al.	2306.17206	null
2023-06-27	Cutting-Edge Techniques for Depth Map Super-Resolution	Ryan Peterson et.al.	2306.15244	null
2023-06-23	ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration	Jiaqi Ma et.al.	2306.13653	link
2023-06-22	PromptIR: Prompting for All-in-One Blind Image Restoration	Vaishnav Potlapalli et.al.	2306.13090	link
2023-06-22	Restoration of the JPEG Maximum Lossy Compressed Face Images with Hourglass Block based on Early Stopping Discriminator	Jongwook Si et.al.	2306.12757	null
2023-06-21	Accelerating Multiframe Blind Deconvolution via Deep Learning	A. Asensio Ramos et.al.	2306.12078	link
2023-06-21	TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting	Liang Liao et.al.	2306.11528	link
2023-07-31	Enlighten Anything: When Segment Anything Model Meets Low-Light Image Enhancement	Qihan Zhao et.al.	2306.10286	link
2023-06-15	Exploring the Application of Large-scale Pre-trained Models on Adverse Weather Removal	Zhentao Tan et.al.	2306.09008	null
2023-06-14	Investigation of the Challenges of Underwater-Visual-Monocular-SLAM	Michele Grimaldi et.al.	2306.08738	null
2023-06-13	Learning Image-Adaptive Codebooks for Class-Agnostic Image Restoration	Kechun Liu et.al.	2306.06513	null
2023-06-09	Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding	Jie Gui et.al.	2306.05675	link
2023-06-08	HQ-50K: A Large-scale, High-quality Dataset for Image Restoration	Qinhong Yang et.al.	2306.05390	link
2023-06-06	BokehOrNot: Transforming Bokeh Effect with Image Transformer and Lens Metadata Embedding	Zhihao Yang et.al.	2306.04032	link
2023-06-06	Convergent Bregman Plug-and-Play Image Restoration for Poisson Inverse Problems	Samuel Hurault et.al.	2306.03466	null
2023-06-05	Zero shot framework for satellite image restoration	Praveen Kandula et.al.	2306.02921	null
2023-06-04	ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes	Minghao Fu et.al.	2306.02443	link
2023-06-04	Deep Optimal Transport: A Practical Algorithm for Photo-realistic Image Restoration	Theo Adrai et.al.	2306.02342	link
2023-06-03	Unsupervised Low Light Image Enhancement Using SNR-Aware Swin Transformer	Zhijian Luo et.al.	2306.02082	null
2023-06-02	Fast and Interpretable Nonlocal Neural Networks for Image Denoising via Group-Sparse Convolutional Dictionary Learning	Nikola Janjušević et.al.	2306.01950	link
2023-06-02	Counting Crowds in Bad Weather	Zhi-Kai Huang et.al.	2306.01209	null
2023-06-01	Wavelet Image Restoration Using Multifractal Priors	Karl Young et.al.	2306.00309	null
2023-06-01	Low-Light Image Enhancement with Wavelet-based Diffusion Models	Hai Jiang et.al.	2306.00306	link
2023-05-31	A Unified Conditional Framework for Diffusion-based Image Restoration	Yi Zhang et.al.	2305.20049	null
2023-05-30	Wide & deep learning for spatial & intensity adaptive image restoration	Yadong Wang et.al.	2305.18708	link
2023-05-29	GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions	Tao Wang et.al.	2305.17863	link
2023-05-28	PND-Net: Physics based Non-local Dual-domain Network for Metal Artifact Reduction	Jinqiu Xia et.al.	2305.17778	link
2023-05-27	Rethinking PRL: A Multiscale Progressively Residual Learning Network for Inverse Halftoning	Feiyu Li et.al.	2305.17355	link
2023-05-24	Learning INR for Event-guided Rolling Shutter Frame Correction, Deblur, and Interpolation	Yunfan Lu et.al.	2305.15078	link
2023-05-23	Generalized Expectation Maximization Framework for Blind Image Super Resolution	Yuxiao Li et.al.	2305.13880	null
2023-05-23	WaveDM: Wavelet-Based Diffusion Models for Image Restoration	Yi Huang et.al.	2305.13819	link
2023-05-23	A Dive into SAM Prior in Image Restoration	Zeyu Xiao et.al.	2305.13620	null
2023-05-22	Restore Anything Pipeline: Segment Anything Meets Image Restoration	Jiaxi Jiang et.al.	2305.13093	link
2023-05-19	SIDAR: Synthetic Image Dataset for Alignment & Restoration	Monika Kwiatkowski et.al.	2305.12036	link
2023-05-15	Neural information coding for efficient spike-based image denoising	Andrea Castagnetti et.al.	2305.11898	null
2023-05-22	RAMiT: Reciprocal Attention Mixing Transformer for Lightweight Image Restoration	Haram Choi et.al.	2305.11474	link
2023-05-17	Principal Uncertainty Quantification with Spatial Correlation for Image Restoration Problems	Omer Belhasin et.al.	2305.10124	link
2023-05-17	Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go	Ye-Cong Wan et.al.	2305.09996	link
2023-05-15	Denoising Diffusion Models for Plug-and-Play Image Restoration	Yuanzhi Zhu et.al.	2305.08995	link
2023-05-15	Toward Moiré-Free and Detail-Preserving Demosaicking	Xuanchen Li et.al.	2305.08585	null
2023-05-13	A Two-Stage Real Image Deraining Method for GT-RAIN Challenge CVPR 2023 Workshop UG $^{\textbf{2}}$ + Track 3	Yun Guo et.al.	2305.07979	link

SAM

Publish Date	Title	Authors	PDF	Code
2025-07-21	ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction	Danhui Chen et.al.	2507.15803	null
2025-07-20	FastSmoothSAM: A Fast Smooth Method For Segment Anything Model	Jiasheng Xu et.al.	2507.15008	null
2025-07-19	Depthwise-Dilated Convolutional Adapters for Medical Object Tracking and Segmentation Using the Segment Anything Model 2	Guoping Xu et.al.	2507.14613	null
2025-07-16	RegCL: Continual Adaptation of Segment Anything Model via Model Merging	Yuan-Chen Shu et.al.	2507.12297	null
2025-07-16	SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation	Jun Yin et.al.	2507.11994	null
2025-07-15	Two intersecting radio shells: relics of galaxy merger shocks ?	Bärbel S. Koribalski et.al.	2507.11781	null
2025-07-13	Landmark Detection for Medical Images using a General-purpose Segmentation Model	Ekaterina Stansfield et.al.	2507.11551	null
2025-07-13	Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive	You Huang et.al.	2507.09612	null
2025-07-22	Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation	Ming Yin et.al.	2507.09577	null
2025-07-13	Prompt Engineering in Segment Anything Model: Methodologies, Applications, and Emerging Challenges	Yidong Jiang et.al.	2507.09562	null
2025-07-11	Compress Any Segment Anything Model (SAM)	Juntong Fan et.al.	2507.08765	null
2025-07-11	SAM2RL: Towards Reinforcement Learning Memory Control in Segment Anything Model 2	Alen Adamyan et.al.	2507.08548	null
2025-07-10	RAPS-3D: Efficient interactive segmentation for 3D radiological imaging	Théo Danielou et.al.	2507.07730	null
2025-07-10	Semantic-guided Masked Mutual Learning for Multi-modal Brain Tumor Segmentation with Arbitrary Missing Modalities	Guoyan Liang et.al.	2507.07592	null
2025-07-09	A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding	Zhenyang Liu et.al.	2507.06719	null
2025-07-07	OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts	Shiting Xiao et.al.	2507.05427	null
2025-07-04	SAMed-2: Selective Memory Enhanced Medical Segment Anything Model	Zhiling Yan et.al.	2507.03698	null
2025-07-04	Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation	Tao Tang et.al.	2507.03585	null
2025-07-05	No time to train! Training-Free Reference-Based Instance Segmentation	Miguel Espinosa et.al.	2507.02798	null
2025-07-03	Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection	Weiwei Duan et.al.	2507.02454	null
2025-07-03	ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation	Hanbo Bi et.al.	2507.02294	null
2025-07-02	Autoadaptive Medical Segment Anything Model	Tyler Ward et.al.	2507.01828	null
2025-07-02	Mamba Guided Boundary Prior Matters: A New Perspective for Generalized Polyp Segmentation	Tapas K. Dutta et.al.	2507.01509	null
2025-06-30	Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data	Shubhabrata Mukherjee et.al.	2506.24039	null
2025-07-10	Diffusion Model-based Data Augmentation Method for Fetal Head Ultrasound Segmentation	Fangyijie Wang et.al.	2506.23664	null
2025-07-01	SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting	Yiming Huang et.al.	2506.23309	null
2025-06-29	DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation	Jihun Kim et.al.	2506.23104	null
2025-06-28	VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding	Minchao Jiang et.al.	2506.22799	null
2025-06-26	Detection of Breast Cancer Lumpectomy Margin with SAM-incorporated Forward-Forward Contrastive Learning	Tyler Ward et.al.	2506.21006	null
2025-06-25	AI-Driven MRI-based Brain Tumour Segmentation Benchmarking	Connor Ludwig et.al.	2506.20786	null
2025-06-24	SAM2-SGP: Enhancing SAM2 for Medical Image Segmentation via Support-Set Guided Prompting	Yang Xing et.al.	2506.19658	null
2025-06-24	Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models	Kai Zhao et.al.	2506.19300	null
2025-06-24	PicoSAM2: Low-Latency Segmentation In-Sensor for Edge Vision Applications	Pietro Bonazzi et.al.	2506.18807	null
2025-06-23	MedSeg-R: Medical Image Segmentation with Clinical Reasoning	Hao Shao et.al.	2506.18669	null
2025-06-23	Segment Anything for Satellite Imagery: A Strong Baseline and a Regional Dataset for Automatic Field Delineation	Carmelo Scribano et.al.	2506.16318	link
2025-06-16	MorphSAM: Learning the Morphological Prompts from Atlases for Spine Image Segmentation	Dingwei Fan et.al.	2506.13094	null
2025-06-13	Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling	Yunhan Ren et.al.	2506.11661	link
2025-06-12	Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches	Andrea Moglia et.al.	2506.10825	null
2025-06-12	Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation	Shuyang Li et.al.	2506.10503	null
2025-06-11	Q-SAM2: Accurate Quantization for Segment Anything Model 2	Nicola Farronato et.al.	2506.09782	null
2025-06-11	SRPL-SFDA: SAM-Guided Reliable Pseudo-Labels for Source-Free Domain Adaptation in Medical Image Segmentation	Xinya Liu et.al.	2506.09403	link
2025-06-10	SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything	Joost van Dalen et.al.	2506.08613	link
2025-06-10	Discovery of Odd Radio Circles and Other Peculiars in the First Year of the EMU Survey using Object Detection	Nikhel Gupta et.al.	2506.08439	null
2025-06-09	Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods	Beining Xu et.al.	2506.07779	null
2025-06-09	OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting	Jens Piekenbrinck et.al.	2506.07697	null
2025-06-06	Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models	Yannis Spyridis et.al.	2506.06569	null
2025-06-03	Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation	Luka Vetoshkin et.al.	2506.05396	null
2025-06-05	SAM-aware Test-time Adaptation for Universal Medical Image Segmentation	Jianghao Wu et.al.	2506.05221	null
2025-06-05	Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery	Mélisande Teng et.al.	2506.04970	null
2025-06-03	Extremely large oblate deformation of the first excited state in $^{12}$ C: a new challenge to modern nuclear theory	C. Ngwetsheni et.al.	2506.03236	null
2025-06-03	Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery	Michelle Chen et.al.	2506.03114	link
2025-06-05	GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation	Sohyun Lee et.al.	2506.02882	null
2025-06-03	Hierarchical Self-Prompting SAM: A Prompt-Free Medical Image Segmentation Framework	Mengmeng Zhang et.al.	2506.02854	null
2025-06-03	SAMJ: Fast Image Annotation on ImageJ/Fiji via Segment Anything Model	Carlos Garcia-Lopez-de-Haro et.al.	2506.02783	null
2025-06-02	SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes	Yuji Wang et.al.	2506.01558	null
2025-06-02	Computing Diverse and Nice Triangulations	Waldo Gálvez et.al.	2506.01323	null
2025-06-02	SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost	Haiyang Mei et.al.	2506.01304	link
2025-06-01	AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting	Yuyuan Liu et.al.	2506.01015	link
2025-05-30	KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices	Uzair Khan et.al.	2505.24334	link
2025-05-28	SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning	Jiaqi Huang et.al.	2505.22596	null
2025-05-28	Adapting Segment Anything Model for Power Transmission Corridor Hazard Segmentation	Hang Chen et.al.	2505.22105	link
2025-06-03	InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective	Yuanhong Zhang et.al.	2505.21920	null
2025-05-27	Geometric Feature Prompting of Image Segmentation Models	Kenneth Ball et.al.	2505.21644	null
2025-05-29	Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation	Nagito Saito et.al.	2505.19846	null
2025-05-25	Domain and Task-Focused Example Selection for Data-Efficient Contrastive Medical Image Segmentation	Tyler Ward et.al.	2505.19208	link
2025-05-24	SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models	Ye Sun et.al.	2505.18812	null
2025-05-23	Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking	Cheng-Yen Yang et.al.	2505.18111	null
2025-05-22	Assessing the generalization performance of SAM for ureteroscopy scene understanding	Martin Villagrana et.al.	2505.17210	null
2025-05-22	TextureSAM: Towards a Texture Aware Foundation Model for Segmentation	Inbal Cohen et.al.	2505.16540	null
2025-05-21	VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation	Niccolo Avogaro et.al.	2505.15592	null
2025-05-21	UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset	Hua Li et.al.	2505.15581	link
2025-05-21	Zero-Shot Gaze-based Volumetric Medical Image Segmentation	Tatyana Shmykova et.al.	2505.15256	null
2025-05-19	IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion	Wentao Song et.al.	2505.13633	null
2025-05-20	Industrial Synthetic Segment Pre-training	Shinichi Mae et.al.	2505.13099	null
2025-05-17	Beluga Whale Detection from Satellite Imagery with Point Labels	Yijie Zheng et.al.	2505.12066	link
2025-05-17	AoP-SAM: Automation of Prompts for Efficient Segmentation	Yi Chen et.al.	2505.11980	null
2025-05-16	SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision	Utsav Rai et.al.	2505.11439	null
2025-05-16	Unifying Segment Anything in Microscopy with Multimodal Large Language Model	Manyu Li et.al.	2505.10769	null
2025-05-14	Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance	Guoying Liang et.al.	2505.09123	null
2025-05-13	Parameter-Efficient Fine-Tuning of Vision Foundation Model for Forest Floor Segmentation from UAV Imagery	Mohammad Wasil et.al.	2505.08932	link
2025-05-13	ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking	Haofeng Liu et.al.	2505.08581	link
2025-05-14	Leveraging Segment Anything Model for Source-Free Domain Adaptation via Dual Feature Guided Auto-Prompting	Zheang Huai et.al.	2505.08527	link
2025-05-12	ABS-Mamba: SAM2-Driven Bidirectional Spiral Mamba Network for Medical Image Translation	Feng Yuan et.al.	2505.07687	null
2025-05-12	MAIS: Memory-Attention for Interactive Segmentation	Mauricio Orbes-Arteaga et.al.	2505.07511	null
2025-05-11	MarkMatch: Same-Hand Stuffing Detection	Fei Zhao et.al.	2505.07032	null
2025-05-10	Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation	Jingyao Wang et.al.	2505.06524	link
2025-05-09	The 76Cu conundrum remains unsolved	B. Olaizola et.al.	2505.06400	null
2025-05-09	Adapting a Segmentation Foundation Model for Medical Image Classification	Pengfei Gu et.al.	2505.06217	null
2025-05-09	UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model	Timo Kaiser et.al.	2505.05049	link
2025-05-08	Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization	Xi Yang et.al.	2505.04905	null
2025-05-08	Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model	Navin Ranjan et.al.	2505.04861	null
2025-05-07	Cross-organ all-in-one parallel compressed sensing magnetic resonance imaging	Baoshun Shi et.al.	2505.04658	link
2025-05-09	MAISY: Motion-Aware Image SYnthesis for Medical Image Motion Correction	Andrew Zhang et.al.	2505.04105	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679	null
2025-05-04	Segment Any RGB-Thermal Model with Language-aided Distillation	Dong Xing et.al.	2505.01950	null
2025-05-03	Accelerating Volumetric Medical Image Annotation via Short-Long Memory SAM 2	Yuwen Chen et.al.	2505.01854	link
2025-04-30	MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection	Qiushi Yang et.al.	2505.00739	null
2025-05-05	AI-Driven Segmentation and Analysis of Microbial Cells	Shuang Zhang et.al.	2505.00578	null
2025-04-30	SAM4EM: Efficient memory-based two stage prompt-free segment anything model adapter for complex 3D neuroscience electron microscopy stacks	Uzair Shah et.al.	2504.21544	link
2025-04-30	UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation	Linshan Wu et.al.	2504.21336	link
2025-04-29	RadSAM: Segmenting 3D radiological images with a 2D promptable model	Julien Khlaut et.al.	2504.20837	null
2025-04-29	SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation	Jia Wang et.al.	2504.20501	null
2025-04-26	Reservoir-enhanced Segment Anything Model for Subsurface Diagnosis	Xiren Zhou et.al.	2504.18802	link
2025-04-25	RSFR: A Coarse-to-Fine Reconstruction Framework for Diffusion Tensor Cardiac MRI with Semantic-Aware Refinement	Jiahao Huang et.al.	2504.18520	null
2025-04-23	Prompt-Tuning SAM: From Generalist to Specialist with only 2048 Parameters and 16 Training Images	Tristan Piater et.al.	2504.16739	null
2025-04-23	RGB-D Video Object Segmentation via Enhanced Multi-store Feature Memory	Boyue Xu et.al.	2504.16471	null
2025-04-19	Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection	Ghodsiyeh Rostami et.al.	2504.14138	null
2025-04-18	HSACNet: Hierarchical Scale-Aware Consistency Regularized Semi-Supervised Change Detection	Qi’ao Xu et.al.	2504.13428	null
2025-04-24	Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance	Oliver Mills et.al.	2504.13340	link
2025-04-17	SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping	Yun-Cheng Li et.al.	2504.12619	null
2025-04-17	Contour Field based Elliptical Shape Prior for the Segment Anything Model	Xinyu Zhao et.al.	2504.12556	null
2025-04-17	DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency	Mengshi Qi et.al.	2504.12080	link
2025-04-14	Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials	Jingyun Yang et.al.	2504.10281	null
2025-04-13	Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation	Jia Wei et.al.	2504.09601	null
2025-04-12	AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images	Saikat Dutta et.al.	2504.09203	null
2025-04-11	Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models	Jiahuan Long et.al.	2504.08915	null
2025-04-11	Robust SAM: On the Adversarial Robustness of Vision Foundation Models	Jiahuan Long et.al.	2504.08906	null
2025-04-11	FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents	Xin Tan et.al.	2504.08581	null
2025-04-11	SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data	Sourya Sengupta et.al.	2504.08177	null
2025-04-09	Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting	Daiwei Zhang et.al.	2504.06978	null
2025-04-09	A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology	Marco Acerbis et.al.	2504.06957	link
2025-04-09	MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking	Chang Nie et.al.	2504.06863	null
2025-04-08	HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling	Qing Xu et.al.	2504.06205	link
2025-04-08	KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection	Xingyuan Li et.al.	2504.05878	null
2025-04-07	S^4M: Boosting Semi-Supervised Instance Segmentation with SAM	Heeji Yoon et.al.	2504.05301	null
2025-04-07	CMaP-SAM: Contraction Mapping Prior for SAM-driven Few-shot Segmentation	Shuai Chen et.al.	2504.05049	null
2025-04-05	PIORF: Physics-Informed Ollivier-Ricci Flow for Long-Range Interactions in Mesh Graph Neural Networks	Youn-Yeol Yu et.al.	2504.04052	null
2025-04-05	UCS: A Universal Model for Curvilinear Structure Segmentation	Dianshuo Li et.al.	2504.04034	null
2025-04-04	MedSAM2: Segment Anything in 3D Medical Images and Videos	Jun Ma et.al.	2504.03600	link
2025-04-03	APSeg: Auto-Prompt Model with Acquired and Injected Knowledge for Nuclear Instance Segmentation and Classification	Liying Xu et.al.	2504.02222	null
2025-04-02	BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models	Encheng Su et.al.	2504.01452	null
2025-04-01	CamoSAM2: Motion-Appearance Induced Auto-Refining Prompts for Video Camouflaged Object Detection	Xin Zhang et.al.	2504.00375	null
2025-04-01	Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation	Ting Liu et.al.	2504.00356	link
2025-03-31	SmartScan: An AI-based Interactive Framework for Automated Region Extraction from Satellite Images	Savinay Nagendra et.al.	2504.00200	null
2025-04-03	IMPACT: A Generic Semantic Loss for Multimodal Medical Image Registration	Valentin Boussot et.al.	2503.24121	link
2025-03-31	MGD-SAM2: Multi-view Guided Detail-enhanced Segment Anything Model 2 for High-Resolution Class-agnostic Segmentation	Haoran Shen et.al.	2503.23786	link
2025-03-28	SCHNet: SAM Marries CLIP for Human Parsing	Kunliang Liu et.al.	2503.22237	null
2025-03-28	Synergistic Bleeding Region and Point Detection in Surgical Videos	Jialun Pei et.al.	2503.22174	null
2025-03-27	Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying	Hairong Yin et.al.	2503.21767	null
2025-03-27	AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation	Jiahe Qian et.al.	2503.21695	null
2025-03-31	Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement	Xinghao Wang et.al.	2503.20294	null
2025-03-26	Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery	Mélisande Teng et.al.	2503.20199	null
2025-03-25	BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts	Suzhe Xu et.al.	2503.19769	null
2025-03-24	Towards Human-Understandable Multi-Dimensional Concept Discovery	Arne Grobrügge et.al.	2503.18629	link
2025-03-26	PG-SAM: Prior-Guided SAM with Medical for Multi-organ Segmentation	Yiheng Zhong et.al.	2503.18227	link
2025-03-23	Cost-effective multi-fidelity strategy for the optimization of high-Reynolds number turbine flows guided by LES	Camille Matar et.al.	2503.17977	null
2025-03-18	Organ-aware Multi-scale Medical Image Segmentation Using Text Prompt Engineering	Wenjie Zhang et.al.	2503.13806	null
2025-03-17	Integrating AI for Human-Centric Breast Cancer Diagnostics: A Multi-Scale and Multi-View Swin Transformer Framework	Farnoush Bayatmakou et.al.	2503.13309	null
2025-03-17	3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o	Dingning Liu et.al.	2503.13185	null
2025-03-17	SAM2 for Image and Video Segmentation: A Comprehensive Survey	Zhang Jiaxing et.al.	2503.12781	null
2025-03-16	Segment Any-Quality Images with Generative Latent Space Enhancement	Guangqian Guo et.al.	2503.12507	null
2025-03-16	SAM2-ELNet: Label Enhancement and Automatic Annotation for Remote Sensing Segmentation	Jianhao Yang et.al.	2503.12404	null
2025-03-15	E-SAM: Training-Free Segment Every Entity Model	Weiming Zhang et.al.	2503.12094	null
2025-03-12	NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model	Yuzhi Lai et.al.	2503.09335	link
2025-03-10	Visual and Text Prompt Segmentation: A Novel Multi-Model Framework for Remote Sensing	Xing Zi et.al.	2503.07911	null
2025-03-10	Customized SAM 2 for Referring Remote Sensing Image Segmentation	Fu Rong et.al.	2503.07266	null
2025-03-10	Multi-Modal 3D Mesh Reconstruction from Images and Text	Melvin Reka et.al.	2503.07190	null
2025-03-10	OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation	Ding Zhong et.al.	2503.07098	null
2025-03-20	MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation	Chenfei Liao et.al.	2503.06700	null
2025-03-09	SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model	Jing Zhang et.al.	2503.06515	null
2025-03-08	Segment Anything, Even Occluded	Wei-En Tai et.al.	2503.06261	null
2025-03-08	Dynamically evolving segment anything model with continuous learning for medical image segmentation	Zhaori Liu et.al.	2503.06236	null
2025-03-08	Improving SAM for Camouflaged Object Detection via Dual Stream Adapters	Jiaming Liu et.al.	2503.06042	null
2025-03-08	Towards Universal Text-driven CT Image Segmentation	Yuheng Li et.al.	2503.06030	null
2025-03-07	S4M: Segment Anything with 4 Extreme Points	Adrien Meyer et.al.	2503.05534	null
2025-03-05	Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching	Haiyue Zu et.al.	2503.04826	null
2025-03-06	Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation	Aishik Konwer et.al.	2503.04639	null
2025-03-07	GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain tumour Segmentation on mp-MRI	Cecilia Diana-Albelda et.al.	2503.04325	link
2025-03-06	WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining	Haoran Wang et.al.	2503.04106	link
2025-03-05	Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model	Steve Andreas Immanuel et.al.	2503.03785	link
2025-03-05	AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model	Wenlun Zhang et.al.	2503.03088	null
2025-03-04	Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance	Jiayi Zhao et.al.	2503.02581	link
2025-03-04	Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration	Pengchen Liang et.al.	2503.02321	null
2025-03-03	Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond	Guanyao Wu et.al.	2503.01210	null
2025-02-25	An Analysis of Segment Anything 2	Clayton Bromley et.al.	2503.00042	null
2025-02-28	SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models	Yichi Zhang et.al.	2502.20749	link
2025-02-27	Energy-carbon comprehensive efficiency evaluation of hydrogen metallurgy system considering low-temperature waste heat recovery	Qiang Ji et.al.	2502.20131	null
2025-02-25	VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention	Adnan Iltaf et.al.	2502.18185	link
2025-02-23	Lightweight Vision Model-based Multi-user Semantic Communication Systems	Feibo Jiang et.al.	2502.16424	null
2025-02-22	USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images	Jiamu Wang et.al.	2502.16160	null
2025-02-21	UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction	Chenyu Li et.al.	2502.15199	null
2025-02-16	Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review	Ufaq Khan et.al.	2502.14886	null
2025-02-21	Vision Foundation Models in Medical Image Analysis: Advances and Challenges	Pengchen Liang et.al.	2502.14584	null
2025-02-19	MaizeEar-SAM: Zero-Shot Maize Ear Phenotyping	Hossein Zaremehrjerdi et.al.	2502.13399	link
2025-02-18	SpeHeatal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis	Yi Shi et.al.	2502.13192	link
2025-02-17	Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness	Hao Xu et.al.	2502.11440	link
2025-02-17	WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing	Yunyi Zhou et.al.	2502.11338	null
2025-02-14	MITO: Enabling Non-Line-of-Sight Perception using Millimeter-waves through Real-World Datasets and Simulation Tools	Laura Dodds et.al.	2502.10259	link
2025-02-12	Towards Fine-grained Interactive Segmentation in Images and Videos	Yuan Yao et.al.	2502.09660	null
2025-02-10	SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement	Yuqi Lin et.al.	2502.06756	link
2025-02-10	FunduSAM: A Specialized Deep Learning Model for Enhanced Optic Disc and Cup Segmentation in Fundus Images	Jinchen Yu et.al.	2502.06220	null
2025-02-05	ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models	Ying Zhang et.al.	2502.03266	link
2025-02-04	Rethinking Vision Transformer for Object Centric Foundation Models	Manuel Traub et.al.	2502.02763	null
2025-02-04	RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2	Bin Xie et.al.	2502.02741	null
2025-02-04	IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning	Quan Zhang et.al.	2502.02454	null
2025-02-02	SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation	Mingyu Yang et.al.	2502.00960	null
2025-02-02	Vision and Language Reference Prompt into SAM for Few-shot Segmentation	Kosuke Sakurai et.al.	2502.00719	link
2025-02-02	Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation	Bin Xie et.al.	2502.00630	null
2025-02-01	Parameter Efficient Fine-Tuning of Segment Anything Model	Carolin Teuber et.al.	2502.00418	link
2025-02-01	Segment Anything for Histopathology	Titus Griebel et.al.	2502.00408	link
2025-01-28	Efficient Knowledge Distillation of SAM for Medical Image Segmentation	Kunal Dasharath Patil et.al.	2501.16740	null
2025-01-27	CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation	Xiaochuan Ma et.al.	2501.16246	null
2025-01-26	Marker Track: Accurate Fiducial Marker Tracking for Evaluation of Residual Motions During Breath-Hold Radiotherapy	Aimee Guo et.al.	2501.15660	null
2025-01-27	Gland Segmentation Using SAM With Cancer Grade as a Prompt	Yijie Zhu et.al.	2501.14718	null
2025-01-23	MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation	Fu Rong et.al.	2501.13667	null
2025-01-23	Auto-Prompting SAM for Weakly Supervised Landslide Extraction	Jian Wang et.al.	2501.13426	null
2025-01-21	fabSAM: A Farmland Boundary Delineation Method Based on the Segment Anything Model	Yufeng Xie et.al.	2501.12487	null
2025-01-17	Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks	Michael Schwingshackl et.al.	2501.10080	link
2025-01-15	Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures	Pengru Deng et.al.	2501.09203	null
2025-01-15	Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation	Xingxin He et.al.	2501.09138	null
2025-01-15	SuperSAM: Crafting a SAM Supernetwork via Structured Pruning and Unstructured Parameter Prioritization	Waqwoya Abebe et.al.	2501.08504	link
2025-01-13	Guided SAM: Label-Efficient Part Segmentation	S. B. van Rooij et.al.	2501.07434	null
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	null
2025-01-13	EdgeTAM: On-Device Track Anything Model	Chong Zhou et.al.	2501.07256	link
2025-01-12	Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation	Zhenyang Feng et.al.	2501.06749	null
2025-01-12	PGP-SAM: Prototype-Guided Prompt Learning for Efficient Few-Shot Medical Image Segmentation	Zhonghao Yan et.al.	2501.06692	null
2025-01-10	Weakly Supervised Segmentation of Hyper-Reflective Foci with Compact Convolutional Transformers and SAM2	Olivier Morelle et.al.	2501.05933	null
2025-01-10	Zero-shot Shark Tracking and Biometrics from Aerial Imagery	Chinmay K Lalgudi et.al.	2501.05717	null
2025-01-07	MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention	Aadya Arora et.al.	2501.03839	null
2025-01-07	AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish	Stefan Hein Bengtson et.al.	2501.03767	null
2025-01-06	Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy	Risha Goel et.al.	2501.03153	link
2025-01-02	ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI	Neda Tavakoli et.al.	2501.01372	link
2025-01-02	Evidential Calibrated Uncertainty-Guided Interactive Segmentation paradigm for Ultrasound Images	Jiang Shang et.al.	2501.01072	null
2024-12-31	Advanced Lung Nodule Segmentation and Classification for Early Detection of Lung Cancer using SAM and Transfer Learning	Asha V et.al.	2501.00586	null
2024-12-31	Is Segment Anything Model 2 All You Need for Surgery Video Segmentation? A Systematic Evaluation	Cheng Yuan et.al.	2501.00525	null
2024-12-27	Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts	Enze Xie et.al.	2412.19917	null
2024-12-26	When SAM2 Meets Video Shadow and Mirror Detection	Leiping Jie et.al.	2412.19293	link
2024-12-28	Optimizing Prompt Strategies for SAM: Advancing lesion Segmentation Across Diverse Medical Imaging Modalities	Yuli Wang et.al.	2412.17943	null
2024-12-16	Machine Learning-Based Automated Assessment of Intracorporeal Suturing in Laparoscopic Fundoplication	Shekhar Madhav Khairnar et.al.	2412.16195	null
2024-12-18	Memorizing SAM: 3D Medical Segment Anything Model with Memorizing Transformer	Xinyuan Shao et.al.	2412.13908	link
2024-12-18	Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation	Kaiwen Huang et.al.	2412.13742	link
2024-12-17	Fruit Deformity Classification through Single-Input and Multi-Input Architectures based on CNN Models using Real and Synthetic Images	Tommy D. Beltran et.al.	2412.12966	null
2024-12-17	Synthetic Data Generation for Anomaly Detection on Table Grapes	Ionut Marian Motoi et.al.	2412.12949	link
2024-12-17	SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection	Xing Liufu et.al.	2412.12892	link
2024-12-17	PolSAM: Polarimetric Scattering Mechanism Informed Segment Anything Model	Yuqing Wang et.al.	2412.12737	link
2024-12-17	SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation	Shuangping Huang et.al.	2412.12660	null
2024-12-17	SAModified: A Foundation Model-Based Zero-Shot Approach for Refining Noisy Land-Use Land-Cover Maps	Sparsh Pekhale et.al.	2412.12552	null
2024-12-16	Adapting Segment Anything Model (SAM) to Experimental Datasets via Fine-Tuning on GAN-based Simulation: A Case Study in Additive Manufacturing	Anika Tabassum et.al.	2412.11381	link
2024-12-15	Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment	Haisheng Lu et.al.	2412.11186	link
2024-12-15	SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation	Xudong Zhou et.al.	2412.11034	null
2024-12-13	TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views	Liang Zhao et.al.	2412.10051	link
2024-12-11	SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation	Tapas Kumar Dutta et.al.	2412.08482	link
2024-12-11	Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion	Bingzhi Shen et.al.	2412.08315	null
2024-12-13	Crack-EdgeSAM Self-Prompting Crack Segmentation System for Edge Devices	Yingchu Wang et.al.	2412.07205	null
2024-12-17	Continual Learning for Segment Anything Model Adaptation	Jinglong Yang et.al.	2412.06418	link
2024-12-18	Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework	Jiuyi Xu et.al.	2412.06268	null
2024-12-08	MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One Day	Donghang Lyu et.al.	2412.05888	link
2024-12-07	RefSAM3D: Adapting SAM with Cross-modal Reference for 3D Medical Image Segmentation	Xiang Gao et.al.	2412.05605	null
2024-12-06	SAMCL: Empowering SAM to Continually Learn from Dynamic Domains	Zeqing Wang et.al.	2412.05012	null
2024-12-06	HOLa: HoloLens Object Labeling	Michael Schwimmbeck et.al.	2412.04945	link
2024-12-05	Quantifying the Limits of Segment Anything Model: Analyzing Challenges in Segmenting Tree-Like and Low-Contrast Structures	Yixin Zhang et.al.	2412.04243	link
2024-12-05	Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts	Chenyang Zhu et.al.	2412.04220	null
2024-12-04	Automated galaxy sizes in Euclid images using the Segment Anything Model	J. Vega-Ferrero et.al.	2412.03642	link
2024-12-04	Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything	Yongkyu Lee et.al.	2412.03472	link
2024-12-04	MRNet: Multifaceted Resilient Networks for Medical Image-to-Image Translation	Hyojeong Lee et.al.	2412.03039	null
2024-12-02	CellSeg1: Robust Cell Segmentation with One Training Image	Peilin Zhou et.al.	2412.01410	link
2024-12-02	A Bottom-Up Approach to Optimizing the Solar Organic Rankine Cycle for Transactive Energy Trading	Silvia Anna Cordieri et.al.	2412.01359	null
2024-12-02	Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different Scenes	Xiaoqi Zhao et.al.	2412.01240	null
2024-12-02	Referring Video Object Segmentation via Language-aligned Track Selection	Seongchan Kim et.al.	2412.01136	link
2024-11-27	In Search of Truth: In memory of Balraj Singh	José Nicolás Orce et.al.	2412.00097	null
2024-11-28	SADG: Segment Any Dynamic Gaussian Without Object Trackers	Yun-Jin Li et.al.	2411.19290	link
2024-12-02	Det-SAM2:Technical Report on the Self-Prompting Segmentation Framework Based on Segment Anything Model 2	Zhiting Wang et.al.	2411.18977	link
2024-11-28	Efficient Track Anything	Yunyang Xiong et.al.	2411.18933	null
2024-11-28	COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detection	Xiaoqin Zhang et.al.	2411.18858	link
2024-11-27	SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality	Chenyang Lei et.al.	2411.18669	link
2024-11-26	“Nuclear thermometers” reveal the origin of the universal r-process nucleosynthesis	José Nicolás Orce et.al.	2411.17852	null
2024-11-26	SAM-MPA: Applying SAM to Few-shot Medical Image Segmentation using Mask Propagation and Auto-prompting	Jie Xu et.al.	2411.17363	null
2024-11-26	MeerKAT discovery of a MIGHTEE Odd Radio Circle	Ray P. Norris et.al.	2411.17311	null
2024-11-29	Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning	Hui-Yue Yang et.al.	2411.17217	null
2024-11-25	UltraSam: A Foundation Model for Ultrasound using Large Open-Access Segmentation Datasets	Adrien Meyer et.al.	2411.16222	link
2024-11-25	Weakly supervised image segmentation for defect-based grading of fresh produce	Manuel Knott et.al.	2411.16219	link
2024-11-25	Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain	Hangyul Yoon et.al.	2411.16123	link
2024-11-22	There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks	Miguel Espinosa et.al.	2411.15288	link
2024-11-22	Effective SAM Combination for Open-Vocabulary Semantic Segmentation	Minhyeok Lee et.al.	2411.14723	null
2024-11-21	Data Formats in Analytical DBMSs: Performance Trade-offs and Future Directions	Chunwei Liu et.al.	2411.14331	null
2024-11-21	Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting	Nikolai Goncharov et.al.	2411.13840	link
2024-11-21	Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals	Hussni Mohd Zakir et.al.	2411.13774	null
2024-11-24	ClickTrack: Towards Real-time Interactive Single Object Tracking	Kuiran Wang et.al.	2411.13183	null
2024-11-13	SAM-I2I: Unleash the Power of Segment Anything Model for Medical Image Translation	Jiayu Huo et.al.	2411.12755	null
2024-11-19	SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation	Ron Keuth et.al.	2411.12602	link
2024-11-30	SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory	Cheng-Yen Yang et.al.	2411.11922	link
2024-11-18	Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development	Ranjan Sapkota et.al.	2411.11285	null
2024-11-15	Large quadrupole deformation in $^{20}$Ne challenges rotor model and modern theory: urging for $α$ clusters in nuclei	C. V. Mehl et.al.	2411.10598	null
2024-11-15	SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning	Zewen Chen et.al.	2411.10161	link
2024-11-15	CoSAM: Self-Correcting SAM for Domain Generalization in 2D Medical Image Segmentation	Yihang Fu et.al.	2411.10136	null
2024-11-15	CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Dengke Zhang et.al.	2411.10086	link
2024-11-14	Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	Yuheng Shi et.al.	2411.09219	link
2024-11-13	Zero-shot capability of SAM-family models for bone segmentation in CT scans	Caroline Magg et.al.	2411.08629	null
2024-11-13	Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model	Jun Xie et.al.	2411.08592	null
2024-11-13	Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model	Yutao Shen et.al.	2411.08453	null
2024-11-12	Triaxial nuclear shapes from simple ratios of electric-quadrupole matrix elements	Elena Atanassova Lawrie et.al.	2411.08130	null
2024-11-12	INTRABENCH: Interactive Radiological Benchmark	Constantin Ulrich et.al.	2411.07885	null
2024-11-14	MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data	Chika Maduabuchi et.al.	2411.07463	link
2024-11-11	MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps	Xue Xia et.al.	2411.06971	link
2024-11-10	Superpixel Segmentation: A Long-Lasting Ill-Posed Problem	Rémi Giraud et.al.	2411.06478	null
2024-11-08	Assessing Foundational Medical ‘Segment Anything’ (Med-SAM1, Med-SAM2) Deep Learning Models for Left Atrial Segmentation in 3D LGE MRI	Mehri Mehrnia et.al.	2411.05963	null
2024-11-18	Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model	Shuchang Lyu et.al.	2411.05878	link
2024-11-07	UEVAVD: A Dataset for Developing UAV’s Eye View Active Object Detection	Xinhua Jiang et.al.	2411.04348	null
2024-11-06	SA3DIP: Segment Any 3D Instance with Potential 3D Priors	Xi Yang et.al.	2411.03819	link
2024-11-05	Exploiting the Segment Anything Model (SAM) for Lung Segmentation in Chest X-ray Images	Gabriel Bellon de Carvalho et.al.	2411.03064	null
2024-11-08	Region-Guided Attack on the Segment Anything Model (SAM)	Xiaoliang Liu et.al.	2411.02974	null
2024-11-05	Foundation AI Model for Medical Image Segmentation	Rina Bao et.al.	2411.02745	null
2024-11-04	UnSegMedGAT: Unsupervised Medical Image Segmentation using Graph Attention Networks Clustering	A. Mudit Adityaja et.al.	2411.01966	link
2024-11-01	ZIM: Zero-Shot Image Matting for Anything	Beomyoung Kim et.al.	2411.00626	link
2024-11-01	Generative AI-based Pipeline Architecture for Increasing Training Efficiency in Intelligent Weed Control Systems	Sourav Modak et.al.	2411.00548	null
2024-10-29	Performance of the Segment Anything Model in Various RFI/Events Detection in Radio Astronomy	Yanbin Yang et.al.	2410.22497	null
2024-10-30	Benchmarking Human and Automated Prompting in the Segment Anything Model	Jorge Quesada et.al.	2410.22048	link
2024-10-29	SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection	Jia Wei et.al.	2410.21813	link
2024-11-03	VideoSAM: A Large Vision Foundation Model for High-Speed Video Segmentation	Chika Maduabuchi et.al.	2410.21304	link
2024-10-29	Transferable Adversarial Attacks on SAM and Its Downstream Models	Song Xia et.al.	2410.20197	link
2024-10-11	A SAM based Tool for Semi-Automatic Food Annotation	Lubnaa Abdur Rahman et.al.	2410.19756	null
2024-10-24	Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction	Hongxin Peng et.al.	2410.18433	null
2024-10-23	Gaze-Assisted Medical Image Segmentation	Leila Khaertdinova et.al.	2410.17920	link
2024-10-22	Subshell gaps and onsets of collectivity from proton and neutron pairing gap correlations	José Nicolás Orce et.al.	2410.17436	null
2024-10-22	Multi Kernel Estimation based Object Segmentation	Haim Goldfisher et.al.	2410.17064	link
2024-10-21	PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model	Zhongchen Deng et.al.	2410.16545	null
2024-10-21	SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree	Shuangrui Ding et.al.	2410.16268	link
2024-10-17	SAMReg: SAM-enabled Image Registration with ROI-based Correspondence	Shiqi Huang et.al.	2410.14083	link
2024-10-22	EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything	Joonhyeon Song et.al.	2410.13621	link
2024-10-16	Adaptive Prompt Learning with SAM for Few-shot Scanning Probe Microscope Image Segmentation	Yao Shen et.al.	2410.12562	null
2024-10-15	MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation	Xianping Ma et.al.	2410.11160	link
2024-10-13	UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation	Ye Sun et.al.	2410.09909	null
2024-10-13	AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model	Yuchen Li et.al.	2410.09714	null
2024-10-12	Distribution-aware Noisy-label Crack Segmentation	Xiaoyan Jiang et.al.	2410.09409	link
2024-10-11	VideoSAM: Open-World Video Segmentation	Pinxue Guo et.al.	2410.08781	null
2024-10-11	Bridge the Points: Graph-based Few-shot Segment Anything Semantically	Anqi Zhang et.al.	2410.06964	link
2024-10-08	Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing Images	Shiyu Miao et.al.	2410.06194	link
2024-10-08	Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts	Zhiwei Lin et.al.	2410.05963	null
2024-10-18	On Efficient Variants of Segment Anything Model: A Survey	Xiaorui Sun et.al.	2410.04960	null
2024-10-07	Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting	Matthew Strong et.al.	2410.04680	link
2024-10-05	DB-SAM: Delving into High Quality Universal Medical Image Segmentation	Chao Qin et.al.	2410.04172	link
2024-10-03	Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images	Qingyuan Liu et.al.	2410.02207	null
2024-10-02	SinkSAM: A Monocular Depth-Guided SAM Framework for Automatic Sinkhole Segmentation	Osher Rafaeli et.al.	2410.01473	link
2024-10-02	Recovering Manifold Structure Using Ollivier-Ricci Curvature	Tristan Luca Saidi et.al.	2410.01149	link
2024-09-30	Automating MedSAM by Learning Prompts with Weak Few-Shot Supervision	Mélanie Gaillochet et.al.	2409.20293	link
2024-09-30	Medical Image Segmentation with SAM-generated Annotations	Iira Häkkinen et.al.	2409.20253	null
2024-09-29	One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos	Zechen Bai et.al.	2409.19603	link
2024-09-29	RoboNurse-VLA: Robotic Scrub Nurse System based on Vision-Language-Action Model	Shunlei Li et.al.	2409.19590	null
2024-10-10	MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation	Taha Koleilat et.al.	2409.19483	link
2024-09-27	When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation	Yuli Zhou et.al.	2409.18653	link
2024-09-26	AI-Powered Augmented Reality for Satellite Assembly, Integration and Test	Alvaro Patricio et.al.	2409.18101	null
2024-09-26	DarkSAM: Fooling Segment Anything Model to Segment Nothing	Ziqi Zhou et.al.	2409.17874	link
2024-09-26	Global-Local Medical SAM Adaptor Based on Full Adaption	Meng Wang et.al.	2409.17486	null
2024-09-25	Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis	Illia Tsiporenko et.al.	2409.16940	null
2024-09-25	Towards Underwater Camouflaged Object Tracking: An Experimental Evaluation of SAM and SAM 2	Chunhui Zhang et.al.	2409.16902	link
2024-09-24	Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking	Xi Wang et.al.	2409.16287	null
2024-09-24	Open-World Object Detection with Instance Representation Learning	Sunoh Lee et.al.	2409.16073	null
2024-09-23	Adapting Segment Anything Model for Unseen Object Instance Segmentation	Rui Cao et.al.	2409.15481	null
2024-09-24	Towards Ground-truth-free Evaluation of Any Segmentation in Medical Images	Ahjol Senbi et.al.	2409.14874	link
2024-09-23	SAMEdge: An Edge-cloud Video Analytics Architecture for the Segment Anything Model	Rui Lu et.al.	2409.14784	null
2024-09-23	An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding	Wei-Bin Kou et.al.	2409.14737	null
2024-09-23	Video-to-Audio Generation with Fine-grained Temporal Semantics	Yuchen Hu et.al.	2409.14709	null
2024-09-21	Foundation Models for Amodal Video Instance Segmentation in Automated Driving	Jasmin Breitenstein et.al.	2409.14095	link
2024-09-20	Deep learning for fast segmentation and critical dimension metrology & characterization enabling AR/VR design and fabrication	Kundan Chaudhary et.al.	2409.13951	null
2024-09-20	PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images	Nanqing Liu et.al.	2409.13401	link
2024-09-20	MCICSAM: Monte Carlo-guided Interpolation Consistency Segment Anything Model for Semi-Supervised Prostate Zone Segmentation	Guantian Huang et.al.	2409.13371	null
2024-09-19	Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation	Zhikai Wei et.al.	2409.12522	link
2024-09-23	GraspSAM: When Segment Anything Model Meets Grasp Detection	Sangjun Noh et.al.	2409.12521	null
2024-09-19	Frequency-Guided Spatial Adaptation for Camouflaged Object Detection	Shizhou Zhang et.al.	2409.12421	null
2024-09-14	Target Speaker ASR with Whisper	Alexander Polok et.al.	2409.09543	link
2024-09-14	An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation	Zheming Zuo et.al.	2409.09530	null
2024-09-14	Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM Empowerment	Xin Hu et.al.	2409.09520	null
2024-09-14	Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 Model	Mobina Mansoori et.al.	2409.09484	null
2024-09-14	SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2	Xinrun Chen et.al.	2409.09286	link
2024-09-13	Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images	Hualiang Wang et.al.	2409.08492	null
2024-09-12	SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality	Chenyang Lei et.al.	2409.08083	link
2024-09-11	Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets	Ruochen Gao et.al.	2409.07172	link
2024-09-10	Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts	Assefa Seyoum Wahd et.al.	2409.06821	link
2024-09-11	Segmenting sea ice floes in close-range optical imagery with active contour and foundation models	Giulio Passerotti et.al.	2409.06641	null
2024-09-10	Towards Generalizable Scene Change Detection	Jaewoo Kim et.al.	2409.06214	link
2024-09-09	AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations	Jingtao Li et.al.	2409.05679	null
2024-09-09	TAVP: Task-Adaptive Visual Prompt for Cross-domain Few-shot Segmentation	Jiaqi Yang et.al.	2409.05393	null
2024-09-07	SSFam: Scribble Supervised Salient Object Detection Family	Zhengyi Liu et.al.	2409.04817	link
2024-09-07	Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection	Mingjin Zhang et.al.	2409.04714	link
2024-09-06	FS-MedSAM2: Exploring the Potential of SAM2 for Few-Shot Medical Image Segmentation without Fine-tuning	Yunhao Bai et.al.	2409.04298	link
2024-09-06	Reprojection Errors as Prompts for Efficient Scene Coordinate Regression	Ting-Ru Liu et.al.	2409.04178	null
2024-09-04	Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation	Tiantian Zhang et.al.	2409.02567	link
2024-09-03	When 3D Partial Points Meets SAM: Tooth Point Cloud Segmentation with Sparse Labels	Yifan Liu et.al.	2409.01691	null
2024-09-02	MedSAM-U: Uncertainty-Guided Auto Multi-Prompt Adaptation for Reliable MedSAM	Nan Zhou et.al.	2409.00924	null
2024-08-29	SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners	Ziyu Guo et.al.	2408.16768	link
2024-08-27	SAM & SAM 2 in 3D Slicer: SegmentWithSAM Extension for Annotating Medical Images	Zafer Yildiz et.al.	2408.15224	link
2024-09-02	Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance	Kunpeng Wang et.al.	2408.15063	link
2024-08-27	Intraoperative Glioma Segmentation with YOLO + SAM for Improved Accuracy in Tumor Resection	Samir Kassam et.al.	2408.14847	null
2024-08-26	FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation	Daixun Li et.al.	2408.13980	null
2024-08-23	Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey	Yichi Zhang et.al.	2408.12889	link
2024-08-23	S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis	Kamal Basha S et.al.	2408.12833	link
2024-08-23	VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models	Purushothaman Natarajan et.al.	2408.12808	link
2024-08-22	Segment Anything Model for Grain Characterization in Hard Drive Design	Kai Nichols et.al.	2408.12732	null
2024-08-22	The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation	Tuyen Tran et.al.	2408.12447	null
2024-08-22	Generalized SAM: Efficient Fine-Tuning of SAM for Variable Input Image Sizes	Sota Kato et.al.	2408.12406	link
2024-08-22	SAM-SP: Self-Prompting Makes SAM Great Again	Chunpeng Zhou et.al.	2408.12364	null
2024-08-21	EmbodiedSAM: Online Segment Any 3D Thing in Real Time	Xiuwei Xu et.al.	2408.11811	null
2024-08-25	NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation	Zhenye Lou et.al.	2408.11787	link
2024-08-22	SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything	Chongkai Yu et.al.	2408.11535	null
2024-08-20	SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection	Huafeng Chen et.al.	2408.10760	null
2024-08-24	Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track	Feiyu Pan et.al.	2408.10125	null
2024-08-19	LCE: A Framework for Explainability of DNNs for Ultrasound Image Based on Concept Discovery	Weiji Kong et.al.	2408.09899	null
2024-08-19	SAM-UNet:Enhancing Zero-Shot Segmentation of SAM for Universal Medical Images	Sihan Yang et.al.	2408.09886	link
2024-08-19	Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving	Jun Yan et.al.	2408.09839	link
2024-08-17	GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation	Weiming Zhang et.al.	2408.09115	null
2024-08-17	Segment Anything with Multiple Modalities	Aoran Xiao et.al.	2408.09085	link
2024-08-16	SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation	Xinyu Xiong et.al.	2408.08870	link
2024-08-16	Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models	Lin Zhao et.al.	2408.08813	null
2024-08-16	Extracting polygonal footprints in off-nadir images with Segment Anything Model	Kai Li et.al.	2408.08645	link
2024-08-16	Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation	Linghao Zheng et.al.	2408.08576	null
2024-08-15	Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning	Haofeng Liu et.al.	2408.07931	link
2024-08-14	MeerKAT reveals a ghostly thermal radio ring towards the Galactic Centre	C. Bordiu et.al.	2408.07727	null
2024-08-14	Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification	Yongcheng Li et.al.	2408.07467	link
2024-08-15	Prompt-Based Segmentation at Multiple Resolutions and Lighting Conditions using Segment Anything Model 2	Osher Rafaeli et.al.	2408.06970	null
2024-08-13	Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model	Yongcheng Li et.al.	2408.06716	link
2024-08-13	Specialized Change Detection using Segment Anything	Tahir Ahmad et.al.	2408.06644	null
2024-08-12	S-SAM: SVD-based Fine-Tuning of Segment Anything Model for Medical Image Segmentation	Jay N. Paranjape et.al.	2408.06447	link
2024-08-12	From SAM to SAM 2: Exploring Improvements in Meta’s Segment Anything Model	Athulya Sundaresan Geetha et.al.	2408.06305	null
2024-08-12	Zero-shot 3D Segmentation of Abdominal Organs in CT Scans Using Segment Anything Model 2: Adapting Video Tracking Capabilities for 3D Medical Imaging	Yosuke Yamagishi et.al.	2408.06170	null
2024-08-12	Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes	Ke Zhou et.al.	2408.05936	null
2024-08-12	Polyp SAM 2: Advancing Zero shot Polyp Segmentation in Colorectal Cancer Detection	Mobina Mansoori et.al.	2408.05892	link
2024-08-15	SAM-FNet: SAM-Guided Fusion Network for Laryngo-Pharyngeal Tumor Detection	Jia Wei et.al.	2408.05426	link
2024-08-09	One Shot is Enough for Sequential Infrared Small Target Segmentation	Bingbing Dan et.al.	2408.04823	link
2024-08-08	Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2	Andrew Seohwan Yu et.al.	2408.04762	null
2024-08-08	SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation	Jieming Yu et.al.	2408.04593	null
2024-08-08	Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection	Shixuan Gao et.al.	2408.04326	link
2024-08-12	Is SAM 2 Better than SAM in Medical Image Segmentation?	Sourya Sengupta et.al.	2408.04212	null
2024-08-07	PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation	Blessing Agyei Kyem et.al.	2408.04110	link
2024-08-16	Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation	Yiqing Shen et.al.	2408.04098	null
2024-08-07	SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology	Mingya Zhang et.al.	2408.03651	link
2024-08-06	Segment Anything in Medical Images and Videos: Benchmark and Deployment	Jun Ma et.al.	2408.03322	link
2024-08-06	Biomedical SAM 2: Segment Anything in Biomedical Images and Videos	Zhiling Yan et.al.	2408.03286	link
2024-08-06	Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment	Shijie Lian et.al.	2408.02924	link
2024-08-05	Interactive 3D Medical Image Segmentation with SAM 2	Chuyun Shen et.al.	2408.02635	link
2024-08-04	PromptSAM+: Malware Detection based on Prompt Segment Anything Model	Xingyuan Wei et.al.	2408.02066	null
2024-08-04	PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone	Xin Yang et.al.	2408.02053	null
2024-08-03	TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks	Yang Yu et.al.	2408.01835	link
2024-08-03	Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2	Ange Lou et.al.	2408.01648	link
2024-08-01	Medical SAM 2: Segment medical images as video via Segment Anything Model 2	Jiayuan Zhu et.al.	2408.00874	link
2024-08-06	Segment anything model 2: an application to 2D and 3D medical images	Haoyu Dong et.al.	2408.00756	link
2024-08-01	SAM 2: Segment Anything in Images and Videos	Nikhila Ravi et.al.	2408.00714	link
2024-08-01	Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM	Xiaofeng Liu et.al.	2408.00706	null
2024-08-01	DMESA: Densely Matching Everything by Segmenting Anything	Yesheng Zhang et.al.	2408.00279	link
2024-07-31	CC-SAM: SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation	Shreyank N Gowda et.al.	2408.00181	null
2024-07-31	A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation	Mothilal Asokan et.al.	2407.21739	null
2024-07-31	Evaluating SAM2’s Role in Camouflaged Object Detection: From SAM to SAM2	Lv Tang et.al.	2407.21596	null
2024-07-31	Robust Box Prompt based SAM for Medical Image Segmentation	Yuhao Huang et.al.	2407.21284	null
2024-07-31	Weakly Supervised Intracranial Hemorrhage Segmentation with YOLO and an Uncertainty Rectified Segment Anything Model	Pascal Spiegler et.al.	2407.20461	null
2024-07-28	ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding	Zhen Chen et.al.	2407.19435	link
2024-07-25	SSTD: Stripe-Like Space Target Detection using Single-Point Supervision	Zijian Zhu et.al.	2407.18097	null
2024-07-25	Segmentation by registration-enabled SAM prompt engineering using five reference images	Yaxi Chen et.al.	2407.17933	link
2024-07-25	SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification	Heng Fang et.al.	2407.17689	link
2024-07-23	SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation	Pengfei Chen et.al.	2407.16682	null
2024-07-23	Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance	Jiyeop Kim et.al.	2407.16173	null
2024-07-23	SAM2CLIP2SAM: Vision Language Model for Segmentation of 3D CT Scans for Covid-19 Detection	Dimitrios Kollias et.al.	2407.15728	null
2024-07-21	MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM	Navyansh Mahla et.al.	2407.15042	null
2024-07-19	ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation	Qing Xu et.al.	2407.14153	link
2024-07-19	Seismic Fault SAM: Adapting SAM with Lightweight Modules and 2.5D Strategy for Fault Detection	Ran Chen et.al.	2407.14121	null
2024-07-25	MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis	Ziming Zhong et.al.	2407.13675	link
2024-07-18	Hybrid Deep Learning-Based for Enhanced Occlusion Segmentation in PICU Patient Monitoring	Mario Francisco Munoz et.al.	2407.13341	null
2024-07-17	OMG-Net: A Deep Learning Framework Deploying Segment Anything to Detect Pan-Cancer Mitotic Figures from Haematoxylin and Eosin-Stained Slides	Zhuoyan Shen et.al.	2407.12773	null
2024-07-17	FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification	Yiqing Shen et.al.	2407.12658	link
2024-07-17	Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection	Zhenni Yu et.al.	2407.12339	link
2024-07-19	Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes	Zhi Cai et.al.	2407.11464	link
2024-07-17	Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts	Jianhao Li et.al.	2407.11382	null
2024-07-16	Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations	Yunya Gao et.al.	2407.11381	link
2024-07-14	WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models	Xinjian Wu et.al.	2407.10131	link
2024-07-12	Region Attention Transformer for Medical Image Restoration	Zhiwen Yang et.al.	2407.09268	link
2024-07-11	Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appear	Seonwhee Jin et.al.	2407.08257	link
2024-07-11	Enrich the content of the image Using Context-Aware Copy Paste	Qiushi Guo et.al.	2407.08151	null
2024-07-10	Interactive Segmentation Model for Placenta Segmentation from 3D Ultrasound images	Hao Li et.al.	2407.08020	link
2024-07-10	IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection	Mingjin Zhang et.al.	2407.07520	link
2024-07-18	ProtoSAM: One-Shot Medical Image Segmentation With Foundational Models	Lev Ayzenberg et.al.	2407.07042	link
2024-07-09	CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM	Aditya Murali et.al.	2407.06795	null
2024-07-08	Unsupervised Fault Detection using SAM with a Moving Window Approach	Ahmed Maged et.al.	2407.06303	null
2024-07-08	MBA-Net: SAM-driven Bidirectional Aggregation Network for Ovarian Tumor Segmentation	Yifan Gao et.al.	2407.05984	null
2024-07-07	Addressing single object tracking in satellite imagery through prompt-engineered solutions	Athena Psalta et.al.	2407.05518	null
2024-07-07	Cross Prompting Consistency with Segment Anything Model for Semi-supervised Medical Image Segmentation	Juzheng Miao et.al.	2407.05416	link
2024-07-06	SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation	Guoan Wang et.al.	2407.04938	null
2024-07-06	Revolutionizing Alloy Microstructure Segmentation through SAM and Domain Knowledge without Extra Training	Xudong Ma et.al.	2407.04922	null
2024-07-05	Graph Pooling via Ricci Flow	Amy Feng et.al.	2407.04236	null
2024-07-09	CS3: Cascade SAM for Sperm Segmentation	Yi Shi et.al.	2407.03772	link
2024-07-02	Lung-CADex: Fully automatic Zero-Shot Detection and Classification of Lung Nodules in Thoracic CT Images	Furqan Shaukat et.al.	2407.02625	null
2024-07-02	Virtually Objective Quantification of in vitro Wound Healing Scratch Assays with the Segment Anything Model	Katja Löwenstein et.al.	2407.02187	null
2024-07-02	HRSAM: Efficiently Segment Anything in High-Resolution Images	You Huang et.al.	2407.02109	link
2024-07-03	SAVE: Segment Audio-Visual Easy way using Segment Anything Model	Khanh-Binh Nguyen et.al.	2407.02004	null
2024-07-01	Investigating the Segment Anything Foundation Model for Mapping Smallholder Agriculture Field Boundaries Without Training Labels	Pratyush Tripathy et.al.	2407.01846	null
2024-07-01	Efficient Cutting Tool Wear Segmentation Based on Segment Anything Model	Zongshuo Li et.al.	2407.01211	null
2024-06-30	ASPS: Augmented Segment Anything Model for Polyp Segmentation	Huiqian Li et.al.	2407.00718	link
2024-06-30	HATs: Hierarchical Adaptive Taxonomy Segmentation for Panoramic Pathology Image Analysis	Ruining Deng et.al.	2407.00596	link
2024-06-29	SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City	Guohao Wang et.al.	2407.00296	link
2024-06-28	Segment Anything without Supervision	XuDong Wang et.al.	2406.20081	link
2024-07-03	EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model	Yuxuan Zhang et.al.	2406.20076	link
2024-06-28	Parallax-tolerant Image Stitching via Segmentation-guided Multi-homography Warping	Tianli Liao et.al.	2406.19922	link
2024-06-27	Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model	Haobo Yuan et.al.	2406.19369	link
2024-06-30	Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO	Fuseini Mumuni et.al.	2406.19057	null
2024-06-27	Structural Attention: Rethinking Transformer for Unpaired Medical Image Synthesis	Vu Minh Hieu Phan et.al.	2406.18967	link
2024-06-07	Composition Vision-Language Understanding via Segment and Depth Anything Model	Mingxiao Huo et.al.	2406.18591	link
2024-06-25	Point-SAM: Promptable 3D Segmentation Model for Point Clouds	Yuchen Zhou et.al.	2406.17741	link
2024-06-22	TP-DRSeg: Improving Diabetic Retinopathy Lesion Segmentation with Explicit Text-Prompts Assisted SAM	Wenxue Li et.al.	2406.15764	link
2024-06-21	TraceNet: Segment one thing efficiently	Mingyuan Wu et.al.	2406.14874	null
2024-06-21	SAM-EG: Segment Anything Model with Egde Guidance framework for efficient Polyp Segmentation	Quoc-Huy Trinh et.al.	2406.14819	null
2024-06-18	An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation	Qin Li et.al.	2406.12646	null
2024-06-16	Boosting Medical Image Classification with Segmentation Foundation Model	Pengfei Gu et.al.	2406.11026	null
2024-06-16	ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model	Song Zhang et.al.	2406.10855	link
2024-06-13	RobustSAM: Segment Anything Robustly on Degraded Images	Wei-Ting Chen et.al.	2406.09627	link
2024-06-13	APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation	Weizhao He et.al.	2406.08372	null
2024-06-11	Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation	Jinyuan Li et.al.	2406.07268	link
2024-06-10	Extending Segment Anything Model into Auditory and Temporal Dimensions for Audio-Visual Segmentation	Juhyeong Seon et.al.	2406.06163	link
2024-06-10	Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset	Shijie Lian et.al.	2406.06039	link
2024-06-09	SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention	Muhammad Nawfal Meeran et.al.	2406.05802	link
2024-06-08	Training-Free Robust Interactive Video Object Segmentation	Xiaoli Wei et.al.	2406.05485	null
2024-06-07	USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation	Xiaoqi Wang et.al.	2406.05271	null
2024-06-06	Matching Anything by Segmenting Anything	Siyuan Li et.al.	2406.04221	link
2024-06-03	Immunocto: a massive immune cell database auto-generated for histopathology	Mikaël Simard et.al.	2406.02618	null
2024-06-04	FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping	Yuzhou Ji et.al.	2406.01916	null
2024-06-03	SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation	Danni Yang et.al.	2406.01451	link
2024-06-03	Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation	Tianyu Huang et.al.	2406.00956	null
2024-06-02	SimSAM: Zero-shot Medical Image Segmentation via Simulated Interaction	Benjamin Towle et.al.	2406.00663	link
2024-06-05	SAM-LAD: Segment Anything Model Meets Zero-Shot Logic Anomaly Detection	Yun Peng et.al.	2406.00625	null
2024-06-12	Artificial General Intelligence (AGI) for the oil and gas industry: a review	Jimmy Xuekai Li et.al.	2406.00594	null
2024-06-01	AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning	Duojun Huang et.al.	2406.00480	link
2024-05-29	FocSAM: Delving Deeply into Focused Objects in Segmenting Anything	You Huang et.al.	2405.18706	link
2024-05-28	Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation	Yangxiao Lu et.al.	2405.17859	link
2024-05-27	Part123: Part-aware 3D Reconstruction from a Single-view Image	Anran Liu et.al.	2405.16888	null
2024-05-27	PP-SAM: Perturbed Prompts for Robust Adaptation of Segment Anything Model for Polyp Segmentation	Md Mostafijur Rahman et.al.	2405.16740	link
2024-05-24	Open-Vocabulary SAM3D: Understand Any 3D Scene	Hanchen Tai et.al.	2405.15580	null
2024-05-22	Accelerated Evaluation of Ollivier-Ricci Curvature Lower Bounds: Bridging Theory and Computation	Wonwoo Kang et.al.	2405.13302	null
2024-05-20	Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate Model	Mounes Zaval et.al.	2405.11837	null
2024-05-20	Universal Organizer of SAM for Unsupervised Semantic Segmentation	Tingting Li et.al.	2405.11742	link
2024-05-17	One registration is worth two segmentations	Shiqi Huang et.al.	2405.10879	link
2024-05-12	Zero Shot Context-Based Object Segmentation using SLIP (SAM+CLIP)	Saaketh Koundinya Gundavarapu et.al.	2405.07284	link
2024-05-10	SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model	Trevor J. Chan et.al.	2405.06786	null
2024-05-10	Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach	Elham Ravanbakhsh et.al.	2405.06586	null
2024-05-10	Automated Cell Structure Extraction for 3D Electron Microscopy by Deep Learning	Jin Kousaka et.al.	2405.06303	null
2024-05-07	ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation	Zhibo Zhang et.al.	2405.04121	null
2024-05-06	PTQ4SAM: Post-Training Quantization for Segment Anything	Chengtao Lv et.al.	2405.03144	link
2024-05-04	UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model	Shuai Yuan et.al.	2405.02608	link
2024-05-02	Active Learning Enabled Low-cost Cell Image Segmentation Using Bounding Box Annotation	Yu Zhu et.al.	2405.01701	null
2024-05-01	Beyond Human Vision: The Role of Large Vision Language Models in Microscope Image Analysis	Prateek Verma et.al.	2405.00876	null
2024-05-01	MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model	Rajat Sahay et.al.	2405.00293	null
2024-05-01	ASAM: Boosting Segment Anything Model with Adversarial Tuning	Bo Li et.al.	2405.00256	link
2024-04-29	Innovative Integration of Visual Foundation Model with a Robotic Arm on a Mobile Platform	Shimian Zhang et.al.	2404.18720	null
2024-04-25	Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segmentation	Tanvi Deshpande et.al.	2404.17033	link
2024-04-25	Dr-SAM: An End-to-End Framework for Vascular Segmentation, Diameter Estimation, and Anomaly Detection on Angiography Images	Vazgen Zohranyan et.al.	2404.17029	link
2024-04-25	OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation	Lizhi Wang et.al.	2404.15891	link
2024-05-09	MAS-SAM: Segment Any Marine Animal with Aggregated Features	Tianyu Yan et.al.	2404.15700	link
2024-04-23	Ultrasound SAM Adapter: Adapting SAM for Breast Lesion Segmentation in Ultrasound Images	Zhengzheng Tu et.al.	2404.14837	link
2024-04-22	UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation	Siru Zhong et.al.	2404.14241	null
2024-04-22	Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic Surgery	Yuyang Sheng et.al.	2404.14040	link
2024-04-22	PM-VIS: High-Performance Box-Supervised Video Instance Segmentation	Zhangjing Yang et.al.	2404.13863	null
2024-04-20	Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models	Yuyan Shi et.al.	2404.13239	null
2024-04-19	ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation	Yu-Hsuan Ho et.al.	2404.12606	null
2024-04-18	Moving Object Segmentation: All You Need Is SAM (and Flow)	Junyu Xie et.al.	2404.12389	link
2024-04-18	SOHES: Self-supervised Open-world Hierarchical Entity Segmentation	Shengcao Cao et.al.	2404.12386	null
2024-04-18	Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery	Yona Falinie A. Gaus et.al.	2404.12285	null
2024-04-17	When are Foundation Models Effective? Understanding the Suitability for Pixel-Level Classification Using Multispectral Imagery	Yiqun Xie et.al.	2404.11797	null
2024-04-15	How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model	Hanxue Gu et.al.	2404.09957	link
2024-04-15	The Physalis system: Discovery of ORC-like radio shells around a massive pair of interacting early-type galaxies with offset X-ray emission	Bärbel S. Koribalski et.al.	2404.09522	null
2024-04-15	VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection	Bonan Ding et.al.	2404.09431	null
2024-04-12	LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning	Junchi Wang et.al.	2404.08767	link
2024-04-12	Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation	Abu Bakor Hayat Arnob et.al.	2404.08584	link
2024-04-12	Adapting the Segment Anything Model During Usage in Novel Situations	Robin Schön et.al.	2404.08421	null
2024-04-12	Practical Region-level Attack against Segment Anything Models	Yifan Shen et.al.	2404.08255	link
2024-04-11	Streamlined Photoacoustic Image Processing with Foundation Models: A Training-Free Solution	Handi Deng et.al.	2404.07833	null
2024-04-09	SAM-I-Am: Semantic Boosting for Zero-shot Atomic-Scale Electron Micrograph Segmentation	Waqwoya Abebe et.al.	2404.06638	link
2024-04-09	Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation	Sidra Aleem et.al.	2404.06362	link
2024-04-08	Rendering-Enhanced Automatic Image-to-Point Cloud Registration for Roadside Scenes	Yu Sheng et.al.	2404.05164	null
2024-04-07	Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM	Pingping Zhang et.al.	2404.04996	link
2024-04-05	Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models	Sangwon Jang et.al.	2404.04243	null
2024-04-02	Red-Teaming Segment Anything Model	Krzysztof Jankowski et.al.	2404.02067	link
2024-04-01	Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs	Jialou Wang et.al.	2404.01151	null
2024-03-31	Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts	Qin Liu et.al.	2404.00741	link
2024-03-31	Deep Instruction Tuning for Segment Anything Model	Xiaorui Huang et.al.	2404.00650	link
2024-03-29	MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation	Taha Koleilat et.al.	2403.20253	link
2024-03-29	Mixed-precision Supernet Training from Vision Foundation Models using Low Rank Adapter	Yuiko Sakuma et.al.	2403.20080	null
2024-03-30	Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction	Xiaoyang Lyu et.al.	2403.19314	link
2024-03-27	Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding	Zhiheng Cheng et.al.	2403.18271	link
2024-03-26	EgoLifter: Open-world 3D Segmentation for Egocentric Perception	Qiao Gu et.al.	2403.18118	link
2024-03-26	Segment Any Medical Model Extended	Yihao Liu et.al.	2403.18114	link
2024-03-25	GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation	Weiming Zhang et.al.	2403.16370	null
2024-04-02	Distilling Semantic Priors from SAM to Efficient Image Restoration Models	Quan Zhang et.al.	2403.16368	null
2024-03-31	Segment Anything Model for Road Network Graph Extraction	Congrui Hetang et.al.	2403.16051	link
2024-03-22	Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations	Pranav Kulkarni et.al.	2403.15218	link
2024-03-22	Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans	Heng Guo et.al.	2403.15063	link
2024-03-21	Empowering Segmentation Ability to Multi-modal Large Language Models	Yuqi Yang et.al.	2403.14141	null
2024-03-21	MaskSAM: Towards Auto-prompt SAM with Mask Classification for Medical Image Segmentation	Bin Xie et.al.	2403.14103	null
2024-03-20	SAMCT: Segment Any CT Allowing Labor-Free Task-Indicator Prompts	Xian Lin et.al.	2403.13258	link
2024-03-19	Segment Anything for comprehensive analysis of grapevine cluster architecture and berry properties	Efrain Torres-Lomas et.al.	2403.12935	null
2024-03-27	LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model	Yuxin Cao et.al.	2403.11656	null
2024-03-18	CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization	Mrityunjoy Gain et.al.	2403.11494	null
2024-03-17	Concatenate, Fine-tuning, Re-training: A SAM-enabled Framework for Semi-supervised 3D Medical Image Segmentation	Shumeng Li et.al.	2403.11229	link
2024-03-16	Task-Aware Low-Rank Adaptation of Segment Anything Model	Xuehao Wang et.al.	2403.10971	null
2024-03-19	Uncertainty-Aware Adapter: Adapting Segment Anything Model (SAM) for Ambiguous Medical Image Segmentation	Mingzhou Jiang et.al.	2403.10931	null
2024-03-16	Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval	Shichao Kan et.al.	2403.10798	link
2024-03-16	Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation	Mariia Khan et.al.	2403.10780	null
2024-03-15	Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models	Tian Meng et.al.	2403.10287	null
2024-03-15	Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning	Meixuan Li et.al.	2403.10252	null
2024-03-15	Grasp Anything: Combining Teacher-Augmented Policy Gradient Learning with Instance Segmentation to Grasp Arbitrary Objects	Malte Mosbach et.al.	2403.10187	null
2024-03-15	TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model	Changhong Hou et.al.	2403.10127	null
2024-03-15	Group-Mix SAM: Lightweight Solution for Industrial Assembly Line Applications	Wu Liang et.al.	2403.10053	null
2024-03-15	Cardiac Magnetic Resonance 2D+T Short- and Long-axis Segmentation via Spatio-temporal SAM Adaptation	Zhennong Chen et.al.	2403.10009	null
2024-03-14	FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images	Yiqing Shen et.al.	2403.09827	link
2024-03-14	The galaxy group merger origin of the Cloverleaf odd radio circle system	E. Bulbul et.al.	2403.09808	null
2024-03-14	PosSAM: Panoptic Open-vocabulary Segment Anything	Vibashan VS et.al.	2403.09620	link
2024-03-14	DF4LCZ: A SAM-Empowered Data Fusion Framework for Scene-Level Local Climate Zone Classification	Qianqian Wu et.al.	2403.09367	link
2024-03-17	WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images	Hong Liu et.al.	2403.09257	link
2024-03-14	Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation	Hyung-Il Kim et.al.	2403.09199	null
2024-03-18	SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration	Yanfei Song et.al.	2403.09195	null
2024-03-12	FluoroSAM: A Language-aligned Foundation Model for X-ray Image Segmentation	Benjamin D. Killeen et.al.	2403.08059	link
2024-03-12	Real-time Surgical Instrument Segmentation in Video Using Point Tracking and Segment Anything	Zijian Wu et.al.	2403.08003	link
2024-03-12	SAMDA: Leveraging SAM on Few-Shot Domain Adaptation for Electronic Microscopy Segmentation	Yiran Wang et.al.	2403.07951	null
2024-03-09	Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation	Hairong Shi et.al.	2403.05912	link
2024-03-09	Large Generative Model Assisted 3D Semantic Communication	Feibo Jiang et.al.	2403.05783	null
2024-03-14	OmniCount: Multi-label Object Counting with Semantic-Geometric Priors	Anindya Mondal et.al.	2403.05435	null
2024-03-08	Part-aware Personalized Segment Anything Model for Patient-Specific Segmentation	Chenhui Zhao et.al.	2403.05433	link
2024-03-08	FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation	Yuxi Liu et.al.	2403.05408	link
2024-03-07	SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising	Tao Zhou et.al.	2403.04194	link
2024-03-07	ProMISe: Promptable Medical Image Segmentation using SAM	Jinfeng Wang et.al.	2403.04164	link
2024-03-06	Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery	Wei Zhang et.al.	2403.03790	null
2024-03-03	A Simple-but-effective Baseline for Training-free Class-Agnostic Counting	Yuhao Lin et.al.	2403.01418	null
2024-02-29	RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation	Jie Zhang et.al.	2402.19004	null
2024-02-28	From Generalization to Precision: Exploring SAM for Tool Segmentation in Surgical Environments	Kanyifeechukwu J. Oguine et.al.	2402.17972	null
2024-02-27	VRP-SAM: SAM with Visual Reference Prompt	Yanpeng Sun et.al.	2402.17726	link
2024-02-27	Robust Unsupervised Crowd Counting and Localization with Adaptive Resolution SAM	Jia Wan et.al.	2402.17514	null
2024-02-27	Segment anything model for head and neck tumor segmentation with CT, PET and MRI multi-modality images	Jintao Ren et.al.	2402.17454	link
2024-02-27	SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution	Chengcheng Wang et.al.	2402.17133	link
2024-02-26	UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images	Zhen Chen et.al.	2402.16663	link
2024-03-11	BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM	Li Zhang et.al.	2402.16338	link
2024-02-24	Increasing SAM Zero-Shot Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation	Zekun Jiang et.al.	2402.15759	link
2024-02-22	WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition	Lianghui Zhu et.al.	2402.14812	link
2024-02-22	Subobject-level Image Tokenization	Delong Chen et.al.	2402.14327	link
2024-02-20	Object-level Geometric Structure Preserving for Natural Image Stitching	Wenxiao Cai et.al.	2402.12677	link
2024-02-27	ISCUTE: Instance Segmentation of Cables Using Text Embedding	Shir Kozlovsky et.al.	2402.11996	null
2024-02-18	A Multispectral Automated Transfer Technique (MATT) for machine-driven image labeling utilizing the Segment Anything Model (SAM)	James E. Gallagher et.al.	2402.11413	null
2024-02-16	Dynamic Patch-aware Enrichment Transformer for Occluded Person Re-Identification	Xin Zhang et.al.	2402.10435	null
2024-02-15	LaserSAM: Zero-Shot Change Detection Using Visual Segmentation of Spinning LiDAR	Alexander Krawciw et.al.	2402.10321	null
2024-02-15	Lester: rotoscope animation through video object segmentation and tracking	Ruben Tous et.al.	2402.09883	link
2024-02-15	Are Odd Radio Circles phoenixes of powerful radio galaxies?	Stanislav Shabala et.al.	2402.09708	null
2024-02-10	Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance	Raza Imam et.al.	2402.07059	link
2024-02-09	Iris-SAM: Iris Segmentation Using a Foundational Model	Parisa Farmanifard et.al.	2402.06497	link
2024-02-25	ClickSAM: Fine-tuning Segment Anything Model using click prompts for ultrasound image segmentation	Aimee Guo et.al.	2402.05902	null
2024-02-07	EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss	Zhuoyang Zhang et.al.	2402.05008	link
2024-02-06	CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model	Aoran Xiao et.al.	2402.03631	link
2024-02-03	Polyp-DAM: Polyp segmentation via depth anything model	Zhuoran Zheng et.al.	2402.02298	null
2024-02-15	Segment Any Change	Zhuo Zheng et.al.	2402.01188	link
2024-02-01	Comparative Evaluation of Traditional and Deep Learning-Based Segmentation Methods for Spoil Pile Delineation Using UAV Images	Sureka Thiruchittampalam et.al.	2402.00295	null
2024-01-31	Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation	Maoyuan Ye et.al.	2401.17904	link
2024-01-31	Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model	Zihan Zhong et.al.	2401.17868	link
2024-01-31	SimAda: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes	Yiran Song et.al.	2401.17803	link
2024-01-29	MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection	Yuxue Yang et.al.	2401.16305	link
2024-01-27	GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis	Jing Hao et.al.	2401.15282	link
2024-01-30	SAM-based instance segmentation models for the automation of structural damage detection	Zehao Ye et.al.	2401.15266	null
2024-01-25	On generalisability of segment anything model for nuclear instance segmentation in histology images	Kesi Xu et.al.	2401.14248	null
2024-01-25	Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks	Tianhe Ren et.al.	2401.14159	link
2024-01-24	Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation	Saiyang Na et.al.	2401.13220	null
2024-01-23	PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation	Zhaozhi Xie et.al.	2401.13051	link
2024-01-23	SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI	Hanxue Gu et.al.	2401.12974	link
2024-01-18	RAP-SAM: Towards Real-Time All-Purpose Segment Anything	Shilin Xu et.al.	2401.10228	link
2024-01-20	Boosting Few-Shot Semantic Segmentation Via Segment Anything Model	Chen-Bin Feng et.al.	2401.09826	null
2024-01-17	Change Detection Between Optical Remote Sensing Imagery and Map Data via Segment Anything Model (SAM)	Hongruixuan Chen et.al.	2401.09019	null
2024-01-16	Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model’s Generalizability in Permafrost Mapping	Wenwen Li et.al.	2401.08787	null
2024-01-16	AGN jet-inflated bubbles as possible origin of odd radio circles	Yen-Hsing Lin et.al.	2401.08207	null
2024-02-01	UV-SAM: Adapting Segment Anything Model for Urban Village Identification	Xin Zhang et.al.	2401.08083	link
2024-01-16	Achieve Fairness without Demographics for Dermatological Disease Diagnosis	Ching-Hao Chiu et.al.	2401.08066	link
2024-01-15	Foundation Models for Biomedical Image Segmentation: A Survey	Ho Hin Lee et.al.	2401.07654	null
2024-01-15	Compositional Oil Spill Detection Based on Object Detector and Adapted Segment Anything Model from SAR Images	Wenhui Wu et.al.	2401.07502	null
2024-01-12	SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization	Zhenlong Yuan et.al.	2401.06385	null
2024-01-12	SamLP: A Customized Segment Anything Model for License Plate Detection	Haoxuan Ding et.al.	2401.06374	link
2024-01-11	MatSAM: Efficient Materials Microstructure Extraction via Visual Large Model	Changtai Li et.al.	2401.05638	link
2024-01-09	Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-based Non-invasive Digital System	Galib Muhammad Shahriar Himel et.al.	2401.04746	null
2024-01-09	Segment anything model (SAM) for brain extraction in fMRI studies	Dwith Chenna et.al.	2401.04740	link
2024-01-09	Learning to Prompt Segment Anything Models	Jiaxing Huang et.al.	2401.04651	null
2024-01-07	Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions	Yichi Zhang et.al.	2401.03495	link
2024-01-05	Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively	Haobo Yuan et.al.	2401.02955	link
2024-01-04	ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation	Xinyang Pu et.al.	2401.02326	link
2024-01-08	BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model	Yiran Song et.al.	2401.02317	link
2024-01-04	Leveraging SAM for Single-Source Domain Generalization in Medical Image Segmentation	Hanhui Wang et.al.	2401.02076	link
2024-01-06	Discovery of a circularly symmetric extended diffuse radio emission around an elliptical galaxy with the VLA FIRST survey	Shobha Kumari et.al.	2401.01278	null
2024-01-02	Unsupervised Continual Anomaly Detection with Contrastively-learned Prompt	Jiaqi Liu et.al.	2401.01010	link
2023-12-30	Promoting Segment Anything Model towards Highly Accurate Dichotomous Image Segmentation	Xianjie Liu et.al.	2401.00248	null
2023-12-28	Generalizable Visual Reinforcement Learning with Segment Anything Model	Ziyu Wang et.al.	2312.17116	link
2023-12-27	Segment Change Model (SCM) for Unsupervised Change detection in VHR Remote Sensing Images: a Case Study of Buildings	Xiaoliang Tan et.al.	2312.16410	link
2023-12-24	Segment Any Events via Weighted Adaptation of Pivotal Tokens	Zhiwen Chen et.al.	2312.16222	link
2023-12-26	Medical Report Generation based on Segment-Enhanced Contrastive Representation Learning	Ruoqing Zhao et.al.	2312.15869	null
2023-12-26	Video Frame Interpolation with Region-Distinguishable Priors from SAM	Yan Han et.al.	2312.15868	null
2023-12-22	Part to Whole: Collaborative Prompting for Surgical Instrument Segmentation	Wenxi Yue et.al.	2312.14481	link
2023-12-22	FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection	Dongmei Zhang et.al.	2312.14465	null
2023-12-21	TinySAM: Pushing the Envelope for Efficient Segment Anything Model	Han Shu et.al.	2312.13789	link
2023-12-20	Testing the Segment Anything Model on radiology data	José Guilherme de Almeida et.al.	2312.12880	null
2023-12-20	Segment Anything Model Meets Image Harmonization	Haoxing Chen et.al.	2312.12729	null
2023-12-19	Weakly Supervised Open-Vocabulary Object Detection	Jianghang Lin et.al.	2312.12437	null
2023-12-19	Towards SAMBA: Segment Anything Model for Brain Tumor Segmentation in Sub-Sharan African Populations	Mohannad Barakat et.al.	2312.11775	null
2023-12-17	SAI3D: Segment Any Instance in 3D Scenes	Yingda Yin et.al.	2312.11557	null
2023-12-18	Appearance-based Refinement for Object-Centric Motion Segmentation	Junyu Xie et.al.	2312.11463	null
2023-12-20	How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model	Yixin Zhang et.al.	2312.10600	link
2023-12-16	Mapping Housing Stock Characteristics from Drone Images for Climate Resilience in the Caribbean	Isabelle Tingzon et.al.	2312.10306	null
2023-12-25	Osprey: Pixel Understanding with Visual Instruction Tuning	Yuqian Yuan et.al.	2312.10032	link
2023-12-15	SQA-SAM: Segmentation Quality Assessment for Medical Images Utilizing the Segment Anything Model	Yizhe Zhang et.al.	2312.09899	null
2023-12-15	Collaborating Foundation models for Domain Generalized Semantic Segmentation	Yasser Benigmim et.al.	2312.09788	link
2023-12-15	MobileSAMv2: Faster Segment Anything to Everything	Chaoning Zhang et.al.	2312.09579	link
2023-12-21	Enhancing Data Lakes with GraphAr: Efficient Graph Data Management with a Specialized Storage Scheme	Xue Li et.al.	2312.09577	link
2023-12-14	Influence of Prompting Strategies on Segment Anything Model (SAM) for Short-axis Cardiac MRI segmentation	Josh Stein et.al.	2312.08932	null
2023-12-13	ASLseg: Adapting SAM in the Loop for Semi-supervised Liver Tumor Segmentation	Shiyun Chen et.al.	2312.07969	null
2023-12-18	Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged Objects	Jian Hu et.al.	2312.07374	link
2023-12-11	SqueezeSAM: User friendly mobile interactive segmentation	Balakrishnan Varadarajan et.al.	2312.06736	null
2023-12-11	EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM	Chong Zhou et.al.	2312.06660	link
2023-12-11	The Intrinsic Sizes of Odd Radio Circles	David Rupke et.al.	2312.06387	null
2023-12-11	Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation	Dong Zhao et.al.	2312.06331	link
2023-12-11	SemiSAM: Exploring SAM for Enhancing Semi-Supervised Medical Image Segmentation with Extremely Limited Annotations	Yichi Zhang et.al.	2312.06316	link
2023-12-10	RepViT-SAM: Towards Real-Time Segmenting Anything	Ao Wang et.al.	2312.05760	link
2023-12-12	0.1% Data Makes Segment Anything Slim	Zigeng Chen et.al.	2312.05284	link
2023-12-15	Fine-tuning vision foundation model for crack segmentation in civil infrastructures	Kang Ge et.al.	2312.04233	null
2023-12-07	SAMBA: A Trainable Segmentation Web-App with Smart Labelling	Ronan Docherty et.al.	2312.04197	link
2023-12-07	An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything	Israt Zarin Era et.al.	2312.04063	null
2023-12-06	Boosting Segment Anything Model Towards Open-Vocabulary Learning	Xumeng Han et.al.	2312.03628	link
2023-12-10	Foundation Model Assisted Weakly Supervised Semantic Segmentation	Xiaobo Yang et.al.	2312.03585	link
2023-12-05	AI-SAM: Automatic and Interactive Segment Anything Model	Yimu Pan et.al.	2312.03119	link
2023-12-05	SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints	Xianping Ma et.al.	2312.02464	link
2023-12-05	Towards Granularity-adjusted Pixel-level Semantic Annotation	Rohit Kundu et.al.	2312.02420	null
2023-12-03	SANeRF-HQ: Segment Anything for NeRF in High Quality	Yichen Liu et.al.	2312.01531	null
2023-12-01	Segment and Caption Anything	Xiaoke Huang et.al.	2312.00869	link
2023-12-01	EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything	Yunyang Xiong et.al.	2312.00863	link
2023-12-01	Segment Anything Model-guided Collaborative Learning Network for Scribble-supervised Polyp Segmentation	Yiming Zhao et.al.	2312.00312	null
2023-11-29	SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation	Mutian Xu et.al.	2311.17707	link
2023-11-28	Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model	Zelin Peng et.al.	2311.17112	null
2023-11-28	I-MedSAM: Implicit Medical Image Segmentation with Segment Anything	Xiaobao Wei et.al.	2311.17081	link
2023-12-01	Self-Supervised Learning of Whole and Component-Based Semantic Representations for Person Re-Identification	Siyuan Huang et.al.	2311.17074	null
2023-11-27	Unleashing the Power of Prompt-driven Nucleus Instance Segmentation	Zhongyi Shui et.al.	2311.15939	link
2023-12-05	Stable Segment Anything Model	Qi Fan et.al.	2311.15776	link
2023-11-27	MARIS: Referring Image Segmentation via Mutual-Aware Attention Features	Mengxi Zhang et.al.	2311.15727	null
2023-11-27	SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation	Jiehong Lin et.al.	2311.15707	link
2023-11-27	Where to Begin? From Random to Foundation Model Instructed Initialization in Federated Learning for Medical Image Segmentation	Ming Li et.al.	2311.15463	null
2023-11-26	Obj-NeRF: Extract Object NeRFs from Multi-view Images	Zhiyi Li et.al.	2311.15291	null
2023-12-04	Can SAM recognize crops? Quantifying the zero-shot performance of a semantic segmentation foundation model on generating crop-type maps using satellite imagery for precision agriculture	Rutuja Gurav et.al.	2311.15138	null
2023-11-22	Self-guided Few-shot Semantic Segmentation for Remote Sensing Imagery Based on Large Vision Models	Xiyu Qi et.al.	2311.13200	null
2023-11-21	Novel OCT mosaicking pipeline with Feature- and Pixel-based registration	Jiacheng Wang et.al.	2311.13052	link
2023-11-21	GMISeg: General Medical Image Segmentation without Re-Training	Jing Xu et.al.	2311.12539	null
2023-11-20	Broadband non-thermal emission of odd radio circles induced by galactic outflow remnants and their evolution	Yutaka Fujita et.al.	2311.12099	null
2023-11-19	Few-Shot Classification & Segmentation Using Large Language Models Agent	Tian Meng et.al.	2311.12065	null
2023-11-20	SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks	Jin Ye et.al.	2311.11969	link
2023-11-19	GeoSAM: Fine-tuning SAM with Sparse and Dense Visual Prompting for Automated Segmentation of Mobility Infrastructure	Rafi Ibn Sultan et.al.	2311.11319	link
2023-11-18	A Foundation Model for Cell Segmentation	Uriah Israel et.al.	2311.11004	null
2023-11-17	Zero-Shot Digital Rock Image Segmentation with a Fine-Tuned Segment Anything Model	Zhaoyang Ma et.al.	2311.10865	null
2023-11-17	Segment Anything Model with Uncertainty Rectification for Auto-Prompting Medical Image Segmentation	Yichi Zhang et.al.	2311.10529	null
2023-11-16	Slide-SAM: Medical SAM Meets Sliding Window	Quan Quan et.al.	2311.10121	link
2023-11-15	AdapterShadow: Adapting Segment Anything Model for Shadow Detection	Leiping Jie et.al.	2311.08891	link
2023-11-15	Discovery of Diffuse Radio Source in Abell 1060	Kohei Kurahara et.al.	2311.08693	null
2023-11-14	Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images	Zhiyun Song et.al.	2311.08225	null
2023-11-14	SAMIHS: Adaptation of Segment Anything Model for Intracranial Hemorrhage Segmentation	Yinuo Wang et.al.	2311.08190	link
2023-11-14	Zero-Shot Segmentation of Eye Features Using the Segment Anything Model (SAM)	Virmarie Maquiling et.al.	2311.08077	link
2023-11-14	GlanceSeg: Real-time microaneurysm lesion segmentation with gaze-map-guided foundation model for early detection of diabetic retinopathy	Hongyang Jiang et.al.	2311.08075	null
2023-11-10	EviPrompt: A Training-Free Evidential Prompt Generation Method for Segment Anything Model in Medical Images	Yinsong Xu et.al.	2311.06400	null
2023-11-09	SAMVG: A Multi-stage Image Vectorization Model with the Segment-Anything Model	Haokun Zhu et.al.	2311.05276	null
2023-11-08	Are foundation models efficient for medical image segmentation?	Danielle Ferreira et.al.	2311.04847	null
2023-11-06	Masking Hyperspectral Imaging Data with Pretrained Models	Elias Arbash et.al.	2311.03053	link
2023-11-06	Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation	Shichao Dong et.al.	2311.01989	null
2023-11-02	Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning	Gaoang Wang et.al.	2311.01004	link
2023-10-31	Joint Depth Prediction and Semantic Segmentation with Multi-View SAM	Mykhailo Shvets et.al.	2311.00134	null
2023-10-31	Team I2R-VI-FF Technical Report on EPIC-KITCHENS VISOR Hand Object Segmentation Challenge 2023	Fen Fang et.al.	2310.20120	null
2023-11-13	Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models	Hao Li et.al.	2310.19721	link
2023-10-30	A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture	Qianqian Shen et.al.	2310.19257	link
2023-10-28	Audio-Visual Instance Segmentation	Ruohao Guo et.al.	2310.18709	link
2023-10-26	Task-driven Prompt Evolution for Foundation Models	Rachana Sathish et.al.	2310.17128	null
2023-10-25	Open-NeRF: Towards Open Vocabulary NeRF Decomposition	Hao Zhang et.al.	2310.16383	null
2023-10-23	SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding	Haoxiang Wang et.al.	2310.15308	null
2023-10-23	Ionized Gas Extended Over 40 kpc in an Odd Radio Circle Host Galaxy	Alison L. Coil et.al.	2310.15162	null
2023-10-29	SAM-Med3D	Haoyu Wang et.al.	2310.15161	link
2023-10-19	Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models	Zhaozheng Chen et.al.	2310.13026	link
2023-10-04	Comprehensive Multimodal Segmentation in Medical Imaging: Combining YOLOv8 with SAM and HQ-SAM Models	Sumit Pandey et.al.	2310.12995	null
2023-10-19	Segment Anything Meets Universal Adversarial Perturbation	Dongshen Han et.al.	2310.12431	null
2023-10-17	Towards Training-free Open-world Segmentation via Image Prompting Foundation Models	Lv Tang et.al.	2310.10912	link
2023-10-16	Electric dipole polarizability of low-lying excited states in atomic nuclei	José Nicolás Orce et.al.	2310.10775	null
2023-10-16	Evaluation and improvement of Segment Anything Model for interactive histopathology image segmentation	SeungKyu Kim et.al.	2310.10493	null
2023-11-07	Recursive Segmentation Living Image: An eXplainable AI (XAI) Approach for Computing Structural Beauty of Images or the Livingness of Space	Yao Qianxiang et.al.	2310.10149	null
2023-10-16	Black-box Targeted Adversarial Attack on Segment Anything (SAM)	Sheng Zheng et.al.	2310.10010	null
2023-10-24	Segment Anything Model for Pedestrian Infrastructure Inventory: Assessing Zero-Shot Segmentation on Multi-Mode Geospatial Data	Jiahao Xia et.al.	2310.09918	null
2023-10-17	Prototype-oriented Unsupervised Change Detection for Disaster Management	Youngtack Oh et.al.	2310.09759	null
2023-10-13	Generative AI-driven Semantic Communication Framework for NextG Wireless Network	Avi Deb Raha et.al.	2310.09021	null
2023-10-12	Virtual Augmented Reality for Atari Reinforcement Learning	Christian A. Schiller et.al.	2310.08683	link
2023-10-12	Fine-Grained Annotation for Face Anti-Spoofing	Xu Chen et.al.	2310.08142	null
2023-10-10	Machine Eye for Defects: Machine Learning-Based Solution to Identify and Characterize Topological Defects in Textured Images of Nematic Materials	Haijie Ren et.al.	2310.06406	null
2023-10-09	Empirical Evaluation of the Segment Anything Model (SAM) for Brain Tumor Segmentation	Mohammad Peivandi et.al.	2310.06162	null
2023-10-07	Tree-GPT: Modular Large Language Model Expert System for Forest Remote Sensing Image Understanding and Interactive Analysis	Siqi Du et.al.	2310.04698	null
2023-10-06	TiC: Exploring Vision Transformer in Convolution	Song Zhang et.al.	2310.04134	link
2023-10-03	Multi-Prompt Fine-Tuning of Foundation Models for Enhanced Medical Image Segmentation	Xiangru Li et.al.	2310.02381	null
2023-10-03	Zero-Shot Refinement of Buildings’ Segmentation Models using SAM	Ali Mayladan et.al.	2310.01845	link
2023-10-01	Propagating Semantic Labels in Video Data	David Balaban et.al.	2310.00783	null
2023-09-30	Exploring SAM Ablations for Enhancing Medical Segmentation in Radiology and Pathology	Amin Ranem et.al.	2310.00504	null
2023-09-29	Are Odd Radio Circles virial shocks around massive galaxies? Implications for cosmic-ray diffusion in the circumgalactic medium	Shotaro Yamasaki et.al.	2309.17451	null
2023-10-02	UniQuadric: A SLAM Backend for Unknown Rigid Object 3D Tracking and Light-Weight Modeling	Linghao Yang et.al.	2309.17036	null
2023-09-29	Segment Anything Model is a Good Teacher for Local Feature Learning	Jingqian Wu et.al.	2309.16992	link
2023-10-02	nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance	Yunxiang Li et.al.	2309.16967	link
2023-09-28	Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization	Thilo von Neumann et.al.	2309.16482	null
2023-09-27	Learning from SAM: Harnessing a Segmentation Foundation Model for Sim2Real Domain Adaptation through Regularization	Mayara E. Bonani et.al.	2309.15562	null
2023-09-24	A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and Weeds Competition	Khoa Dang Nguyen et.al.	2309.13578	null
2023-09-24	MediViSTA-SAM: Zero-shot Medical Video Analysis with Spatio-temporal SAM Adaptation	Sekeun Kim et.al.	2309.13539	link
2023-09-22	NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything	Xiaobao Wei et.al.	2309.12790	link
2023-09-21	Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal	Xiao Feng Zhang et.al.	2309.11715	null
2023-09-18	An Accurate and Efficient Neural Network for OCTA Vessel Segmentation and a New Dataset	Haojian Ning et.al.	2309.09483	link
2023-09-16	MA-SAM: Modality-agnostic SAM Adaptation for 3D Medical Image Segmentation	Cheng Chen et.al.	2309.08842	link
2023-09-15	Global trends of the electric dipole polarizability from shell-model calculations	José Nicolás Orce et.al.	2309.08810	null
2023-09-15	Segment Anything Model for Brain Tumor Segmentation	Peng Zhang et.al.	2309.08434	null
2023-09-13	SAMUS: Adapting Segment Anything Model for Clinically-Friendly and Generalizable Ultrasound Image Segmentation	Xian Lin et.al.	2309.06824	link
2023-09-07	SAM3D: Segment Anything Model in Volumetric Medical Images	Nhat-Tan Bui et.al.	2309.03493	link
2023-09-05	Artificial General Intelligence for Radiation Oncology	Chenbin Liu et.al.	2309.02590	null
2023-09-05	SAM-Deblur: Let Segment Anything Boost Image Deblurring	Siwei Li et.al.	2309.02270	link
2023-09-04	Prompt me a Dataset: An investigation of text-image prompting for historical image dataset creation using foundation models	Hassan El-Hajj et.al.	2309.01674	link
2023-09-04	Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images	Lei Ding et.al.	2309.01429	link
2023-09-01	Self-Sampling Meta SAM: Enhancing Few-shot Medical Image Segmentation with Meta-Learning	Yiming Zhang et.al.	2308.16466	link
2023-08-30	SAM-Med2D	Junlong Cheng et.al.	2308.16184	link
2023-08-28	Auto-Prompting SAM for Mobile Friendly 3D Medical Image Segmentation	Chengyin Li et.al.	2308.14936	link
2023-08-31	SAM-PARSER: Fine-tuning SAM Efficiently by Parameter Space Reconstruction	Zelin Peng et.al.	2308.14604	null
2023-08-27	Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars	Weijia Feng et.al.	2308.14133	null
2023-08-27	Enhancing Bloodstain Analysis Through AI-Based Segmentation: Leveraging Segment Anything Model for Crime Scene Investigation	Zihan Dong et.al.	2308.13979	link
2023-08-26	Zero-Shot Edge Detection with SCESAME: Spectral Clustering-based Ensemble for Segment Anything Model Estimation	Hiroaki Yamagiwa et.al.	2308.13779	link
2023-08-26	SamDSK: Combining Segment Anything Model with Domain-Specific Knowledge for Semi-Supervised Learning in Medical Image Segmentation	Yizhe Zhang et.al.	2308.13759	link
2023-08-23	SPPNet: A Single-Point Prompt Network for Nuclei Image Segmentation	Qing Xu et.al.	2308.12231	link
2023-08-22	SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF)	Ange Lou et.al.	2308.11774	null
2023-08-20	False Negative/Positive Control for SAM on Noisy Medical Images	Xing Yao et.al.	2308.10382	link
2023-08-31	SAMedOCT: Adapting Segment Anything Model (SAM) for Retinal OCT	Botond Fazekas et.al.	2308.09331	null
2023-08-17	SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation	Wenxi Yue et.al.	2308.08746	link
2023-08-15	Self-Prompting Large Vision Models for Few-Shot Medical Image Segmentation	Qi Wu et.al.	2308.07624	link
2023-08-14	SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation	An Wang et.al.	2308.07156	null
2023-08-14	A One Stop 3D Target Reconstruction and multilevel Segmentation Method	Jiexiong Xu et.al.	2308.06974	link
2023-08-14	CEmb-SAM: Segment Anything Model with Condition Embedding for Joint Learning from Heterogeneous Datasets	Dongik Shin et.al.	2308.06957	null
2023-08-28	CLE Diffusion: Controllable Light Enhancement Diffusion Model	Yuyang Yin et.al.	2308.06725	null
2023-08-12	Polyp-SAM++: Can A Text Guided SAM Perform Better for Polyp Segmentation?	Risab Biswas et.al.	2308.06623	link
2023-08-12	TongueSAM: An Universal Tongue Segmentation Model Based on SAM with Zero-Shot	Shan Cao et.al.	2308.06444	link
2023-08-11	FoodSAM: Any Food Segmentation	Xing Lan et.al.	2308.05938	link
2023-08-10	Leverage Weakly Annotation to Pixel-wise Annotation via Zero-shot Segment Anything Model for Molecular-empowered Learning	Xueyuan Li et.al.	2308.05785	null
2023-08-10	Adaptive Low Rank Adaptation of Segment Anything to Salient Object Detection	Ruikai Cui et.al.	2308.05426	link
2023-08-08	AquaSAM: Underwater Image Foreground Segmentation	Muduo Xu et.al.	2308.04218	link
2023-08-05	Surrogate Empowered Sim2Real Transfer of Deep Reinforcement Learning for ORC Superheat Control	Runze Lin et.al.	2308.02765	null
2023-08-02	Push the Boundary of SAM: A Pseudo-label Correction Framework for Medical Segmentation	Ziyi Huang et.al.	2308.00883	null
2023-08-16	SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model	Shili Zhou et.al.	2307.16586	null
2023-07-26	Tracking Anything in High Quality	Jiawen Zhu et.al.	2307.13974	link
2023-07-21	MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems	Thilo von Neumann et.al.	2307.11394	link
2023-07-12	SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology	Jingwei Zhang et.al.	2307.09570	null
2023-07-15	Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments	Ruiping Liu et.al.	2307.07757	link
2023-07-11	$\mathrm{SAM^{Med}}$ : A medical image annotation framework based on large vision model	Chenglong Wang et.al.	2307.05617	null
2023-07-07	Large AI Model-Based Semantic Communications	Feibo Jiang et.al.	2307.03492	null
2023-07-10	ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking	Yuanyou Xu et.al.	2307.02508	null
2023-07-05	AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images	Ao Cheng et.al.	2307.02464	null
2023-07-03	Segment Anything Meets Point Tracking	Frano Rajič et.al.	2307.01197	link
2023-07-03	SAMAug: Point Prompt Augmentation for Segment Anything Model	Haixing Dai et.al.	2307.01187	link
2023-07-03	SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation	Liangliang Yao et.al.	2307.01024	link
2023-07-03	RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation	Yonglin Li et.al.	2307.00997	link
2023-07-01	All-in-SAM: from Weak Annotation to Pixel-wise Nuclei Segmentation with Prompt-based Finetuning	Can Cui et.al.	2307.00290	null
2023-06-30	Training-free Object Counting with Prompts	Zenglin Shi et.al.	2307.00038	link
2023-06-30	Topological Data Analysis Guided Segment Anything Model Prompt Optimization for Zero-Shot Segmentation in Biological Imaging	Ruben Glatt et.al.	2306.17400	null
2023-06-29	Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization	Yingxin Lai et.al.	2306.17075	link
2023-06-29	The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot	Lucas Prado Osco et.al.	2306.16623	link
2023-06-28	RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model	Keyan Chen et.al.	2306.16269	link
2023-06-28	Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection	Zhewei Chen et.al.	2306.16186	null
2023-06-24	Utilizing Segment Anything Model For Assessing Localization of GRAD-CAM in Medical Imaging	Evan Kellener et.al.	2306.15692	null
2023-06-27	CellViT: Vision Transformers for Precise Cell Segmentation and Classification	Fabian Hörst et.al.	2306.15350	link
2023-06-30	MedLSAM: Localize and Segment Anything Model for 3D Medical Images	Wenhui Lei et.al.	2306.14752	link
2023-07-01	Faster Segment Anything: Towards Lightweight SAM for Mobile Applications	Chaoning Zhang et.al.	2306.14289	link
2023-06-25	When SAM Meets Sonar Images	Lin Wang et.al.	2306.14109	link
2023-06-23	Curvature-enhanced Graph Convolutional Network for Biomolecular Interaction Prediction	Cong Shen et.al.	2306.13699	link
2023-06-23	3DSAM-adapter: Holistic Adaptation of SAM from 2D to 3D for Promptable Medical Image Segmentation	Shizhan Gong et.al.	2306.13465	link
2023-06-23	Robustness of Segment Anything Model (SAM) for Autonomous Driving in Adverse Weather Conditions	Xinru Shan et.al.	2306.13290	null
2023-06-22	Ladder Fine-tuning approach for SAM integrating complementary network	Shurong Chai et.al.	2306.12737	link
2023-06-21	Comparative Analysis of Segment Anything Model and U-Net for Breast Tumor Detection in Ultrasound and Mammography Images	Mohsen Ahmadi et.al.	2306.12510	null
2023-06-21	Fast Segment Anything	Xu Zhao et.al.	2306.12156	link
2023-06-20	Segment Anything Model (SAM) for Radiation Oncology	Lian Zhang et.al.	2306.11730	null
2023-06-22	Enlighten Anything: When Segment Anything Model Meets Low-Light Image Enhancement	Qihan Zhao et.al.	2306.10286	link
2023-06-15	Temporally-Extended Prompts Optimization for SAM in Interactive Medical Image Segmentation	Chuyun Shen et.al.	2306.08958	null
2023-06-14	TomoSAM: a 3D Slicer extension using SAM for tomography segmentation	Federico Semeraro et.al.	2306.08609	link
2023-06-13	Robustness of SAM: Segment Anything Under Corruptions and Beyond	Yu Qiao et.al.	2306.07713	null