[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]
Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-08 | GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling | Huawei Sun et.al. | 2409.02720 | link |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-08-20 | End-to-end learned Lossy Dynamic Point Cloud Attribute Compression | Dat Thanh Nguyen et.al. | 2408.10665 | null |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-06 | Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement | Hao Xu et.al. | 2408.02966 | null |
2024-08-01 | Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control | Michael Rudolph et.al. | 2408.00599 | null |
2024-07-22 | Double Deep Learning-based Event Data Coding and Classification | Abdelrahman Seleem et.al. | 2407.15531 | null |
2024-07-11 | Enhancing octree-based context models for point cloud geometry compression with attention-based child node number prediction | Chang Sun et.al. | 2407.08528 | null |
2024-07-11 | Enhancing context models for point cloud geometry compression with context feature residuals and multi-loss | Chang Sun et.al. | 2407.08520 | null |
2024-07-19 | PCAC-GAN: A Sparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-05 | Rethinking Data Input for Point Cloud Upsampling | Tongxu Zhang et.al. | 2407.04476 | null |
2024-08-26 | TSC-PCAC: Voxel Transformer and Sparse Convolution Based Point Cloud Attribute Compression for 3D Broadcasting | Zixi Guo et.al. | 2407.04284 | link |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-06-09 | Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering | Yueyu Hu et.al. | 2406.05915 | null |
2024-06-02 | Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor | Lei Liu et.al. | 2406.00791 | null |
2024-05-23 | NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation | Chaokang Jiang et.al. | 2405.14241 | link |
2024-05-19 | Point Cloud Compression with Implicit Neural Representations: A Unified Framework | Hongning Ruan et.al. | 2405.11493 | null |
2024-05-02 | PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | null |
2024-04-21 | Pointsoup: High-Performance and Extremely Low-Decoding-Latency Learned Geometry Codec for Large-Scale Point Cloud Scenes | Kang You et.al. | 2404.13550 | link |
2024-04-16 | Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery | Zohre Karimi et.al. | 2404.07185 | null |
2024-04-10 | Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression | Kang You et.al. | 2404.06936 | link |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-03-13 | Point Cloud Compression via Constrained Optimal Transport | Zezeng Li et.al. | 2403.08236 | link |
2024-03-08 | Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning | Hang Du et.al. | 2403.05117 | link |
2024-03-01 | Assessing objective quality metrics for JPEG and MPEG point cloud coding | Davi Lazzarotto et.al. | 2403.00410 | null |
2024-02-23 | Scalable Human-Machine Point Cloud Compression | Mateen Ulhaq et.al. | 2402.12532 | link |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-17 | Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression | Dingquan Li et.al. | 2402.11250 | link |
2024-02-11 | PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression | Jiahao Pang et.al. | 2402.07243 | null |
2024-02-07 | Performance analysis of Deep Learning-based Lossy Point Cloud Geometry Compression Coding Solutions | Joao Prazeres et.al. | 2402.05192 | null |
2024-02-08 | Subjective performance evaluation of bitrate allocation strategies for MPEG and JPEG Pleno point cloud compression | Davi Lazzarotto et.al. | 2402.04760 | null |
2024-02-15 | LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application | Yawen Lu et.al. | 2402.04546 | null |
2023-12-23 | Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling | Shujuan Li et.al. | 2312.15133 | null |
2024-03-13 | DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction | Yanlong Li et.al. | 2312.03298 | link |
2023-12-03 | A Conditional Denoising Diffusion Probabilistic Model for Point Cloud Upsampling | Wentao Qu et.al. | 2312.02719 | link |
2023-11-22 | Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression | Tam Thuc Do et.al. | 2311.13539 | null |
2023-11-22 | Volumetric 3D Point Cloud Attribute Compression: Learned polynomial bilateral filter for prediction | Tam Thuc Do et.al. | 2311.13533 | null |
2023-11-22 | Test-Time Augmentation for 3D Point Cloud Classification and Segmentation | Tuan-Anh Vu et.al. | 2311.13152 | null |
2023-11-03 | PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation | Yuhan Ding et.al. | 2311.01773 | null |
2023-11-02 | Lightweight super resolution network for point cloud geometry compression | Wei Zhang et.al. | 2311.00970 | link |
2023-11-17 | Deep Learning-based Compressed Domain Multimedia for Man and Machine: A Taxonomy and Application to Point Cloud Classification | Abdelrahman Seleem et.al. | 2310.18849 | null |
2023-10-13 | iPUNet:Iterative Cross Field Guided Point Cloud Upsampling | Guangshun Wei et.al. | 2310.09092 | link |
2024-03-15 | PU-Ray: Domain-Independent Point Cloud Upsampling via Ray Marching on Neural Implicit Surface | Sangwon Lim et.al. | 2310.08755 | link |
2024-02-16 | Quasi-Monte Carlo for 3D Sliced Wasserstein | Khai Nguyen et.al. | 2309.11713 | link |
2023-09-08 | Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point Cloud Compression | Jin Heo et.al. | 2309.04549 | null |
2023-09-01 | Test-Time Adaptation for Point Cloud Upsampling Using Meta-Learning | Ahmed Hatem et.al. | 2308.16484 | null |
2024-02-08 | SCP: Spherical-Coordinate-based Learned Point Cloud Compression | Ao Luo et.al. | 2308.12535 | null |
2023-08-22 | Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection | Junsheng Zhou et.al. | 2308.11441 | link |
2023-08-11 | Learned Point Cloud Compression for Classification | Mateen Ulhaq et.al. | 2308.05959 | link |
2023-07-27 | FLiCR: A Fast and Lightweight LiDAR Point Cloud Compression Based on Lossy RI | Jin Heo et.al. | 2307.15005 | null |
2023-07-20 | Aggressive saliency-aware point cloud compression | Eleftheria Psatha et.al. | 2307.10741 | null |
2023-07-18 | Arbitrary point cloud upsampling via Dual Back-Projection Network | Zhi-Song Liu et.al. | 2307.08992 | null |
2023-06-01 | 4DSR-GCN: 4D Video Point Cloud Upsampling using Graph Convolutional Networks | Lorenzo Berlincioni et.al. | 2306.01081 | null |
2023-05-16 | Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching | Shuting Xia et.al. | 2305.05356 | null |
2023-05-02 | Geometric Prior Based Deep Human Point Cloud Geometry Compression | Xinju Wu et.al. | 2305.01309 | null |
2023-05-02 | PU-EdgeFormer: Edge Transformer for Dense Prediction in Point Cloud Upsampling | Dohoon Kim et.al. | 2305.01148 | link |
2023-04-24 | Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions | Yun He et.al. | 2304.11846 | link |
2023-04-01 | Volumetric Attribute Compression for 3D Point Clouds using Feedforward Network with Geometric Attention | Tam Thuc Do et.al. | 2304.00335 | null |
2023-03-27 | NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation | Zehan Zheng et.al. | 2303.15126 | link |
2023-11-07 | GQE-Net: A Graph-based Quality Enhancement Network for Point Cloud Color Attribute | Jinrui Xing et.al. | 2303.13764 | link |
2023-03-22 | Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction | Jianqiang Wang et.al. | 2303.12917 | null |
2023-12-28 | Progressive Frame Patching for FoV-based Point Cloud Video Streaming | Tongyu Zong et.al. | 2303.08336 | null |
2023-12-03 | Parametric Surface Constrained Upsampler Network for Point Cloud | Pingping Cai et.al. | 2303.08240 | link |
2024-03-20 | Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model | Dat Thanh Nguyen et.al. | 2303.06519 | link |
2023-03-11 | Deep probabilistic model for lossless scalable point cloud attribute compression | Dat Thanh Nguyen et.al. | 2303.06517 | null |
2023-03-09 | BIRD-PCC: Bi-directional Range Image-based Deep LiDAR Point Cloud Compression | Chia-Sheng Liu et.al. | 2303.04027 | null |
2023-02-13 | gpcgc: a green point cloud geometry coding method | Qingyang Zhou et.al. | 2302.06062 | null |
2023-02-09 | BASICS: Broad quality Assessment of Static point clouds In Compression Scenarios | Ali Ak et.al. | 2302.04796 | null |
2023-04-27 | Linear Optimal Partial Transport Embedding | Yikun Bai et.al. | 2302.03232 | link |
2023-01-31 | Lidar Upsampling with Sliced Wasserstein Distance | Artem Savkin et.al. | 2301.13558 | null |
2023-01-28 | Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional Coding | Jianqiang Wang et.al. | 2301.12165 | null |
2023-01-27 | Joint Geometry and Attribute Upsampling of Point Clouds Using Frequency-Selective Models with Overlapped Support | Viktoria Heimann et.al. | 2301.11630 | null |
2023-01-03 | Reduced Reference Quality Assessment for Point Cloud Compression | Yipeng Liu et.al. | 2301.01009 | null |
2023-04-06 | Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program | Tiange Luo et.al. | 2212.12952 | null |
2022-12-11 | Learning Neural Volumetric Field for Point Cloud Geometry Compression | Yueyu Hu et.al. | 2212.05589 | link |
2022-12-01 | Low-Rank Tensor Function Representation for Multi-Dimensional Data Recovery | Yisi Luo et.al. | 2212.00262 | null |
2023-12-09 | ECM-OPCC: Efficient Context Model for Octree-based Point Cloud Compression | Yiqi Jin et.al. | 2211.10916 | null |
2022-11-19 | Rate-Distortion Modeling for Bit Rate Constrained Point Cloud Compression | Pan Gao et.al. | 2211.10646 | null |
2022-10-21 | Motion Policy Networks | Adam Fishman et.al. | 2210.12209 | link |
2022-10-28 | Motion estimation and filtered prediction for dynamic point cloud attribute compression | Haoran Hong et.al. | 2210.08262 | null |
2022-10-08 | Point Cloud Upsampling via Cascaded Refinement Network | Hang Du et.al. | 2210.03942 | link |
2023-02-14 | Multiscale Latent-Guided Entropy Model for LiDAR Point Cloud Compression | Tingyu Fan et.al. | 2209.12512 | null |
2022-09-17 | CARNet:Compression Artifact Reduction for Point Cloud Attribute | Dandan Ding et.al. | 2209.08276 | null |
2022-11-16 | CU-Net: Real-Time High-Fidelity Color Upsampling for Point Clouds | Lingdong Wang et.al. | 2209.06112 | link |
2022-09-09 | GRASP-Net: Geometric Residual Analysis and Synthesis for Point Cloud Compression | Jiahao Pang et.al. | 2209.04401 | link |
2022-09-06 | Learning to Predict on Octree for Scalable Point Cloud Geometry Coding | Yixiang Mao et.al. | 2209.02226 | null |
2022-08-26 | Efficient LiDAR Point Cloud Geometry Compression Through Neighborhood Point Attention | Ruixiang Xue et.al. | 2208.12573 | null |
2022-08-17 | Efficient dynamic point cloud coding using Slice-Wise Segmentation | Faranak Tohidi et.al. | 2208.08061 | null |
2023-01-10 | Arbitrary Point Cloud Upsampling with Spherical Mixture of Gaussians | Anthony Dell'Eva et.al. | 2208.05274 | link |
2022-08-04 | IT/IST/IPLeiria Response to the Call for Proposals on JPEG Pleno Point Cloud Coding | André F. R. Guarda et.al. | 2208.02716 | null |
2022-08-04 | IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression | Kang You et.al. | 2208.02519 | link |
2022-07-25 | Inter-Frame Compression for Dynamic Point Cloud Geometry Coding | Anique Akhtar et.al. | 2207.12554 | null |
2022-07-20 | GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation | Cristiano Saltori et.al. | 2207.09763 | link |
2022-06-25 | BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling | Yechao Bai et.al. | 2206.12648 | null |
2022-06-24 | Rate-Distortion Optimal Transform Coefficient Selection for Unoccupied Regions in Video-Based Point Cloud Compression | Christian Herglotz et.al. | 2206.12186 | null |
2022-05-24 | A Rate Control Algorithm for Video-based Point Cloud Compression | Fangyu Shen et.al. | 2205.11825 | null |
2022-05-19 | A Comparative Study of Feature Expansion Unit for 3D Point Cloud Upsampling | Qiang Li et.al. | 2205.09594 | null |
2022-05-02 | D-DPCC: Deep Dynamic Point Cloud Compression via 3D Motion Prediction | Tingyu Fan et.al. | 2205.01135 | link |
2022-05-02 | Point Cloud Compression with Sibling Context and Surface Priors | Zhili Chen et.al. | 2205.00760 | link |
2022-04-29 | Deep Geometry Post-Processing for Decompressed Point Clouds | Xiaoqing Fan et.al. | 2204.13952 | link |
2022-04-27 | Density-preserving Deep Point Cloud Compression | Yun He et.al. | 2204.12684 | null |
2022-04-25 | 4DAC: Learning Attribute Compression for Dynamic Point Clouds | Guangchi Fang et.al. | 2204.11723 | null |
2022-04-25 | Dynamic Point Cloud Compression with Cross-Sectional Approach | Faranak Tohidi et.al. | 2204.11409 | null |
2022-04-22 | PU-EVA: An Edge Vector based Approximation Solution for Flexible-scale Point Cloud Upsampling | Luqing Luo et.al. | 2204.10750 | null |
2022-04-18 | Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation | Wenbo Zhao et.al. | 2204.08196 | link |
2022-06-22 | Learning-based Lossless Point Cloud Geometry Coding using Sparse Tensors | Dat Thanh Nguyen et.al. | 2204.05043 | null |
2022-04-03 | Sparse Tensor-based Point Cloud Attribute Compression | Jianqiang Wang et.al. | 2204.01023 | link |
2022-03-22 | IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment | Yiming Zeng et.al. | 2203.11590 | link |
2022-03-21 | Upsampling Autoencoder for Self-Supervised Point Cloud Learning | Cheng Zhang et.al. | 2203.10768 | null |
2022-05-03 | Frequency-Selective Mesh-to-Mesh Resampling for Color Upsampling of Point Clouds | Viktoria Heimann et.al. | 2203.09224 | null |
2022-03-02 | PUFA-GAN: A Frequency-Aware Generative Adversarial Network for 3D Point Cloud Upsampling | Hao Liu et.al. | 2203.00914 | null |
2022-05-16 | Variable Rate Compression for Raw 3D Point Clouds | Md Ahmed Al Muzaddid et.al. | 2202.13862 | link |
2022-09-14 | Point cloud completion via structured feature maps using a feedback network | Zejia Su et.al. | 2202.08583 | null |
2022-05-08 | OctAttention: Octree-Based Large-Scale Contexts Model for Point Cloud Compression | Chunyang Fu et.al. | 2202.06028 | link |
2022-02-01 | Point Cloud Compression for Efficient Data Broadcasting: A Performance Comparison | Francesco Nardo et.al. | 2202.00719 | null |
2022-02-01 | Fractional Motion Estimation for Point Cloud Compression | Haoran Hong et.al. | 2202.00172 | null |
2022-01-17 | SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations | Zhenyu Li et.al. | 2112.04680 | link |
2022-03-31 | Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling | Wanquan Feng et.al. | 2112.04148 | link |
2022-03-01 | Attribute Artifacts Removal for Geometry-based Point Cloud Compression | Xihua Sheng et.al. | 2112.00560 | null |
2022-10-03 | PU-Transformer: Point Cloud Upsampling Transformer | Shi Qiu et.al. | 2111.12242 | link |
2022-10-21 | Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression | Jianqiang Wang et.al. | 2111.10633 | link |
2021-10-18 | Patch-Based Deep Autoencoder for Point Cloud Geometry Compression | Kang You et.al. | 2110.09109 | link |
2022-07-12 | PC |
Chen Long et.al. | 2109.09337 | link |
2021-09-16 | R-PCC: A Baseline for Range Image-based Point Cloud Compression | Sukai Wang et.al. | 2109.07717 | link |
2021-09-15 | Which One is Better: Assessing Objective Metrics for Point Cloud Compression | Yipeng Liu et.al. | 2109.07158 | null |
2021-08-05 | Joint Geometry and Color Projection-based Point Cloud Quality Metric | Alireza Javaheri et.al. | 2108.02481 | link |
2021-08-03 | SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering | Yifan Zhao et.al. | 2108.00454 | link |
2021-07-29 | Video-based Point Cloud Compression Artifact Removal | Anique Akhtar et.al. | 2107.14179 | null |
2024-02-28 | Score-Based Point Cloud Denoising | Shitong Luo et.al. | 2107.10981 | link |
2022-06-08 | PU-Flow: a Point Cloud Upsampling Network with Normalizing Flows | Aihua Mao et.al. | 2107.05893 | link |
2022-04-18 | "Zero-Shot" Point Cloud Upsampling | Kaiyue Zhou et.al. | 2106.13765 | link |
2021-06-23 | Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction | Qian Yin et.al. | 2106.12236 | null |
2021-06-21 | Cylindrical coordinates for LiDAR point cloud compression | Shashank N. Sridhara et.al. | 2106.11237 | null |
2021-10-11 | Neural Network Modeling of Probabilities for Coding the Octree Representation of Point Clouds | Emre Can Kaya et.al. | 2106.06482 | link |
2021-06-09 | Point Cloud Upsampling via Disentangled Refinement | Ruihui Li et.al. | 2106.04779 | link |
2021-06-02 | DeepCompress: Efficient Point Cloud Geometry Compression | Ryan Killea et.al. | 2106.01504 | null |
2021-06-01 | RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network | Lili Zhao et.al. | 2106.00496 | null |
2021-05-28 | An Unsupervised Optical Flow Estimation For LiDAR Image Sequences | Xuezhou Guo et.al. | 2105.13879 | null |
2021-05-05 | VoxelContext-Net: An Octree based Framework for Point Cloud Compression | Zizheng Que et.al. | 2105.02158 | null |
2021-04-20 | Multiscale deep context modeling for lossless point cloud geometry compression | Dat Thanh Nguyen et.al. | 2104.09859 | link |
2021-04-12 | Towards Efficient Graph Convolutional Networks for Point Cloud Handling | Yawei Li et.al. | 2104.05706 | null |
2021-03-11 | Advanced Geometry Surface Coding for Dynamic Point Cloud Compression | Jian Xiong et.al. | 2103.06549 | null |
2021-03-05 | Hybrid Point Cloud Semantic Compression for Automotive Sensors: A Performance Evaluation | Andrea Varischio et.al. | 2103.03819 | null |
2021-02-26 | Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction | Rajat Sharma et.al. | 2102.13391 | link |
2021-02-25 | A deep perceptual metric for 3D point clouds | Maurice Quach et.al. | 2102.12839 | link |
2021-02-08 | Meta-PU: An Arbitrary-Scale Upsampling Network for Point Cloud | Shuquan Ye et.al. | 2102.04317 | null |
2020-12-15 | NeuralQAAD: An Efficient Differentiable Framework for High Resolution Point Cloud Compression | Nicolas Wagner et.al. | 2012.08143 | null |
2022-06-11 | SPU-Net: Self-Supervised Point Cloud Upsampling by Coarse-to-Fine Reconstruction with Self-Projection Optimization | Xinhai Liu et.al. | 2012.04439 | link |
2021-11-18 | Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning | Mohamed K. Abdel-Aziz et.al. | 2012.03414 | null |
2020-12-05 | ParaNet: Deep Regular Representation for 3D Point Clouds | Qijian Zhang et.al. | 2012.03028 | null |
2020-11-27 | Spherical Interpolated Convolutional Network with Distance-Feature Density for 3D Semantic Segmentation of Point Clouds | Guangming Wang et.al. | 2011.13784 | null |
2020-11-25 | Reduced Reference Perceptual Quality Model and Application to Rate Control for 3D Point Cloud Compression | Qi Liu et.al. | 2011.12688 | null |
2020-11-07 | Multiscale Point Cloud Geometry Compression | Jianqiang Wang et.al. | 2011.03799 | link |
2020-10-29 | Point Cloud Attribute Compression via Successive Subspace Graph Transform | Yueru Chen et.al. | 2010.15302 | null |
2020-08-16 | Real-Time Spatio-Temporal LiDAR Point Cloud Compression | Yu Feng et.al. | 2008.06972 | link |
2021-08-03 | Subjective Quality Database and Objective Study of Compressed Point Clouds With 6DoF Head-Mounted Display | Xinju Wu et.al. | 2008.02501 | null |
2020-06-20 | Pseudo-LiDAR Point Cloud Interpolation Based on 3D Motion Representation and Spatial Supervision | Haojie Liu et.al. | 2006.11481 | null |
2020-06-24 | Improved Deep Point Cloud Geometry Compression | Maurice Quach et.al. | 2006.09043 | link |
2020-04-03 | Intrinsic Point Cloud Interpolation via Dual Latent Space Navigation | Marie-Julie Rakotosaona et.al. | 2004.01661 | link |
2020-03-30 | A generalized Hausdorff distance based quality metric for point cloud geometry | Alireza Javaheri et.al. | 2003.13669 | null |
2020-03-30 | Optimizing Geometry Compression using Quantum Annealing | Sebastian Feld et.al. | 2003.13253 | null |
2020-03-27 | Model-based Joint Bit Allocation between Geometry and Color for Video-based 3D Point Cloud Compression | Qi Liu et.al. | 2002.10798 | null |
2020-03-07 | PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling | Yue Qian et.al. | 2002.10277 | null |
2020-06-22 | Folding-based compression of point cloud attributes | Maurice Quach et.al. | 2002.04439 | null |
2020-01-13 | Efficient 3D Road Map Data Exchange for Intelligent Vehicles in Vehicular Fog Networks | Ivan Wang-Hei Ho et.al. | 2001.04057 | null |
2020-01-12 | Linear Model based Geometry Coding for Lidar Acquired Point Clouds | Xiang Zhang et.al. | 2001.03871 | null |
2021-04-09 | PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection | Shaoshuai Shi et.al. | 1912.13192 | link |
2019-12-20 | A Comprehensive Study and Comparison of Core Technologies for MPEG 3D Point Cloud Compression | Hao Liu et.al. | 1912.09674 | null |
2020-10-15 | Point Cloud Rendering after Coding: Impacts on Subjective and Objective Quality | Alireza Javaheri et.al. | 1912.09137 | null |
2021-03-29 | PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks | Guocheng Qian et.al. | 1912.03264 | link |
2019-11-04 | Video-based compression for plenoptic point clouds | Li Li et.al. | 1911.01355 | null |
2019-09-26 | Learned Point Cloud Geometry Compression | Jianqiang Wang et.al. | 1909.12037 | link |
2019-09-16 | PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation | Haojie Liu et.al. | 1909.07137 | null |
2019-08-17 | 3D Point Cloud Super-Resolution via Graph Total Variation on Surface Normals | Chinthaka Dinesh et.al. | 1908.06261 | null |
2019-08-06 | Point Cloud Super Resolution with Adversarial Residual Graph Networks | Huikai Wu et.al. | 1908.02111 | link |
2020-08-10 | Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds | Yiqun Xu et.al. | 1908.01970 | null |
2019-07-25 | PU-GAN: a Point Cloud Upsampling Adversarial Network | Ruihui Li et.al. | 1907.10844 | null |
2019-06-27 | A Convolutional Decoder for Point Clouds using Adaptive Instance Normalization | Isaak Lim et.al. | 1906.11478 | null |
2019-04-18 | Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds | Wei Yan et.al. | 1905.03691 | null |
2019-05-22 | Learning Convolutional Transforms for Lossy Point Cloud Geometry Compression | Maurice Quach et.al. | 1903.08548 | link |
2019-09-30 | Variational Graph Methods for Efficient Point Cloud Sparsification | Daniel Tenbrinck et.al. | 1903.02858 | null |
2019-03-05 | Pose Estimation of Vehicles Over Uneven Terrain | Yingchong Ma et.al. | 1903.02052 | null |
2019-02-11 | Occupancy-map-based rate distortion optimization for video-based point cloud compression | Li Li et.al. | 1902.04169 | null |
2018-09-30 | A Volumetric Approach to Point Cloud Compression | Maja Krivokuća et.al. | 1810.00484 | null |
2018-05-29 | Surface Light Field Compression using a Point Cloud Codec | Xiang Zhang et.al. | 1805.11203 | null |
2018-05-23 | Comments on "Compression of 3D Point Clouds Using a Region-Adaptive Hierarchical Transform" | Gustavo Sandri et.al. | 1805.09146 | null |
2018-04-28 | Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction | Yiting Shao et.al. | 1804.10783 | null |
2018-03-26 | PU-Net: Point Cloud Upsampling Network | Lequan Yu et.al. | 1801.06761 | link |
2017-10-10 | Attribute Compression of 3D Point Clouds Using Laplacian Sparsity Optimized Graph Transform | Yiting Shao et.al. | 1710.03532 | null |
2017-03-08 | Dynamic Polygon Clouds: Representation and Compression for VR/AR | Philip A. Chou et.al. | 1610.00402 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-18 | One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation | Finn Lukas Busch et.al. | 2409.11764 | null |
2024-09-18 | LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution | Shiyu Feng et.al. | 2409.11711 | null |
2024-09-18 | k-mer-based approaches to bridging pangenomics and population genetics | Miles D. Roberts et.al. | 2409.11683 | null |
2024-09-17 | Few-Shot Domain Adaptation for Learned Image Compression | Tianyu Zhang et.al. | 2409.11111 | null |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
2024-09-14 | Lossy Image Compression with Stochastic Quantization | Anton Kozyriev et.al. | 2409.09488 | null |
2024-09-13 | Fast DCT+: A Family of Fast Transforms Based on Rank-One Updates of the Path Graph | Samuel Fernández-Menduiña et.al. | 2409.08970 | null |
2024-09-13 | On the Computation of BD-Rate over a Set of Videos for Fair Assessment of Performance of Learned Video Codecs | M. Akin Yilmaz et.al. | 2409.08772 | null |
2024-09-13 | USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s | Zhuoyuan Li et.al. | 2409.08481 | null |
2024-09-12 | Learned Compression for Images and Point Clouds | Mateen Ulhaq et.al. | 2409.08376 | link |
2024-09-11 | NVRC: Neural Video Representation Compression | Ho Man Kwan et.al. | 2409.07414 | null |
2024-09-11 | Dynamic Error-Bounded Hierarchical Matrices in Neural Network Compression | John Mango et.al. | 2409.07028 | null |
2024-09-10 | Universal End-to-End Neural Network for Lossy Image Compression | Bouzid Arezki et.al. | 2409.06586 | null |
2024-09-10 | Rate-Constrained Quantization for Communication-Efficient Federated Learning | Shayan Mohajer Hamidi et.al. | 2409.06319 | null |
2024-09-09 | Design and Implementation of TAO DAQ System | Shuihan Zhang et.al. | 2409.05522 | null |
2024-09-09 | A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression | Nora Hofer et.al. | 2409.05490 | null |
2024-09-09 | Attention Based Machine Learning Methods for Data Reduction with Guaranteed Error Bounds | Xiao Li et.al. | 2409.05357 | null |
2024-09-06 | Convolutional Transformer-Based Image Compression | Bouzid Arezki et.al. | 2409.04118 | null |
2024-09-06 | 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors | Yujun Huang et.al. | 2409.04013 | link |
2024-09-05 | TropNNC: Structured Neural Network Compression Using Tropical Geometry | Konstantinos Fotopoulos et.al. | 2409.03945 | null |
2024-09-05 | Unified Framework for Neural Network Compression via Decomposition and Optimal Rank Selection | Ali Aghababaei-Harandi et.al. | 2409.03555 | null |
2024-09-05 | Efficient Image Compression Using Advanced State Space Models | Bouzid Arezki et.al. | 2409.02743 | null |
2024-09-10 | FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings | John Li et.al. | 2409.02453 | null |
2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | null |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-03 | Privacy-Preserving Multimedia Mobile Cloud Computing Using Protective Perturbation | Zhongze Tang et.al. | 2409.01710 | null |
2024-09-02 | Multi-Reference Generative Face Video Compression with Contrastive Learning | Goluck Konuko et.al. | 2409.01029 | null |
2024-09-02 | Accelerating block-level rate control for learned image compression | Muchen Dong et.al. | 2409.01009 | null |
2024-09-02 | PNVC: Towards Practical INR-based Video Compression | Ge Gao et.al. | 2409.00953 | null |
2024-09-01 | BWT construction and search at the terabase scale | Heng Li et.al. | 2409.00613 | link |
2024-08-30 | Prioritized Information Bottleneck Theoretic Framework with Distributed Online Learning for Edge Video Analytics | Zhengru Fang et.al. | 2409.00146 | null |
2024-08-28 | Quantum Kernel Principal Components Analysis for Compact Readout of Chemiresistive Sensor Arrays | Zeheng Wang et.al. | 2409.00115 | null |
2024-08-30 | NDP: Next Distribution Prediction as a More Broad Target | Junhao Ruan et.al. | 2408.17377 | null |
2024-08-30 | Approximately Invertible Neural Network for Learned Image Compression | Yanbo Gao et.al. | 2408.17073 | null |
2024-08-29 | UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation | Piotr Rudol et.al. | 2408.16501 | null |
2024-08-29 | Convolutional Neural Network Compression Based on Low-Rank Decomposition | Yaping He et.al. | 2408.16289 | null |
2024-08-27 | Bandwidth-Aware and Overlap-Weighted Compression for Communication-Efficient Federated Learning | Zichen Tang et.al. | 2408.14736 | null |
2024-08-25 | Condensed Sample-Guided Model Inversion for Knowledge Distillation | Kuluhan Binici et.al. | 2408.13850 | null |
2024-08-12 | Semantic Variational Bayes Based on a Semantic Information Theory for Solving Latent Variables | Chenguang Lu et.al. | 2408.13122 | null |
2024-08-22 | Quantization-free Lossy Image Compression Using Integer Matrix Factorization | Pooya Ashtari et.al. | 2408.12691 | link |
2024-08-22 | DeepHQ: Learned Hierarchical Quantizer for Progressive Deep Image Coding | Jooyoung Lee et.al. | 2408.12150 | null |
2024-08-28 | AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Maksim Smirnov et.al. | 2408.11982 | link |
2024-08-20 | Trustworthy Compression? Impact of AI-based Codecs on Biometrics for Law Enforcement | Sandra Bergmann et.al. | 2408.10823 | null |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-16 | Bi-Directional Deep Contextual Video Compression | Xihua Sheng et.al. | 2408.08604 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-15 | Algebraic Vertex Ordering of a Sparse Graph for Adjacency Access Locality and Graph Compression | Dimitris Floros et.al. | 2408.08439 | null |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-15 | DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions | Ryosuke Korekata et.al. | 2408.07910 | null |
2024-08-14 | Towards Real-time Video Compressive Sensing on Mobile Devices | Miao Cao et.al. | 2408.07530 | link |
2024-08-14 | Encoding and Decoding Algorithms of ANS Variants and Evaluation of Their Average Code Lengths | Hirosuke Yamamoto et.al. | 2408.07322 | null |
2024-08-13 | Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality | Yu-Chih Chen et.al. | 2408.07041 | null |
2024-08-13 | Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines | Samuel Fernández Menduiña et.al. | 2408.07028 | null |
2024-08-19 | Joint Source-Channel Optimization for UAV Video Coding and Transmission | Kesong Wu et.al. | 2408.06667 | null |
2024-08-08 | Flow-Lenia.png: Evolving Multi-Scale Complexity by Means of Compression | Tadashi Adachi et.al. | 2408.06374 | null |
2024-08-09 | Benchmarking Conventional and Learned Video Codecs with a Low-Delay Configuration | Siyue Teng et.al. | 2408.05042 | null |
2024-08-08 | SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression | Linhan Cao et.al. | 2408.04273 | null |
2024-08-07 | Bi-Level Spatial and Channel-aware Transformer for Learned Image Compression | Hamidreza Soltani et.al. | 2408.03842 | null |
2024-08-07 | BVI-AOM: A New Training Dataset for Deep Video Compression Optimization | Jakub Nawała et.al. | 2408.03265 | null |
2024-08-06 | Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring | Jeremy J. Williams et.al. | 2408.02869 | null |
2024-08-05 | Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation | McKell Woodland et.al. | 2408.02761 | link |
2024-08-04 | CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization | Xiang He et.al. | 2408.01952 | link |
2024-08-03 | Channel-Aware Distributed Transmission Control and Video Streaming in UAV Networks | Masoud Ghazikor et.al. | 2408.01885 | null |
2024-08-02 | An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression | Shiyi Luo et.al. | 2408.01534 | null |
2024-07-31 | Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study | Mitra Amiri et.al. | 2408.00052 | null |
2024-07-31 | Tora: Trajectory-oriented Diffusion Transformer for Video Generation | Zhenghao Zhang et.al. | 2407.21705 | null |
2024-07-30 | Edge Learning Based Collaborative Automatic Modulation Classification for Hierarchical Cognitive Radio Networks | Peihao Dong et.al. | 2407.20772 | link |
2024-07-30 | Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications | Yi Ju et.al. | 2407.20717 | null |
2024-07-29 | Homomorphic data compression for real time photon correlation analysis | Sebastian Strempfer et.al. | 2407.20356 | null |
2024-07-24 | Accelerating the Low-Rank Decomposed Models | Habib Hajimolahoseini et.al. | 2407.20266 | null |
2024-07-29 | ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck | Chia-Hao Kao et.al. | 2407.19651 | null |
2024-07-28 | NVC-1B: A Large Neural Video Coding Model | Xihua Sheng et.al. | 2407.19402 | null |
2024-07-18 | Generative AI Augmented Induction-based Formal Verification | Aman Kumar et.al. | 2407.18965 | null |
2024-07-25 | The seismic purifier: An unsupervised approach to seismic signal detection via representation learning | Onur Efe et.al. | 2407.18402 | link |
2024-07-25 | Adaptable Deep Joint Source-and-Channel Coding for Small Satellite Applications | Olga Kondrateva et.al. | 2407.18146 | null |
2024-07-25 | Scaling Training Data with Lossy Image Compression | Katherine L. Mentzer et.al. | 2407.17954 | link |
2024-07-25 | Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks | Zhicheng Cai et.al. | 2407.17834 | link |
2024-07-24 | Lossy Data Compression By Adaptive Mesh Coarsening | N. Böing et.al. | 2407.17316 | null |
2024-07-24 | High Efficiency Image Compression for Large Visual-Language Models | Binzhe Li et.al. | 2407.17060 | null |
2024-07-23 | Accelerating Learned Video Compression via Low-Resolution Representation Learning | Zidian Qiu et.al. | 2407.16418 | null |
2024-07-24 | FCNR: Fast Compressive Neural Representation of Visualization Images | Yunfei Lu et.al. | 2407.16369 | link |
2024-07-19 | Shapley Pruning for Neural Network Compression | Kamil Adamczewski et.al. | 2407.15875 | null |
2024-07-18 | CIC: Circular Image Compression | Honggui Li et.al. | 2407.15870 | null |
2024-07-22 | Online String Attractors | Philip Whittington et.al. | 2407.15599 | null |
2024-07-22 | Spectral properties of bright deposits in permanently shadowed craters on Ceres | Stefan Schröder et.al. | 2407.15327 | null |
2024-07-21 | Lessons Learned on the Path to Guaranteeing the Error Bound in Lossy Quantizers | Alex Fallin et.al. | 2407.15037 | null |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-18 | Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law | Giorgio Franceschelli et.al. | 2407.13493 | null |
2024-07-18 | Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao et.al. | 2407.13179 | null |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency | Vignesh V Menon et.al. | 2407.12465 | null |
2024-07-17 | Reliability Function of Classical-Quantum Channels | Ke Li et.al. | 2407.12403 | null |
2024-07-17 | Exploiting Inter-Image Similarity Prior for Low-Bitrate Remote Sensing Image Compression | Junhui Li et.al. | 2407.12295 | null |
2024-07-16 | Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors | Matt Gorbett et.al. | 2407.12075 | null |
2024-07-17 | Rate-Distortion-Cognition Controllable Versatile Neural Image Compression | Jinming Liu et.al. | 2407.11700 | null |
2024-07-16 | MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models | Hongrong Cheng et.al. | 2407.11681 | null |
2024-07-17 | Neural Compression of Atmospheric States | Piotr Mirowski et.al. | 2407.11666 | null |
2024-07-16 | Rethinking Learned Image Compression: Context is All You Need | Jixiang Luo et.al. | 2407.11590 | null |
2024-07-16 | The impact of lossy data compression on the power spectrum of the high redshift 21-cm signal with LOFAR | J. K. Chege et.al. | 2407.11557 | null |
2024-07-21 | Uniformly Accelerated Motion Model for Inter Prediction | Zhuoyuan Li et.al. | 2407.11541 | null |
2024-07-15 | M18K: A Comprehensive RGB-D Dataset and Benchmark for Mushroom Detection and Instance Segmentation | Abdollah Zakeri et.al. | 2407.11275 | link |
2024-07-15 | Enhancing Electrocardiogram Signal Analysis Using NLP-Inspired Techniques: A Novel Approach with Embedding and Self-Attention | Prapti Ganguly et.al. | 2407.11102 | null |
2024-07-15 | In-Loop Filtering via Trained Look-Up Tables | Zhuoyuan Li et.al. | 2407.10926 | null |
2024-07-15 | Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Zhening Liu et.al. | 2407.10632 | null |
2024-07-14 | UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers | Huy Ha et.al. | 2407.10353 | null |
2024-07-13 | WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model | Haisheng Fu et.al. | 2407.09983 | null |
2024-07-13 | Zero-Shot Image Compression with Diffusion-Based Posterior Sampling | Noam Elata et.al. | 2407.09896 | link |
2024-07-13 | Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation | Han Li et.al. | 2407.09853 | link |
2024-07-13 | Infinite families of optimal and minimal codes over rings using simplicial complexes | Yanan Wu et.al. | 2407.09783 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | Hybrid Temporal Computing for Lower Power Hardware Accelerators | Maliha Tasnim et.al. | 2407.08975 | null |
2024-07-11 | Manipulating a Tetris-Inspired 3D Video Representation | Mihir Godbole et.al. | 2407.08885 | null |
2024-07-11 | OMR-NET: a two-stage octave multi-scale residual network for screen content image compression | Shiqi Jiang et.al. | 2407.08545 | null |
2024-07-11 | CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data | Hossein Entezari Zarch et.al. | 2407.08108 | null |
2024-07-10 | Using Low-Discrepancy Points for Data Compression in Machine Learning: An Experimental Comparison | Simone Göttlich et.al. | 2407.07450 | null |
2024-07-10 | Standard compliant video coding using low complexity, switchable neural wrappers | Yueyu Hu et.al. | 2407.07395 | null |
2024-07-10 | MNeRV: A Multilayer Neural Representation for Videos | Qingling Chang et.al. | 2407.07347 | link |
2024-07-11 | Entropy Law: The Story Behind Data Compression and LLM Performance | Mingjia Yin et.al. | 2407.06645 | link |
2024-07-08 | A Hybrid Algorithm for Computing a Partial Singular Value Decomposition Satisfying a Given Threshold | James Baglama et.al. | 2407.06306 | link |
2024-07-08 | TAPVid-3D: A Benchmark for Tracking Any Point in 3D | Skanda Koppula et.al. | 2407.05921 | link |
2024-07-05 | The Impact of Quantization and Pruning on Deep Reinforcement Learning Models | Heng Lu et.al. | 2407.04803 | null |
2024-07-05 | An autoencoder for compressing angle-resolved photoemission spectroscopy data | Steinn Ymir Agustsson et.al. | 2407.04631 | link |
2024-07-05 | Rethinking Image Compression on the Web with Generative AI | Shayan Ali Hassan et.al. | 2407.04542 | null |
2024-07-11 | A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization | Daoce Wang et.al. | 2407.04267 | null |
2024-07-04 | Autoencoded Image Compression for Secure and Fast Transmission | Aryan Kashyap Naveen et.al. | 2407.03990 | link |
2024-07-03 | Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations | Trevor Ablett et.al. | 2407.03311 | link |
2024-07-03 | KeyVideoLLM: Towards Large-scale Video Keyframe Selection | Hao Liang et.al. | 2407.03104 | null |
2024-07-01 | Statistical Analysis of ZFP: Understanding Bias | Alyson Fox et.al. | 2407.01826 | null |
2024-07-01 | An AI-based, Error-bounded Compression Scheme for High-frequency Power Quality Disturbance Data | Markus Stroot et.al. | 2407.01112 | null |
2024-06-28 | Wavelets Are All You Need for Autoregressive Image Generation | Wael Mattar et.al. | 2406.19997 | null |
2024-06-28 | Optimal Video Compression using Pixel Shift Tracking | Hitesh Saai Mananchery Panneerselvam et.al. | 2406.19630 | link |
2024-06-27 | MCNC: Manifold Constrained Network Compression | Chayne Thrash et.al. | 2406.19301 | null |
2024-06-27 | Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without | Ruida Zhou et.al. | 2406.19248 | null |
2024-06-25 | Asymptotically Minimax Regret by Bayes Mixtures | Jun'ichi Takeuchi et.al. | 2406.17929 | null |
2024-06-24 | Hierarchical B-frame Video Coding for Long Group of Pictures | Ivan Kirillov et.al. | 2406.16544 | null |
2024-06-20 | Ranking LLMs by compression | Peijia Guo et.al. | 2406.14171 | null |
2024-06-21 | Measuring Sample Importance in Data Pruning for Training LLMs from a Data Compression Perspective | Minsang Kim et.al. | 2406.14124 | null |
2024-06-20 | Prediction and Reference Quality Adaptation for Learned Video Compression | Xihua Sheng et.al. | 2406.14118 | null |
2024-06-19 | Convex-hull Estimation using XPSNR for Versatile Video Coding | Vignesh V Menon et.al. | 2406.13712 | null |
2024-06-19 | A Study on the Effect of Color Spaces in Learned Image Compression | Srivatsa Prativadibhayankaram et.al. | 2406.13709 | null |
2024-06-19 | Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics | Weitong Zhang et.al. | 2406.13652 | null |
2024-06-18 | Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution | Maximilian Fischer et.al. | 2406.12623 | null |
2024-06-18 | Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines | Honglei Zhang et.al. | 2406.12367 | null |
2024-06-15 | How Should We Extract Discrete Audio Tokens from Self-Supervised Models? | Pooneh Mousavi et.al. | 2406.10735 | null |
2024-06-15 | Object-Attribute-Relation Representation based Video Semantic Communication | Qiyuan Du et.al. | 2406.10469 | null |
2024-06-14 | On Efficient Neural Network Architectures for Image Compression | Yichi Zhang et.al. | 2406.10361 | link |
2024-06-14 | Information Compression in the AI Era: Recent Advances and Future Challenges | Jun Chen et.al. | 2406.10036 | null |
2024-06-13 | CMC-Bench: Towards a New Paradigm of Visual Signal Compression | Chunyi Li et.al. | 2406.09356 | link |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-14 | Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models | Yi-Fan Zhang et.al. | 2406.08487 | link |
2024-06-12 | On Annotation-free Optimization of Video Coding for Machines | Marc Windsheimer et.al. | 2406.07938 | null |
2024-06-11 | SSNVC: Single Stream Neural Video Compression with Implicit Temporal Information | Feng Wang et.al. | 2406.07645 | null |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548 | link |
2024-06-11 | Optimal Matrix-Mimetic Tensor Algebras via Variable Projection | Elizabeth Newman et.al. | 2406.06942 | link |
2024-06-10 | Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency | Jincheng Dai et.al. | 2406.06446 | null |
2024-06-10 | Image Compression with Isotropic and Anisotropic Shepard Inpainting | Rahul Mohideen Kaja Mohideen et.al. | 2406.06247 | null |
2024-06-10 | Efficient Neural Compression with Inference-time Decoding | C. Metz et.al. | 2406.06237 | null |
2024-06-10 | Fiducial-Cosmology-dependent systematics for the DESI 2024 BAO Analysis | A. Pérez-Fernández et.al. | 2406.06085 | null |
2024-06-10 | Quantum Sparse Coding and Decoding Based on Quantum Network | Xun Ji et.al. | 2406.06012 | null |
2024-06-09 | Region of Interest Loss for Anonymizing Learned Image Compression | Christoph Liebender et.al. | 2406.05726 | link |
2024-06-08 | Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models | Minho Park et.al. | 2406.05432 | null |
2024-06-07 | PatchSVD: A Non-uniform SVD-based Image Compression Algorithm | Zahra Golpayegani et.al. | 2406.05129 | link |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-06 | LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression | Junhui Li et.al. | 2406.03961 | null |
2024-06-05 | Lossless Image Compression Using Multi-level Dictionaries: Binary Images | Samar Agnihotri et.al. | 2406.03087 | null |
2024-06-05 | On Jacob Ziv's Individual-Sequence Approach to Information Theory | Neri Merhav et.al. | 2406.02904 | null |
2024-06-04 | Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey | Reza Farahani et.al. | 2406.02302 | null |
2024-06-03 | Video Coding with Cross-Component Sample Offset | Han Gao et.al. | 2406.01795 | null |
2024-06-05 | Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption | Anqi Li et.al. | 2406.00758 | link |
2024-06-01 | Efficient Massive Black Hole Binary parameter estimation for LISA using Sequential Neural Likelihood | Iván Martín Vílchez et.al. | 2406.00565 | null |
2024-06-01 | A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing | Nurul Rafi et.al. | 2406.00239 | null |
2024-05-31 | ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model | Yufei Wang et.al. | 2405.20721 | link |
2024-05-30 | Quantum encoder for fixed Hamming-weight subspaces | Renato M. S. Farias et.al. | 2405.20408 | null |
2024-05-29 | Implicit Neural Image Field for Biological Microscopy Image Compression | Gaole Dai et.al. | 2405.19012 | link |
2024-05-28 | Deep Network Pruning: A Comparative Study on CNNs in Face Recognition | Fernando Alonso-Fernandez et.al. | 2405.18302 | null |
2024-05-28 | Channel Reciprocity Based Attack Detection for Securing UWB Ranging by Autoencoder | Wenlong Gou et.al. | 2405.18255 | null |
2024-05-27 | Evaluation of Resource-Efficient Crater Detectors on Embedded Systems | Simon Vellas et.al. | 2405.16953 | link |
2024-05-27 | UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation | Runzhao Yang et.al. | 2405.16850 | null |
2024-05-27 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model | Shoma Iwai et.al. | 2405.16817 | link |
2024-05-25 | N-BVH: Neural ray queries with bounding volume hierarchies | Philippe Weier et.al. | 2405.16237 | link |
2024-05-25 | A 7K Parameter Model for Underwater Image Enhancement based on Transmission Map Prior | Fuheng Zhou et.al. | 2405.16197 | link |
2024-05-24 | Analytical proxy to families of numerical solutions: the case study of spherical mini-boson stars | Jianzhi Yang et.al. | 2405.15651 | null |
2024-05-24 | SATSense: Multi-Satellite Collaborative Framework for Spectrum Sensing | Haoxuan Yuan et.al. | 2405.15542 | null |
2024-05-24 | Meta-meshing and triangulating lattice structures at a large scale | Qiang Zou et.al. | 2405.15197 | null |
2024-05-23 | NeCGS: Neural Compression for 3D Geometry Sets | Siyu Ren et.al. | 2405.15034 | null |
2024-05-23 | An augmented Lagrangian trust-region method with inexact gradient evaluations to accelerate constrained optimization problems using model hyperreduction | Tianshu Wen et.al. | 2405.14827 | null |
2024-05-23 | Motion-based video compression for resource-constrained camera traps | Malika Nisal Ratnayake et.al. | 2405.14419 | null |
2024-06-01 | I |
Meiqin Liu et.al. | 2405.14336 | link |
2024-05-23 | Sparse |
Matthias Chung et.al. | 2405.14270 | null |
2024-05-22 | "Turing Tests" For An AI Scientist | Xiaoxin Yin et.al. | 2405.13352 | null |
2024-05-21 | Efficient Learned Wavelet Image and Video Coding | Anna Meyer et.al. | 2405.12631 | null |
2024-05-24 | Accelerating Relative Entropy Coding with Space Partitioning | Jiajun He et.al. | 2405.12203 | null |
2024-05-20 | Refining Coded Image in Human Vision Layer Using CNN-Based Post-Processing | Takahiro Shindo et.al. | 2405.11894 | null |
2024-05-19 | Effective In-Context Example Selection through Data Compression | Zhongxiang Sun et.al. | 2405.11465 | null |
2024-05-18 | InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | Wuzhou Li et.al. | 2405.11293 | link |
2024-05-17 | Dark Energy Survey Year 3 results: simulation-based cosmological inference with wavelet harmonics, scattering transforms, and moments of weak lensing mass maps II. Cosmological results | M. Gatti et.al. | 2405.10881 | null |
2024-05-17 | Reduced storage direct tensor ring decomposition for convolutional neural networks compression | Mateusz Gabor et.al. | 2405.10802 | link |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-15 | Properties that allow or prohibit transferability of adversarial attacks among quantized networks | Abhishek Shrestha et.al. | 2405.09598 | link |
2024-05-15 | Sensitivity Decouple Learning for Image Compression Artifacts Reduction | Li Ma et.al. | 2405.09291 | null |
2024-05-18 | Scalable Image Coding for Humans and Machines Using Feature Fusion Network | Takahiro Shindo et.al. | 2405.09152 | link |
2024-05-14 | Parameter-Efficient Instance-Adaptive Neural Video Compression | Hyunmo Yang et.al. | 2405.08530 | link |
2024-05-13 | Goal-oriented compression for |
Yifei Sun et.al. | 2405.07808 | null |
2024-05-13 | Neural Network Compression for Reinforcement Learning Tasks | Dmitry A. Ivanov et.al. | 2405.07748 | null |
2024-05-13 | On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks | Chenhao Wu et.al. | 2405.07717 | null |
2024-05-21 | An Efficient Compression Method for Sign Information of DCT Coefficients via Sign Retrieval | Chihiro Tsutake et.al. | 2405.07487 | link |
2024-05-10 | Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming | Chin-Yun Yu et.al. | 2405.06804 | link |
2024-05-08 | Urban Boundary Delineation from Commuting Data with Bayesian Stochastic Blockmodeling: Scale, Contiguity, and Hierarchy | Sebastian Morel-Balbi et.al. | 2405.04911 | link |
2024-05-14 | Some Notes on the Sample Complexity of Approximate Channel Simulation | Gergely Flamich et.al. | 2405.04363 | null |
2024-05-07 | Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression | Zhenghao Chen et.al. | 2405.04274 | null |
2024-05-08 | Verified Neural Compressed Sensing | Rudy Bunel et.al. | 2405.04260 | null |
2024-05-15 | Lossy Compression with Data, Perception, and Classification Constraints | Yuhan Wang et.al. | 2405.04144 | null |
2024-05-07 | DMOFC: Discrimination Metric-Optimized Feature Compression | Changsheng Gao et.al. | 2405.04044 | null |
2024-05-06 | Computational ghost imaging with hybrid transforms by integrating Hadamard, discrete cosine, and Haar matrices | Yi-Ning Zhao et.al. | 2405.03729 | null |
2024-05-06 | A Rate-Distortion-Classification Approach for Lossy Image Compression | Yuefeng Zhang et.al. | 2405.03500 | null |
2024-05-06 | Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition | Xitong Zhang et.al. | 2405.03089 | null |
2024-05-04 | Deep Pulse-Signal Magnification for remote Heart Rate Estimation in Compressed Videos | Joaquim Comas et.al. | 2405.02652 | null |
2024-05-06 | Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | Jian Meng et.al. | 2405.01775 | link |
2024-05-02 | PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | null |
2024-04-28 | Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression | Li Wan et.al. | 2405.01584 | null |
2024-05-02 | GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression | Daxin Li et.al. | 2405.01170 | null |
2024-04-30 | Analysis and Enhancement of Lossless Image Compression in JPEG-XL | Rustam Mamedov et.al. | 2404.19755 | null |
2024-04-30 | EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization | Jianzong Wang et.al. | 2404.19214 | null |
2024-04-29 | Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Zhiyuan Li et.al. | 2404.18820 | link |
2024-04-28 | Joint Reference Frame Synthesis and Post Filter Enhancement for Versatile Video Coding | Weijie Bao et.al. | 2404.18058 | null |
2024-04-25 | Learning Visuotactile Skills with Two Multifingered Hands | Toru Lin et.al. | 2404.16823 | link |
2024-04-24 | Domain Adaptation for Learned Image Compression with Supervised Adapters | Alberto Presta et.al. | 2404.15591 | link |
2024-04-23 | One-Pass Randomized Algorithm with Practical Rangefinder for Low-Rank Approximation to Quaternion Matrices | Chao Chang et.al. | 2404.14783 | link |
2024-04-22 | Neural Compress-and-Forward for the Relay Channel | Ezgi Ozyilkan et.al. | 2404.14594 | null |
2024-04-22 | Taming Server Memory TCO with Multiple Software-Defined Compressed Tiers | Sandeep Kumar et.al. | 2404.13886 | null |
2024-04-20 | HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression | Lei Lu et.al. | 2404.13372 | null |
2024-04-18 | Image Compression and Reconstruction Based on Quantum Network | Xun Ji et.al. | 2404.11994 | null |
2024-04-17 | Spatio-Temporal Motion Retargeting for Quadruped Robots | Taerim Yoon et.al. | 2404.11557 | null |
2024-04-17 | Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems | Luca Bompani et.al. | 2404.11488 | link |
2024-04-17 | Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks | Eri Hosonuma et.al. | 2404.11280 | null |
2024-04-16 | Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning | Kyle Hsu et.al. | 2404.10282 | link |
2024-04-16 | Compressible and Searchable: AI-native Multi-Modal Retrieval System with Learned Image Compression | Jixiang Luo et.al. | 2404.10234 | null |
2024-04-15 | One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing | Yueyu Hu et.al. | 2404.09979 | null |
2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
2024-04-18 | Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Tobias Weber et.al. | 2404.09683 | link |
2024-04-15 | MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image | Chengfeng Liu et.al. | 2404.09433 | null |
2024-04-17 | Incremental data compression for PDE-constrained optimization with a data assimilation application | Xuejian Li et.al. | 2404.09323 | null |
2024-04-14 | A Joint Data Compression and Time-Delay Estimation Method For Distributed Systems via Extremum Encoding | Amir Weiss et.al. | 2404.09244 | null |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT | Miguel Ortiz del Castillo et.al. | 2404.08399 | null |
2024-04-11 | Video Compression Beyond VVC: Quantitative Analysis of Intra Coding Tools in Enhanced Compression Model (ECM) | Mohsen Abdoli et.al. | 2404.07872 | null |
2024-04-11 | Learning to Classify New Foods Incrementally Via Compressed Exemplars | Justin Yang et.al. | 2404.07507 | null |
2024-04-14 | A comparison between Shapefit compression and Full-Modelling method with PyBird for DESI 2024 and beyond | Y. Lai et.al. | 2404.07283 | link |
2024-04-10 | Exploring Repetitiveness Measures for Two-Dimensional Strings | Giuseppe Romana et.al. | 2404.07030 | null |
2024-04-10 | Fine color guidance in diffusion models and its application to image compression at extremely low bitrates | Tom Bordin et.al. | 2404.06865 | null |
2024-04-09 | Encoder-Quantization-Motion-based Video Quality Metrics | Yixu Chen et.al. | 2404.06620 | null |
2024-04-09 | DiffHarmony: Latent Diffusion Model Meets Image Harmonization | Pengfei Zhou et.al. | 2404.06139 | link |
2024-04-09 | Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey | Feng Liang et.al. | 2404.06114 | null |
2024-04-09 | Image and Video Compression using Generative Sparse Representation with Fidelity Controls | Wei Jiang et.al. | 2404.06076 | null |
2024-04-07 | Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Yiyang Ma et.al. | 2404.04916 | null |
2024-04-07 | Task-Aware Encoder Control for Deep Video Compression | Xingtong Ge et.al. | 2404.04848 | null |
2024-04-06 | Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint | Ashok Mondal et.al. | 2404.04642 | null |
2024-04-05 | ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing | Alec Helbling et.al. | 2404.04376 | link |
2024-04-03 | Convolutional variational autoencoders for secure lossy image compression in remote sensing | Alessandro Giuliano et.al. | 2404.03696 | null |
2024-03-25 | RL for Consistency Models: Faster Reward Guided Text-to-Image Generation | Owen Oertell et.al. | 2404.03673 | link |
2024-04-04 | Training LLMs over Neurally Compressed Text | Brian Lester et.al. | 2404.03626 | null |
2024-04-04 | Leveraging Interpolation Models and Error Bounds for Verifiable Scientific Machine Learning | Tyler Chang et.al. | 2404.03586 | link |
2024-04-04 | Semantic Compression with Information Lattice Learning | Haizi Yu et.al. | 2404.03131 | null |
2024-04-01 | Accounting for contact network uncertainty in epidemic inferences with Approximate Bayesian Computation | Maxwell H. Wang et.al. | 2404.02924 | null |
2024-04-03 | Building test batteries based on analysing random number generator tests within the framework of algorithmic information theory | Boris Ryabko et.al. | 2404.02708 | null |
2024-04-03 | Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | I. Dror et.al. | 2404.02481 | null |
2024-04-03 | MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless Platforms | Jiaang Duan et.al. | 2404.02445 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-01 | The Rate-Distortion-Perception Trade-off: The Role of Private Randomness | Yassine Hamdi et.al. | 2404.01111 | null |
2024-03-31 | Metric dimensions of generalized Sierpiński graphs over squares | Savari Prabhu et.al. | 2404.00771 | null |
2024-03-27 | Computationally and Memory-Efficient Robust Predictive Analytics Using Big Data | Daniel Menges et.al. | 2403.19721 | null |
2024-03-28 | RootInteractive tool for multidimensional statistical analysis, machine learning and analytical model validation | Marian Invanov et.al. | 2403.19330 | null |
2024-03-28 | Uncertainty-Aware Deep Video Compression with Ensembles | Wufei Ma et.al. | 2403.19158 | null |
2024-04-08 | Neural Embedding Compression For Efficient Multi-Task Earth Observation Modelling | Carlos Gomes et.al. | 2403.17886 | link |
2024-03-26 | Low-Latency Neural Stereo Streaming | Qiqi Hou et.al. | 2403.17879 | null |
2024-03-26 | Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs | Kai Yuan et.al. | 2403.17607 | link |
2024-03-25 | Neural Image Compression with Quantization Rectifier | Wei Luo et.al. | 2403.17236 | null |
2024-03-25 | Invertible Diffusion Models for Compressed Sensing | Bin Chen et.al. | 2403.17006 | null |
2024-03-25 | Virtual Cylindrical PET for Efficient DOI Image Reconstruction with Sub-millimetre Resolution | Francisco E Enríquez-Mier-y-Terán et.al. | 2403.16465 | null |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-23 | Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets | Robert Underwood et.al. | 2403.15953 | null |
2024-03-23 | Droplet shape representation using Fourier series and autoencoders | Mihir Durve et.al. | 2403.15797 | null |
2024-03-21 | S2LIC: Learned Image Compression with the SwinV2 Block, Adaptive Channel-wise and Global-inter Attention Context | Yongqiang Wang et.al. | 2403.14471 | link |
2024-03-21 | Tensor network compressibility of convolutional models | Sukhbinder Singh et.al. | 2403.14379 | null |
2024-03-26 | Powerful Lossy Compression for Noisy Images | Shilv Cai et.al. | 2403.14135 | null |
2024-03-20 | String attractors and bi-infinite words | Pierre Béaur et.al. | 2403.13449 | null |
2024-03-19 | Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization | Jixiang Luo et.al. | 2403.13030 | null |
2024-03-19 | Privacy-Preserving Face Recognition Using Trainable Feature Subtraction | Yuxi Mi et.al. | 2403.12457 | link |
2024-03-19 | VQ-NeRV: A Vector Quantized Neural Representation for Videos | Yunjie Xu et.al. | 2403.12401 | link |
2024-03-18 | Encoding of linear kinetic plasma problems in quantum circuits via data compression | Ivan Novikau et.al. | 2403.11989 | null |
2024-03-18 | Object Segmentation-Assisted Inter Prediction for Versatile Video Coding | Zhuoyuan Li et.al. | 2403.11694 | null |
2024-03-18 | Overfitted image coding at reduced complexity | Théophile Blard et.al. | 2403.11651 | link |
2024-03-18 | Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement | Qianyu Zhang et.al. | 2403.11556 | null |
2024-03-18 | Earth+: on-board satellite imagery compression leveraging historical earth observations | Kuntai Du et.al. | 2403.11434 | null |
2024-03-17 | Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology | Shima Mohammadi et.al. | 2403.11241 | link |
2024-03-16 | Channel-wise Feature Decorrelation for Enhanced Learned Image Compression | Farhad Pakdaman et.al. | 2403.10936 | null |
2024-03-16 | NARRATE: Versatile Language Architecture for Optimal Control in Robotics | Seif Ismail et.al. | 2403.10762 | link |
2024-03-15 | Process-and-Forward: Deep Joint Source-Channel Coding Over Cooperative Relay Networks | Chenghong Bian et.al. | 2403.10613 | null |
2024-03-15 | CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement | Qiang Zhu et.al. | 2403.10362 | null |
2024-03-15 | Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration | Usama Ali et.al. | 2403.09988 | link |
2024-03-14 | SketchINR: A First Look into Sketches as Implicit Neural Representations | Hmrishav Bandyopadhyay et.al. | 2403.09344 | link |
2024-03-14 | Noise Dimension of GAN: An Image Compression Perspective | Ziran Zhu et.al. | 2403.09196 | null |
2024-03-20 | Content-aware Masked Image Modeling Transformer for Stereo Image Compression | Xinjie Zhang et.al. | 2403.08505 | null |
2024-03-12 | Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding | Eric Lei et.al. | 2403.07320 | null |
2024-03-11 | Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI | Lang Tong et.al. | 2403.06942 | null |
2024-03-16 | Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression | Zhi Cao et.al. | 2403.06700 | null |
2024-03-13 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-10 | Probing Image Compression For Class-Incremental Learning | Justin Yang et.al. | 2403.06288 | null |
2024-03-10 | Blockchain-Enabled Variational Information Bottleneck for IoT Networks | Qiong Wu et.al. | 2403.06129 | link |
2024-03-09 | Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding | Cunhui Dong et.al. | 2403.05937 | null |
2024-03-07 | Complexity-constrained quantum thermodynamics | Anthony Munson et.al. | 2403.04828 | null |
2024-03-07 | Image Coding for Machines with Edge Information Learning Using Segment Anything | Takahiro Shindo et.al. | 2403.04173 | link |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954 | link |
2024-03-06 | Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer | Naifu Xue et.al. | 2403.03736 | null |
2024-03-06 | ZF Beamforming Tensor Compression for Massive MIMO Fronthaul | Libin Zheng et.al. | 2403.03675 | null |
2024-03-06 | Space Complexity of Euclidean Clustering | Xiaoyi Zhu et.al. | 2403.02971 | null |
2024-03-05 | Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity | Hagyeong Lee et.al. | 2403.02944 | link |
2024-03-05 | Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders | Daniele Mari et.al. | 2403.02887 | null |
2024-03-04 | Dark Energy Survey Year 3 results: likelihood-free, simulation-based |
N. Jeffrey et.al. | 2403.02314 | null |
2024-03-04 | Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 | Xinyue Li et.al. | 2403.01647 | link |
2024-03-03 | On the Compressibility of Quantized Large Language Models | Yu Mao et.al. | 2403.01384 | null |
2024-03-02 | Towards Accurate Lip-to-Speech Synthesis in-the-Wild | Sindhu Hegde et.al. | 2403.01087 | null |
2024-03-01 | Region-Adaptive Transform with Segmentation Prior for Image Compression | Yuxi Liu et.al. | 2403.00628 | link |
2024-03-07 | ODVista: An Omnidirectional Video Dataset for super-resolution and Quality Enhancement Tasks | Ahmed Telili et.al. | 2403.00604 | link |
2024-02-29 | Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space | Mahsa Mozafari-Nia et.al. | 2403.00155 | null |
2024-02-29 | Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling | Wenxue Cui et.al. | 2402.19111 | null |
2024-02-29 | Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets | Fatih Kamisli et.al. | 2402.18930 | link |
2024-02-29 | Towards Backward-Compatible Continual Learning of Image Compression | Zhihao Duan et.al. | 2402.18862 | link |
2024-02-29 | Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression | Xinyue Li et.al. | 2402.18761 | null |
2024-01-10 | Motion Guided Token Compression for Efficient Masked Video Modeling | Yukun Feng et.al. | 2402.18577 | null |
2024-02-28 | Tokenization Is More Than Compression | Craig W. Schmidt et.al. | 2402.18376 | null |
2024-02-28 | NERV++: An Enhanced Implicit Neural Video Representation | Ahmed Ghorbel et.al. | 2402.18305 | null |
2024-02-28 | Computing Minimal Absent Words and Extended Bispecial Factors with CDAWG Space | Shunsuke Inenaga et.al. | 2402.18090 | null |
2024-03-03 | Towards Optimal Learning of Language Models | Yuxian Gu et.al. | 2402.17759 | null |
2024-02-27 | Gaoyuan Wang et.al. | 2402.17749 | null | |
2024-02-27 | Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model | Panqi Jia et.al. | 2402.17487 | null |
2024-02-27 | Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization | Panqi Jia et.al. | 2402.17470 | null |
2024-02-29 | Neural Video Compression with Feature Modulation | Jiahao Li et.al. | 2402.17414 | link |
2024-01-19 | MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network | Yujun Huang et.al. | 2402.16855 | null |
2024-02-29 | MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model | Chunyi Li et.al. | 2402.16749 | link |
2024-02-26 | Enabling robust sensor network design with data processing and optimization making use of local beehive image and video files | Ephrance Eunice Namugenyi et.al. | 2402.16655 | null |
2024-02-26 | Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields | Yifei Li et.al. | 2402.16599 | null |
2024-02-26 | Distortion-Controlled Dithering with Reduced Recompression Rate | Morriel Kasher et.al. | 2402.16447 | null |
2024-02-26 | Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction | Wen-Yang Lu et.al. | 2402.16371 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-02-24 | Traditional Transformation Theory Guided Model for Learned Image Compression | Zhiyuan Li et.al. | 2402.15744 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-21 | Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel | Jordan Dotzel et.al. | 2402.13536 | null |
2024-02-20 | Compressing the two-particle Green's function using wavelets: Theory and application to the Hubbard atom | Emin Moghadas et.al. | 2402.13030 | null |
2024-02-20 | RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models | Xinchen Zhang et.al. | 2402.12908 | link |
2024-02-20 | Transformer-based Learned Image Compression for Joint Decoding and Denoising | Yi-Hsin Chen et.al. | 2402.12888 | null |
2024-02-19 | Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling | Philip Müller et.al. | 2402.11985 | link |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-18 | Learning to Learn Faster from Human Feedback with Language Model Predictive Control | Jacky Liang et.al. | 2402.11450 | null |
2024-02-17 | TinyLIC-High efficiency lossy image compression method | Gaocheng Ma et.al. | 2402.11164 | null |
2024-02-15 | Analysis of Neural Video Compression Networks for 360-Degree Video Coding | Andy Regensky et.al. | 2402.10257 | null |
2024-02-14 | Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion | Edgar Heinert et.al. | 2402.09530 | null |
2024-02-14 | A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders | Matthias Kränzler et.al. | 2402.09001 | null |
2024-02-14 | Extreme Video Compression with Pre-trained Diffusion Models | Bohan Li et.al. | 2402.08934 | link |
2024-02-14 | Saliency-aware End-to-end Learned Variable-Bitrate 360-degree Image Compression | Oguzhan Gungordu et.al. | 2402.08862 | null |
2024-02-13 | Learned Image Compression with Text Quality Enhancement | Chih-Yu Lai et.al. | 2402.08643 | null |
2024-02-13 | Motion-Adaptive Inference for Flexible Learned B-Frame Compression | M. Akin Yilmaz et.al. | 2402.08550 | null |
2024-02-21 | A Neural-network Enhanced Video Coding Framework beyond ECM | Yanchen Zhao et.al. | 2402.08397 | null |
2024-02-13 | Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss | Kei Iino et.al. | 2402.08267 | null |
2024-02-12 | Distributed Compression in the Era of Machine Learning: A Review of Recent Advances | Ezgi Ozyilkan et.al. | 2402.07997 | null |
2024-02-13 | Towards Meta-Pruning via Optimal Transport | Alexander Theus et.al. | 2402.07839 | link |
2024-02-09 | Parameter estimation for quantum jump unraveling | Marco Radaelli et.al. | 2402.06556 | link |
2024-02-07 | RAGE for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications | Christian D. Rask et.al. | 2402.05974 | null |
2024-02-08 | Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers | Onur G. Guleryuz et.al. | 2402.05887 | link |
2024-02-08 | Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs | Yuxin Xie et.al. | 2402.05582 | null |
2024-02-05 | TexShape: Information Theoretic Sentence Embedding for Language Models | H. Kaan Kale et.al. | 2402.05132 | link |
2024-02-07 | Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth | Kevin Kögler et.al. | 2402.05013 | null |
2024-02-06 | A Novel Local and Hyper-Local Multicast Services Transmission Scheme for Beyond 5G Networks | Sweta Singh et.al. | 2402.03963 | null |
2024-02-06 | Cool-chic video: Learned video coding with 800 parameters | Thomas Leguay et.al. | 2402.03179 | link |
2024-02-05 | Perceptual Learned Image Compression via End-to-End JND-Based Optimization | Farhad Pakdaman et.al. | 2402.02836 | null |
2024-02-04 | Discovering More Effective Tensor Network Structure Search Algorithms via Large Language Models (LLMs) | Junhua Zeng et.al. | 2402.02456 | link |
2024-03-04 | RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction | Nikolaos Stathoulopoulos et.al. | 2402.02192 | null |
2024-02-03 | Generative Visual Compression: A Review | Bolin Chen et.al. | 2402.02140 | null |
2024-02-23 | Immersive Video Compression using Implicit Neural Representations | Ho Man Kwan et.al. | 2402.01596 | link |
2024-02-02 | Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization | Zhiyu Zhang et.al. | 2402.01380 | null |
2024-02-02 | UCVC: A Unified Contextual Video Compression Framework with Joint P-frame and B-frame Coding | Jiayu Yang et.al. | 2402.01289 | null |
2024-02-02 | Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training | Sota Kudo et.al. | 2402.01238 | link |
2024-02-02 | The O2 software framework and GPU usage in ALICE online and offline reconstruction in Run 3 | Giulio Eulisse et.al. | 2402.01205 | null |
2024-02-01 | Compressed image quality assessment using stacking | S. Farhad Hosseini-Benvidi et.al. | 2402.00993 | null |
2024-02-04 | Evaluating Large Language Models for Generalization and Robustness via Data Compression | Yucheng Li et.al. | 2402.00861 | link |
2024-03-11 | LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression | Wei Jiang et.al. | 2402.00680 | null |
2024-02-01 | Gain of Grain: A Film Grain Handling Toolchain for VVC-based Open Implementations | Vignesh V Menon et.al. | 2402.00622 | null |
2024-01-31 | EPSD: Early Pruning with Self-Distillation for Efficient Model Compression | Dong Chen et.al. | 2402.00084 | null |
2024-01-31 | A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024 | Darren Ramsook et.al. | 2401.18021 | null |
2024-01-31 | Robustly overfitting latents for flexible neural image compression | Yura Perugachi-Diaz et.al. | 2401.17789 | null |
2024-01-30 | A Group Theoretic Metric for Robot State Estimation Leveraging Chebyshev Interpolation | Varun Agrawal et.al. | 2401.17463 | null |
2024-01-30 | SLIC: A Learned Image Codec Using Structure and Color | Srivatsa Prativadibhayankaram et.al. | 2401.17246 | link |
2024-01-30 | Large Language Model Evaluation via Matrix Entropy | Lai Wei et.al. | 2401.17139 | link |
2024-01-30 | Local integrals of motion in dipole-conserving models with Hilbert space fragmentation | Patrycja Łydżba et.al. | 2401.17097 | null |
2024-01-29 | On Channel Simulation with Causal Rejection Samplers | Daniel Goc et.al. | 2401.16579 | null |
2024-01-29 | Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression | Xihua Sheng et.al. | 2401.15864 | null |
2024-01-29 | Bayesian one- and two-sided inference on the local effective dimension | Eduard Belitser et.al. | 2401.15816 | null |
2024-01-28 | Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement | Minghong Duan et.al. | 2401.15613 | null |
2024-01-26 | Shadow simulation of quantum processes | Xuanqiang Zhao et.al. | 2401.14934 | null |
2024-01-26 | Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images | Jon Alvarez Justo et.al. | 2401.14786 | null |
2024-01-26 | A Comparative Study of Compressive Sensing Algorithms for Hyperspectral Imaging Reconstruction | Jon Alvarez Justo et.al. | 2401.14762 | null |
2024-01-26 | Residual Quantization with Implicit Neural Codebooks | Iris Huijben et.al. | 2401.14732 | link |
2024-01-25 | Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression | Daxin Li et.al. | 2401.14007 | null |
2024-02-07 | Perceptual-oriented Learned Image Compression with Dynamic Kernel | Nianxiang Fu et.al. | 2401.13967 | null |
2024-01-25 | Conditional Neural Video Coding with Spatial-Temporal Super-Resolution | Henan Wang et.al. | 2401.13959 | null |
2024-01-24 | FLLIC: Functionally Lossless Image Compression | Xi Zhang et.al. | 2401.13616 | null |
2024-01-23 | Fast Implicit Neural Representation Image Codec in Resource-limited Devices | Xiang Liu et.al. | 2401.12587 | null |
2024-01-22 | PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression | Aaron Hurst et.al. | 2401.12018 | null |
2024-01-22 | A Training-Free Defense Framework for Robust Learned Image Compression | Myungseo Song et.al. | 2401.11902 | null |
2024-01-21 | Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding | Yichi Zhang et.al. | 2401.11615 | null |
2024-01-21 | ColorVideoVDP: A visual difference predictor for image, video and display distortions | Rafal K. Mantiuk et.al. | 2401.11485 | link |
2024-01-21 | Data-driven compression of electron-phonon interactions | Yao Luo et.al. | 2401.11393 | null |
2024-01-20 | Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding | Haisheng Fu et.al. | 2401.11093 | null |
2024-01-19 | NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines | Jukka I. Ahonen et.al. | 2401.10761 | null |
2024-01-19 | Bridging the gap between image coding for machines and humans | Nam Le et.al. | 2401.10732 | null |
2024-01-18 | Attack and Defense Analysis of Learned Image Compression | Tianyu Zhu et.al. | 2401.10345 | null |
2024-01-18 | Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions | Namitha Padmanabhan et.al. | 2401.10217 | null |
2024-01-18 | Depth Over RGB: Automatic Evaluation of Open Surgery Skills Using Depth Camera | Ido Zuckerman et.al. | 2401.10037 | null |
2024-01-18 | Memory Efficient Corner Detection for Event-driven Dynamic Vision Sensors | Pao-Sheng Vincent Sun et.al. | 2401.09797 | null |
2024-01-18 | Compressing MIMO Channel Submatrices with Tucker Decomposition: Enabling Efficient Storage and Reducing SINR Computation Overhead | Yuanwei Zhang et.al. | 2401.09792 | null |
2024-01-17 | Idempotence and Perceptual Image Compression | Tongda Xu et.al. | 2401.08920 | link |
2024-01-16 | End-to-End Optimized Image Compression with the Frequency-Oriented Transform | Yuefeng Zhang et.al. | 2401.08194 | null |
2024-01-17 | Learned Image Compression with ROI-Weighted Distortion and Bit Allocation | Wei Jiang et.al. | 2401.08154 | null |
2024-01-15 | Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning | Manish Sharma et.al. | 2401.08014 | null |
2024-01-15 | Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models | Dan Jacobellis et.al. | 2401.07957 | link |
2024-01-14 | Exploring Compressed Image Representation as a Perceptual Proxy: A Study | Chen-Hsiu Huang et.al. | 2401.07200 | link |
2024-01-13 | Progressive Feature Fusion Network for Enhancing Image Quality Assessment | Kaiqun Wu et.al. | 2401.06992 | null |
2024-01-12 | Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part II: Spatial and Tonal Data Optimization | Niklas Kämper et.al. | 2401.06747 | null |
2024-03-18 | LiDAR Depth Map Guided Image Compression Model | Alessandro Gnutti et.al. | 2401.06517 | null |
2024-01-11 | Transformer Masked Autoencoders for Next-Generation Wireless Communications: Architecture and Opportunities | Abdullah Zayat et.al. | 2401.06274 | null |
2024-01-11 | MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring | Qian Gong et.al. | 2401.05994 | null |
2024-01-10 | SnapCap: Efficient Snapshot Compressive Video Captioning | Jianqiao Sun et.al. | 2401.04903 | null |
2024-01-09 | Modified Levenberg-Marquardt Algorithm For Tensor CP Decomposition in Image Compression | Ramin Goudarzi Karim et.al. | 2401.04670 | null |
2024-01-09 | Optimal Transcoding Resolution Prediction for Efficient Per-Title Bitrate Ladder Estimation | Jinhai Yang et.al. | 2401.04405 | null |
2024-01-08 | Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion | Minglong Xue et.al. | 2401.03788 | link |
2024-01-08 | A Video Coding Method Based on Neural Network for CLIC2024 | Zhengang Li et.al. | 2401.03623 | null |
2024-01-06 | Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis | Qian Gong et.al. | 2401.03317 | null |
2024-01-06 | Comparison of spectrum models as applied to single-particle |
Thomas A. Trainor et.al. | 2401.03290 | null |
2024-01-06 | Transferable Learned Image Compression-Resistant Adversarial Perturbations | Yang Sui et.al. | 2401.03115 | null |
2024-01-05 | MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multi-scale Dilated Convolution for Image Compressive Sensing (CS) | Youhao Yu et.al. | 2401.02884 | null |
2024-03-08 | Importance Matching Lemma for Lossy Compression with Side Information | Buu Phan et.al. | 2401.02609 | null |
2024-01-04 | Cool-Chic: Perceptually Tuned Low Complexity Overfitted Image Coder | Théo Ladune et.al. | 2401.02156 | link |
2024-01-04 | ED: Perceptually tuned Enhanced Compression Model | Pierrick Philippe et.al. | 2401.02145 | null |
2024-01-02 | NU-Class Net: A Novel Deep Learning-based Approach for Video Quality Enhancement | Parham Zilouchian Moghaddam et.al. | 2401.01163 | null |
2024-01-28 | Higher-Order Cellular Automata Generated Symmetry-Protected Topological Phases and Detection Through Multi-Point Strange Correlators | Jie-Yu Zhang et.al. | 2401.00505 | null |
2023-12-28 | Selective Run-Length Encoding | Xutan Peng et.al. | 2312.17024 | null |
2023-12-29 | FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side Information | Yichong Xia et.al. | 2312.16963 | null |
2023-12-26 | Range Entropy Queries and Partitioning | Sanjay Krishnan et.al. | 2312.15959 | null |
2023-12-25 | MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression | Yi-Hsin Chen et.al. | 2312.15829 | null |
2023-12-25 | On Robust Wasserstein Barycenter: The Model and Algorithm | Xu Wang et.al. | 2312.15762 | null |
2023-12-25 | Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision | Qi Mao et.al. | 2312.15622 | null |
2023-12-22 | The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs | Junli Fang et.al. | 2312.14792 | null |
2024-01-09 | Enhanced Color Palette Modeling for Lossless Screen Content Compression | Hannah Och et.al. | 2312.14491 | null |
2023-12-30 | Efficient Communication in Federated Learning Using Floating-Point Lossy Compression | Grant Wilkins et.al. | 2312.13461 | null |
2023-12-19 | A Huffman based short message service compression technique using adjacent distance array | Pranta Sarker et.al. | 2312.12495 | null |
2023-12-19 | Full-reference Video Quality Assessment for User Generated Content Transcoding | Zihao Qi et.al. | 2312.12317 | null |
2023-12-19 | Low-Consumption Partial Transcoding by HEVC | Mohsen Abdoli et.al. | 2312.12174 | link |
2023-12-19 | Comparative Study of Hardware and Software Power Measurements in Video Compression | Angeliki Katsenou et.al. | 2312.12150 | null |
2023-12-18 | Blind-Touch: Homomorphic Encryption-Based Distributed Neural Network Inference for Privacy-Preserving Fingerprint Authentication | Hyunmin Choi et.al. | 2312.11575 | link |
2024-01-11 | Quantized Decoder in Learned Image Compression for Deterministic Reconstruction | Esin Koyuncu et.al. | 2312.11209 | null |
2023-12-19 | A Computationally Efficient Neural Video Compression Accelerator Based on a Sparse CNN-Transformer Hybrid Network | Siyu Zhang et.al. | 2312.10716 | null |
2023-12-17 | IntraSeismic: a coordinate-based learning approach to seismic inversion | Juan Romero et.al. | 2312.10568 | null |
2023-12-17 | Light-weight CNN-based VVC Inter Partitioning Acceleration | Yiqun Liu et.al. | 2312.10567 | null |
2023-12-16 | Statistical Analysis of Inter Coding in VVC Test Model (VTM) | Yiqun Liu et.al. | 2312.10406 | null |
2023-12-15 | IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding | Yu-Han Sun et.al. | 2312.09799 | null |
2023-12-15 | Towards Neuromorphic Compression based Neural Sensing for Next-Generation Wireless Implantable Brain Machine Interface | Vivek Mohan et.al. | 2312.09503 | null |
2023-12-14 | Geometry-Corrected Geodesic Motion Modeling with Per-Frame Camera Motion for 360-Degree Video Compression | Andy Regensky et.al. | 2312.09266 | link |
2023-12-14 | Efficient Online Learning of Contact Force Models for Connector Insertion | Kevin Tracy et.al. | 2312.09190 | null |
2023-12-13 | Balanced and Deterministic Weight-sharing Helps Network Performance | Oscar Chang et.al. | 2312.08401 | null |
2023-12-13 | Preparing VVC for Streaming: A Fast Multi-Rate Encoding Approach | Yiqun Liu et.al. | 2312.08330 | null |
2023-12-13 | CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation | Eugenio Chisari et.al. | 2312.08240 | null |
2023-12-13 | Explainable Trajectory Representation through Dictionary Learning | Yuanbo Tang et.al. | 2312.08052 | null |
2023-12-12 | Deep Hierarchical Video Compression | Ming Lu et.al. | 2312.07126 | null |
2023-12-12 | Communication Cost Reduction for Subgraph Counting under Local Differential Privacy via Hash Functions | Quentin Hillebrand et.al. | 2312.07055 | link |
2023-12-11 | RAFIC: Retrieval-Augmented Few-shot Image Classification | Hangfei Lin et.al. | 2312.06868 | link |
2023-12-11 | A New Projection Pursuit Index for Big Data | Yajie Duan et.al. | 2312.06465 | null |
2023-12-11 | Variational Auto-Encoder Based Deep Learning Technique For Filling Gaps in Reacting PIV Data | Shashank Yellapantula et.al. | 2312.06461 | null |
2023-12-07 | Analysis of Coding Gain Due to In-Loop Reshaping | Chau-Wai Wong et.al. | 2312.04022 | null |
2023-12-05 | C3: High-performance and low-complexity neural compression from a single image or video | Hyunjik Kim et.al. | 2312.02753 | null |
2023-12-05 | Unified learning-based lossy and lossless JPEG recompression | Jianghui Zhang et.al. | 2312.02705 | null |
2023-12-05 | Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation | Tianhao Peng et.al. | 2312.02605 | null |
2023-12-04 | Hyperspectral Image Compression Using Sampling and Implicit Neural Representations | Shima Rezasoltani et.al. | 2312.01558 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-18 | An Efficient Projection-Based Next-best-view Planning Framework for Reconstruction of Unknown Objects | Zhizhou Jia et.al. | 2409.12096 | null |
2024-09-18 | Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement | Zizhen Lin et.al. | 2409.11725 | null |
2024-09-18 | DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion | Jian Xu et.al. | 2409.11642 | link |
2024-09-17 | Noise-aware Dynamic Image Denoising and Positron Range Correction for Rubidium-82 Cardiac PET Imaging via Self-supervision | Huidong Xie et.al. | 2409.11543 | null |
2024-09-17 | Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements | Jipeng Yan et.al. | 2409.11391 | null |
2024-09-17 | Ultrasound Image Enhancement with the Variance of Diffusion Models | Yuxin Zhang et.al. | 2409.11380 | link |
2024-09-17 | Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks | Edgar Heinert et.al. | 2409.11373 | null |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-17 | CUNSB-RFIE: Context-aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement | Xuanzhao Dong et.al. | 2409.10966 | null |
2024-09-17 | Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending | Yongyang Pan et.al. | 2409.10958 | null |
2024-09-17 | Neural Fields for Adaptive Photoacoustic Computed Tomography | Tianao Li et.al. | 2409.10876 | null |
2024-09-16 | Investigating Training Objectives for Generative Speech Enhancement | Julius Richter et.al. | 2409.10753 | null |
2024-09-16 | Taming Diffusion Models for Image Restoration: A Review | Ziwei Luo et.al. | 2409.10353 | null |
2024-09-16 | FGR-Net:Interpretable fundus imagegradeability classification based on deepreconstruction learning | Saif Khalid et.al. | 2409.10246 | null |
2024-09-16 | RF-GML: Reference-Free Generative Machine Listener | Arijit Biswas et.al. | 2409.10210 | null |
2024-09-16 | Towards Explainable Automated Data Quality Enhancement without Domain Knowledge | Djibril Sarr et.al. | 2409.10139 | null |
2024-09-16 | 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction | Atsuya Nakata et.al. | 2409.09969 | link |
2024-09-15 | A Global Perspective on the Past, Present, and Future of Video Streaming over Starlink | Liz Izhikevich et.al. | 2409.09846 | null |
2024-09-15 | Underwater Image Enhancement via Dehazing and Color Restoration | Chengqin Wu et.al. | 2409.09779 | null |
2024-09-15 | High Definition Map Mapping and Update: A General Overview and Future Directions | Benny Wijaya et.al. | 2409.09726 | null |
2024-09-15 | Superconducting and low temperature RF Coils for Ultra-Low-Field MRI: A Study on SNR Performance | Aditya A Bhosale et.al. | 2409.09608 | null |
2024-09-14 | Estimating Neural Orientation Distribution Fields on High Resolution Diffusion MRI Scans | Mohammed Munzer Dwedari et.al. | 2409.09387 | link |
2024-09-13 | Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions | Zahra Ashktorab et.al. | 2409.08937 | null |
2024-09-13 | Confocal Raman Microscopy with Adaptive Optics | J. D. Munoz-Bolanos et.al. | 2409.08725 | null |
2024-09-13 | Joint image reconstruction and segmentation of real-time cardiac MRI in free-breathing using a model based on disentangled representation learning | Tobias Wech et.al. | 2409.08619 | null |
2024-09-13 | DiffFAS: Face Anti-Spoofing via Generative Diffusion Models | Xinxu Ge et.al. | 2409.08572 | link |
2024-09-13 | CasDyF-Net: Image Dehazing via Cascaded Dynamic Filters | Wang Yinglong et.al. | 2409.08510 | link |
2024-09-12 | OpenACE: An Open Benchmark for Evaluating Audio Coding Performance | Jozef Coldenhoff et.al. | 2409.08374 | null |
2024-09-12 | Expansive Supervision for Neural Radiance Field | Weixiang Zhang et.al. | 2409.08056 | null |
2024-09-12 | OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation | Shun Zou et.al. | 2409.08000 | link |
2024-09-14 | Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment | Shaode Yu et.al. | 2409.07762 | null |
2024-09-11 | Foundation Models Boost Low-Level Perceptual Similarity Metrics | Abhijay Ghildyal et.al. | 2409.07650 | null |
2024-09-11 | Machine Learning and Constraint Programming for Efficient Healthcare Scheduling | Aymen Ben Said et.al. | 2409.07547 | null |
2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451 | null |
2024-09-11 | EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion | Jian Zhang et.al. | 2409.07255 | null |
2024-09-12 | 3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents | Yingjie Zhou et.al. | 2409.07236 | null |
2024-09-11 | Phantom-based gradient waveform measurements with compensated variable-prephasing: Description and application to EPI at 7T | Hannah Scholten et.al. | 2409.07203 | null |
2024-09-11 | Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency for Blind Image Quality Assessment | Mohammed Alsaafin et.al. | 2409.07115 | link |
2024-09-11 | CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion | Joshua Kazdan et.al. | 2409.07025 | null |
2024-09-11 | AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models | Boming Miao et.al. | 2409.07002 | null |
2024-09-10 | ExIQA: Explainable Image Quality Assessment Using Distortion Attributes | Sepehr Kazemi Ranjbar et.al. | 2409.06853 | null |
2024-09-10 | Universal End-to-End Neural Network for Lossy Image Compression | Bouzid Arezki et.al. | 2409.06586 | null |
2024-09-10 | Three-dimensional generative adversarial networks for turbulent flow estimation from wall measurements | Antonio Cuéllar et.al. | 2409.06548 | null |
2024-09-11 | AMNS: Attention-Weighted Selective Mask and Noise Label Suppression for Text-to-Image Person Retrieval | Runqing Zhang et.al. | 2409.06385 | null |
2024-09-10 | Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement | Yang Wen et.al. | 2409.06334 | null |
2024-09-10 | DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing | Kuang Yuan et.al. | 2409.06137 | null |
2024-09-09 | Enhancing Cross-Modality Synthesis: Subvolume Merging for MRI-to-CT Conversion | Fuxin Fan et.al. | 2409.05982 | null |
2024-09-09 | SynMorph: Generating Synthetic Face Morphing Dataset with Mated Samples | Haoyu Zhang et.al. | 2409.05595 | null |
2024-09-09 | Efficient Quality Estimation of True Random Bit-streams | Cesare Caratozzolo et.al. | 2409.05543 | null |
2024-09-09 | Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild | Xiongkuo Min et.al. | 2409.05540 | null |
2024-09-09 | A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression | Nora Hofer et.al. | 2409.05490 | null |
2024-09-09 | Boosting CLIP Adaptation for Image Quality Assessment via Meta-Prompt Learning and Gradient Regularization | Xudong Li et.al. | 2409.05381 | null |
2024-09-09 | PersonaTalk: Bring Attention to Your Persona in Visual Dubbing | Longhao Zhang et.al. | 2409.05379 | null |
2024-09-09 | BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec | Detai Xin et.al. | 2409.05377 | null |
2024-09-09 | Adaptive Offloading and Enhancement for Low-Light Video Analytics on Mobile Devices | Yuanyi He et.al. | 2409.05297 | null |
2024-09-08 | Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation | Haichao Zhu et.al. | 2409.05151 | null |
2024-09-07 | Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography | Jiahao Zhu et.al. | 2409.04878 | null |
2024-09-07 | Metadata augmented deep neural networks for wild animal classification | Aslak Tøn et.al. | 2409.04825 | link |
2024-09-11 | Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras | Zimu Liao et.al. | 2409.04751 | link |
2024-09-06 | Whole Heart Perfusion with High-Multiband Simultaneous Multislice Imaging via Linear Phase Modulated Extended Field of View (SMILE) | Shen Zhao et.al. | 2409.04353 | null |
2024-09-06 | Design and Characterization of MRI-compatible Plastic Ultrasonic Motor | Zhanyue Zhao et.al. | 2409.04006 | null |
2024-09-06 | Bi-modality Images Transfer with a Discrete Process Matching Method | Zhe Xiong et.al. | 2409.03977 | null |
2024-09-03 | Applications and Advances of Artificial Intelligence in Music Generation:A Review | Yanxu Chen et.al. | 2409.03715 | null |
2024-09-05 | Enabling Practical and Privacy-Preserving Image Processing | Chao Wang et.al. | 2409.03568 | null |
2024-09-05 | Use of triplet loss for facial restoration in low-resolution images | Sebastian Pulgar et.al. | 2409.03530 | null |
2024-09-05 | Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation | Prerak Mody et.al. | 2409.03470 | link |
2024-09-05 | Multiple weather images restoration using the task transformer and adaptive mixup strategy | Yang Wen et.al. | 2409.03249 | null |
2024-09-05 | Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem | Qiwen Zhu et.al. | 2409.03179 | link |
2024-09-05 | Large Étendue 3D Holographic Display with Content-adpative Dynamic Fourier Modulation | Brian Chao et.al. | 2409.03143 | null |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models | Pujing Yang et.al. | 2409.02597 | null |
2024-09-04 | Coral Model Generation from Single Images for Virtual Reality Applications | Jie Fu et.al. | 2409.02376 | null |
2024-09-04 | Image Registration with Averaging Network and Edge-Based Loss for Low-SNR Cardiac MRI | Xuan Lei et.al. | 2409.02348 | null |
2024-09-03 | Coaching a Robotic Sonographer: Learning Robotic Ultrasound with Sparse Expert's Feedback | Deepak Raina et.al. | 2409.02337 | null |
2024-09-03 | Unveiling Deep Shadows: A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning | Xiaowei Hu et.al. | 2409.02108 | link |
2024-09-03 | AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions | Chenghao Qian et.al. | 2409.02045 | null |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-03 | UWStereo: A Large Synthetic Dataset for Underwater Stereo Matching | Qingxuan Lv et.al. | 2409.01782 | null |
2024-09-03 | Boron Isotope Effects on Raman Scattering in Bulk BN, BP, and BAs: A Density-Functional Theory Study | Nima Ghafari Cherati et.al. | 2409.01671 | null |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-09-03 | Learning Task-Specific Sampling Strategy for Sparse-View CT Reconstruction | Liutao Yang et.al. | 2409.01544 | null |
2024-09-03 | Long-Range Biometric Identification in Real World Scenarios: A Comprehensive Evaluation Framework Based on Missions | Deniz Aykac et.al. | 2409.01540 | null |
2024-09-02 | Real-Time Multi-Scene Visibility Enhancement for Promoting Navigational Safety of Vessels Under Complex Weather Conditions | Ryan Wen Liu et.al. | 2409.01500 | link |
2024-09-02 | Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement | Tathagata Bandyopadhyay et.al. | 2409.01352 | null |
2024-09-02 | A Roadmap to Holographic Focused Ultrasound Approaches to Generate Thermal Patterns | Ceren Cengiz et.al. | 2409.01323 | null |
2024-09-02 | Investigation of the spatial resolution of PET imaging system measuring polarization-correlated Compton events | Ana Marija Kožuljević et.al. | 2409.01238 | null |
2024-09-02 | MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation | Zewen Chen et.al. | 2409.01212 | link |
2024-09-02 | Generating Synthetic Satellite Imagery for Rare Objects: An Empirical Comparison of Models and Metrics | Tuong Vy Nguyen et.al. | 2409.01138 | null |
2024-09-02 | Rapid GPU-Based Pangenome Graph Layout | Jiajie Li et.al. | 2409.00876 | null |
2024-09-01 | An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI | Michelle Su et.al. | 2409.00798 | null |
2024-08-30 | Subspace Diffusion Posterior Sampling for Travel-Time Tomography | Xiang Cao et.al. | 2408.17333 | null |
2024-08-30 | Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution | Yixin Wu et.al. | 2408.17285 | null |
2024-08-30 | LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model | Nasim Jamshidi Avanaki et.al. | 2408.17057 | link |
2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
2024-08-29 | Legacy Learning Using Few-Shot Font Generation Models for Automatic Text Design in Metaverse Content: Cases Studies in Korean and Chinese | Younghwi Kim et.al. | 2408.16900 | null |
2024-08-29 | The Continuous Electron Beam Accelerator Facility at 12 GeV | P. A. Adderley et.al. | 2408.16880 | null |
2024-08-29 | MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning | Nasim Jamshidi Avanaki et.al. | 2408.16879 | null |
2024-09-04 | Auto-resolving atomic structure at van der Waal interfaces using a generative model | Wenqiang Huang et.al. | 2408.16802 | link |
2024-09-02 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model | Zhuan Shi et.al. | 2408.16634 | null |
2024-09-02 | A Deep-Learning-Based Label-free No-Reference Image Quality Assessment Metric: Application in Sodium MRI Denoising | Shuaiyu Yuan et.al. | 2408.16481 | null |
2024-08-29 | LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement | Ye Yu et.al. | 2408.16235 | link |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995 | null |
2024-08-28 | Segmentation-guided Layer-wise Image Vectorization with Gradient Fills | Hengyu Zhou et.al. | 2408.15741 | link |
2024-08-28 | Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Fabio Quattrini et.al. | 2408.15660 | link |
2024-08-28 | Avoiding Generative Model Writer's Block With Embedding Nudging | Ali Zand et.al. | 2408.15450 | null |
2024-09-02 | Pitfalls and Outlooks in Using COMET | Vilém Zouhar et.al. | 2408.15366 | link |
2024-08-27 | Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment | Xuan Xu et.al. | 2408.15218 | null |
2024-08-27 | CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP | Zhenchen Tang et.al. | 2408.15098 | null |
2024-08-27 | Towards Real-world Event-guided Low-light Video Enhancement and Deblurring | Taewoo Kim et.al. | 2408.14916 | link |
2024-08-27 | Alfie: Democratising RGBA Image Generation With No $$$ | Fabio Quattrini et.al. | 2408.14826 | link |
2024-08-27 | Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation | Qiaoxin Li et.al. | 2408.14754 | null |
2024-08-26 | Gallery-Aware Uncertainty Estimation For Open-Set Face Recognition | Leonid Erlygin et.al. | 2408.14229 | null |
2024-08-27 | SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Dao et.al. | 2408.14176 | null |
2024-08-27 | Improving Water Quality Time-Series Prediction in Hong Kong using Sentinel-2 MSI Data and Google Earth Engine Cloud Computing | Rohin Sood et.al. | 2408.14010 | null |
2024-08-26 | LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models | Qihang Ge et.al. | 2408.14008 | null |
2024-08-25 | Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching | Minghao Liu et.al. | 2408.13858 | null |
2024-08-25 | Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! | Stefano Perrella et.al. | 2408.13831 | link |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-23 | ReCon: Reconfiguring Analog Rydberg Atom Quantum Computers for Quantum Generative Adversarial Networks | Nicholas S. DiBrita et.al. | 2408.13389 | link |
2024-08-23 | Re-evaluation of Face Anti-spoofing Algorithm in Post COVID-19 Era Using Mask Based Occlusion Attack | Vaibhav Sundharam et.al. | 2408.13251 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | null |
2024-08-23 | A density ratio framework for evaluating the utility of synthetic data | Thom Benjamin Volker et.al. | 2408.13167 | null |
2024-08-23 | When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation | Xi Zhu et.al. | 2408.12897 | null |
2024-08-22 | Variable Stars in M31 Stellar Clusters from the Panchromatic Hubble Andromeda Treasury | Richard Smith et.al. | 2408.12765 | null |
2024-08-22 | Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis | Memoona Aziz et.al. | 2408.12762 | null |
2024-08-22 | Unlocking Intrinsic Fairness in Stable Diffusion | Eunji Kim et.al. | 2408.12692 | null |
2024-08-22 | Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features | Shaoxiang Dang et.al. | 2408.12279 | null |
2024-08-21 | MBSS-T1: Model-Based Self-Supervised Motion Correction for Robust Cardiac T1 Mapping | Eyal Hanania et.al. | 2408.11992 | null |
2024-08-21 | AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Maksim Smirnov et.al. | 2408.11982 | link |
2024-08-21 | Estimating Contribution Quality in Online Deliberations Using a Large Language Model | Lodewijk Gelauff et.al. | 2408.11936 | null |
2024-08-21 | FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting | Liyao Jiang et.al. | 2408.11706 | null |
2024-08-21 | Interpretable Long-term Action Quality Assessment | Xu Dong et.al. | 2408.11687 | link |
2024-08-21 | E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment | Shangkun Sun et.al. | 2408.11481 | link |
2024-08-21 | Fairness measures for biometric quality assessment | André Dörsch et.al. | 2408.11392 | null |
2024-08-21 | Gender Bias Evaluation in Text-to-image Generation: A Survey | Yankun Wu et.al. | 2408.11358 | null |
2024-08-21 | Image Score: Learning and Evaluating Human Preferences for Mercari Search | Chingis Oinar et.al. | 2408.11349 | null |
2024-08-21 | High-quality imaging of large areas through path-difference ptychography | Jizhe Cui et.al. | 2408.11332 | null |
2024-08-21 | Optimizing Transmit Field Inhomogeneity of Parallel RF Transmit Design in 7T MRI using Deep Learning | Zhengyi Lu et.al. | 2408.11323 | null |
2024-08-21 | Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning Methods | David Jacob Kedziora et.al. | 2408.11322 | link |
2024-08-20 | Compress Guidance in Conditional Diffusion Sampling | Anh-Dung Dinh et.al. | 2408.11194 | null |
2024-08-20 | Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Satoshi Kosugi et.al. | 2408.11055 | link |
2024-08-20 | Denoising Plane Wave Ultrasound Images Using Diffusion Probabilistic Models | Hojat Asgariandehkordi et.al. | 2408.10987 | null |
2024-08-20 | Influence of Medical Foreign Bodies on Dark-Field Chest Radiographs: First experiences | Lennard Kaster et.al. | 2408.10855 | null |
2024-08-19 | Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Liu He et.al. | 2408.10453 | null |
2024-08-19 | Perceptual Depth Quality Assessment of Stereoscopic Omnidirectional Images | Wei Zhou et.al. | 2408.10134 | null |
2024-08-19 | Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement | Kang Xiao et.al. | 2408.09920 | link |
2024-08-19 | Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation | Yunxin Li et.al. | 2408.09787 | link |
2024-08-21 | Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning | Zhi Qiao et.al. | 2408.09731 | null |
2024-08-18 | FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model | Ziyu Yao et.al. | 2408.09384 | null |
2024-08-17 | Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming | Seungyeop Han et.al. | 2408.09244 | null |
2024-08-16 | Explore Cross-Codec Quality-Rate Convex Hulls Relation for Adaptive Streaming | Masoumeh Farhadi Nia et.al. | 2408.09044 | null |
2024-08-16 | Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions | Bhuvanashree Murugadoss et.al. | 2408.08781 | null |
2024-08-16 | Speckle Noise Analysis for Synthetic Aperture Radar (SAR) Space Data | Sanjjushri Varshini R et.al. | 2408.08774 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-16 | Visual-Friendly Concept Protection via Selective Adversarial Perturbations | Xiaoyue Mi et.al. | 2408.08518 | link |
2024-08-16 | Achieving Complex Image Edits via Function Aggregation with Diffusion Models | Mohammadreza Samadi et.al. | 2408.08495 | null |
2024-08-15 | Level Up Your Tutorials: VLMs for Game Tutorials Quality Assessment | Daniele Rege Cambrin et.al. | 2408.08396 | link |
2024-08-15 | METR: Image Watermarking with Large Number of Unique Messages | Alexander Varlamov et.al. | 2408.08340 | link |
2024-08-15 | Accelerated Image-Aware Generative Diffusion Modeling | Tanmay Asthana et.al. | 2408.08306 | null |
2024-08-15 | Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective | Zixuan Pan et.al. | 2408.08228 | link |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-15 | KGV: Integrating Large Language Models with Knowledge Graphs for Cyber Threat Intelligence Credibility Assessment | Zongzong Wu et.al. | 2408.08088 | null |
2024-08-15 | Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation | Seon-Hoon Kim et.al. | 2408.07947 | link |
2024-08-15 | MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion | Lucas Nedel Kirsten et.al. | 2408.07932 | link |
2024-08-14 | New Curriculum, New Chance -- Retrieval Augmented Generation for Lesson Planning in Ugandan Secondary Schools. Prototype Quality Evaluation | Simon Kloker et.al. | 2408.07542 | null |
2024-08-14 | Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models | Jean-Marie Lemercier et.al. | 2408.07472 | null |
2024-08-14 | DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement | Tao Sun et.al. | 2408.07388 | null |
2024-08-13 | Direction of Arrival Correction through Speech Quality Feedback | Caleb Rascon et.al. | 2408.07234 | link |
2024-08-13 | SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis | Yuchen Mao et.al. | 2408.07196 | null |
2024-08-13 | BVI-UGC: A Video Quality Database for User-Generated Content Transcoding | Zihao Qi et.al. | 2408.07171 | null |
2024-08-13 | Efficient Deep Model-Based Optoacoustic Image Reconstruction | Christoph Dehner et.al. | 2408.07109 | null |
2024-08-13 | Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality | Yu-Chih Chen et.al. | 2408.07041 | null |
2024-08-13 | Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines | Samuel Fernández Menduiña et.al. | 2408.07028 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-13 | Evaluating Research Quality with Large Language Models: An Analysis of ChatGPT's Effectiveness with Different Settings and Inputs | Mike Thelwall et.al. | 2408.06752 | null |
2024-08-13 | Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models | Chenqian Yan et.al. | 2408.06646 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses | Zhongweiyang Xu et.al. | 2408.06468 | null |
2024-08-12 | Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming | Xinqi Jin et.al. | 2408.06152 | link |
2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
2024-08-12 | DiagESC: Dialogue Synthesis for Integrating Depression Diagnosis into Emotional Support Conversation | Seungyeon Seo et.al. | 2408.06044 | null |
2024-08-12 | A Sharpness Based Loss Function for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2408.06014 | link |
2024-08-12 | A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Taehong Moon et.al. | 2408.05927 | link |
2024-08-12 | Creating Arabic LLM Prompts at Scale | Abdelrahman El-Sheikh et.al. | 2408.05882 | null |
2024-08-11 | LaWa: Using Latent Space for In-Generation Image Watermarking | Ahmad Rezaei et.al. | 2408.05868 | null |
2024-08-14 | Iterative Improvement of an Additively Regularized Topic Model | Alex Gorbulev et.al. | 2408.05840 | null |
2024-08-11 | SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Du Chen et.al. | 2408.05713 | link |
2024-08-11 | Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators | Yifan Pu et.al. | 2408.05710 | link |
2024-08-11 | Evaluating BM3D and NBNet: A Comprehensive Study of Image Denoising Across Multiple Datasets | Ghazal Kaviani et.al. | 2408.05697 | null |
2024-08-09 | CBCT scatter correction with dual-layer flat-panel detector | Xin Zhang et.al. | 2408.04943 | null |
2024-08-09 | Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction | Lingbei Meng et.al. | 2408.04831 | null |
2024-08-08 | DaedalusData: Exploration, Knowledge Externalization and Labeling of Particles in Medical Manufacturing -- A Design Study | Alexander Wyss et.al. | 2408.04749 | null |
2024-08-08 | Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond | Ravi Ramamoorthi et.al. | 2408.04586 | null |
2024-08-11 | Synchronous Multi-modal Semantic Communication System with Packet-level Coding | Yun Tian et.al. | 2408.04535 | null |
2024-08-08 | Robustness investigation of quality measures for the assessment of machine learning models | Thomas Most et.al. | 2408.04391 | null |
2024-08-08 | SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression | Linhan Cao et.al. | 2408.04273 | null |
2024-08-08 | LLDif: Diffusion Models for Low-light Emotion Recognition | Zhifeng Wang et.al. | 2408.04235 | null |
2024-08-07 | Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Yiqing Shen et.al. | 2408.04098 | null |
2024-08-07 | Machine Learning-Based Reward-Driven Tuning of Scanning Probe Microscopy: Towards Fully Automated Microscopy | Yu Liu et.al. | 2408.04055 | null |
2024-08-07 | Global-Local Progressive Integration Network for Blind Image Quality Assessment | Xiaoqi Wang et.al. | 2408.03885 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal | Eirini Cholopoulou et.al. | 2408.03734 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-07 | D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods | Onkar Susladkar et.al. | 2408.03558 | link |
2024-08-07 | PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting | Yijia Guo et.al. | 2408.03538 | null |
2024-08-06 | Image Quality Transfer of Diffusion MRI Guided By High-Resolution Structural MRI | Alp G. Cicimen et.al. | 2408.03216 | null |
2024-08-06 | Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models | Sho Ozaki et.al. | 2408.03156 | null |
2024-08-05 | VidGen-1M: A Large-Scale Dataset for Text-to-video Generation | Zhiyu Tan et.al. | 2408.02629 | null |
2024-08-05 | Cascading Refinement Video Denoising with Uncertainty Adaptivity | Xinyuan Yu et.al. | 2408.02284 | null |
2024-08-04 | PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aoming Liu et.al. | 2408.02157 | null |
2024-08-06 | RICA2: Rubric-Informed, Calibrated Assessment of Actions | Abrar Majeedi et.al. | 2408.02138 | link |
2024-08-04 | View-consistent Object Removal in Radiance Fields | Yiren Lu et.al. | 2408.02100 | null |
2024-08-04 | Constructing Per-Shot Bitrate Ladders using Visual Information Fidelity | Krishna Srikar Durbha et.al. | 2408.01932 | null |
2024-08-03 | Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation | Jintao Tan et.al. | 2408.01732 | null |
2024-08-03 | JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Model | Farzaneh Jafari et.al. | 2408.01627 | null |
2024-08-02 | Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics | Alexander Gushchin et.al. | 2408.01541 | link |
2024-08-02 | Underwater Object Detection Enhancement via Channel Stabilization | Muhammad Ali et.al. | 2408.01293 | link |
2024-08-02 | Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement | Wenbin Zou et.al. | 2408.01276 | link |
2024-08-02 | Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion | Ke Li et.al. | 2408.01225 | link |
2024-08-02 | Validation of an Analysability Model in Hybrid Quantum Software | Díaz-Muñoz Ana et.al. | 2408.01105 | null |
2024-08-06 | FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation | Xiang Gao et.al. | 2408.00998 | link |
2024-08-01 | SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement | Mark Boss et.al. | 2408.00653 | null |
2024-08-01 | Regional quality estimation for echocardiography using deep learning | Gilles Van De Vyver et.al. | 2408.00591 | null |
2024-08-01 | Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception | Jiancong Feng et.al. | 2408.00470 | null |
2024-08-01 | RDP: Ranked Differential Privacy for Facial Feature Protection in Multiscale Sparsified Subspace | Lu Ou et.al. | 2408.00294 | null |
2024-07-31 | Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification | Xingchen Shi et.al. | 2407.21683 | null |
2024-07-31 | Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model | Zhichao Zhang et.al. | 2407.21408 | null |
2024-07-31 | An all-sky catalogue of stellar reddening values | E. Paunzen et.al. | 2407.21373 | null |
2024-07-31 | ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images | Xilei Zhu et.al. | 2407.21363 | null |
2024-08-01 | Outlier Detection in Large Radiological Datasets using UMAP | Mohammad Tariqul Islam et.al. | 2407.21263 | link |
2024-07-30 | MP-You: A Web-based MPI Simulation Tool | The-Vinh Tran-Luu et.al. | 2407.21155 | null |
2024-07-30 | Simultaneous Multi-Slice Diffusion Imaging using Navigator-free Multishot Spiral Acquisition | Yuancheng Jiang et.al. | 2407.20904 | null |
2024-07-30 | Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy | Xiaoheng Tan et.al. | 2407.20766 | null |
2024-07-30 | Questionnaires for Everyone: Streamlining Cross-Cultural Questionnaire Adaptation with GPT-Based Translation Quality Evaluation | Otso Haavisto et.al. | 2407.20608 | link |
2024-07-29 | Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods | Hyeon Yu et.al. | 2407.20427 | null |
2024-07-29 | Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception | Konstantinos Tzevelekakis et.al. | 2407.20336 | null |
2024-07-29 | DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models | Jing Yang et.al. | 2407.20141 | null |
2024-07-29 | HeadsetOff: Enabling Photorealistic Video Conferencing on Economical VR Headsets | Yili Jin et.al. | 2407.19988 | null |
2024-07-29 | Noise-Resilient Unsupervised Graph Representation Learning via Multi-Hop Feature Quality Estimation | Shiyuan Li et.al. | 2407.19944 | null |
2024-07-29 | FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention | Yu Lu et.al. | 2407.19918 | null |
2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
2024-07-29 | UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content | Yuqin Cao et.al. | 2407.19704 | null |
2024-07-29 | Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment | Wulian Yun et.al. | 2407.19675 | null |
2024-07-28 | X-Fake: Juggling Utility Evaluation and Explanation of Simulated SAR Images | Zhongling Huang et.al. | 2407.19436 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-27 | Towards Clean-Label Backdoor Attacks in the Physical World | Thinh Dao et.al. | 2407.19203 | null |
2024-07-26 | Regularized Multi-Decoder Ensemble for an Error-Aware Scene Representation Network | Tianyu Xiong et.al. | 2407.19082 | null |
2024-07-26 | Correcting for objective sample refractive index mismatch in extended field of view selective plane illumination microscopy | Steven J. Sheppard et.al. | 2407.18862 | null |
2024-07-25 | Joint RGB-Spectral Decomposition Model Guided Image Enhancement in Mobile Photography | Kailai Zhou et.al. | 2407.17996 | link |
2024-07-29 | Invariance of deep image quality metrics to affine transformations | Nuria Alabau-Bosque et.al. | 2407.17927 | link |
2024-07-25 | Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion | Xiaodan Xing et.al. | 2407.17882 | null |
2024-07-24 | Final Alignment and Image Quality Test for the Acquisition and Guiding System of SOXS | J. A. Araiza-Duran et.al. | 2407.17382 | null |
2024-07-24 | SOXS NIR: Optomechanical integration and alignment, optical performance verification before full instrument assembly | M. Genoni et.al. | 2407.17244 | null |
2024-07-24 | Q-Ground: Image Quality Grounding with Large Multi-modality Models | Chaofeng Chen et.al. | 2407.17035 | link |
2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | link |
2024-07-24 | SAR to Optical Image Translation with Color Supervised Diffusion Model | Xinyu Bai et.al. | 2407.16921 | null |
2024-07-23 | QPT V2: Masked Image Modeling Advances Visual Scoring | Qizhi Xie et.al. | 2407.16541 | link |
2024-07-23 | ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation | Zhenhua Wu et.al. | 2407.16508 | null |
2024-07-23 | On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models | Deniz Daum et.al. | 2407.16405 | link |
2024-07-23 | Improving multidimensional projection quality with user-specific metrics and optimal scaling | Maniru Ibrahim et.al. | 2407.16328 | null |
2024-07-23 | A new visual quality metric for Evaluating the performance of multidimensional projections | Maniru Ibrahim et.al. | 2407.16309 | null |
2024-07-23 | Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance | Jiyeop Kim et.al. | 2407.16173 | null |
2024-07-23 | Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jiahe Liu et.al. | 2407.16124 | link |
2024-07-22 | Enhancing Cell Instance Segmentation in Scanning Electron Microscopy Images via a Deep Contour Closing Operator | Florian Robert et.al. | 2407.15817 | null |
2024-07-22 | SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection | Daniel Jakab et.al. | 2407.15646 | null |
2024-07-22 | Experimenting with Adaptive Bitrate Algorithms for Virtual Reality Streaming over Wi-Fi | Ferran Maura et.al. | 2407.15614 | link |
2024-07-22 | SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time | Stanislav Frolov et.al. | 2407.15507 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | link |
2024-07-20 | Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs | Karl Van Eeden Risager et.al. | 2407.14994 | null |
2024-07-20 | Deep Learning CT Image Restoration using System Blur and Noise Models | Yijie Yuan et.al. | 2407.14983 | null |
2024-07-20 | GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation | Jingzhi Gong et.al. | 2407.14982 | link |
2024-07-20 | Dual High-Order Total Variation Model for Underwater Image Restoration | Yuemei Li et.al. | 2407.14868 | link |
2024-07-20 | CBCTLiTS: A Synthetic, Paired CBCT/CT Dataset For Segmentation And Style Transfer | Maximilian E. Tschuchnig et.al. | 2407.14853 | null |
2024-07-20 | Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting | Tianle Zeng et.al. | 2407.14846 | null |
2024-07-20 | Difflare: Removing Image Lens Flare with Latent Diffusion Model | Tianwen Zhou et.al. | 2407.14746 | link |
2024-07-20 | Polarimetric compressed sensing with hollow, self-assembled diffractive films | Ji Feng et.al. | 2407.14722 | null |
2024-07-19 | A Minibatch Alternating Projections Algorithm for Robust and Efficient Magnitude Least-Squares RF Pulse Design in MRI | Jonathan B. Martin et.al. | 2407.14696 | null |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-19 | Shape and Style GAN-based Multispectral Data Augmentation for Crop/Weed Segmentation in Precision Farming | Mulham Fawakherji et.al. | 2407.14119 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Personalized Privacy Protection Mask Against Unauthorized Facial Recognition | Ka-Ho Chow et.al. | 2407.13975 | link |
2024-07-18 | Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | Boyang Deng et.al. | 2407.13759 | null |
2024-07-18 | A Novel Freeform Slicer IFU for the Magellan InfraRed Multi-Object Spectrograph (MIRMOS) | Maren Cosens et.al. | 2407.13747 | null |
2024-07-18 | HazeCLIP: Towards Language Guided Real-World Image Dehazing | Ruiyi Wang et.al. | 2407.13719 | link |
2024-07-18 | Removing cloud shadows from ground-based solar imagery | Amal Chaoui et.al. | 2407.13379 | null |
2024-07-18 | Any Image Restoration with Efficient Automatic Degradation Adaptation | Bin Ren et.al. | 2407.13372 | link |
2024-07-18 | Heterogeneous Clinical Trial Outcomes via Multi-Output Gaussian Processes | Owen Thomas et.al. | 2407.13283 | null |
2024-07-18 | Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network | Hao Yan et.al. | 2407.13211 | null |
2024-07-18 | Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao et.al. | 2407.13179 | null |
2024-07-18 | Image Inpainting Models are Effective Tools for Instruction-guided Image Editing | Xuan Ju et.al. | 2407.13139 | null |
2024-07-18 | Enhanced Denoising of OCT Images Using Residual U-Net: A Cross-Modality Approach on PSOCT and ASOCT for Clinical Diagnostics | Akkidas Noel Prakasha et.al. | 2407.13090 | null |
2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780 | null |
2024-07-17 | CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems | Jiankun Zhao et.al. | 2407.12676 | link |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations | Tomáš Chobola et.al. | 2407.12511 | link |
2024-07-17 | Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency | Vignesh V Menon et.al. | 2407.12465 | null |
2024-07-17 | Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process | Yang Cheng et.al. | 2407.12261 | null |
2024-07-16 | Semantic Communication for the Internet of Sounds: Architecture, Design Principles, and Challenges | Chengsi Liang et.al. | 2407.12203 | null |
2024-07-16 | Neural Passage Quality Estimation for Static Pruning | Xuejun Chang et.al. | 2407.12170 | link |
2024-07-16 | MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification | Zhuoxiao Li et.al. | 2407.11840 | null |
2024-07-16 | LoFTI: Localization and Factuality Transfer to Indian Locales | Sona Elza Simon et.al. | 2407.11833 | link |
2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | null |
2024-07-16 | ITI-IQA: a Toolbox for Heterogeneous Univariate and Multivariate Missing Data Imputation Quality Assessment | Pedro Pons-Suñer et.al. | 2407.11767 | null |
2024-07-16 | Magnetogram-to-Magnetogram: Generative Forecasting of Solar Evolution | Francesco Pio Ramunno et.al. | 2407.11659 | link |
2024-07-16 | ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment | Xinyi Wang et.al. | 2407.11496 | link |
2024-07-16 | Cover-separable Fixed Neural Network Steganography via Deep Generative Models | Guobiao Li et.al. | 2407.11405 | link |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-15 | UFQA: Utility guided Fingerphoto Quality Assessment | Amol S. Joshi et.al. | 2407.11141 | null |
2024-07-15 | Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation | Tu Vu et.al. | 2407.10817 | null |
2024-07-15 | Melon Fruit Detection and Quality Assessment Using Generative AI-Based Image Data Augmentation | Seungri Yoon et.al. | 2407.10413 | null |
2024-07-15 | Exploring the Impact of Moire Pattern on Deepfake Detectors | Razaib Tariq et.al. | 2407.10399 | null |
2024-07-14 | Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models | Qinyu Yang et.al. | 2407.10285 | link |
2024-07-14 | Low Sensitivity Hopsets | Vikrant Ashvinkumar et.al. | 2407.10249 | null |
2024-07-14 | A Novel Approach to Ultrasound Beamforming using Synthetic Transmit Aperture with Low Complexity and High SNR for Medical Imaging | Thenmozhi Elango et.al. | 2407.10242 | null |
2024-07-13 | Asynchronous Feedback Network for Perceptual Point Cloud Quality Assessment | Yujie Zhang et.al. | 2407.09806 | null |
2024-07-12 | Quantum-dot-based Kitaev chains: Majorana quality measures and scaling with increasing chain length | Viktor Svensson et.al. | 2407.09211 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | Task-driven single-image super-resolution reconstruction of document scans | Maciej Zyrek et.al. | 2407.08993 | null |
2024-07-12 | LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models | Hai Jiang et.al. | 2407.08939 | link |
2024-07-12 | 15M Multimodal Facial Image-Text Dataset | Dawei Dai et.al. | 2407.08515 | null |
2024-07-11 | Imitation Learning for Robotic Assisted Ultrasound Examination of Deep Venous Thrombosis using Kernelized Movement Primitives | Diego Dall'Alba et.al. | 2407.08506 | null |
2024-07-11 | E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors | Jinxiu Liang et.al. | 2407.08231 | null |
2024-07-11 | Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-10 | Coherent and Multi-modality Image Inpainting via Latent Space Optimization | Lingzhi Pan et.al. | 2407.08019 | null |
2024-07-10 | Intensity-sensitive quality assessment of extended sources in astronomical images | X. Li et.al. | 2407.07863 | link |
2024-07-12 | Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization | Feixiang Zhou et.al. | 2407.07673 | null |
2024-07-10 | Video In-context Learning | Wentao Zhang et.al. | 2407.07356 | null |
2024-07-10 | Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Yuehan Zhang et.al. | 2407.07302 | link |
2024-07-09 | HAMIL-QA: Hierarchical Approach to Multiple Instance Learning for Atrial LGE MRI Quality Assessment | K M Arefeen Sultan et.al. | 2407.07254 | null |
2024-07-09 | Scaling Up Personalized Aesthetic Assessment via Task Vector Customization | Jooyeol Yun et.al. | 2407.07176 | null |
2024-07-09 | Microsoft Cloud-based Digitization Workflow with Rich Metadata Acquisition for Cultural Heritage Objects | Krzysztof Kutt et.al. | 2407.06972 | null |
2024-07-09 | CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Shuang Hao et.al. | 2407.06780 | link |
2024-07-09 | Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition | Mingfang Zhang et.al. | 2407.06628 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-09 | Low-dose, high-resolution CT of infant-sized lungs via propagation-based phase contrast | James A. Pollock et.al. | 2407.06527 | null |
2024-07-08 | MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions | Xuan Ju et.al. | 2407.06358 | null |
2024-07-08 | Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Orr Zohar et.al. | 2407.06189 | link |
2024-07-08 | PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes | Mohammad Reza Karimi Dastjerdi et.al. | 2407.06150 | null |
2024-07-08 | Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation | Xinyu Bai et.al. | 2407.06095 | null |
2024-07-08 | Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation | Shuang Xu et.al. | 2407.06064 | link |
2024-07-08 | MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices | Jianwen Jiang et.al. | 2407.05712 | null |
2024-07-09 | PCAC-GAN:ASparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-08 | GSBIQA: Green Saliency-guided Blind Image Quality Assessment Method | Zhanxuan Mei et.al. | 2407.05590 | null |
2024-07-08 | Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN | Jiacheng Su et.al. | 2407.05577 | null |
2024-07-06 | Panopticon: a telescope for our times | Will Saunders et.al. | 2407.05103 | null |
2024-07-06 | CLIPVQA:Video Quality Assessment via CLIP | Fengchuang Xing et.al. | 2407.04928 | link |
2024-07-06 | OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding | Tiancheng Zhao et.al. | 2407.04923 | null |
2024-07-05 | MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? | Zhaorun Chen et.al. | 2407.04842 | link |
2024-07-05 | Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps | Mattias Nilsson et.al. | 2407.04578 | link |
2024-07-05 | Rethinking Image Compression on the Web with Generative AI | Shayan Ali Hassan et.al. | 2407.04542 | null |
2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | null |
2024-07-05 | Unsupervised Video Summarization via Reinforcement Learning and a Trained Evaluator | Mehryar Abbasi et.al. | 2407.04258 | null |
2024-07-05 | HCS-TNAS: Hybrid Constraint-driven Semi-supervised Transformer-NAS for Ultrasound Image Segmentation | Renqi Chen et.al. | 2407.04203 | null |
2024-07-04 | Performance of Medical Image Fusion in High-level Analysis Tasks: A Mutual Enhancement Framework for Unaligned PAT and MRI Image Fusion | Yutian Zhong et.al. | 2407.03992 | link |
2024-07-04 | DSMix: Distortion-Induced Sensitivity Map Based Pre-training for No-Reference Image Quality Assessment | Jinsong Shi et.al. | 2407.03886 | link |
2024-07-04 | Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy | Yujie Zhang et.al. | 2407.03885 | null |
2024-07-04 | DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts | Zheng-Peng Duan et.al. | 2407.03757 | null |
2024-07-04 | Adaptive sampling strategy for tolerance analysis of freeform optical surfaces based on critical ray aiming | Rundong Fan et.al. | 2407.03688 | null |
2024-07-04 | Pathological Semantics-Preserving Learning for H&E-to-IHC Virtual Staining | Fuqiang Chen et.al. | 2407.03655 | link |
2024-07-04 | Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration | Yuhong Zhang et.al. | 2407.03636 | null |
2024-07-04 | Orthogonal Constrained Minimization with Tensor |
Xiaoxia Liu et.al. | 2407.03605 | null |
2024-07-03 | Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models | Chunmei Xu et.al. | 2407.03050 | null |
2024-07-03 | Single Image Rolling Shutter Removal with Diffusion Models | Zhanglei Yang et.al. | 2407.02906 | null |
2024-07-03 | FedPot: A Quality-Aware Collaborative and Incentivized Honeypot-Based Detector for Smart Grid Networks | Abdullatif Albaseer et.al. | 2407.02845 | null |
2024-07-03 | Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Gen Li et.al. | 2407.02813 | link |
2024-07-03 | SF-GNN: Self Filter for Message Lossless Propagation in Deep Graph Neural Network | Yushan Zhu et.al. | 2407.02762 | null |
2024-07-03 | MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control | Yeonji Lee et.al. | 2407.02736 | null |
2024-07-02 | Meta 3D Gen | Raphael Bensadoun et.al. | 2407.02599 | null |
2024-07-02 | Off-Grid Ultrasound Imaging by Stochastic Optimization | Vincent van de Schaft et.al. | 2407.02285 | link |
2024-07-02 | SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules | Suyi Li et.al. | 2407.02031 | null |
2024-07-01 | Free-text Rationale Generation under Readability Level Control | Yi-Sheng Hsu et.al. | 2407.01384 | null |
2024-07-01 | GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting | Chenxin Li et.al. | 2407.01301 | null |
2024-07-01 | Optical turbulence vertical distribution at the Peak Terskol Observatory and Mt. Kurapdag | A. Y. Shikhovtsev et.al. | 2407.00960 | null |
2024-07-01 | Diffusion Transformer Model With Compact Prior for Low-dose PET Reconstruction | Bin Huang et.al. | 2407.00944 | null |
2024-06-30 | A Comparative Study of Quality Evaluation Methods for Text Summarization | Huyen Nguyen et.al. | 2407.00747 | null |
2024-06-30 | DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models | Wenda Wang et.al. | 2407.00560 | null |
2024-06-29 | Dynamic Optimization of Video Streaming Quality Using Network Digital Twin Technology | Zurh Farus et.al. | 2407.00513 | null |
2024-07-02 | RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering | Weikai Lin et.al. | 2407.00435 | link |
2024-06-29 | Benchmark Evaluation of Image Fusion algorithms for Smartphone Camera Capture | Lucas N. Kirsten et.al. | 2407.00301 | null |
2024-06-28 | PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration | Yuxuan Sun et.al. | 2407.00203 | null |
2024-06-28 | Quantitative Methods in Research Evaluation Citation Indicators, Altmetrics, and Artificial Intelligence | Mike Thelwall et.al. | 2407.00135 | null |
2024-06-28 | MR-zero meets FLASH -- Controlling the transient signal decay in gradient- and rf-spoiled gradient echo sequences | Simon Weinmüller et.al. | 2406.19877 | null |
2024-06-28 | Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation | Niful Islam et.al. | 2406.19690 | null |
2024-06-28 | UltraGelBot: Autonomous Gel Dispenser for Robotic Ultrasound | Deepak Raina et.al. | 2406.19678 | null |
2024-06-28 | PopAlign: Population-Level Alignment for Fair Text-to-Image Generation | Shufan Li et.al. | 2406.19668 | link |
2024-06-27 | Robustness Testing of Black-Box Models Against CT Degradation Through Test-Time Augmentation | Jack Highton et.al. | 2406.19557 | null |
2024-06-27 | Lightweight Predictive 3D Gaussian Splats | Junli Cao et.al. | 2406.19434 | link |
2024-06-27 | Looking 3D: Anomaly Detection with 2D-3D Alignment | Ankan Bhunia et.al. | 2406.19393 | link |
2024-06-27 | AI Data Readiness Inspector (AIDRIN) for Quantitative Assessment of Data Readiness for AI | Kaveen Hiniduma et.al. | 2406.19256 | null |
2024-06-27 | Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without | Ruida Zhou et.al. | 2406.19248 | null |
2024-06-27 | Local Manifold Learning for No-Reference Image Quality Assessment | Timin Gao et.al. | 2406.19247 | null |
2024-06-27 | Complex-valued scatter compensation in nonlinear microscopy | Maximilian Sohmen et.al. | 2406.19031 | null |
2024-06-27 | Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model | Jiangtong Tan et.al. | 2406.19030 | link |
2024-06-26 | IDA-UIE: An Iterative Framework for Deep Network-based Degradation Aware Underwater Image Enhancement | Pranjali Singh et.al. | 2406.18628 | null |
2024-06-26 | On Scaling Up 3D Gaussian Splatting Training | Hexu Zhao et.al. | 2406.18533 | link |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-26 | ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation | Shenghai Yuan et.al. | 2406.18522 | link |
2024-06-26 | MFDNet: Multi-Frequency Deflare Network for Efficient Nighttime Flare Removal | Yiguo Jiang et.al. | 2406.18079 | link |
2024-06-26 | Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation | Qilai Zhang et.al. | 2406.18054 | link |
2024-06-25 | Burst Image Super-Resolution with Base Frame Selection | Sanghyun Kim et.al. | 2406.17869 | null |
2024-06-25 | Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation | Bowei Yao et.al. | 2406.17578 | null |
2024-06-25 | UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment | Vlad Hosu et.al. | 2406.17472 | null |
2024-06-25 | Leveraging LLMs for Dialogue Quality Measurement | Jinghan Jia et.al. | 2406.17304 | null |
2024-06-25 | HD snapshot diffractive spectral imaging and inferencing | Apratim Majumder et.al. | 2406.17302 | null |
2024-06-25 | Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving | Ce Zhang et.al. | 2406.17265 | null |
2024-06-25 | Disentangled Motion Modeling for Video Frame Interpolation | Jaihyun Lew et.al. | 2406.17256 | link |
2024-06-24 | Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models | Bei Yan et.al. | 2406.17115 | link |
2024-06-24 | Fine-tuning Diffusion Models for Enhancing Face Quality in Text-to-image Generation | Zhenyi Liao et.al. | 2406.17100 | link |
2024-06-24 | Reducing the Memory Footprint of 3D Gaussian Splatting | Panagiotis Papantonakis et.al. | 2406.17074 | null |
2024-06-24 | 3D distortion-free, reduced field of view diffusion-prepared GRE at 3T | Sarah McElroy et.al. | 2406.16809 | null |
2024-06-24 | Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation | Katherine M. Collins et.al. | 2406.16807 | null |
2024-06-24 | Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment | Jun Fu et.al. | 2406.16641 | link |
2024-06-24 | DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Aiwen Jiang et.al. | 2406.16477 | link |
2024-06-24 | Approximate DCT and Quantization Techniques for Energy-Constrained Image Sensors | Ming-Che Li et.al. | 2406.16358 | null |
2024-06-24 | Priorformer: A UGC-VQA Method with content and distortion priors | Yajing Pei et.al. | 2406.16297 | null |
2024-06-23 | Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation | Rafael Redondo et.al. | 2406.16155 | null |
2024-06-23 | LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction | Hengyu Liu et.al. | 2406.16073 | link |
2024-06-22 | Quality-guided Skin Tone Enhancement for Portrait Photography | Shiqi Gao et.al. | 2406.15848 | null |
2024-06-21 | Adaptive Self-Supervised Consistency-Guided Diffusion Model for Accelerated MRI Reconstruction | Mojtaba Safari et.al. | 2406.15656 | null |
2024-06-21 | Contrastive Entity Coreference and Disambiguation for Historical Texts | Abhishek Arora et.al. | 2406.15576 | null |
2024-06-21 | Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild | Nadav Orzech et.al. | 2406.15331 | null |
2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | null |
2024-06-24 | VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation | Xuan He et.al. | 2406.15252 | null |
2024-06-21 | Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior | Junbo Peng et.al. | 2406.15219 | null |
2024-06-21 | Benchmarking Retinal Blood Vessel Segmentation Models for Cross-Dataset and Cross-Disease Generalization | Jeremiah Fadugba et.al. | 2406.14994 | link |
2024-06-21 | Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning | Xu Han et.al. | 2406.14847 | null |
2024-06-21 | Is this a bad table? A Closer Look at the Evaluation of Table Generation from Text | Pritika Ramu et.al. | 2406.14829 | null |
2024-06-20 | Holistic Evaluation for Interleaved Text-and-Image Generation | Minqian Liu et.al. | 2406.14643 | null |
2024-06-20 | A Fuzzy Logic-Based Quality Model For Identifying Microservices With Low Maintainability | Rahime Yilmaz et.al. | 2406.14489 | null |
2024-06-20 | Enhancing multivariate post-processed visibility predictions utilizing CAMS forecasts | Mária Lakatos et.al. | 2406.14159 | null |
2024-06-20 | EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations | Jie Ren et.al. | 2406.13933 | null |
2024-06-19 | IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution | Alireza Aghelan et.al. | 2406.13815 | link |
2024-06-19 | Convex-hull Estimation using XPSNR for Versatile Video Coding | Vignesh V Menon et.al. | 2406.13712 | null |
2024-06-19 | Assessing the 3D resolution of refocused correlation plenoptic images using a general-purpose image quality estimator | Gianlorenzo Massaro et.al. | 2406.13501 | null |
2024-06-19 | ALiiCE: Evaluating Positional Fine-grained Citation Generation | Yilong Xu et.al. | 2406.13375 | link |
2024-06-19 | AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models | Ken Chen et.al. | 2406.13272 | null |
2024-06-19 | New methods for ALMA angular-scale based observation scheduling, quality assessment, and beam shaping II: refinements | Dirk Petry et.al. | 2406.13199 | null |
2024-06-18 | NTIRE 2024 Challenge on Night Photography Rendering | Egor Ershov et.al. | 2406.13007 | null |
2024-06-18 | Pattern or Artifact? Interactively Exploring Embedding Quality with TRACE | Edith Heiter et.al. | 2406.12953 | link |
2024-06-18 | Automatic generation of insights from workers' actions in industrial workflows with explainable Machine Learning | Francisco de Arriba-Pérez et.al. | 2406.12732 | null |
2024-06-18 | Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution | Maximilian Fischer et.al. | 2406.12623 | null |
2024-06-18 | Training Diffusion Models with Federated Learning | Matthijs de Goede et.al. | 2406.12575 | null |
2024-06-18 | Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation | Sophie Loizillon et.al. | 2406.12448 | link |
2024-06-18 | AI-Assisted Human Evaluation of Machine Translation | Vilém Zouhar et.al. | 2406.12419 | null |
2024-06-18 | SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions | Yuexiong Ding et.al. | 2406.12395 | null |
2024-06-17 | A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets | Bernhard Kerbl et.al. | 2406.12080 | null |
2024-06-17 | FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure | Ziyue Xu et.al. | 2406.12009 | link |
2024-06-17 | RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians | Bingling Li et.al. | 2406.11836 | null |
2024-06-17 | Latent Denoising Diffusion GAN: Faster sampling, Higher image quality | Luan Thanh Trinh et.al. | 2406.11713 | link |
2024-06-17 | Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT | Maximilian E. Tschuchnig et.al. | 2406.11650 | null |
2024-06-17 | Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation | Boxuan Lyu et.al. | 2406.11632 | null |
2024-06-17 | Compressed Skinning for Facial Blendshapes | Ladislav Kavan et.al. | 2406.11597 | null |
2024-06-17 | Energy Reduction Opportunities in HDR Video Encoding | Christian Herglotz et.al. | 2406.11492 | null |
2024-06-17 | A Dictionary Based Approach for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2406.11330 | link |
2024-06-17 | NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Niu Guanchen et.al. | 2406.11259 | null |
2024-06-17 | Incentivizing Quality Text Generation via Statistical Contracts | Eden Saig et.al. | 2406.11118 | link |
2024-06-16 | Parameter Blending for Multi-Camera Harmonization for Automotive Surround View Systems | Yuzhuo Ren et.al. | 2406.11066 | null |
2024-06-16 | SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction | Yuxun Tang et.al. | 2406.10911 | null |
2024-06-15 | MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images | Tao Yan et.al. | 2406.10652 | null |
2024-06-15 | Exploring the Impact of AI-generated Image Tools on Professional and Non-professional Users in the Art and Design Fields | Yuying Tang et.al. | 2406.10640 | null |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-06-15 | CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation | Wei Chen et.al. | 2406.10462 | null |
2024-06-14 | Consistency-diversity-realism Pareto fronts of conditional image generative models | Pietro Astolfi et.al. | 2406.10429 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | null |
2024-06-14 | AlignNet: Learning dataset score alignment functions to enable better training of speech quality estimators | Jaden Pieper et.al. | 2406.10205 | null |
2024-06-14 | D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video | Moritz Kappel et.al. | 2406.10078 | null |
2024-06-14 | Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment | Fei Zhou et.al. | 2406.09858 | null |
2024-06-14 | Full-reference Point Cloud Quality Assessment Using Spectral Graph Wavelets | Ryosuke Watanabe et.al. | 2406.09762 | null |
2024-06-14 | Compressed Video Quality Enhancement with Temporal Group Alignment and Fusion | Qiang Zhu et.al. | 2406.09693 | null |
2024-06-13 | DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer | Wei-Ting Chen et.al. | 2406.09622 | null |
2024-06-13 | Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment | Fengbin Guan et.al. | 2406.09546 | null |
2024-06-13 | Modeling Ambient Scene Dynamics for Free-view Synthesis | Meng-Li Shih et.al. | 2406.09395 | null |
2024-06-14 | WonderWorld: Interactive 3D Scene Generation from a Single Image | Hong-Xing Yu et.al. | 2406.09394 | null |
2024-06-13 | LRM-Zero: Training Large Reconstruction Models with Synthesized Data | Desai Xie et.al. | 2406.09371 | link |
2024-06-13 | CMC-Bench: Towards a New Paradigm of Visual Signal Compression | Chunyi Li et.al. | 2406.09356 | link |
2024-06-13 | StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning | Giuseppe Vecchio et.al. | 2406.09293 | null |
2024-06-13 | SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Soufiane Belharbi et.al. | 2406.09168 | link |
2024-06-13 | Adaptive Cooperative Streaming of Holographic Video Over Wireless Networks: A Proximal Policy Optimization Solution | Wanli Wen et.al. | 2406.08806 | null |
2024-06-13 | Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation | Mingwang Xu et.al. | 2406.08801 | null |
2024-06-13 | FouRA: Fourier Low Rank Adaptation | Shubhankar Borse et.al. | 2406.08798 | null |
2024-06-12 | Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods | Eugene Vyborov et.al. | 2406.08582 | null |
2024-06-12 | IMFL-AIGC: Incentive Mechanism Design for Federated Learning Empowered by Artificial Intelligence Generated Content | Guangjing Huang et.al. | 2406.08526 | null |
2024-06-12 | DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor | Juncheng Wu et.al. | 2406.08377 | null |
2024-06-12 | WMAdapter: Adding WaterMark Control to Latent Diffusion Models | Hai Ci et.al. | 2406.08337 | null |
2024-06-12 | Guiding In-Context Learning of LLMs through Quality Estimation for Machine Translation | Javad Pourmostafa Roshan Sharami et.al. | 2406.07970 | link |
2024-06-12 | DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera | Senyan Xu et.al. | 2406.07951 | link |
2024-06-12 | Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation | Jiadong Liang et.al. | 2406.07895 | null |
2024-06-11 | A PRISMA Driven Systematic Review of Publicly Available Datasets for Benchmark and Model Developments for Industrial Defect Detection | Can Akbas et.al. | 2406.07694 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499 | null |
2024-06-11 | Textual Similarity as a Key Metric in Machine Translation Quality Estimation | Kun Sun et.al. | 2406.07440 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399 | null |
2024-06-11 | DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling | Sixian Wang et.al. | 2406.07390 | null |
2024-06-11 | Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment | Takuto Igarashi et.al. | 2406.07280 | null |
2024-06-11 | Accurate estimate of the ESPRESSO fiber-injection losses inferred from integrated field-stabilization images | Tobias M. Schmidt et.al. | 2406.07193 | null |
2024-06-11 | Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation | Yuanhao Zhai et.al. | 2406.06890 | link |
2024-06-11 | A Subjective Quality Evaluation of 3D Mesh with Dynamic Level of Detail in Virtual Reality | Duc Nguyen et.al. | 2406.06888 | null |
2024-06-09 | Latent Diffusion Model-Enabled Real-Time Semantic Communication Considering Semantic Ambiguities and Channel Noises | Jianhua Pei et.al. | 2406.06644 | null |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525 | link |
2024-06-10 | MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Xuanyu Yi et.al. | 2406.06367 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202 | null |
2024-06-10 | Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios | Raül Pérez-Gonzalo et.al. | 2406.06165 | null |
2024-06-10 | JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis | Hyunjae Cho et.al. | 2406.06111 | null |
2024-06-10 | GAIA: Rethinking Action Quality Assessment for AI-Generated Videos | Zijian Chen et.al. | 2406.06087 | link |
2024-06-10 | FRAG: Frequency Adapting Group for Diffusion Video Editing | Sunjae Yoon et.al. | 2406.06044 | link |
2024-06-12 | MLCM: Multistep Consistency Distillation of Latent Diffusion Model | Qingsong Xie et.al. | 2406.05768 | null |
2024-06-08 | Energy-Efficient Approximate Full Adders Applying Memristive Serial IMPLY Logic For Image Processing | Seyed Erfan Fatemieh et.al. | 2406.05525 | null |
2024-06-08 | Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid | Thanh-Huy Nguyen et.al. | 2406.05349 | null |
2024-06-08 | Deep convolutional demosaicking network for multispectral polarization filter array | Tomoharu Ishiuchi et.al. | 2406.05312 | null |
2024-06-08 | YouTube SFV+HDR Quality Dataset | Yilin Wang et.al. | 2406.05305 | null |
2024-06-07 | Spectral Codecs: Spectrogram-Based Audio Codecs for High Quality Speech Synthesis | Ryan Langman et.al. | 2406.05298 | null |
2024-06-07 | GANetic Loss for Generative Adversarial Networks with a Focus on Medical Applications | Shakhnaz Akhmedova et.al. | 2406.05023 | link |
2024-06-07 | Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior | Tanvir Mahmud et.al. | 2406.04873 | null |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-07 | The Active Optics System on the Vera C. Rubin Observatory: Optimal Control of Degeneracy Among the Large Number of Degrees of Freedom | Guillem Megias Homar et.al. | 2406.04656 | null |
2024-06-07 | GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models | Diptanu De et.al. | 2406.04654 | null |
2024-06-07 | StreamOptix: A Cross-layer Adaptive Video Delivery Scheme | Mufan Liu et.al. | 2406.04632 | link |
2024-06-07 | Attention Fusion Reverse Distillation for Multi-Lighting Image Anomaly Detection | Yiheng Zhang et.al. | 2406.04573 | null |
2024-06-06 | Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance | Reyhane Askari Hemmat et.al. | 2406.04551 | null |
2024-06-06 | A Versatile Collage Visualization Technique | Zhenyu Wang et.al. | 2406.04008 | null |
2024-06-06 | JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits | Minzhou Pan et.al. | 2406.03720 | link |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-05 | Anatomy-based quality metric of diffusion-weighted MRI data for accurate derivation of muscle fiber orientation | Nadya Shusharina et.al. | 2406.03560 | null |
2024-06-05 | Globally and Locally Optimized Pannini Projection for High FoV Rendering of 360-degree Images | Falah Jabar et.al. | 2406.03282 | null |
2024-06-05 | FAPNet: An Effective Frequency Adaptive Point-based Eye Tracker | Xiaopeng Lin et.al. | 2406.03177 | null |
2024-06-05 | Dynamic 3D Gaussian Fields for Urban Areas | Tobias Fischer et.al. | 2406.03175 | null |
2024-06-05 | The new Herschel/PACS Point Source Catalogue | Gábor Marton et.al. | 2406.03116 | null |
2024-06-05 | A-Bench: Are LMMs Masters at Evaluating AI-generated Images? | Zicheng Zhang et.al. | 2406.03070 | link |
2024-06-05 | DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross Domain | Jun Liu et.al. | 2406.03017 | link |
2024-06-05 | Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms | Firas Trabelsi et.al. | 2406.02832 | null |
2024-06-04 | ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation | Tianchen Zhao et.al. | 2406.02540 | link |
2024-06-04 | Guiding a Diffusion Model with a Bad Version of Itself | Tero Karras et.al. | 2406.02507 | null |
2024-06-04 | Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey | Reza Farahani et.al. | 2406.02302 | null |
2024-06-04 | I4VGen: Image as Stepping Stone for Text-to-Video Generation | Xiefan Guo et.al. | 2406.02230 | null |
2024-06-04 | OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection | Chenyang Huang et.al. | 2406.01919 | null |
2024-06-04 | Rank-based No-reference Quality Assessment for Face Swapping | Xinghui Zhou et.al. | 2406.01884 | null |
2024-06-03 | Video Coding with Cross-Component Sample Offset | Han Gao et.al. | 2406.01795 | null |
2024-06-03 | DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised |
Alexander Denker et.al. | 2406.01781 | null |
2024-06-03 | Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers | Pablo Arratia et.al. | 2406.01299 | null |
2024-06-03 | Capsule Enhanced Variational AutoEncoder for Underwater Image Reconstruction | Rita Pucci et.al. | 2406.01294 | null |
2024-06-03 | Dimba: Transformer-Mamba Diffusion Models | Zhengcong Fei et.al. | 2406.01159 | null |
2024-06-03 | Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline | Jan Lippemeier et.al. | 2406.01071 | null |
2024-06-03 | UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment | Hantao Zhou et.al. | 2406.01069 | link |
2024-06-03 | CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment | Daekyu Kwon et.al. | 2406.01020 | null |
2024-06-02 | EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing | Hadrien Reynaud et.al. | 2406.00808 | link |
2024-06-04 | Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion Models | Cristiano Patrício et.al. | 2406.00772 | link |
2024-06-02 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-06-01 | Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner | Xing Cui et.al. | 2406.00432 | null |
2024-06-01 | Hybrid attention structure preserving network for reconstruction of under-sampled OCT images | Zezhao Guo et.al. | 2406.00279 | null |
2024-05-31 | Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis | Chaoyou Fu et.al. | 2405.21075 | null |
2024-05-31 | Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Jiatao Gu et.al. | 2405.21048 | null |
2024-05-31 | Tsang's resolution enhancement method for imaging with focused illumination | Alexander Duplinskiy et.al. | 2405.20979 | null |
2024-05-31 | Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation | Shuzhou Yang et.al. | 2405.20669 | link |
2024-05-30 | An Automatic Question Usability Evaluation Toolkit | Steven Moore et.al. | 2405.20529 | link |
2024-05-30 | Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution? | Egor Kashkarov et.al. | 2405.20392 | null |
2024-05-30 | CoSy: Evaluating Textual Explanations of Neurons | Laura Kopf et.al. | 2405.20331 | link |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-05-30 | Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion | Jiangkai Wu et.al. | 2405.20032 | null |
2024-06-03 | DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild | Honghao Fu et.al. | 2405.19996 | link |
2024-05-29 | CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning | Yiping Wang et.al. | 2405.19547 | null |
2024-05-29 | A Full-duplex Speech Dialogue Scheme Based On Large Language Models | Peng Wang et.al. | 2405.19487 | null |
2024-05-29 | VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture | Heesup Yun et.al. | 2405.19413 | null |
2024-05-29 | Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare | Hanwei Zhu et.al. | 2405.19298 | link |
2024-05-29 | A study on the adequacy of common IQA measures for medical images | Anna Breger et.al. | 2405.19224 | link |
2024-05-29 | A study of why we need to reassess full reference image quality assessment with medical images | Anna Breger et.al. | 2405.19097 | null |
2024-05-31 | Benchmarking and Improving Detail Image Caption | Hongyuan Dong et.al. | 2405.19092 | link |
2024-05-29 | Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization | Zhiwei Tang et.al. | 2405.18881 | link |
2024-05-29 | Descriptive Image Quality Assessment in the Wild | Zhiyuan You et.al. | 2405.18842 | null |
2024-05-29 | Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics | Zhangkai Ni et.al. | 2405.18790 | link |
2024-05-28 | Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? | Zebin You et.al. | 2405.18029 | null |
2024-05-30 | Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains | Zhenjie Zhang et.al. | 2405.17934 | null |
2024-05-30 | MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization | Tianchen Zhao et.al. | 2405.17873 | null |
2024-05-28 | PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild | Kun Yuan et.al. | 2405.17765 | null |
2024-05-28 | AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval | Sihe Zhang et.al. | 2405.17718 | null |
2024-05-27 | Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba | Jiahao Huang et.al. | 2405.17659 | null |
2024-05-27 | Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction | Wenhao Zhang et.al. | 2405.17167 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-29 | The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control | Arle Lommel et.al. | 2405.16969 | null |
2024-05-27 | EM Distillation for One-step Diffusion Models | Sirui Xie et.al. | 2405.16852 | null |
2024-05-27 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model | Shoma Iwai et.al. | 2405.16817 | link |
2024-05-26 | Coil Reweighting to Suppress Motion Artifacts in Real-Time Exercise Cine Imaging | Chong Chen et.al. | 2405.16715 | null |
2024-05-26 | Deep learning improved autofocus for motion artifact reduction and its application in quantitative susceptibility mapping | Chao Li et.al. | 2405.16664 | null |
2024-05-26 | Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models | Regev Cohen et.al. | 2405.16475 | null |
2024-05-25 | Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination | Shelly Golan et.al. | 2405.16260 | null |
2024-05-25 | Maintaining and Managing Road Quality:Using MLP and DNN | Makgotso Jacqueline Maotwana et.al. | 2405.16196 | null |
2024-05-25 | Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection | Yun Zhu et.al. | 2405.16178 | null |
2024-05-24 | Diff-DTI: Fast Diffusion Tensor Imaging Using A Feature-Enhanced Joint Diffusion Model | Lang Zhang et.al. | 2405.15830 | null |
2024-05-24 | Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction | Yuyang Xue et.al. | 2405.15517 | link |
2024-05-24 | Benchmarking Pre-trained Large Language Models' Potential Across Urdu NLP tasks | Munief Hassan Tahir et.al. | 2405.15453 | null |
2024-05-24 | Fieldscale: Locality-Aware Field-based Adaptive Rescaling for Thermal Infrared Image | Hyeonjae Gil et.al. | 2405.15395 | link |
2024-05-24 | CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation | Xia Li et.al. | 2405.15385 | null |
2024-05-24 | Seeing the World through an Antenna's Eye: Reception Quality Visualization Using Incomplete Technical Signal Information | Leif Bergerhoff et.al. | 2405.15253 | null |
2024-05-24 | Improved Distribution Matching Distillation for Fast Image Synthesis | Tianwei Yin et.al. | 2405.14867 | link |
2024-05-23 | Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography | Shuo Han et.al. | 2405.14770 | null |
2024-05-23 | Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms | Aditya Jonnalagadda et.al. | 2405.14720 | null |
2024-05-23 | OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance | Shuheng Ge et.al. | 2405.14709 | null |
2024-05-24 | Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRI | Guanxiong Luo et.al. | 2405.14327 | null |
2024-05-23 | Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization | Zhibo Chen et.al. | 2405.14221 | null |
2024-05-22 | Uncertainty-aware Evaluation of Auxiliary Anomalies with the Expected Anomaly Posterior | Lorenzo Perini et.al. | 2405.13699 | null |
2024-05-22 | Euclid: Early Release Observations -- Programme overview and pipeline for compact- and diffuse-emission photometry | J. -C. Cuillandre et.al. | 2405.13496 | null |
2024-05-25 | Class-Conditional self-reward mechanism for improved Text-to-Image models | Safouane El Ghazouali et.al. | 2405.13473 | link |
2024-05-22 | Comparative Analysis of Hyperspectral Image Reconstruction Using Deep Learning for Agricultural and Biological Applications | Md. Toukir Ahmed et.al. | 2405.13331 | null |
2024-05-21 | Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos | Jayroop Ramesh et.al. | 2405.13235 | link |
2024-05-24 | Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction | Maciej Kilian et.al. | 2405.13218 | null |
2024-05-21 | NieR: Normal-Based Lighting Scene Rendering | Hongsheng Wang et.al. | 2405.13097 | null |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model? | Ziqin Lin et.al. | 2405.12584 | null |
2024-05-20 | Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI | Di Xu et.al. | 2405.12357 | null |
2024-05-20 | Deep learning-based hyperspectral image reconstruction for quality assessment of agro-product | Md. Toukir Ahmed et.al. | 2405.12313 | null |
2024-05-20 | GGAvatar: Geometric Adjustment of Gaussian Head Avatar | Xinyang Li et.al. | 2405.11993 | null |
2024-05-20 | On Efficient and Statistical Quality Estimation for Data Annotation | Jan-Christoph Klie et.al. | 2405.11919 | null |
2024-05-20 | ViViD: Video Virtual Try-on using Diffusion Models | Zixun Fang et.al. | 2405.11794 | null |
2024-05-19 | Solar image quality assessment: a proof of concept using Variance of Laplacian method and its application to optical atmospheric condition monitoring | Chu Wing So et.al. | 2405.11490 | null |
2024-05-18 | Sampling Strategies for Mitigating Bias in Face Synthesis Methods | Emmanouil Maragkoudakis et.al. | 2405.11320 | null |
2024-05-18 | Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching | Xingyu Miao et.al. | 2405.11252 | link |
2024-05-18 | Testing the Performance of Face Recognition for People with Down Syndrome | Christian Rathgeb et.al. | 2405.11240 | null |
2024-05-21 | SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation | Ziyao Xu et.al. | 2405.10650 | link |
2024-05-17 | Simultaneous Deep Learning of Myocardium Segmentation and T2 Quantification for Acute Myocardial Infarction MRI | Yirong Zhou et.al. | 2405.10570 | null |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-16 | Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder | Mohamed Ilyes Lakhal et.al. | 2405.10423 | null |
2024-05-16 | GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction | Rui Jin et.al. | 2405.10142 | null |
2024-05-16 | Semantic Communication via Rate Distortion Perception Bottleneck | Zihe Zhao et.al. | 2405.09995 | null |
2024-05-16 | VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing | Binghui Chen et.al. | 2405.09985 | null |
2024-05-16 | NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge | Jie Liang et.al. | 2405.09923 | null |
2024-05-16 | DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection | Yuhao Sun et.al. | 2405.09882 | link |
2024-05-15 | Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment | Xinying Lin et.al. | 2405.09472 | null |
2024-05-16 | Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images | Memoona Aziz et.al. | 2405.09426 | null |
2024-05-15 | Application of Gated Recurrent Units for CT Trajectory Optimization | Yuedong Yuan et.al. | 2405.09333 | null |
2024-05-21 | Deep Blur Multi-Model (DeepBlurMM) - a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysis | Yujie Xiang et.al. | 2405.09298 | null |
2024-05-15 | Sensitivity Decouple Learning for Image Compression Artifacts Reduction | Li Ma et.al. | 2405.09291 | null |
2024-05-15 | Shacl4Bib: custom validation of library data | Péter Király et.al. | 2405.09177 | null |
2024-05-18 | Scalable Image Coding for Humans and Machines Using Feature Fusion Network | Takahiro Shindo et.al. | 2405.09152 | link |
2024-05-15 | RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing | Jiamei Xiong et.al. | 2405.09083 | link |
2024-05-14 | Chemically peculiar stars on the pre-main sequence | L. Kueß et.al. | 2405.08946 | null |
2024-05-14 | Enhancing Blind Video Quality Assessment with Rich Quality-aware Features | Wei Sun et.al. | 2405.08745 | link |
2024-05-13 | The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective | Andrew Shin et.al. | 2405.08720 | null |
2024-05-14 | Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs | P. Mas-Buitrago et.al. | 2405.08703 | link |
2024-05-15 | RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content | Tianhao Peng et.al. | 2405.08621 | null |
2024-05-14 | Dual-Branch Network for Portrait Image Quality Assessment | Wei Sun et.al. | 2405.08555 | link |
2024-05-14 | WaterMamba: Visual State Space Model for Underwater Image Enhancement | Meisheng Guan et.al. | 2405.08419 | null |
2024-05-14 | Perivascular space Identification Nnunet for Generalised Usage (PINGU) | Benjamin Sinclair et.al. | 2405.08337 | null |
2024-05-14 | Progressive enhancement and restoration for mural images under low-light and defected conditions based on multi-receptive field strategy | Xiameng Wei et.al. | 2405.08245 | link |
2024-05-13 | Quality of Experience Optimization for Real-time XR Video Transmission with Energy Constraints | Guangjin Pan et.al. | 2405.07689 | null |
2024-05-15 | PRANK: a singular value based noise filtering approach | Francesco Trainotti et.al. | 2405.07578 | null |
2024-05-13 | Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches | Gao Yu Lee et.al. | 2405.07520 | null |
2024-05-12 | Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning | Jiarui Wang et.al. | 2405.07346 | link |
2024-05-12 | PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification | Mohammad Shafiul Alam et.al. | 2405.07332 | link |
2024-05-12 | Stable Signature is Unstable: Removing Image Watermark from Diffusion Models | Yuepeng Hu et.al. | 2405.07145 | null |
2024-05-11 | Large Language Model-aided Edge Learning in Distribution System State Estimation | Renyou Xie et.al. | 2405.06999 | null |
2024-05-15 | Generation of Granular-Balls for Clustering Based on the Principle of Justifiable Granularity | Zihang Jia et.al. | 2405.06904 | null |
2024-05-11 | FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment | Jinglin Xu et.al. | 2405.06887 | link |
2024-05-10 | Multi-Object Tracking in the Dark | Xinzhe Wang et.al. | 2405.06600 | link |
2024-05-10 | Compression-Realized Deep Structural Network for Video Quality Enhancement | Hanchi Sun et.al. | 2405.06342 | null |
2024-05-09 | Perceptual Crack Detection for Rendered 3D Textured Meshes | Armin Shafiee Sarvestani et.al. | 2405.06143 | link |
2024-05-09 | Distilling Diffusion Models into Conditional GANs | Minguk Kang et.al. | 2405.05967 | null |
2024-05-09 | How Quality Affects Deep Neural Networks in Fine-Grained Image Classification | Joseph Smith et.al. | 2405.05742 | null |
2024-05-09 | LatentColorization: Latent Diffusion-Based Speaker Video Colorization | Rory Ward et.al. | 2405.05707 | null |
2024-05-09 | SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space | Zeren Zhang et.al. | 2405.05636 | null |
2024-05-09 | Array SAR 3D Sparse Imaging Based on Regularization by Denoising Under Few Observed Data | Yangyang Wang et.al. | 2405.05565 | null |
2024-05-08 | Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation | Jonas Kohler et.al. | 2405.05224 | null |
2024-05-08 | Bridging the Gap Between Saliency Prediction and Image Quality Assessment | Kirillov Alexey et.al. | 2405.04997 | link |
2024-05-07 | Remote Diffusion | Kunal Sunil Kasodekar et.al. | 2405.04717 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation | Dogucan Yaman et.al. | 2405.04327 | null |
2024-05-07 | Cross-IQA: Unsupervised Learning for Image Quality Assessment | Zhen Zhang et.al. | 2405.04311 | null |
2024-05-07 | Sora Detector: A Unified Hallucination Detection for Large Text-to-Video Models | Zhixuan Chu et.al. | 2405.04180 | link |
2024-05-07 | Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment | Aobo Li et.al. | 2405.04167 | null |
2024-05-07 | Lossy Compression with Data, Perception, and Classification Constraints | Yuhan Wang et.al. | 2405.04144 | null |
2024-05-07 | Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints | Xiongjun Guan et.al. | 2405.03959 | link |
2024-05-06 | AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration | Widad Elouataoui et.al. | 2405.03870 | null |
2024-05-06 | Accelerated MR Cholangiopancreatography with Deep Learning-based Reconstruction | Jinho Kim et.al. | 2405.03732 | null |
2024-05-06 | All-in-One Deep Learning Framework for MR Image Reconstruction | Geunu Jeong et.al. | 2405.03684 | null |
2024-05-06 | An Image Quality Evaluation and Masking Algorithm Based On Pre-trained Deep Neural Networks | Peng Jia et.al. | 2405.03408 | null |
2024-05-06 | Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement | Jiesong Bai et.al. | 2405.03349 | link |
2024-05-06 | Light-VQA+: A Video Quality Assessment Model for Exposure Correction with Vision-Language Guidance | Xunchu Zhou et.al. | 2405.03333 | link |
2024-05-06 | Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning | Jiewen Deng et.al. | 2405.03255 | link |
2024-05-05 | Matten: Video Generation with Mamba-Attention | Yu Gao et.al. | 2405.03025 | null |
2024-05-05 | Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens | Shaohua Gao et.al. | 2405.02942 | null |
2024-05-05 | Residual-Conditioned Optimal Transport: Towards Structure-preserving Unpaired and Paired Image Restoration | Xiaole Tang et.al. | 2405.02843 | link |
2024-05-04 | Deep Image Restoration For Image Anti-Forensics | Eren Tahir et.al. | 2405.02751 | link |
2024-05-04 | DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model | Liangqi Lei et.al. | 2405.02696 | null |
2024-05-03 | On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? | Maxime Zanella et.al. | 2405.02266 | link |
2024-05-01 | Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts | Han Cui et.al. | 2405.02208 | null |
2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
2024-05-03 | Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics | Rucha Deshpande et.al. | 2405.01822 | null |
2024-05-07 | Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration | Praveen Kumar Chandaliya et.al. | 2405.01273 | null |
2024-05-02 | Singular Value and Frame Decomposition-based Reconstruction for Atmospheric Tomography | Lukas Weissinger et.al. | 2405.01079 | null |
2024-05-01 | Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer | Hui Lin et.al. | 2405.00857 | link |
2024-05-01 | Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models | Xiaoshi Wu et.al. | 2405.00760 | null |
2024-05-01 | Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays | Andrei Chubarau et.al. | 2405.00670 | link |
2024-05-01 | Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning | Yuxi Xie et.al. | 2405.00451 | link |
2024-04-30 | Fast MRI Reconstruction Using Deep Learning-based Compressed Sensing: A Systematic Review | Mojtaba Safari et.al. | 2405.00241 | link |
2024-04-30 | Charting the Path Forward: CT Image Quality Assessment -- An In-Depth Review | Siyi Xun et.al. | 2405.00075 | null |
2024-04-30 | Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity | Lei Wang et.al. | 2404.19666 | null |
2024-04-30 | Perceptual Constancy Constrained Single Opinion Score Calibration for Image Quality Assessment | Lei Wang et.al. | 2404.19595 | null |
2024-04-30 | Causal Perception Inspired Representation Learning for Trustworthy Image Quality Assessment | Lei Wang et.al. | 2404.19567 | null |
2024-05-04 | Towards Real-world Video Face Restoration: A New Benchmark | Ziyan Chen et.al. | 2404.19500 | null |
2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | null |
2024-04-30 | Global Search Optics: Automatically Exploring Optimal Solutions to Compact Computational Imaging Systems | Yao Gao et.al. | 2404.19201 | null |
2024-04-30 | Advancing low-field MRI with a universal denoising imaging transformer: Towards fast and high-quality imaging | Zheren Zhu et.al. | 2404.19167 | link |
2024-04-29 | A Comprehensive Rubric for Annotating Pathological Speech | Mario Corrales-Astorgano et.al. | 2404.18851 | null |
2024-04-29 | Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology | Luzhe Huang et.al. | 2404.18458 | null |
2024-04-29 | PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images | Jiquan Yuan et.al. | 2404.18409 | link |
2024-04-29 | G-Refine: A General Quality Refiner for Text-to-Image Generation | Chunyi Li et.al. | 2404.18343 | link |
2024-04-28 | An automated pipeline for computation and analysis of functional ventilation and perfusion lung MRI with matrix pencil decomposition: TrueLung | Orso Pusterla et.al. | 2404.18275 | null |
2024-04-28 | LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM | Zicheng Zhang et.al. | 2404.18203 | link |
2024-04-28 | Assessing Image Quality Using a Simple Generative Representation | Simon Raviv et.al. | 2404.18178 | link |
2024-04-28 | fMRI Exploration of Visual Quality Assessment | Yiming Zhang et.al. | 2404.18162 | null |
2024-04-27 | Quality Estimation with |
Tu Anh Dinh et.al. | 2404.18031 | null |
2024-04-27 | LpQcM: Adaptable Lesion-Quantification-Consistent Modulation for Deep Learning Low-Count PET Image Denoising | Menghua Xia et.al. | 2404.17994 | null |
2024-04-27 | From Optimization to Generalization: Fair Federated Learning against Quality Shift via Inter-Client Sharpness Matching | Nannan Wu et.al. | 2404.17805 | link |
2024-04-27 | Large Multi-modality Model Assisted AI-Generated Image Quality Assessment | Puyi Wang et.al. | 2404.17762 | link |
2024-04-27 | Segmentation Quality and Volumetric Accuracy in Medical Imaging | Zheyuan Zhang et.al. | 2404.17742 | null |
2024-04-27 | Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission | Mingyu Yang et.al. | 2404.17736 | link |
2024-04-26 | Attention-aware non-rigid image registration for accelerated MR imaging | Aya Ghoul et.al. | 2404.17621 | link |
2024-04-26 | Low Cost Machine Vision for Insect Classification | Danja Brandt et.al. | 2404.17488 | null |
2024-04-26 | S-IQA Image Quality Assessment With Compressive Sampling | Ronghua Liao et.al. | 2404.17170 | null |
2024-04-25 | ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images | Weiqi Li et.al. | 2404.16825 | null |
2024-04-25 | NTIRE 2024 Quality Assessment of AI-Generated Content Challenge | Xiaohong Liu et.al. | 2404.16687 | null |
2024-04-25 | Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior | Han Wang et.al. | 2404.16678 | null |
2024-04-25 | Application of RESNET50 Convolution Neural Network for the Extraction of Optical Parameters in Scattering Media | Bowen Deng et.al. | 2404.16647 | null |
2024-04-25 | COBRA -- COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images | Panagiotis Sapoutzoglou et.al. | 2404.16471 | link |
2024-04-25 | PAD: Patch-Agnostic Defense against Adversarial Patch Attacks | Lihua Jing et.al. | 2404.16452 | link |
2024-04-25 | Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series | Aimi Okabayashi et.al. | 2404.16409 | link |
2024-04-24 | AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results | Marcos V. Conde et.al. | 2404.16205 | link |
2024-04-24 | Quantitative Characterization of Retinal Features in Translated OCTA | Rashadul Hasan Badhon et.al. | 2404.16133 | null |
2024-04-24 | Assessment of the quality of a prediction | Roger Sewell et.al. | 2404.15764 | null |
2024-04-24 | A stochastic approach to estimate distribution grid state with confidence regions | Rasmus L. Olsen et.al. | 2404.15722 | null |
2024-04-24 | Deep Learning for Accelerated and Robust MRI Reconstruction: a Review | Reinhard Heckel et.al. | 2404.15692 | null |
2024-04-24 | Neural network-based recognition of multiple nanobubbles in graphene | Subin Kim et.al. | 2404.15658 | null |
2024-04-24 | PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing | Yutong Chen et.al. | 2404.15638 | null |
2024-04-24 | Direct Zernike Coefficient Prediction from Point Spread Functions and Extended Images using Deep Learning | Yong En Kok et.al. | 2404.15231 | null |
2024-04-23 | Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment | Tianwei Zhou et.al. | 2404.15163 | null |
2024-04-23 | Multi-Modal Prompt Learning on Blind Image Quality Assessment | Wensheng Pan et.al. | 2404.14949 | link |
2024-04-23 | Novel Topological Machine Learning Methodology for Stream-of-Quality Modeling in Smart Manufacturing | Jay Lee et.al. | 2404.14728 | null |
2024-04-22 | Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360 |
Haopeng Wang et.al. | 2404.14573 | null |
2024-04-25 | Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach | Tahmim Hossain et.al. | 2404.14560 | null |
2024-04-22 | Narrative Action Evaluation with Prompt-Guided Multimodal Interaction | Shiyi Zhang et.al. | 2404.14471 | link |
2024-04-22 | CrossScore: Towards Multi-View Image Evaluation and Scoring | Zirui Wang et.al. | 2404.14409 | null |
2024-04-22 | Experimental Validation of Ultrasound Beamforming with End-to-End Deep Learning for Single Plane Wave Imaging | Ryan A. L. Schoop et.al. | 2404.14188 | link |
2024-04-22 | Text in the Dark: Extremely Low-Light Text Image Enhancement | Che-Tsung Lin et.al. | 2404.14135 | null |
2024-04-22 | CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task | Kangzhen Yang et.al. | 2404.14132 | link |
2024-04-22 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-22 | CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment | Kanglei Zhou et.al. | 2404.13999 | link |
2024-04-22 | SI-FID: Only One Objective Indicator for Evaluating Stitched Images | Xinrui Zhang et.al. | 2404.13905 | null |
2024-04-21 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-04-21 | Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap | Bowen Qu et.al. | 2404.13573 | link |
2024-04-21 | Cell Phone Image-Based Persian Rice Detection and Classification Using Deep Learning Techniques | Mahmood Saeedi kelishami et.al. | 2404.13555 | null |
2024-04-20 | Joint Quality Assessment and Example-Guided Image Processing by Disentangling Picture Appearance from Content | Abhinau K. Venkataramanan et.al. | 2404.13484 | null |
2024-04-20 | Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos | Abhinau K. Venkataramanan et.al. | 2404.13452 | null |
2024-04-20 | HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression | Lei Lu et.al. | 2404.13372 | null |
2024-04-20 | PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition | Xi Fang et.al. | 2404.13299 | null |
2024-04-20 | Beyond Score Changes: Adversarial Attack on No-Reference Image Quality Assessment from Two Perspectives | Chenxi Yang et.al. | 2404.13277 | null |
2024-04-19 | A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks | Ronglei Ji et.al. | 2404.13018 | link |
2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | null |
2024-04-19 | Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture | Zarif Ahmed et.al. | 2404.12986 | null |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-04-19 | 3D Multi-frame Fusion for Video Stabilization | Zhan Peng et.al. | 2404.12887 | null |
2024-04-19 | ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation | Yu-Hsuan Ho et.al. | 2404.12606 | null |
2024-04-18 | Plane-wave compounding with adaptive joint coherence factor weighting | Nikunj Khetan et.al. | 2404.12533 | link |
2024-04-18 | Advancing Applications of Satellite Photogrammetry: Novel Approaches for Built-up Area Modeling and Natural Environment Monitoring using Stereo/Multi-view Satellite Image-derived 3D Data | Shengxi Gui et.al. | 2404.12487 | null |
2024-04-18 | On the Content Bias in Fréchet Video Distance | Songwei Ge et.al. | 2404.12391 | null |
2024-04-18 | Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models | Trevor J. Chan et.al. | 2404.12361 | null |
2024-04-18 | GraFIQs: Face Image Quality Assessment Using Gradient Magnitudes | Jan Niklas Kolf et.al. | 2404.12203 | link |
2024-04-18 | Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models | Yuzhu Cai et.al. | 2404.12104 | null |
2024-04-18 | Seeing Motion at Nighttime with an Event Camera | Haoyue Liu et.al. | 2404.11884 | link |
2024-04-18 | Automated tomographic assessment of structural defects of freeze-dried pharmaceuticals | Patric Müller et.al. | 2404.11867 | null |
2024-04-18 | Multiphoton super-resolution imaging via virtual structured illumination | Sumin Lim et.al. | 2404.11849 | null |
2024-04-17 | Analysis of blurring due to short T2 decay at different resolutions in 23Na MRI | Olga Dergachyova et.al. | 2404.11774 | null |
2024-04-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al. | 2404.11429 | null |
2024-04-17 | Achromatic Full Stokes Polarimetry Metasurface for Full-color Polarization Imaging in the Visible | Yueqiang Hu et.al. | 2404.11415 | null |
2024-04-17 | Toward Understanding the Disagreement Problem in Neural Network Feature Attribution | Niklas Koenen et.al. | 2404.11330 | link |
2024-04-17 | NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results | Xin Li et.al. | 2404.11313 | link |
2024-04-18 | Study on the static detection of ICF target based on muonic X-ray sphere encoded imaging | Dikai Li et.al. | 2404.11278 | null |
2024-04-17 | Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case | João Gabriel Vinholi et.al. | 2404.11243 | null |
2024-04-17 | ONOT: a High-Quality ICAO-compliant Synthetic Mugshot Dataset | Nicolò Di Domenico et.al. | 2404.11236 | null |
2024-04-17 | Deep Portrait Quality Assessment. A NTIRE 2024 Challenge Survey | Nicolas Chahine et.al. | 2404.11159 | link |
2024-04-17 | MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training | Jiayang Li et.al. | 2404.11016 | null |
2024-04-16 | Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution | Yutao Yuan et.al. | 2404.10688 | link |
2024-04-16 | VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time | Sicheng Xu et.al. | 2404.10667 | null |
2024-04-16 | A Computer Vision-Based Quality Assessment Technique for the automatic control of consumables for analytical laboratories | Meriam Zribi et.al. | 2404.10454 | null |
2024-04-16 | OneActor: Consistent Character Generation via Cluster-Conditioned Guidance | Jiahao Wang et.al. | 2404.10267 | null |
2024-04-16 | Diffusion assisted image reconstruction in optoacoustic tomography | M. G. González et.al. | 2404.10239 | null |
2024-04-16 | Novel Method to Estimate Kinetic Microparameters from Dynamic Whole-Body Imaging in Regular-Axial Field-of-View PET Scanners | Kyung-Nam Lee et.al. | 2404.10197 | null |
2024-04-15 | Quality Assessment of Prompts Used in Code Generation | Mohammed Latif Siddiq et.al. | 2404.10155 | null |
2024-04-15 | ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis | Aashish Anantha Ramakrishnan et.al. | 2404.10141 | link |
2024-04-15 | Ti-Patch: Tiled Physical Adversarial Patch for no-reference video quality metrics | Victoria Leonenkova et.al. | 2404.09961 | link |
2024-04-15 | The Problem Of Image Super-Resolution, Denoising And Some Image Restoration Methods In Deep Learning Models | Ngoc-Giau Pham et.al. | 2404.09817 | null |
2024-04-15 | Language-Agnostic Modeling of Wikipedia Articles for Content Quality Assessment across Languages | Paramita Das et.al. | 2404.09764 | null |
2024-04-15 | Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement | Wenyi Lian et.al. | 2404.09735 | link |
2024-04-15 | AI Competitions and Benchmarks: Dataset Development | Romain Egele et.al. | 2404.09703 | null |
2024-04-15 | Are Large Language Models Reliable Argument Quality Annotators? | Nailia Mirzakhmedova et.al. | 2404.09696 | link |
2024-04-15 | Real-world Instance-specific Image Goal Navigation for Service Robots: Bridging the Domain Gap with Contrastive Learning | Taichi Sakaguchi et.al. | 2404.09645 | null |
2024-04-15 | AI-KD: Towards Alignment Invariant Face Image Quality Assessment Using Knowledge Distillation | Žiga Babnik et.al. | 2404.09555 | link |
2024-04-15 | WiTUnet: A U-Shaped Architecture Integrating CNN and Transformer for Improved Feature Alignment and Local Information Fusion | Bin Wang et.al. | 2404.09533 | link |
2024-04-15 | MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image | Chengfeng Liu et.al. | 2404.09433 | null |
2024-04-14 | Exploring Generative AI for Sim2Real in Driving Data Synthesis | Haonan Zhao et.al. | 2404.09111 | null |
2024-04-13 | A Parametric Rate-Distortion Model for Video Transcoding | Maedeh Jamali et.al. | 2404.09029 | null |
2024-04-13 | THQA: A Perceptual Quality Assessment Database for Talking Heads | Yingjie Zhou et.al. | 2404.09003 | link |
2024-04-13 | PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos | Qi Zhao et.al. | 2404.08921 | null |
2024-04-12 | Multi-Branch Generative Models for Multichannel Imaging with an Application to PET/CT Joint Reconstruction | Noel Jeffrey Pinton et.al. | 2404.08748 | null |
2024-04-12 | Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation | Yanhao Zheng et.al. | 2404.08603 | link |
2024-04-12 | Self-Supervised k-Space Regularization for Motion-Resolved Abdominal MRI Using Neural Implicit k-Space Representation | Veronika Spieker et.al. | 2404.08350 | link |
2024-04-11 | Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis | Marc Aubreville et.al. | 2404.07676 | link |
2024-04-10 | GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models | Zewei Zhang et.al. | 2404.07206 | null |
2024-04-10 | Adversarial purification for no-reference image-quality metrics: applicability study and new methods | Aleksandr Gushchin et.al. | 2404.06957 | null |
2024-04-10 | Perception-Oriented Video Frame Interpolation via Asymmetric Blending | Guangyang Wu et.al. | 2404.06692 | link |
2024-04-10 | CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge | Yu Ying Chiu et.al. | 2404.06664 | null |
2024-04-09 | Encoder-Quantization-Motion-based Video Quality Metrics | Yixu Chen et.al. | 2404.06620 | null |
2024-04-09 | Low-Cost Generation and Evaluation of Dictionary Example Sentences | Bill Cai et.al. | 2404.06224 | null |
2024-04-09 | Image and Video Compression using Generative Sparse Representation with Fidelity Controls | Wei Jiang et.al. | 2404.06076 | null |
2024-04-09 | Prompt-driven Universal Model for View-Agnostic Echocardiography Analysis | Sekeun Kim et.al. | 2404.05916 | null |
2024-04-06 | Study of the effect of Sharpness on Blind Video Quality Assessment | Anantha Prabhu et.al. | 2404.05764 | null |
2024-04-08 | A Training-Free Plug-and-Play Watermark Framework for Stable Diffusion | Guokai Zhang et.al. | 2404.05607 | null |
2024-04-08 | UniFL: Improve Stable Diffusion via Unified Feedback Learning | Jiacheng Zhang et.al. | 2404.05595 | null |
2024-04-08 | Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance | Dazhong Shen et.al. | 2404.05384 | link |
2024-04-08 | Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt | Zhiqi Huang et.al. | 2404.05331 | null |
2024-04-08 | Progressive Alignment with VLM-LLM Feature to Augment Defect Classification for the ASE Dataset | Chih-Chung Hsu et.al. | 2404.05183 | null |
2024-04-08 | QMix: Quality-aware Learning with Mixed Noise for Robust Retinal Disease Diagnosis | Junlin Hou et.al. | 2404.05169 | null |
2024-04-07 | Data Conditioning for Subsurface Models with Single-Image Generative Adversarial Network (SinGAN) | Lei Liu et.al. | 2404.05068 | null |
2024-04-07 | LOGO: A Long-Form Video Dataset for Group Action Quality Assessment | Shiyi Zhang et.al. | 2404.05029 | link |
2024-04-07 | Dual-Scale Transformer for Large-Scale Single-Pixel Imaging | Gang Qu et.al. | 2404.05001 | link |
2024-04-07 | Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Yiyang Ma et.al. | 2404.04916 | null |
2024-04-07 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-07 | CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data | Wei Fang et.al. | 2404.04878 | null |
2024-04-07 | Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving | Jinlong Li et.al. | 2404.04804 | null |
2024-04-06 | Convolutional Neural Network Transformer (CNNT) for Fluorescence Microscopy image Denoising with Improved Generalization and Fast Adaptation | Azaan Rehman et.al. | 2404.04726 | null |
2024-04-09 | Computation and Critical Transitions of Rate-Distortion-Perception Functions With Wasserstein Barycenter | Chunhui Chen et.al. | 2404.04681 | null |
2024-04-06 | FastHDRNet: A new efficient method for SDR-to-HDR Translation | Siyuan Tian et.al. | 2404.04483 | null |
2024-04-06 | RoNet: Rotation-oriented Continuous Image Translation | Yi Li et.al. | 2404.04474 | null |
2024-04-05 | Physics-Inspired Synthesized Underwater Image Dataset | Reina Kaneko et.al. | 2404.03998 | null |
2024-04-05 | Towards introspective loop closure in 4D radar SLAM | Maximilian Hilger et.al. | 2404.03940 | null |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment | Chunyi Li et.al. | 2404.03407 | null |
2024-04-04 | DI-Retinex: Digital-Imaging Retinex Theory for Low-Light Image Enhancement | Shangquan Sun et.al. | 2404.03327 | null |
2024-04-04 | CSR-dMRI: Continuous Super-Resolution of Diffusion MRI with Anatomical Structure-assisted Implicit Neural Representation Learning | Ruoyou Wu et.al. | 2404.03209 | null |
2024-04-02 | Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models | Jiachen Ma et.al. | 2404.02928 | null |
2024-04-03 | Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Keyu Tian et.al. | 2404.02905 | link |
2024-04-03 | Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | I. Dror et.al. | 2404.02481 | null |
2024-04-03 | Imaging transformer for MRI denoising with the SNR unit training: enabling generalization across field-strengths, imaging contrasts, and anatomy | Hui Xue et.al. | 2404.02382 | null |
2024-04-02 | DSGNN: A Dual-View Supergrid-Aware Graph Neural Network for Regional Air Quality Estimation | Xin Zhang et.al. | 2404.01975 | null |
2024-04-02 | Event-assisted Low-Light Video Object Segmentation | Hebei Li et.al. | 2404.01945 | link |
2024-04-02 | PATCH -- Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Mathematics Proficiency | Qixiang Fang et.al. | 2404.01799 | link |
2024-04-02 | Super-Resolution Analysis for Landfill Waste Classification | Matias Molina et.al. | 2404.01790 | null |
2024-04-02 | Upsample Guidance: Scale Up Diffusion Models without Training | Juno Hwang et.al. | 2404.01709 | null |
2024-04-02 | Boosting Visual Recognition for Autonomous Driving in Real-world Degradations with Deep Channel Prior | Zhanwen Liu et.al. | 2404.01703 | link |
2024-04-02 | A CT Image Denoising Method with Residual Encoder-Decoder Network | Helena Shawn et.al. | 2404.01553 | null |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-01 | New infrared camera of the Caucasian Mountain Observatory of the SAI MSU: design, main parameters, and first light | S. G. Zheltoukhov et.al. | 2404.01246 | null |
2024-04-01 | The Rate-Distortion-Perception Trade-off: The Role of Private Randomness | Yassine Hamdi et.al. | 2404.01111 | null |
2024-04-01 | AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images | Liu Yang et.al. | 2404.01024 | link |
2024-04-01 | Digital Twins for Supporting AI Research with Autonomous Vehicle Networks | Anıl Gürses et.al. | 2404.00954 | null |
2024-04-01 | Towards Memorization-Free Diffusion Models | Chen Chen et.al. | 2404.00922 | null |
2024-04-01 | Model-Agnostic Human Preference Inversion in Diffusion Models | Jeeyung Kim et.al. | 2404.00879 | null |
2024-03-31 | GAMA-IR: Global Additive Multidimensional Averaging for Fast Image Restoration | Youssef Mansour et.al. | 2404.00807 | null |
2024-03-31 | Personalized Neural Speech Codec | Inseon Jang et.al. | 2404.00791 | null |
2024-04-02 | DRCT: Saving Image Super-resolution away from Information Bottleneck | Chih-Chung Hsu et.al. | 2404.00722 | link |
2024-03-30 | Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network | Md Hassanuzzaman et.al. | 2404.00470 | null |
2024-03-30 | Harmonizing Light and Darkness: A Symphony of Prior-guided Data Synthesis and Adaptive Focus for Nighttime Flare Removal | Lishen Qu et.al. | 2404.00313 | null |
2024-03-30 | Learned Scanpaths Aid Blind Panoramic Video Quality Assessment | Kanglong Fan et.al. | 2404.00252 | link |
2024-03-29 | Evolving Semantic Communication with Generative Model | Shunpu Tang et.al. | 2403.20237 | link |
2024-03-29 | Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context | Tuan Nguyen et.al. | 2403.20184 | null |
2024-03-29 | Unsupervised Tumor-Aware Distillation for Multi-Modal Brain Image Translation | Chuan Huang et.al. | 2403.20168 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-03-28 | Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality | Kyotaro Tokoro et.al. | 2403.19428 | link |
2024-03-28 | Imperceptible Protection against Style Imitation from Diffusion Models | Namhyuk Ahn et.al. | 2403.19254 | null |
2024-03-28 | DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation | Haonan Lin et.al. | 2403.19235 | null |
2024-03-28 | AAPMT: AGI Assessment Through Prompt and Metric Transformer | Benhao Huang et.al. | 2403.19101 | link |
2024-03-27 | TextCraftor: Your Text Encoder Can be Image Quality Controller | Yanyu Li et.al. | 2403.18978 | null |
2024-03-27 | Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction | Yiyao Zhang et.al. | 2403.18776 | link |
2024-03-27 | Bringing Textual Prompt to AI-Generated Image Quality Assessment | Bowen Qu et.al. | 2403.18714 | link |
2024-03-27 | qIoV: A Quantum-Driven Internet-of-Vehicles-Based Approach for Environmental Monitoring and Rapid Response Systems | Ankur Nahar et.al. | 2403.18622 | null |
2024-03-27 | Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning -- A Review | Mohammadreza Amirian et.al. | 2403.18565 | null |
2024-03-27 | Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting | Haiwei Chen et.al. | 2403.18186 | null |
2024-03-26 | Pseudo-MRI-Guided PET Image Reconstruction Method Based on a Diffusion Probabilistic Model | Weijie Gan et.al. | 2403.18139 | null |
2024-03-26 | TDIP: Tunable Deep Image Processing, a Real Time Melt Pool Monitoring Solution | Javid Akhavan et.al. | 2403.18117 | null |
2024-03-26 | Cross-system biological image quality enhancement based on the generative adversarial network as a foundation for establishing a multi-institute microscopy cooperative network | Dominik Panek et.al. | 2403.18026 | null |
2024-03-26 | Improving Text-to-Image Consistency via Automatic Prompt Optimization | Oscar Mañas et.al. | 2403.17804 | null |
2024-03-26 | Can patient-specific acquisition protocol improve performance on defect detection task in myocardial perfusion SPECT? | Nu Ri Choi et.al. | 2403.17764 | null |
2024-03-26 | Panonut360: A Head and Eye Tracking Dataset for Panoramic Video | Yutong Xu et.al. | 2403.17708 | null |
2024-03-26 | AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation | Huawei Wei et.al. | 2403.17694 | link |
2024-03-26 | ExpressEdit: Video Editing with Natural Language and Sketching | Bekzat Tilekbay et.al. | 2403.17693 | null |
2024-03-26 | Practical Applications of Advanced Cloud Services and Generative AI Systems in Medical Image Analysis | Jingyu Xu et.al. | 2403.17549 | null |
2024-03-26 | ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales? | Fan Huang et.al. | 2403.17368 | link |
2024-03-26 | AutoMRISimQA: an automated system for daily quality control of a 3T MRI simulator | Aitang Xing et.al. | 2403.17365 | null |
2024-03-25 | Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models | Li Qiao et.al. | 2403.17256 | null |
2024-03-25 | PROSPECT: Precision Robot Spectroscopy Exploration and Characterization Tool | Nathaniel Hanson et.al. | 2403.17232 | null |
2024-03-25 | Comp4D: LLM-Guided Compositional 4D Scene Generation | Dejia Xu et.al. | 2403.16993 | null |
2024-03-25 | Towards Low-Latency and Energy-Efficient Hybrid P2P-CDN Live Video Streaming | Reza Farahani et.al. | 2403.16985 | null |
2024-03-25 | INPC: Implicit Neural Point Clouds for Radiance Field Rendering | Florian Hahlbohm et.al. | 2403.16862 | null |
2024-03-25 | C-arm inverse geometry CT for 3D cardiac chamber mapping | Jordan M. Slagowski et.al. | 2403.16779 | null |
2024-03-25 | FOOL: Addressing the Downlink Bottleneck in Satellite Computing with Neural Feature Compression | Alireza Furutanpey et.al. | 2403.16677 | link |
2024-03-25 | Enhancing Cross-Dataset EEG Emotion Recognition: A Novel Approach with Emotional EEG Style Transfer Network | Yijin Zhou et.al. | 2403.16540 | null |
2024-03-25 | Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | Ziyao Huang et.al. | 2403.16510 | link |
2024-03-25 | Plaintext-Free Deep Learning for Privacy-Preserving Medical Image Analysis via Frequency Information Embedding | Mengyu Sun et.al. | 2403.16473 | null |
2024-03-25 | Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging | Jintong Hu et.al. | 2403.16384 | link |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-24 | Passive Screen-to-Camera Communication | Seyed Keyarash Ghiasi et.al. | 2403.16185 | null |
2024-03-24 | Argument Quality Assessment in the Age of Instruction-Following Large Language Models | Henning Wachsmuth et.al. | 2403.16084 | null |
2024-03-23 | An edge detection-based deep learning approach for tear meniscus height measurement | Kesheng Wang et.al. | 2403.15853 | null |
2024-03-22 | Medical Image Data Provenance for Medical Cyber-Physical System | Vijay Kumar et.al. | 2403.15522 | null |
2024-03-22 | Time-efficient, high-resolution 3T whole-brain relaxometry using Cartesian 3D MR-STAT with CSF suppression | Hongyan Liu et.al. | 2403.15379 | link |
2024-03-22 | Ultrasound Imaging based on the Variance of a Diffusion Restoration Model | Yuxin Zhang et.al. | 2403.15316 | link |
2024-03-22 | Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos | Abhinau K. Venkataramanan et.al. | 2403.15061 | null |
2024-03-21 | On the exploitation of DCT statistics for cropping detectors | Claudio Vittorio Ragaglia et.al. | 2403.14789 | null |
2024-03-21 | From Handcrafted Features to LLMs: A Brief Survey for Machine Translation Quality Estimation | Haofei Zhao et.al. | 2403.14118 | null |
2024-03-20 | Multi-criteria approach for selecting an explanation from the set of counterfactuals produced by an ensemble of explainers | Ignacy Stępka et.al. | 2403.13940 | link |
2024-03-20 | Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models | Richard Osuala et.al. | 2403.13890 | link |
2024-03-20 | Hierarchical NeuroSymbolic Approach for Action Quality Assessment | Lauren Okamoto et.al. | 2403.13798 | link |
2024-03-20 | Step-Calibrated Diffusion for Biomedical Optical Image Restoration | Yiwei Lyu et.al. | 2403.13680 | link |
2024-03-20 | Defining metric-aware size-shape measures to validate and optimize curved high-order meshes | Guillermo Aparicio-Estrems et.al. | 2403.13528 | null |
2024-03-20 | AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation | Jingkun An et.al. | 2403.13352 | null |
2024-03-20 | Learning Novel View Synthesis from Heterogeneous Low-light Captures | Quan Zheng et.al. | 2403.13337 | null |
2024-03-19 | Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization | Jixiang Luo et.al. | 2403.13030 | null |
2024-03-18 | Invisible Backdoor Attack Through Singular Value Decomposition | Wenmin Chen et.al. | 2403.13018 | null |
2024-03-19 | Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference | Baolin Li et.al. | 2403.12900 | null |
2024-03-19 | VisualCritic: Making LMMs Perceive Visual Quality Like Humans | Zhipeng Huang et.al. | 2403.12806 | null |
2024-03-19 | Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean | Dojun Park et.al. | 2403.12666 | link |
2024-03-19 | GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation | Quankai Gao et.al. | 2403.12365 | null |
2024-03-19 | Deep Few-view High-resolution Photon-counting Extremity CT at Halved Dose for a Clinical Trial | Mengzhou Li et.al. | 2403.12331 | null |
2024-03-18 | Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2 mapping using dual-echo spiral navigators and conjugate-phase reconstruction* | Yuguang Meng et.al. | 2403.12230 | null |
2024-03-19 | Generic 3D Diffusion Adapter Using Controlled Multi-View Editing | Hansheng Chen et.al. | 2403.12032 | link |
2024-03-18 | Enhancing Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems | Bo-Han Lu et.al. | 2403.12024 | link |
2024-03-18 | VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model | Qi Zuo et.al. | 2403.12010 | null |
2024-03-19 | Subjective-Aligned Dateset and Metric for Text-to-Video Quality Assessment | Tengchuan Kou et.al. | 2403.11956 | link |
2024-03-18 | HyperColorization: Propagating spatially sparse noisy spectral clues for reconstructing hyperspectral images | M. Kerem Aydin et.al. | 2403.11935 | link |
2024-03-18 | Evaluating Text to Image Synthesis: Survey and Taxonomy of Image Quality Metrics | Sebastian Hartwig et.al. | 2403.11821 | null |
2024-03-18 | Hallucination in Perceptual Metric-Driven Speech Enhancement Networks | George Close et.al. | 2403.11732 | null |
2024-03-18 | FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events | Xiangyuan Wang et.al. | 2403.11662 | link |
2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | link |
2024-03-18 | Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement | Qianyu Zhang et.al. | 2403.11556 | null |
2024-03-18 | Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning | Teppei Suzuki et.al. | 2403.11460 | link |
2024-03-18 | Earth+: on-board satellite imagery compression leveraging historical earth observations | Kuntai Du et.al. | 2403.11434 | null |
2024-03-18 | Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization | Yujia Liu et.al. | 2403.11397 | link |
2024-03-18 | Simulating Wearable Urban Augmented Reality Experiences in VR: Lessons Learnt from Designing Two Future Urban Interfaces | Tram Thi Minh Tran et.al. | 2403.11377 | null |
2024-03-17 | Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction | Xue Bai et.al. | 2403.11337 | null |
2024-03-17 | Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology | Shima Mohammadi et.al. | 2403.11241 | link |
2024-03-17 | Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment | Lorenzo Agnolucci et.al. | 2403.11176 | link |
2024-03-17 | Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model | Dian Zheng et.al. | 2403.11157 | link |
2024-03-17 | Interactive |
Yixiang Mao et.al. | 2403.11155 | null |
2024-03-17 | Hierarchical Generative Network for Face Morphing Attacks | Zuyuan He et.al. | 2403.11101 | null |
2024-03-17 | Endora: Video Generation Models as Endoscopy Simulators | Chenxin Li et.al. | 2403.11050 | null |
2024-03-16 | A Spectrum-based Image Denoising Method with Edge Feature Enhancement | Peter Luvton et.al. | 2403.11036 | null |
2024-03-16 | Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming | Amritha Premkumar et.al. | 2403.10976 | link |
2024-03-16 | A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment | Tianhe Wu et.al. | 2403.10854 | link |
2024-03-16 | MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections | Mude Hui et.al. | 2403.10815 | link |
2024-03-16 | ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models | Yuwen Chen et.al. | 2403.10786 | null |
2024-03-15 | Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation | Anton Pelykh et.al. | 2403.10731 | link |
2024-03-15 | EAGLE: An Edge-Aware Gradient Localization Enhanced Loss for CT Image Reconstruction | Yipeng Sun et.al. | 2403.10695 | link |
2024-03-15 | A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models | Xijun Wang et.al. | 2403.10589 | null |
2024-03-21 | Deep Bi-directional Attention Network for Image Super-Resolution Quality Assessment | Yixiao Li et.al. | 2403.10406 | null |
2024-03-15 | PASTA: Towards Flexible and Efficient HDR Imaging Via Progressively Aggregated Spatio-Temporal Aligment | Xiaoning Liu et.al. | 2403.10376 | null |
2024-03-15 | CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement | Qiang Zhu et.al. | 2403.10362 | null |
2024-03-15 | Context-Semantic Quality Awareness Network for Fine-Grained Visual Categorization | Qin Xu et.al. | 2403.10298 | null |
2024-03-15 | Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim et.al. | 2403.10255 | null |
2024-03-15 | Perceptual Quality-based Model Training under Annotator Label Uncertainty | Chen Zhou et.al. | 2403.10190 | null |
2024-03-15 | Animate Your Motion: Turning Still Images into Dynamic Videos | Mingxiao Li et.al. | 2403.10179 | null |
2024-03-15 | PQDynamicISP: Dynamically Controlled Image Signal Processor for Any Image Sensors Pursuing Perceptual Quality | Masakazu Yoshimura et.al. | 2403.10091 | null |
2024-03-15 | Learning Physical Dynamics for Object-centric Visual Prediction | Huilin Xu et.al. | 2403.10079 | null |
2024-03-15 | Contrastive Pre-Training with Multi-View Fusion for No-Reference Point Cloud Quality Assessment | Ziyu Shan et.al. | 2403.10066 | null |
2024-03-15 | PAME: Self-Supervised Masked Autoencoder for No-Reference Point Cloud Quality Assessment | Ziyu Shan et.al. | 2403.10061 | null |
2024-03-14 | ProMark: Proactive Diffusion Watermarking for Causal Attribution | Vishal Asnani et.al. | 2403.09914 | null |
2024-03-14 | MultiGripperGrasp: A Dataset for Robotic Grasping from Parallel Jaw Grippers to Dexterous Hands | Luis Felipe Casas Murrilo et.al. | 2403.09841 | null |
2024-03-13 | PICNIQ: Pairwise Comparisons for Natural Image Quality Assessment | Nicolas Chahine et.al. | 2403.09746 | link |
2024-03-14 | Renovating Names in Open-Vocabulary Segmentation Benchmarks | Haiwen Huang et.al. | 2403.09593 | null |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-14 | StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images | Robert Jewsbury et.al. | 2403.09302 | link |
2024-03-20 | D-YOLO a robust framework for object detection in adverse weather conditions | Zihan Chu et.al. | 2403.09233 | null |
2024-03-14 | Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts | Byeongjun Park et.al. | 2403.09176 | link |
2024-03-14 | Dial-insight: Fine-tuning Large Language Models with High-Quality Domain-Specific Data Preventing Capability Collapse | Jianwei Sun et.al. | 2403.09167 | null |
2024-03-15 | NTIRE 2023 Image Shadow Removal Challenge Technical Report: Team IIM_TTI | Yuki Kondo et.al. | 2403.08995 | link |
2024-03-13 | Structural Positional Encoding for knowledge integration in transformer-based medical process monitoring | Christopher Irwin et.al. | 2403.08836 | link |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764 | null |
2024-03-13 | Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08749 | null |
2024-03-14 | GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu et.al. | 2403.08733 | link |
2024-03-13 | Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment | Paraskevas Pegios et.al. | 2403.08700 | null |
2024-03-13 | Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages | Rik van Noord et.al. | 2403.08693 | null |
2024-03-13 | Physics-Guided Inverse Regression for Crop Quality Assessment | David Shulman et.al. | 2403.08653 | null |
2024-03-14 | GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting | Xinjie Zhang et.al. | 2403.08551 | link |
2024-03-13 | Masked Generative Story Transformer with Character Guidance and Caption Augmentation | Christos Papadimitriou et.al. | 2403.08502 | link |
2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | null |
2024-03-13 | Protocol Optimization for Functional Cardiac CT Imaging Using Noise Emulation in the Raw Data Domain | Zhye Yin et.al. | 2403.08486 | null |
2024-03-13 | PFStorer: Personalized Face Restoration and Super-Resolution | Tuomas Varanka et.al. | 2403.08436 | null |
2024-03-13 | AADNet: Attention aware Demoiréing Network | M Rakesh Reddy et.al. | 2403.08384 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-03-13 | IG-FIQA: Improving Face Image Quality Assessment through Intra-class Variance Guidance robust to Inaccurate Pseudo-Labels | Minsoo Kim et.al. | 2403.08256 | null |
2024-03-13 | PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping | Jiafu Chen et.al. | 2403.08252 | null |
2024-03-15 | A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT | Hongyang Zhu et.al. | 2403.08247 | null |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | link |
2024-03-18 | BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives | Ivo M. Baltruschat et.al. | 2403.07800 | null |
2024-03-12 | Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation | Michael Ogezi et.al. | 2403.07605 | null |
2024-03-12 | Learning Correction Errors via Frequency-Self Attention for Blind Image Super-Resolution | Haochen Sun et.al. | 2403.07390 | null |
2024-03-12 | Time-Efficient Light-Field Acquisition Using Coded Aperture and Events | Shuji Habuchi et.al. | 2403.07244 | null |
2024-03-10 | Propensity-score matching analysis in COVID-19-related studies: a method and quality systematic review | Chunhui Gu et.al. | 2403.07023 | null |
2024-03-11 | BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju et.al. | 2403.06976 | link |
2024-03-11 | Applicability of oculomics for individual risk prediction: Repeatability and robustness of retinal Fractal Dimension using DART and AutoMorph | Justin Engelmann et.al. | 2403.06950 | null |
2024-03-11 | Monitoring the Venice Lagoon: an IoT Cloud-Based Sensor Nerwork Approach | Filippo Campagnaro et.al. | 2403.06915 | null |
2024-03-11 | COOD: Combined out-of-distribution detection using multiple measures for anomaly & novel class detection in large-scale hierarchical classification | L. E. Hogeweg et.al. | 2403.06874 | null |
2024-03-20 | QUASAR: QUality and Aesthetics Scoring with Advanced Representations | Sergey Kastryulin et.al. | 2403.06866 | null |
2024-03-11 | A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos | Weixia Zhang et.al. | 2403.06421 | link |
2024-03-11 | Comparison of No-Reference Image Quality Models via MAP Estimation in Diffusion Latents | Weixia Zhang et.al. | 2403.06406 | null |
2024-03-11 | Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models | Yang Zhang et.al. | 2403.06381 | link |
2024-03-15 | ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge | Sami Khairy et.al. | 2403.06324 | link |
2024-03-10 | Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising | Yuang Wang et.al. | 2403.06069 | null |
2024-03-09 | IOI: Invisible One-Iteration Adversarial Attack on No-Reference Image- and Video-Quality Metrics | Ekaterina Shumitskaya et.al. | 2403.05955 | link |
2024-03-09 | Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding | Cunhui Dong et.al. | 2403.05937 | null |
2024-03-08 | Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis | Muxi Chen et.al. | 2403.05125 | null |
2024-03-08 | CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model | Pengwei Yin et.al. | 2403.05124 | null |
2024-03-08 | Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile | Seokjun Lee et.al. | 2403.05093 | link |
2024-03-08 | Improving Diffusion-Based Generative Models via Approximated Optimal Transport | Daegyu Kim et.al. | 2403.05069 | link |
2024-03-08 | PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts | Zewen Chen et.al. | 2403.04993 | null |
2024-03-08 | StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models | Lezhong Wang et.al. | 2403.04965 | link |
2024-03-07 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
2024-03-17 | PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen et.al. | 2403.04692 | null |
2024-03-07 | A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images | Cristiana Tiago et.al. | 2403.04612 | null |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-07 | FriendNet: Detection-Friendly Dehazing Network | Yihua Fan et.al. | 2403.04443 | link |
2024-03-07 | MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment | Kanglei Zhou et.al. | 2403.04398 | null |
2024-03-07 | Self-Evaluation of Large Language Model based on Glass-box Features | Hui Huang et.al. | 2403.04222 | null |
2024-03-06 | Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer | Naifu Xue et.al. | 2403.03736 | null |
2024-03-06 | Development and evaluation of Artificial Intelligence techniques for IoT data quality assessment and curation | Laura Martín et.al. | 2403.03661 | null |
2024-03-06 | A Connector for Integrating NGSI-LD Data into Open Data Portals | Laura Martín et.al. | 2403.03648 | null |
2024-03-06 | Low-Dose CT Image Reconstruction by Fine-Tuning a UNet Pretrained for Gaussian Denoising for the Downstream Task of Image Enhancement | Tim Selig et.al. | 2403.03551 | null |
2024-03-06 | Combined optimization ghost imaging based on random speckle field | Zhiqing Yang et.al. | 2403.03426 | null |
2024-03-06 | DaISy: Diffuser-aided Sub-THz Imaging System | Shao-Hsuan Wu et.al. | 2403.03383 | null |
2024-03-05 | Imaging the event horizon of M87 from space on different timescales* | Anastasia Shlentsova et.al. | 2403.03327 | null |
2024-03-05 | MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Hossein Aboutalebi et.al. | 2403.03194 | null |
2024-03-05 | Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity | Hagyeong Lee et.al. | 2403.02944 | link |
2024-03-05 | DIFNet: SAR RFI suppression based on domain invariant features | Fuping Fang et.al. | 2403.02894 | null |
2024-03-05 | Rehabilitation Exercise Quality Assessment through Supervised Contrastive Learning with Hard and Soft Negatives | Mark Karlov et.al. | 2403.02772 | null |
2024-03-04 | Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection | Shitao Chen et.al. | 2403.01978 | link |
2024-03-04 | Revisiting the dust torus size-luminosity relation based on a uniform reverberation mapping analysis | Amit Kumar Mandal et.al. | 2403.01885 | null |
2024-03-04 | PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis | Zhengyao Lv et.al. | 2403.01852 | link |
2024-03-04 | ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models | Lukas Höllein et.al. | 2403.01807 | link |
2024-03-04 | Development of a near-infrared wide-field integral field unit by ultra-precision diamond cutting | Kosuke Kushibiki et.al. | 2403.01668 | null |
2024-03-04 | Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 | Xinyue Li et.al. | 2403.01647 | link |
2024-03-05 | 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos | Jiakai Sun et.al. | 2403.01444 | link |
2024-03-02 | NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning | Linsheng Chen et.al. | 2403.01325 | link |
2024-03-02 | Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images | Shufan Pei et.al. | 2403.01083 | null |
2024-03-02 | LLMCRIT: Teaching Large Language Models to Use Criteria | Weizhe Yuan et.al. | 2403.01069 | link |
2024-03-01 | Near-Real-Time Mueller Polarimetric Image Processing for Neurosurgical Intervention | Stefano Moriconi et.al. | 2403.00893 | null |
2024-03-01 | Gate-set evaluation metrics for closed-loop optimal control on nitrogen-vacancy center ensembles in diamond | Philipp J. Vetter et.al. | 2403.00616 | null |
2024-03-01 | Equilibrium Model with Anisotropy for Model-Based Reconstruction in Magnetic Particle Imaging | Marco Maass et.al. | 2403.00602 | link |
2024-03-01 | Data Quality Assessment: Challenges and Opportunities | Sedir Mohammed et.al. | 2403.00526 | null |
2024-03-01 | Phase retrieval beyond the homogeneous object assumption for X-ray in-line holographic imaging | Jens Lucht et.al. | 2403.00461 | null |
2024-03-01 | An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels | Shumpei Takezaki et.al. | 2403.00452 | null |
2024-03-01 | Assessing objective quality metrics for JPEG and MPEG point cloud coding | Davi Lazzarotto et.al. | 2403.00410 | null |
2024-03-01 | List-Mode PET Image Reconstruction Using Dykstra-Like Splitting | Kibo Ote et.al. | 2403.00394 | null |
2024-03-01 | Optimization of Array Encoding for Ultrasound Imaging | Jacob Spainhour et.al. | 2403.00289 | link |
2024-03-01 | Deep-learning-based Magnetic Resonance Simultaneous Multislice Imaging Using Holographic Image Decoding | Satoshi Ito et.al. | 2403.00220 | null |
2024-03-03 | RoadRunner - Learning Traversability Estimation for Autonomous Off-road Driving | Jonas Frey et.al. | 2402.19341 | null |
2024-02-29 | Integral field spectroscopy supports atmospheric optics to reveal the finite outer scale of the turbulence | Begoña García-Lorenzo et.al. | 2402.19337 | null |
2024-03-13 | Modular Blind Video Quality Assessment | Wen Wen et.al. | 2402.19276 | link |
2024-02-29 | Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts | Cansu Korkmaz et.al. | 2402.19215 | link |
2024-02-29 | Disentangling representations of retinal images with generative models | Sarah Müller et.al. | 2402.19186 | null |
2024-02-29 | Trajectory Consistency Distillation | Jianbin Zheng et.al. | 2402.19159 | link |
2024-02-29 | Atmospheric Turbulence Removal with Video Sequence Deep Visual Priors | P. Hill et.al. | 2402.19041 | null |
2024-02-28 | Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation | Yuan Ge et.al. | 2402.18191 | link |
2024-02-28 | NiteDR: Nighttime Image De-Raining with Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes | Cidan Shi et.al. | 2402.18172 | link |
2024-03-02 | G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment | Juan Zhang et.al. | 2402.18122 | null |
2024-02-28 | Improvement Of Audiovisual Quality Estimation Using A Nonlinear Autoregressive Exogenous Neural Network And Bitstream Parameters | Koffi Kossi et.al. | 2402.18056 | null |
2024-02-28 | PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis | Jason J. Yu et.al. | 2402.17986 | null |
2024-02-28 | Rapid hyperspectral photothermal mid-infrared spectroscopic imaging from sparse data for gynecologic cancer tissue subtyping | Reza Reihanisaransari et.al. | 2402.17960 | null |
2024-02-29 | QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction | Ishak Ayad et.al. | 2402.17951 | null |
2024-02-27 | Accelerated Real-time Cine and Flow under In-magnet Staged Exercise | Preethi Chandrasekaran et.al. | 2402.17877 | null |
2024-02-27 | A Performance Evaluation of Filtered Delay Multiply and Sum Beamforming for Ultrasound Localization Microscopy: Preliminary Results | A. N. Madhavanunni et.al. | 2402.17643 | null |
2024-02-28 | Black-box Adversarial Attacks Against Image Quality Assessment Models | Yu Ran et.al. | 2402.17533 | null |
2024-02-27 | Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization | Panqi Jia et.al. | 2402.17470 | null |
2024-02-27 | VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction | Jiaqi Lin et.al. | 2402.17427 | null |
2024-02-27 | Sora Generates Videos with Stunning Geometrical Consistency | Xuanyi Li et.al. | 2402.17403 | null |
2024-03-10 | Learning Exposure Correction in Dynamic Scenes | Jin Liu et.al. | 2402.17296 | link |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-03-01 | Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System | Majid Memari et.al. | 2402.17204 | null |
2024-03-19 | Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain | Qunliang Xing et.al. | 2402.17200 | null |
2024-02-27 | SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution | Chengcheng Wang et.al. | 2402.17133 | link |
2024-02-27 | T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality | Susan Epstein et.al. | 2402.17101 | null |
2024-02-26 | Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids | Jasper Kirton-Wingate et.al. | 2402.16757 | null |
2024-02-29 | MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model | Chunyi Li et.al. | 2402.16749 | link |
2024-03-04 | Towards Open-ended Visual Quality Comparison | Haoning Wu et.al. | 2402.16641 | null |
2024-02-26 | Distortion-Controlled Dithering with Reduced Recompression Rate | Morriel Kasher et.al. | 2402.16447 | null |
2024-02-26 | Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues | Tassadaq Hussain et.al. | 2402.16394 | null |
2024-02-26 | Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech | Szu-Wei Fu et.al. | 2402.16321 | link |
2024-02-24 | Design, Implementation and Analysis of a Compressed Sensing Photoacoustic Projection Imaging System | Markus Haltmeier et.al. | 2402.15750 | null |
2024-02-23 | Benchmarking the Robustness of Panoptic Segmentation for Automated Driving | Yiting Wang et.al. | 2402.15469 | null |
2024-02-23 | Ten computational challenges in human virome studies | Yifan Wu et.al. | 2402.15186 | null |
2024-02-23 | The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling | Jiajun Ma et.al. | 2402.15170 | null |
2024-02-22 | Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis | Willi Menapace et.al. | 2402.14797 | null |
2024-02-25 | Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening | Zhenrong Shen et.al. | 2402.14707 | null |
2024-02-22 | Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment | Zhaoyang Wang et.al. | 2402.14401 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-02-20 | Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control | Denis Lukovnikov et.al. | 2402.13404 | null |
2024-02-24 | Denoising OCT Images Using Steered Mixture of Experts with Multi-Model Inference | Aytaç Özkan et.al. | 2402.12735 | null |
2024-02-20 | Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation | Zheng Wei Lim et.al. | 2402.12690 | null |
2024-02-21 | Robust-Wide: Robust Watermarking against Instruction-driven Image Editing | Runyi Hu et.al. | 2402.12688 | link |
2024-02-20 | X-ray multibeam ptychography at up to 20 keV: nano-lithography enhances X-ray nano-imaging | Tang Li et.al. | 2402.12082 | null |
2024-02-19 | A Lightweight Parallel Framework for Blind Image Quality Assessment | Qunyue Huang et.al. | 2402.12043 | null |
2024-02-18 | Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking dialogs | Arian Askari et.al. | 2402.11633 | link |
2024-02-16 | Path Loss Modeling for RIS-Assisted Wireless System with Direct Link and Elevation Factors | Vinay Kumar Chapala et.al. | 2402.10419 | null |
2024-02-15 | Deep Spectral Meshes: Multi-Frequency Facial Mesh Processing with Graph Neural Networks | Robert Kosk et.al. | 2402.10365 | null |
2024-02-15 | Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community | Arman Isajanyan et.al. | 2402.09872 | link |
2024-02-15 | How to Train Data-Efficient LLMs | Noveen Sachdeva et.al. | 2402.09668 | null |
2024-02-14 | TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction | Xueqi Guo et.al. | 2402.09567 | null |
2024-02-14 | Assessing test artifact quality -- A tertiary study | Huynh Khanh Vi Tran et.al. | 2402.09541 | null |
2024-02-14 | LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning | Adithya Raman et.al. | 2402.09392 | null |
2024-02-14 | Generalized Portrait Quality Assessment | Nicolas Chahine et.al. | 2402.09178 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-17 | NSSR-DIL: Null-Shot Image Super-Resolution Using Deep Identity Learning | Sree Rama Vamsidhar S et.al. | 2409.12165 | null |
2024-09-18 | Quantum-like nonlinear interferometry with frequency-engineered classical light | Romain Dalidet et.al. | 2409.12049 | null |
2024-09-19 | Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing | Seongmin Hong et.al. | 2409.11738 | null |
2024-09-17 | Enhancing the Reliability of LiDAR Point Cloud Sampling: A Colorization and Super-Resolution Approach Based on LiDAR-Generated Images | Sier Ha et.al. | 2409.11532 | null |
2024-09-19 | Super Resolution On Global Weather Forecasts | Lawrence Zhang et.al. | 2409.11502 | null |
2024-09-17 | Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements | Jipeng Yan et.al. | 2409.11391 | null |
2024-09-18 | Single-Layer Learnable Activation for Implicit Neural Representation (SL |
Moein Heidari et.al. | 2409.10836 | null |
2024-09-16 | WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency | Pranav Jeevan et.al. | 2409.10582 | link |
2024-09-16 | Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression | Yi-Hsin Li et.al. | 2409.10101 | null |
2024-09-15 | Learning Two-factor Representation for Magnetic Resonance Image Super-resolution | Weifeng Wei et.al. | 2409.09731 | null |
2024-09-14 | Adversarial Deep-Unfolding Network for MA-XRF Super-Resolution on Old Master Paintings Using Minimal Training Data | Herman Verinaz-Jadan et.al. | 2409.09483 | null |
2024-09-17 | Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution | Yongjoon Lee et.al. | 2409.09337 | null |
2024-09-13 | FB-HyDON: Parameter-Efficient Physics-Informed Operator Learning of Complex PDEs via Hypernetwork and Finite Basis Domain Decomposition | Milad Ramezankhani et.al. | 2409.09207 | null |
2024-09-13 | Optically-Validated Microvascular Phantom for Super-Resolution Ultrasound Imaging | Jaime Parra Raad et.al. | 2409.09031 | null |
2024-09-13 | Test-time Training for Hyperspectral Image Super-resolution | Ke Li et.al. | 2409.08667 | null |
2024-09-13 | Low Complexity DoA-ToA Signature Estimation for Multi-Antenna Multi-Carrier Systems | Chandrashekhar Rai et.al. | 2409.08650 | null |
2024-09-13 | Think Twice Before You Act: Improving Inverse Problem Solving With MCMC | Yaxuan Zhu et.al. | 2409.08551 | null |
2024-09-12 | Learned Compression for Images and Point Clouds | Mateen Ulhaq et.al. | 2409.08376 | link |
2024-09-12 | Mapping the nanoscale optical topological textures with a fiber-integrated plasmonic probe | Yunkun Wu et.al. | 2409.07894 | null |
2024-09-17 | Mesh-based Super-Resolution of Fluid Flows with Multiscale Graph Neural Networks | Shivam Barwey et.al. | 2409.07769 | null |
2024-09-11 | Dual scale Residual-Network for turbulent flow sub grid scale resolving: A prior analysis | Omar Sallam et.al. | 2409.07605 | null |
2024-09-11 | Three-Dimensional, Multimodal Synchrotron Data for Machine Learning Applications | Calum Green et.al. | 2409.07322 | null |
2024-09-11 | CWT-Net: Super-resolution of Histopathology Images Using a Cross-scale Wavelet-based Transformer | Feiyang Jia et.al. | 2409.07092 | null |
2024-09-10 | Lightweight Multiscale Feature Fusion Super-Resolution Network Based on Two-branch Convolution and Transformer | Li Ke et.al. | 2409.06590 | null |
2024-09-10 | Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition | Junzheng Zhang et.al. | 2409.06371 | null |
2024-09-10 | EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation | Nischal Khanal et.al. | 2409.06183 | link |
2024-09-07 | Single-snapshot machine learning for turbulence super resolution | Kai Fukami et.al. | 2409.04923 | null |
2024-09-06 | Empirical Bayesian image restoration by Langevin sampling with a denoising diffusion implicit prior | Charlesquin Kemajou Mbakam et.al. | 2409.04384 | null |
2024-09-06 | Adaptive Super-Resolution Imaging Without Prior Knowledge Using a Programmable Spatial-Mode Sorter | Itay Ozer et.al. | 2409.04323 | null |
2024-09-06 | EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution | Xi Su et.al. | 2409.04050 | null |
2024-09-05 | Use of triplet loss for facial restoration in low-resolution images | Sebastian Pulgar et.al. | 2409.03530 | null |
2024-09-05 | LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution | Jeongsoo Kim et.al. | 2409.03516 | link |
2024-09-07 | Real-time Speech Enhancement on Raw Signals with Deep State-space Modeling | Yan Ru Pei et.al. | 2409.03377 | link |
2024-09-05 | Enhancing digital core image resolution using optimal upscaling algorithm: with application to paired SEM images | Shaohua You et.al. | 2409.03265 | null |
2024-09-05 | Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem | Qiwen Zhu et.al. | 2409.03179 | link |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Solving Video Inverse Problems Using Image Diffusion Models | Taesung Kwon et.al. | 2409.02574 | null |
2024-09-07 | EarthGen: Generating the World from Top-Down Views | Ansh Sharma et.al. | 2409.01491 | link |
2024-09-02 | DiffEyeSyn: Diffusion-based User-specific Eye Movement Synthesis | Chuhan Jiao et.al. | 2409.01240 | null |
2024-09-02 | Single-photon super-resolved spectroscopy from spatial-mode demultiplexing | Luigi Santamaria Amato et.al. | 2409.01190 | null |
2024-09-02 | SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution | Mevan Ekanayake et.al. | 2409.01013 | null |
2024-09-01 | DMRA: An Adaptive Line Spectrum Estimation Method through Dynamical Multi-Resolution of Atoms | Mingguang Han et.al. | 2409.00799 | null |
2024-09-01 | Rethinking Image Super-Resolution from Training Data Perspectives | Go Ohtani et.al. | 2409.00768 | link |
2024-09-01 | Attention-Guided Multi-scale Interaction Network for Face Super-Resolution | Xujie Wan et.al. | 2409.00591 | null |
2024-08-30 | HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution | Masoomeh Aslahishahri et.al. | 2408.16959 | link |
2024-08-29 | GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content | Lebin Zhou et.al. | 2408.16866 | null |
2024-08-30 | Beyond MR Image Harmonization: Resolution Matters Too | Savannah P. Hays et.al. | 2408.16562 | null |
2024-08-29 | Super-Resolution works for coastal simulations | Zhi-Song Liu et.al. | 2408.16553 | null |
2024-08-29 | Enhanced Control for Diffusion Bridge in Image Restoration | Conghan Yue et.al. | 2408.16303 | link |
2024-08-28 | ChartEye: A Deep Learning Framework for Chart Information Extraction | Osama Mustafa et.al. | 2408.16123 | null |
2024-08-27 | Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution | Marcelo dos Santos et.al. | 2408.15386 | link |
2024-08-22 | 3D Photon Counting CT Image Super-Resolution Using Conditional Diffusion Model | Chuang Niu et.al. | 2408.15283 | null |
2024-08-27 | Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment | Xuan Xu et.al. | 2408.15218 | null |
2024-08-27 | A Preliminary Exploration Towards General Image Restoration | Xiangtao Kong et.al. | 2408.15143 | null |
2024-08-27 | Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach | Valfride Nascimento et.al. | 2408.15103 | link |
2024-08-26 | Cascaded Temporal Updating Network for Efficient Video Super-Resolution | Hao Li et.al. | 2408.14244 | null |
2024-08-26 | Efficient Active Flow Control Strategy for Confined Square Cylinder Wake Using Deep Learning-Based Surrogate Model and Reinforcement Learning | Meng Zhang et.al. | 2408.14232 | null |
2024-08-25 | Particle-Filtering-based Latent Diffusion for Inverse Problems | Amir Nazemi et.al. | 2408.13868 | null |
2024-08-25 | FreqINR: Frequency Consistency for Implicit Neural Representation with Adaptive DCT Frequency Loss | Meiyi Wei et.al. | 2408.13716 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | null |
2024-08-23 | SIMPLE: Simultaneous Multi-Plane Self-Supervised Learning for Isotropic MRI Restoration from Anisotropic Data | Rotem Benisty et.al. | 2408.13065 | null |
2024-08-22 | A Unified Plug-and-Play Algorithm with Projected Landweber Operator for Split Convex Feasibility Problems | Shuchang Zhang et.al. | 2408.12100 | null |
2024-08-21 | MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMs | Yulin Ren et.al. | 2408.11758 | link |
2024-08-21 | Quantum super-resolution microscopy by photon statistics and structured light | Fabio Picariello et.al. | 2408.11654 | null |
2024-08-22 | Phase-Based Approaches for Rapid Construction of Magnetic Fields in NV Magnetometry | Prabhat Anand et.al. | 2408.11069 | null |
2024-08-20 | MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling | Zili Liu et.al. | 2408.10854 | null |
2024-08-19 | Webcam-based Pupil Diameter Prediction Benefits from Upscaling | Vijul Shah et.al. | 2408.10397 | null |
2024-08-19 | ML-CrAIST: Multi-scale Low-high Frequency Information-based Cross black Attention with Image Super-resolving Transformer | Alik Pramanick et.al. | 2408.09940 | link |
2024-08-19 | Harnessing Multi-resolution and Multi-scale Attention for Underwater Image Restoration | Alik Pramanick et.al. | 2408.09912 | link |
2024-08-19 | Predicting Long-term Dynamics of Complex Networks via Identifying Skeleton in Hyperbolic Space | Ruikun Li et.al. | 2408.09845 | link |
2024-08-19 | Implicit Grid Convolution for Multi-Scale Image Super-Resolution | Dongheon Lee et.al. | 2408.09674 | link |
2024-08-18 | Angle of Arrival Estimation with Transformer: A Sparse and Gridless Method with Zero-Shot Capability | Zhaoxuan Zhu et.al. | 2408.09362 | null |
2024-08-17 | Discovery of Limb-Brightening in the Parsec-Scale Jet of NGC 315 through Global VLBI Observations and Its Implications for Jet Models | Jongho Park et.al. | 2408.09069 | null |
2024-08-16 | AI-assisted super-resolution cosmological simulations IV: An emulator for deterministic realizations | Xiaowen Zhang et.al. | 2408.09051 | link |
2024-08-25 | Task-Aware Dynamic Transformer for Efficient Arbitrary-Scale Image Super-Resolution | Tianyi Xu et.al. | 2408.08736 | link |
2024-08-16 | QMambaBSR: Burst Image Super-Resolution with Query State Space Model | Xin Di et.al. | 2408.08665 | null |
2024-08-16 | Reference-free Axial Super-resolution of 3D Microscopy Images using Implicit Neural Representation with a 2D Diffusion Prior | Kyungryun Lee et.al. | 2408.08616 | link |
2024-08-16 | Enhancing Events in Neutrino Telescopes through Deep Learning-Driven Super-Resolution | Felix J. Yu et.al. | 2408.08474 | null |
2024-08-15 | SuperNANO: Enabling Nano-Scale Laser an-ti-counterfeiting Marking and Precision Cutting with Super-Resolution Imaging | Yiduo Chen et.al. | 2408.08455 | null |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-15 | DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution | Yuanbo Zhou et.al. | 2408.07516 | null |
2024-08-14 | GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution | Yuzhen Li et.al. | 2408.07484 | link |
2024-08-14 | One Step Diffusion-based Super-Resolution with Time-Aware Distillation | Xiao He et.al. | 2408.07476 | link |
2024-08-17 | Deep-sub-cycle attosecond optical pulses | Hongliang Dang et.al. | 2408.07306 | null |
2024-08-13 | Event-Stream Super Resolution using Sigma-Delta Neural Network | Waseem Shariff et.al. | 2408.06968 | null |
2024-08-12 | Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming | Xinqi Jin et.al. | 2408.06152 | link |
2024-08-12 | Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models | Ioannis Romanelis et.al. | 2408.06145 | link |
2024-08-11 | SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Du Chen et.al. | 2408.05713 | link |
2024-08-10 | Content-decoupled Contrastive Learning-based Implicit Degradation Modeling for Blind Image Super-Resolution | Jiang Yuan et.al. | 2408.05440 | null |
2024-08-09 | Kalman-Inspired Feature Propagation for Video Face Super-Resolution | Ruicheng Feng et.al. | 2408.05205 | null |
2024-08-08 | Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation | Xiaole Zhao et.al. | 2408.04158 | null |
2024-08-07 | Underwater litter monitoring using consumer-grade aerial-aquatic speedy scanner (AASS) and deep learning based super-resolution reconstruction and detection network | Fan Zhao et.al. | 2408.03564 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-06 | SGSR: Structure-Guided Multi-Contrast MRI Super-Resolution via Spatio-Frequency Co-Query Attention | Shaoming Zheng et.al. | 2408.03194 | null |
2024-08-03 | Supervised Image Translation from Visible to Infrared Domain for Object Detection | Prahlad Anand et.al. | 2408.01843 | null |
2024-08-03 | Transformer for seismic image super-resolution | Shiqi Dong et.al. | 2408.01695 | null |
2024-08-03 | Flow Reconstruction Using Spatially Restricted Domains Based on Enhanced Super-Resolution Generative Adversarial Networks | Mustafa Z. Yousif et.al. | 2408.01658 | null |
2024-08-02 | PINNs for Medical Image Analysis: A Survey | Chayan Banerjee et.al. | 2408.01026 | null |
2024-08-01 | Stop-and-go waves reconstruction via iterative refinement | Junyi Ji et.al. | 2408.00941 | null |
2024-08-01 | Exceptional points in SSH-like models with hopping amplitude gradient | David S. Simon et.al. | 2408.00879 | null |
2024-08-01 | Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception | Jiancong Feng et.al. | 2408.00470 | null |
2024-07-31 | Accelerating Image Super-Resolution Networks with Pixel-Level Classification | Jinho Jeong et.al. | 2407.21448 | null |
2024-07-27 | Inverse Problems with Diffusion Models: A MAP Estimation Perspective | Sai bharath chandra Gutha et.al. | 2407.20784 | null |
2024-08-01 | What makes for good morphology representations for spatial omics? | Eduard Chelebian et.al. | 2407.20660 | null |
2024-07-30 | Efficient Channel Estimation for Millimeter Wave and Terahertz Systems Enabled by Integrated Super-resolution Sensing and Communication | Jingran Xu et.al. | 2407.20607 | null |
2024-07-29 | Spatial sub-Rayleigh imaging via structured speckle illumination | Liming Li et.al. | 2407.20460 | null |
2024-08-02 | Deep Learning for Super-resolution Ultrasound Imaging with Spatiotemporal Data | Arthur David Redfern et.al. | 2407.20407 | null |
2024-07-30 | Efficient Face Super-Resolution via Wavelet-based Feature Enhancement Network | Wenjie Li et.al. | 2407.19768 | link |
2024-07-28 | Giant Purcell broadening and Lamb shift for DNA-assembled near-infrared quantum emitters | Sachin Verlekar et.al. | 2407.19513 | null |
2024-07-28 | Perfect Hyperlens | Tao Hou et.al. | 2407.19506 | null |
2024-07-28 | Model-based Super-resolution: Towards a Unified Framework for Super-resolution | Zetao Fei et.al. | 2407.19480 | null |
2024-07-28 | Competition-based Adaptive ReLU for Deep Neural Networks | Junjia Chen et.al. | 2407.19441 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-26 | Super Resolution for Renewable Energy Resource Data With Wind From Reanalysis Data (Sup3rWind) and Application to Ukraine | Brandon N. Benton et.al. | 2407.19086 | null |
2024-07-25 | GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution | Jintong Hu et.al. | 2407.18046 | null |
2024-07-24 | Cuboid-Net: A Multi-Branch Convolutional Neural Network for Joint Space-Time Video Super Resolution | Congrui Fu et.al. | 2407.16986 | null |
2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | link |
2024-07-23 | Channel-Partitioned Windowed Attention And Frequency Learning for Single Image Super-Resolution | Dinh Phu Tran et.al. | 2407.16232 | null |
2024-07-23 | Topological Dark Spots of Electric Near Field in Metal Structures | Tong Fu et.al. | 2407.16213 | null |
2024-07-23 | Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems | Sojin Lee et.al. | 2407.16125 | link |
2024-07-22 | High-flexibility reconstruction of small-scale motions in wall turbulence using a generalized zero-shot learning | Haokai Wu et.al. | 2407.15604 | null |
2024-07-22 | Attention Beats Linear for Fast Implicit Neural Representation Generation | Shuyi Zhang et.al. | 2407.15355 | link |
2024-07-22 | ThermalNeRF: Thermal Radiance Fields | Yvette Y. Lin et.al. | 2407.15337 | null |
2024-07-22 | Efficient Multi-disparity Transformer for Light Field Image Super-resolution | Zeke Zexi Hu et.al. | 2407.15329 | null |
2024-07-20 | A New Dataset and Framework for Real-World Blurred Images Super-Resolution | Rui Qin et.al. | 2407.14880 | link |
2024-07-19 | Large Kernel Distillation Network for Efficient Single Image Super-Resolution | Chengxing Xie et.al. | 2407.14340 | link |
2024-07-19 | RealViformer: Investigating Attention for Real-World Video Super-Resolution | Yuehan Zhang et.al. | 2407.13987 | link |
2024-07-18 | MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References | Lukas Bösiger et.al. | 2407.13745 | link |
2024-07-18 | Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network | Hao Yan et.al. | 2407.13211 | null |
2024-07-18 | UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt | Xin Li et.al. | 2407.13108 | null |
2024-07-17 | Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Dohyung Kim et.al. | 2407.12637 | null |
2024-07-16 | Speckle-based 3D sub-diffraction imaging through a multimode fiber | Zhouping Lyu et.al. | 2407.11796 | null |
2024-07-16 | Deconvolution with a Box | Pedro Felzenszwalb et.al. | 2407.11685 | null |
2024-07-16 | Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations | Yunya Gao et.al. | 2407.11381 | link |
2024-07-16 | Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems | Yaşar Utku Alçalar et.al. | 2407.11288 | null |
2024-07-14 | Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV | Zhiwen Yang et.al. | 2407.11087 | link |
2024-07-15 | Spectral Properties of Infinitely Smooth Kernel Matrices in the Single Cluster Limit, with Applications to Multivariate Super-Resolution | Nuha Diab et.al. | 2407.10600 | null |
2024-07-15 | Backdoor Attacks against Image-to-Image Networks | Wenbo Jiang et.al. | 2407.10445 | null |
2024-07-13 | Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors | Wei Shang et.al. | 2407.09919 | link |
2024-07-13 | Fast and Provable Simultaneous Blind Super-Resolution and Demixing for Point Source Signals: Scaled Gradient Descent without Regularization | Jinchi Chen et.al. | 2407.09900 | link |
2024-07-12 | Region Attention Transformer for Medical Image Restoration | Zhiwen Yang et.al. | 2407.09268 | link |
2024-07-12 | Task-driven single-image super-resolution reconstruction of document scans | Maciej Zyrek et.al. | 2407.08993 | null |
2024-07-11 | Global Spatial-Temporal Information-based Residual ConvLSTM for Video Space-Time Super-Resolution | Congrui Fu et.al. | 2407.08466 | null |
2024-07-11 | Wind Power Assessment based on Super-Resolution and Downscaling -- A Comparison of Deep Learning Methods | Luca Schmidt et.al. | 2407.08259 | null |
2024-07-11 | Spatially-Variant Degradation Model for Dataset-free Super-resolution | Shaojie Guo et.al. | 2407.08252 | null |
2024-07-10 | VEnhancer: Generative Space-Time Enhancement for Video Generation | Jingwen He et.al. | 2407.07667 | null |
2024-07-10 | Aging-Resistant Wideband Precoding in 5G and Beyond Using 3D Convolutional Neural Networks | Alejandro Villena-Rodriguez et.al. | 2407.07434 | null |
2024-07-10 | Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Yuehan Zhang et.al. | 2407.07302 | link |
2024-07-09 | UnmixingSR: Material-aware Network with Unsupervised Unmixing as Auxiliary Task for Hyperspectral Image Super-resolution | Yang Yu et.al. | 2407.06525 | null |
2024-07-08 | Enhancing super-resolution ultrasound localisation through multi-frame deconvolution exploiting spatiotemporal coherence | Su Yan et.al. | 2407.06373 | null |
2024-07-08 | Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis | Emaad Khwaja et.al. | 2407.06079 | null |
2024-07-08 | Self-Prior Guided Mamba-UNet Networks for Medical Image Super-Resolution | Zexin Ji et.al. | 2407.05993 | null |
2024-07-08 | Deform-Mamba Network for MRI Super-Resolution | Zexin Ji et.al. | 2407.05969 | null |
2024-07-08 | HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution | Xiang Zhang et.al. | 2407.05878 | null |
2024-07-08 | Neuromorphic Imaging with Super-Resolution | Pei Zhang et.al. | 2407.05764 | null |
2024-07-07 | Edge-guided and Cross-scale Feature Fusion Network for Efficient Multi-contrast MRI Super-Resolution | Zhiyuan Yang et.al. | 2407.05307 | link |
2024-07-07 | A Hybrid Registration and Fusion Method for Hyperspectral Super-resolution | Kunjing Yang et.al. | 2407.05279 | null |
2024-07-07 | RIS-assisted Coverage Enhancement in mmWave Integrated Sensing and Communication Networks | Xu Gan et.al. | 2407.05249 | null |
2024-07-05 | NSD-DIL: Null-Shot Deblurring Using Deep Identity Learning | Sree Rama Vamsidhar S et.al. | 2407.04815 | null |
2024-07-08 | Super-resolution imaging of nanoscale inhomogeneities in hBN-covered and encapsulated few-layer graphene | Lina Jäckering et.al. | 2407.04565 | null |
2024-07-05 | AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource | Wengyi Zhan et.al. | 2407.04241 | link |
2024-07-04 | M^3:Manipulation Mask Manufacturer for Arbitrary-Scale Super-Resolution Mask | Xinyu Yang et.al. | 2407.03695 | null |
2024-07-04 | ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution | Yuanbo Zhou et.al. | 2407.03598 | null |
2024-07-04 | Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis | Tong Zhou et.al. | 2407.03089 | null |
2024-07-03 | Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Gen Li et.al. | 2407.02813 | link |
2024-07-02 | Adversarial Magnification to Deceive Deepfake Detection through Super Resolution | Davide Alessandro Coccomini et.al. | 2407.02670 | link |
2024-07-01 | Broadband planar electromagnetic hyper-lens with uniform magnification in air | Ran Sun et.al. | 2407.02532 | null |
2024-07-04 | Real HSI-MSI-PAN image dataset for the hyperspectral/multi-spectral/panchromatic image fusion and super-resolution fields | Shuangliang Li et.al. | 2407.02387 | link |
2024-07-02 | Efficient Stochastic Differential Equation for DEM Super Resolution with Void Filling | Tongtong Zhang et.al. | 2407.01908 | null |
2024-07-01 | DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models | Chang-Han Yeh et.al. | 2407.01519 | link |
2024-07-02 | Preserving Full Degradation Details for Blind Image Super-Resolution | Hongda Liu et.al. | 2407.01299 | link |
2024-07-01 | DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-Resolution | Crispian Morris et.al. | 2407.01230 | null |
2024-06-28 | ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction | Ding-Jiun Huang et.al. | 2406.20066 | null |
2024-06-28 | Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence | Xiantao Fan et.al. | 2406.20047 | null |
2024-06-28 | CSAKD: Knowledge Distillation with Cross Self-Attention for Hyperspectral and Multispectral Image Fusion | Chih-Chung Hsu et.al. | 2406.19666 | link |
2024-06-28 | Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion | Quanmin Liang et.al. | 2406.19640 | link |
2024-06-27 | Shoulder of Dust Rings Formed by Planet-disk Interactions | Jiaqing Bi et.al. | 2406.19438 | null |
2024-06-27 | Super-resolution imaging using super-oscillatory diffractive neural networks | Hang Chen et.al. | 2406.19126 | null |
2024-06-26 | Spatial-temporal Hierarchical Reinforcement Learning for Interpretable Pathology Image Super-Resolution | Wenting Chen et.al. | 2406.18310 | link |
2024-06-30 | V2X Sidelink Positioning in FR1: From Ray-Tracing and Channel Estimation to Bayesian Tracking | Yu Ge et.al. | 2406.17950 | null |
2024-06-25 | Burst Image Super-Resolution with Base Frame Selection | Sanghyun Kim et.al. | 2406.17869 | null |
2024-06-25 | A Near-Field Super-Resolution Network for Accelerating Antenna Characterization | Yuchen Gu et.al. | 2406.17244 | null |
2024-06-24 | DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Aiwen Jiang et.al. | 2406.16477 | link |
2024-06-24 | Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution | Junxiong Lin et.al. | 2406.16459 | null |
2024-06-24 | Improving Generative Adversarial Networks for Video Super-Resolution | Daniel Wen et.al. | 2406.16359 | null |
2024-06-23 | Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning | Ruisheng Gao et.al. | 2406.16083 | null |
2024-06-23 | Gridless Parameter Estimation in Partly Calibrated Rectangular Arrays | Tianyi Liu et.al. | 2406.16041 | null |
2024-06-23 | Learning Accurate and Enriched Features for Stereo Image Super-Resolution | Hu Gao et.al. | 2406.16001 | link |
2024-06-21 | A Generative Machine Learning Approach for Improving Precipitation from Earth System Models | Philipp Hess et.al. | 2406.15026 | null |
2024-06-20 | Zero-Shot Image Denoising for High-Resolution Electron Microscopy | Xuanyu Tian et.al. | 2406.14264 | link |
2024-06-19 | IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution | Alireza Aghelan et.al. | 2406.13815 | link |
2024-06-19 | Enhance the Image: Super Resolution using Artificial Intelligence in MRI | Ziyu Li et.al. | 2406.13625 | null |
2024-06-19 | EvTexture: Event-driven Texture Enhancement for Video Super-Resolution | Dachun Kai et.al. | 2406.13457 | link |
2024-06-19 | Super-resolution 3D tomography of vector near-fields in dielectric resonators | Bingbing Zhu et.al. | 2406.13171 | null |
2024-06-18 | Structured Detection for Simultaneous Super-Resolution and Optical Sectioning in Laser Scanning Microscopy | Alessandro Zunino et.al. | 2406.12542 | link |
2024-06-18 | LFMamba: Light Field Image Super-Resolution with State Space Model | Wang xia et.al. | 2406.12463 | null |
2024-06-17 | A Dictionary Based Approach for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2406.11330 | link |
2024-06-16 | Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution | Cuixin Yang et.al. | 2406.10869 | null |
2024-06-14 | SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models | Zhaoxu Luo et.al. | 2406.10225 | null |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | Exact Sparse Representation Recovery in Signal Demixing and Group BLASSO | Marcello Carioni et.al. | 2406.09922 | null |
2024-06-14 | Bayesian Conditioned Diffusion Models for Inverse Problems | Alper Güngör et.al. | 2406.09768 | null |
2024-06-13 | Near-Field Multiuser Communications based on Sparse Arrays | Kangjian Chen et.al. | 2406.09238 | null |
2024-06-13 | SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Soufiane Belharbi et.al. | 2406.09168 | link |
2024-06-13 | Microparticle-assisted 2D super resolution virtual image modeling | Arlen Bekirov et.al. | 2406.09060 | null |
2024-06-13 | Blind Super-Resolution via Meta-learning and Markov Chain Monte Carlo Simulation | Jingyuan Xia et.al. | 2406.08896 | link |
2024-06-12 | Pranath Reddy et.al. | 2406.08442 | null | |
2024-06-12 | DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor | Juncheng Wu et.al. | 2406.08377 | null |
2024-06-14 | One-Step Effective Diffusion Network for Real-World Image Super-Resolution | Rongyuan Wu et.al. | 2406.08177 | link |
2024-06-11 | Image Neural Field Diffusion Models | Yinbo Chen et.al. | 2406.07480 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399 | null |
2024-06-12 | Towards Realistic Data Generation for Real-World Super-Resolution | Long Peng et.al. | 2406.07255 | null |
2024-06-10 | 2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution | Kai Liu et.al. | 2406.06649 | link |
2024-06-10 | Inter-slice Super-resolution of Magnetic Resonance Images by Pre-training and Self-supervised Fine-tuning | Xin Wang et.al. | 2406.05974 | null |
2024-06-09 | Binarized Diffusion Model for Image Super-Resolution | Zheng Chen et.al. | 2406.05723 | link |
2024-06-07 | M2NO: Multiresolution Operator Learning with Multiwavelet-based Algebraic Multigrid Method | Zhihao Li et.al. | 2406.04822 | null |
2024-06-06 | M&M VTO: Multi-Garment Virtual Try-On and Editing | Luyang Zhu et.al. | 2406.04542 | link |
2024-06-06 | Enhancing Weather Predictions: Super-Resolution via Deep Diffusion Models | Jan Martinů et.al. | 2406.04099 | null |
2024-06-06 | Vectorized Conditional Neural Fields: A Framework for Solving Time-dependent Parametric Partial Differential Equations | Jan Hagnberger et.al. | 2406.03919 | link |
2024-06-07 | Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge | Nan Zhang et.al. | 2406.03799 | link |
2024-06-05 | SuperFormer: Volumetric Transformer Architectures for MRI Super-Resolution | Cristhian Forigua et.al. | 2406.03359 | link |
2024-06-01 | CoNO: Complex Neural Operator for Continous Dynamical Physical Systems | Karn Tiwari et.al. | 2406.02597 | null |
2024-06-04 | ReLUs Are Sufficient for Learning Implicit Neural Representations | Joseph Shenouda et.al. | 2406.02529 | link |
2024-06-05 | Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation | Clement Chadebec et.al. | 2406.02347 | link |
2024-06-03 | L-MAGIC: Language Model Assisted Generation of Images with Coherence | Zhipeng Cai et.al. | 2406.01843 | link |
2024-06-03 | PolyCLEAN: When Högbom meets Bayes -- Fast Super-Resolution Imaging with Bayesian MAP Estimation | Adrian Jarret et.al. | 2406.01342 | link |
2024-06-03 | Arctic Sea Ice Image Super-Resolution Based on Multi-Scale Convolution and Dual-Gating Mechanism | Zhaomin Fang et.al. | 2406.01240 | null |
2024-06-02 | Stealing Image-to-Image Translation Models With a Single Query | Nurit Spingarn-Eliezer et.al. | 2406.00828 | null |
2024-06-02 | Multidimensional optical singularities and their applications | Soon Wei Daniel Lim et.al. | 2406.00784 | null |
2024-06-02 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-06-04 | SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen et.al. | 2406.00609 | null |
2024-06-01 | GLCAN: Global-Local Collaborative Auxiliary Network for Local Learning | Feiyu Zhu et.al. | 2406.00446 | null |
2024-06-01 | SpikeMM: Flexi-Magnification of High-Speed Micro-Motions | Baoyue Zhang et.al. | 2406.00383 | null |
2024-06-01 | Hybrid attention structure preserving network for reconstruction of under-sampled OCT images | Zezhao Guo et.al. | 2406.00279 | null |
2024-05-31 | Climate Variable Downscaling with Conditional Normalizing Flows | Christina Winkler et.al. | 2405.20719 | null |
2024-05-30 | Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution? | Egor Kashkarov et.al. | 2405.20392 | null |
2024-05-30 | All-In-One Medical Image Restoration via Task-Adaptive Routing | Zhiwen Yang et.al. | 2405.19769 | link |
2024-05-30 | MAE-GAN: A Novel Strategy for Simultaneous Super-resolution Reconstruction and Denoising of Post-stack Seismic Profile | Wenshuo Yu et.al. | 2405.19767 | null |
2024-05-29 | Reconstructing Interpretable Features in Computational Super-Resolution microscopy via Regularized Latent Search | Marzieh Gheisari et.al. | 2405.19112 | null |
2024-05-29 | Single image super-resolution based on trainable feature matching attention network | Qizhou Chen et.al. | 2405.18872 | link |
2024-05-29 | Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching | Yasi Zhang et.al. | 2405.18816 | null |
2024-05-28 | Towards a Sampling Theory for Implicit Neural Representations | Mahrokh Najaf et.al. | 2405.18410 | null |
2024-05-28 | Hyperspectral and multispectral image fusion with arbitrary resolution through self-supervised representations | Ting Wang et.al. | 2405.17818 | null |
2024-05-27 | Fast Samplers for Inverse Problems in Iterative Refinement Models | Kushagra Pandey et.al. | 2405.17673 | null |
2024-05-27 | Does Diffusion Beat GAN in Image Super Resolution? | Denis Kuznedelev et.al. | 2405.17261 | link |
2024-06-02 | PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution | Yong Liu et.al. | 2405.17158 | link |
2024-05-27 | Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models | Cristina N. Vasconcelos et.al. | 2405.16759 | null |
2024-05-26 | Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models | Regev Cohen et.al. | 2405.16475 | null |
2024-05-25 | BOLD: Boolean Logic Deep Learning | Van Minh Nguyen et.al. | 2405.16339 | null |
2024-05-24 | Visible-frequency hyperbolic plasmon polaritons in a natural van der Waals crystal | Giacomo Venturi et.al. | 2405.15420 | null |
2024-05-29 | Stochastic super-resolution for Gaussian microtextures | Emile Pierret et.al. | 2405.15399 | null |
2024-05-24 | Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving | Jia He et.al. | 2405.15241 | null |
2024-05-23 | Universal Robustness via Median Randomized Smoothing for Real-World Super-Resolution | Zakariya Chaouai et.al. | 2405.14934 | null |
2024-05-24 | Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation | Hongxu Jiang et.al. | 2405.14802 | link |
2024-05-23 | Stimulated Raman-induced Beam Focusing | Minhaeng Cho et.al. | 2405.14240 | null |
2024-05-22 | Perceptual Fairness in Image Restoration | Guy Ohayon et.al. | 2405.13805 | null |
2024-05-22 | HR-INR: Continuous Space-Time Video Super-Resolution via Event Camera | Yunfan Lu et.al. | 2405.13389 | null |
2024-05-20 | Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolution | Xihaier Luo et.al. | 2405.12202 | null |
2024-05-18 | HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos | Qifeng Chen et.al. | 2405.11270 | null |
2024-05-17 | AdaWaveNet: Adaptive Wavelet Network for Time Series Analysis | Han Yu et.al. | 2405.11124 | null |
2024-05-27 | Infrared Image Super-Resolution via Lightweight Information Split Network | Shijie Liu et.al. | 2405.10561 | null |
2024-05-16 | RGB Guided ToF Imaging System: A Survey of Deep Learning-based Methods | Xin Qiao et.al. | 2405.10357 | null |
2024-05-16 | Bilateral Event Mining and Complementary for Event Stream Super-Resolution | Zhilin Huang et.al. | 2405.10037 | link |
2024-05-16 | Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution | Xingjian Wang et.al. | 2405.10014 | null |
2024-05-16 | IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model | Yongsong Huang et.al. | 2405.09873 | link |
2024-05-15 | Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment | Xinying Lin et.al. | 2405.09472 | null |
2024-05-15 | Low-Complexity Joint Azimuth-Range-Velocity Estimation for Integrated Sensing and Communication with OFDM Waveform | Jun Zhang et.al. | 2405.09443 | null |
2024-05-15 | Large coordinate kernel attention network for lightweight image super-resolution | Fangwei Hao et.al. | 2405.09353 | null |
2024-05-14 | NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-Resolution | Yihong Chen et.al. | 2405.08423 | link |
2024-05-23 | Exploring the Low-Pass Filtering Behavior in Image Super-Resolution | Haoyu Deng et.al. | 2405.07919 | link |
2024-05-13 | CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution | Qingguo Liu et.al. | 2405.07648 | link |
2024-05-11 | Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior | Ce Wang et.al. | 2405.07044 | link |
2024-05-11 | Efficient Real-world Image Super-Resolution Via Adaptive Directional Gradient Convolution | Long Peng et.al. | 2405.07023 | link |
2024-05-11 | Incorporating Degradation Estimation in Light Field Spatial Super-Resolution | Zeyu Xiao et.al. | 2405.07012 | null |
2024-05-11 | Super-Resolving Blurry Images with Events | Chi Zhang et.al. | 2405.06918 | null |
2024-05-10 | Machine learning for reconstruction of polarity inversion lines from solar filaments | V. Kisielius et.al. | 2405.06293 | link |
2024-05-07 | Single-antenna 3D localization with nonseparable toroidal pulses | Ren Wang et.al. | 2405.05979 | null |
2024-05-17 | Diag2Diag: Multimodal super-resolution diagnostics for physics discovery with application to fusion | Azarakhsh Jalalvand et.al. | 2405.05908 | null |
2024-05-09 | Multi-Level Feature Fusion Network for Lightweight Stereo Image Super-Resolution | Yunxiang Li et.al. | 2405.05497 | link |
2024-05-08 | HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-Resolution | Shu-Chuan Chu et.al. | 2405.05001 | link |
2024-05-08 | Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution | Yi Xiao et.al. | 2405.04964 | link |
2024-05-08 | Teacher-Student Network for Real-World Face Super-Resolution with Progressive Embedding of Edge Information | Zhilei Liu et.al. | 2405.04778 | null |
2024-05-07 | An Advanced Features Extraction Module for Remote Sensing Image Super-Resolution | Naveed Sultan et.al. | 2405.04595 | null |
2024-05-07 | CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model | Haixia Xiao et.al. | 2405.04483 | null |
2024-05-08 | Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer | Zhuoyi Yang et.al. | 2405.04312 | link |
2024-05-06 | All-in-One Deep Learning Framework for MR Image Reconstruction | Geunu Jeong et.al. | 2405.03684 | null |
2024-05-11 | DVMSR: Distillated Vision Mamba for Efficient Super-Resolution | Xiaoyan Lei et.al. | 2405.03008 | link |
2024-05-05 | I |
Haofei Song et.al. | 2405.02857 | null |
2024-05-05 | Antenna Failure Resilience: Deep Learning-Enabled Robust DOA Estimation with Single Snapshot Sparse Arrays | Ruxin Zheng et.al. | 2405.02788 | link |
2024-05-01 | Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts | Han Cui et.al. | 2405.02208 | null |
2024-05-03 | Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations | Zhilu Zhang et.al. | 2405.02171 | link |
2024-05-03 | Optical skyrmions from metafibers | Tiantian He et.al. | 2405.01962 | null |
2024-05-05 | TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms | Yueyuan Sui et.al. | 2405.01242 | null |
2024-05-02 | Single Image Super-Resolution Based on Global-Local Information Synergy | Nianzu Qiao et.al. | 2405.01085 | null |
2024-05-01 | Detail-Enhancing Framework for Reference-Based Image Super-Resolution | Zihan Wang et.al. | 2405.00431 | null |
2024-04-30 | Replica-assisted super-resolution fluorescence imaging in scattering media | Tengfei Wu et.al. | 2404.19734 | null |
2024-05-04 | Towards Real-world Video Face Restoration: A New Benchmark | Ziyan Chen et.al. | 2404.19500 | null |
2024-04-30 | Super-resolution by converting evanescent waves in microsphere to propagating and transfer function from its surface to nano-jet | Y. Ben-Aryeh et.al. | 2404.19333 | null |
2024-04-29 | Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing | Leonardo Rossi et.al. | 2404.18924 | link |
2024-04-27 | Generative Diffusion-based Downscaling for Climate | Robbie A. Watt et.al. | 2404.17752 | link |
2024-04-26 | Federated Learning for Blind Image Super-Resolution | Brian B. Moser et.al. | 2404.17670 | null |
2024-04-26 | One-Shot Image Restoration | Deborah Pereg et.al. | 2404.17426 | null |
2024-04-26 | Simultaneous Tri-Modal Medical Image Fusion and Super-Resolution using Conditional Diffusion Model | Yushen Xu et.al. | 2404.17357 | null |
2024-04-25 | Deep learning-based blind image super-resolution with iterative kernel reconstruction and noise estimation | Hasan F. Ates et.al. | 2404.16564 | link |
2024-04-25 | Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey | Marcos V. Conde et.al. | 2404.16484 | link |
2024-04-25 | Latent Modulated Function for Computational Optimal Continuous Image Representation | Zongyao He et.al. | 2404.16451 | link |
2024-04-25 | Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series | Aimi Okabayashi et.al. | 2404.16409 | link |
2024-04-24 | Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey | Marcos V. Conde et.al. | 2404.16223 | link |
2024-04-26 | A Survey on Visual Mamba | Hanwei Zhang et.al. | 2404.15956 | null |
2024-04-24 | Super-resolution imaging based on active optical intensity interferometry | Lu-Chuan Liu et.al. | 2404.15685 | null |
2024-04-26 | A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution | Zhixiong Yang et.al. | 2404.15620 | link |
2024-04-23 | Super-resolved CARS by coherent image scanning | Anna Zhitnitsky et.al. | 2404.15094 | null |
2024-04-23 | Canalization-based super-resolution imaging using a single van der Waals layer | Jiahua Duan et.al. | 2404.14876 | null |
2024-04-22 | SwinFuSR: an image fusion-inspired model for RGB-guided thermal image super-resolution | Cyprien Arnold et.al. | 2404.14533 | link |
2024-04-29 | ALMA 2D Super-resolution Imaging of Taurus-Auriga Protoplanetary Disks: Probing Statistical Properties of Disk Substructures | Masayuki Yamaguchi et.al. | 2404.13570 | null |
2024-04-26 | SEGSRNet for Stereo-Endoscopic Image Super-Resolution and Surgical Instrument Segmentation | Mansoor Hayat et.al. | 2404.13330 | null |
2024-04-19 | Single-sample image-fusion upsampling of fluorescence lifetime images | Valentin Kapitány et.al. | 2404.13102 | null |
2024-04-19 | A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks | Ronglei Ji et.al. | 2404.13018 | link |
2024-04-19 | Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics | Xiaofei Wang et.al. | 2404.12973 | null |
2024-04-18 | VideoGigaGAN: Towards Detail-rich Video Super-Resolution | Yiran Xu et.al. | 2404.12388 | null |
2024-04-19 | Multichannel-GaAsP-photomultiplier-based fiber bundle ISM-STED microscope | Marcus Babin et.al. | 2404.12370 | null |
2024-04-18 | Multiphoton super-resolution imaging via virtual structured illumination | Sumin Lim et.al. | 2404.11849 | null |
2024-04-18 | Partial Large Kernel CNNs for Efficient Super-Resolution | Dongheon Lee et.al. | 2404.11848 | link |
2024-04-17 | Training Transformer Models by Wavelet Losses Improves Quantitative and Visual Performance in Single Image Super-Resolution | Cansu Korkmaz et.al. | 2404.11273 | link |
2024-04-16 | Uncertainty Quantification of Super-Resolution Flow Mapping in Liquid Metals using Ultrasound Localization Microscopy | David Weik et.al. | 2404.10840 | null |
2024-04-16 | Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution | Yutao Yuan et.al. | 2404.10688 | link |
2024-04-16 | The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report | Bin Ren et.al. | 2404.10343 | link |
2024-04-16 | SRGS: Super-Resolution 3D Gaussian Splatting | Xiang Feng et.al. | 2404.10318 | link |
2024-04-17 | OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model | Runyi Li et.al. | 2404.10312 | null |
2024-04-16 | Little Pilot is Needed for Channel Estimation with Integrated Super-Resolution Sensing and Communication | Jingran Xu et.al. | 2404.10233 | null |
2024-04-15 | The Problem Of Image Super-Resolution, Denoising And Some Image Restoration Methods In Deep Learning Models | Ngoc-Giau Pham et.al. | 2404.09817 | null |
2024-04-15 | NTIRE 2024 Challenge on Image Super-Resolution ( |
Zheng Chen et.al. | 2404.09790 | link |
2024-04-15 | MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution | Yuxuan Jiang et.al. | 2404.09571 | null |
2024-04-15 | Super-resolution of biomedical volumes with 2D supervision | Cheng Jiang et.al. | 2404.09425 | null |
2024-04-15 | Differentiable Search for Finding Optimal Quantization Strategy | Lianqiang Li et.al. | 2404.08010 | null |
2024-04-11 | Terahertz imaging super-resolution for documental heritage diagnostics | Danae Antunez Vazquez et.al. | 2404.07798 | null |
2024-04-11 | Near-field reconstruction of periodic structures with superimposed illumination | Jue Wang et.al. | 2404.07763 | null |
2024-04-11 | Deep learning-driven pulmonary arteries and veins segmentation reveals demography-associated pulmonary vasculature anatomy | Yuetan Chu et.al. | 2404.07671 | link |
2024-04-10 | Unfolding ADMM for Enhanced Subspace Clustering of Hyperspectral Images | Xianlu Li et.al. | 2404.07112 | link |
2024-04-09 | Dynamic Deep Learning Based Super-Resolution For The Shallow Water Equations | Maximilian Witte et.al. | 2404.06400 | null |
2024-04-09 | Fortifying Fully Convolutional Generative Adversarial Networks for Image Super-Resolution Using Divergence Measures | Arkaprabha Basu et.al. | 2404.06294 | null |
2024-04-09 | LIPT: Latency-aware Image Processing Transformer | Junbo Qiao et.al. | 2404.06075 | null |
2024-04-09 | Space-Time Video Super-resolution with Neural Operator | Yuantong Zhang et.al. | 2404.06036 | null |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-04-09 | Resolution enhancement of SOHO/MDI Magnetograms | Ying Qin et.al. | 2404.05968 | null |
2024-04-08 | Nanomolecular OLED Pixelization Enabling Electroluminescent Metasurfaces | Tommaso Marcato et.al. | 2404.05336 | null |
2024-04-07 | Gull: A Generative Multifunctional Audio Codec | Yi Luo et.al. | 2404.04947 | null |
2024-04-07 | Efficient Learnable Collaborative Attention for Single Image Super-Resolution | Yigang Zhao Chaowei Zheng et.al. | 2404.04922 | null |
2024-04-07 | CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data | Wei Fang et.al. | 2404.04878 | null |
2024-04-07 | Effect of active loop extrusion on the two-contact correlations in the interphase chromosome | Dmitry Starkov et.al. | 2404.04853 | null |
2024-04-07 | Rethinking Diffusion Model for Multi-Contrast MRI Super-Resolution | Guangyuan Li et.al. | 2404.04785 | link |
2024-04-06 | Collaborative Feedback Discriminative Propagation for Video Super-Resolution | Hao Li et.al. | 2404.04745 | link |
2024-04-06 | Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint | Ashok Mondal et.al. | 2404.04642 | null |
2024-04-06 | PointSAGE: Mesh-independent superresolution approach to fluid flow predictions | Rajat Sarkar et.al. | 2404.04615 | null |
2024-04-03 | Translation-based Video-to-Video Synthesis | Pratim Saha et.al. | 2404.04283 | null |
2024-04-05 | Real-GDSR: Real-World Guided DSM Super-Resolution via Edge-Enhancing Residual Network | Daniel Panangian et.al. | 2404.03930 | null |
2024-04-05 | The ESPRIT algorithm under high noise: Optimal error scaling and noisy super-resolution | Zhiyan Ding et.al. | 2404.03885 | null |
2024-04-04 | AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution | Cheeun Hong et.al. | 2404.03296 | link |
2024-04-04 | CSR-dMRI: Continuous Super-Resolution of Diffusion MRI with Anatomical Structure-assisted Implicit Neural Representation Learning | Ruoyou Wu et.al. | 2404.03209 | null |
2024-04-04 | Quantum enhanced mechanical rotation sensing using wavefront photonic gears | Ofir Yesharim et.al. | 2404.02797 | null |
2024-04-03 | GenN2N: Generative NeRF2NeRF Translation | Xiangyue Liu et.al. | 2404.02788 | null |
2024-04-03 | Two-Stage Super-Resolution Simulation Method for Three-Dimensional Flow Fields Around Buildings for Real-Time Prediction of Urban Micrometeorology | Yuki Yasuda et.al. | 2404.02631 | link |
2024-04-03 | Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution | Simiao Li et.al. | 2404.02573 | null |
2024-04-02 | Super-Resolution Analysis for Landfill Waste Classification | Matias Molina et.al. | 2404.01790 | null |
2024-04-03 | AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation | Rui Xie et.al. | 2404.01717 | null |
2024-04-04 | Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Jaeha Kim et.al. | 2404.01692 | link |
2024-04-02 | RefQSR: Reference-based Quantization for Image Super-Resolution Networks | Hongjae Lee et.al. | 2404.01690 | null |
2024-04-01 | Video Interpolation with Diffusion Models | Siddhant Jain et.al. | 2404.01203 | null |
2024-04-01 | DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF | Jie Long Lee et.al. | 2404.00874 | link |
2024-04-02 | DRCT: Saving Image Super-resolution away from Information Bottleneck | Chih-Chung Hsu et.al. | 2404.00722 | link |
2024-03-31 | DeeDSR: Towards Real-World Image Super-Resolution via Degradation-Aware Stable Diffusion | Chunyang Bi et.al. | 2404.00661 | null |
2024-03-30 | SGDFormer: One-stage Transformer-based Architecture for Cross-Spectral Stereo Image Guided Denoising | Runmin Zhang et.al. | 2404.00349 | null |
2024-03-30 | Exploiting Self-Supervised Constraints in Image Super-Resolution | Gang Wu et.al. | 2404.00260 | link |
2024-04-03 | Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality | Kyotaro Tokoro et.al. | 2403.19428 | link |
2024-03-28 | Structured illumination microscopy with extreme ultraviolet pulses | R. Mincigrucci et.al. | 2403.19382 | null |
2024-03-27 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T et.al. | 2403.18922 | null |
2024-03-27 | Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction | Yiyao Zhang et.al. | 2403.18776 | link |
2024-03-27 | Ship in Sight: Diffusion Models for Ship-Image Super Resolution | Luigi Sigillo et.al. | 2403.18370 | link |
2024-03-27 | Super-Resolution of SOHO/MDI Magnetograms of Solar Active Regions Using SDO/HMI Data and an Attention-Aided Convolutional Neural Network | Chunhui Xu et.al. | 2403.18302 | null |
2024-03-26 | Climate Downscaling: A Deep-Learning Based Super-resolution Model of Precipitation Data with Attention Block and Skip Connections | Chia-Hao Chiang et.al. | 2403.17847 | null |
2024-03-26 | Algorithmic unfolding for image reconstruction and localization problems in fluorescence microscopy | Silvia Bonettini et.al. | 2403.17506 | link |
2024-03-26 | SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational Autoencoder | Dihan Zheng et.al. | 2403.17502 | link |
2024-03-26 | Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model | Runmin Dong et.al. | 2403.17460 | link |
2024-03-25 | A Study in Dataset Pruning for Image Super-Resolution | Brian B. Moser et.al. | 2403.17083 | null |
2024-03-25 | Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution | Zhikai Chen et.al. | 2403.17000 | null |
2024-03-25 | Self-STORM: Deep Unrolled Self-Supervised Learning for Super-Resolution Microscopy | Yair Ben Sahel et.al. | 2403.16974 | link |
2024-03-25 | Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution | Qingping Zheng et.al. | 2403.16643 | null |
2024-03-25 | Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging | Jintong Hu et.al. | 2403.16384 | link |
2024-03-24 | CFAT: Unleashing TriangularWindows for Image Super-resolution | Abhisek Ray et.al. | 2403.16143 | link |
2024-03-23 | Adaptive Super Resolution For One-Shot Talking-Head Generation | Luchuan Song et.al. | 2403.15944 | link |
2024-03-23 | Time-series Initialization and Conditioning for Video-agnostic Stabilization of Video Super-Resolution using Recurrent Networks | Hiroshi Mori et.al. | 2403.15832 | null |
2024-03-20 | Using Super-Resolution Imaging for Recognition of Low-Resolution Blurred License Plates: A Comparative Study of Real-ESRGAN, A-ESRGAN, and StarSRGAN | Ching-Hsiang Wang et.al. | 2403.15466 | null |
2024-03-22 | Deep Generative Model based Rate-Distortion for Image Downscaling Assessment | Yuanbang Liang et.al. | 2403.15139 | link |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839 | null |
2024-03-21 | QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping | Zhuang Xiong et.al. | 2403.14070 | null |
2024-03-20 | Multi-photon super-linear image scanning microscopy using upconversion nanoparticles | Yao Wang et.al. | 2403.13436 | null |
2024-03-20 | Efficient scene text image super-resolution with semantic guidance | LeoWu TomyEnrique et.al. | 2403.13330 | link |
2024-03-18 | Super-resolution of ultrafast pulses via spectral inversion | Michał Lipka et.al. | 2403.12746 | null |
2024-03-18 | A Wideband Distributed Massive MIMO Channel Sounder for Communication and Sensing | Michiel Sandra et.al. | 2403.11856 | null |
2024-03-18 | PAON: A New Neuron Model using Padé Approximants | Onur Keleş et.al. | 2403.11791 | null |
2024-03-18 | CasSR: Activating Image Power for Real-World Image Super-Resolution | Haolan Chen et.al. | 2403.11451 | null |
2024-03-18 | VmambaIR: Visual State Space Model for Image Restoration | Yuan Shi et.al. | 2403.11423 | link |
2024-03-17 | Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution | Jialu Sui et.al. | 2403.11078 | link |
2024-03-16 | Boosting Flow-based Generative Super-Resolution Models via Learned Prior | Li-Yuan Tsao et.al. | 2403.10988 | link |
2024-03-16 | Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution | Zhiheng Li et.al. | 2403.10925 | null |
2024-03-15 | A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models | Xijun Wang et.al. | 2403.10589 | null |
2024-03-15 | Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint | Haoyue Tang et.al. | 2403.10585 | null |
2024-03-15 | FeatUp: A Model-Agnostic Framework for Features at Any Resolution | Stephanie Fu et.al. | 2403.10516 | link |
2024-03-21 | Deep Bi-directional Attention Network for Image Super-Resolution Quality Assessment | Yixiao Li et.al. | 2403.10406 | null |
2024-03-15 | Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim et.al. | 2403.10255 | null |
2024-03-15 | BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution | Feng Li et.al. | 2403.10211 | link |
2024-03-15 | SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation | Peng Zheng et.al. | 2403.10166 | null |
2024-03-14 | Deep unfolding Network for Hyperspectral Image Super-Resolution with Automatic Exposure Correction | Yuan Fang et.al. | 2403.09096 | null |
2024-03-13 | PFStorer: Personalized Face Restoration and Super-Resolution | Tuomas Varanka et.al. | 2403.08436 | null |
2024-03-13 | Activating Wider Areas in Image Super-Resolution | Cheng Cheng et.al. | 2403.08330 | null |
2024-03-07 | Accelerating multigrid solver with generative super-resolution | Francisco Holguin et.al. | 2403.07936 | null |
2024-03-19 | Towards Model Extraction Attacks in GAN-Based Image Translation via Domain Shift Mitigation | Di Mi et.al. | 2403.07673 | null |
2024-03-12 | Learning Correction Errors via Frequency-Self Attention for Blind Image Super-Resolution | Haochen Sun et.al. | 2403.07390 | null |
2024-03-12 | Efficient Diffusion Model for Image Restoration by Residual Shifting | Zongsheng Yue et.al. | 2403.07319 | link |
2024-03-12 | Learning Hierarchical Color Guidance for Depth Map Super-Resolution | Runmin Cong et.al. | 2403.07290 | null |
2024-03-11 | Galaxy Morphologies Revealed with Subaru HSC and Super-Resolution Techniques II: Environmental Dependence of Galaxy Mergers at z~2-5 | Takatoshi Shibuya et.al. | 2403.06729 | null |
2024-03-11 | Breaking Abbe's diffraction limit with harmonic deactivation microscopy | Kevin Murzyn et.al. | 2403.06617 | null |
2024-03-11 | Multi-Scale Implicit Transformer with Re-parameterize for Arbitrary-Scale Super-Resolution | Jinchen Zhu et.al. | 2403.06536 | null |
2024-03-10 | Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising | Yuang Wang et.al. | 2403.06069 | null |
2024-03-12 | Decoupled Data Consistency with Diffusion Purification for Image Restoration | Xiang Li et.al. | 2403.06054 | link |
2024-03-15 | CoNFiLD: Conditional Neural Field Latent Diffusion Model Generating Spatiotemporal Turbulence | Pan Du et.al. | 2403.05940 | null |
2024-03-09 | Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution | Junxiong Lin et.al. | 2403.05808 | null |
2024-03-08 | An End-to-End Pipeline Perspective on Video Streaming in Best-Effort Networks: A Survey and Tutorial | Leonardo Peroni et.al. | 2403.05192 | null |
2024-03-08 | CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion | Wendi Zheng et.al. | 2403.05121 | null |
2024-03-08 | XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution | Yunpeng Qu et.al. | 2403.05049 | link |
2024-03-07 | Super-resolution on network telemetry time series | Fengchen Gong et.al. | 2403.04165 | null |
2024-03-11 | Identifying Black Holes Through Space Telescopes and Deep Learning | Yeqi Fang et.al. | 2403.03821 | null |
2024-03-05 | Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning | Haoyu Chen et.al. | 2403.02601 | null |
2024-03-04 | UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images | Zhiyi He et.al. | 2403.02132 | null |
2024-03-03 | APISR: Anime Production Inspired Real-World Anime Super-Resolution | Boyang Wang et.al. | 2403.01598 | link |
2024-03-02 | Extrapolated Plug-and-Play Three-Operator Splitting Methods for Nonconvex Optimization with Applications to Image Restoration | Zhongming Wu et.al. | 2403.01144 | link |
2024-03-02 | Text-guided Explorable Image Super-resolution | Kanchana Vaishnavi Gandikota et.al. | 2403.01124 | null |
2024-03-07 | ODVista: An Omnidirectional Video Dataset for super-resolution and Quality Enhancement Tasks | Ahmed Telili et.al. | 2403.00604 | link |
2024-02-29 | SeD: Semantic-Aware Discriminator for Image Super-Resolution | Bingchen Li et.al. | 2402.19387 | link |
2024-02-29 | 3D Super-resolution Optical Fluctuation Imaging with Temporal Focusing two-photon excitation | Pawel Szczypkowski et.al. | 2402.19338 | null |
2024-03-15 | CAMixerSR: Only Details Need More "Attention" | Yan Wang et.al. | 2402.19289 | link |
2024-02-29 | Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts | Cansu Korkmaz et.al. | 2402.19215 | link |
2024-02-29 | Unsupervised Learning of High-resolution Light Field Imaging via Beam Splitter-based Hybrid Lenses | Jianxin Lei et.al. | 2402.19020 | null |
2024-03-01 | Navigating Beyond Dropout: An Intriguing Solution Towards Generalizable Image Super Resolution | Hongjun Wang et.al. | 2402.18929 | link |
2024-02-29 | LoLiSRFlow: Joint Single Image Low-light Enhancement and Super-resolution via Cross-scale Transformer-based Conditional Flow | Ziyu Yue et.al. | 2402.18871 | null |
2024-02-28 | Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis | Bashir Kazimi et.al. | 2402.18286 | null |
2024-02-28 | Misalignment-Robust Frequency Distribution Loss for Image Transformation | Zhangkai Ni et.al. | 2402.18192 | link |
2024-03-01 | Data-driven nonlinear turbulent flow scaling with Buckingham Pi variables | Kai Fukami et.al. | 2402.17990 | null |
2024-02-27 | Thermodynamics-informed super-resolution of scarce temporal dynamics data | Carlos Bermejo-Barbanoj et.al. | 2402.17506 | null |
2024-02-27 | Spatial super-resolution in nanosensing with blinking emitters | Alexander Mikhalychev et.al. | 2402.17391 | null |
2024-02-27 | Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network | Zhaoyang Wang et.al. | 2402.17285 | link |
2024-02-27 | SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution | Chengcheng Wang et.al. | 2402.17133 | link |
2024-02-26 | Generative AI in Vision: A Survey on Models, Metrics and Applications | Gaurav Raut et.al. | 2402.16369 | null |
2024-02-25 | Photon-counting CT using a Conditional Diffusion Model for Super-resolution and Texture-preservation | Christopher Wiedeman et.al. | 2402.16212 | null |
2024-02-25 | ARIN: Adaptive Resampling and Instance Normalization for Robust Blind Inpainting of Dunhuang Cave Paintings | Alexander Schmidt et.al. | 2402.16188 | null |
2024-02-25 | XAI-based gait analysis of patients walking with Knee-Ankle-Foot orthosis using video cameras | Arnav Mishra et.al. | 2402.16175 | null |
2024-02-24 | HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models | Li Pang et.al. | 2402.15865 | link |
2024-02-24 | A Heterogeneous Dynamic Convolutional Neural Network for Image Super-resolution | Chunwei Tian et.al. | 2402.15704 | link |
2024-02-24 | DeepLight: Reconstructing High-Resolution Observations of Nighttime Light With Multi-Modal Remote Sensing Data | Lixian Zhang et.al. | 2402.15659 | link |
2024-02-23 | Towards complete all-optical emission control of high-harmonic generation from solids | Pieter J. van Essen et.al. | 2402.15375 | null |
2024-02-21 | Generative Adversarial Models for Extreme Downscaling of Climate Datasets | Guiye Li et.al. | 2402.14049 | null |
2024-02-23 | Scene Prior Filtering for Depth Map Super-Resolution | Zhengxue Wang et.al. | 2402.13876 | null |
2024-02-21 | Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion | Lianghu Guo et.al. | 2402.13776 | null |
2024-02-20 | Diffusion Posterior Sampling is Computationally Intractable | Shivam Gupta et.al. | 2402.12727 | null |
2024-02-19 | Image Super-resolution Inspired Electron Density Prediction | Chenghan Li et.al. | 2402.12335 | link |
2024-02-19 | Regularization by denoising: Bayesian model and Langevin-within-split Gibbs sampling | Elhadji C. Faye et.al. | 2402.12292 | null |
2024-02-19 | FOD-Swin-Net: angular super resolution of fiber orientation distribution using a transformer-based deep model | Mateus Oliveira da Silva et.al. | 2402.11775 | link |
2024-02-25 | Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme | Saeid Haghighatshoar et.al. | 2402.11748 | link |
2024-02-17 | Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression | Dingquan Li et.al. | 2402.11250 | link |
2024-02-16 | Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary Task Integration | Mahapara Khurshid et.al. | 2402.10454 | null |
2024-02-08 | Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results | Kelly Payette et.al. | 2402.09463 | null |
2024-02-14 | Neural Operators Meet Energy-based Theory: Operator Learning for Hamiltonian and Dissipative PDEs | Yusuke Tanaka et.al. | 2402.09018 | null |
2024-02-12 | Cosmology at the Field Level with Probabilistic Machine Learning | Adam Rouhiainen et.al. | 2402.07694 | null |
2024-02-12 | Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback | Cansu Korkmaz et.al. | 2402.07597 | null |
2024-02-12 | High-resolution Cryogenic Spectroscopy of Single Molecules in Nanoprinted Crystals | Mohammad Musavinezhad et.al. | 2402.07474 | null |
2024-02-09 | Copper phosphate micro-flowers coated with indocyanine green and iron oxide nanoparticles for in vivo localization optoacoustic tomography and magnetic actuation | Daniil Nozdriukhin et.al. | 2402.06749 | null |
2024-02-05 | Hybrid Neural Representations for Spherical Data | Hyomin Kim et.al. | 2402.05965 | null |
2024-02-07 | Arbitrary Scale Super-Resolution Assisted Lunar Crater Detection in Satellite Images | Atal Tewari et.al. | 2402.05068 | null |
2024-02-07 | Device Activity Detection and Channel Estimation for Millimeter-Wave Massive MIMO | Yinchuan Li et.al. | 2402.04704 | null |
2024-02-06 | Elastic wave imaging with Maxwell's fish-eye lens | Liuxian Zhao et.al. | 2402.04285 | null |
2024-02-06 | 3D Volumetric Super-Resolution in Radiology Using 3D RRDB-GAN | Juhyung Ha et.al. | 2402.04171 | null |
2024-02-05 | Video Super-Resolution for Optimized Bitrate and Green Online Streaming | Vignesh V Menon et.al. | 2402.03513 | null |
2024-02-05 | See More Details: Efficient Image Super-Resolution by Experts Mining | Eduard Zamfir et.al. | 2402.03412 | link |
2024-01-25 | When Geoscience Meets Generative AI and Large Language Models: Foundations, Trends, and Future Challenges | Abdenour Hadid et.al. | 2402.03349 | null |
2024-02-05 | Instant square lattice structured illumination microscopy: an optimal strategy towards photon-saving and real-time super-resolution observation | Tianyu Zhao et.al. | 2402.02775 | null |
2024-02-02 | A Robust Super-resolution Gridless Imaging Framework for UAV-borne SAR Tomography | Silin Gao et.al. | 2402.01194 | null |
2024-02-01 | Diffusion-based Light Field Synthesis | Ruisheng Gao et.al. | 2402.00575 | null |
2024-01-31 | Improving Object Detection Quality in Football Through Super-Resolution Techniques | Karolina Seweryn et.al. | 2402.00163 | null |
2024-01-31 | Fully Data-Driven Model for Increasing Sampling Rate Frequency of Seismic Data using Super-Resolution Generative Adversarial Networks | Navid Gholizadeh et.al. | 2402.00153 | null |
2024-01-31 | Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models | Kyungsung Lee et.al. | 2401.17629 | null |
2024-01-30 | You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation | Mehdi Noroozi et.al. | 2401.17258 | null |
2024-01-30 | Ptycho-endoscopy on a lensless ultrathin fiber bundle tip | Pengming Song et.al. | 2401.17213 | null |
2024-01-30 | Deep 3D World Models for Multi-Image Super-Resolution Beyond Optical Flow | Luca Savant Aira et.al. | 2401.16972 | null |
2024-01-29 | Reconfigurable AI Modules Aided Channel Estimation and MIMO Detection | Xiangzhao Qin et.al. | 2401.16141 | null |
2024-01-29 | Bridging the Domain Gap: A Simple Domain Matching Method for Reference-based Image Super-Resolution in Remote Sensing | Jeongho Min et.al. | 2401.15944 | null |
2024-01-29 | Vision-Informed Flow Image Super-Resolution with Quaternion Spatial Modeling and Dynamic Flow Convolution | Qinglong Cao et.al. | 2401.15913 | null |
2024-01-28 | Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement | Minghong Duan et.al. | 2401.15613 | null |
2024-01-31 | Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models | Fabio Merizzi et.al. | 2401.15469 | link |
2024-01-27 | Face to Cartoon Incremental Super-Resolution using Knowledge Distillation | Trinetra Devkatte et.al. | 2401.15366 | null |
2024-01-26 | From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution | Ragib Amin Nihal et.al. | 2401.14661 | null |
2024-01-26 | Super Efficient Neural Network for Compression Artifacts Reduction and Super Resolution | Wen Ma et.al. | 2401.14641 | null |
2024-01-25 | Combined Generative and Predictive Modeling for Speech Super-resolution | Heming Wang et.al. | 2401.14269 | null |
2024-01-25 | Conditional Neural Video Coding with Spatial-Temporal Super-Resolution | Henan Wang et.al. | 2401.13959 | null |
2024-02-05 | Lumiere: A Space-Time Diffusion Model for Video Generation | Omer Bar-Tal et.al. | 2401.12945 | null |
2024-01-22 | Observation-Guided Meteorological Field Downscaling at Station Scale: A Benchmark and a New Method | Zili Liu et.al. | 2401.11960 | link |
2024-01-24 | LKFormer: Large Kernel Transformer for Infrared Image Super-Resolution | Feiwei Qin et.al. | 2401.11859 | link |
2024-01-22 | Simultaneous Blind Demixing and Super-resolution via Vectorized Hankel Lift | Haifeng Wang et.al. | 2401.11805 | null |
2024-01-18 | Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution | Xin Yuan et.al. | 2401.10404 | null |
2024-01-22 | 3D orientation super-resolution spatial-frequency-shift microscopy | Xiaowei Liu et.al. | 2401.09085 | null |
2024-01-17 | Efficient Image Super-Resolution via Symmetric Visual Attention Network | Chengxu Wu et.al. | 2401.08913 | null |
2024-01-16 | Robust DOA estimation using deep acoustic imaging | Adrian S. Roman et.al. | 2401.08717 | link |
2024-01-20 | Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis | Zhenhui Ye et.al. | 2401.08503 | link |
2024-01-16 | Physics-informed Meta-instrument for eXperiments (PiMiX) with applications to fusion energy | Zhehui Wang et.al. | 2401.08390 | null |
2024-01-18 | Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary | Leheng Zhang et.al. | 2401.08209 | link |
2024-01-16 | The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignment and Aggregation | Xinni Jiang et.al. | 2401.08123 | link |
2024-01-26 | No-Clean-Reference Image Super-Resolution: Application to Electron Microscopy | Mohammad Khateri et.al. | 2401.08115 | null |
2024-01-15 | Sparsity-based background removal for STORM super-resolution images | Patris Valera et.al. | 2401.07746 | link |
2024-01-15 | Time-varying k-domain modulation around a point sink in time reversal cavity | Xin Liu et.al. | 2401.07535 | null |
2024-01-14 | City Scene Super-Resolution via Geometric Error Minimization | Zhengyang Lu et.al. | 2401.07272 | link |
2024-01-13 | Deep Blind Super-Resolution for Satellite Video | Yi Xiao et.al. | 2401.07139 | link |
2024-01-12 | Broad Yet Narrow: Super-resolution techniques to simulate electronic spectra of large molecular systems | Matthias Kick et.al. | 2401.06929 | null |
2024-01-15 | Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention | Xingyu Zhou et.al. | 2401.06312 | link |
2024-01-11 | Frequency-Time Diffusion with Neural Cellular Automata | John Kalkhof et.al. | 2401.06291 | null |
2024-01-11 | TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation | Rajaei Khatib et.al. | 2401.06191 | null |
2024-01-11 | Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach | Gang Wu et.al. | 2401.05633 | link |
2024-01-10 | Quantum Inspired Microwave Phase Super-Resolution at Room Temperature | Leonid Vidro et.al. | 2401.05026 | null |
2024-01-08 | AGG: Amortized Generative 3D Gaussians for Single Image to 3D | Dejia Xu et.al. | 2401.04099 | null |
2024-01-08 | Sub-Rayleigh ghost imaging via structured illumination | Liming Li et.al. | 2401.03829 | null |
2024-01-08 | FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring | Geunhyuk Youk et.al. | 2401.03707 | null |
2024-01-07 | Nanofabrication beyond optical diffraction limit: Optical driven assembly enabled by superlubricity | Liu Jiang-tao et.al. | 2401.03486 | null |
2024-01-05 | Super-Resolution Multi-Contrast Unbiased Eye Atlases With Deep Probabilistic Refinement | Ho Hin Lee et.al. | 2401.03060 | null |
2024-01-04 | Predicting Future States with Spatial Point Processes in Single Molecule Resolution Spatial Transcriptomics | Parisa Boodaghi Malidarreh et.al. | 2401.02564 | null |
2024-01-04 | What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs | Alex Trevithick et.al. | 2401.02411 | null |
2024-01-02 | Efficient Hybrid Zoom using Camera Fusion on Mobile Phones | Xiaotong Wu et.al. | 2401.01461 | null |
2024-01-02 | Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise | Qinglong Huang et.al. | 2401.01216 | null |
2023-12-30 | Improving the Stability of Diffusion Models for Content Consistent Super-Resolution | Lingchen Sun et.al. | 2401.00877 | link |
2024-03-18 | Exposure Bracketing is All You Need for Unifying Image Restoration and Enhancement Tasks | Zhilu Zhang et.al. | 2401.00766 | link |
2024-01-01 | Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolution | Zeke Zexi Hu et.al. | 2401.00740 | null |
2024-02-06 | Diffusion Models, Image Super-Resolution And Everything: A Survey | Brian B. Moser et.al. | 2401.00736 | null |
2024-02-21 | Compressing Deep Image Super-resolution Models | Yuxuan Jiang et.al. | 2401.00523 | null |
2023-12-31 | UGPNet: Universal Generative Prior for Image Restoration | Hwayoon Lee et.al. | 2401.00370 | null |
2023-12-30 | Robust fluctuation-based super-resolution microscopy in a confocal architecture | Alexander Krupinski-Ptaszek et.al. | 2401.00261 | null |
2024-03-13 | Image Super-resolution Reconstruction Network based on Enhanced Swin Transformer via Alternating Aggregation of Local-Global Features | Yuming Huang et.al. | 2401.00241 | null |
2023-12-29 | Noise-free Optimization in Early Training Steps for Image Super-Resolution | MinKyu Lee et.al. | 2312.17526 | link |
2023-12-28 | Single particle algorithms to reveal cellular nanodomain organization | Pierre Parutto et.al. | 2312.17191 | null |
2024-01-02 | KeDuSR: Real-World Dual-Lens Super-Resolution via Kernel-Free Matching | Huanjing Yue et.al. | 2312.17050 | link |
2023-12-27 | Learning from small data sets: Patch-based regularizers in inverse problems for image reconstruction | Moritz Piening et.al. | 2312.16611 | null |
2023-12-27 | Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance | Tomer Garber et.al. | 2312.16519 | link |
2023-12-30 | A Survey on Super Resolution for video Enhancement Using GAN | Ankush Maity et.al. | 2312.16471 | null |
2023-12-27 | Learn From Orientation Prior for Radiograph Super-Resolution: Orientation Operator Transformer | Yongsong Huang et.al. | 2312.16455 | null |
2023-12-24 | BSRAW: Improving Blind RAW Image Super-Resolution | Marcos V. Conde et.al. | 2312.15487 | link |
2023-12-24 | Perception-Distortion Balanced Super-Resolution: A Multi-Objective Optimization Perspective | Lingchen Sun et.al. | 2312.15408 | link |
2023-12-22 | Spectrally Decomposed Diffusion Models for Generative Turbulence Recovery | Mohammed Sardar et.al. | 2312.15029 | null |
2023-12-22 | DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution | Yan Wang et.al. | 2312.14551 | link |
2024-03-18 | HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models | Hayk Manukyan et.al. | 2312.14091 | link |
2023-12-21 | Super-resolution of THz time-domain images based on low-rank representation | Marina Ljubenovic et.al. | 2312.13820 | null |
2023-12-21 | BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution | Guochen Yu et.al. | 2312.13722 | link |
2023-12-21 | A Comprehensive End-to-End Computer Vision Framework for Restoration and Recognition of Low-Quality Engineering Drawings | Lvyang Yang et.al. | 2312.13620 | link |
2023-12-20 | EPNet: An Efficient Pyramid Network for Enhanced Single-Image Super-Resolution with Reduced Computational Requirements | Xin Xu et.al. | 2312.13396 | null |
2024-03-19 | ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Rongsheng Wang et.al. | 2312.13316 | link |
2023-12-20 | A 3D super-resolution of wind fields via physics-informed pixel-wise self-attention generative adversarial network | Takuya Kurihana et.al. | 2312.13212 | null |
2023-12-20 | Joint Range-Velocity-Azimuth Estimation for OFDM-Based Integrated Sensing and Communication | Zelin Hu et.al. | 2312.13154 | null |
2024-03-18 | Learning Exhaustive Correlation for Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence | Hongyuan Wang et.al. | 2312.12833 | null |
2023-12-20 | How Good Are Deep Generative Models for Solving Inverse Problems? | Shichong Peng et.al. | 2312.12691 | null |
2023-12-19 | ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields | Xiang Feng et.al. | 2312.12122 | null |
2023-12-19 | Neural operator-based super-fidelity: A warm-start approach for accelerating steady-state simulations | Xu-Hui Zhou et.al. | 2312.11842 | null |
2023-12-18 | TIP: Text-Driven Image Processing with Semantic and Restoration Instructions | Chenyang Qi et.al. | 2312.11595 | null |
2023-12-20 | FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline | Chien-Yu Lin et.al. | 2312.11537 | null |
2023-12-18 | Disentangling photon rings beyond General Relativity with future radio-telescope arrays | Raúl Carballo-Rubio et.al. | 2312.11351 | null |
2024-03-19 | Self-Supervised Learning for Image Super-Resolution and Deblurring | Jérémy Scanvic et.al. | 2312.11232 | link |
2023-12-18 | Experimental 3D super-localization with Laguerre-Gaussian modes | Chenyu Hu et.al. | 2312.11044 | null |
2023-12-16 | Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge | Conghan Yue et.al. | 2312.10299 | link |
2023-12-18 | TMP: Temporal Motion Propagation for Online Video Super-Resolution | Zhengqiang Zhang et.al. | 2312.09909 | link |
2024-03-03 | Diffusion-based Blind Text Image Super-Resolution | Yuzhe Zhang et.al. | 2312.08886 | link |
2023-12-14 | Guided Image Restoration via Simultaneous Feature and Image Guided Fusion | Xinyi Liu et.al. | 2312.08853 | null |
2023-12-14 | CartoMark: a benchmark dataset for map pattern recognition and 1 map content retrieval with machine intelligence | Xiran Zhou et.al. | 2312.08600 | null |
2023-12-13 | EventAid: Benchmarking Event-aided Image/Video Enhancement Algorithms with Real-captured Hybrid Dataset | Peiqi Duan et.al. | 2312.08220 | null |
2023-12-13 | Toward Real World Stereo Image Super-Resolution via Hybrid Degradation Model and Discriminator for Implied Stereo Image Information | Yuanbo Zhou et.al. | 2312.07934 | link |
2023-12-20 | CoIE: Chain-of-Instruct Editing for Multi-Attribute Face Manipulation | Zhenduo Zhang et.al. | 2312.07879 | null |
2023-12-13 | Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements | Gaurav Shrivastava et.al. | 2312.07835 | null |
2024-01-19 | Semantic Lens: Instance-Centric Semantic Alignment for Video Super-Resolution | Qi Tang et.al. | 2312.07823 | link |
2023-12-12 | Super-Resolution on Rotationally Scanned Photoacoustic Microscopy Images Incorporating Scanning Prior | Kai Pan et.al. | 2312.07226 | link |
2023-12-12 | Hyper-Restormer: A General Hyperspectral Image Restoration Transformer for Remote Sensing Imaging | Yo-Yu Lai et.al. | 2312.07016 | null |
2023-12-14 | TULIP: Transformer for Upsampling of LiDAR Point Cloud | Bin Yang et.al. | 2312.06733 | link |
2023-12-11 | Photorealistic Video Generation with Diffusion Models | Agrim Gupta et.al. | 2312.06662 | null |
2023-12-11 | Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution | Shangchen Zhou et.al. | 2312.06640 | null |
2023-12-11 | Non-iterative Methods in Inhomogeneous Background Inverse Scattering Imaging Problem Assisted by Swin Transformer Network | Naike Du et.al. | 2312.06302 | null |
2023-12-11 | Hundred-Kilobyte Lookup Tables for Efficient Single-Image Super-Resolution | Binxiao Huang et.al. | 2312.06101 | link |
2024-03-20 | Precipitation Downscaling with Spatiotemporal Video Diffusion | Prakhar Srivastava et.al. | 2312.06071 | null |
2023-12-10 | Study of Multiuser Multiple-Antenna Wireless Communications Systems Based on Super-Resolution Arrays | S. Pinto et.al. | 2312.06033 | null |
2023-12-10 | Transformer-based Selective Super-Resolution for Efficient Image Refinement | Tianyi Zhang et.al. | 2312.05803 | link |
2023-12-13 | SGNet: Structure Guided Network via Gradient-Frequency Awareness for Depth Map Super-Resolution | Zhengxue Wang et.al. | 2312.05799 | link |
2023-12-09 | Iterative Token Evaluation and Refinement for Real-World Super-Resolution | Chaofeng Chen et.al. | 2312.05616 | link |
2023-12-07 | AniRes2D: Anisotropic Residual-enhanced Diffusion for 2D MR Super-Resolution | Zejun Wu et.al. | 2312.04385 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-18 | Applications of Knowledge Distillation in Remote Sensing: A Survey | Yassine Himeur et.al. | 2409.12111 | null |
2024-09-18 | Multi-Sensor Deep Learning for Glacier Mapping | Codruţ-Andrei Diaconu et.al. | 2409.12034 | null |
2024-09-18 | BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling | Lulin Zhang et.al. | 2409.12014 | link |
2024-09-18 | Photothermal Spectroscopy for Planetary Sciences: Mid-IR Absorption Made Easy | Christopher Cox et.al. | 2409.11626 | null |
2024-09-17 | Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark | Clifford Broni-Bediako et.al. | 2409.11227 | null |
2024-09-17 | On-policy Actor-Critic Reinforcement Learning for Multi-UAV Exploration | Ali Moltajaei Farid et.al. | 2409.11058 | null |
2024-09-16 | Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot Segmentation | Hanbo Bi et.al. | 2409.10389 | null |
2024-09-16 | Performance of Human Annotators in Object Detection and Segmentation of Remotely Sensed Data | Roni Blushtein-Livnon et.al. | 2409.10272 | null |
2024-09-16 | BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images | Wentao Wang et.al. | 2409.10269 | null |
2024-09-15 | Fuzzy logic for reconstructing arbitrary moments of multiplicity distributions | Anar Rustamov et.al. | 2409.09814 | null |
2024-09-15 | SITSMamba for Crop Classification based on Satellite Image Time Series | Xiaolei Qin et.al. | 2409.09673 | link |
2024-09-19 | Unsupervised Hyperspectral and Multispectral Image Blind Fusion Based on Deep Tucker Decomposition Network with Spatial-Spectral Manifold Learning | He Wang et.al. | 2409.09670 | link |
2024-09-14 | Detecting Looted Archaeological Sites from Satellite Image Time Series | Elliot Vincent et.al. | 2409.09432 | link |
2024-09-14 | NBBOX: Noisy Bounding Box Improves Remote Sensing Object Detection | Yechan Kim et.al. | 2409.09424 | null |
2024-09-14 | Investigation of Hierarchical Spectral Vision Transformer Architecture for Classification of Hyperspectral Imagery | Wei Liu et.al. | 2409.09244 | null |
2024-09-13 | Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing | Minh-Duc Vu et.al. | 2409.08885 | null |
2024-09-13 | ChangeChat: An Interactive Model for Remote Sensing Change Analysis via Multimodal Instruction Tuning | Pei Deng et.al. | 2409.08582 | null |
2024-09-13 | VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation | Ezra MacDonald et.al. | 2409.08461 | link |
2024-09-12 | Ultra-wideband integrated microwave photonic multi-parameter measurement system on thin-film lithium niobate | Yong Zheng et.al. | 2409.07817 | null |
2024-09-12 | Open-Vocabulary Remote Sensing Image Semantic Segmentation | Qinglong Cao et.al. | 2409.07683 | null |
2024-09-11 | The Mismeasure of Weather: Using Remotely Sensed Earth Observation Data in Economic Context | Anna Josephson et.al. | 2409.07506 | null |
2024-09-11 | Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations | Keumgang Cha et.al. | 2409.07048 | null |
2024-09-11 | Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images | Xuexue Li et.al. | 2409.07022 | null |
2024-09-10 | PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation | Yin Hu et.al. | 2409.06309 | null |
2024-09-09 | Real-time optical gas sensing with two-dimensional materials | Gia Quyet Ngo et.al. | 2409.05693 | null |
2024-09-09 | AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations | Jingtao Li et.al. | 2409.05679 | null |
2024-09-09 | Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery | Fan Zhang et.al. | 2409.05624 | null |
2024-09-09 | Localization of macroscopic sources of magnetic field using optical fibers doped with NV-rich sub-micron diamonds and zero-field resonance | Mariusz Mrózek et.al. | 2409.05452 | null |
2024-09-06 | Ab initio quantum dynamics as a scalable solution to the exoplanet opacity challenge: A case study of CO |
Laurent Wiesenfeld et.al. | 2409.04439 | null |
2024-09-06 | How to Identify Good Superpixels for Deforestation Detection on Tropical Rainforests | Isabela Borlido et.al. | 2409.04330 | null |
2024-09-06 | An OpenMetBuoy dataset of Marginal Ice Zone dynamics collected around Svalbard in 2022 and 2023 | Jean Rabault et.al. | 2409.04151 | null |
2024-09-05 | Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning | Isaac Ray et.al. | 2409.03938 | null |
2024-09-05 | On-board Satellite Image Classification for Earth Observation: A Comparative Study of Pre-Trained Vision Transformer Models | Thanh-Dung Le et.al. | 2409.03901 | null |
2024-09-09 | UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images | Lulin Li et.al. | 2409.03431 | link |
2024-09-04 | A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering | Shuang Song et.al. | 2409.03032 | null |
2024-09-03 | Impact Evaluations in Data Poor Settings: The Case of Stress-Tolerant Rice Varieties in Bangladesh | Jeffrey D. Michler et.al. | 2409.02201 | null |
2024-09-03 | Brain-Inspired Online Adaptation for Remote Sensing with Spiking Neural Network | Dexin Duan et.al. | 2409.02146 | null |
2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | null |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-01 | Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification | Karim El Khoury et.al. | 2409.00698 | link |
2024-08-31 | Incremental Open-set Domain Adaptation | Sayan Rakshit et.al. | 2409.00530 | null |
2024-08-31 | Mapping earth mounds from space | Baki Uzun et.al. | 2409.00518 | null |
2024-08-31 | Plant detection from ultra high resolution remote sensing images: A Semantic Segmentation approach based on fuzzy loss | Shivam Pande et.al. | 2409.00513 | null |
2024-08-31 | Geospatial foundation models for image analysis: evaluating and enhancing NASA-IBM Prithvi's domain adaptability | Chia-Yu Hsu et.al. | 2409.00489 | null |
2024-08-31 | Self-supervised Fusarium Head Blight Detection with Hyperspectral Image and Feature Mining | Yu-Fan Lin et.al. | 2409.00395 | null |
2024-08-30 | FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition | Chen Hu et.al. | 2408.17090 | link |
2024-08-29 | Low Saturation Confidence Distribution-based Test-Time Adaptation for Cross-Domain Remote Sensing Image Classification | Yu Liang et.al. | 2408.16265 | null |
2024-08-28 | A Survey on Evaluation of Multimodal Large Language Models | Jiaxing Huang et.al. | 2408.15769 | null |
2024-08-28 | Can SAR improve RSVQA performance? | Lucrezia Tosato et.al. | 2408.15642 | null |
2024-08-27 | RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models | Junyao Ge et.al. | 2408.14744 | link |
2024-08-26 | MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification | Feng Gao et.al. | 2408.14255 | link |
2024-08-27 | Improving Water Quality Time-Series Prediction in Hong Kong using Sentinel-2 MSI Data and Google Earth Engine Cloud Computing | Rohin Sood et.al. | 2408.14010 | null |
2024-08-25 | GeoPlant: Spatial Plant Species Prediction Dataset | Lukas Picek et.al. | 2408.13928 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | null |
2024-08-23 | A plug-and-play framework for curvilinear structure segmentation based on a learned reconnecting regularization | Sophie Carneiro-Esteves et.al. | 2408.12943 | null |
2024-08-22 | Hierarchical Attention and Parallel Filter Fusion Network for Multi-Source Data Classification | Han Luo et.al. | 2408.12760 | null |
2024-08-22 | Research on Improved U-net Based Remote Sensing Image Segmentation Algorithm | Qiming Yang et.al. | 2408.12672 | null |
2024-08-26 | UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images | Enze Zhu et.al. | 2408.11545 | link |
2024-08-21 | High Performance Simulation of Spaceborne Radar for Remote-Sensing Oceanography: Application to an Altimetry Scenario | Goulven Monnier et.al. | 2408.11472 | null |
2024-08-21 | Near-Field Signal Processing: Unleashing the Power of Proximity | Ahmet M. Elbir et.al. | 2408.11434 | null |
2024-08-20 | Unified Deep Learning Model for Global Prediction of Aboveground Biomass, Canopy Height and Cover from High-Resolution, Multi-Sensor Satellite Imagery | Manuel Weber et.al. | 2408.11234 | null |
2024-08-20 | Reactive molecular dynamics simulations of micrometeoroid bombardment for space weathering of asteroid (162173) Ryugu | Daigo Shoji et.al. | 2408.10959 | null |
2024-08-20 | Novel Change Detection Framework in Remote Sensing Imagery Using Diffusion Models and Structural Similarity Index (SSIM) | Andrew Kiruluta et.al. | 2408.10619 | null |
2024-08-19 | Assessment of Spectral based Solutions for the Detection of Floating Marine Debris | Muhammad Alì et.al. | 2408.10187 | null |
2024-08-17 | Pursuing Truth: Improving Retrievals on Mid-Infrared Exo-Earth Spectra with Physically Motivated Water Abundance Profiles and Cloud Models | Björn S. Konrad et.al. | 2408.09129 | null |
2024-08-17 | Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community | Jiancheng Pan et.al. | 2408.09110 | link |
2024-08-16 | Speckle Noise Analysis for Synthetic Aperture Radar (SAR) Space Data | Sanjjushri Varshini R et.al. | 2408.08774 | null |
2024-08-16 | Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation | Linghao Zheng et.al. | 2408.08576 | null |
2024-08-16 | Improving the measurement of air-water flow properties using remote distance sensing technology | Matthias Kramer et.al. | 2408.08466 | null |
2024-08-15 | SpectralEarth: Training Hyperspectral Foundation Models at Scale | Nassim Ait Ali Braham et.al. | 2408.08447 | null |
2024-08-15 | The Dawn of KAN in Image-to-Image (I2I) Translation: Integrating Kolmogorov-Arnold Networks with GANs for Unpaired I2I Translation | Arpan Mahara et.al. | 2408.08216 | link |
2024-08-15 | The Effect of Horizontal Shear on Extracting Water Currents From Surface Wave Data | Stefan Weichert et.al. | 2408.08197 | null |
2024-08-15 | Treat Stillness with Movement: Remote Sensing Change Detection via Coarse-grained Temporal Foregrounds Mining | Xixi Wang et.al. | 2408.08078 | link |
2024-08-14 | Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks | Liting Jiang et.al. | 2408.07613 | null |
2024-08-14 | Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction | Liting Jiang et.al. | 2408.07419 | link |
2024-08-15 | Prompt-Based Segmentation at Multiple Resolutions and Lighting Conditions using Segment Anything Model 2 | Osher Rafaeli et.al. | 2408.06970 | null |
2024-08-14 | A Comprehensive Survey on Synthetic Infrared Image synthesis | Avinash Upadhyay et.al. | 2408.06868 | null |
2024-08-13 | IFShip: A Large Vision-Language Model for Interpretable Fine-grained Ship Classification via Domain Knowledge-Enhanced Instruction Tuning | Mingning Guo et.al. | 2408.06631 | null |
2024-08-12 | On the Peril of Inferring Phytoplankton Properties from Remote-Sensing Observations | J. Xavier Prochaska et.al. | 2408.06149 | null |
2024-08-11 | Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task | Hannuo Zhang et.al. | 2408.05777 | null |
2024-08-09 | Modeling and Analysis of Downlink Communications in a Heterogeneous LEO Satellite Network | Chang-Sik Choi et.al. | 2408.05070 | null |
2024-08-08 | AI for operational methane emitter monitoring from space | Anna Vaughan et.al. | 2408.04745 | null |
2024-08-08 | Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation | Daniele Rege Cambrin et.al. | 2408.04523 | link |
2024-08-08 | Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction | Yuchen Wang et.al. | 2408.04294 | null |
2024-08-08 | Quantum-Enhanced Polarimetric Imaging | Meng-Yu Xie et.al. | 2408.04183 | null |
2024-08-08 | Integrated Dynamic Phenological Feature for Remote Sensing Image Land Cover Change Detection | Yi Liu et.al. | 2408.04144 | null |
2024-08-07 | Prospects for using drones to test formation-flying CubeSat concepts, and other astronomical applications | John D. Monnier et.al. | 2408.03911 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-06 | AI Foundation Models in Remote Sensing: A Survey | Siqi Lu et.al. | 2408.03464 | null |
2024-08-04 | Masked Angle-Aware Autoencoder for Remote Sensing Images | Zhihao Li et.al. | 2408.01946 | link |
2024-08-03 | Quantum Lotka-Volterra dynamics | Yuechun Jiao et.al. | 2408.01726 | null |
2024-08-02 | Deep Learning Meets OBIA: Tasks, Challenges, Strategies, and Perspectives | Lei Ma et.al. | 2408.01607 | null |
2024-07-30 | SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition | Hao Tan et.al. | 2407.20920 | null |
2024-07-29 | Urban Traffic Accident Risk Prediction Revisited: Regionality, Proximity, Similarity and Sparsity | Minxiao Chen et.al. | 2407.19668 | link |
2024-07-29 | Towards a Knowledge guided Multimodal Foundation Model for Spatio-Temporal Remote Sensing Applications | Praveen Ravirathinam et.al. | 2407.19660 | null |
2024-07-25 | HAMSTER: Hyperspectral Albedo Maps dataset with high Spatial and TEmporal Resolution | Giulia Roccetti et.al. | 2407.18030 | null |
2024-07-24 | An Energy-Efficient Artefact Detection Accelerator on FPGAs for Hyper-Spectral Satellite Imagery | Cornell Castelino et.al. | 2407.17647 | null |
2024-07-24 | EuroCropsML: A Time Series Benchmark Dataset For Few-Shot Crop Type Classification | Joana Reuss et.al. | 2407.17458 | null |
2024-07-24 | CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction | Paul Goyes-Peñafiel et.al. | 2407.17402 | link |
2024-07-24 | Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks | Alessandro Sebastianelli et.al. | 2407.17108 | null |
2024-07-23 | Integrating Biological Data into Autonomous Remote Sensing Systems for In Situ Imageomics: A Case Study for Kenyan Animal Behavior Sensing with Unmanned Aerial Vehicles (UAVs) | Jenna M. Kline et.al. | 2407.16864 | null |
2024-07-23 | A Multitask Deep Learning Model for Classification and Regression of Hyperspectral Images: Application to the large-scale dataset | Koushikey Chhapariya et.al. | 2407.16384 | null |
2024-07-23 | Sizey: Memory-Efficient Execution of Scientific Workflow Tasks | Jonathan Bader et.al. | 2407.16353 | null |
2024-07-23 | HyTAS: A Hyperspectral Image Transformer Architecture Search Benchmark and Analysis | Fangqin Zhou et.al. | 2407.16269 | link |
2024-07-23 | Cross-Domain Separable Translation Network for Multimodal Image Change Detection | Tao Zhan et.al. | 2407.16158 | link |
2024-07-24 | Self-driving lab discovers principles for steering spontaneous emission | Saaketh Desai et.al. | 2407.16083 | null |
2024-07-22 | EfficientCD: A New Strategy For Change Detection Based With Bi-temporal Layers Exchanged | Sijun Dong et.al. | 2407.15999 | link |
2024-07-22 | PRIME: Blind Multispectral Unmixing Using Virtual Quantum Prism and Convex Geometry | Chia-Hsiang Lin et.al. | 2407.15358 | null |
2024-07-22 | Fever Detection with Infrared Thermography: Enhancing Accuracy through Machine Learning Techniques | Parsa Razmara et.al. | 2407.15302 | null |
2024-07-21 | Rethinking Feature Backbone Fine-tuning for Remote Sensing Object Detection | Yechan Kim et.al. | 2407.15143 | null |
2024-07-20 | PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction | Weiqin Jiao et.al. | 2407.14912 | null |
2024-07-20 | CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation | Yukai Shi et.al. | 2407.14823 | link |
2024-07-20 | Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures | Jiaxing Huang et.al. | 2407.14754 | link |
2024-07-20 | Minh-Quan Le et.al. | 2407.14709 | null | |
2024-07-25 | Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images | Bo Yuan et.al. | 2407.14242 | link |
2024-07-19 | The Cardinality of Identifying Code Sets for Soccer Ball Graph with Application to Remote Sensing | Anna L. D. Latour et.al. | 2407.14120 | link |
2024-07-19 | Semantic-CC: Boosting Remote Sensing Image Change Captioning via Foundational Knowledge and Semantic Guidance | Yongshuo Zhu et.al. | 2407.14032 | null |
2024-07-18 | Quantifying uncertainty in area and regression coefficient estimation from remote sensing maps | Kerri Lu et.al. | 2407.13659 | null |
2024-07-20 | EarthMarker: Visual Prompt Learning for Region-level and Point-level Remote Sensing Imagery Comprehension | Wei Zhang et.al. | 2407.13596 | link |
2024-07-18 | Wavelet-based Bi-dimensional Aggregation Network for SAR Image Change Detection | Jiangwei Xie et.al. | 2407.13151 | link |
2024-07-17 | Exploiting Inter-Image Similarity Prior for Low-Bitrate Remote Sensing Image Compression | Junhui Li et.al. | 2407.12295 | null |
2024-07-17 | UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction | Zeyu Wang et.al. | 2407.11578 | null |
2024-07-17 | RIMformer: An End-to-End Transformer for FMCW Radar Interference Mitigation | Ziang Zhang et.al. | 2407.11459 | null |
2024-07-16 | Mapping savannah woody vegetation at the species level with multispecral drone and hyperspectral EnMAP data | Christina Karakizi et.al. | 2407.11404 | null |
2024-07-14 | Harnessing Feature Clustering For Enhanced Anomaly Detection With Variational Autoencoder And Dynamic Threshold | Tolulope Ale et.al. | 2407.10042 | null |
2024-07-13 | MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection | Ziyue Huang et.al. | 2407.09920 | link |
2024-07-11 | Segmentation-guided Attention for Visual Question Answering from Remote Sensing Images | Lucrezia Tosato et.al. | 2407.08669 | null |
2024-07-11 | Haar Nuclear Norms with Applications to Remote Sensing Imagery Restoration | Shuang Xu et.al. | 2407.08509 | null |
2024-07-11 | Paving the way toward foundation models for irregular and unaligned Satellite Image Time Series | Iris Dumeur et.al. | 2407.08448 | null |
2024-07-11 | XAI-Guided Enhancement of Vegetation Indices for Crop Mapping | Hiba Najjar et.al. | 2407.08298 | null |
2024-07-11 | Explainability of Sub-Field Level Crop Yield Prediction using Remote Sensing | Hiba Najjar et.al. | 2407.08274 | null |
2024-07-11 | DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing | Minghang Zhou et.al. | 2407.08132 | null |
2024-07-10 | PaliGemma: A versatile 3B VLM for transfer | Lucas Beyer et.al. | 2407.07726 | link |
2024-07-10 | The deep oxygen abundance in Solar System Giant Planets, with a new derivation for Saturn | Thibault Cavalié et.al. | 2407.07515 | null |
2024-07-10 | Bayesian weighted time-lapse full-waveform inversion using a receiver-extension strategy | Sergio Luiz E. F. da Silva et.al. | 2407.07467 | null |
2024-07-13 | Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken | Peifu Liu et.al. | 2407.07307 | link |
2024-07-10 | Identity-enabled CDMA LiDAR for massively parallel ranging with a single-element receiver | Yixiu Shen et.al. | 2407.06918 | null |
2024-07-08 | A Mamba-based Siamese Network for Remote Sensing Change Detection | Jay N. Paranjape et.al. | 2407.06839 | link |
2024-07-08 | Tile Compression and Embeddings for Multi-Label Classification in GeoLifeCLEF 2024 | Anthony Miyaguchi et.al. | 2407.06326 | link |
2024-07-07 | Addressing single object tracking in satellite imagery through prompt-engineered solutions | Athena Psalta et.al. | 2407.05518 | null |
2024-07-07 | HyperKAN: Kolmogorov-Arnold Networks make Hyperspectral Image Classificators Smarter | Valeriy Lobanov et.al. | 2407.05278 | link |
2024-07-07 | Estimation of the Area and Precipitation Associated with a Tropical Cyclone Biparjoy by using Image Processing | Shikha Verma et.al. | 2407.05255 | null |
2024-07-06 | BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support | Vladyslav Polushko et.al. | 2407.05007 | null |
2024-07-04 | MineNetCD: A Benchmark for Global Mining Change Detection on Remote Sensing Imagery | Weikang Yu et.al. | 2407.03971 | null |
2024-07-04 | High-Frequency Radar observation of strong and contrasted currents: the Alderney race paradigm | Dylan Dumas et.al. | 2407.03827 | null |
2024-07-04 | reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis | Kai Norman Clasen et.al. | 2407.03653 | link |
2024-07-03 | Relating CNN-Transformer Fusion Network for Change Detection | Yuhao Gao et.al. | 2407.03178 | link |
2024-07-03 | ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation | Chang Li et.al. | 2407.03033 | null |
2024-07-03 | Style Alignment based Dynamic Observation Method for UAV-View Geo-localization | Jie Shao et.al. | 2407.02832 | null |
2024-07-08 | Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction | Tinghuai Wang et.al. | 2407.02639 | null |
2024-07-02 | Efficient Stochastic Differential Equation for DEM Super Resolution with Void Filling | Tongtong Zhang et.al. | 2407.01908 | null |
2024-06-26 | Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM) | Younghyun Koo et.al. | 2407.01464 | null |
2024-07-01 | Hyperspectral Pansharpening: Critical Review, Tools and Future Perspectives | Matteo Ciotola et.al. | 2407.01355 | link |
2024-07-01 | Small Aerial Target Detection for Airborne Infrared Detection Systems using LightGBM and Trajectory Constraints | Xiaoliang Sun et.al. | 2407.01278 | null |
2024-07-01 | FALCON: Frequency Adjoint Link with CONtinuous Density Mask for Fast Single Image Dehazing | Donghyun Kim et.al. | 2407.00972 | null |
2024-07-01 | Optical turbulence vertical distribution at the Peak Terskol Observatory and Mt. Kurapdag | A. Y. Shikhovtsev et.al. | 2407.00960 | null |
2024-06-30 | Prediction of Sentinel-2 multi-band imagery with attention BiLSTM for continuous earth surface monitoring | Weiying Zhao et.al. | 2407.00834 | null |
2024-06-30 | Fully invertible hyperbolic neural networks for segmenting large-scale surface and sub-surface data | Bas Peters et.al. | 2407.00595 | null |
2024-06-29 | SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City | Guohao Wang et.al. | 2407.00296 | link |
2024-06-28 | Monolithic lithium niobate photonic chip for efficient terahertz-optic modulation and terahertz generation | Yiwen Zhang et.al. | 2406.19620 | null |
2024-06-27 | Cost-efficient Active Illumination Camera For Hyper-spectral Reconstruction | Yuxuan Zhang et.al. | 2406.19560 | null |
2024-06-27 | Secure quantum-enhanced measurements on a network of sensors | Sean William Moore et.al. | 2406.19285 | null |
2024-06-27 | Simultaneous determination of the dielectric relaxation behavior and soilwater characteristic curve of undisturbed soil samples | Norman Wagner et.al. | 2406.18909 | null |
2024-06-26 | Evaluating and Benchmarking Foundation Models for Earth Observation and Geospatial AI | Nikolaos Dionelis et.al. | 2406.18295 | null |
2024-06-26 | CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data | Nikolaos Dionelis et.al. | 2406.18279 | null |
2024-06-26 | SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery | Jian Song et.al. | 2406.18151 | null |
2024-06-26 | Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model | Zhuo Zheng et.al. | 2406.17998 | link |
2024-06-25 | Cross-Modal Spherical Aggregation for Weakly Supervised Remote Sensing Shadow Removal | Kaichen Chi et.al. | 2406.17469 | null |
2024-06-25 | Continuous Urban Change Detection from Satellite Image Time Series with Temporal Feature Refinement and Multi-Task Integration | Sebastian Hafner et.al. | 2406.17458 | link |
2024-06-24 | Quantifying Heterogeneous Ecosystem Services With Multi-Label Soft Classification | Zhihui Tian et.al. | 2406.17147 | null |
2024-06-19 | Generative Data Assimilation of Sparse Weather Station Observations at Kilometer Scales | Peter Manshausen et.al. | 2406.16947 | null |
2024-06-24 | Multi-Modal Vision Transformers for Crop Mapping from Satellite Image Time Series | Theresa Follath et.al. | 2406.16513 | null |
2024-07-02 | LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery | Xiaowen Ma et.al. | 2406.16502 | link |
2024-06-23 | UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery | Pengfei Zhang et.al. | 2406.16129 | null |
2024-06-22 | Single-Temporal Supervised Learning for Universal Remote Sensing Change Detection | Zhuo Zheng et.al. | 2406.15694 | link |
2024-06-21 | Miniature fluorescence sensor for quantitative detection of brain tumour | Jean Pierre Ndabakuranye et.al. | 2406.15520 | null |
2024-06-21 | Rethinking Remote Sensing Change Detection With A Mask View | Xiaowen Ma et.al. | 2406.15320 | link |
2024-06-21 | Understanding the variability of helium abundance in the solar corona using three-fluid modeling and UV observations | Leon Ofman et.al. | 2406.14897 | null |
2024-07-01 | Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery | Ilham Adi Panuntun et.al. | 2406.14220 | null |
2024-06-20 | Semi Supervised Heterogeneous Domain Adaptation via Disentanglement and Pseudo-Labelling | Cassio F. Dantas et.al. | 2406.14087 | link |
2024-06-20 | Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images | Qinfeng Zhu et.al. | 2406.14086 | link |
2024-06-21 | CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification | Faxu Guo et.al. | 2406.14080 | null |
2024-06-19 | Locating and measuring marine aquaculture production from space: a computer vision approach in the French Mediterranean | Sebastian Quaade et.al. | 2406.13847 | null |
2024-06-22 | Velocity Analysis of Moving Objects in Earth Observation Satellite Images Using Multi-Spectral Push Broom Scanning | Eric Keto et.al. | 2406.13710 | null |
2024-06-19 | DDLNet: Boosting Remote Sensing Change Detection with Dual-Domain Learning | Xiaowen Ma et.al. | 2406.13606 | link |
2024-06-19 | Formation of a Magnetic Cloud from the Merging of Two Successive Coronal Mass Ejections | Chong Chen et.al. | 2406.13603 | null |
2024-06-19 | Towards a multimodal framework for remote sensing image change retrieval and captioning | Roger Ferrod et.al. | 2406.13424 | link |
2024-06-19 | Multi-scale Restoration of Missing Data in Optical Time-series Images with Masked Spatial-Temporal Attention Network | Zaiyan Zhang et.al. | 2406.13358 | link |
2024-06-18 | Scale-Translation Equivariant Network for Oceanic Internal Solitary Wave Localization | Zhang Wan et.al. | 2406.13060 | link |
2024-06-18 | ChangeViT: Unleashing Plain Vision Transformers for Change Detection | Duowang Zhu et.al. | 2406.12847 | link |
2024-06-21 | Windows Into Other Worlds: Pitfalls in the physical interpretation of exoplanet atmospheric spectroscopy | Darius Modirrousta-Galian et.al. | 2406.12765 | null |
2024-06-18 | RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding | Linrui Xu et.al. | 2406.12479 | link |
2024-06-18 | VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding | Xiang Li et.al. | 2406.12384 | link |
2024-06-17 | Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset | Fengxiang Wang et.al. | 2406.11933 | link |
2024-06-17 | HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model | Di Wang et.al. | 2406.11519 | link |
2024-06-17 | Diffusion Models in Low-Level Vision: A Survey | Chunming He et.al. | 2406.11138 | link |
2024-06-16 | ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model | Song Zhang et.al. | 2406.10855 | link |
2024-06-16 | PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery | Libo Wang et.al. | 2406.10828 | link |
2024-06-15 | Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft | Ian Vyse et.al. | 2406.10724 | link |
2024-06-14 | Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval | Genc Hoxha et.al. | 2406.10107 | null |
2024-06-14 | SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding | Junwei Luo et.al. | 2406.10100 | link |
2024-06-14 | Soil nitrogen forecasting from environmental variables provided by multisensor remote sensing images | Weiying Zhao et.al. | 2406.09812 | null |
2024-06-13 | Modelling the magnetic vectors of ICMEs at different heliocentric distances with INFROS | Ranadeep Sarkar et.al. | 2406.09247 | null |
2024-06-16 | A |
Lixian Zhang et.al. | 2406.08079 | null |
2024-06-12 | Deep Learning for Slum Mapping in Remote Sensing Images: A Meta-analysis and Review | Anjali Raj et.al. | 2406.08031 | null |
2024-06-12 | Real-time, chirped-pulse heterodyne detection at room-temperature with 100GHz 3dB-bandwidth mid-infrared quantum-well photodetectors | Quyang Lin et.al. | 2406.08027 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482 | link |
2024-06-11 | Characterizing GPROF Regional Bias Using Radar-Derived Hydrometeor Information | Eric Goldenstern et.al. | 2406.07344 | null |
2024-06-11 | Grapevine Disease Prediction Using Climate Variables from Multi-Sensor Remote Sensing Imagery via a Transformer Model | Weiying Zhao et.al. | 2406.07094 | null |
2024-06-11 | RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents | Wenjia Xu et.al. | 2406.07089 | null |
2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032 | null |
2024-06-10 | An Elliptic Kernel Unsupervised Autoencoder-Graph Convolutional Network Ensemble Model for Hyperspectral Unmixing | Estefania Alfaro-Mejia et.al. | 2406.06742 | null |
2024-06-10 | ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery | Xian Sun et.al. | 2406.06028 | null |
2024-06-09 | BOSC: A toolbox for aerial imagery mapping | Ricard Durall et.al. | 2406.05833 | link |
2024-06-09 | Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment | Zijia Song et.al. | 2406.05766 | null |
2024-06-15 | A DeNoising FPN With Transformer R-CNN for Tiny Object Detection | Hou-I Liu et.al. | 2406.05755 | link |
2024-06-09 | HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model | Hang Fu et.al. | 2406.05700 | link |
2024-06-09 | SRC-Net: Bi-Temporal Spatial Relationship Concerned Network for Change Detection | Hongjia Chen et.al. | 2406.05668 | link |
2024-06-09 | Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision | Pranav Jeevan et.al. | 2406.05612 | link |
2024-06-08 | A Deep Learning-Augmented Stand-off Radar Scheme for Rapidly Detecting Tree Defects | Jiwei Qian et.al. | 2406.05389 | null |
2024-06-07 | Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment | Venkanna Babu Guthula et.al. | 2406.04949 | null |
2024-06-07 | MGIMM: Multi-Granularity Instruction Multimodal Model for Attribute-Guided Remote Sensing Image Detailed Description | Cong Yang et.al. | 2406.04716 | link |
2024-06-07 | UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping | Pengju Tian et.al. | 2406.04648 | null |
2024-06-06 | SpectralZoom: Efficient Segmentation with an Adaptive Hyperspectral Camera | Jackson Arnold et.al. | 2406.04287 | null |
2024-06-06 | M3LEO: A Multi-Modal, Multi-Label Earth Observation Dataset Integrating Interferometric SAR and RGB Data | Matthew J Allen et.al. | 2406.04230 | link |
2024-06-06 | CDMamba: Remote Sensing Image Change Detection with Mamba | Haotian Zhang et.al. | 2406.04207 | link |
2024-06-06 | LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression | Junhui Li et.al. | 2406.03961 | null |
2024-06-09 | Partial Label Learning with Focal Loss for Sea Ice Classification Based on Ice Charts | Behzad Vahedi et.al. | 2406.03645 | null |
2024-06-05 | Foundation Models for Geophysics: Reviews and Perspectives | Qi Liu et.al. | 2406.03163 | null |
2024-06-05 | P2PFormer: A Primitive-to-polygon Method for Regular Building Contour Extraction from Remote Sensing Images | Tao Zhang et.al. | 2406.02930 | null |
2024-06-04 | Low-Rank Adaption on Transformer-based Oriented Object Detector for Satellite Onboard Processing of Remote Sensing Images | Xinyang Pu et.al. | 2406.02385 | link |
2024-06-03 | Sparse Focus Network for Multi-Source Remote Sensing Data Classification | Xuepeng Jin et.al. | 2406.01245 | null |
2024-06-03 | LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism | Miao Fu et.al. | 2406.01228 | null |
2024-06-02 | Kolmogorov-Arnold Network for Satellite Image Classification in Remote Sensing | Minjong Cheon et.al. | 2406.00600 | link |
2024-06-04 | Analyzing trends for agricultural decision support system using twitter data | Sneha Jha et.al. | 2406.00577 | null |
2024-06-01 | A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing | Nurul Rafi et.al. | 2406.00239 | null |
2024-05-31 | ImplicitTerrain: a Continuous Surface Model for Terrain Data Analysis | Haoan Feng et.al. | 2406.00227 | null |
2024-05-31 | Responsible AI for Earth Observation | Pedram Ghamisi et.al. | 2405.20868 | null |
2024-05-31 | Maximum Temperature Prediction Using Remote Sensing Data Via Convolutional Neural Network | Lorenzo Innocenti et.al. | 2405.20731 | null |
2024-05-30 | P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation | Qi Zhang et.al. | 2405.20443 | link |
2024-05-30 | FMARS: Annotating Remote Sensing Images for Disaster Management using Foundation Models | Edoardo Arnaudo et.al. | 2405.20109 | link |
2024-05-30 | Rapid Wildfire Hotspot Detection Using Self-Supervised Learning on Temporal Remote Sensing Data | Luca Barco et.al. | 2405.20093 | link |
2024-05-30 | Recipes for forming a carbon-rich giant planet | Olivier Mousis et.al. | 2405.19748 | null |
2024-05-30 | Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes | Yong-Qiang Mao et.al. | 2405.19735 | null |
2024-05-30 | Research on Foundation Model for Spatial Data Intelligence: China's 2024 White Paper on Strategic Development of Spatial Data Intelligence | Shaohua Wang et.al. | 2405.19730 | null |
2024-06-02 | Large-scale DSM registration via motion averaging | Ningli Xu et.al. | 2405.19442 | null |
2024-06-05 | FUSU: A Multi-temporal-source Land Use Change Segmentation Dataset for Fine-grained Urban Semantic Understanding | Shuai Yuan et.al. | 2405.19055 | link |
2024-05-29 | Transcending Fusion: A Multi-Scale Alignment Method for Remote Sensing Image-Text Retrieval | Rui Yang et.al. | 2405.18959 | link |
2024-05-29 | MAGIC: Modular Auto-encoder for Generalisable Model Inversion with Bias Corrections | Yihang She et.al. | 2405.18953 | link |
2024-05-29 | Spectral Fidelity and Spatial Enhancement: An Assessment and Cascading of Pan-Sharpening Techniques for Satellite Imagery | Abdul Aziz A. B et.al. | 2405.18900 | null |
2024-05-29 | Refinement of global coronal and interplanetary magnetic field extrapolations constrained by remote-sensing and in-situ observations at the solar minimum | Guanglu Shi et.al. | 2405.18665 | null |
2024-05-28 | Probing the Information Theoretical Roots of Spatial Dependence Measures | Zhangyu Wang et.al. | 2405.18459 | link |
2024-05-28 | SSLChange: A Self-supervised Change Detection Framework Based on Domain Adaptation | Yitao Zhao et.al. | 2405.18224 | link |
2024-05-28 | Near-Infrared and Low-Rank Adaptation of Vision Transformers in Remote Sensing | Irem Ulku et.al. | 2405.17901 | null |
2024-05-28 | Towards Efficient Disaster Response via Cost-effective Unbiased Class Rate Estimation through Neyman Allocation Stratified Sampling Active Learning | Yanbing Bai et.al. | 2405.17734 | null |
2024-05-27 | Robust Perception and Navigation of Autonomous Surface Vehicles in Challenging Environments | Mingi Jeong et.al. | 2405.17657 | null |
2024-05-27 | Refraction FWI of a circular shot OBN acquisition in the Brazilian pre-salt region | Sérgio Luiz E. F. da Silva et.al. | 2405.17330 | null |
2024-05-27 | Deep Feature Gaussian Processes for Single-Scene Aerosol Optical Depth Reconstruction | Shengjie Liu et.al. | 2405.17262 | null |
2024-05-27 | SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing | Yong-Qiang Mao et.al. | 2405.17140 | null |
2024-05-27 | Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image Classification | Shujun Yang et.al. | 2405.17110 | link |
2024-05-27 | Evaluation of Resource-Efficient Crater Detectors on Embedded Systems | Simon Vellas et.al. | 2405.16953 | link |
2024-05-24 | Multimodal Object Detection via Probabilistic a priori Information Integration | Hafsa El Hafyani et.al. | 2405.15596 | link |
2024-05-29 | Composed Image Retrieval for Remote Sensing | Bill Psomas et.al. | 2405.15587 | link |
2024-05-24 | MagicBathyNet: A Multimodal Remote Sensing Dataset for Bathymetry Prediction and Pixel-based Classification in Shallow Waters | Panagiotis Agrafiotis et.al. | 2405.15477 | link |
2024-05-24 | Comparing remote sensing-based forest biomass mapping approaches using new forest inventory plots in contrasting forests in northeastern and southwestern China | Wenquan Dong et.al. | 2405.15438 | null |
2024-05-24 | Transformer-based Federated Learning for Multi-Label Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2405.15405 | null |
2024-05-24 | Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets | Hoàng-Ân Lê et.al. | 2405.15394 | link |
2024-05-23 | Dual-comb correlation spectroscopy of thermal light | Eugene J. Tsao et.al. | 2405.14842 | null |
2024-05-23 | Multi-view Remote Sensing Image Segmentation With SAM priors | Zipeng Qi et.al. | 2405.14171 | null |
2024-05-23 | Hyperspectral Image Dataset for Individual Penguin Identification | Youta Noboru et.al. | 2405.14146 | null |
2024-05-22 | AutoLCZ: Towards Automatized Local Climate Zone Mapping from Rule-Based Remote Sensing | Chenying Liu et.al. | 2405.13993 | null |
2024-05-22 | Embedding Generalized Semantic Knowledge into Few-Shot Remote Sensing Segmentation | Yuyu Jia et.al. | 2405.13686 | null |
2024-05-22 | MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation | Zhiping Yu et.al. | 2405.13570 | null |
2024-05-22 | Euclid. I. Overview of the Euclid mission | Euclid Collaboration et.al. | 2405.13491 | null |
2024-05-22 | A Label Propagation Strategy for CutMix in Multi-Label Remote Sensing Image Classification | Tom Burgert et.al. | 2405.13451 | null |
2024-05-21 | Global-Local Detail Guided Transformer for Sea Ice Recognition in Optical Remote Sensing Images | Zhanchao Huang et.al. | 2405.13197 | null |
2024-05-21 | Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images | Xiaofei Yu et.al. | 2405.12875 | link |
2024-05-21 | 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification | Yan He et.al. | 2405.12487 | null |
2024-05-25 | Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification | Weilian Zhou et.al. | 2405.12003 | link |
2024-05-20 | Climatic & Anthropogenic Hazards to the Nasca World Heritage: Application of Remote Sensing, AI, and Flood Modelling | Masato Sakai et.al. | 2405.11814 | null |
2024-05-18 | InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | Wuzhou Li et.al. | 2405.11293 | link |
2024-05-17 | Ptychographic non-line-of-sight imaging for depth-resolved visualization of hidden objects | Pengming Song et.al. | 2405.11115 | null |
2024-05-17 | Photorealistic 3D Urban Scene Reconstruction and Point Cloud Extraction using Google Earth Imagery and Gaussian Splatting | Kyle Gao et.al. | 2405.11021 | null |
2024-05-17 | CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation | Mushui Liu et.al. | 2405.10530 | link |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-16 | Region-level labels in ice charts can produce pixel-level segmentation for Sea Ice types | Muhammed Patel et.al. | 2405.10456 | null |
2024-05-16 | PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning | Jiancheng Pan et.al. | 2405.10160 | link |
2024-05-16 | RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing | Huiling Zhou et.al. | 2405.10030 | null |
2024-05-16 | Cross-sensor self-supervised training and alignment for remote sensing | Valerio Marsocci et.al. | 2405.09922 | null |
2024-05-16 | Many-Shot In-Context Learning in Multimodal Foundation Models | Yixing Jiang et.al. | 2405.09798 | link |
2024-05-16 | LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation | Wentao Jiang et.al. | 2405.09789 | link |
2024-05-15 | SARATR-X: A Foundation Model for Synthetic Aperture Radar Images Target Recognition | Weijie L et.al. | 2405.09365 | null |
2024-05-15 | RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing | Jiamei Xiong et.al. | 2405.09083 | link |
2024-05-15 | Dim Small Target Detection and Tracking: A Novel Method Based on Temporal Energy Selective Scaling and Trajectory Association | Weihua Gao et.al. | 2405.09054 | null |
2024-05-15 | Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels | Guozhang Liu et.al. | 2405.09024 | null |
2024-05-14 | Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research | Qinglong Cao et.al. | 2405.08668 | link |
2024-05-14 | Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study | Qinfeng Zhu et.al. | 2405.08493 | null |
2024-05-13 | IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data | Ziyang Zhang et.al. | 2405.07916 | null |
2024-05-13 | Sub-percent Characterization and Polarimetric Performance Analysis of Commercial Micro-polarizer Array Detectors | Thijs Stockmans et.al. | 2405.07864 | null |
2024-05-13 | Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches | Gao Yu Lee et.al. | 2405.07520 | null |
2024-05-11 | Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior | Ce Wang et.al. | 2405.07044 | link |
2024-05-10 | Ocean-DC: An analysis ready data cube framework for environmental and climate change monitoring over the port areas | Ioannis Kavouras et.al. | 2405.06730 | null |
2024-05-10 | A Lightweight Transformer for Remote Sensing Image Change Captioning | Dongwei Sun et.al. | 2405.06598 | null |
2024-05-10 | Comparative Analysis of Advanced Feature Matching Algorithms in Challenging High Spatial Resolution Optical Satellite Stereo Scenarios | Qiyan Luo et.al. | 2405.06246 | null |
2024-05-09 | UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks | Kovvuri Sai Gopal Reddy et.al. | 2405.06057 | link |
2024-05-09 | Exploring Text-Guided Single Image Editing for Remote Sensing Images | Fangzhou Han et.al. | 2405.05769 | null |
2024-05-08 | EarthMatch: Iterative Coregistration for Fine-grained Localization of Astronaut Photography | Gabriele Berton et.al. | 2405.05422 | **[link](https://github.com/gmberton/Earth |