Mine-Arxiv

Contributors Forks Stargazers Issues

Updated on 2024.04.04

Table of Contents
  1. <a href=#diffusion>diffusion</a>
  2. <a href=#sketch>sketch</a>
  3. <a href=#3D-reconstruction>3D reconstruction</a>
  4. <a href=#generate>generate</a>
  5. <a href=#generation>generation</a>

diffusion

Publish Date Title Authors PDF Code
2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Keyu Tian et.al. 2404.02905v1 link
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903v1 null
2024-04-03 MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Duygu Ceylan et.al. 2404.02899v1 null
2024-04-03 On the Scalability of Diffusion-based Text-to-Image Generation Hao Li et.al. 2404.02883v1 null
2024-04-03 Fast Diffusion Model For Seismic Data Noise Attenuation Junheng Peng et.al. 2404.02767v1 null
2024-04-03 Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models Wentian Zhang et.al. 2404.02747v1 link
2024-04-03 InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation Haofan Wang et.al. 2404.02733v1 link
2024-04-03 Harnessing the Power of Large Vision Language Models for Synthetic Image Detection Mamadou Keita et.al. 2404.02726v1 null
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148v1 link
2024-04-02 WcDT: World-centric Diffusion Transformer for Traffic Scene Generation Chen Yang et.al. 2404.02082v1 link
2024-04-02 AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design Xinze Li et.al. 2404.02003v1 null
2024-04-02 Bi-LORA: A Vision-Language Approach for Synthetic Image Detection Mamadou Keita et.al. 2404.01959v1 null
2024-03-29 Relation Rectification in Diffusion Model Yinwei Wu et.al. 2403.20249v1 null
2024-03-29 Graph Neural Aggregation-diffusion with Metastability Kaiyuan Cui et.al. 2403.20221v1 null
2024-03-29 Motion Inversion for Video Customization Luozhou Wang et.al. 2403.20193v1 null
2024-03-29 FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models Barbara Toniella Corradini et.al. 2403.20105v1 null
2024-03-29 SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Zhongrui Yu et.al. 2403.20079v1 null
2024-03-29 Optimal s-boxes against alternative operations Marco Calderini et.al. 2403.20059v1 null
2024-03-28 GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling Bowen Zhang et.al. 2403.19655v1 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653v1 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652v1 null
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645v1 null
2024-03-28 Generalisation of the Spectral Difference scheme for the diffused-interface five equation model Niccolò Tonicello et.al. 2403.19623v1 null
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600v1 link
2024-03-28 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models Aimon Rahman et.al. 2403.19593v1 null
2024-03-28 Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics Norman Di Palo et.al. 2403.19578v1 null
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818v1 null
2024-03-27 Garment3DGen: 3D Garment Stylization and Texture Generation Nikolaos Sarafianos et.al. 2403.18816v1 null
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807v2 link
2024-03-27 Object Pose Estimation via the Aggregation of Diffusion Features Tianfu Wang et.al. 2403.18791v1 link
2024-03-27 ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object Chenshuang Zhang et.al. 2403.18775v1 link
2024-03-28 FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing Trong-Tung Nguyen et.al. 2403.18605v2 null
2024-03-27 HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions Hao Xu et.al. 2403.18575v1 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936v1 null
2024-03-26 SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models Kashyap Chitta et.al. 2403.17933v1 null
2024-03-26 AID: Attention Interpolation of Text-to-Image Diffusion Qiyuan He et.al. 2403.17924v1 link
2024-03-26 Boosting Diffusion Models with Moving Average Sampling in Frequency Domain Yurui Qian et.al. 2403.17870v1 null
2024-03-26 The memory of Rayleigh-Taylor turbulence S. Thévenin et.al. 2403.17832v1 null
2024-03-26 DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions Sammy Christen et.al. 2403.17827v1 null
2024-03-25 Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning Sicong Pan et.al. 2403.16803v1 null
2024-03-25 Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise Dilum Fernando et.al. 2403.16790v1 null
2024-03-25 Multilevel Modeling as a Methodology for the Simulation of Human Mobility Luca Serena et.al. 2403.16745v1 null
2024-03-25 A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models Nils Ingelhag et.al. 2403.16730v1 null
2024-03-25 Improving Diffusion Models’s Data-Corruption Resistance using Scheduled Pseudo-Huber Loss Artem Khrapov et.al. 2403.16728v1 link
2024-03-25 The effect of inter-track coupling on H $_2$O$_2$ productions Ramin Abolfath et.al. 2403.16722v1 null
2024-03-25 The Directionality of Gravitational and Thermal Diffusive Transport in Geologic Fluid Storage Anna Herring et.al. 2403.16659v1 null
2024-03-25 SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Yuda Song et.al. 2403.16627v1 link
2024-03-25 SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation Aysim Toker et.al. 2403.16605v1 null
2024-03-22 DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Hanrong Ye et.al. 2403.15389v1 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385v1 null
2024-03-22 Controlled Training Data Generation with Diffusion Models Teresa Yeo et.al. 2403.15309v1 null
2024-03-22 Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies Nicolò Botteghi et.al. 2403.15267v1 null
2024-03-22 Spectral Motion Alignment for Video Motion Transfer using Diffusion Models Geon Yeong Park et.al. 2403.15249v1 null
2024-03-22 Shadow Generation for Composite Image Using Diffusion model Qingyang Liu et.al. 2403.15234v1 link
2024-03-22 Broad Instantaneous Bandwidth Microwave Spectrum Analyzer with a Microfabricated Atomic Vapor Cell Yongqi Shi et.al. 2403.15155v1 null
2024-03-22 Oxygenation of CO and NO on Amorphous Solid Water Meenu Upadhyay et.al. 2403.15141v1 null
2024-03-21 Simplified Diffusion Schrödinger Bridge Zhicong Tang et.al. 2403.14623v1 link
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621v1 link
2024-03-21 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617v1 null
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613v1 null
2024-03-21 ReNoise: Real Image Inversion Through Iterative Noising Daniel Garibi et.al. 2403.14602v1 null
2024-03-21 Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors Nikolaos Tsagkas et.al. 2403.14526v1 null
2024-03-21 Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation Mathias Öttl et.al. 2403.14429v1 null
2024-03-20 On Pretraining Data Diversity for Self-Supervised Learning Hasan Abed Al Kader Hammoud et.al. 2403.13808v1 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807v1 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802v1 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800v1 null
2024-03-20 DepthFM: Fast Monocular Depth Estimation with Flow Matching Ming Gui et.al. 2403.13788v1 null
2024-03-20 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation Fu-Yun Wang et.al. 2403.13745v1 link
2024-03-20 Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes Yifan Chen et.al. 2403.13724v1 null
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963v1 link
2024-03-19 FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation Shuai Yang et.al. 2403.12962v1 link
2024-03-19 TexTile: A Differentiable Metric for Texture Tileability Carlos Rodriguez-Pardo et.al. 2403.12961v1 null
2024-03-19 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He et.al. 2403.12957v1 null
2024-03-19 Zero-Reference Low-Light Enhancement via Physical Quadruple Priors Wenjing Wang et.al. 2403.12933v1 null
2024-03-19 You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs Yihong Luo et.al. 2403.12931v1 link
2024-03-19 Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model Jiajie Yang et.al. 2403.12915v1 link
2024-03-19 D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation Jun Yamada et.al. 2403.12861v1 null
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706v1 link
2024-03-19 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697v2 null
2024-03-18 Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection Julia Wolleb et.al. 2403.11667v1 null
2024-03-18 Diffusion-Based Environment-Aware Trajectory Prediction Theodor Westny et.al. 2403.11643v1 null
2024-03-18 Arc2Face: A Foundation Model of Human Faces Foivos Paraperas Papantoniou et.al. 2403.11641v1 link
2024-03-18 LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Yang Yang et.al. 2403.11627v1 link
2024-03-18 CRS-Diff: Controllable Generative Remote Sensing Foundation Model Datao Tang et.al. 2403.11614v1 link
2024-03-15 Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives Ronghui Li et.al. 2403.10518v1 link
2024-03-15 MusicHiFi: Fast High-Fidelity Stereo Vocoding Ge Zhu et.al. 2403.10493v1 null
2024-03-15 SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal Conditioned Diffusion Policy Alison Bartsch et.al. 2403.10401v1 null
2024-03-15 Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding Pengkun Liu et.al. 2403.10395v1 link
2024-03-15 Denoising Task Difficulty-based Curriculum for Training Diffusion Models Jin-Young Kim et.al. 2403.10348v1 null
2024-03-15 Towards Generalizable Deepfake Video Detection with Thumbnail Layout and Graph Reasoning Yuting Xu et.al. 2403.10261v1 link
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638v1 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631v1 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630v1 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625v1 null
2024-03-14 Score-Guided Diffusion for 3D Human Recovery Anastasis Stathopoulos et.al. 2403.09623v1 link
2024-03-14 Explore In-Context Segmentation via Latent Diffusion Models Chaoyang Wang et.al. 2403.09616v1 null
2024-03-14 The effect of spatially-varying collision frequency on the development of the Rayleigh-Taylor instability John Rodman et.al. 2403.09591v1 null
2024-03-14 MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models Zunnan Xu et.al. 2403.09471v1 null
2024-03-14 Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing Wonjun Kang et.al. 2403.09468v1 link
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764v1 null
2024-03-14 GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing Jing Wu et.al. 2403.08733v2 null
2024-03-13 Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data Asad Aali et.al. 2403.08728v1 link
2024-03-13 Historical Astronomical Diagrams Decomposition in Geometric Primitives Syrine Kalleli et.al. 2403.08721v1 null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860v1 link
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842v1 null
2024-03-12 MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model Guibo Luo et.al. 2403.07838v1 null
2024-03-13 SemCity: Semantic Scene Generation with Triplane Diffusion Jumin Lee et.al. 2403.07773v2 link
2024-03-12 Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Yuxuan Zhang et.al. 2403.07764v1 null
2024-03-13 Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion Dongyang Li et.al. 2403.07721v2 link
2024-03-12 SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces Yuta Oshima et.al. 2403.07711v1 link
2024-03-12 Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal Yijun Yang et.al. 2403.07684v1 null
2024-03-11 BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion Xuan Ju et.al. 2403.06976v1 link
2024-03-11 Bayesian Diffusion Models for 3D Shape Reconstruction Haiyang Xu et.al. 2403.06973v1 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952v1 null
2024-03-12 DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations Tianhao Qi et.al. 2403.06951v2 link
2024-03-08 VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models Yabo Zhang et.al. 2403.05438v1 link
2024-03-08 DiffSF: Diffusion Models for Scene Flow Estimation Yushan Zhang et.al. 2403.05327v1 link
2024-03-07 ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes Hashmat Shadab Malik et.al. 2403.04701v1 link
2024-03-07 Delving into the Trajectory Long-tail Distribution for Muti-object Tracking Sijia Chen et.al. 2403.04700v1 link
2024-03-07 PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Junsong Chen et.al. 2403.04692v1 null
2024-03-07 Pix2Gif: Motion-Guided Diffusion for GIF Generation Hitesh Kandala et.al. 2403.04634v1 null
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954v1 link
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938v1 link
2024-03-06 Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation Xiao Ma et.al. 2403.03890v1 null
2024-03-06 Latent Dataset Distillation with Diffusion Models Brian B. Moser et.al. 2403.03881v1 null
2024-03-06 Accelerating Convergence of Score-Based Diffusion Models, Provably Gen Li et.al. 2403.03852v1 null
2024-03-06 Diffusion on language model embeddings for protein sequence generation Viacheslav Meshchaninov et.al. 2403.03726v1 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206v1 null
2024-03-05 MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Hossein Aboutalebi et.al. 2403.03194v1 null
2024-03-05 Behavior Generation with Latent Actions Seungjae Lee et.al. 2403.03181v1 link
2024-03-05 Enhanced beam-beam modeling to include longitudinal variation during weak-strong simulation Derong Xu et.al. 2403.03137v1 null
2024-03-02 Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models Neta Shaul et.al. 2403.01329v1 null
2024-03-02 Anomalous mass dependency in Hydra endoderm cell cluster diffusion Aline Lütz et.al. 2403.01294v1 null
2024-03-02 DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction Junwen Xiong et.al. 2403.01226v1 null
2024-03-02 TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion Salaheldin Mohamed et.al. 2403.01212v1 null
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481v1 link
2024-02-29 Structure Preserving Diffusion Models Haoye Lu et.al. 2402.19369v1 null
2024-02-29 A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation Hanxi Li et.al. 2402.19330v1 link
2024-02-29 DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly Gianluca Scarpellini et.al. 2402.19302v1 link
2024-02-29 Generative models struggle with kirigami metamaterials Gerrit Felsch et.al. 2402.19196v1 null
2024-02-28 Diffusion Language Models Are Versatile Protein Learners Xinyou Wang et.al. 2402.18567v1 null
2024-02-28 Photon statistics of resonantly driven spectrally diffusive quantum emitters Aymeric Delteil et.al. 2402.18542v1 null
2024-02-28 Dynamical Regimes of Diffusion Models Giulio Biroli et.al. 2402.18491v1 null
2024-02-28 Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model Sangjoon Park et.al. 2402.18362v1 null
2024-02-27 Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning Xiaoyu Zhang et.al. 2402.17768v1 null
2024-02-27 Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners Yazhou Xing et.al. 2402.17723v1 null
2024-02-27 Structure-Guided Adversarial Training of Diffusion Models Ling Yang et.al. 2402.17563v1 null
2024-02-27 Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label Xinliang Zhang et.al. 2402.17555v1 link
2024-02-27 Diffusion Model-Based Image Editing: A Survey Yi Huang et.al. 2402.17525v1 link
2024-02-27 Label-Noise Robust Diffusion Models Byeonghu Na et.al. 2402.17517v1 link
2024-02-27 The Unwanted Dissemination of Science: The Usage of Academic Articles as Ammunition in Contested Discursive Arenas on Twitter Richard Zhang et.al. 2402.17495v1 null
2024-02-27 EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Linrui Tian et.al. 2402.17485v1 null
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506v1 null
2024-02-26 Outline-Guided Object Inpainting with Diffusion Models Markus Pobitzer et.al. 2402.16421v1 null
2024-02-26 Placing Objects in Context via Inpainting for Out-of-distribution Segmentation Pau de Jorge et.al. 2402.16392v1 link
2024-02-26 Generative AI in Vision: A Survey on Models, Metrics and Applications Gaurav Raut et.al. 2402.16369v1 null
2024-02-26 Feedback Efficient Online Fine-Tuning of Diffusion Models Masatoshi Uehara et.al. 2402.16359v1 null
2024-02-26 Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion Xuantong Liu et.al. 2402.16305v1 null
2024-02-26 Graph Diffusion Policy Optimization Yijing Liu et.al. 2402.16302v1 link
2024-02-23 Seamless Human Motion Composition with Blended Positional Encodings German Barquero et.al. 2402.15509v1 link
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504v1 link
2024-02-23 Solute transport due to periodic loading in a soft porous material Matilde Fiori et.al. 2402.15451v1 null
2024-02-23 ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation Yi Zhang et.al. 2402.15429v1 link
2024-02-23 Understanding Oversmoothing in Diffusion-Based GNNs From the Perspective of Operator Semigroup Theory Weichen Zhao et.al. 2402.15326v1 null
2024-02-23 Let’s Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models Shunyu Liu et.al. 2402.15289v1 link
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817v1 null
2024-02-22 GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion Xueyi Liu et.al. 2402.14810v1 link
2024-02-22 Consolidating Attention Features for Multi-view Image Editing Or Patashnik et.al. 2402.14792v1 null
2024-02-22 Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren et.al. 2402.14780v1 null
2024-02-22 Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening Zhenrong Shen et.al. 2402.14707v1 null
2024-02-22 Debiasing Text-to-Image Diffusion Models Ruifei He et.al. 2402.14577v1 null
2024-02-22 DynGMA: a robust approach for learning stochastic differential equations from data Aiqing Zhu et.al. 2402.14475v1 link
2024-02-21 D-Flow: Differentiating through Flows for Controlled Generation Heli Ben-Hamu et.al. 2402.14017v1 null
2024-02-21 SDXL-Lightning: Progressive Adversarial Diffusion Distillation Shanchuan Lin et.al. 2402.13929v1 null
2024-02-21 Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate Yuchen Liang et.al. 2402.13901v1 null
2024-02-21 NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion Haoyu Li et.al. 2402.13809v1 null
2024-02-21 The Geography of Information Diffusion in Online Discourse on Europe and Migration Elisa Leonardelli et.al. 2402.13800v1 null
2024-02-21 Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions Jiayu Chen et.al. 2402.13777v1 link
2024-02-21 Music Style Transfer with Time-Varying Inversion of Diffusion Models Sifei Li et.al. 2402.13763v1 null
2024-02-20 Neural Network Diffusion Kai Wang et.al. 2402.13144v1 link
2024-02-20 Excited state-specific CASSCF theory for the torsion of ethylene Sandra Saade et.al. 2402.13046v1 null
2024-02-20 Text-Guided Molecule Generation with Diffusion Language Model Haisong Gong et.al. 2402.13040v1 link
2024-02-20 Visual Style Prompting with Swapping Self-Attention Jaeseok Jeong et.al. 2402.12974v1 link
2024-02-20 CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection Sohail Ahmed Khan et.al. 2402.12927v1 null
2024-02-20 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models Xinchen Zhang et.al. 2402.12908v1 link
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376v1 link
2024-02-19 Analysis of Persian News Agencies on Instagram, A Words Co-occurrence Graph-based Approach Mohammad Heydari et.al. 2402.12272v1 null
2024-02-19 Synthetic location trajectory generation using categorical diffusion models Simon Dirmeier et.al. 2402.12242v1 link
2024-02-19 Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations Jonas Beck et.al. 2402.12231v1 link
2024-02-19 Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training Leo Hyun Park et.al. 2402.12187v1 null
2024-02-19 Human Video Translation via Query Warping Haiming Zhu et.al. 2402.12099v1 null
2024-02-16 Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning Chia-Ling Tsai et.al. 2402.10894v1 null
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885v1 null
2024-02-16 Control Color: Multimodal Diffusion-based Interactive Image Colorization Zhexin Liang et.al. 2402.10855v1 null
2024-02-16 Training Class-Imbalanced Diffusion Model Via Overlap Optimization Divin Yan et.al. 2402.10821v1 link
2024-02-16 VATr++: Choose Your Words Wisely for Handwritten Text Generation Bram Vanherle et.al. 2402.10798v1 null
2024-02-16 Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation Hongbin Na et.al. 2402.10699v1 null
2024-02-16 Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm Yuanzhen Xie et.al. 2402.10671v1 link
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210v1 null
2024-02-15 Recovering the Pre-Fine-Tuning Weights of Generative Models Eliahu Horwitz et.al. 2402.10208v1 link
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207v1 link
2024-02-15 Energy Flux Decomposition in Magnetohydrodynamic Turbulence D. Capocci et.al. 2402.10125v1 null
2024-02-15 Collision efficiency of droplets across diffusive, electrostatic and inertial regimes Florian Poydenot et.al. 2402.10117v1 null
2024-02-15 Quantized Embedding Vectors for Controllable Diffusion Language Models Cheng Kang et.al. 2402.10107v1 null
2024-02-15 Classification Diffusion Models Shahar Yadin et.al. 2402.10095v1 null
2024-02-14 Magic-Me: Identity-Specific Video Customized Diffusion Ze Ma et.al. 2402.09368v1 link
2024-02-14 Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio Pablo Alonso-Jiménez et.al. 2402.09318v1 null
2024-02-14 Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection Pengfei Zhou et.al. 2402.09242v1 link
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-13 Target Score Matching Valentin De Bortoli et.al. 2402.08667v1 null
2024-02-13 Learning Continuous 3D Words for Text-to-Image Generation Ta-Ying Cheng et.al. 2402.08654v1 null
2024-02-13 Latent Inversion with Timestep-aware Sampling for Training-free Non-rigid Editing Yunji Jung et.al. 2402.08601v1 null
2024-02-13 Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator Amartya Mukherjee et.al. 2402.08563v1 null
2024-02-13 Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases Ziyi Zhang et.al. 2402.08552v1 null
2024-02-13 Hyperballistic transport in dense ionized matter under external AC electric fields Daniele Gamba et.al. 2402.08519v1 null
2024-02-12 Label-Efficient Model Selection for Text Generation Shir Ashury-Tahan et.al. 2402.07891v1 null
2024-02-12 High-order harmonic generation in 2D Transition Metal Disulphides Jose Manuel Iglesias et.al. 2402.07850v1 null
2024-02-12 Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models Jiacheng Ye et.al. 2402.07754v1 link
2024-02-12 Topological Edge States in Reconfigurable Multi-stable Mechanical Metamaterials Zhen Wang et.al. 2402.07707v1 null
2024-02-12 Higher-order Connection Laplacians for Directed Simplicial Complexes Xue Gong et.al. 2402.07631v1 null
2024-02-09 Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following Brian Yang et.al. 2402.06559v1 null
2024-02-09 Sequential Flow Matching for Generative Modeling Jongmin Yoon et.al. 2402.06461v1 null
2024-02-09 ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation Fengyi Shen et.al. 2402.06446v1 null
2024-02-09 Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation Peter Hönig et.al. 2402.06436v1 null
2024-02-09 Enhanced bubble growth near an advancing solidification front Jochem G. Meijer et.al. 2402.06409v1 null
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937v1 null
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933v1 link
2024-02-08 AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning Wamiq Reyaz Para et.al. 2402.05803v1 null
2024-02-08 Determining the significance and relative importance of parameters of a simulated quenching algorithm using statistical tools Pedro A. Castillo et.al. 2402.05791v1 null
2024-02-08 DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer Zhiyuan Ma et.al. 2402.05712v1 link
2024-02-08 Scalable Diffusion Models with State Space Backbone Zhengcong Fei et.al. 2402.05608v1 link
2024-02-07 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098v1 link
2024-02-07 NITO: Neural Implicit Fields for Resolution-free Topology Optimization Amin Heyrani Nobari et.al. 2402.05073v1 null
2024-02-07 LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation Jiaxiang Tang et.al. 2402.05054v1 null
2024-02-06 SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models Yichen Shi et.al. 2402.04178v1 link
2024-02-06 Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning Ruoqi Zhang et.al. 2402.04080v1 link
2024-02-06 Generative Modeling of Graphs via Joint Diffusion of Node and Edge Attributes Nimrod Berman et.al. 2402.04046v1 null
2024-02-06 Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation Zolnamar Dorjsembe et.al. 2402.04031v1 link
2024-02-06 Space Group Constrained Crystal Generation Rui Jiao et.al. 2402.03992v1 null
2024-02-06 Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting Yiming Xu et.al. 2402.03981v1 null
2024-02-06 Weibel- and non-resonant Whistler wave growth in an expanding plasma in a 1D simulation geometry M E Dieckmann et.al. 2402.03925v1 null
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305v1 null
2024-02-05 Zero-shot Object-Level OOD Detection with Context-Aware Inpainting Quang-Huy Nguyen et.al. 2402.03292v1 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290v1 link
2024-02-05 Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? Anna Yoo Jeong Ha et.al. 2402.03214v1 null
2024-02-05 Light and Optimal Schrödinger Bridge Matching Nikita Gushchin et.al. 2402.03207v1 link
2024-02-05 Guidance with Spherical Gaussian Constraint for Conditional Diffusion Lingxiao Yang et.al. 2402.03201v1 null
2024-02-05 Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion Shiyuan Yang et.al. 2402.03162v1 null
2024-02-05 DARTS: Diffusion Approximated Residual Time Sampling for Low Variance Time-of-flight Rendering in Homogeneous Scattering Medium Qianyue He et.al. 2402.03106v1 null
2024-02-02 NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties Jingyuan Sun et.al. 2402.01590v1 null
2024-02-02 Boximator: Generating Rich and Controllable Motions for Video Synthesis Jiawei Wang et.al. 2402.01566v1 null
2024-02-02 Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations Panos Kakoulidis et.al. 2402.01520v1 null
2024-02-02 Cross-view Masked Diffusion Transformers for Person Image Synthesis Trung X. Pham et.al. 2402.01516v1 null
2024-02-01 AToM: Amortized Text-to-Mesh using 2D Diffusion Guocheng Qian et.al. 2402.00867v1 null
2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields Jiahua Dong et.al. 2402.00864v1 link
2024-02-01 Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching Shangzhe Li et.al. 2402.00807v1 null
2024-02-01 AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Fu-Yun Wang et.al. 2402.00769v1 link
2024-02-01 CapHuman: Capture Your Moments in Parallel Universes Chao Liang et.al. 2402.00627v1 link
2024-02-01 Diffusion-based Light Field Synthesis Ruisheng Gao et.al. 2402.00575v1 null
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085v1 null
2024-01-31 An electrodynamic wave model for the action potential Vitaly L. Galinsky et.al. 2401.18051v1 null
2024-01-31 Investigation of Microstructure and Corrosion Resistance of Ti-Al-V Titanium Alloys Obtained by Spark Plasma Sintering Aleksey Nokhrin et.al. 2401.17941v1 null
2024-01-31 AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error Jonas Ricker et.al. 2401.17879v1 link
2024-01-30 You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation Mehdi Noroozi et.al. 2401.17258v1 null
2024-01-30 ContactGen: Contact-Guided Interactive 3D Human Generation for Partners Dongjun Gu et.al. 2401.17212v1 null
2024-01-30 Transfer Learning for Text Diffusion Models Kehang Han et.al. 2401.17181v1 null
2024-01-29 Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models Zhongjie Duan et.al. 2401.16224v1 null
2024-01-29 Rapidly rotating radiatively driven convection: experimental and numerical validation of the `geostrophic turbulence’ scaling predictions Gabriel Hadjerci et.al. 2401.16200v1 null
2024-01-29 Spatial-Aware Latent Initialization for Controllable Image Generation Wenqiang Sun et.al. 2401.16157v1 null
2024-01-29 Acoustic Screens based on Sonic Crystals with high Diffusion properties M. P. Peiró-Torres et.al. 2401.16074v1 null
2024-01-26 Annotated Hands for Generative Models Yue Yang et.al. 2401.15075v1 link
2024-01-26 Emulating Complex Synapses Using Interlinked Proton Conductors Lifu Zhang et.al. 2401.15045v1 null
2024-01-26 DAM: Diffusion Activation Maximization for 3D Global Explanations Hanxiao Tan et.al. 2401.14938v1 link
2024-01-26 Social norms and cooperation in higher-order networks Yin-Jie Ma et.al. 2401.14905v1 null
2024-01-25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen et.al. 2401.14404v1 null
2024-01-25 pix2gestalt: Amodal Segmentation by Synthesizing Wholes Ege Ozguroglu et.al. 2401.14398v1 link
2024-01-25 Manifold GCN: Diffusion-based Convolutional Neural Network for Manifold-valued Graphs Martin Hanik et.al. 2401.14381v1 null
2024-01-25 UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models Timo Kapsalis et.al. 2401.14379v1 null
2024-01-25 Modeling Global Surface Dust Deposition Using Physics-Informed Neural Networks Constanza A. Molina Catricheo et.al. 2401.14372v1 link
2024-01-25 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257v1 null
2024-01-24 Bi-Hamiltonian in Semiflexible Polymer as Strongly Coupled System Heeyuen Koh et.al. 2401.13655v1 null
2024-01-24 On the self-similarity of unbounded viscous Marangoni flows Fernando Temprano-Coleto et.al. 2401.13647v1 null
2024-01-24 Winding Clearness for Differentiable Point Cloud Optimization Dong Xiao et.al. 2401.13639v1 null
2024-01-24 Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials Yanyan Yang et.al. 2401.13570v1 null
2024-01-24 Expressive Acoustic Guitar Sound Synthesis with an Instrument-Specific Input Representation and Diffusion Outpainting Hounsu Kim et.al. 2401.13498v1 null
2024-01-23 GALA: Generating Animatable Layered Assets from a Single Scan Taeksoo Kim et.al. 2401.12979v1 null
2024-01-23 Zero-Shot Learning for the Primitives of 3D Affordance in General Objects Hyeonwoo Kim et.al. 2401.12978v1 null
2024-01-23 Lumiere: A Space-Time Diffusion Model for Video Generation Omer Bar-Tal et.al. 2401.12945v1 null
2024-01-23 Long-range three-dimensional tracking of nanoparticles using interferometric scattering (iSCAT) microscopy Kiarash Kasaian et.al. 2401.12939v1 null
2024-01-22 DITTO: Diffusion Inference-Time T-Optimization for Music Generation Zachary Novack et.al. 2401.12179v1 null
2024-01-22 Single-View 3D Human Digitalization with Large Reconstruction Models Zhenzhen Weng et.al. 2401.12175v1 null
2024-01-22 Improved accuracy of continuum surface flux models for metal additive manufacturing melt pool simulations Nils Much et.al. 2401.12114v1 null
2024-01-22 Experimental investigation and scale analysis on melting of salty ice in a 3D-printed cavity filled with porous media Xiaotian Liand Yuming Wang et.al. 2401.12009v1 null
2024-01-22 Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research Rrubaa Panchendrarajan et.al. 2401.11969v1 null
2024-01-22 Feature Denoising Diffusion Model for Blind Image Quality Assessment Xudong Li et.al. 2401.11949v1 null
2024-01-19 Synthesizing Moving People with 3D Control Boyi Li et.al. 2401.10889v1 null
2024-01-19 ActAnywhere: Subject-Aware Video Background Generation Boxiao Pan et.al. 2401.10822v1 null
2024-01-19 Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion Zuoyue Li et.al. 2401.10786v1 null
2024-01-19 Signatures of s-wave scattering in bound electronic states Robin E. Moorby et.al. 2401.10714v1 null
2024-01-19 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Yinan Zheng et.al. 2401.10700v1 link
2024-01-19 Refractive index measurement of pharmaceutical powders in the short-wave infrared range using index matching assisted with phase imaging Cory Juntunen et.al. 2401.10667v1 null
2024-01-19 Analysis of the Patent of a Protective Cover for Vertical-Axis Wind Turbines (VAWTs): Simulations of Wind Flow JA Moleón Baca et.al. 2401.10656v1 null
2024-01-18 A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting Wouter Van Gansbeke et.al. 2401.10227v1 link
2024-01-18 Towards Language-Driven Video Inpainting via Multimodal Large Language Models Jianzong Wu et.al. 2401.10226v1 null
2024-01-18 Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation Changgu Chen et.al. 2401.10150v1 null
2024-01-18 DiffusionGPT: LLM-Driven Text-to-Image Generation System Jie Qin et.al. 2401.10061v1 null
2024-01-18 CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects Zhao Wang et.al. 2401.09962v1 null
2024-01-17 TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion Yu-Ying Yeh et.al. 2401.09416v1 null
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414v1 link
2024-01-17 Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery Jia Jia et.al. 2401.09325v1 null
2024-01-17 Tailoring chaotic motion of microcavity photons in ray and wave dynamics by tuning the curvature of space Wei Lin et.al. 2401.09303v1 null
2024-01-17 T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis Yoonjin Chung et.al. 2401.09294v1 null
2024-01-16 Robotic Imitation of Human Actions Josua Spisak et.al. 2401.08381v1 null
2024-01-16 Optimization of the plasmonic properties of titanium nitride films sputtered at room temperature through microstructure and thickness control Mateusz Nieborek et.al. 2401.08353v1 null
2024-01-16 Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing Bin Zhang et.al. 2401.08275v1 null
2024-01-16 Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization Chongzhi Zhang et.al. 2401.08232v1 null
2024-01-12 Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks Stefan Blücher et.al. 2401.06654v1 link
2024-01-12 Adversarial Examples are Misaligned in Diffusion Model Manifolds Peter Lorenz et.al. 2401.06637v1 null
2024-01-12 Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking Wei Cao et.al. 2401.06614v1 null
2024-01-12 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model Qian Wang et.al. 2401.06578v1 null
2024-01-11 E $^{2}$ GAN: Efficient Training of Efficient GANs for Image-to-Image Translation Yifan Gong et.al. 2401.06127v1 null
2024-01-11 Numerical thermalization in 2D PIC simulations: Practical estimates for low temperature plasma simulations Sierra Jubin et.al. 2401.06057v1 null
2024-01-11 DiffDA: a diffusion model for weather-scale data assimilation Langwen Huang et.al. 2401.05932v1 null
2024-01-11 Efficient Image Deblurring Networks based on Diffusion Models Kang Chen et.al. 2401.05907v1 link
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335v1 null
2024-01-10 Score Distillation Sampling with Learned Manifold Corrective Thiemo Alldieck et.al. 2401.05293v1 null
2024-01-10 PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models Junsong Chen et.al. 2401.05252v1 link
2024-01-10 Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN Muhammad Ali Farooq et.al. 2401.05159v1 null
2024-01-10 CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model Yinghui Xing et.al. 2401.05153v1 null
2024-01-09 Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation Xiyi Chen et.al. 2401.04728v1 null
2024-01-09 EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models Jingyuan Yang et.al. 2401.04608v1 null
2024-01-09 Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Xuewen Liu et.al. 2401.04585v1 link
2024-01-09 MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation Weimin Wang et.al. 2401.04468v1 null
2024-01-09 D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection Justin Tebbe et.al. 2401.04463v1 link
2024-01-08 D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement Danqi Yan et.al. 2401.03914v1 null
2024-01-05 Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction Yuxin Yang et.al. 2401.02916v1 null
2024-01-05 Plug-in Diffusion Model for Sequential Recommendation Haokai Ma et.al. 2401.02913v1 link
2024-01-05 Generating Non-Stationary Textures using Self-Rectification Yang Zhou et.al. 2401.02847v1 link
2024-01-05 Diffbody: Diffusion-based Pose and Shape Editing of Human Images Yuta Okuyama et.al. 2401.02804v1 link
2024-01-05 Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors Top Piriyakulkij et.al. 2401.02739v1 null
2024-01-05 Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation Can Xu et.al. 2401.02683v1 link
2024-01-04 Bring Metric Functions into Diffusion Models Jie An et.al. 2401.02414v1 null
2024-01-04 Image denoising and model-independent parameterization for improving IVIM MRI Caleb Sample et.al. 2401.02394v1 null
2024-01-04 Integration of physics-informed operator learning and finite element method for parametric learning of partial differential equations Shahed Rezaei et.al. 2401.02363v1 null
2024-01-04 Robust Physics Informed Neural Networks Marcin Łoś et.al. 2401.02300v1 null
2024-01-03 From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations Evonne Ng et.al. 2401.01885v1 link
2024-01-03 DGDNN: Decoupled Graph Diffusion Neural Network for Stock Movement Prediction Zinuo You et.al. 2401.01846v1 link
2024-01-03 Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions David Junhao Zhang et.al. 2401.01827v1 link
2024-01-03 aMUSEd: An Open MUSE Reproduction Suraj Patil et.al. 2401.01808v1 link
2024-01-03 Short-time expansion of one-dimensional Fokker-Planck equations with heterogeneous diffusion Tom Dupont et.al. 2401.01765v1 null
2024-01-02 Influence of scanning plane on Human Spinal Cord functional Magnetic Resonance echo planar imaging Marta Moraschi et.al. 2401.01281v1 null
2024-01-02 Fairness Certification for Natural Language Processing and Large Language Models Vincent Freiberger et.al. 2401.01262v1 null
2024-01-02 VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM Fuchen Long et.al. 2401.01256v1 null
2024-01-02 Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation Renshuai Liu et.al. 2401.01207v1 null
2024-01-02 Learning Surface Scattering Parameters From SAR Images Using Differentiable Ray Tracing Jiangtao Wei et.al. 2401.01175v1 null
2024-01-02 Joint Generative Modeling of Scene Graphs and Images via Diffusion Models Bicheng Xu et.al. 2401.01130v1 null
2023-12-29 FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis Feng Liang et.al. 2312.17681v1 null
2023-12-29 Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models Kay Liu et.al. 2312.17679v1 link
2023-12-29 Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation Tuan-Anh Vu et.al. 2312.17505v1 null
2023-12-28 iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views Chin-Hsuan Wu et.al. 2312.17250v1 link
2023-12-28 Amodal Ground Truth and Completion in the Wild Guanqi Zhan et.al. 2312.17247v1 link
2023-12-28 Personalized Restoration via Dual-Pivot Tuning Pradyumna Chari et.al. 2312.17234v1 null
2023-12-28 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency Yuyang Yin et.al. 2312.17225v1 null
2023-12-28 EFHQ: Multi-purpose ExtremePose-Face-HQ dataset Trung Tuan Dao et.al. 2312.17205v1 null
2023-12-28 Restoration by Generation with Constrained Priors Zheng Ding et.al. 2312.17161v1 null
2023-12-28 InsActor: Instruction-driven Physics-based Characters Jiawei Ren et.al. 2312.17135v1 null
2023-12-28 100-fold improvement in relaxed eddy accumulation flux estimates through error diffusion Anas Emad et.al. 2312.17027v1 link
2023-12-26 One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications Mengyao Lyu et.al. 2312.16145v1 null
2023-12-26 HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Sangmin Woo et.al. 2312.15980v1 link
2023-12-26 Semantic Guidance Tuning for Text-To-Image Diffusion Models Hyun Kang et.al. 2312.15964v1 null
2023-12-26 EnchantDance: Unveiling the Potential of Music-Driven Dance Movement Bo Han et.al. 2312.15946v1 link
2023-12-22 MACS: Mass Conditioned 3D Hand and Object Motion Synthesis Soshi Shimada et.al. 2312.14929v1 null
2023-12-22 BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction Honghao Fu et.al. 2312.14871v1 null
2023-12-22 Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models Tanish Baranwal et.al. 2312.14830v1 null
2023-12-22 Neural network models for preferential concentration of particles in two-dimensional turbulence Thibault Maurel-Oujia et.al. 2312.14829v1 null
2023-12-22 Plan, Posture and Go: Towards Open-World Text-to-Motion Generation Jinpeng Liu et.al. 2312.14828v1 null
2023-12-22 Disorder-induced non-linear growth of viscously-unstable immiscible two-phase flow fingers in porous media Santanu Sinha et.al. 2312.14799v1 null
2023-12-22 Diffusion Maps for Signal Filtering in Graph Learning Todd Hildebrant et.al. 2312.14758v1 null
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134v1 null
2023-12-21 Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation Philipp Schröppel et.al. 2312.14124v1 link
2023-12-21 HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models Hayk Manukyan et.al. 2312.14091v1 link
2023-12-21 Designing Artificial Intelligence Equipped Social Decentralized Autonomous Organizations for Tackling Sextortion Cases Version 0.7 Norta Alex et.al. 2312.14090v1 null
2023-12-21 The influence of controlled vibration effects on fluid flow Alexey Fedyushkin et.al. 2312.14079v1 null
2023-12-21 Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning Desai Xie et.al. 2312.13980v1 null
2023-12-21 Controllable 3D Face Generation with Conditional Style Code Diffusion Xiaolong Shen et.al. 2312.13941v1 link
2023-12-20 Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting Junwu Zhang et.al. 2312.13271v1 link
2023-12-20 Conditional Image Generation with Pretrained Generative Model Rajesh Shrestha et.al. 2312.13253v1 null
2023-12-20 Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model Saurabh Saxena et.al. 2312.13252v1 null
2023-12-20 Diffusion Models With Learned Adaptive Noise Subham Sekhar Sahoo et.al. 2312.13236v1 link
2023-12-20 MoSAR: Monocular Semi-Supervised Model for Avatar Reconstruction using Differentiable Shading Abdallah Dib et.al. 2312.13091v1 null
2023-12-20 DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis Yuming Gu et.al. 2312.13016v1 link
2023-12-20 A comparative study of analytical models of diffuse reflectance in homogeneous biological tissues: Gelatin based phantoms and Monte Carlo experiments Anisha Bahl et.al. 2312.12935v1 null
2023-12-19 On Inference Stability for Diffusion Models Viet Nguyen et.al. 2312.12431v1 link
2023-12-19 SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process Mengyu Wang et.al. 2312.12425v1 link
2023-12-19 Scene-Conditional 3D Object Stylization and Composition Jinghao Zhou et.al. 2312.12419v1 null
2023-12-19 LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset Haolin Liu et.al. 2312.12418v1 null
2023-12-19 Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models Shweta Mahajan et.al. 2312.12416v1 null
2023-12-19 Intrinsic Image Diffusion for Single-view Material Estimation Peter Kocsis et.al. 2312.12274v1 link
2023-12-19 Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model Lingjun Zhang et.al. 2312.12232v1 link
2023-12-18 A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm Yong Niu et.al. 2312.10885v1 null
2023-12-17 Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models Nikita Starodubcev et.al. 2312.10835v1 link
2023-12-17 From mixing to displacement of miscible phases in porous media: The role of heterogeneity and inlet pressure Yahel Eliyahu-Yakir et.al. 2312.10722v1 null
2023-12-17 CogCartoon: Towards Practical Story Visualization Zhongyang Zhu et.al. 2312.10718v1 null
2023-12-17 A Framework of Full-Process Generation Design for Park Green Spaces Based on Remote Sensing Segmentation-GAN-Diffusion Ran Chen et.al. 2312.10674v1 null
2023-12-15 Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects Paul Maria Scheikl et.al. 2312.10008v1 null
2023-12-15 Contributions to the geomagnetic secular variation from a reanalysis of core surface dynamics Olivier Barrois et.al. 2312.09942v1 null
2023-12-15 Assimilation of ground and satellite magnetic measurements: inference of core surface magnetic and velocity field changes Olivier Barrois et.al. 2312.09878v1 null
2023-12-15 Integrating New Technologies into Science: The case of AI Stefano Bianchini et.al. 2312.09843v1 null
2023-12-15 Socio-Economic Deprivation Analysis: Diffusion Maps June Moh Goo et.al. 2312.09830v1 null
2023-12-15 Comparison of Quasi-Geostrophic, Hybrid and 3D models of planetary core convection Olivier Barrois et.al. 2312.09826v1 null
2023-12-15 Neural networks for turbulent transport prediction in a simplified model of tokamak plasmas L. M. Pomârjanschi et.al. 2312.09807v1 null
2023-12-14 LIME: Localized Image Editing via Attention Regularization in Diffusion Models Enis Simsar et.al. 2312.09256v1 null
2023-12-14 FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection Hongsuk Choi et.al. 2312.09252v1 null
2023-12-14 Single Mesh Diffusion Models with Field Latents for Texture Generation Thomas W. Mitchel et.al. 2312.09250v1 null
2023-12-14 Text2Immersion: Generative Immersive Scene with 3D Gaussians Hao Ouyang et.al. 2312.09242v1 null
2023-12-14 A framework for conditional diffusion modelling with applications in motif scaffolding for protein design Kieran Didi et.al. 2312.09236v1 null
2023-12-14 Reliability in Semantic Segmentation: Can We Use Synthetic Data? Thibaut Loiseau et.al. 2312.09231v1 null
2023-12-14 Mosaic-SDF for 3D Generative Models Lior Yariv et.al. 2312.09222v1 null
2023-12-14 Measurement in the Age of LLMs: An Application to Ideological Scaling Sean O’Hagan et.al. 2312.09203v1 null
2023-12-14 Fast Sampling via De-randomization for Discrete Diffusion Models Zixiang Chen et.al. 2312.09193v1 null
2023-12-13 PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion Xin You et.al. 2312.08323v1 link
2023-12-13 Black-box Membership Inference Attacks against Fine-tuned Diffusion Models Yan Pang et.al. 2312.08207v1 link
2023-12-13 SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space Yunchen Li et.al. 2312.08200v1 link
2023-12-13 Concept-centric Personalization with Large-scale Diffusion Priors Pu Cao et.al. 2312.08195v1 link
2023-12-13 $ρ$ -Diffusion: A diffusion-based density estimation framework for computational physics Maxwell X. Cai et.al. 2312.08153v1 link
2023-12-13 Clockwork Diffusion: Efficient Generation With Model-Step Distillation Amirhossein Habibian et.al. 2312.08128v1 link
2023-12-12 FreeInit: Bridging Initialization Gap in Video Diffusion Models Tianxing Wu et.al. 2312.07537v1 link
2023-12-12 FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition Sicheng Mo et.al. 2312.07536v1 null
2023-12-12 PEEKABOO: Interactive Video Generation via Masked-Diffusion Yash Jain et.al. 2312.07509v1 null
2023-12-12 MinD-3D: Reconstruct High-quality 3D objects in Human Brain Jianxiong Gao et.al. 2312.07485v1 null
2023-12-12 DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing Kaiwen Zhang et.al. 2312.07409v1 null
2023-12-12 Boosting Latent Diffusion with Flow Matching Johannes S. Fischer et.al. 2312.07360v1 link
2023-12-12 Momentum Particle Maximum Likelihood Jen Ning Lim et.al. 2312.07335v1 null
2023-12-11 CAD: Photorealistic 3D Generation via Adversarial Distillation Ziyu Wan et.al. 2312.06663v1 null
2023-12-11 Photorealistic Video Generation with Diffusion Models Agrim Gupta et.al. 2312.06662v1 null
2023-12-11 UpFusion: Novel View Diffusion from Unposed Sparse View Observations Bharath Raj Nagoor Kani et.al. 2312.06661v1 null
2023-12-11 Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior Fangfu Liu et.al. 2312.06655v1 link
2023-12-11 Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Shangchen Zhou et.al. 2312.06640v1 null
2023-12-11 DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection Haoyang He et.al. 2312.06607v1 link
2023-12-11 ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models Denis Zavadski et.al. 2312.06573v1 link
2023-12-11 HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models Xiaogang Peng et.al. 2312.06553v1 null
2023-12-11 In-situ Synchrotron X-Ray Photoelectron Spectroscopy Study of Medium-Temperature Baking of Niobium for SRF Application Alena Prudnikava et.al. 2312.06529v1 null
2023-12-08 KBFormer: A Diffusion Model for Structured Entity Completion Ouail Kitouni et.al. 2312.05253v1 null
2023-12-08 SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation Thuan Hoang Nguyen et.al. 2312.05239v1 null
2023-12-08 Stoichiometry preservation and generalization of Bilger mixture fraction for non-premixed combustion with differential molecular diffusion Haifeng Wang et.al. 2312.05204v1 null
2023-12-08 Membership Inference Attacks on Diffusion Models via Quantile Regression Shuai Tang et.al. 2312.05140v1 null
2023-12-08 DreaMoving: A Human Dance Video Generation Framework based on Diffusion Models Mengyang Feng et.al. 2312.05107v1 null
2023-12-08 Application of deep learning to the estimation of normalization coefficients in diffusion-based covariance models Folke K Skrunes et.al. 2312.05068v1 link
2023-12-08 SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control Jaskirat Singh et.al. 2312.05039v1 null
2023-12-08 Numerical determination of iron dust laminar flame speeds with the counterflow twin-flame technique C. E. A. G. van Gool et.al. 2312.04994v1 null
2023-12-07 Gen2Det: Generate to Detect Saksham Suri et.al. 2312.04566v1 null
2023-12-07 NeRFiller: Completing Scenes via Generative 3D Inpainting Ethan Weber et.al. 2312.04560v1 null
2023-12-07 PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation Zhaoxi Chen et.al. 2312.04559v1 link
2023-12-07 GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Shoufa Chen et.al. 2312.04557v1 null
2023-12-07 SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing Tomoki Ichikawa et.al. 2312.04553v1 null
2023-12-07 Generating Illustrated Instructions Sachit Menon et.al. 2312.04552v1 null
2023-12-07 PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play Lili Chen et.al. 2312.04549v1 null
2023-12-07 HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image Tong Wu et.al. 2312.04543v1 null
2023-12-07 Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance Yuto Enyo et.al. 2312.04529v1 null
2023-12-07 RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models Ozgur Kara et.al. 2312.04524v1 link
2023-12-06 Relightable Gaussian Codec Avatars Shunsuke Saito et.al. 2312.03704v1 null
2023-12-06 Self-conditioned Image Generation via Generating Representations Tianhong Li et.al. 2312.03701v1 link
2023-12-06 Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication Ali Naseh et.al. 2312.03692v1 null
2023-12-06 WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on xujie zhang et.al. 2312.03667v1 null
2023-12-06 TokenCompose: Grounding Diffusion with Token-level Supervision Zirui Wang et.al. 2312.03626v1 link
2023-12-06 DreamComposer: Controllable 3D Object Generation via Multi-View Conditions Yunhan Yang et.al. 2312.03611v1 null
2023-12-06 DiffusionSat: A Generative Foundation Model for Satellite Imagery Samar Khanna et.al. 2312.03606v1 null
2023-12-06 MMM: Generative Masked Motion Model Ekkasit Pinyoanuntapong et.al. 2312.03596v1 link
2023-12-05 ReconFusion: 3D Reconstruction with Diffusion Priors Rundi Wu et.al. 2312.02981v1 null
2023-12-05 Alchemist: Parametric Control of Material Properties with Diffusion Models Prafull Sharma et.al. 2312.02970v1 null
2023-12-05 AmbiGen: Generating Ambigrams from Pre-trained Diffusion Model Boheng Zhao et.al. 2312.02967v1 null
2023-12-05 Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection Cheng-Ju Ho et.al. 2312.02966v1 link
2023-12-05 Drag-A-Video: Non-rigid Video Editing with Point-based Interaction Yao Teng et.al. 2312.02936v1 null
2023-12-05 WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation Jiachen Lu et.al. 2312.02934v1 link
2023-12-05 LivePhoto: Real Image Animation with Text-guided Motion Control Xi Chen et.al. 2312.02928v1 null
2023-12-05 Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration Yuang Ai et.al. 2312.02918v1 null
2023-12-04 Latent Feature-Guided Diffusion Models for Shadow Removal Kangfu Mei et.al. 2312.02156v1 null
2023-12-04 Readout Guidance: Learning Control from Diffusion Features Grace Luo et.al. 2312.02150v1 null
2023-12-04 Generative Powers of Ten Xiaojuan Wang et.al. 2312.02149v1 null
2023-12-04 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation Bingxin Ke et.al. 2312.02145v1 link
2023-12-04 DiffiT: Diffusion Vision Transformers for Image Generation Ali Hatamizadeh et.al. 2312.02139v1 link
2023-12-04 Style Aligned Image Generation via Shared Attention Amir Hertz et.al. 2312.02133v1 link
2023-12-04 VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence Yuchao Gu et.al. 2312.02087v1 null
2023-12-04 Computational Investigation on Collective Dynamical Behaviors of Flickering Laminar Buoyant Diffusion Flames in Circular Arrays Tao Yang et.al. 2312.02018v1 null
2023-12-01 MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video Hengyi Wang et.al. 2312.00778v1 null
2023-12-01 VideoBooth: Diffusion-based Video Generation with Image Prompts Yuming Jiang et.al. 2312.00777v1 null
2023-12-01 CompuCell3D Model of Cell Migration Reproduces Chemotaxis Pedro C. Dal-Castel et.al. 2312.00776v1 link
2023-12-01 Effects of three-dimensional slit geometry on flashback of premixed hydrogen flames in perforated burners Filippo Fruzza et.al. 2312.00744v1 null
2023-12-01 Resource-constrained knowledge diffusion processes inspired by human peer learning Ehsan Beikihassan et.al. 2312.00660v1 null
2023-12-01 TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models Pengxiang Li et.al. 2312.00651v1 null
2023-12-01 How the zebra got its stripes: Curvature-dependent diffusion orients Turing patterns on 3D surfaces Michael F. Staddon et.al. 2312.00637v1 null
2023-11-30 VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models Zhen Xing et.al. 2311.18837v1 null
2023-11-30 ART $\boldsymbol{\cdot}$ V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng et.al. 2311.18834v1 null
2023-11-30 Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction Hsin-Ying Lee et.al. 2311.18832v1 link
2023-11-30 MotionEditor: Editing Video Motion via Content-Aware Diffusion Shuyuan Tu et.al. 2311.18830v1 link
2023-11-30 MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Yanhui Wang et.al. 2311.18829v1 null
2023-11-30 One-step Diffusion with Distribution Matching Distillation Tianwei Yin et.al. 2311.18828v1 null
2023-11-30 ElasticDiffusion: Training-free Arbitrary Size Image Generation Moayed Haji-Ali et.al. 2311.18822v1 link
2023-11-30 Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters James Seale Smith et.al. 2311.18763v1 null
2023-11-29 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921v1 link
2023-11-29 Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models Daniel Geng et.al. 2311.17919v1 null
2023-11-29 AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text Jianfeng Zhang et.al. 2311.17917v1 null
2023-11-29 CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting Alexander Vilesov et.al. 2311.17907v1 null
2023-11-29 SODA: Bottleneck Diffusion Models for Representation Learning Drew A. Hudson et.al. 2311.17901v1 null
2023-11-29 Leveraging Graph Diffusion Models for Network Refinement Tasks Puja Trivedi et.al. 2311.17856v1 null
2023-11-29 SPiC-E : Structural Priors in 3D Diffusion Models using Cross Entity Attention Etai Sella et.al. 2311.17834v1 null
2023-11-29 Analyzing and Explaining Image Classifiers via Diffusion Guidance Maximilian Augustin et.al. 2311.17833v1 null
2023-11-28 Material Palette: Extraction of Materials from a Single Image Ivan Lopes et.al. 2311.17060v1 null
2023-11-28 ReMoS: Reactive 3D Motion Synthesis for Two-Person Interactions Anindita Ghosh et.al. 2311.17057v1 null
2023-11-28 DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models Tsun-Hsuan Wang et.al. 2311.17053v1 null
2023-11-28 Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models Zhengming Yu et.al. 2311.17050v1 null
2023-11-28 Adversarial Diffusion Distillation Axel Sauer et.al. 2311.17042v1 link
2023-11-28 Rumors with Changing Credibility Charlotte Out et.al. 2311.17040v1 null
2023-11-28 Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features Niladri Shekhar Dutt et.al. 2311.17024v1 link
2023-11-28 Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer Danah Yatim et.al. 2311.17009v1 null
2023-11-28 Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Yutong Feng et.al. 2311.17002v1 null
2023-11-27 Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback Mihir Prabhudesai et.al. 2311.16102v1 null
2023-11-27 CG-HOI: Contact-Guided 3D Human-Object Interaction Generation Christian Diller et.al. 2311.16097v1 null
2023-11-27 Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images Aiyu Cui et.al. 2311.16094v1 null
2023-11-27 Self-correcting LLM-controlled Diffusion Models Tsung-Han Wu et.al. 2311.16090v1 null
2023-11-27 DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization Zhaoyang Xia et.al. 2311.16060v1 link
2023-11-27 Exploring Attribute Variations in Style-based GANs using Diffusion Models Rishubh Parihar et.al. 2311.16052v1 null
2023-11-27 GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions Jiemin Fang et.al. 2311.16037v1 null
2023-11-27 Closing the ODE-SDE gap in score-based diffusion models through the Fokker-Planck equation Teo Deveney et.al. 2311.15996v1 null
2023-11-27 DiffAnt: Diffusion Models for Action Anticipation Zeyun Zhong et.al. 2311.15991v1 null
2023-11-24 CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image Personalization Ruoyu Zhao et.al. 2311.14631v1 null
2023-11-24 Received Signal and Channel Parameter Estimation in Molecular Communications O. Tansel Baydas et.al. 2311.14621v1 null
2023-11-24 Animate124: Animating One Image to 4D Dynamic Scene Yuyang Zhao et.al. 2311.14603v1 null
2023-11-24 On the thermodynamic invariance of fine-grain and coarse-grain fluid models Thomas Dubos et.al. 2311.14564v1 null
2023-11-24 ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model Eslam Mohamed Bakr et.al. 2311.14542v1 null
2023-11-24 GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting Yiwen Chen et.al. 2311.14521v1 link
2023-11-24 MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation Zhiqi Li et.al. 2311.14494v1 link
2023-11-22 On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates Stefano Bruno et.al. 2311.13584v1 null
2023-11-22 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570v1 null
2023-11-22 ADriver-I: A General World Model for Autonomous Driving Fan Jia et.al. 2311.13549v1 null
2023-11-22 DiffusionMat: Alpha Matting as Sequential Refinement Learning Yangyang Xu et.al. 2311.13535v1 null
2023-11-22 Guided Flows for Generative Modeling and Decision Making Qinqing Zheng et.al. 2311.13443v1 null
2023-11-22 LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes Jaeyoung Chung et.al. 2311.13384v1 null
2023-11-21 Bubble departure and sliding in high-pressure flow boiling of water Artyom Kossolapov et.al. 2311.12749v1 null
2023-11-21 GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning Jiaxi Lv et.al. 2311.12631v1 null
2023-11-21 HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis Sang-Hoon Lee et.al. 2311.12454v1 link
2023-11-21 Stable Diffusion For Aerial Object Detection Yanan Jian et.al. 2311.12345v1 null
2023-11-21 LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis Peiang Zhao et.al. 2311.12342v1 null
2023-11-21 Overcoming Pathology Image Data Deficiency: Generating Images from Pathological Transformation Process Zeyu Liu et.al. 2311.12316v1 link
2023-11-20 Macroscopic description of a heavy particle immersed within a flow of light particles Radek Erban et.al. 2311.12021v1 null
2023-11-20 An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis Aishwarya Agarwal et.al. 2311.11919v1 null
2023-11-20 Evolution of internal gravity waves in meso-scale eddies Pablo Sebastia Saez et.al. 2311.11916v1 null
2023-11-20 Log-periodic oscillations as real-time signatures of hierarchical dynamics in proteins Emanuel Dorbath et.al. 2311.11839v1 null
2023-11-20 Holistic Inverse Rendering of Complex Facade via Aerial 3D Scanning Zixuan Xie et.al. 2311.11825v1 null
2023-11-17 Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Rohit Girdhar et.al. 2311.10709v1 null
2023-11-17 SelfEval: Leveraging the discriminative nature of generative models for evaluation Sai Saketh Rambhatla et.al. 2311.10708v1 null
2023-11-17 Enhancing Object Coherence in Layout-to-Image Synthesis Yibin Wang et.al. 2311.10522v1 link
2023-11-16 The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Omri Avrahami et.al. 2311.10093v1 null
2023-11-16 Spontaneous Opinion Swings in the Voter Model with Latency Giovanni Palermo et.al. 2311.10045v1 null
2023-11-16 TransFusion – A Transparency-Based Diffusion Model for Anomaly Detection Matic Fučka et.al. 2311.09999v1 null
2023-11-16 The divergence-free velocity formulation of the consistent Navier-Stokes Cahn-Hilliard model with non-matching densities, divergence-conforming discretization, and benchmarks M. ten Eikelder et.al. 2311.09966v1 null
2023-11-16 DSR-Diff: Depth Map Super-Resolution with Diffusion Model Yuan Shi et.al. 2311.09919v1 null
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221v1 null
2023-11-15 DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model Yinghao Xu et.al. 2311.09217v1 null
2023-11-15 Finding polarised communities and tracking information diffusion on Twitter: The Irish Abortion Referendum Caroline Pena et.al. 2311.09196v1 null
2023-11-15 Fast Detection of Phase Transitions with Multi-Task Learning-by-Confusion Julian Arnold et.al. 2311.09128v1 null
2023-11-15 Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search Hefeng Wu et.al. 2311.09084v1 link
2023-11-15 A Spectral Diffusion Prior for Hyperspectral Image Super-Resolution Jianjun Liu et.al. 2311.08955v1 null
2023-11-13 Fast and Space-Efficient Parallel Algorithms for Influence Maximization Letong Wang et.al. 2311.07554v1 link
2023-11-13 Harnessing elastic instabilities for enhanced mixing and reaction kinetics in porous media Christopher A. Browne et.al. 2311.07431v1 link
2023-11-13 Robust semi-supervised segmentation with timestep ensembling diffusion models Margherita Rosnati et.al. 2311.07421v1 null
2023-11-10 Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization Weiyang Liu et.al. 2311.06243v1 null
2023-11-10 Diffusion Models for Earth Observation Use-cases: from cloud removal to urban change detection Fulvio Sanguigni et.al. 2311.06222v1 null
2023-11-10 Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model Jiahao Li et.al. 2311.06214v1 null
2023-11-10 Turbulence Scaling from Deep Learning Diffusion Generative Models Tim Whittaker et.al. 2311.06112v1 null
2023-11-09 Diffusion-Generative Multi-Fidelity Learning for Physical Simulation Zheng Wang et.al. 2311.05606v1 null
2023-11-09 Bayesian Methods for Media Mix Modelling with shape and funnel effects Javier Marin et.al. 2311.05587v1 null
2023-11-09 LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Simian Luo et.al. 2311.05556v1 link
2023-11-09 From Stability to Change: The Potential Application of Bifurcation Theory to Opinion Dynamics Considerations Yasuko Kawahata et.al. 2311.05488v1 null
2023-11-09 Lithium-ion battery performance model including solvent segregation effects Ruihe Li et.al. 2311.05467v1 null
2023-11-09 3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models Haibo Yang et.al. 2311.05464v1 link
2023-11-09 ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors Jingwen Chen et.al. 2311.05463v1 null
2023-11-08 Transferability of atomic energies from alchemical decomposition Michael J. Sahre et.al. 2311.04784v1 link
2023-11-08 Weakly-supervised deepfake localization in diffusion-generated images Dragos Tantaru et.al. 2311.04584v1 link
2023-11-07 I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models Shiwei Zhang et.al. 2311.04145v1 link
2023-11-07 Simple Bundles of Complex Networks Alexandre Benatti et.al. 2311.04133v1 null
2023-11-07 Generative Structural Design Integrating BIM and Diffusion Model Zhili He et.al. 2311.04052v1 link
2023-11-07 A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems Cheng Yin et.al. 2311.04014v1 null
2023-11-06 TS-Diffusion: Generating Highly Complex Time Series with Diffusion Models Yangming Li et.al. 2311.03303v1 null
2023-11-06 LDM3D-VR: Latent Diffusion Model for 3D VR Gabriela Ben Melech Stan et.al. 2311.03226v1 null
2023-11-06 Persistent homology for high-dimensional data based on spectral methods Sebastian Damrich et.al. 2311.03087v1 link
2023-11-06 AnyText: Multilingual Visual Text Generation And Editing Yuxiang Tuo et.al. 2311.03054v1 link
2023-11-03 Latent Diffusion Model for Conditional Reservoir Facies Generation Daesoo Lee et.al. 2311.01968v1 null
2023-11-03 DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder Tao Liu et.al. 2311.01811v1 null
2023-11-03 On the Generalization Properties of Diffusion Models Puheng Li et.al. 2311.01797v1 link
2023-11-03 PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation Yuhan Ding et.al. 2311.01773v1 null
2023-11-03 CDGraph: Dual Conditional Social Graph Synthesizing via Diffusion Model Jui-Yi Tsai et.al. 2311.01729v1 null
2023-11-02 Time Series Anomaly Detection using Diffusion-based Models Ioana Pintilie et.al. 2311.01452v1 link
2023-11-02 Constrained-Context Conditional Diffusion Models for Imitation Learning Vaibhav Saxena et.al. 2311.01419v1 null
2023-11-02 The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing Shen Nie et.al. 2311.01410v1 null
2023-11-02 Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors Gabriele M. Caddeo et.al. 2311.01380v1 link
2023-11-02 DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning Wenxuan Bao et.al. 2311.01295v1 link
2023-11-02 Unraveling Diffusion in Fusion Plasma: A Case Study of In Situ Processing and Particle Sorting Junmin Gu et.al. 2311.01288v1 null
2023-11-01 De-Diffusion Makes Text a Strong Cross-Modal Interface Chen Wei et.al. 2311.00618v1 null
2023-11-01 Controllable Music Production with Diffusion Models and Guidance Gradients Mark Levy et.al. 2311.00613v1 null
2023-11-01 Intriguing Properties of Data Attribution on Diffusion Models Xiaosen Zheng et.al. 2311.00500v1 link
2023-11-01 Diffusion models for probabilistic programming Simon Dirmeier et.al. 2311.00474v1 link
2023-11-01 Dual Conditioned Diffusion Models for Out-Of-Distribution Detection: Application to Fetal Ultrasound Videos Divyanshu Mishra et.al. 2311.00469v1 null
2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction Xinyuan Chen et.al. 2310.20700v1 null
2023-10-31 Diffusion Reconstruction of Ultrasound Images with Informative Uncertainty Yuxin Zhang et.al. 2310.20618v1 null
2023-10-29 JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation Yao Yao et.al. 2310.19180v1 null
2023-10-29 Learning to Follow Object-Centric Image Editing Instructions Faithfully Tuhin Chakrabarty et.al. 2310.19145v1 link
2023-10-29 Backward and Forward Inference in Interacting Independent-Cascade Processes: A Scalable and Convergent Message-Passing Approach Nouman Khan et.al. 2310.19138v1 null
2023-10-29 Bespoke Solvers for Generative Flow Models Neta Shaul et.al. 2310.19075v1 null
2023-10-29 Controllable Group Choreography using Contrastive Diffusion Nhat Le et.al. 2310.18986v1 null
2023-10-29 Adversarial Examples Are Not Real Features Ang Li et.al. 2310.18936v1 link
2023-10-27 Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models Pushkal Katara et.al. 2310.18308v1 null
2023-10-27 Unsteady evolution of slip and drag in surfactant-contaminated superhydrophobic channels Samuel D. Tomlinson et.al. 2310.18184v1 null
2023-10-27 Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN Neeraj Kumar et.al. 2310.18169v1 null
2023-10-27 Lost in Translation – Multilingual Misinformation and its Evolution Dorian Quelle et.al. 2310.18089v1 null
2023-10-27 ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Kyle Sargent et.al. 2310.17994v1 null
2023-10-26 6-DoF Stability Field via Diffusion Models Takuma Yoneda et.al. 2310.17649v1 null
2023-10-26 Generative Fractional Diffusion Models Gabriel Nobis et.al. 2310.17638v1 null
2023-10-26 Orbital-optimized Density Functional Calculations of Molecular Rydberg Excited States with Real Space Grid Representation and Self-Interaction Correction Alec E. Sigurðarson et.al. 2310.17605v1 null
2023-10-26 Noise-Free Score Distillation Oren Katzir et.al. 2310.17590v1 null
2023-10-27 Global Structure-Aware Diffusion Process for Low-Light Image Enhancement Jinhui Hou et.al. 2310.17577v2 link
2023-10-26 DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation Yongxin Zhu et.al. 2310.17570v1 null
2023-10-26 SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching Xinghui Li et.al. 2310.17569v1 null
2023-10-27 The Expressive Power of Low-Rank Adaptation Yuchen Zeng et.al. 2310.17513v2 link
2023-10-25 PERF: Panoramic Neural Radiance Field from a Single Panorama Guangcong Wang et.al. 2310.16831v1 link
2023-10-25 CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images Aaron Gokaslan et.al. 2310.16825v1 link
2023-10-26 DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior Jingxiang Sun et.al. 2310.16818v2 link
2023-10-25 Optical Kinetic Theory of Nonlinear Multi-mode Photonic Networks Arkady Kurnosov et.al. 2310.16784v1 null
2023-10-25 Kiki or Bouba? Sound Symbolism in Vision-and-Language Models Morris Alper et.al. 2310.16781v1 null
2023-10-25 Multi-scale Diffusion Denoised Smoothing Jongheon Jeong et.al. 2310.16779v1 link
2023-10-25 Discrete variance decay analysis of spurious mixing Tridib Banerjee et.al. 2310.16768v1 null
2023-10-25 Scalar mass conservation in turbulent mixture fraction based combustion models through consistent local flow parameters Marco Davidovic et.al. 2310.16743v1 null
2023-10-24 From Posterior Sampling to Meaningful Diversity in Image Restoration Noa Cohen et.al. 2310.16047v1 null
2023-10-24 CVPR 2023 Text Guided Video Editing Competition Jay Zhangjie Wu et.al. 2310.16003v1 link
2023-10-24 Classical wave-particle localization in disordered landscapes Abel J. Abraham et.al. 2310.16000v1 null
2023-10-25 Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles Xing Shen et.al. 2310.15952v2 null
2023-10-24 Language-driven Scene Synthesis using Multi-conditional Diffusion Model An Vuong et.al. 2310.15948v1 link
2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling Haonan Qiu et.al. 2310.15169v1 link
2023-10-23 Matryoshka Diffusion Models Jiatao Gu et.al. 2310.15111v1 null
2023-10-23 Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model Ruoxi Shi et.al. 2310.15110v1 link
2023-10-24 Wonder3D: Single Image to 3D using Cross-Domain Diffusion Xiaoxiao Long et.al. 2310.15008v2 null
2023-10-23 Orientation-Aware Leg Movement Learning for Action-Driven Human Motion Prediction Chunzhi Gu et.al. 2310.14907v1 null
2023-10-20 Achieving Single-Electron Sensitivity at Enhanced Speed in Fully-Depleted CCDs with Double-Gate MOSFETs Miguel Sofo-Haro et.al. 2310.13644v1 null
2023-10-20 ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection Zhongzhan Huang et.al. 2310.13545v1 link
2023-10-20 A Critical Insight into Pretransitional Behavior and Dielectric Tunability of Relaxor Ceramics Sylwester J. Rzoska et.al. 2310.13326v1 null
2023-10-19 Variational Inference for SDEs Driven by Fractional Noise Rembert Daems et.al. 2310.12975v1 null
2023-10-19 A Markovian dynamics for $C. elegans$ behavior across scales Antonio C. Costa et.al. 2310.12883v1 link
2023-10-19 EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model Zheyuan Zhang et.al. 2310.12868v1 null
2023-10-19 An effective theory of collective deep learning Lluís Arola-Fernández et.al. 2310.12802v1 link
2023-10-19 Energy-Based Models For Speech Synthesis Wanli Sun et.al. 2310.12765v1 null
2023-10-18 Object-aware Inversion and Reassembly for Image Editing Zhen Yang et.al. 2310.12149v1 null
2023-10-18 Quality Diversity through Human Feedback Li Ding et.al. 2310.12103v1 link
2023-10-18 Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach Feng Luo et.al. 2310.12004v1 link
2023-10-18 Bayesian Flow Networks in Continual Learning Mateusz Pyla et.al. 2310.12001v1 null
2023-10-18 InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation Renzhi Wang et.al. 2310.11976v1 link
2023-10-17 Elucidating The Design Space of Classifier-Guided Diffusion Generation Jiajun Ma et.al. 2310.11311v1 link
2023-10-17 Favorable and unfavorable many-body interactions for near-field radiative heat transfer in nanoparticle networks Minggang Luo et.al. 2310.11273v1 null
2023-10-17 A diffusive wetting model for water entry/exit based on the weakly-compressible SPH method Shuoguo Zhang et.al. 2310.11179v1 null
2023-10-17 Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion Xueyao Zhang et.al. 2310.11160v1 link
2023-10-17 BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference Siqi Kou et.al. 2310.11142v1 link
2023-10-17 3D Structure-guided Network for Tooth Alignment in 2D Photograph Yulong Dou et.al. 2310.11106v1 link
2023-10-16 A Survey on Video Diffusion Models Zhen Xing et.al. 2310.10647v1 link
2023-10-16 TOSS:High-quality Text-guided Novel View Synthesis from a Single Image Yukai Shi et.al. 2310.10644v1 null
2023-10-16 LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts Hanan Gani et.al. 2310.10640v1 link
2023-10-16 Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models Kevin Black et.al. 2310.10639v1 link
2023-10-16 DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing Jia-Wei Liu et.al. 2310.10624v1 null
2023-10-16 ViPE: Visualise Pretty-much Everything Hassan Shahmohammadi et.al. 2310.10543v1 link
2023-10-13 Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy Anton Baryshnikov et.al. 2310.09247v1 link
2023-10-13 Unseen Image Synthesis with Diffusion Models Ye Zhu et.al. 2310.09213v1 null
2023-10-13 The effect of solar wind on the charged particles’ diffusion coefficients J. F. Wang et.al. 2310.09211v1 null
2023-10-12 OmniControl: Control Any Joint at Any Time for Human Motion Generation Yiming Xie et.al. 2310.08580v1 link
2023-10-12 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Xian Liu et.al. 2310.08579v1 null
2023-10-12 NetDiffusion: Network Data Augmentation Through Protocol-Constrained Traffic Generation Xi Jiang et.al. 2310.08543v1 null
2023-10-12 GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors Taoran Yi et.al. 2310.08529v1 link
2023-10-12 MotionDirector: Motion Customization of Text-to-Video Diffusion Models Rui Zhao et.al. 2310.08465v1 link
2023-10-12 Debias the Training of Diffusion Models Hu Yu et.al. 2310.08442v1 null
2023-10-12 Neural Diffusion Models Grigory Bartosh et.al. 2310.08337v1 null
2023-10-11 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Yingqing He et.al. 2310.07702v1 link
2023-10-11 ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Bo Peng et.al. 2310.07697v1 link
2023-10-11 Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models Lai Zeqiang et.al. 2310.07653v1 link
2023-10-11 Flux gradient relations and their dependence on turbulence anisotropy Samuele Mosso et.al. 2310.07503v1 null
2023-10-11 Boosting Black-box Attack to Deep Neural Networks with Conditional Diffusion Models Renyang Liu et.al. 2310.07492v1 null
2023-10-11 Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else Hazarapet Tunanyan et.al. 2310.07419v1 null
2023-10-10 What Does Stable Diffusion Know about the 3D Scene? Guanqi Zhan et.al. 2310.06836v1 link
2023-10-10 Impact of grain boundary and surface diffusion on predicted fission gas bubble behavior and release in UO $_2$ fuel Md Ali Muntaha et.al. 2310.06795v1 null
2023-10-10 HiFi-123: Towards High-fidelity One Image to 3D Content Generation Wangbo Yu et.al. 2310.06744v1 null
2023-10-10 Latent Diffusion Counterfactual Explanations Karim Farid et.al. 2310.06668v1 null
2023-10-10 Tertiary Lymphoid Structures Generation through Graph-based Diffusion Manuel Madeira et.al. 2310.06661v1 null
2023-10-09 FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Yuren Cong et.al. 2310.05922v1 null
2023-10-10 Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models Zhili Liu et.al. 2310.05873v2 null
2023-10-09 A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models Sebastian G. Gruber et.al. 2310.05833v1 null
2023-10-09 DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models Shansan Gong et.al. 2310.05793v1 link
2023-10-09 Language Model Beats Diffusion – Tokenizer is Key to Visual Generation Lijun Yu et.al. 2310.05737v1 link
2023-10-09 CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis Xiaoxiao Sun et.al. 2310.04414v2 null
2023-10-06 Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Simian Luo et.al. 2310.04378v1 link
2023-10-05 Aligning Text-to-Image Diffusion Models with Reward Backpropagation Mihir Prabhudesai et.al. 2310.03739v1 link
2023-10-05 Stochastic interpolants with data-dependent couplings Michael S. Albergo et.al. 2310.03725v1 null
2023-10-05 Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints Chuan Fang et.al. 2310.03602v1 null
2023-10-05 Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion Anton Razzhigaev et.al. 2310.03502v1 link
2023-10-05 Deep Generative Models of Music Expectation Ninon Lizé Masclef et.al. 2310.03500v1 null
2023-10-05 An Extended Phase Graph-based framework for DANTE-SPACE simulations including physiological, temporal, and spatial variations Matthijs H. S. de Buck et.al. 2310.03429v1 null
2023-10-04 Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models Jianglong Ye et.al. 2310.03020v1 null
2023-10-04 Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day Yifan Jiang et.al. 2310.03015v1 null
2023-10-04 Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples Phillip Howard et.al. 2310.02988v1 null
2023-10-04 T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation Yuze He et.al. 2310.02977v1 link
2023-10-04 Fast, Expressive SE $(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation Space Erik J Bekkers et.al. 2310.02970v1 link
2023-10-04 Optimal Transport with Adaptive Regularisation Hugues Van Assel et.al. 2310.02925v1 null
2023-10-04 Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts Shiyi Du et.al. 2310.02906v1 null
2023-10-03 Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models Huaijin Pi et.al. 2310.02242v1 null
2023-10-03 Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks Luca Scimeca et.al. 2310.02230v1 null
2023-10-03 Global Attractor for a Reaction-Diffusion Model Arising in Biological Dynamic in 3D Soil Structure Mohamed Elghandouri et.al. 2310.02060v1 null
2023-10-03 AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Zibin Dong et.al. 2310.02054v1 null
2023-10-03 Spectral operator learning for parametric PDEs without data reliance Junho Choi et.al. 2310.02013v1 null
2023-10-03 Optimizing microlens arrays for incoherent HiLo microscopy Ziao Jiao et.al. 2310.01939v1 null
2023-10-02 LLM-grounded Video Diffusion Models Long Lian et.al. 2309.17444v2 null
2023-09-29 Directly Fine-Tuning Diffusion Models on Differentiable Rewards Kevin Clark et.al. 2309.17400v1 null
2023-09-29 Physics-Informed Neural Network for the Transient Diffusivity Equation in Reservoir Engineering Daniel Badawi et.al. 2309.17345v1 null
2023-09-28 KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing Jiancheng Huang et.al. 2309.16608v1 null
2023-09-28 CCEdit: Creative and Controllable Video Editing via Diffusion Models Ruoyu Feng et.al. 2309.16496v1 null
2023-09-28 Distilling ODE Solvers of Diffusion Models into Smaller Steps Sanghwan Kim et.al. 2309.16421v1 null
2023-09-27 Exploiting the Signal-Leak Bias in Diffusion Models Martin Nicolas Everaert et.al. 2309.15842v1 null
2023-09-27 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation David Junhao Zhang et.al. 2309.15818v1 link
2023-09-27 Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack Xiaoliang Dai et.al. 2309.15807v1 null
2023-09-27 Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation Xin Yuan et.al. 2309.15726v1 null
2023-09-27 Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing Kai Wang et.al. 2309.15664v1 link
2023-09-27 Direct Sensing of Remote Nuclei: Expanding the Reach of Cross-Effect Dynamic Nuclear Polarization Amaria Javed et.al. 2309.15653v1 null
2023-09-26 Generating Visual Scenes from Touch Fengyu Yang et.al. 2309.15117v1 null
2023-09-27 LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Yaohui Wang et.al. 2309.15103v2 link
2023-09-26 FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image Editing Songyan Chen et.al. 2309.14934v1 null
2023-09-27 ITEM3D: Illumination-Aware Directional Texture Editing for 3D Models Shengqi Liu et.al. 2309.14872v2 null
2023-09-26 Navigating Text-To-Image Customization:From LyCORIS Fine-Tuning to Model Evaluation Shin-Ying Yeh et.al. 2309.14859v1 link
2023-09-25 Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation Quang Nguyen et.al. 2309.14303v1 link
2023-09-25 Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models Yangming Li et.al. 2309.14068v1 null
2023-09-25 Mixing as a correlated aggregation process Joris Heyman et.al. 2309.14040v1 link
2023-09-22 MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation Jiahao Xie et.al. 2309.13042v1 link
2023-09-22 Diffusion Augmentation for Sequential Recommendation Qidong Liu et.al. 2309.12858v1 link
2023-09-22 Accuracy and stability analysis of horizontal discretizations used in unstructured grid ocean models Fabricio Rodrigues Lapolli et.al. 2309.12832v1 null
2023-09-22 Synthetic Boost: Leveraging Synthetic Data for Enhanced Vision-Language Segmentation in Echocardiography Rabin Adhikari et.al. 2309.12829v1 link
2023-09-22 Semantic Change Driven Generative Semantic Communication Framework Wanting Yang et.al. 2309.12775v1 link
2023-09-21 A Diffusion-Model of Joint Interactive Navigation Matthew Niedoba et.al. 2309.12508v1 null
2023-09-21 License Plate Super-Resolution Using Diffusion Models Sawsan AlHalawani et.al. 2309.12506v1 null
2023-09-21 Performance Conditioning for Diffusion-Based Multi-Instrument Music Synthesis Ben Maman et.al. 2309.12283v1 null
2023-09-20 FreeU: Free Lunch in Diffusion U-Net Chenyang Si et.al. 2309.11497v1 link
2023-09-20 Generative Agent-Based Modeling: Unveiling Social System Dynamics through Coupling Mechanistic Models with Generative Artificial Intelligence Navid Ghaffarzadegan et.al. 2309.11456v1 null
2023-09-20 Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models Song Mei et.al. 2309.11420v1 null
2023-09-20 EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning Kallol Saha et.al. 2309.11414v1 link
2023-09-20 Face Aging via Diffusion-based Editing Xiangyi Chen et.al. 2309.11321v1 link
2023-09-20 FaceDiffuser: Speech-Driven 3D Facial Animation Synthesis Using Diffusion Stefan Stan et.al. 2309.11306v1 link
2023-09-19 PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance Peiqing Yang et.al. 2309.10810v1 link
2023-09-19 Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation Yatong Bai et.al. 2309.10740v1 link
2023-09-19 Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising Yujin Wang et.al. 2309.10714v1 null
2023-09-18 Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees Alexia Jolicoeur-Martineau et.al. 2309.09968v1 link
2023-09-18 What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews Zoe De Simone et.al. 2309.09944v1 link
2023-09-18 DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving Xiaofeng Wang et.al. 2309.09777v1 null
2023-09-18 Application-driven Validation of Posteriors in Inverse Problems Tim J. Adler et.al. 2309.09764v1 null
2023-09-19 Non-Hermitian physics and topological phenomena in convective thermal metamaterials Zhoufei Liu et.al. 2309.09681v2 null
2023-09-18 Anomalous Diffusion of Lithium-Anion Clusters in Ionic Liquids YeongKyu Lee et.al. 2309.09674v1 null
2023-09-15 Compositional Foundation Models for Hierarchical Planning Anurag Ajay et.al. 2309.08587v1 null
2023-09-15 Denoising Diffusion Probabilistic Models for Hardware-Impaired Communications Mehdi Letafati et.al. 2309.08568v1 null
2023-09-15 Breathing New Life into 3D Assets with Generative Repainting Tianfu Wang et.al. 2309.08523v1 link
2023-09-15 Diffuse-illumination holographic optical coherence tomography Léo Puyo et.al. 2309.08486v1 null
2023-09-15 Large-Vocabulary 3D Diffusion Model with Transformer Ziang Cao et.al. 2309.07920v2 null
2023-09-14 Generative Image Dynamics Zhengqi Li et.al. 2309.07906v1 null
2023-09-14 Beta Diffusion Mingyuan Zhou et.al. 2309.07867v1 link
2023-09-14 Study and evaluation of the Ronen Method accuracy at material interfaces Johan Cufe et.al. 2309.07756v1 null
2023-09-14 Dual-angle interferometric scattering microscopy for optical multiparametric particle characterization Erik Olsén et.al. 2309.07572v1 null
2023-09-13 UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons Sicheng Yang et.al. 2309.07051v1 link
2023-09-13 Experimental Study on the Detection of Frozen Diffused Ammonia Blockage in the Inactive Section of a Variable Conductance Heat Pipe F. K. Miranda et.al. 2309.06936v1 null
2023-09-13 DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models Namhyuk Ahn et.al. 2309.06933v1 null
2023-09-13 MagiCapture: High-Resolution Multi-Concept Portrait Customization Junha Hyung et.al. 2309.06895v1 null
2023-09-13 DCTTS: Discrete Diffusion Model with Contrastive Learning for Text-to-speech Generation Zhichao Wu et.al. 2309.06787v1 null
2023-09-13 High throughput sampling of phase space with deep learning potentials: $δ$ -AlOOH at geophysical conditions Chenxing Luo et.al. 2309.06712v1 null
2023-09-13 Generalizable improvement of the Spalart-Allmaras model through assimilation of experimental data Deepinder Jot Singh Aulakh et.al. 2309.06679v1 null
2023-09-12 InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation Xingchao Liu et.al. 2309.06380v1 link
2023-09-12 Dispersion versus diffusion in mixing fronts Gauthier Rousseau et.al. 2309.06347v1 null
2023-09-12 Unraveling biochemical spatial patterns: machine learning approaches to the inverse problem of Turing patterns Antonio Matas-Gil et.al. 2309.06339v1 link
2023-09-12 Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model Yin Wang et.al. 2309.06284v1 null
2023-09-11 Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips Yufei Ye et.al. 2309.05663v1 null
2023-09-11 PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud Chengyu Wang et.al. 2309.05534v1 null
2023-09-11 NExT-GPT: Any-to-Any Multimodal LLM Shengqiong Wu et.al. 2309.05519v1 link
2023-09-08 Variations and Relaxations of Normalizing Flows Keegan Kelly et.al. 2309.04433v1 null
2023-09-08 Create Your World: Lifelong Text-to-Image Diffusion Gan Sun et.al. 2309.04430v1 null
2023-09-08 MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask Yupeng Zhou et.al. 2309.04399v1 null
2023-09-08 MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers Sijia Li et.al. 2309.04372v1 null
2023-09-08 The role of tumbling in bacterial scattering at convex obstacles Theresa Jakuszeit et.al. 2309.04326v1 null
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895v1 null
2023-09-07 DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection Manlin Zhang et.al. 2309.03893v1 null
2023-09-07 Text-to-feature diffusion for audio-visual few-shot learning Otniel-Bogdan Mercea et.al. 2309.03869v1 link
2023-09-07 Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption Teng Hu et.al. 2309.03729v1 link
2023-09-07 DiffDefense: Defending against Adversarial Attacks via Diffusion Models Hondamunige Prasanna Silva et.al. 2309.03702v1 link
2023-09-07 Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model Sungwon Hwang et.al. 2309.03550v1 null
2023-09-06 My Art My Choice: Adversarial Protection Against Unruly AI Anthony Rhodes et.al. 2309.03198v1 null
2023-09-06 SLiMe: Segment Like Me Aliasghar Khani et.al. 2309.03179v1 link
2023-09-06 MCM: Multi-condition Motion Synthesis Framework for Multi-scenario Zeyu Ling et.al. 2309.03031v1 null
2023-09-05 Generating Realistic Images from In-the-wild Sounds Taegyeong Lee et.al. 2309.02405v1 null
2023-09-05 A Diffusion Quantum Monte Carlo Approach to the Polaritonic Ground State Braden M. Weight et.al. 2309.02349v1 link
2023-09-05 Robust frequency-dependent diffusion kurtosis computation using an efficient direction scheme, axisymmetric modelling, and spatial regularization J. Hamilton et.al. 2309.02319v1 null
2023-09-05 Neuromorphic nanocluster networks: Critical role of the substrate in nano-link formation Wenkai Wu et.al. 2309.02299v1 null
2023-09-05 Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models Haixu Song et.al. 2309.02218v1 link
2023-09-01 Iterative Multi-granular Image Editing using Diffusion Models K J Joseph et.al. 2309.00613v1 null
2023-09-01 VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation Xin Li et.al. 2309.00398v1 null
2023-09-01 Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution Charles Laroche et.al. 2309.00287v1 link
2023-09-01 Data-driven Topology Optimization of Channel Flow Problems Ce Guan et.al. 2309.00278v1 null
2023-08-31 InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion Sirui Xu et.al. 2308.16905v1 link
2023-09-01 GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields Yanjie Ze et.al. 2308.16891v2 link
2023-08-31 Prediction of Diblock Copolymer Morphology via Machine Learning Hyun Park et.al. 2308.16886v1 null
2023-08-31 Diffusion Models for Interferometric Satellite Aperture Radar Alexandre Tuel et.al. 2308.16847v1 link
2023-09-01 Irregular Traffic Time Series Forecasting Based on Asynchronous Spatio-Temporal Graph Convolutional Network Weijia Zhang et.al. 2308.16818v2 null
2023-09-01 Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models Minheng Ni et.al. 2308.16777v2 null
2023-08-31 Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance Zexin Hu et.al. 2308.16725v1 null
2023-08-30 SignDiff: Learning Diffusion Models for American Sign Language Production Sen Fang et.al. 2308.16082v1 null
2023-08-30 Click Metamaterials: Fast Acquisition of Thermal Conductivity and Functionality Diversities Chengmeng Wang et.al. 2308.16057v1 null
2023-08-30 DiffuVolume: Diffusion Model for Volume based Stereo Matching Dian Zheng et.al. 2308.15989v1 null
2023-08-30 Physics-Informed DeepMRI: Bridging the Gap from Heat Diffusion to k-Space Interpolation Zhuo-Xu Cui et.al. 2308.15918v1 null
2023-08-29 ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer Zachary Horvitz et.al. 2308.15459v1 link
2023-08-29 Vortex core radius in baroclinic turbulence: Implications for scaling predictions Gabriel Hadjerci et.al. 2308.15398v1 null
2023-08-29 Rayleigh-Bénard instability in a horizontal porous layer with anomalous diffusion Antonio Barletta et.al. 2308.15359v1 null
2023-08-30 Elucidating the Exposure Bias in Diffusion Models Mang Ning et.al. 2308.15321v2 link
2023-08-28 Total Selfie: Generating Full-Body Selfies Bowei Chen et.al. 2308.14740v1 null
2023-08-28 Oscillating reaction in porous media under saddle flow Satoshi Izumoto et.al. 2308.14723v1 null
2023-08-28 360-Degree Panorama Generation from Few Unregistered NFoV Images Jionghao Wang et.al. 2308.14686v1 link
2023-08-28 Effect of gas diffusion layer fiber shape on cathode two-phase dynamics in proton exchange membrane fuel cells Danan Yang et.al. 2308.14539v1 null
2023-08-28 Priority-Centric Human Motion Generation in Discrete Latent Space Hanyang Kong et.al. 2308.14480v1 null
2023-08-25 Distribution-Aligned Diffusion for Human Mesh Recovery Lin Geng Foo et.al. 2308.13369v1 null
2023-08-25 Age of Information Diffusion on Social Networks: Optimizing Multi-Stage Seeding Strategies Songhua Li et.al. 2308.13303v1 null
2023-08-25 EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior Minda Zhao et.al. 2308.13223v1 link
2023-08-25 Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model Xunpeng Yi et.al. 2308.13164v1 null
2023-08-25 A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions Tianyi Zhang et.al. 2308.13142v1 null
2023-08-24 Dense Text-to-Image Generation with Attention Modulation Yunji Kim et.al. 2308.12964v1 link
2023-08-24 Language as Reality: A Co-Creative Storytelling Game Experience in 1001 Nights using Generative AI Yuqian Sun et.al. 2308.12915v1 null
2023-08-24 Hydrogen jet diffusion modeling by using physics-informed graph neural network and sparsely-distributed sensor data Xinqi Zhang et.al. 2308.12621v1 null
2023-08-24 APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency Yupu Yao et.al. 2308.12605v1 null
2023-08-23 On-Manifold Projected Gradient Descent Aaron Mahler et.al. 2308.12279v1 null
2023-08-23 Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning Jiasheng Ye et.al. 2308.12219v1 link
2023-08-23 Pulse shape discrimination for the CONUS experiment in the keV and sub-keV regime H. Bonet et.al. 2308.12105v1 null
2023-08-22 Theory of Transverse Mode Instability in Fiber Amplifiers with Multimode Excitations Kabish Wisal et.al. 2308.11599v1 null
2023-08-22 NIPG-DG schemes for transformed master equations modeling open quantum systems Jose A. Morales Escalante et.al. 2308.11580v1 null
2023-08-22 IT3D: Improved Text-to-3D Generation with Explicit View Synthesis Yiwen Chen et.al. 2308.11473v1 link
2023-08-22 SDeMorph: Towards Better Facial De-morphing from Single Morph Nitish Shukla et.al. 2308.11442v1 null
2023-08-21 TADA! Text to Animatable Digital Avatars Tingting Liao et.al. 2308.10899v1 null
2023-08-21 Election Manipulation in Social Networks with Single-Peaked Agents Vincenzo Auletta et.al. 2308.10845v1 null
2023-08-21 Backdooring Textual Inversion for Concept Censorship Yutong wu et.al. 2308.10718v1 null
2023-08-21 EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints Yutao Chen et.al. 2308.10648v1 null
2023-08-21 Frequency Compensated Diffusion Model for Real-scene Dehazing Jing Wang et.al. 2308.10510v1 link
2023-08-21 Texture Generation on 3D Meshes with Point-UV Diffusion Xin Yu et.al. 2308.10490v1 null
2023-08-18 Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization Soumik Mukhopadhyay et.al. 2308.09716v1 link
2023-08-18 HumanLiff: Layer-wise 3D Human Generation with Diffusion Model Shoukang Hu et.al. 2308.09712v1 null
2023-08-18 SimDA: Simple Diffusion Adapter for Efficient Video Generation Zhen Xing et.al. 2308.09710v1 null
2023-08-18 Guide3D: Create 3D Avatars from Text and Image Guidance Yukang Cao et.al. 2308.09705v1 null
2023-08-18 PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation Hanbing Liu et.al. 2308.09678v1 link
2023-08-18 Constrained Bayesian Optimization Using a Lagrange Multiplier Applied to Power Transistor Design Ping-Ju Chuang et.al. 2308.09612v1 null
2023-08-18 Language-Guided Diffusion Model for Visual Grounding Sijia Chen et.al. 2308.09599v1 null
2023-08-18 StableVideo: Text-driven Consistency-aware Diffusion Video Editing Wenhao Chai et.al. 2308.09592v1 link
2023-08-18 O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model Yubin Hu et.al. 2308.09591v1 link
2023-08-16 TeCH: Text-guided Reconstruction of Lifelike Clothed Humans Yangyi Huang et.al. 2308.08545v1 link
2023-08-16 Voxlines: Streamline Transparency through Voxelization and View-Dependent Line Orders Besm Osman et.al. 2308.08436v1 null
2023-08-16 Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model Ran Jiang et.al. 2308.08367v1 null
2023-08-18 Dual-Stream Diffusion Net for Text-to-Video Generation Binhui Liu et.al. 2308.08316v2 null
2023-08-16 Electron transfer efficiency in liquid xenon across THGEM holes G. Martínez-Lema et.al. 2308.08314v1 null
2023-08-15 StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models Zhizhong Wang et.al. 2308.07863v1 null
2023-08-15 CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction Yan Di et.al. 2308.07837v1 null
2023-08-15 DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding Jeongsoo Choi et.al. 2308.07787v1 link
2023-08-15 Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model Bosheng Qin et.al. 2308.07749v1 null
2023-08-14 Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation Alexander Martin et.al. 2308.07316v1 link
2023-08-14 DiffSED: Sound Event Detection with Denoising Diffusion Swapnil Bhosale et.al. 2308.07293v1 null
2023-08-14 Diffusion Based Augmentation for Captioning and Retrieval in Cultural Heritage Dario Cioni et.al. 2308.07151v1 link
2023-08-14 Temporal clustering of social interactions trades-off disease spreading and knowledge diffusion Giulia Cencetti et.al. 2308.07058v1 link
2023-08-14 Bayesian Flow Networks Alex Graves et.al. 2308.07037v1 link
2023-08-14 An efficient topology optimization method for steady gas flows in all flow regimes Ruifeng Yuan et.al. 2308.07018v1 null
2023-08-14 Discrete Conditional Diffusion for Reranking in Recommendation Xiao Lin et.al. 2308.06982v1 null
2023-08-11 Acoustofluidic Engineering Functional Vessel-on-a-Chip Yue Wu et.al. 2308.06219v1 null
2023-08-11 DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models Weijia Wu et.al. 2308.06160v1 link
2023-08-11 Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow Junhong Gou et.al. 2308.06101v1 link
2023-08-11 Diffusion-based Visual Counterfactual Explanations – Towards Systematic Quantitative Evaluation Philipp Vaeth et.al. 2308.06100v1 link
2023-08-11 Head Rotation in Denoising Diffusion Models Andrea Asperti et.al. 2308.06057v1 link
2023-08-11 Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning Chun-Mei Feng et.al. 2308.06038v1 link
2023-08-11 Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation Yuki Endo et.al. 2308.06027v1 link
2023-08-14 Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model Fan Zhang et.al. 2308.05995v2 null
2023-08-10 AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Haohe Liu et.al. 2308.05734v1 link
2023-08-10 PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers Phillip Lippe et.al. 2308.05732v1 null
2023-08-10 Masked Diffusion as Self-supervised Representation Learner Zixuan Pan et.al. 2308.05695v1 null
2023-08-10 Generative Diffusion Models for Radio Wireless Channel Modelling and Sampling Ushnish Sengupta et.al. 2308.05583v1 null
2023-08-10 Fokker-Planck-Poisson kinetics: Multi-phase flow beyond equilibrium Mohsen Sadr et.al. 2308.05580v1 null
2023-08-09 LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation Leigang Qu et.al. 2308.05095v1 null
2023-08-09 Do Diffusion Models Suffer Error Propagation? Theoretical Analysis and Consistency Regularization Yangming Li et.al. 2308.05021v1 null
2023-08-10 IDiff-Face: Synthetic-based Face Recognition through Fizzy Identity-Conditioned Diffusion Models Fadi Boutros et.al. 2308.04995v2 link
2023-08-09 CasCIFF: A Cross-Domain Information Fusion Framework Tailored for Cascade Prediction in Social Networks Hongjun Zhu et.al. 2308.04961v1 link
2023-08-09 Interaction-induced directional transport on periodically driven chains Helena Drüeke et.al. 2308.04845v1 null
2023-08-08 DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images Xuechao Zou et.al. 2308.04417v1 link
2023-08-08 Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On Daiheng Gao et.al. 2308.04288v1 null
2023-08-08 MCDAN: a Multi-scale Context-enhanced Dynamic Attention Network for Diffusion Prediction Xiaowen Wang et.al. 2308.04266v1 null
2023-08-08 FLIRT: Feedback Loop In-context Red Teaming Ninareh Mehrabi et.al. 2308.04265v1 null
2023-08-08 MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion Yizhuo Lu et.al. 2308.04249v1 link
2023-08-08 Synthetic Augmentation with Large-scale Unconditional Pre-training Jiarong Ye et.al. 2308.04020v1 link
2023-08-07 Diffusion Model in Causal Inference with Unmeasured Confounders Tatsuhiro Shimizu et.al. 2308.03669v1 link
2023-08-07 AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose Huichao Zhang et.al. 2308.03610v1 link
2023-08-08 DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis Zhongjie Duan et.al. 2308.03463v2 link
2023-08-04 Quantum Dynamical Approach to Predicting the Optical Pumping Threshold for Lasing in Organic Materials Bin Zhang et.al. 2308.02447v1 null
2023-08-04 Diffusion-Augmented Depth Prediction with Sparse Annotations Jiaqi Li et.al. 2308.02283v1 null
2023-08-04 Painterly Image Harmonization using Diffusion Model Lingxiao Lu et.al. 2308.02228v1 link
2023-08-03 Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling Zhao Yang et.al. 2308.01850v1 link
2023-08-03 DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models Jianxin Lin et.al. 2308.01655v1 null
2023-08-03 Reference-Free Isotropic 3D EM Reconstruction using Diffusion Models Kyungryun Lee et.al. 2308.01594v1 null
2023-08-03 Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS Myeongjin Ko et.al. 2308.01573v1 link
2023-08-02 Patched Denoising Diffusion Models For High-Resolution Image Synthesis Zheng Ding et.al. 2308.01316v1 link
2023-08-02 Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation Guojin Zhong et.al. 2308.01147v1 link
2023-08-02 DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation Jingfan Chen et.al. 2308.01127v1 null
2023-08-01 Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models Cheng-Yu Hsieh et.al. 2308.00675v1 null
2023-08-01 Diffusion Model for Camouflaged Object Detection Zhennan Chen et.al. 2308.00303v1 null
2023-07-31 Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models Weikang Yu et.al. 2307.16865v1 null
2023-07-31 From Generation to Suppression: Towards Effective Irregular Glow Removal for Nighttime Visibility Enhancement Wanyu Wu et.al. 2307.16783v1 null
2023-07-31 Understanding Dynamics in Coarse-Grained Models: III. Roles of Rotational Motion and Translation-Rotation Coupling in Coarse-Grained Dynamics Jaehyeok Jin et.al. 2307.16747v1 null
2023-07-31 DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation Runyang Feng et.al. 2307.16687v1 null
2023-07-31 On the Trustworthiness Landscape of State-of-the-art Generative Models: A Comprehensive Survey Mingyuan Fan et.al. 2307.16680v1 null
2023-07-28 Understanding the Anomalous Diffusion of Water in Aqueous Electrolytes Using Machine Learned Potentials Nikhil V. S. Avula et.al. 2307.15576v1 null
2023-07-28 Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding Chunyu Qiang et.al. 2307.15484v1 null
2023-07-27 The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation Lingdong Kong et.al. 2307.15061v1 link
2023-07-27 TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis Zihan Zhang et.al. 2307.15042v1 null
2023-07-27 Generative convective parametrization of dry atmospheric boundary layer Florian Heyder et.al. 2307.14857v1 null
2023-07-27 Empirical analysis of congestion spreading in Seoul traffic network Jung-Hoon Jung et.al. 2307.14800v1 null
2023-07-26 Virtual Mirrors: Non-Line-of-Sight Imaging Beyond the Third Bounce Diego Royo et.al. 2307.14341v1 null
2023-07-26 Visual Instruction Inversion: Image Editing via Visual Prompting Thao Nguyen et.al. 2307.14331v1 link
2023-07-26 Founding a mathematical diffusion model in linguistics. The case study of German syntactic features in the North-Eastern Italian dialects I. Lazzizzera et.al. 2307.14291v1 null
2023-07-26 VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet Zhihao Hu et.al. 2307.14073v1 null
2023-07-25 Comparing phase-space and phenomenological modeling approaches for Lagrangian particles settling in a turbulent boundary layer Andrew P. Grace et.al. 2307.13659v1 null
2023-07-25 Fake It Without Making It: Conditioned Face Generation for Accurate 3D Face Shape Estimation Will Rowan et.al. 2307.13639v1 null
2023-07-25 XDLM: Cross-lingual Diffusion Language Model for Machine Translation Linyao Chen et.al. 2307.13560v1 null
2023-07-25 Not with my name! Inferring artists’ names of input strings employed by Diffusion Models Roberto Leotta et.al. 2307.13527v1 link
2023-07-24 A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jindong Gu et.al. 2307.12980v1 link
2023-07-24 Data-free Black-box Attack based on Diffusion Model Mingwen Shao et.al. 2307.12872v1 null
2023-07-24 Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry Yong-Hyun Park et.al. 2307.12868v1 link
2023-07-24 The ro-vibrational $ν_2$ mode spectrum of methane investigated by ultrabroadband coherent Raman spectroscopy Francesco Mazza et.al. 2307.12740v1 null
2023-07-21 FEDD – Fair, Efficient, and Diverse Diffusion-based Lesion Segmentation and Malignancy Classification Héctor Carrión et.al. 2307.11654v1 link
2023-07-21 Mixbiotic society measures: Assessment of community well-going as living system Takeshi Kato et.al. 2307.11594v1 null
2023-07-21 Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting Marcel Kollovieh et.al. 2307.11494v1 link
2023-07-20 Hypergraph Diffusions and Resolvents for Norm-Based Hypergraph Laplacians Konstantinos Ameranis et.al. 2307.11042v1 null
2023-07-20 Progressive distillation diffusion for raw music generation Svetlana Pavlova et.al. 2307.10994v1 null
2023-07-20 Energy-consistent discretization of viscous dissipation with application to natural convection flow Benjamin Sanderse et.al. 2307.10874v1 null
2023-07-19 FABRIC: Personalizing Diffusion Models with Iterative Feedback Dimitri von Rütte et.al. 2307.10159v1 link
2023-07-19 XSkill: Cross Embodiment Skill Discovery Mengda Xu et.al. 2307.09955v1 link
2023-07-19 Visual Representation for Patterned Proliferation of Social Media Addiction: Quantitative Model and Network Analysis Dibyajyoti Mallick et.al. 2307.09902v1 null
2023-07-19 BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection Jitao Ma et.al. 2307.09861v1 link
2023-07-19 A Siamese-based Verification System for Open-set Architecture Attribution of Synthetic Images Lydia Abady et.al. 2307.09822v1 link
2023-07-18 AnyDoor: Zero-shot Object-level Image Customization Xi Chen et.al. 2307.09481v1 link
2023-07-18 Augmenting CLIP with Improved Visio-Linguistic Reasoning Samyadeep Basu et.al. 2307.09233v1 null
2023-07-17 Diffusion Models Beat GANs on Image Classification Soumik Mukhopadhyay et.al. 2307.08702v1 null
2023-07-17 Flow Matching in Latent Space Quan Dao et.al. 2307.08698v1 link
2023-07-17 SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor Defect Classification and Segmentation Vic De Ridder et.al. 2307.08693v1 null
2023-07-17 Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions Yui Iioka et.al. 2307.08597v1 null
2023-07-17 Identity-Preserving Aging of Face Images via Latent Diffusion Models Sudipta Banerjee et.al. 2307.08585v1 link
2023-07-17 Synthetic Lagrangian Turbulence by Generative Diffusion Models Tianyi Li et.al. 2307.08529v1 link
2023-07-17 How far does turbulence spread? Alexandros Alexakis et.al. 2307.08469v1 null
2023-07-17 Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation Luozhou Wang et.al. 2307.08448v1 link
2023-07-18 Unstoppable Attack: Label-Only Model Inversion via Conditional Diffusion Model Rongke Liu et.al. 2307.08424v2 null
2023-07-14 NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis Nilesh Kulkarni et.al. 2307.07511v1 null
2023-07-14 DreamTeacher: Pretraining Image Backbones with Deep Generative Models Daiqing Li et.al. 2307.07487v1 null
2023-07-14 Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks Chaoyu Liu et.al. 2307.07344v1 null
2023-07-14 High-density single-molecule maps reveal transient membrane receptor interactions within a dynamically varying environment Nicolas Mateos et.al. 2307.07334v1 null
2023-07-14 Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection Alessandro Flaborea et.al. 2307.07205v1 link
2023-07-14 Federated Learning-Empowered AI-Generated Content in Wireless Networks Xumin Huang et.al. 2307.07146v1 null
2023-07-13 HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Nataniel Ruiz et.al. 2307.06949v1 null
2023-07-12 Exposing the Fake: Effective Diffusion-Generated Images Detection Ruipeng Ma et.al. 2307.06272v1 null
2023-07-12 Diffusion Based Multi-Agent Adversarial Tracking Sean Ye et.al. 2307.06244v1 null
2023-07-12 Functional light diffusers based on hybrid CsPbBr $_3$/SiO$_2$ aero-framework structures for laser light illumination and conversion Lena M. Saure et.al. 2307.06197v1 null
2023-07-12 Biofilm.jl: a fast solver for one-dimensional biofilm chemistry and ecology Mark Owkes et.al. 2307.06153v1 link
2023-07-11 Metropolis Sampling for Constrained Diffusion Models Nic Fishman et.al. 2307.05439v1 null
2023-07-11 On the Vulnerability of DeepFake Detectors to Attacks Generated by Denoising Diffusion Models Marija Ivanovska et.al. 2307.05397v1 null
2023-07-10 Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement Anthony Simeonov et.al. 2307.04751v1 null
2023-07-10 Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback Jaskirat Singh et.al. 2307.04749v1 null
2023-07-10 Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning Suzan Ece Ada et.al. 2307.04726v1 null
2023-07-10 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Yuwei Guo et.al. 2307.04725v1 link
2023-07-10 Machine learning potentials with Iterative Boltzmann Inversion: training to experiment Sakib Matin et.al. 2307.04712v1 null
2023-07-10 Encapsulation Structure and Dynamics in Hypergraphs Timothy LaRock et.al. 2307.04613v1 link
2023-07-07 Three-dimensional Vorticity Effects on Extinction Behavior of Laminar Flamelets Wes Hellwig et.al. 2307.03695v1 null
2023-07-07 Simulation-free Schrödinger bridges via score and flow matching Alexander Tong et.al. 2307.03672v1 link
2023-07-07 IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model Tianhao Wu et.al. 2307.03177v2 null
2023-07-06 How to Detect Unauthorized Data Usages in Text-to-image Diffusion Models Zhenting Wang et.al. 2307.03108v1 link
2023-07-06 Origin-Destination Travel Time Oracle for Map-based Services Yan Lin et.al. 2307.03048v1 null
2023-07-06 Multi-modal multi-class Parkinson disease classification using CNN and decision level fusion Sushanta Kumar Sahu et.al. 2307.02978v1 null
2023-07-06 On the Cultural Gap in Text-to-Image Generation Bingshuai Liu et.al. 2307.02971v1 null
2023-07-05 DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models Chong Mou et.al. 2307.02421v1 link
2023-07-05 RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation Renato Sortino et.al. 2307.02392v1 null
2023-07-06 Error Approximation and Bias Correction in Dynamic Problems using a Recurrent Neural Network/Finite Element Hybrid Model Moritz von Tresckow et.al. 2307.02349v2 null
2023-07-05 Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality Peter Lorenz et.al. 2307.02347v1 link
2023-07-05 SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection Yuguang Shi et.al. 2307.02270v1 null
2023-07-03 Improved sampling via learned diffusions Lorenz Richter et.al. 2307.01198v1 null
2023-07-03 Squeezing Large-Scale Diffusion Models for Mobile Jiwoong Choi et.al. 2307.01193v1 null
2023-07-03 Learning Mixtures of Gaussians Using the DDPM Objective Kulin Shah et.al. 2307.01178v1 null
2023-07-03 Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis Salman Ul Hassan Dar et.al. 2307.01148v1 null
2023-07-03 A phase field-based framework for electro-chemo-mechanical fracture: crack-contained electrolytes, chemical reactions and stabilisation T. Hageman et.al. 2307.01105v1 null
2023-07-03 MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion Shitao Tang et.al. 2307.01097v1 link
2023-07-03 TomatoDIFF: On-plant Tomato Segmentation with Denoising Diffusion Models Marija Ivanovska et.al. 2307.01064v1 link
2023-06-30 Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Guocheng Qian et.al. 2306.17843v1 link
2023-06-30 Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling Li Sanqian et.al. 2306.17717v1 null
2023-06-29 Generate Anything Anywhere in Any Scene Yuheng Li et.al. 2306.17154v1 null
2023-06-29 Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion Models Zeqi Gu et.al. 2306.17141v1 link
2023-06-29 ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models Weihao Cheng et.al. 2306.17140v1 null
2023-06-29 Learning Nuclei Representations with Masked Image Modelling Piotr Wójcik et.al. 2306.17116v1 null
2023-06-29 Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation Zibo Zhao et.al. 2306.17115v1 link
2023-06-29 Towards rapid extracellular vesicles colorimetric detection using optofluidics-enhanced color-changing optical metasurface Chuchuan Hong et.al. 2306.17102v1 null
2023-06-28 DiffComplete: Diffusion-based Generative 3D Shape Completion Ruihang Chu et.al. 2306.16329v1 null
2023-06-28 UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data Heeseung Kim et.al. 2306.16083v1 link
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667v2 null
2023-06-27 Stabilizing ultrathin Silver (Ag) films on different substrates Allamula Ashok et.al. 2306.15575v1 null
2023-06-27 Trajectory Generation, Control, and Safety with Denoising Diffusion Probabilistic Models Nicolò Botteghi et.al. 2306.15512v1 link
2023-06-27 Miniaturized gas-solid fluidized beds Fernando David Cúñez Benalcázar et.al. 2306.15463v1 null
2023-06-27 Adversarial Training for Graph Neural Networks Lukas Gosch et.al. 2306.15427v1 null
2023-06-26 Fuzzy-Conditioned Diffusion and Diffusion Projection Attention Applied to Facial Image Correction Majed El Helou et.al. 2306.14891v1 link
2023-06-26 Restart Sampling for Improving Generative Processes Yilun Xu et.al. 2306.14878v1 link
2023-06-26 ViNT: A Foundation Model for Visual Navigation Dhruv Shah et.al. 2306.14846v1 null
2023-06-26 ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion Yingjun Du et.al. 2306.14770v1 link
2023-06-23 Fast Macroscopic Forcing Method Spencer H. Bryngelson et.al. 2306.13625v1 link
2023-06-23 DreamEditor: Text-Driven 3D Scene Editing with Neural Fields Jingyu Zhuang et.al. 2306.13455v1 link
2023-06-22 Continuous Layout Editing of Single Images with Diffusion Models Zhiyuan Zhang et.al. 2306.13078v1 null
2023-06-22 Towards More Realistic Membership Inference Attacks on Large Diffusion Models Jan Dubiński et.al. 2306.12983v1 null
2023-06-22 On the nature of the two-positron bond: Evidence for a novel bond type Mohammad Goli et.al. 2306.12899v1 null
2023-06-22 Stress-induced Artificial neuron spiking in Diffusive memristors Debi Pattnaik et.al. 2306.12853v1 null
2023-06-21 DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation Yukun Huang et.al. 2306.12422v1 null
2023-06-21 HumanDiffusion: diffusion model using perceptual gradients Yota Ueda et.al. 2306.12169v1 null
2023-06-20 Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning Huiguo He et.al. 2306.11731v1 null
2023-06-20 Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision Ayush Tewari et.al. 2306.11719v1 null
2023-06-20 Align, Adapt and Inject: Sound-guided Unified Image Generation Yue Yang et.al. 2306.11504v1 null
2023-06-20 EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model Lianying Yin et.al. 2306.11496v1 null
2023-06-16 Group Orthogonalization Regularization For Vision Models Adaptation and Robustness Yoav Kurtz et.al. 2306.10001v1 link
2023-06-16 Towards Better Certified Segmentation via Diffusion Models Othmane Laousy et.al. 2306.09949v1 null
2023-06-16 Unique information from common diffusion MRI models about white-matter differences across the human adult lifespan Rafael Neto Henriques1 et.al. 2306.09942v1 link
2023-06-16 Drag-guided diffusion models for vehicle image generation Nikos Arechiga et.al. 2306.09935v1 null
2023-06-16 Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models Geon Yeong Park et.al. 2306.09869v1 link
2023-06-16 AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation Yifei Zeng et.al. 2306.09864v1 null
2023-06-15 Generative Proxemics: A Prior for 3D Social Interaction from Images Lea Müller et.al. 2306.09337v1 link
2023-06-15 ArtFusion: Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models Dar-Yen Chen et.al. 2306.09330v1 link
2023-06-15 Diffusion Models for Zero-Shot Open-Vocabulary Segmentation Laurynas Karazija et.al. 2306.09316v1 null
2023-06-15 Fast Training of Diffusion Models with Masked Transformers Hongkai Zheng et.al. 2306.09305v1 link
2023-06-15 Conditional Human Sketch Synthesis with Explicit Abstraction Control Dar-Yen Chen et.al. 2306.09274v1 null
2023-06-15 Training Diffusion Classifiers with Denoising Assistance Chandramouli Sastry et.al. 2306.09192v1 null
2023-06-13 Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Shuai Yang et.al. 2306.07954v1 null
2023-06-13 Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data Stanislaw Szymanowicz et.al. 2306.07881v1 null
2023-06-13 Diffusive and convective dissolution of carbon dioxide in a vertical cylindrical cell Daniël P. Faasen et.al. 2306.07721v1 null
2023-06-12 Controlling Text-to-Image Diffusion by Orthogonal Finetuning Zeju Qiu et.al. 2306.07280v1 null
2023-06-12 MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images Junchen Zhu et.al. 2306.07257v1 null
2023-06-12 Diffusion Models for Black-Box Optimization Siddarth Krishnamoorthy et.al. 2306.07180v1 link
2023-06-12 InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions Jiale Xu et.al. 2306.07154v1 null
2023-06-12 Latent Dynamical Implicit Diffusion Processes Mohammad R. Rezaei et.al. 2306.07077v1 null
2023-06-09 Bridging Scales: a Hybrid Model to Simulate Vascular Tumor Growth and Treatment Response Tobias Duswald et.al. 2306.05994v1 link
2023-06-09 DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles Tal Daniel et.al. 2306.05957v1 link
2023-06-09 Beyond Surface Statistics: Scene Representations in a Latent Diffusion Model Yida Chen et.al. 2306.05720v1 link
2023-06-12 Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion Haogeng Liu et.al. 2306.05708v2 null
2023-06-09 RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models Xingchen Zhou et.al. 2306.05668v1 null
2023-06-08 Grounded Text-to-Image Synthesis with Attention Refocusing Quynh Phung et.al. 2306.05427v1 null
2023-06-08 ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Changyao Tian et.al. 2306.05423v1 null
2023-06-08 Stochastic Multi-Person 3D Motion Forecasting Sirui Xu et.al. 2306.05421v1 link
2023-06-08 Improving Negative-Prompt Inversion via Proximal Guidance Ligong Han et.al. 2306.05414v1 link
2023-06-08 PriSampler: Mitigating Property Inference of Diffusion Models Hailong Hu et.al. 2306.05208v1 null
2023-06-08 SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions Yuseung Lee et.al. 2306.05178v1 null
2023-06-07 Designing a Better Asymmetric VQGAN for StableDiffusion Zixin Zhu et.al. 2306.04632v1 link
2023-06-07 ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections Chun-Han Yao et.al. 2306.04619v1 null
2023-06-08 Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt Kai Chen et.al. 2306.04607v2 null
2023-06-07 On the Design Fundamentals of Diffusion Models: A Survey Ziyi Chang et.al. 2306.04542v1 null
2023-06-07 Multi-modal Latent Diffusion Mustapha Bounoua et.al. 2306.04445v1 null
2023-06-07 Synthesizing realistic sand assemblies with denoising diffusion in latent space Nikolaos N. Vlassis et.al. 2306.04411v1 null
2023-06-07 Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance Gihyun Kwon et.al. 2306.04396v1 link
2023-06-06 Emergent Correspondence from Image Diffusion Luming Tang et.al. 2306.03881v1 link
2023-06-06 Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation Xinrong Hu et.al. 2306.03878v1 link
2023-06-06 Newly Formed Cities: an AI Curation Dario Negueruela del Castillo et.al. 2306.03753v1 null
2023-06-06 Towards Visual Foundational Models of Physical Scenes Chethan Parameshwara et.al. 2306.03727v1 null
2023-06-06 Diffusional exchange versus microscopic kurtosis from CTI: two conflicting interpretations of the same data Arthur Chakwizira et.al. 2306.03661v1 null
2023-06-05 Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models Andrew F. Luo et.al. 2306.03089v1 null
2023-06-05 MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion Chiyu Max Jiang et.al. 2306.03083v1 null
2023-06-05 Influence of the finite transverse size of the accelerating region on the relativistic feedback Alexander Sedelnikov et.al. 2306.03059v1 null
2023-06-05 HeadSculpt: Crafting 3D Head Avatars with Text Xiao Han et.al. 2306.03038v1 null
2023-06-05 Interpretable Alzheimer’s Disease Classification Via a Contrastive Diffusion Autoencoder Ayodeji Ijishakin et.al. 2306.03022v1 link
2023-06-05 Complex Preferences for Different Convergent Priors in Discrete Graph Diffusion Alex M. Tseng et.al. 2306.02957v1 null
2023-06-05 INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems Di You et.al. 2306.02949v1 null
2023-06-05 Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions Shaoxu Li et.al. 2306.02903v1 link
2023-06-02 Video Colorization with Pre-trained Text-to-Image Diffusion Models Hanyuan Liu et.al. 2306.01732v1 null
2023-06-02 Denoising Diffusion Semantic Segmentation with Mask Prior Modeling Zeqiang Lai et.al. 2306.01721v1 link
2023-06-02 DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation Guanqun Bi et.al. 2306.01657v1 null
2023-06-02 Influence Maximization with Fairness at Scale (Extended Version) Yuting Feng et.al. 2306.01587v1 null
2023-06-02 PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models Jiacheng Chen et.al. 2306.01461v1 link
2023-06-02 Diffusion Self-Guidance for Controllable Image Generation Dave Epstein et.al. 2306.00986v2 null
2023-06-01 StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners Yonglong Tian et.al. 2306.00984v1 link
2023-06-01 StyleDrop: Text-to-Image Generation in Any Style Kihyuk Sohn et.al. 2306.00983v1 null
2023-06-01 SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds Yanyu Li et.al. 2306.00980v1 link
2023-06-01 Intriguing Properties of Text-guided Diffusion Models Qihao Liu et.al. 2306.00974v1 link
2023-06-01 Intelligent Grimm – Open-ended Visual Storytelling via Latent Diffusion Models Chang Liu et.al. 2306.00973v1 link
2023-06-01 ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation Shaozhe Hao et.al. 2306.00971v1 link
2023-06-01 The Hidden Language of Diffusion Models Hila Chefer et.al. 2306.00966v1 link
2023-06-01 Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation Minghui Hu et.al. 2306.00964v1 null
2023-06-01 Differential Diffusion: Giving Each Pixel Its Strength Eran Levin et.al. 2306.00950v1 link
2023-05-31 Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images Junxing Hu et.al. 2305.20089v1 null
2023-05-31 Understanding and Mitigating Copying in Diffusion Models Gowthami Somepalli et.al. 2305.20086v1 link
2023-05-31 Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor Ruizhi Shao et.al. 2305.20082v1 null
2023-05-31 Efficient Diffusion Policies for Offline Reinforcement Learning Bingyi Kang et.al. 2305.20081v1 link
2023-05-31 A Unified Conditional Framework for Diffusion-based Image Restoration Yi Zhang et.al. 2305.20049v1 link
2023-06-01 Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust Yuxin Wen et.al. 2305.20030v2 link
2023-05-31 Protein Design with Guided Discrete Diffusion Nate Gruver et.al. 2305.20009v1 link
2023-05-31 GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations Pietro Melzi et.al. 2305.19962v1 null
2023-05-31 A Geometric Perspective on Diffusion Models Defang Chen et.al. 2305.19947v1 null
2023-05-30 Ambient Diffusion: Learning Clean Distributions from Corrupted Data Giannis Daras et.al. 2305.19256v1 link
2023-05-30 PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation Jialu Li et.al. 2305.19195v1 null
2023-05-30 Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models Ernie Chu et.al. 2305.19193v1 null
2023-05-30 Calliffusion: Chinese Calligraphy Generation and Style Transfer with Diffusion Modeling Qisheng Liao et.al. 2305.19124v1 null
2023-05-30 DiffMatch: Diffusion Model for Dense Matching Jisu Nam et.al. 2305.19094v1 link
2023-05-30 Likelihood-Based Diffusion Language Models Ishaan Gulrajani et.al. 2305.18619v1 link
2023-05-29 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths Zeyue Xue et.al. 2305.18295v1 null
2023-05-29 Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models Yuchao Gu et.al. 2305.18292v1 link
2023-05-29 Photoswap: Personalized Subject Swapping in Images Jing Gu et.al. 2305.18286v1 null
2023-05-29 Reconstructing the Mind’s Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors Paul S. Scotti et.al. 2305.18274v1 link
2023-05-29 Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising Fu-Yun Wang et.al. 2305.18264v1 link
2023-05-29 GlyphControl: Glyph Conditional Control for Visual Text Generation Yukang Yang et.al. 2305.18259v1 link
2023-05-26 Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model David Soong et.al. 2305.17116v1 null
2023-05-26 ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing Min Zhao et.al. 2305.17098v1 link
2023-05-26 The reaction-diffusion basis of animated patterns in eukaryotic flagella James F. Cass et.al. 2305.17032v1 link
2023-05-26 Accelerating Diffusion Models for Inverse Problems through Shortcut Sampling Gongye Liu et.al. 2305.16965v1 link
2023-05-26 Learning to Imagine: Visually-Augmented Natural Language Generation Tianyi Tang et.al. 2305.16944v1 link
2023-05-26 DiffusionNAG: Task-guided Neural Architecture Generation with Diffusion Models Sohyun An et.al. 2305.16943v1 link
2023-05-26 CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography Jiwen Yu et.al. 2305.16936v1 link
2023-05-26 Turbulence calculation based on the extended Naiver-Stokes equations Shanwen Tan et.al. 2305.16923v1 null
2023-05-25 Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models Shihao Zhao et.al. 2305.16322v1 link
2023-05-25 Eclipse: Disambiguating Illumination and Materials using Unintended Shadows Dor Verbin et.al. 2305.16321v1 null
2023-05-25 Parallel Sampling of Diffusion Models Andy Shih et.al. 2305.16317v1 link
2023-05-25 NAP: Neural 3D Articulation Prior Jiahui Lei et.al. 2305.16315v1 null
2023-05-25 UMat: Uncertainty-Aware Single Image High Resolution Material Capture Carlos Rodriguez-Pardo et.al. 2305.16312v1 null
2023-05-25 Break-A-Scene: Extracting Multiple Concepts from a Single Image Omri Avrahami et.al. 2305.16311v1 link
2023-05-25 Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos Matthew Chang et.al. 2305.16301v1 null
2023-05-25 Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation Lisa Dunlap et.al. 2305.16289v1 link
2023-05-25 CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graphs Guangyao Zhai et.al. 2305.16283v1 link
2023-05-25 UDPM: Upsampling Diffusion Probabilistic Models Shady Abu-Hussein et.al. 2305.16269v1 link
2023-05-24 Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape Rundi Wu et.al. 2305.15399v1 link
2023-05-24 A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence Junyi Zhang et.al. 2305.15347v1 link
2023-05-24 Training on Thin Air: Improve Image Classification with Generated Data Yongchao Zhou et.al. 2305.15316v1 link
2023-05-24 MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation Marco Bellagente et.al. 2305.15296v1 null
2023-05-23 Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Grace Luo et.al. 2305.14334v1 null
2023-05-23 SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models Martin Gonzalez et.al. 2305.14267v1 link
2023-05-23 Improved Convergence of Score-Based Diffusion Models via Prediction-Correction Francesco Pedrotti et.al. 2305.14164v1 null
2023-05-23 Realistic Noise Synthesis with Diffusion Models Qi Wu et.al. 2305.14022v1 null
2023-05-23 Lightweight Channel Codes for ISI Mitigation in Molecular Communication between Bionanosensors Dongliang Jing et.al. 2305.14001v1 null
2023-05-23 Node-wise Diffusion for Scalable Graph Learning Keke Huang et.al. 2305.14000v1 link
2023-05-22 VDT: An Empirical Study on Video Diffusion with Transformers Haoyu Lu et.al. 2305.13311v1 link
2023-05-22 If at First You Don’t Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection Shyamgopal Karthik et.al. 2305.13308v1 link
2023-05-23 Training Diffusion Models with Reinforcement Learning Kevin Black et.al. 2305.13301v2 link
2023-05-22 DiffusionNER: Boundary Diffusion for Named Entity Recognition Yongliang Shen et.al. 2305.13298v1 link
2023-05-22 U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech Xin Jing et.al. 2305.13195v1 null
2023-05-22 Policy Representation via Diffusion Probability Model for Reinforcement Learning Long Yang et.al. 2305.13122v1 link
2023-05-22 Energy cascade in the Garrett-Munk spectrum of internal gravity waves Yue Wu et.al. 2305.13110v1 null
2023-05-19 Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models Byungjun Kim et.al. 2305.11870v1 link
2023-05-19 Any-to-Any Generation via Composable Diffusion Zineng Tang et.al. 2305.11846v1 link
2023-05-19 The probability flow ODE is provably fast Sitan Chen et.al. 2305.11798v1 null
2023-05-19 Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity Zijiao Chen et.al. 2305.11675v1 null
2023-05-19 Few-shot 3D Shape Generation Jingyuan Zhu et.al. 2305.11664v1 null
2023-05-19 Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields Jingbo Zhang et.al. 2305.11588v1 link
2023-05-19 Brain Captioning: Decoding human brain activity into images and text Matteo Ferrante et.al. 2305.11560v1 null
2023-05-19 Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots Jinyi Hu et.al. 2305.11540v1 null
2023-05-19 Late-Constraint Diffusion Guidance for Controllable Image Synthesis Chang Liu et.al. 2305.11520v1 link
2023-05-19 DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion Chao-Hong Tan et.al. 2305.11517v1 null
2023-05-18 UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild Can Qin et.al. 2305.11147v1 link
2023-05-18 Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces Javier E Santos et.al. 2305.11089v1 link
2023-05-18 Inspecting the Geographical Representativeness of Images from Text-to-Image Models Abhipsa Basu et.al. 2305.11080v1 null
2023-05-18 Unsupervised Pansharpening via Low-rank Diffusion Model Xiangyu Rui et.al. 2305.10925v1 link
2023-05-18 Structural Pruning for Diffusion Models Gongfan Fang et.al. 2305.10924v1 link
2023-05-18 VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation Wenjing Wang et.al. 2305.10874v1 null
2023-05-17 FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention Guangxuan Xiao et.al. 2305.10431v1 link
2023-05-17 Raising the Bar for Certified Adversarial Robustness with Diffusion Models Thomas Altstidl et.al. 2305.10388v1 null
2023-05-17 A phase field model for droplets suspended in viscous liquids under the influence of electric fields Yuzhe Qin et.al. 2305.10296v1 null
2023-05-17 Provably Correct Physics-Informed Neural Networks Francisco Eiras et.al. 2305.10157v1 null
2023-05-18 Controllable Mind Visual Diffusion Model Bohan Zeng et.al. 2305.10135v2 link
2023-05-16 Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation Samaneh Azadi et.al. 2305.09662v1 null
2023-05-16 FitMe: Deep Photorealistic 3D Morphable Model Avatars Alexandros Lattas et.al. 2305.09641v1 null
2023-05-16 AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation Tong Wu et.al. 2305.09515v1 link
2023-05-16 Discrete Diffusion Probabilistic Models for Symbolic Music Generation Matthias Plasser et.al. 2305.09489v1 link
2023-05-17 Multi-Level Global Context Cross Consistency Model for Semi-Supervised Ultrasound Image Segmentation with Diffusion Model Fenghe Tang et.al. 2305.09447v2 link
2023-05-16 Diffusion Dataset Generation: Towards Closing the Sim2Real Gap for Pedestrian Detection Andrew Farley et.al. 2305.09401v1 null
2023-05-17 AMD: Autoregressive Motion Diffusion Bo Han et.al. 2305.09381v2 null
2023-05-15 Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models Antoni Bigata Casademunt et.al. 2305.08854v1 link
2023-05-15 Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts Yuyang Zhao et.al. 2305.08850v1 null
2023-05-15 The role of magnetic helicity when it is absent on average Axel Brandenburg et.al. 2305.08769v1 null
2023-05-15 Diffusion-weighted SPECIAL improves the detection of J-coupled metabolites at ultra-high magnetic field Jessie Mosso et.al. 2305.08708v1 null
2023-05-15 A Reproducible Extraction of Training Images from Diffusion Models Ryan Webster et.al. 2305.08694v1 link
2023-05-12 Sound waves, diffusive transport, and wall slip in nanoconfined compressible fluids Hannes Holey et.al. 2305.07501v1 null
2023-05-12 On a Voter Model with Context-Dependent Opinion Adoption Luca Becchetti et.al. 2305.07377v1 null
2023-05-12 Experimental optimization of lensless digital holographic microscopy with rotating diffuser-based coherent noise reduction Piotr Arcab et.al. 2305.07373v1 null
2023-05-12 Penguin huddling: a continuum model Samuel J. Harris et.al. 2305.07324v1 link
2023-05-15 Phosphorus-Controlled Nanoepitaxy in the Asymmetric Growth of GaAs-InP Core-Shell Bent Nanowires Spencer McDermott et.al. 2305.07252v2 null
2023-05-12 Optimal calibration of optical tweezers with arbitrary integration time and sampling frequencies – A general framework Laura Pérez-Garcéa et.al. 2305.07245v1 null
2023-05-15 Fully quantum algorithm for lattice Boltzmann methods with application to partial differential equations Fatima Ezahra Chrit et.al. 2305.07148v2 link
2023-05-11 Exploiting Diffusion Prior for Real-World Image Super-Resolution Jianyi Wang et.al. 2305.07015v1 link
2023-05-11 A method for automated regression test in scientific computing libraries: illustration with SPHinXsys Bo Zhang et.al. 2305.06970v1 link
2023-05-11 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model Zhen Ye et.al. 2305.06908v1 link
2023-05-11 Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator Jing Zhao et.al. 2305.06710v1 null
2023-05-11 Evaluating Twitter’s Algorithmic Amplification of Low-Trust Content: An Observational Study Giulio Corsi et.al. 2305.06125v2 link
2023-05-10 Relightify: Relightable 3D Faces from a Single Image via Diffusion Models Foivos Paraperas Papantoniou et.al. 2305.06077v1 null
2023-05-10 iEdit: Localised Text-guided Image Editing with Weak Supervision Rumeysa Bodur et.al. 2305.05947v1 null
2023-05-09 Large Language Models Humanize Technology Pratyush Kumar et.al. 2305.05576v1 null
2023-05-09 Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer Nisha Huang et.al. 2305.05464v1 link
2023-05-10 Large Language Models Need Holistically Thought in Medical Conversational QA Yixuan Weng et.al. 2305.05410v2 link
2023-05-09 The Multi-cluster Two-Wave Fading Model Juan P. Pena-Martin et.al. 2305.05342v1 null
2023-05-08 DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models Sicheng Yang et.al. 2305.04919v1 link
2023-05-08 CaloClouds: Fast Geometry-Independent Highly-Granular Calorimeter Simulation Erik Buhmann et.al. 2305.04847v1 link
2023-05-08 A Drop of Ink may Make a Million Think: The Spread of False Information in Large Language Models Ning Bian et.al. 2305.04812v1 null
2023-05-08 Controllable Light Diffusion for Portraits David Futschik et.al. 2305.04745v1 null
2023-05-08 A Closest Point Method for Surface PDEs with Interior Boundary Conditions for Geometry Processing Nathan King et.al. 2305.04711v1 null
2023-05-08 ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation Yupei Lin et.al. 2305.04651v1 null
2023-05-05 Reflection of a Diffuser in a Liquid Interface C. Silva et.al. 2305.03682v1 null
2023-05-05 Conditional Diffusion Feature Refinement for Continuous Sign Language Recognition Leming Guo et.al. 2305.03614v1 null
2023-05-05 Data Curation for Image Captioning with Text-to-Image Generative Models Wenyan Li et.al. 2305.03610v1 link
2023-05-04 Personalize Segment Anything Model with One Shot Renrui Zhang et.al. 2305.03048v1 link
2023-05-05 Capacity Bounds for Vertically-Drifted First Arrival Position Channels under a Second-Moment Constraint Yun-Feng Lo et.al. 2305.02706v2 null
2023-05-03 Nonlocal gravity wave turbulence in presence of condensate A. O. Korotkevich et.al. 2305.01930v1 null
2023-05-04 DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion Kiyohiro Nakayama et.al. 2305.01921v2 null
2023-05-04 The Impacts of Dimensionality, Diffusion, and Directedness on Intrinsic Cross-Model Simulation in Tile-Based Self-Assembly Daniel Hader et.al. 2305.01877v2 null
2023-05-03 Multimodal Data Augmentation for Image Captioning using Diffusion Models Changrong Xiao et.al. 2305.01855v1 link
2023-05-02 Unpaired Downscaling of Fluid Flows with Diffusion Bridges Tobias Bischoff et.al. 2305.01822v1 link
2023-05-02 Multimodal Procedural Planning via Dual Text-Image Prompting Yujie Lu et.al. 2305.01795v1 link
2023-05-02 DiffuSum: Generation Enhanced Extractive Summarization with Diffusion Haopeng Zhang et.al. 2305.01735v1 link
2023-05-02 ContactArt: Learning 3D Interaction Priors for Category-level Articulated Object and Hand Poses Estimation Zehao Zhu et.al. 2305.01618v1 null
2023-05-02 Adopting AI: How Familiarity Breeds Both Trust and Contempt Michael C. Horowitz et.al. 2305.01405v1 null
2023-05-02 Long-Term Rhythmic Video Soundtracker Jiashuo Yu et.al. 2305.01319v1 link
2023-05-02 DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling Mehmet Saygin Seyfioglu et.al. 2305.01257v1 null
2023-05-02 Solving Inverse Problems with Score-Based Generative Priors learned from Noisy Data Asad Aali et.al. 2305.01166v1 null
2023-05-02 Geometric Latent Diffusion Models for 3D Molecule Generation Minkai Xu et.al. 2305.01140v1 link
2023-05-01 Fractional and tempered fractional models for Reynolds-averaged Navier-Stokes equations Pavan Pranjivan Mehta et.al. 2305.00770v1 null
2023-05-01 Diffusion Models for Time Series Applications: A Survey Lequan Lin et.al. 2305.00624v1 null
2023-04-30 Class-Balancing Diffusion Models Yiming Qin et.al. 2305.00562v1 link
2023-04-30 Towards Computational Architecture of Liberty: A Comprehensive Survey on Deep Learning for Generating Virtual Architecture in the Metaverse Anqi Wang et.al. 2305.00510v1 null
2023-04-28 Scaling regimes in rapidly rotating thermal convection at extreme Rayleigh numbers Jiaxing Song et.al. 2304.14854v1 null
2023-04-28 Simplified models of diffusion in radially-symmetric geometries Luke P. Filippini et.al. 2304.14632v1 link
2023-04-28 MUDiff: Unified Diffusion for Complete Molecule Generation Chenqing Hua et.al. 2304.14621v1 null
2023-04-28 Robust Gaussian Process Regression method for efficient reaction pathway optimization: application to surface processes Wei Fang et.al. 2304.14596v1 null
2023-04-28 SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis Azade Farshad et.al. 2304.14573v1 null
2023-04-27 It is all about where you start: Text-to-image generation with seed selection Dvir Samuel et.al. 2304.14530v1 link
2023-04-27 Putting People in Their Place: Affordance-Aware Human Insertion into Scenes Sumith Kulal et.al. 2304.14406v1 link
2023-04-27 Motion-Conditioned Diffusion Model for Controllable Video Synthesis Tsai-Shien Chen et.al. 2304.14404v1 null
2023-04-27 Maximizing Model Generalization for Manufacturing with Self-Supervised Learning and Federated Learning Matthew Russell et.al. 2304.14398v1 null
2023-04-27 Functional Diffusion Maps María Barroso et.al. 2304.14378v1 link
2023-04-27 LDPC Decoders Prefer More Reliable Parity Bits: Unequal Data Protection Over BSC Beyza Dabak et.al. 2304.14278v1 null
2023-04-27 DataComp: In search of the next generation of multimodal datasets Samir Yitzhak Gadre et.al. 2304.14108v1 link
2023-04-26 Heuristic Barycenter Modeling of Fully Absorbing Receivers in Diffusive Molecular Communication Channels Fardad Vakilipoor et.al. 2304.13640v1 null
2023-04-26 Identifying the structure patterns to govern the performance of localization in regulating innovation diffusion Leyang Xue et.al. 2304.13608v1 null
2023-04-26 Bifractality of fractal scale-free networks Jun Yamamoto et.al. 2304.13438v1 null
2023-04-26 Training-Free Location-Aware Text-to-Image Synthesis Jiafeng Mao et.al. 2304.13427v1 null
2023-04-25 The Score-Difference Flow for Implicit Generative Modeling Romann M. Weber et.al. 2304.12906v1 null
2023-04-25 Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification Jussi Leinonen et.al. 2304.12891v1 link
2023-04-25 Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning Cheng Lu et.al. 2304.12824v1 link
2023-04-25 A Binary Annular Phase Mask to Regulate Spherical Aberration and Allow Super-Localization in Single-Particle Tracking over Extended Depth-of-Focus Quentin Gresil et.al. 2304.12774v1 null
2023-04-25 Effect of trap states, ion migration and interfaces on carrier transport in single crystal, polycrystalline and thick film devices of halide perovskites CH $_3$NH$_3$PbX$_3$ (X= I, Br, Cl) Mohd Warish et.al. 2304.12701v1 null
2023-04-24 Analyzing the neutron and $γ$ -ray emission properties of an americium-beryllium tagged neutron source Hiroshi Ito et.al. 2304.12153v1 null
2023-04-24 Efficient Halftoning via Deep Reinforcement Learning Haitian Jiang et.al. 2304.12152v1 null
2023-04-24 Variational Diffusion Auto-encoder: Deep Latent Variable Model with Unconditional Diffusion Prior Georgios Batzolis et.al. 2304.12141v1 null
2023-04-24 Customized Load Profiles Synthesis for Electricity Customers Based on Conditional Diffusion Models Zhenyi Wang et.al. 2304.12076v1 null
2023-04-24 Improving Synthetically Generated Image Detection in Cross-Concept Settings Pantelis Dogoulis et.al. 2304.12053v1 link
2023-04-21 BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis Angela Castillo et.al. 2304.11118v1 null
2023-04-21 Improved Diffusion-based Image Colorization via Piggybacked Models Hanyuan Liu et.al. 2304.11105v1 null
2023-04-21 Perturbatively corrected ring-polymer instanton theory for accurate tunneling splittings Joseph E. Lawrence et.al. 2304.10963v1 null
2023-04-20 Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion Tomas Jakab et.al. 2304.10535v1 null
2023-04-20 Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs Frederik Warburg et.al. 2304.10532v1 link
2023-04-20 Collaborative Diffusion for Multi-Modal Face Generation and Editing Ziqi Huang et.al. 2304.10530v1 link
2023-04-20 Prediction of the evolution of the nuclear reactor core parameters using artificial neural network Krzysztof Palmi et.al. 2304.10337v1 null
2023-04-20 Avoiding methane emission rate underestimates when using the divergence method Clayton Roberts et.al. 2304.10303v1 null
2023-04-20 Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art Analysis Yankun Wu et.al. 2304.10278v1 link
2023-04-19 Irregular dependence on Stokes number and non-ergodic transport of heavy inertial particles in steady laminar flows Anu V. S. Nath et.al. 2304.09804v1 null
2023-04-19 NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models Seung Wook Kim et.al. 2304.09787v1 null
2023-04-19 Signatures of heterogeneity in the statistical structure of target state aligned ensembles Nicolas Lenner et.al. 2304.09719v1 null
2023-04-18 Monte-Carlo method for incompressible fluid flows past obstacles Vladislav Cherepanov et.al. 2304.09152v1 null
2023-04-18 On the seed population of solar energetic particles in the inner heliosphere Nicolas Wijsen et.al. 2304.09098v1 null
2023-04-18 Construction of coarse-grained molecular dynamics with many-body non-Markovian memory Liyao Lyu et.al. 2304.09044v1 null
2023-04-18 Look ATME: The Discriminator Mean Entropy Needs Attention Edgardo Solano-Carrillo et.al. 2304.09024v1 link
2023-04-18 UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer Soon Yau Cheong et.al. 2304.08870v1 link
2023-04-17 Text2Performer: Text-Driven Human Video Generation Yuming Jiang et.al. 2304.08483v1 link
2023-04-18 Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation Jie An et.al. 2304.08477v2 null
2023-04-17 Synthetic Data from Diffusion Models Improves ImageNet Classification Shekoofeh Azizi et.al. 2304.08466v1 null
2023-04-17 MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing Mingdeng Cao et.al. 2304.08465v1 link
2023-04-17 OVTrack: Open-Vocabulary Multiple Object Tracking Siyuan Li et.al. 2304.08408v1 null
2023-04-17 Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models Ziwei Luo et.al. 2304.08291v1 link
2023-04-17 Solving stiff ordinary differential equations using physics informed neural networks (PINNs): simple recipes to improve training of vanilla-PINNs Hubert Baty et.al. 2304.08289v1 link
2023-04-14 A Comparative Study on Generative Models for High Resolution Solar Observation Imaging Mehdi Cherti et.al. 2304.07169v1 link
2023-04-14 Towards Controllable Diffusion Models via Reward-Guided Exploration Hengtong Zhang et.al. 2304.07132v1 null
2023-04-14 Delta Denoising Score Amir Hertz et.al. 2304.07090v1 null
2023-04-14 Memory Efficient Diffusion Probabilistic Models via Patch-based Generation Shinei Arakawa et.al. 2304.07087v1 null
2023-04-14 DCFace: Synthetic Face Generation with Dual Condition Diffusion Model Minchul Kim et.al. 2304.07060v1 link
2023-04-14 A Diffusion model for POI recommendation Yifang Qin et.al. 2304.07041v1 link
2023-04-13 Expressive Text-to-Image Generation with Rich Text Songwei Ge et.al. 2304.06720v1 null
2023-04-13 Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction Hansheng Chen et.al. 2304.06714v1 link
2023-04-13 DiffusionRig: Learning Personalized Priors for Facial Appearance Editing Zheng Ding et.al. 2304.06711v1 link
2023-04-13 Learning Controllable 3D Diffusion Models from Single-view Images Jiatao Gu et.al. 2304.06700v1 null
2023-04-13 DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning Enze Xie et.al. 2304.06648v1 null
2023-04-12 Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA James Seale Smith et.al. 2304.06027v1 null
2023-04-12 DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion Johanna Karras et.al. 2304.06025v1 null
2023-04-12 Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views Siwei Zhang et.al. 2304.06024v1 link
2023-04-12 SpectralDiff: Hyperspectral Image Classification with Spectral-Spatial Diffusion Models Ning Chen et.al. 2304.05961v1 link
2023-04-12 Diffusion models with location-scale noise Alexia Jolicoeur-Martineau et.al. 2304.05907v1 null
2023-04-12 Cancer-Net BCa-S: Breast Cancer Grade Prediction using Volumetric Deep Radiomic Features from Synthetic Correlated Diffusion Imaging Chi-en Amy Tai et.al. 2304.05899v1 link
2023-04-11 HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models Eslam Mohamed Bakr et.al. 2304.05390v1 link
2023-04-11 Diffusion Models for Constrained Domains Nic Fishman et.al. 2304.05364v1 link
2023-04-11 Multi-scale Fusion Fault Diagnosis Method Based on Two-Dimensionaliztion Sequence in Complex Scenarios Weiyang Jin et.al. 2304.05198v1 null
2023-04-10 A Cheaper and Better Diffusion Language Model with Soft-Masked Noise Jiaao Chen et.al. 2304.04746v1 link
2023-04-10 Ambiguous Medical Image Segmentation using Diffusion Models Aimon Rahman et.al. 2304.04745v1 link
2023-04-10 Sequential Recommendation with Diffusion Models Hanwen Du et.al. 2304.04541v1 null
2023-04-07 Compressed Regression over Adaptive Networks Marco Carpentiero et.al. 2304.03638v1 null
2023-04-07 Exploring Collaborative Distributed Diffusion-Based AI-Generated Content (AIGC) in Wireless Networks Hongyang Du et.al. 2304.03446v1 link
2023-04-06 RoSteALS: Robust Steganography using Autoencoder Latent Space Tu Bui et.al. 2304.03400v1 link
2023-04-06 Diffusion Models as Masked Autoencoders Chen Wei et.al. 2304.03283v1 null
2023-04-06 Inst-Inpaint: Instructing to Remove Objects with Diffusion Models Ahmet Burak Yildirim et.al. 2304.03246v1 link
2023-04-06 Face Animation with an Attribute-Guided Diffusion Model Bohan Zeng et.al. 2304.03199v1 link
2023-04-06 SketchFFusion: Sketch-guided image editing with diffusion model Weihang Mao et.al. 2304.03174v1 null
2023-04-05 Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models Xuhui Jia et.al. 2304.02642v1 null
2023-04-05 GenPhys: From Physical Processes to Generative Models Ziming Liu et.al. 2304.02637v1 null
2023-04-05 An atlas of the heterogeneous viscoelastic brain with local power-law attenuation synthesised using Prony-series Oisin Morrison et.al. 2304.02610v1 null
2023-04-05 Generative Novel View Synthesis with 3D-Aware Diffusion Models Eric R. Chan et.al. 2304.02602v1 null
2023-04-05 Diffusion across a concentration step: Strongly nonmonotonic evolution into thermodynamic equilibrium Hans R. Moser et.al. 2304.02557v1 null
2023-04-04 viz2viz: Prompt-driven stylized visualization generation using a diffusion model Jiaqi Wu et.al. 2304.01919v1 null
2023-04-04 PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion Gwanghyun Kim et.al. 2304.01900v1 null
2023-04-04 Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion Davis Rempe et.al. 2304.01893v1 null
2023-04-04 Quantitative perfusion and water transport time model from multi b-value diffusion magnetic resonance imaging validated against neutron capture microspheres M. Liu et.al. 2304.01888v1 null
2023-04-04 Adaptive learning of effective dynamics: Adaptive real-time, online modeling for complex systems Ivica Kičić et.al. 2304.01732v1 link
2023-04-03 Learning to Read Braille: Bridging the Tactile Reality Gap with Diffusion Models Carolina Higuera et.al. 2304.01182v1 link
2023-04-03 ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model Mingyuan Zhang et.al. 2304.01116v1 link
2023-04-03 ViT-DAE: Transformer-driven Diffusion Autoencoder for Histopathology Image Analysis Xuan Xu et.al. 2304.01053v1 null
2023-04-03 DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models Yukang Cao et.al. 2304.00916v1 link
2023-03-31 $\infty$ -Diff: Infinite Resolution Diffusion with Subsampled Mollified States Sam Bond-Taylor et.al. 2303.18242v1 link
2023-03-31 A Closer Look at Parameter-Efficient Tuning in Diffusion Models Chendong Xiang et.al. 2303.18181v1 link
2023-03-31 One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models Yasser Benigmim et.al. 2303.18080v1 link
2023-03-30 AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control Ruixiang Jiang et.al. 2303.17606v1 link
2023-03-30 Token Merging for Fast Stable Diffusion Daniel Bolya et.al. 2303.17604v1 link
2023-03-30 Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models Wen Wang et.al. 2303.17599v1 link
2023-03-30 Consistent View Synthesis with Pose-Guided Diffusion Models Hung-Yu Tseng et.al. 2303.17598v1 null
2023-03-30 Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models Eric Zhang et.al. 2303.17591v1 link
2023-03-30 DDP: Diffusion Model for Dense Visual Prediction Yuanfeng Ji et.al. 2303.17559v1 link
2023-03-30 DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder Chenpng Du et.al. 2303.17550v1 null
2023-03-30 PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models Vidit Goel et.al. 2303.17546v1 link
2023-03-29 Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos Kun Su et.al. 2303.16897v1 null
2023-03-30 MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path Qian Wang et.al. 2303.16765v2 link
2023-03-29 4D Facial Expression Diffusion Model Kaifeng Zou et.al. 2303.16611v1 link
2023-03-29 WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models Konstantina Nikolaidou et.al. 2303.16576v1 link
2023-03-29 Your Diffusion Model is Secretly a Zero-Shot Classifier Alexander C. Li et.al. 2303.16203v2 link
2023-03-28 Visual Chain-of-Thought Diffusion Models William Harvey et.al. 2303.16187v1 link
2023-03-28 Diffusion Maps for Group-Invariant Manifolds Paulina Hoyos et.al. 2303.16169v1 null
2023-03-28 Novel View Synthesis of Humans using Differentiable Rendering Guillaume Rochette et.al. 2303.15880v1 link
2023-03-27 The Stable Signature: Rooting Watermarks in Latent Diffusion Models Pierre Fernandez et.al. 2303.15435v1 link
2023-03-27 Anti-DreamBooth: Protecting users from personalized text-to-image synthesis Thanh Van Le et.al. 2303.15433v1 link
2023-03-27 Debiasing Scores and Prompts of 2D Diffusion for Robust Text-to-3D Generation Susung Hong et.al. 2303.15413v1 link
2023-03-27 Training-free Style Transfer Emerges from h-space in Diffusion models Jaeseok Jeong et.al. 2303.15403v1 null
2023-03-27 Exploring Continual Learning of Diffusion Models Michał Zając et.al. 2303.15342v1 null
2023-03-27 Diffusion Models for Memory-efficient Processing of 3D Medical Images Florentin Bieder et.al. 2303.15288v1 link
2023-03-27 Text-to-Image Diffusion Models are Zero-Shot Classifiers Kevin Clark et.al. 2303.15233v1 null
2023-03-24 Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior Junshu Tang et.al. 2303.14184v1 link
2023-03-24 MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion Yizhuo Lu et.al. 2303.14139v1 null
2023-03-24 CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images Jordan J. Bird et.al. 2303.14126v1 null
2023-03-24 Electron transport measurements in liquid xenon with Xenoscope, a large-scale DARWIN demonstrator L. Baudis et.al. 2303.13963v1 null
2023-03-23 Ablating Concepts in Text-to-Image Diffusion Models Nupur Kumari et.al. 2303.13516v1 link
2023-03-23 ReVersion: Diffusion-Based Relation Inversion from Images Ziqi Huang et.al. 2303.13495v1 link
2023-03-23 Scaling laws of two-dimensional incompressible turbulent transport D. I. Palade et.al. 2303.13457v1 null
2023-03-23 Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators Levon Khachatryan et.al. 2303.13439v1 link
2023-03-23 Medical diffusion on a budget: textual inversion for medical image generation Bram de Wilde et.al. 2303.13430v1 null
2023-03-23 DDT: A Diffusion-Driven Transformer-based Framework for Human Mesh Recovery from a Video Ce Zheng et.al. 2303.13397v1 null
2023-03-23 Audio Diffusion Model for Speech Synthesis: A Survey on Text To Speech and Speech Enhancement in Generative AI Chenshuang Zhang et.al. 2303.13336v1 null
2023-03-23 Decentralized Adversarial Training over Graphs Ying Cao et.al. 2303.13326v1 null
2023-03-23 Fourier Diffusion Models: A Method to Control MTF and NPS in Score-Based Stochastic Image Generation Matthew Tivnan et.al. 2303.13285v1 null
2023-03-22 Diffuse-Denoise-Count: Accurate Crowd-Counting with Diffusion Models Yasiru Ranasinghe et.al. 2303.12790v1 link
2023-03-22 Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions Ayaan Haque et.al. 2303.12789v1 null
2023-03-22 FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models Jianglong Ye et.al. 2303.12786v1 null
2023-03-22 Effect of gamma radiation on electrical properties of diffusive memristor devices D. P. Pattnaik et.al. 2303.12762v1 null
2023-03-22 Pix2Video: Video Editing using Image Diffusion Duygu Ceylan et.al. 2303.12688v1 link
2023-03-23 Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis Hadrien Reynaud et.al. 2303.12644v2 link
2023-03-22 A Perceptual Quality Assessment Exploration for AIGC Images Zicheng Zhang et.al. 2303.12618v1 null
2023-03-21 Vox-E: Text-guided Voxel Editing of 3D Objects Etai Sella et.al. 2303.12048v1 link
2023-03-21 Semantic Latent Space Regression of Diffusion Autoencoders for Vertebral Fracture Grading Matthias Keicher et.al. 2303.12031v1 null
2023-03-21 Numerical simulation of self-oscillating catalytic reaction in a plug-flow reactor N. V. Peskov et.al. 2303.12022v1 null
2023-03-21 3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion Yu-Jhe Li et.al. 2303.11938v1 null
2023-03-21 CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion Geonmo Gu et.al. 2303.11916v1 link
2023-03-21 Projections of Model Spaces for Latent Graph Inference Haitz Sáez de Ocáriz Borde et.al. 2303.11754v1 null
2023-03-20 Zero-1-to-3: Zero-shot One Image to 3D Object Ruoshi Liu et.al. 2303.11328v1 link
2023-03-20 Localizing Object-level Shape Variations with Text-to-Image Diffusion Models Or Patashnik et.al. 2303.11306v1 null
2023-03-20 SVDiff: Compact Parameter Space for Diffusion Fine-Tuning Ligong Han et.al. 2303.11305v1 link
2023-03-20 AnimeDiffusion: Anime Face Line Drawing Colorization via Diffusion Models Yu Cao et.al. 2303.11137v1 link
2023-03-17 A Recipe for Watermarking Diffusion Models Yunqing Zhao et.al. 2303.10137v1 link
2023-03-17 Data-Centric Learning from Unlabeled Graphs with Diffusion Model Gang Liu et.al. 2303.10108v1 link
2023-03-17 DialogPaint: A Dialog-based Image Editing Model Jingxuan Wei et.al. 2303.10073v1 null
2023-03-17 GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation Can Qin et.al. 2303.10056v1 link
2023-03-17 On the momentum diffusion over multiphase surfaces with meshless methods Johannes C. Joubert et.al. 2303.09978v1 null
2023-03-17 Adversarial Counterfactual Visual Explanations Guillaume Jeanneret et.al. 2303.09962v1 link
2023-03-17 Discovering mesoscopic descriptions of collective movement with neural stochastic modelling Utkarsh Pratiush et.al. 2303.09906v1 link
2023-03-16 Efficient Diffusion Training via Min-SNR Weighting Strategy Tiankai Hang et.al. 2303.09556v1 link
2023-03-16 Diffusion-HPC: Generating Synthetic Images with Realistic Humans Zhenzhen Weng et.al. 2303.09541v1 link
2023-03-17 FateZero: Fusing Attentions for Zero-shot Text-based Video Editing Chenyang Qi et.al. 2303.09535v2 link
2023-03-16 $P+$ : Extended Textual Conditioning in Text-to-Image Generation Andrey Voynov et.al. 2303.09522v1 null
2023-03-16 DiffIR: Efficient Diffusion Model for Image Restoration Bin Xia et.al. 2303.09472v1 link
2023-03-16 Unwrapping NPT simulations to calculate diffusion coefficients Jakob Tómas Bullerjahn et.al. 2303.09418v1 null
2023-03-17 DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars David Svitov et.al. 2303.09375v2 link
2023-03-15 Stochastic Interpolants: A Unifying Framework for Flows and Diffusions Michael S. Albergo et.al. 2303.08797v1 null
2023-03-15 Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion Inhwa Han et.al. 2303.08767v1 null
2023-03-15 Advanced Analysis of Radar Cross-Section Measurements in Reverberation Environment Corentin Charlo et.al. 2303.08751v1 null
2023-03-15 DiffusionAD: Denoising Diffusion for Anomaly Detection Hui Zhang et.al. 2303.08730v1 link
2023-03-16 ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution Shuyao Shang et.al. 2303.08714v2 null
2023-03-15 Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer Serin Yang et.al. 2303.08622v1 link
2023-03-14 LayoutDM: Discrete Diffusion Model for Controllable Layout Generation Naoto Inoue et.al. 2303.08137v1 link
2023-03-14 MeshDiffusion: Score-based Generative 3D Mesh Modeling Zhen Liu et.al. 2303.08133v1 link
2023-03-14 Editing Implicit Assumptions in Text-to-Image Diffusion Models Hadas Orgad et.al. 2303.08084v1 link
2023-03-15 Interpretable ODE-style Generative Diffusion Model via Force Field Construction Weiyang Jin et.al. 2303.08063v2 null
2023-03-14 Edit-A-Video: Single Video Editing with Object-Aware Consistency Chaehun Shin et.al. 2303.07945v1 null
2023-03-15 Controllable Mesh Generation Through Sparse Latent Point Diffusion Models Zhaoyang Lyu et.al. 2303.07938v2 null
2023-03-15 Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation Junyoung Seo et.al. 2303.07937v2 link
2023-03-13 Erasing Concepts from Diffusion Models Rohit Gandikota et.al. 2303.07345v1 link
2023-03-14 Parallel Vertex Diffusion for Unified Visual Grounding Zesen Cheng et.al. 2303.07216v2 null
2023-03-10 GECCO: Geometrically-Conditioned Point Diffusion Models Michał J. Tyszkiewicz et.al. 2303.05916v1 null
2023-03-10 Photon Diffusion in Microscale Solids Avijit Das et.al. 2303.05776v1 null
2023-03-10 TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets Weixin Chen et.al. 2303.05762v1 link
2023-03-10 Fast Diffusion Sampler for Inverse Problems by Geometric Decomposition Hyungjin Chung et.al. 2303.05754v1 link
2023-03-09 Scaling up GANs for Text-to-Image Synthesis Minguk Kang et.al. 2303.05511v1 null
2023-03-09 Resolving quantitative MRI model degeneracy with machine learning via training data distribution design Michele Guerreri et.al. 2303.05464v1 null
2023-03-09 3DGen: Triplane Latent Diffusion for Textured Mesh Generation Anchit Gupta et.al. 2303.05371v1 null
2023-03-09 TGDataset: a Collection of Over One Hundred Thousand Telegram Channels Massimo La Morgia et.al. 2303.05345v1 link
2023-03-09 Brain-Diffuser: Natural scene reconstruction from fMRI signals using generative latent diffusion Furkan Ozcelik et.al. 2303.05334v1 link
2023-03-08 Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Jiarui Xu et.al. 2303.04803v1 link
2023-03-08 Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation Paul Hagemann et.al. 2303.04772v1 link
2023-03-08 Video-P2P: Video Editing with Cross-attention Control Shaoteng Liu et.al. 2303.04761v1 null
2023-03-08 Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models Chenfei Wu et.al. 2303.04671v1 link
2023-03-08 Diffusing Gaussian Mixtures for Generating Categorical Data Florence Regol et.al. 2303.04635v1 link
2023-03-08 Connecting finite-time Lyapunov exponents with supersaturation and droplet dynamics in the bulk of a turbulent cloud Vladyslav Pushenko et.al. 2303.04632v1 null
2023-03-08 Maritime transportation and people mobility in the early diffusion of COVID-19 in Croatia Corentin Cot et.al. 2303.04617v1 null
2023-03-07 Diffusion Policy: Visuomotor Policy Learning via Action Diffusion Cheng Chi et.al. 2303.04137v1 null
2023-03-06 Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers Sitan Chen et.al. 2303.03384v1 null
2023-03-06 StyO: Stylize Your Face in Only One-Shot Bonan Li et.al. 2303.03231v1 null
2023-03-03 Unleashing Text-to-Image Diffusion Models for Visual Perception Wenliang Zhao et.al. 2303.02153v1 link
2023-03-03 Multi-Agent Adversarial Training Using Diffusion Learning Ying Cao et.al. 2303.01936v1 null
2023-03-03 CONTAIN: A Community-based Algorithm for Network Immunization Özgur Coban et.al. 2303.01934v1 link
2023-03-02 Consistency Models Yang Song et.al. 2303.01469v1 link
2023-03-02 Human Motion Diffusion as a Generative Prior Yonatan Shafir et.al. 2303.01418v1 link
2023-03-02 Why (and When) does Local SGD Generalize Better than SGD? Xinran Gu et.al. 2303.01215v1 link
2023-03-01 StraIT: Non-autoregressive Generation with Stratified Image Transformer Shengju Qian et.al. 2303.00750v1 null
2023-03-01 Diffusing Graph Attention Daniel Glickman et.al. 2303.00613v1 null
2023-03-01 Level Up the Deepfake Detection: a Method to Effectively Discriminate Images Generated by GAN Architectures and Diffusion Models Luca Guarnera et.al. 2303.00608v1 null
2023-03-01 Unlimited-Size Diffusion Restoration Yinhuai Wang et.al. 2303.00354v1 link
2023-03-01 Collage Diffusion Vishnu Sarukkai et.al. 2303.00262v1 null
2023-03-01 Diffusion Probabilistic Fields Peiye Zhuang et.al. 2303.00165v1 null
2023-02-28 Phase Field Modeling of Dictyostelium Discoideum Chemotaxis Yunsong Zhang et.al. 2302.14854v1 null
2023-02-28 Monocular Depth Estimation using Diffusion Models Saurabh Saxena et.al. 2302.14816v1 null
2023-02-28 Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection Jian Shi et.al. 2302.14696v1 link
2023-02-28 Synthesizing Mixed-type Electronic Health Records using Diffusion Models Taha Ceritli et.al. 2302.14679v1 null
2023-02-28 Detecting and Optimising Team Interactions in Software Development Christian Zingg et.al. 2302.14609v1 null
2023-02-28 Can We Use Diffusion Probabilistic Models for 3D Motion Prediction? Hyemin Ahn et.al. 2302.14503v1 null
2023-02-27 Buoyancy-driven attraction of active droplets Yibo Chen et.al. 2302.14008v1 null
2023-02-27 Impact of reconstruction schemes on interpreting lattice Boltzmann results – A study using the Taylor-Green vortex problem Jianping Meng et.al. 2302.13910v1 null
2023-02-27 Differentially Private Diffusion Models Generate Useful Synthetic Images Sahra Ghalebikesabi et.al. 2302.13861v1 null
2023-02-27 Denoising Diffusion Samplers Francisco Vargas et.al. 2302.13834v1 null
2023-02-24 Modulating Pretrained Diffusion Models for Multimodal Image Synthesis Cusuh Ham et.al. 2302.12764v1 null
2023-02-24 Physical interactions promote Turing patterns Lucas Menou et.al. 2302.12521v1 null
2023-02-24 Flow instability and momentum exchange in separation control by a synthetic jet Yoshiaki Abe et.al. 2302.12496v1 null
2023-02-24 Unsupervised Discovery of Semantic Latent Directions in Diffusion Models Yong-Hyun Park et.al. 2302.12469v1 null
2023-02-23 To the Noise and Back: Diffusion for Shared Autonomy Takuma Yoneda et.al. 2302.12244v1 null
2023-02-23 DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models Jamie Wynn et.al. 2302.12231v1 link
2023-02-23 Designing an Encoder for Fast Personalization of Text-to-Image Models Rinon Gal et.al. 2302.12228v1 null
2023-02-23 Metric-oriented Speech Enhancement using Diffusion Probabilistic Model Chen Chen et.al. 2302.11989v1 null
2023-02-22 Uncovering Bias in Face Generation Models Cristian Muñoz et.al. 2302.11562v1 null
2023-02-22 Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC Yilun Du et.al. 2302.11552v1 link
2023-02-22 Scaling Robot Learning with Semantically Imagined Experience Tianhe Yu et.al. 2302.11550v1 null
2023-02-22 Aligned Diffusion Schrödinger Bridges Vignesh Ram Somnath et.al. 2302.11419v1 link
2023-02-22 Entity-Level Text-Guided Image Manipulation Yikai Wang et.al. 2302.11383v1 link
2023-02-22 An agent-based model of the 2020 international policy diffusion in response to the COVID-19 pandemic with particle filter Yannick Oswald et.al. 2302.11277v1 link
2023-02-21 Provable Copyright Protection for Generative Models Nikhil Vyas et.al. 2302.10870v1 null
2023-02-21 Learning 3D Photography Videos via Self-supervised Diffusion on Single Images Xiaodong Wang et.al. 2302.10781v1 null
2023-02-21 On Calibrating Diffusion Probabilistic Models Tianyu Pang et.al. 2302.10688v1 link
2023-02-21 $PC^2$ : Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction Luke Melas-Kyriazi et.al. 2302.10668v1 link
2023-02-21 RealFusion: 360° Reconstruction of Any Object from a Single Image Luke Melas-Kyriazi et.al. 2302.10663v1 null
2023-02-21 Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels Zebin You et.al. 2302.10586v1 link
2023-02-20 Towards Universal Fake Image Detectors that Generalize Across Generative Models Utkarsh Ojha et.al. 2302.10174v1 link
2023-02-20 Cross-domain Compositing with Pretrained Diffusion Models Roy Hachnochi et.al. 2302.10167v1 link
2023-02-20 NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion Jiatao Gu et.al. 2302.10109v1 null
2023-02-20 DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises Jiasheng Ye et.al. 2302.10025v1 link
2023-02-17 Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent Giannis Daras et.al. 2302.09057v1 link
2023-02-17 MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation Clement Vignac et.al. 2302.09048v1 link
2023-02-17 LDFA: Latent Diffusion Face Anonymization for Self-driving Applications Marvin Klemp et.al. 2302.08931v1 null
2023-02-17 Multi-unit Auction over a Social Network Yuan Fang et.al. 2302.08924v1 null
2023-02-17 Unraveling the Variations of the Society of England and Wales through Diffusion Maps Analysis on Census 2011 Gezhi Xiu et.al. 2302.08701v1 null
2023-02-16 Text-driven Visual Synthesis with Latent Diffusion Prior Ting-Hsuan Liao et.al. 2302.08510v1 null
2023-02-16 T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models Chong Mou et.al. 2302.08453v1 link
2023-02-16 Explicit Diffusion of Gaussian Mixture Model Based Image Priors Martin Zach et.al. 2302.08411v1 null
2023-02-16 Boundary Guided Mixing Trajectory for Semantic Control with Diffusion Models Ye Zhu et.al. 2302.08357v1 link
2023-02-15 Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation Joshua Vendrow et.al. 2302.07865v1 link
2023-02-15 Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild Hshmat Sahak et.al. 2302.07864v1 null
2023-02-15 Data Forensics in Diffusion Models: A Systematic Analysis of Membership Privacy Derui Zhu et.al. 2302.07801v1 null
2023-02-15 Video Probabilistic Diffusion Models in Projected Latent Space Sihyun Yu et.al. 2302.07685v1 null
2023-02-14 Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions Raghav Singhal et.al. 2302.07261v1 null
2023-02-14 Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data Minshuo Chen et.al. 2302.07194v1 null
2023-02-14 Universal Guidance for Diffusion Models Arpit Bansal et.al. 2302.07121v1 link
2023-02-14 Differential privacy diffusion auction of homogeneous items Fengjuan Jia et.al. 2302.07072v1 null
2023-02-14 Direct numerical simulations of the Taylor-Green Vortex interacting with a hydrogen diffusion flame: Reynolds number and non-unity Lewis number effects Yifan Xu et.al. 2302.07006v1 null
2023-02-13 Raising the Cost of Malicious AI-Powered Image Editing Hadi Salman et.al. 2302.06588v1 link
2023-02-13 Preconditioned Score-based Generative Models Li Zhang et.al. 2302.06504v1 link
2023-02-13 Technical Note: PDE-constrained Optimization Formulation for Tumor Growth Model Calibration Baoshan Liang et.al. 2302.06445v1 null
2023-02-13 ContrasInver: Voxel-wise Contrastive Semi-supervised Learning for Seismic Inversion Yimin Dou et.al. 2302.06441v1 null
2023-02-13 Interplay between advective, diffusive, and active barriers in Rayleigh-Bénard flow Nikolas Aksamit et.al. 2302.06319v1 null
2023-02-10 Example-Based Sampling with Diffusion Models Bastien Doignies et.al. 2302.05116v1 null
2023-02-09 UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models Wenliang Zhao et.al. 2302.04867v1 link
2023-02-09 RelightableHands: Efficient Neural Relighting of Articulated Hand Models Shun Iwase et.al. 2302.04866v1 null
2023-02-09 Is This Loss Informative? Speeding Up Textual Inversion with Deterministic Objective Evaluation Anton Voronov et.al. 2302.04841v1 link
2023-02-09 Better Diffusion Models Further Improve Adversarial Training Zekai Wang et.al. 2302.04638v1 link
2023-02-09 Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples Chumeng Liang et.al. 2302.04578v1 link
2023-02-08 PFGM++: Unlocking the Potential of Physics-Inspired Generative Models Yilun Xu et.al. 2302.04265v1 link
2023-02-08 GLAZE: Protecting Artists from Style Mimicry by Text-to-Image Models Shawn Shan et.al. 2302.04222v1 null
2023-02-08 Policy Evaluation in Decentralized POMDPs with Belief Sharing Mert Kayaalp et.al. 2302.04151v1 link
2023-02-08 Dimensional lattice Boltzmann method for transport phenomena simulation without conversion to lattice units Ivan Talão Martins et.al. 2302.04120v1 null
2023-02-07 Long Horizon Temperature Scaling Andy Shih et.al. 2302.03686v1 link
2023-02-07 Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery Yuxin Wen et.al. 2302.03668v1 link
2023-02-07 HumanMAC: Masked Motion Completion for Human Motion Prediction Ling-Hao Chen et.al. 2302.03665v1 link
2023-02-07 Graph Generation with Destination-Driven Diffusion Mixture Jaehyeong Jo et.al. 2302.03596v1 link
2023-02-06 Zero-shot Image-to-Image Translation Gaurav Parmar et.al. 2302.03027v1 link
2023-02-06 Structure and Content-Guided Video Synthesis with Diffusion Models Patrick Esser et.al. 2302.03011v1 null
2023-02-03 AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners Zhixuan Liang et.al. 2302.01877v1 link
2023-02-03 TEXTure: Text-Guided Texturing of 3D Shapes Elad Richardson et.al. 2302.01721v1 link
2023-02-03 Learning End-to-End Channel Coding with Diffusion Models Muah Kim et.al. 2302.01714v1 null
2023-02-03 A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization Yasong Feng et.al. 2302.01539v1 null
2023-02-02 Dreamix: Video Diffusion Models are General Video Editors Eyal Molad et.al. 2302.01329v1 null
2023-02-02 Are Diffusion Models Vulnerable to Membership Inference Attacks? Jinhao Duan et.al. 2302.01316v1 link
2023-02-01 Stable Target Field for Reduced Variance Score Estimation in Diffusion Models Yilun Xu et.al. 2302.00670v1 link
2023-01-31 Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models Hila Chefer et.al. 2301.13826v1 link
2023-01-30 Extracting Training Data from Diffusion Models Nicholas Carlini et.al. 2301.13188v1 null
2023-01-30 Shape-aware Text-driven Layered Video Editing Yao-Chih Lee et.al. 2301.13173v1 null
2023-01-30 GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis Ming Tao et.al. 2301.12959v1 link
2023-01-30 ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic Models Shengmeng Li et.al. 2301.12935v1 null
2023-01-30 PromptMix: Text-to-image diffusion models enhance the performance of lightweight networks Arian Bakhtiarnia et.al. 2301.12914v1 null
2023-01-27 Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion Flavio Schneider et.al. 2301.11757v1 link
2023-01-27 Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines? Victor Boutin et.al. 2301.11722v1 link
2023-01-26 simple diffusion: End-to-end diffusion for high resolution images Emiel Hoogeboom et.al. 2301.11093v1 null
2023-01-26 On the Importance of Noise Scheduling for Diffusion Models Ting Chen et.al. 2301.10972v1 null
2023-01-25 Imitating Human Behaviour with Diffusion Models Tim Pearce et.al. 2301.10677v1 link
2023-01-24 Bipartite Graph Diffusion Model for Human Interaction Generation Baptiste Chopin et.al. 2301.10134v1 link
2023-01-24 DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion Model Fan Zhang et.al. 2301.10047v1 link
2023-01-24 Membership Inference of Diffusion Models Hailong Hu et.al. 2301.09956v1 link
2023-01-23 LEGO-Net: Learning Regular Rearrangements of Objects in Rooms Qiuhong Anna Wei et.al. 2301.09629v1 null
2023-01-23 Evaluation of Light Collection from Highly Scattering Media using Wavelength-Shifting Fibers Andrew Wilhelm et.al. 2301.09608v1 null
2023-01-23 StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis Axel Sauer et.al. 2301.09515v1 link
2023-01-23 DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion Qitian Wu et.al. 2301.09474v1 link
2023-01-19 Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models Jun Yue et.al. 2301.08072v1 null
2023-01-18 Targeted Image Reconstruction by Sampling Pre-trained Diffusion Model Jiageng Zheng et.al. 2301.07557v1 null
2023-01-17 GLIGEN: Open-Set Grounded Text-to-Image Generation Yuheng Li et.al. 2301.07093v1 link
2023-01-13 In BLOOM: Creativity and Affinity in Artificial Lyrics and Art Evan Crothers et.al. 2301.05402v1 link
2023-01-12 Guiding Text-to-Image Diffusion Model Towards Grounded Generation Ziyi Li et.al. 2301.05221v1 null
2023-01-12 Thompson Sampling with Diffusion Generative Prior Yu-Guan Hsieh et.al. 2301.05182v1 null

(<a href=#Updated-on-20240404>back to top</a>)

sketch

Publish Date Title Authors PDF Code
2024-04-02 Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation Wangguandong Zheng et.al. 2404.01843v1 null
2024-04-02 FashionEngine: Interactive Generation and Editing of 3D Clothed Humans Tao Hu et.al. 2404.01655v1 null
2024-04-01 Categorical semiotics: Foundations for Knowledge Integration Carlos Leandro et.al. 2404.01526v1 null
2024-04-01 Can Biases in ImageNet Models Explain Generalization? Paul Gavrikov et.al. 2404.01509v1 link
2024-04-02 GDA: Generalized Diffusion for Robust Test-time Adaptation Yun-Yun Tsai et.al. 2404.00095v2 null
2024-03-29 Optimal Communication for Classic Functions in the Coordinator Model and Beyond Hossein Esfandiari et.al. 2403.20307v1 null
2024-03-29 Sketch-to-Architecture: Generative AI-aided Architectural Design Pengzhi Li et.al. 2403.20186v1 null
2024-03-28 Dealing with Missing Modalities in Multimodal Recommendation: a Feature Propagation-based Approach Daniele Malitesta et.al. 2403.19841v1 null
2024-03-28 TASR: A Novel Trust-Aware Stackelberg Routing Algorithm to Mitigate Traffic Congestion Doris E. M. Brown et.al. 2403.19831v1 null
2024-03-26 Neural Attributed Community Search at Billion Scale Jianwei Wang et.al. 2403.18874v1 null
2024-03-27 A Path Towards Legal Autonomy: An interoperable and explainable approach to extracting, transforming, loading and computing legal information using large language models, expert systems and Bayesian networks Axel Constant et.al. 2403.18537v1 null
2024-03-27 U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models Ilias Mitsouras et.al. 2403.18425v1 null
2024-03-27 ECNet: Effective Controllable Text-to-Image Diffusion Models Sicheng Li et.al. 2403.18417v1 null
2024-03-26 Search and Society: Reimagining Information Access for Radical Futures Bhaskar Mitra et.al. 2403.17901v1 null
2024-03-26 ExpressEdit: Video Editing with Natural Language and Sketching Bekzat Tilekbay et.al. 2403.17693v1 null
2024-03-26 Equipping Sketch Patches with Context-Aware Positional Encoding for Graphic Sketch Representation Sicong Zang et.al. 2403.17525v1 null
2024-03-25 On Policy Reuse: An Expressive Language for Representing and Executing General Policies that Call Other Policies Blai Bonet et.al. 2403.16824v1 null
2024-03-25 CodeS: Natural Language to Code Repository via Multi-Layer Sketch Daoguang Zan et.al. 2403.16443v1 link
2024-03-24 Combined Task and Motion Planning Via Sketch Decompositions (Extended Version with Supplementary Material) Magí Dalmau-Moreno et.al. 2403.16277v1 null
2024-03-22 Efficiently Estimating Mutual Information Between Attributes Across Tables Aécio Santos et.al. 2403.15553v1 null
2024-03-22 Fourier Transform-based Estimators for Data Sketches Seth Pettie et.al. 2403.15366v1 null
2024-03-25 Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing Alberto Baldrati et.al. 2403.14828v2 link
2024-03-21 Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild Junhyeong Cho et.al. 2403.14539v1 null
2024-03-21 External Knowledge Enhanced 3D Scene Generation from Sketch Zijie Wu et.al. 2403.14121v1 null
2024-03-20 Towards an extension of Fault Trees in the Predictive Maintenance Scenario Roberta De Fazio et.al. 2403.13785v1 null
2024-03-25 Diagrammatic Instructions to Specify Spatial Objectives and Constraints with Applications to Mobile Base Placement Qilin Sun et.al. 2403.12465v2 null
2024-03-18 Towards a Theory of Pragmatic Information Edward D. Weinberger et.al. 2403.12324v1 null
2024-03-17 Stylized Face Sketch Extraction via Generative Prior with Limited Data Kwan Yun et.al. 2403.11263v1 link
2024-03-16 RETINAQA : A Knowledge Base Question Answering Model Robust to both Answerable and Unanswerable Questions Prayushi Faldu et.al. 2403.10849v1 null
2024-03-15 Animate Your Motion: Turning Still Images into Dynamic Videos Mingxiao Li et.al. 2403.10179v1 null
2024-03-14 What Sketch Explainability Really Means for Downstream Tasks Hmrishav Bandyopadhyay et.al. 2403.09480v1 null
2024-03-14 SketchINR: A First Look into Sketches as Implicit Neural Representations Hmrishav Bandyopadhyay et.al. 2403.09344v1 null
2024-03-14 Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset Hugo Laurençon et.al. 2403.09029v1 null
2024-03-13 ARtVista: Gateway To Empower Anyone Into Artist Trong-Vu Hoang et.al. 2403.08876v1 null
2024-03-13 HAIFIT: Human-Centered AI for Fashion Image Translation Jianan Jiang et.al. 2403.08651v1 link
2024-03-13 Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models Jian Lin et.al. 2403.08266v1 null
2024-03-12 It’s All About Your Sketch: Democratising Sketch Control in Diffusion Models Subhadeep Koley et.al. 2403.07234v1 link
2024-03-12 You’ll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval Subhadeep Koley et.al. 2403.07222v1 null
2024-03-12 Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers Subhadeep Koley et.al. 2403.07214v1 null
2024-03-11 How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? Subhadeep Koley et.al. 2403.07203v1 null
2024-03-11 Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback Adarsh N L et.al. 2403.06735v1 null
2024-03-08 Data-Dependent LSH for the Earth Mover’s Distance Rajesh Jayaram et.al. 2403.05041v1 null
2024-03-07 A challenge in A(G)I, cybernetics revived in the Ouroboros Model as one algorithm for all thinking Knud Thomsen et.al. 2403.04292v1 null
2024-03-06 NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging Takahiro Shirakawa et.al. 2403.03485v1 link
2024-03-07 DLP-GAN: learning to draw modern Chinese landscape photos with generative adversarial network Xiangquan Gui et.al. 2403.03456v2 null
2024-03-05 SmartSantander: IoT Experimentation over a Smart City Testbed Luis Sanchez et.al. 2403.03196v1 null
2024-03-05 CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following Kaiyan Zhang et.al. 2403.03129v1 null
2024-03-05 RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches Priya Sundaresan et.al. 2403.02709v1 null
2024-03-02 Euclidean distance compression via deep random features Brett Leroux et.al. 2403.01327v1 null
2024-02-29 CoMeT: Count-Min-Sketch-based Row Tracking to Mitigate RowHammer at Low Cost F. Nisa Bostanci et.al. 2402.18769v1 link
2024-02-28 DynaWarp – Efficient, large-scale log storage and retrieval Julian Reichinger et.al. 2402.18355v1 null
2024-02-28 Block and Detail: Scaffolding Sketch-to-Image Generation Vishnu Sarukkai et.al. 2402.18116v1 null
2024-02-27 Decremental $(1+ε)$ -Approximate Maximum Eigenvector: Dynamic Power Method Deeksha Adil et.al. 2402.17929v1 null
2024-02-27 Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning Jingying Wang et.al. 2402.17903v1 null
2024-02-27 CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention Mohammad Sadil Khan et.al. 2402.17678v1 null
2024-02-27 CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing Chufeng Xiao et.al. 2402.17624v1 null
2024-02-27 Equivariant ideals of polynomials Arka Ghosh et.al. 2402.17604v1 null
2024-02-25 Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries Mike Heddes et.al. 2402.15953v1 link
2024-02-23 Genie: Generative Interactive Environments Jake Bruce et.al. 2402.15391v1 null
2024-02-22 Semantic Image Synthesis with Unconditional Generator Jungwoo Chae et.al. 2402.14395v1 null
2024-02-21 Sketching AI Concepts with Capabilities and Examples: AI Innovation in the Intensive Care Unit Nur Yildirim et.al. 2402.13437v1 null
2024-02-20 Quantitative causality, causality-guided scientific discovery, and causal machine learning X. San Liang et.al. 2402.13427v1 null
2024-02-20 Almost-Tight Bounds on Preserving Cuts in Classes of Submodular Hypergraphs Sanjeev Khanna et.al. 2402.13151v1 null
2024-02-17 Be Persistent: Towards a Unified Solution for Mitigating Shortcuts in Deep Learning Hadi M. Dolatabadi et.al. 2402.11237v1 null
2024-02-17 Automated Optimization of Parameterized Data-Plane Programs with Parasol Mary Hogan et.al. 2402.11155v1 null
2024-02-13 Sampling Space-Saving Set Sketches Homin K. Lee et.al. 2402.08604v1 link
2024-02-13 One-to-many Reconstruction of 3D Geometry of cultural Artifacts using a synthetically trained Generative Model Thomas Pöllabauer et.al. 2402.08310v1 null
2024-02-13 Epistemic Power, Objectivity and Gender in AI Ethics Labor: Legitimizing Located Complaints David Gray Widder et.al. 2402.08171v1 null
2024-02-13 Randomized Algorithms for Symmetric Nonnegative Matrix Factorization Koby Hayashi et.al. 2402.08134v1 null
2024-02-10 Guided Sketch-Based Program Induction by Search Gradients Ahmad Ayaz Amin et.al. 2402.06990v1 null
2024-02-09 Squidgets: Sketch-based Widget Design and Direct Manipulation of 3D Scene Joonho Kim et.al. 2402.06795v1 null
2024-02-08 InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and Write Blagoj Mitrevski et.al. 2402.05804v1 null
2024-02-08 A Concept for Reconstructing Stucco Statues from historic Sketches using synthetic Data only Thomas Pöllabauer et.al. 2402.05593v1 null
2024-02-06 Gradient Sketches for Training Data Attribution and Studying the Loss Landscape Andrea Schioppa et.al. 2402.03994v1 null
2024-02-06 3Doodle: Compact Abstraction of Objects with 3D Strokes Changwoon Choi et.al. 2402.03690v1 null
2024-02-05 Computing Generic Fibres of Polynomial Ideals with FGLM and Hensel Lifting Jérémy Berthomieu et.al. 2402.03144v1 null
2024-02-03 Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization Bo Yang et.al. 2402.02141v1 link
2024-02-02 Solitons, dispersive shock waves and Noel Fredrick Smyth Saleh Baqer et.al. 2402.01332v1 null
2024-02-01 Deep Robot Sketching: An application of Deep Q-Learning Networks for human-like sketching Raul Fernandez-Fernandez et.al. 2402.00676v1 null
2024-02-01 High-Quality Medical Image Generation from Free-hand Sketch Quan Huu Cap et.al. 2402.00353v1 null
2024-01-31 On The Power of Subtle Expressive Cues in the Perception of Human Affects Ezgi Dede et.al. 2401.18013v1 null
2024-02-04 Fine-Grained Zero-Shot Learning: Advances, Challenges, and Prospects Jingcai Guo et.al. 2401.17766v2 link
2024-01-31 Estimating Diffusion Degree on Graph Streams Vinit Ramesh Gore et.al. 2401.17611v1 null
2024-01-31 Topology-Aware Latent Diffusion for 3D Shape Generation Jiangbei Hu et.al. 2401.17603v1 null
2024-01-29 FPGA Technology Mapping Using Sketch-Guided Program Synthesis Gus Henry Smith et.al. 2401.16526v1 null
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459v1 null
2024-01-25 Incremental Proof Development in Dafny with Module-Based Induction Son Ho et.al. 2401.16233v1 null
2024-01-26 Sketch and Refine: Towards Fast and Accurate Lane Detection Chao Chen et.al. 2401.14729v1 link
2024-01-27 Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation Minglin Chen et.al. 2401.14257v2 null
2024-01-22 PatternPortrait: Draw Me Like One of Your Scribbles Sabine Wieluch et.al. 2401.13001v1 null
2024-01-22 Automated Completion of Statements and Proofs in Synthetic Geometry: an Approach based on Constraint Solving Salwa Tabet Gonzalez et.al. 2401.11898v1 null
2024-01-18 Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access Saibo Geng et.al. 2401.09967v1 null
2024-01-21 Towards Identifiable Unsupervised Domain Translation: A Diversified Distribution Matching Approach Sagar Shrestha et.al. 2401.09671v2 null
2024-01-12 Masked Attribute Description Embedding for Cloth-Changing Person Re-identification Chunlei Peng et.al. 2401.05646v2 link
2024-01-11 DrawTalking: Building Interactive Worlds by Sketching and Speaking Karl Toby Rosenberg et.al. 2401.05631v1 null
2024-01-10 Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval Eunyi Lyou et.al. 2401.04860v1 null
2024-01-09 Content-Conditioned Generation of Stylized Free hand Sketches Jiajun Liu et.al. 2401.04739v1 null
2024-01-09 Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example Kwan Yun et.al. 2401.04362v1 null
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742v1 link
2024-01-05 FedNS: A Fast Sketching Newton-Type Algorithm for Federated Learning Jian Li et.al. 2401.02734v1 link
2024-01-02 ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text Dingkun Yan et.al. 2401.01456v1 link
2024-01-01 Free-form Shape Modeling in XR: A Systematic Review Shounak Chatterjee et.al. 2401.00924v1 null
2024-01-01 DiffMorph: Text-less Image Morphing with Diffusion Models Shounak Chatterjee et.al. 2401.00739v1 null
2023-12-31 SynCDR : Training Cross Domain Retrieval Models with Synthetic Data Samarth Mishra et.al. 2401.00420v1 link
2023-12-31 Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval Liang Wang et.al. 2401.00371v1 link
2023-12-28 A randomized algorithm to solve reduced rank operator regression Giacomo Turri et.al. 2312.17348v1 link
2024-01-03 SVGDreamer: Text Guided SVG Generation with Diffusion Model Ximing Xing et.al. 2312.16476v2 link
2023-12-22 Generative AI and the History of Architecture Joern Ploennigs et.al. 2312.15106v1 null
2023-12-22 A Modular Approach to Metatheoretic Reasoning for Extensible Languages Dawn Michaelson et.al. 2312.14374v1 null
2023-12-21 On the Hardness of Analyzing Quantum Programs Quantitatively Martin Avanzini et.al. 2312.13657v1 null
2023-12-18 Open Vocabulary Semantic Scene Sketch Understanding Ahmed Bourouis et.al. 2312.12463v1 null
2023-12-19 Sketch Vision: Artificial Intelligence with Sight for Imagination Demircan Tas et.al. 2312.12270v1 null
2023-12-19 Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model Lingjun Zhang et.al. 2312.12232v1 link
2023-12-19 CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI DaEun Choi et.al. 2312.11949v1 null
2023-12-16 Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval Decheng Liu et.al. 2312.10320v1 link
2023-12-15 Sketch and shift: a robust decoder for compressive clustering Ayoub Belhadji et.al. 2312.09940v1 null
2023-12-15 Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception Xiao Wang et.al. 2312.09812v1 link
2023-12-14 Matching Noisy Keys for Obfuscation Charlie Dickens et.al. 2312.08981v1 null
2023-12-14 Solving Dense Linear Systems Faster than via Preconditioning Michał Dereziński et.al. 2312.08893v1 null
2023-12-13 Enhance Sketch Recognition’s Explainability via Semantic Component-Level Parsing Guangming Zhu et.al. 2312.07875v1 link
2023-12-12 Improved Frequency Estimation Algorithms with and without Predictions Anders Aamand et.al. 2312.07535v1 null
2023-12-09 BARET : Balanced Attention based Real image Editing driven by Target-text Inversion Yuming Qiao et.al. 2312.05482v1 null
2023-12-07 Optimal Multi-Pass Lower Bounds for MST in Dynamic Streams Sepehr Assadi et.al. 2312.04674v1 null
2023-12-07 Deep3DSketch: 3D modeling from Free-hand Sketches with View- and Structural-Aware Adversarial Training Tianrun Chen et.al. 2312.04435v1 null
2023-12-07 DemoCaricature: Democratising Caricature Generation with a Rough Sketch Dar-Yen Chen et.al. 2312.04364v1 null
2023-12-07 Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes Hmrishav Bandyopadhyay et.al. 2312.04043v1 null
2023-12-06 CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models Hailin Zhang et.al. 2312.03256v1 link
2023-12-05 SEVA: Leveraging sketches to evaluate alignment between human and machine visual abstraction Kushin Mukherjee et.al. 2312.03035v1 link
2023-12-08 FreestyleRet: Retrieving Images from Style-Diversified Queries Hao Li et.al. 2312.02428v2 link
2023-12-04 CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis Nityanand Mathur et.al. 2312.02345v1 null
2023-12-03 Uncertainty-biased molecular dynamics for learning uniformly accurate interatomic potentials Viktor Zaverkin et.al. 2312.01416v1 null
2023-11-30 Sketch Input Method Editor: A Comprehensive Dataset and Methodology for Systematic Input Recognition Guangming Zhu et.al. 2311.18254v1 link
2023-11-29 Analyzing Query Optimizer Performance in the Presence and Absence of Cardinality Estimates Asoke Datta et.al. 2311.17293v1 null
2023-11-28 Time- and Communication-Efficient Overlay Network Construction via Gossip Fabien Dufoulon et.al. 2311.17115v1 null
2023-11-28 SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models Yuwei Guo et.al. 2311.16933v1 null
2023-11-28 ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention Jiawei Wang et.al. 2311.16682v1 null
2023-11-28 Text-Driven Image Editing via Learnable Regions Yuanze Lin et.al. 2311.16432v1 link
2023-11-27 MAST: Model-Agnostic Sparsified Training Yury Demidovich et.al. 2311.16086v1 link
2023-11-26 Sketch Video Synthesis Yudian Zheng et.al. 2311.15306v1 link
2023-11-25 A unified framework for learning with nonlinear model classes from arbitrary linear samples Ben Adcock et.al. 2311.14886v1 null
2023-11-24 Data-to-Text Bilingual Generation Guy Lapalme et.al. 2311.14808v1 link
2023-11-24 One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space Raghav Addanki et.al. 2311.14652v1 null
2023-11-21 Breathing Life Into Sketches Using Text-to-Video Priors Rinon Gal et.al. 2311.13608v1 null
2023-11-22 Adaptive Sampling for Deep Learning via Efficient Nonparametric Proxies Shabnam Daghaghi et.al. 2311.13583v1 null
2023-11-21 From Concept to Manufacturing: Evaluating Vision-Language Models for Engineering Design Cyril Picard et.al. 2311.12668v1 null
2023-11-19 AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort Wen Wang et.al. 2311.11243v1 null
2023-11-17 Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks Benjamin Feuer et.al. 2311.10609v1 null
2023-11-09 Chain of Images for Intuitively Reasoning Fanxu Meng et.al. 2311.09241v1 link
2023-11-14 Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework Weiqin Zu et.al. 2311.08244v1 null
2023-11-13 Fast and Space-Efficient Parallel Algorithms for Influence Maximization Letong Wang et.al. 2311.07554v1 link
2023-11-13 Sketch-based Video Object Segmentation: Benchmark and Analysis Ruolin Yang et.al. 2311.07261v1 null
2023-11-09 General Policies, Subgoal Structure, and Planning Width Blai Bonet et.al. 2311.05490v1 null
2023-11-09 Control3D: Towards Controllable Text-to-3D Generation Yang Chen et.al. 2311.05461v1 null
2023-11-08 Prompt Sketching for Large Language Models Luca Beurer-Kellner et.al. 2311.04954v1 null
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098v1 link
2023-11-06 Sketching methods with small window guarantee using minimum decycling sets Guillaume Marçais et.al. 2311.03592v1 link
2023-11-05 Sketching Multidimensional Time Series for Fast Discord Mining Chin-Chia Michael Yeh et.al. 2311.03393v1 null
2023-11-03 Neural Collage Transfer: Artistic Reconstruction via Material Manipulation Ganghun Lee et.al. 2311.02202v1 link
2023-11-06 RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches Jiayuan Gu et.al. 2311.01977v2 null
2023-11-03 Hardness of Low Rank Approximation of Entrywise Transformed Matrix Products Tamas Sarlos et.al. 2311.01960v1 null
2023-11-03 Towards Concept-Aware Large Language Models Chen Shani et.al. 2311.01866v1 link
2023-11-07 inkn’hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAE Tawin Jiramahapokee et.al. 2311.01804v2 link
2023-10-31 Progress and outlook on advanced fly scans based on Mamba Peng-Cheng Li et.al. 2310.20106v1 link
2023-10-30 The Expressibility of Polynomial based Attention Scheme Zhao Song et.al. 2310.20051v1 null
2023-10-29 Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming Gregory Dexter et.al. 2310.19068v1 null
2023-10-29 Customize StyleGAN with One Hand Sketch Shaocong Zhang et.al. 2310.18949v1 null
2023-10-28 Deep3DSketch+: Obtaining Customized 3D Model by Single Free-Hand Sketch through Deep Learning Ying Zang et.al. 2310.18609v1 null
2023-10-27 Deep3DSketch++: High-Fidelity 3D Modeling from Single Free-hand Sketches Ying Zang et.al. 2310.18178v1 null
2023-10-27 Reality3DSketch: Rapid 3D Modeling of Objects from Single Freehand Sketches Tianrun Chen et.al. 2310.18148v1 null
2023-10-27 On General Language Understanding David Schlangen et.al. 2310.18038v1 null
2023-10-27 Sketching and Streaming for Dictionary Compression Ruben Becker et.al. 2310.17980v1 null
2023-10-26 Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models Dingli Yu et.al. 2310.17567v1 null
2023-10-24 Emergent Communication in Interactive Sketch Question Answering Zixing Lei et.al. 2310.15597v1 link
2023-10-24 Fast multiplication of random dense matrices with fixed sparse matrices Tianyu Liang et.al. 2310.15419v1 link
2023-10-18 A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge Yikun Han et.al. 2310.11703v1 null
2023-10-17 Matrix Compression via Randomized Low Rank and Low Precision Factorization Rajarshi Saha et.al. 2310.11028v1 link
2023-10-16 HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending Tianyi Wei et.al. 2310.10651v1 link
2023-10-16 Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models Vishaal Udandarao et.al. 2310.08577v2 link
2023-10-12 Visualizing a Nondeterministic to Deterministic Finite-State Machine Transformation Tijana Minic et.al. 2310.08248v1 link
2023-10-11 On $(1+\varepsilon)$ -Approximate Flow Sparsifiers Yu Chen et.al. 2310.07857v1 null
2023-10-10 SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction Fei Wang et.al. 2310.06577v1 link
2023-10-15 HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation Yaosen Chen et.al. 2310.05720v3 link
2023-10-09 Logic-guided Deep Reinforcement Learning for Stock Trading Zhiming Li et.al. 2310.05551v1 null
2023-10-08 Transforming Pixels into a Masterpiece: AI-Powered Art Restoration using a Novel Distributed Denoising CNN (DDCNN) Sankar B. et.al. 2310.05270v1 null
2023-10-06 Hanging in there: Prenatal origins of antigravity homeostasis in humans Nicholas M. Wilkinson et.al. 2310.04168v1 null
2023-10-06 Deterministic Clustering in High Dimensional Spaces: Sketches and Approximation Vincent Cohen-Addad et.al. 2310.04076v1 null
2023-10-05 Matrix Completion from One-Bit Dither Samples Arian Eamaz et.al. 2310.03224v1 null
2023-10-04 Streaming Euclidean $k$-median and $k$-means with $o(\log n)$ Space Vincent Cohen-Addad et.al. 2310.02882v1 null
2023-10-04 On the tilt of the Earth’s polar axis ( $κλιμα$ ): Some ‘impressionist’ remarks V. Courtillot et.al. 2310.02768v1 null
2023-10-03 View-Independent Adjoint Light Tracing for Lighting Design Optimization Lukas Lipp et.al. 2310.02043v1 null
2023-10-03 Randomized Dimension Reduction with Statistical Guarantees Yijun Dong et.al. 2310.01739v1 null
2023-10-02 PolySketchFormer: Fast Transformers via Sketches for Polynomial Kernels Praneeth Kacham et.al. 2310.01655v1 null
2023-09-29 Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Developing Practical Guidelines and Tools Emily Black et.al. 2309.17337v1 null
2023-09-28 Sketch2CADScript: 3D Scene Reconstruction from 2D Sketch using Visual Transformer and Rhino Grasshopper Hong-Bin Yang et.al. 2309.16850v1 null
2023-09-28 Multi-Modal Financial Time-Series Retrieval Through Latent Space Projections Tom Bamford et.al. 2309.16741v1 null
2023-09-28 Language models in molecular discovery Nikita Janakarajan et.al. 2309.16235v1 null
2023-10-01 Sampling Methods for Inner Product Sketching Majid Daliri et.al. 2309.16157v2 link
2023-09-27 Fast Locality Sensitive Hashing with Theoretical Guarantee Zongyuan Tan et.al. 2309.15479v1 null
2023-09-25 Guess & Sketch: Language Model Guided Transpilation Celine Lee et.al. 2309.14396v1 null
2023-09-22 Deep3DSketch+: Rapid 3D Modeling from Single Free-hand Sketches Tianrun Chen et.al. 2309.13006v1 null
2023-09-22 Visualization According to Statisticians: An Interview Study on the Role of Visualization for Inferential Statistics Eric Newburger et.al. 2309.12684v1 null
2023-09-22 Towards medhub: A Self-Service Platform for Analysts and Physicians Markus Höhn et.al. 2309.11234v2 null
2023-09-20 An Empirical Study of Malicious Code In PyPI Ecosystem Wenbo Guo et.al. 2309.11021v1 link
2023-09-19 An overview of some mathematical techniques and problems linking 3D vision to 3D printing Emiliano Cristiani et.al. 2309.10549v1 null
2023-09-19 Learning Orbitally Stable Systems for Diagrammatically Teaching Weiming Zhi et.al. 2309.10298v1 null
2023-09-18 Completeness Thresholds for Memory Safety: Unbounded Guarantees via Bounded Proofs (Extended Abstract) Tobias Reinhard et.al. 2309.09731v1 null
2023-09-18 Applying Security Testing Techniques to Automotive Engineering Irdin Pekaric et.al. 2309.09647v1 null
2023-09-15 Active Learning for Fine-Grained Sketch-Based Image Retrieval Himanshu Thakur et.al. 2309.08743v1 null
2023-09-15 Beyond Domain Gap: Exploiting Subjectivity in Sketch-Based Person Retrieval Kejun Lin et.al. 2309.08372v1 link
2023-09-14 Landscape-Sketch-Step: An AI/ML-Based Metaheuristic for Surrogate Optimization Problems Rafael Monteiro et.al. 2309.07936v1 link
2023-09-12 Grounded Language Acquisition From Object and Action Imagery James Robert Kubricht et.al. 2309.06335v1 null
2023-09-12 OmniSketch: Efficient Multi-Dimensional High-Velocity Stream Analytics with Arbitrary Predicates Wieger R. Punter et.al. 2309.06051v1 null
2023-09-12 GA-Sketching: Shape Modeling from Multi-View Sketching with Geometry-Aligned Deep Implicit Functions Jie Zhou et.al. 2309.05946v1 link
2023-09-11 Photodetachment dynamics using nonlocal dicrete-state-in-continuum model Martin Čížek et.al. 2309.05830v1 null
2023-09-10 Streaming Semidefinite Programs: $O(\sqrt{n})$ Passes, Small Space and Fast Runtime Zhao Song et.al. 2309.05135v1 null
2023-09-08 Receiving an algorithmic recommendation based on documentary filmmaking techniques Samuel Gantier et.al. 2309.04184v1 null
2023-09-07 Learning from Demonstration via Probabilistic Diagrammatic Teaching Weiming Zhi et.al. 2309.03835v1 null
2023-09-07 Adjacency Sketches in Adversarial Environments Moni Naor et.al. 2309.03728v1 null
2023-09-06 An Evaluation of Software Sketches Roy Friedman et.al. 2309.03045v1 null
2023-09-03 Business Process Text Sketch Automation Generation Using Large Language Model Rui Zhu et.al. 2309.01071v1 null
2023-09-02 Online Adaptive Mahalanobis Distance Estimation Lianke Qin et.al. 2309.01030v1 null
2023-09-01 Randomized Polar Codes for Anytime Distributed Machine Learning Burak Bartan et.al. 2309.00682v1 null
2023-09-01 Human-Inspired Facial Sketch Synthesis with Dynamic Adaptation Fei Gao et.al. 2309.00216v1 link
2023-08-31 Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance Zexin Hu et.al. 2308.16725v1 null
2023-08-30 Surrogate-based Autotuning for Randomized Sketching Algorithms in Regression Problems Younghyun Cho et.al. 2308.15720v1 null
2023-08-27 SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation Zhiyu Qu et.al. 2308.14191v1 link
2023-08-25 WorldSmith: Iterative and Expressive Prompting for World Building with a Generative AI Hai Dang et.al. 2308.13355v1 null
2023-08-25 Bridging the Gap: Fine-to-Coarse Sketch Interpolation Network for High-Quality Animation Sketch Inbetweening Jiaming Shen et.al. 2308.13273v1 null
2023-08-21 Geo-Sketcher: Rapid 3D Geological Modeling using Geological and Topographic Map Sketches Ronan Amorim et.al. 2308.12152v1 null
2023-08-24 Bayesian Learning for Dynamic Target Localization with Human-provided Spatial Information Min-Won Seo et.al. 2308.11839v2 null
2023-08-22 MatFuse: Controllable Material Generation with Diffusion Models Giuseppe Vecchio et.al. 2308.11408v1 link
2023-08-22 Minwise-Independent Permutations with Insertion and Deletion of Features Rameshwar Pratap et.al. 2308.11240v1 null
2023-08-28 Large Language Models for Software Engineering: A Systematic Literature Review Xinyi Hou et.al. 2308.10620v2 null
2023-08-16 Freedom of Speech and AI Output Eugene Volokh et.al. 2308.08673v1 null
2023-08-16 Painter: Teaching Auto-regressive Language Models to Draw Sketches Reza Pourreza et.al. 2308.08520v1 null
2023-08-15 Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations without Training Ximing Xing et.al. 2308.07665v1 link
2023-08-11 Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation Yuki Endo et.al. 2308.06027v1 link
2023-08-11 Uncertainty-Aware Cross-Modal Transfer Network for Sketch-Based 3D Shape Retrieval Yiyang Cai et.al. 2308.05948v1 null
2023-08-20 The Fast and the Private: Task-based Dataset Search Zezhou Huang et.al. 2308.05637v2 null
2023-08-12 LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation Leigang Qu et.al. 2308.05095v2 null
2023-08-10 Apple Vision Pro for Healthcare: “The Ultimate Display”? – Entering the Wonderland of Precision Jan Egger et.al. 2308.04313v3 null
2023-08-08 Iterative Sketching for Secure Coded Regression Neophytos Charalambides et.al. 2308.04185v1 null
2023-08-06 Gradient Coding through Iterative Block Leverage Score Sampling Neophytos Charalambides et.al. 2308.03096v1 null
2023-08-05 Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation Zijie Wu et.al. 2308.02874v1 null
2023-08-07 SoK: The Ghost Trilemma S. Mukherjee et.al. 2308.02202v2 null
2023-08-07 BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout Kairui Yang et.al. 2308.01661v3 null
2023-08-03 PPI-NET: End-to-End Parametric Primitive Inference Liang Wang et.al. 2308.01521v1 null
2023-08-01 Neural approximation of Wasserstein distance via a universal architecture for symmetric and factorwise group invariant functions Samantha Chen et.al. 2308.00273v1 null
2023-08-01 CONSTRUCT: A Program Synthesis Approach for Reconstructing Control Algorithms from Embedded System Binaries in Cyber-Physical Systems Ali Shokri et.al. 2308.00250v1 null
2023-07-30 RealityCanvas: Augmented Reality Sketching for Embedded and Responsive Scribble Animation Effects Zhijie Xia et.al. 2307.16116v1 link
2023-07-25 Federated Heavy Hitter Recovery under Linear Sketching Adria Gascon et.al. 2307.13347v1 null
2023-07-24 Learning Dense Correspondences between Photos and Sketches Xuanchen Lu et.al. 2307.12967v1 null
2023-07-18 Semi-supervised Cycle-GAN for face photo-sketch translation in the wild Chaofeng Chen et.al. 2307.10281v1 null
2023-07-14 Volumetric Wireframe Parsing from Neural Attraction Fields Nan Xue et.al. 2307.10206v1 link
2023-07-17 Multi-Domain Learning with Modulation Adapters Ekaterina Iakovleva et.al. 2307.08528v1 null
2023-07-16 InkSight: Leveraging Sketch Interaction for Documenting Chart Findings in Computational Notebooks Yanna Lin et.al. 2307.07922v1 null
2023-07-13 Connectivity Labeling for Multiple Vertex Failures Merav Parter et.al. 2307.06276v2 null
2023-07-10 Some Preliminary Steps Towards Metaverse Logic Antonio L. Furtado et.al. 2307.05574v1 null
2023-07-11 A “Game of Like” : Online Social Network Sharing As Strategic Interaction Emmanuel J. Genot et.al. 2307.05063v1 null
2023-07-11 Diffusion idea exploration for art generation Nikhil Verma et.al. 2307.04978v1 null
2023-07-08 Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation Aditya Sanghi et.al. 2307.03869v1 null
2023-07-06 Wireless Multi-Agent Generative AI: From Connected Intelligence to Collective Intelligence Hang Zou et.al. 2307.02757v1 null
2023-07-04 Text + Sketch: Image Compression at Ultra Low Rates Eric Lei et.al. 2307.01944v1 link
2023-07-03 Digital Twin-Empowered Communications: A New Frontier of Wireless Networks Lina Bariah et.al. 2307.00973v1 null
2023-07-04 SketchMetaFace: A Learning-based Sketching Interface for High-fidelity 3D Character Face Modeling Zhongjin Luo et.al. 2307.00804v2 null
2023-06-27 Cartesian institutions with evidence: Data and system modelling with diagrammatic constraints and generalized sketches Zinovy Diskin et.al. 2306.16284v1 null
2023-06-26 Towards Optimal Effective Resistance Estimation Rajat Vadiraj Dwaraknath et.al. 2306.14820v1 null
2023-06-26 DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models Ximing Xing et.al. 2306.14685v1 link
2023-06-25 ALBUS: a Probabilistic Monitoring Algorithm to Counter Burst-Flood Attacks Simon Scherrer et.al. 2306.14328v1 null
2023-06-24 Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles Paul Tarau et.al. 2306.14077v1 link
2023-06-21 PrivSketch: A Private Sketch-based Frequency Estimation Protocol for Data Streams Ying Li et.al. 2306.12144v1 null
2023-06-20 Computing a human-like reaction time metric from stable recurrent vision models Lore Goetschalckx et.al. 2306.11582v1 null
2023-06-23 3D VR Sketch Guided 3D Shape Prototyping and Exploration Ling Luo et.al. 2306.10830v2 link
2023-06-19 Shape Guided Gradient Voting for Domain Generalization Jiaqi Xu et.al. 2306.10809v1 null
2023-06-15 Private Federated Frequency Estimation: Adapting to the Hardness of the Instance Jingfeng Wu et.al. 2306.09396v1 null
2023-06-15 Conditional Human Sketch Synthesis with Explicit Abstraction Control Dar-Yen Chen et.al. 2306.09274v1 null
2023-06-15 Behaviorally Typed State Machines in TypeScript for Heterogeneous Swarms Roland Kuhn et.al. 2306.09068v1 link
2023-06-15 Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation Zihui Gu et.al. 2306.08891v1 link
2023-06-14 Zero-Shot 3D Shape Sketch View Similarity and Retrieval Gianluca Berardi et.al. 2306.08541v1 null
2023-06-14 Probing the unfolded configurations of a $β$ -hairpin using sketch-map Albert Ardevol et.al. 2306.08429v1 null
2023-06-14 CLIPXPlore: Coupled CLIP and Shape Spaces for 3D Shape Exploration Jingyu Hu et.al. 2306.08226v1 null
2023-06-13 AniFaceDrawing: Anime Portrait Exploration during Your Sketching Zhengyu Huang et.al. 2306.07476v1 null
2023-06-15 Strokes2Surface: Recovering Curve Networks From 4D Architectural Design Sketches S. Rasoulzadeh et.al. 2306.07220v2 link
2023-06-11 Learning the Positions in CountSketch Yi Li et.al. 2306.06611v1 null
2023-06-09 SENS: Sketch-based Implicit Neural Shape Modeling Alexandre Binninger et.al. 2306.06088v1 null
2023-06-09 Sketch2Stress: Sketching with Structural Stress Awareness Deng Yu et.al. 2306.05911v1 null
2023-06-09 Sketch Beautification: Learning Part Beautification and Structure Refinement for Sketches of Man-made Objects Deng Yu et.al. 2306.05832v1 null
2023-06-05 Tracking Evolving labels using Cone based Oracles Aditya Acharya et.al. 2306.03306v1 null
2023-06-09 Explicit Construction of q-ary 2-deletion Correcting Codes with Low Redundancy Shu Liu et.al. 2306.02868v2 null
2023-06-06 VideoComposer: Compositional Video Synthesis with Motion Controllability Xiang Wang et.al. 2306.02018v2 null
2023-06-07 Cross Modal Data Discovery over Structured and Unstructured Data Lakes Mohamed Y. Eltabakh et.al. 2306.00932v2 link
2023-06-01 Towards Interactive Image Inpainting via Sketch Refinement Chang Liu et.al. 2306.00407v1 link
2023-06-01 Faster Robust Tensor Power Method for Arbitrary Order Yichuan Deng et.al. 2306.00406v1 null
2023-05-31 Knowledge Base Question Answering for Space Debris Queries Paul Darm et.al. 2305.19734v1 link
2023-05-30 A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation Omar Seddati et.al. 2305.18988v1 null
2023-05-30 DiffSketching: Sketch Control Image Synthesis with Diffusion Models Qiang Wang et.al. 2305.18812v1 link
2023-05-30 Generalization Bounds for Magnitude-Based Pruning via Sparse Matrix Sketching Etash Kumar Guha et.al. 2305.18789v1 null
2023-05-29 Controllable Text-to-Image Generation with GPT-4 Tianjun Zhang et.al. 2305.18583v1 null
2023-05-29 ANPL: Compiling Natural Programs with Interactive Decomposition Di Huang et.al. 2305.18498v1 link
2023-05-30 TaleCrafter: Interactive Story Visualization with Multiple Characters Yuan Gong et.al. 2305.18247v2 link
2023-05-27 Pruning at Initialization – A Sketching Perspective Noga Bar et.al. 2305.17559v1 null
2023-05-27 On the Noise Sensitivity of the Randomized SVD Elad Romanov et.al. 2305.17435v1 link
2023-05-26 BIG-C: a Multimodal Multi-Purpose Dataset for Bemba Claytone Sikasote et.al. 2305.17202v1 link
2023-05-26 CARAMEL: A Succinct Read-Only Lookup Table via Compressed Static Functions Benjamin Coleman et.al. 2305.16545v1 null
2023-05-25 SketchOGD: Memory-Efficient Continual Learning Benjamin Wright et.al. 2305.16424v1 link
2023-05-24 DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models Sungnyun Kim et.al. 2305.15194v1 link
2023-05-23 Distributed CONGEST Algorithms against Mobile Adversaries Orr Fischer et.al. 2305.14300v1 null
2023-05-19 MaGIC: Multi-modality Guided Image Completion Yongsheng Yu et.al. 2305.11818v1 null
2023-05-19 MIDI-Draw: Sketching to Control Melody Generation Tashi Namgyal et.al. 2305.11605v1 null
2023-05-17 Data Extraction via Semantic Regular Expression Synthesis Qiaochu Chen et.al. 2305.10401v1 null
2023-05-15 Scalable and Robust Tensor Ring Decomposition for Large-scale Data Yicong He et.al. 2305.09044v1 null
2023-05-15 Validity Constraints for Data Analysis Workflows Florian Schintke et.al. 2305.08409v1 null
2023-05-15 Fast and Efficient Matching Algorithm with Deadline Instances Zhao Song et.al. 2305.08353v1 null
2023-05-15 Approximation and Progressive Display of Multiverse Analyses Yang Liu et.al. 2305.08323v1 null
2023-05-11 Enabling Programming Thinking in Large Language Models Toward Code Generation Jia Li et.al. 2305.06599v1 null
2023-05-12 Searching Mobile App Screens via Text + Doodle Soumik Mohian et.al. 2305.06165v2 link
2023-05-10 Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models Rohan Dhesikan et.al. 2305.05845v1 link
2023-05-09 Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval Shiyin Dong et.al. 2305.05144v1 null
2023-05-08 Behavioural Types for Local-First Software Roland Kuhn et.al. 2305.04848v1 null
2023-05-09 Locally Attentional SDF Diffusion for Controllable 3D Shape Generation Xin-Yang Zheng et.al. 2305.04461v2 null
2023-05-08 Oblivious algorithms for the Max- $k$ AND Problem Noah G. Singer et.al. 2305.04438v1 null
2023-05-05 Towards Feminist Intersectional XAI: From Explainability to Response-Ability Goda Klumbyte et.al. 2305.03375v1 null
2023-05-04 Program Synthesis for Robot Learning from Demonstrations Noah Patton et.al. 2305.03129v1 null
2023-05-04 HAISTA-NET: Human Assisted Instance Segmentation Through Attention Muhammed Korkmaz et.al. 2305.03105v1 null
2023-05-04 Controllable Visual-Tactile Synthesis Ruihan Gao et.al. 2305.03051v1 link
2023-05-02 A Survey of Methods for Converting Unstructured Data to CSG Models Pierre-Alain Fayolle et.al. 2305.01220v1 null
2023-05-01 IndoorSim-to-OutdoorReal: Learning to Navigate Outdoors without any Outdoor Experience Joanne Truong et.al. 2305.01098v1 null
2023-05-01 Design and Evaluation of a Bioinspired Tendon-Driven 3D-Printed Robotic Eye with Active Vision Capabilities Hamid Osooli et.al. 2305.01076v1 link
2023-05-01 semantic neural model approach for face recognition from sketch Chandana Navuluri et.al. 2305.01058v1 null
2023-04-25 Bridging graph data models: RDF, RDF-star, and property graphs as directed acyclic graphs Ewout Gelling et.al. 2304.13097v1 link
2023-04-25 DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design Jiahao Weng et.al. 2304.12506v1 null
2023-04-23 SketchXAI: A First Look at Explainability for Human Sketches Zhiyu Qu et.al. 2304.11744v1 null
2023-04-22 (Vector) Space is Not the Final Frontier: Product Search as Program Synthesis Jacopo Tagliabue et.al. 2304.11473v1 null
2023-04-21 The centaur programmer – How Kasparov’s Advanced Chess spans over to the software development of the future Pedro Alves et.al. 2304.11172v1 null
2023-04-19 StyleDEM: a Versatile Model for Authoring Terrains Simon Perche et.al. 2304.09626v1 null
2023-04-19 Sensitivity estimation for differentially private query processing Meifan Zhang et.al. 2304.09546v1 null
2023-04-19 A Protocol for Cast-as-Intended Verifiability with a Second Device Johannes Müller et.al. 2304.09456v1 null
2023-04-18 Optimal Eigenvalue Approximation via Sketching William Swartworth et.al. 2304.09281v1 null
2023-04-18 GUILGET: GUI Layout GEneration with Transformer Andrey Sobolevsky et.al. 2304.09012v1 link
2023-04-18 Coefficient Synthesis for Threshold Automata A. R. Balasubramanian et.al. 2304.08917v1 null
2023-04-18 Online fair division with arbitrary entitlements Kushagra Chatterjee et.al. 2304.08864v1 null
2023-04-17 Learning Geometry-aware Representations by Sketching Hyundo Lee et.al. 2304.08204v1 null
2023-04-15 Learned Interpolation for Better Streaming Quantile Approximation with Worst-Case Guarantees Nicholas Schiefer et.al. 2304.07652v1 null
2023-04-15 Remembering Ludwig Dmitrievich Faddeev, our lifelong partner in mathematical physics Daniel Sternheimer et.al. 2304.07577v1 null
2023-04-14 Pool Inference Attacks on Local Differential Privacy: Quantifying the Privacy Guarantees of Apple’s Count Mean Sketch in Practice Andrea Gadotti et.al. 2304.07134v1 null
2023-04-14 On deterministic, constant memory triangular searches on the integer lattice J. Alfredo Cruz-Carlon et.al. 2304.07033v1 null
2023-04-13 Learning Controllable 3D Diffusion Models from Single-view Images Jiatao Gu et.al. 2304.06700v1 null
2023-04-13 On streaming approximation algorithms for constraint satisfaction problems Noah G. Singer et.al. 2304.06664v1 null
2023-04-13 Solving Tensor Low Cycle Rank Approximation Yichuan Deng et.al. 2304.06594v1 null
2023-04-12 TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval Trung-Nghia Le et.al. 2304.06053v1 null
2023-04-12 SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval Trung-Nghia Le et.al. 2304.05731v1 null
2023-04-10 Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification Zan Gao et.al. 2304.04400v1 null
2023-04-09 On Extend-Only Directed Posets and Derived Byzantine-Tolerant Replicated Data Types (Extended Version) Florian Jacob et.al. 2304.04318v1 null
2023-04-07 ChiroDiff: Modelling chirographic data with Diffusion Models Ayan Das et.al. 2304.03785v1 null
2023-04-06 SketchFFusion: Sketch-guided image editing with diffusion model Weihang Mao et.al. 2304.03174v1 null
2023-04-06 LSketch: A Label-Enabled Graph Stream Sketch Toward Time-Sensitive Queries Yiling Zeng et.al. 2304.02897v1 null
2023-04-05 Tracing and Visualizing Human-ML/AI Collaborative Processes through Artifacts of Data Work Jennifer Rogers and et.al. 2304.02699v1 null
2023-04-05 Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks Zejiang Shen et.al. 2304.02623v1 null
2023-04-05 Optimal Sketching Bounds for Sparse Linear Regression Tung Mai et.al. 2304.02261v1 null
2023-04-05 LogoNet: a fine-grained network for instance-level logo sketch retrieval Binbin Feng et.al. 2304.02214v1 link
2023-04-04 Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing Alberto Baldrati et.al. 2304.02051v1 link
2023-04-02 Sketch-based Video Object Localization Sangmin Woo et.al. 2304.00450v1 link
2023-03-31 Almost Linear Constant-Factor Sketching for $\ell_1$ and Logistic Regression Alexander Munteanu et.al. 2304.00051v1 link
2023-03-30 If At First You Don’t Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval Finlay G. C. Hudson et.al. 2303.17703v1 null
2023-03-30 Methods and advancement of content-based fashion image retrieval: A Review Amin Muhammad Shoib et.al. 2303.17371v1 null
2023-03-29 Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval Leo Sampaio Ferraz Ribeiro et.al. 2303.16769v1 null
2023-03-28 Visual Chain-of-Thought Diffusion Models William Harvey et.al. 2303.16187v1 link
2023-03-27 What Can Human Sketches Do for Object Detection? Pinaki Nath Chowdhury et.al. 2303.15149v1 null
2023-03-25 Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style Fengyin Lin et.al. 2303.14348v1 link
2023-03-24 Feature Space Sketching for Logistic Regression Gregory Dexter et.al. 2303.14284v1 null
2023-03-24 Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR Aneeshan Sain et.al. 2303.13779v1 null
2023-03-24 The First Computer Program Raúl Rojas et.al. 2303.13740v1 null
2023-03-28 CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not Aneeshan Sain et.al. 2303.13440v3 null
2023-03-23 Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform Petra Heck et.al. 2303.13151v1 null
2023-03-22 Evaluation of Sketch-Based and Semantic-Based Modalities for Mockup Generation Tommaso Calò et.al. 2303.12709v1 null
2023-03-22 An Extended Study of Human-like Behavior under Adversarial Training Paul Gavrikov et.al. 2303.12669v1 link
2023-03-24 RaBit: Parametric Modeling of 3D Biped Cartoon Characters with a Topological-consistent Dataset Zhongjin Luo et.al. 2303.12564v2 null
2023-03-21 Roots and Requirements for Collaborative AI Mark Stefik et.al. 2303.12040v1 null
2023-03-23 Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings Ayan Kumar Bhunia et.al. 2303.11502v2 null
2023-03-20 Automatic Measures for Evaluating Generative Design Methods for Architects Eric Yeh et.al. 2303.11483v1 null
2023-03-20 Picture that Sketch: Photorealistic Image Generation from Abstract Sketches Subhadeep Koley et.al. 2303.11162v1 null
2023-03-20 On the Maximal Independent Sets of $k$ -mers with the Edit Distance Leran Ma et.al. 2303.10926v1 link
2023-03-19 SKED: Sketch-guided Text-based 3D Editing Aryan Mikaeili et.al. 2303.10735v1 null
2023-03-19 Trainable Projected Gradient Method for Robust Fine-tuning Junjiao Tian et.al. 2303.10720v1 link
2023-03-19 EduVis: Workshop on Visualization Education, Literacy, and Activities Mandy Keck et.al. 2303.10708v1 null
2023-03-19 SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations Pu Li et.al. 2303.10613v1 link
2023-03-17 PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds Anran Qi et.al. 2303.09695v1 null
2023-03-15 Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch Aditay Tripathi et.al. 2303.08784v1 null
2023-03-15 RIS-Enabled Smart Wireless Environments: Deployment Scenarios, Network Architecture, Bandwidth and Area of Influence George C. Alexandropoulos et.al. 2303.08505v1 null
2023-03-14 Data-Free Sketch-Based Image Retrieval Abhra Chaudhuri et.al. 2303.07775v1 link
2023-03-13 Can Workers Meaningfully Consent to Workplace Wellbeing Technologies? Shreya Chowdhary et.al. 2303.07242v1 null
2023-03-13 An Improved Sample Complexity for Rank-1 Matrix Sensing Yichuan Deng et.al. 2303.06895v1 null
2023-03-10 StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces Shuai Yang et.al. 2303.06146v1 link
2023-03-08 Sketching with Spherical Designs for Noisy Data Fitting on Spheres Shao-Bo Lin et.al. 2303.04550v1 null
2023-03-08 Models of symbol emergence in communication: a conceptual review and a guide for avoiding local minima Julian Zubek et.al. 2303.04544v1 null
2023-03-07 Introspective Cross-Attention Probing for Lightweight Transfer of Pre-trained Models Yonatan Dukler et.al. 2303.04105v1 null
2023-03-06 Data Portraits: Recording Foundation Model Training Data Marc Marone et.al. 2303.03919v1 null
2023-03-07 Sketch-based Medical Image Retrieval Kazuma Kobayashi et.al. 2303.03633v1 null
2023-03-06 Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design Michelle S. Lam et.al. 2303.02884v1 link
2023-03-05 Text2Face: A Multi-Modal 3D Face Model Will Rowan et.al. 2303.02688v1 null
2023-03-03 Graph-based Extreme Feature Selection for Multi-class Classification Tasks Shir Friedman et.al. 2303.01792v1 null
2023-03-02 Coresets for Clustering in Geometric Intersection Graphs Sayan Bandyapadhyay et.al. 2303.01400v1 null
2023-03-01 Sketch2Cloth: Sketch-based 3D Garment Generation with Unsigned Distance Fields Yi He et.al. 2303.00167v1 null
2023-02-26 Towards Human-Bot Collaborative Software Architecting with ChatGPT Aakash Ahmad et.al. 2302.14600v1 link
2023-02-28 On-the-Fly Communication-and-Computing for Distributed Tensor Decomposition Over MIMO Channels Xu Chen et.al. 2302.14297v1 null
2023-02-27 Capstone: A Capability-based Foundation for Trustless Secure Memory Access (Extended Version) Jason Zhijingcheng Yu et.al. 2302.13863v1 null
2023-02-27 Evaluation of Automatically Constructed Word Meaning Explanations Marie Stará et.al. 2302.13625v1 null
2023-02-26 Scalable Weight Reparametrization for Efficient Transfer Learning Byeonggeun Kim et.al. 2302.13435v1 null
2023-02-24 Modulating Pretrained Diffusion Models for Multimodal Image Synthesis Cusuh Ham et.al. 2302.12764v1 null
2023-02-23 Using Colors and Sketches to Count Subgraphs in a Streaming Graph Shirin Handjani et.al. 2302.12210v1 null
2023-02-24 A Scalable Space-efficient In-database Interpretability Framework for Embedding-based Semantic SQL Queries Prabhakar Kudva et.al. 2302.12178v2 null
2023-02-22 A Reference Architecture for Observability and Compliance of Cloud Native Applications William Pourmajidi et.al. 2302.11617v1 null
2023-02-20 Ontology-aware Network for Zero-shot Sketch-based Image Retrieval Haoxiang Zhang et.al. 2302.10040v1 null
2023-02-22 Composer: Creative and Controllable Image Synthesis with Composable Conditions Lianghua Huang et.al. 2302.09778v2 link
2023-02-16 Rejecting Cognitivism: Computational Phenomenology for Deep Learning Pierre Beckmann et.al. 2302.09071v1 null
2023-02-14 DiffFaceSketch: High-Fidelity Face Image Synthesis with Sketch-Guided Latent Diffusion Model Yichen Peng et.al. 2302.06908v1 link
2023-02-14 Text-Guided Scene Sketch-to-Photo Synthesis AprilPyone MaungMaung et.al. 2302.06883v1 null
2023-02-14 Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation Yasheng Sun et.al. 2302.06857v1 null
2023-02-13 SkCoder: A Sketch-based Approach for Automatic Code Generation Jia Li et.al. 2302.06144v1 link
2023-02-13 Learning to Scale Temperature in Masked Self-Attention for Image Inpainting Xiang Zhou et.al. 2302.06130v1 null
2023-02-11 An Evaluation Algorithm for Datalog with Equality Martin E. Bidlingmaier et.al. 2302.05792v1 link
2023-02-11 Sketch Less Face Image Retrieval: A New Challenge Dawei Dai et.al. 2302.05576v1 link
2023-02-10 MaskSketch: Unpaired Structure-guided Masked Image Generation Dina Bashkirova et.al. 2302.05496v1 link
2023-02-10 Count-min sketch with variable number of hash functions: an experimental study Éric Fusy et.al. 2302.05245v1 null
2023-02-10 Fast Gumbel-Max Sketch and its Applications Yuanming Zhang et.al. 2302.05176v1 null
2023-02-09 Projection-free Online Exp-concave Optimization Dan Garber et.al. 2302.04859v1 null
2023-02-09 Locally consistent decomposition of strings with applications to edit distance sketching Sudatta Bhattacharya et.al. 2302.04475v1 null
2023-02-06 Sketching Robot Programs On the Fly David Porfirio et.al. 2302.03088v1 null
2023-02-05 Leaving Reality to Imagination: Robust Classification via Generated Datasets Hritik Bansal et.al. 2302.02503v1 link
2023-02-04 An Effective and Differentially Private Protocol for Secure Distributed Cardinality Estimation Pinghui Wang et.al. 2302.02158v1 null
2023-02-04 Sketch-Flip-Merge: Mergeable Sketches for Private Distinct Counting Jonathan Hehir et.al. 2302.02056v1 null
2023-02-01 A Nearly-Optimal Bound for Fast Regression with $\ell_\infty$ Guarantee Zhao Song et.al. 2302.00248v1 null
2023-01-31 FLAME: A small language model for spreadsheet formulas Harshit Joshi et.al. 2301.13779v1 null
2023-01-30 Streaming Anomaly Detection Siddharth Bhatia et.al. 2301.13199v1 link
2023-01-29 BERT-based Authorship Attribution on the Romanian Dataset called ROST Sanda-Maria Avram et.al. 2301.12500v1 null
2023-01-26 Synesthetic Dice: Sensors, Actuators, And Mappings Albrecht Kurze et.al. 2301.11436v1 null
2023-01-26 Cut and Learn for Unsupervised Object Detection and Instance Segmentation Xudong Wang et.al. 2301.11320v1 link
2023-01-25 Reflective Artificial Intelligence Peter R. Lewis et.al. 2301.10823v1 null
2023-01-25 Distilling Text into Circuits Vincent Wang-Mascianica et.al. 2301.10595v1 null
2023-01-24 Capacity Analysis of Vector Symbolic Architectures Kenneth L. Clarkson et.al. 2301.10352v1 null
2023-01-20 Improving Sketch Colorization using Adversarial Segmentation Consistency Samet Hicsonmez et.al. 2301.08590v1 link
2023-01-19 On Finite Blocklength Lossy Source Coding Lin Zhou et.al. 2301.07871v1 null
2023-01-17 Vision Based Machine Learning Algorithms for Out-of-Distribution Generalisation Hamza Riaz et.al. 2301.06975v1 null
2023-01-17 Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval Yuchen Wu et.al. 2301.06685v1 null
2023-01-16 A Distributed Palette Sparsification Theorem Maxime Flin et.al. 2301.06457v1 null
2023-01-14 Weighted Minwise Hashing Beats Linear Sketching for Inner Product Estimation Aline Bessa et.al. 2301.05811v1 null
2023-01-06 Better Differentially Private Approximate Histograms and Heavy Hitters using the Misra-Gries Sketch Christian Janos Lebeda et.al. 2301.02457v1 null
2023-01-03 EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions [Technical Report] Enhao Zhang et.al. 2301.00929v1 link
2023-01-17 Algorithms for Massive Data – Lecture Notes Nicola Prezza et.al. 2301.00754v2 null
2022-12-28 Modular termination verification with a higher-order concurrent separation logic (Intermediate report) Justus Fasse et.al. 2212.14126v1 null
2022-12-22 A Domain-Extensible Compiler with Controllable Automation of Optimisations Thomas Koehler et.al. 2212.12035v1 null

(<a href=#Updated-on-20240404>back to top</a>)

3D reconstruction

Publish Date Title Authors PDF Code
2024-04-03 Neural Radiance Fields with Torch Units Bingnan Ni et.al. 2404.02617v1 null
2024-04-03 TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes Cheng Zhao et.al. 2404.02410v1 null
2024-04-03 APC2Mesh: Bridging the gap from occluded building façades to full 3D models Perpetual Hope Akwensi et.al. 2404.02391v1 null
2024-04-01 Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects Yijia Weng et.al. 2404.01440v1 link
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400v1 null
2024-04-01 FPGA-Accelerated Correspondence-free Point Cloud Registration with PointNet Features Keisuke Sugiura et.al. 2404.01237v1 null
2024-04-02 Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features Pietro Bonazzi et.al. 2404.01112v2 null
2024-03-30 DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans Akash Sengupta et.al. 2404.00485v1 null
2024-03-30 3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting Xiaoyang Lyu et.al. 2404.00409v1 null
2024-03-29 Sparse Views, Near Light: A Practical Paradigm for Uncalibrated Point-light Photometric Stereo Mohammed Brahimi et.al. 2404.00098v1 null
2024-03-29 NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising Tianchen Deng et.al. 2403.20034v1 link
2024-03-28 CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians Avinash Paliwal et.al. 2403.19495v1 null
2024-03-30 Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction Xiaoyang Lyu et.al. 2403.19314v2 link
2024-03-28 Neural Fields for 3D Tracking of Anatomy and Surgical Instruments in Monocular Laparoscopic Video Clips Beerend G. A. Gerats et.al. 2403.19265v1 null
2024-04-01 WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion Khiem Vuong et.al. 2403.19022v2 null
2024-03-29 Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction Qiuhong Shen et.al. 2403.18795v2 null
2024-03-29 SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface Jiahao Luo et.al. 2403.18784v2 null
2024-03-27 Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction Yiyao Zhang et.al. 2403.18776v1 link
2024-03-27 SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery Camille Billouard et.al. 2403.18711v1 null
2024-03-26 EgoLifter: Open-world 3D Segmentation for Egocentric Perception Qiao Gu et.al. 2403.18118v1 null
2024-03-25 Creating a Digital Twin of Spinal Surgery: A Proof of Concept Jonas Hein et.al. 2403.16736v1 null
2024-03-25 Spike-NeRF: Neural Radiance Field Based On Spike Camera Yijia Guo et.al. 2403.16410v1 null
2024-03-25 Elite360D: Towards Efficient 360 Depth Estimation via Semantic- and Distance-Aware Bi-Projection Fusion Hao Ai et.al. 2403.16376v1 null
2024-03-24 latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction Christopher Wewer et.al. 2403.16292v1 null
2024-03-23 UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation Yuliang Guo et.al. 2403.15705v1 null
2024-03-22 FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos Florian Langer et.al. 2403.15161v1 null
2024-03-22 Recent Trends in 3D Reconstruction of General Non-Rigid Scenes Raza Yunus et.al. 2403.15064v1 null
2024-03-21 Hyperspectral Neural Radiance Fields Gerry Chen et.al. 2403.14839v1 null
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621v1 link
2024-03-21 Isotropic Gaussian Splatting for Real-Time Radiance Field Rendering Yuanhao Gong et.al. 2403.14244v1 null
2024-03-21 Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions Jiacong Xu et.al. 2403.14053v1 null
2024-03-20 T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image Shijie Zhang et.al. 2403.13663v1 null
2024-03-20 MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination Weiying Wang et.al. 2403.13348v1 null
2024-03-19 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He et.al. 2403.12957v1 null
2024-03-19 PostoMETRO: Pose Token Enhanced Mesh Transformer for Robust 3D Human Mesh Recovery Wendi Yang et.al. 2403.12473v1 null
2024-03-18 LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation Yushi Lan et.al. 2403.12019v1 null
2024-03-18 GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image Xiao Fu et.al. 2403.12013v1 null
2024-03-18 SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion Vikram Voleti et.al. 2403.12008v1 null
2024-03-18 GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors LI Yang et.al. 2403.11899v1 null
2024-03-18 OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation Haochen Jiang et.al. 2403.11796v1 null
2024-03-18 Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning Teppei Suzuki et.al. 2403.11460v1 link
2024-03-18 BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors Tingyang Zhang et.al. 2403.11427v1 null
2024-03-17 Creating Seamless 3D Maps Using Radiance Fields Sai Tarun Sathyan et.al. 2403.11364v1 null
2024-03-17 Recent Advances in 3D Gaussian Splatting Tong Wu et.al. 2403.11134v1 null
2024-03-17 Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications Yonggan Fu et.al. 2403.11131v1 null
2024-03-16 Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription Hongxiang Zhao et.al. 2403.10953v1 null
2024-03-15 SCILLA: SurfaCe Implicit Learning for Large Urban Area, a volumetric hybrid solution Hala Djeghim et.al. 2403.10344v1 null
2024-03-15 FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model Qijun Feng et.al. 2403.10242v1 null
2024-03-15 Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience Xiaohang Yu et.al. 2403.09973v1 null
2024-03-14 MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation Jiayi Wu et.al. 2403.09850v1 link
2024-03-14 Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting Jaewoo Jung et.al. 2403.09413v1 link
2024-03-13 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface Linyi Jin et.al. 2403.08768v1 null
2024-03-13 Refractive COLMAP: Refractive Structure-from-Motion Revisited Mengkun She et.al. 2403.08640v1 null
2024-03-12 Q-SLAM: Quadric Representations for Monocular SLAM Chensheng Peng et.al. 2403.08125v1 null
2024-03-11 Bayesian Diffusion Models for 3D Shape Reconstruction Haiyang Xu et.al. 2403.06973v1 null
2024-03-08 DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction Jaehyeok Shim et.al. 2403.05005v1 null
2024-03-11 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765v2 null
2024-03-08 Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces Evangelos Skartados et.al. 2403.04508v2 null
2024-03-07 CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images Guanlin Shen et.al. 2403.04198v1 link
2024-03-05 Pooling Image Datasets With Multiple Covariate Shift and Imbalance Sotirios Panagiotis Chytas et.al. 2403.02598v1 null
2024-03-04 TripoSR: Fast 3D Object Reconstruction from a Single Image Dmitry Tochilkin et.al. 2403.02151v1 link
2024-03-03 A Novel Dynamic Light-Section 3D Reconstruction Method for Wide-Range Sensing Mengjuan Chen et.al. 2403.01374v1 null
2024-03-08 G3DR: Generative 3D Reconstruction in ImageNet Pradyumna Reddy et.al. 2403.00939v2 link
2024-03-01 DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots Chunlin Li et.al. 2403.00228v1 null
2024-03-05 VEnvision3D: A Synthetic Perception Dataset for 3D Multi-Task Model Research Jiahao Zhou et.al. 2402.19059v2 null
2024-02-27 Sora Generates Videos with Stunning Geometrical Consistency Xuanyi Li et.al. 2402.17403v1 null
2024-02-27 CharNeRF: 3D Character Generation from Concept Art Eddy Chu et.al. 2402.17115v1 null
2024-02-26 DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer Yizhe Wu et.al. 2402.16308v1 null
2024-02-25 GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction Xiao Chen et.al. 2402.16174v1 null
2024-02-24 A Generative Machine Learning Model for Material Microstructure 3D Reconstruction and Performance Evaluation Yilin Zheng et.al. 2402.15815v1 null
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817v1 null
2024-02-22 Workspace Analysis for Laparoscopic Rectal Surgery : A Preliminary Study Alexandra Thomieres et.al. 2402.14386v1 null
2024-02-22 MVD $^2$ : Efficient Multiview 3D Reconstruction for Multiview Diffusion Xin-Yang Zheng et.al. 2402.14253v1 null
2024-02-20 MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction Shitao Tang et.al. 2402.12712v1 null
2024-02-25 A Robust Error-Resistant View Selection Method for 3D Reconstruction Shaojie Zhang et.al. 2402.11431v2 null
2024-02-17 Dense Matchers for Dense Tracking Tomáš Jelínek et.al. 2402.11287v1 null
2024-02-17 DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model Yu Feng et.al. 2402.11241v1 null
2024-02-15 Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions Muhammad Arbab Arshad et.al. 2402.10344v1 null
2024-02-15 GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering Abdullah Hamdi et.al. 2402.10128v1 link
2024-02-14 PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments Xiuzhong Hu et.al. 2402.09325v1 link
2024-02-14 DUDF: Differentiable Unsigned Distance Fields with Hyperbolic Scaling Miguel Fainstein et.al. 2402.08876v1 null
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-20 Camera Calibration through Geometric Constraints from Rotation and Projection Matrices Muhammad Waleed et.al. 2402.08437v2 link
2024-02-09 Neural Rendering based Urban Scene Reconstruction for Autonomous Driving Shihao Shen et.al. 2402.06826v1 null
2024-02-07 Carousel phase retrieval algorithm for 3D coherent X-ray diffraction imaging Fangzhou Ai et.al. 2402.05283v1 link
2024-02-06 EscherNet: A Generative Model for Scalable View Synthesis Xin Kong et.al. 2402.03908v1 link
2024-02-09 MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction Heng Zhou et.al. 2402.03762v2 null
2024-02-05 Denoising Diffusion via Image-Based Rendering Titas Anciukevicius et.al. 2402.03445v1 null
2024-02-02 Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses Mahboubeh Asadi et.al. 2402.01485v1 null
2024-02-02 DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based Mapping Zequan Chen et.al. 2402.01134v1 link
2024-02-01 Enhanced fringe-to-phase framework using deep learning Won-Hoe Kim et.al. 2402.00977v1 null
2024-02-01 Diffusion-based Light Field Synthesis Ruisheng Gao et.al. 2402.00575v1 null
2024-01-31 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592v1 link
2024-01-30 Self-Supervised Representation Learning for Nerve Fiber Distribution Patterns in 3D-PLI Alexander Oberstrass et.al. 2401.17207v1 null
2024-01-30 Physical Priors Augmented Event-Based 3D Reconstruction Jiaxu Wang et.al. 2401.17121v1 link
2024-01-30 OmniSCV: An Omnidirectional Synthetic Image Generator for Computer Vision Bruno Berenguel-Baeta et.al. 2401.17061v1 link
2024-01-29 Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data Sascha Jecklin et.al. 2401.16027v1 null
2024-01-29 2L3: Lifting Imperfect Generated 2D Images into Accurate 3D Yizheng Chen et.al. 2401.15841v1 null
2024-01-28 Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras Yu-Jhe Li et.al. 2401.15616v1 null
2024-01-26 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field Zhenyu Bao et.al. 2401.14726v1 link
2024-01-25 TIFu: Tri-directional Implicit Function for High-Fidelity 3D Character Reconstruction Byoungsung Lim et.al. 2401.14565v1 null
2024-01-25 Range-Agnostic Multi-View Depth Estimation With Keyframe Selection Andrea Conti et.al. 2401.14401v1 link
2024-01-25 pix2gestalt: Amodal Segmentation by Synthesizing Wholes Ege Ozguroglu et.al. 2401.14398v1 link
2024-01-25 GauU-Scene: A Scene Reconstruction Benchmark on Large Scale 3D Reconstruction Dataset Using Gaussian Splatting Butian Xiong et.al. 2401.14032v1 null
2024-01-24 EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction Yangsen Chen et.al. 2401.13352v1 null
2024-01-23 IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images Zhi-Hao Lin et.al. 2401.12977v1 null
2024-01-21 A Survey on African Computer Vision Datasets, Topics and Researchers Abdul-Hakeem Omotayo et.al. 2401.11617v1 null
2024-01-21 Multi-View Neural 3D Reconstruction of Micro-/Nanostructures with Atomic Force Microscopy Shuo Chen et.al. 2401.11541v1 link
2024-01-21 Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting Lingting Zhu et.al. 2401.11535v1 link
2024-01-17 POE: Acoustic Soft Robotic Proprioception for Omnidirectional End-effectors Uksang Yoo et.al. 2401.09382v1 null
2024-01-16 Learning Implicit Representation for Reconstructing Articulated Objects Hao Zhang et.al. 2401.08809v1 null
2024-01-20 Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis Zhenhui Ye et.al. 2401.08503v2 null
2024-01-16 S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera Thanh Nguyen Canh et.al. 2401.08134v1 null
2024-01-12 3D Reconstruction of Interacting Multi-Person in Clothing from a Single Image Junuk Cha et.al. 2401.06415v1 null
2024-01-12 SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization Zhenlong Yuan et.al. 2401.06385v1 null
2024-01-12 Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery Beilei Cui et.al. 2401.06013v2 link
2024-01-10 Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects Tianhang Cheng et.al. 2401.05236v1 link
2024-01-07 RHOBIN Challenge: Reconstruction of Human Object Interaction Xianghui Xie et.al. 2401.04143v1 null
2024-01-08 AGG: Amortized Generative 3D Gaussians for Single Image to 3D Dejia Xu et.al. 2401.04099v1 null
2024-01-08 A Survey on 3D Gaussian Splatting Guikun Chen et.al. 2401.03890v1 null
2024-01-03 S3Net: Innovating Stereo Matching and Semantic Segmentation with a Single-Branch Semantic Stereo Network in Satellite Epipolar Imagery Qingyuan Yang et.al. 2401.01643v1 link
2023-12-29 Informative Rays Selection for Few-Shot Neural Radiance Fields Marco Orsingher et.al. 2312.17561v1 null
2023-12-28 Toward Semantic Scene Understanding for Fine-Grained 3D Modeling of Plants Mohamad Qadri et.al. 2312.17110v1 null
2023-12-28 Learning Spatially Collaged Fourier Bases for Implicit Neural Representation Jason Chun Lok Li et.al. 2312.17018v1 null
2023-12-27 In-Hand 3D Object Reconstruction from a Monocular RGB Video Shijian Jiang et.al. 2312.16425v1 null
2023-12-24 SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition Nikhil Behari et.al. 2312.16215v1 null
2023-12-24 A theory of volumetric representations for opaque solids Bailey Miller et.al. 2312.15406v1 null
2023-12-22 Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarization Joaquin Rodriguez et.al. 2312.14697v1 link
2023-12-22 Scalable 3D Reconstruction From Single Particle X-Ray Diffraction Images Based on Online Machine Learning Jay Shenoy et.al. 2312.14432v1 null
2023-12-21 PlatoNeRF: 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar Tzofi Klinghoffer et.al. 2312.14239v1 null
2023-12-21 3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera Christen Millerdurai et.al. 2312.14157v1 null
2023-12-21 DUSt3R: Geometric 3D Vision Made Easy Shuzhe Wang et.al. 2312.14132v1 link
2023-12-21 Anatomical basis of sex differences in human post-myocardial infarction ECG phenotypes identified by novel automated torso-cardiac 3D reconstruction Hannah J. Smith et.al. 2312.13976v1 null
2023-12-21 SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS Ahmet Haydar Ornek et.al. 2312.13832v1 null
2023-12-21 Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects David Nakath et.al. 2312.13494v1 null
2023-12-20 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections Fangjinhua Wang et.al. 2312.13285v1 null
2023-12-20 Splatter Image: Ultra-Fast Single-View 3D Reconstruction Stanislaw Szymanowicz et.al. 2312.13150v1 link
2023-12-21 pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction David Charatan et.al. 2312.12337v2 link
2023-12-19 EVI-SAM: Robust, Real-time, Tightly-coupled Event-Visual-Inertial State Estimation and 3D Dense Mapping Weipeng Guan et.al. 2312.11911v1 link
2023-12-17 Primitive-based 3D Human-Object Interaction Modelling and Programming Siqi Liu et.al. 2312.10714v1 null
2023-12-16 Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers Zi-Xin Zou et.al. 2312.09147v2 null
2023-12-14 Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments Liyuan Zhu et.al. 2312.09138v1 null
2023-12-14 Scene 3-D Reconstruction System in Scattering Medium Zhuoyifan Zhang et.al. 2312.09005v1 null
2023-12-11 Gaussian Splatting SLAM Hidenobu Matsuki et.al. 2312.06741v1 null
2023-12-10 UNeR3D: Versatile and Scalable 3D RGB Point Cloud Generation from 2D Images in Unsupervised Reconstruction Hongbin Lin et.al. 2312.06706v1 null
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889v1 null
2023-12-11 Nuvo: Neural UV Mapping for Unruly 3D Representations Pratul P. Srinivasan et.al. 2312.05283v1 null
2023-12-08 Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation Bruno Lecouat et.al. 2312.05190v1 null
2023-12-08 SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration Xu Cao et.al. 2312.04803v1 null
2023-12-07 FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models Stathis Galanakis et.al. 2312.04465v1 null
2023-12-06 Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle Youtian Lin et.al. 2312.03431v1 null
2023-12-06 Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method Hongyu Huang et.al. 2312.03372v1 null
2023-12-06 RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids Doriand Petit et.al. 2312.03357v1 null
2023-12-05 ReconFusion: 3D Reconstruction with Diffusion Priors Rundi Wu et.al. 2312.02981v1 null
2023-12-05 R3D-SWIN:Use Shifted Window Attention for Single-View 3D Reconstruction Chenhuan Li et.al. 2312.02725v1 null
2023-12-05 DreaMo: Articulated 3D Reconstruction From A Single Casual Video Tao Tu et.al. 2312.02617v1 null
2023-12-05 Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent Jianmeng Liu et.al. 2312.02568v1 null
2023-12-03 Slice3D: Multi-Slice, Occlusion-Revealing, Single View 3D Reconstruction Yizhi Wang et.al. 2312.02221v1 null
2023-12-04 Steerers: A framework for rotation equivariant keypoint descriptors Georg Bökman et.al. 2312.02152v1 link
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141v1 null
2023-12-04 Light Field Imaging in the Restrictive Object Space based on Flexible Angular Plane Ping Zhou et.al. 2312.01761v1 null
2023-12-02 RNb-NeuS: Reflectance and Normal-based Multi-View 3D Reconstruction Baptiste Brument et.al. 2312.01215v1 link
2023-12-05 Self-Evolving Neural Radiance Fields Jaewoo Jung et.al. 2312.01003v2 link
2023-12-01 NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance Hanlin Chen et.al. 2312.00846v1 null
2023-12-01 UAVs and Birds: Enhancing Short-Range Navigation through Budgerigar Flight Studies Md. Mahmudur Rahman et.al. 2312.00597v1 null
2023-11-30 Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data Yu Deng et.al. 2311.18729v1 null
2023-11-30 Multi-task learning with cross-task consistency for improved depth estimation in colonoscopy Pedro Esteban Chavarrias Solano et.al. 2311.18664v1 null
2023-11-30 HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video Zicong Fan et.al. 2311.18448v1 link
2023-11-29 Volumetric Cloud Field Reconstruction Jacob Lin et.al. 2311.17657v1 null
2023-11-30 REF $^2$ -NeRF: Reflection and Refraction aware Neural Radiance Field Wooseok Kim et.al. 2311.17116v2 link
2023-11-28 Multi-Scale 3D Gaussian Splatting for Anti-Aliased Rendering Zhiwen Yan et.al. 2311.17089v1 null
2023-11-28 Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models Zhengming Yu et.al. 2311.17050v1 null
2023-11-28 Gradient-based Local Next-best-view Planning for Improved Perception of Targeted Plant Nodes Akshay K. Burusa et.al. 2311.16759v1 null
2023-11-28 RGBGrasp: Image-based Object Grasping by Capturing Multiple Views during Robot Arm Movement with Neural Radiance Fields Chang Liu et.al. 2311.16592v1 null
2023-11-28 Rethinking Directional Integration in Neural Radiance Fields Congyue Deng et.al. 2311.16504v1 null
2023-11-27 Weakly-Supervised 3D Reconstruction of Clothed Humans via Normal Maps Jane Wu et.al. 2311.16042v1 null
2023-11-27 SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion Hsuan-I Ho et.al. 2311.15855v1 null
2023-11-26 Obj-NeRF: Extract Object NeRFs from Multi-view Images Zhiyi Li et.al. 2311.15291v1 null
2023-11-25 Multi-task Planar Reconstruction with Feature Warping Guidance Luan Wei et.al. 2311.14981v1 link
2023-11-24 RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling Xiaoyue Wan et.al. 2311.14242v1 null
2023-11-23 GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence Van Nguyen Nguyen et.al. 2311.14155v1 link
2023-11-23 MonoNav: MAV Navigation via Monocular Depth Estimation and Reconstruction Nathaniel Simon et.al. 2311.14100v1 null
2023-11-23 DRIFu: Differentiable Rendering and Implicit Function-based Single-View 3D Reconstruction Zijian Kuang et.al. 2311.13199v2 link
2023-11-22 Differentiable Radio Frequency Ray Tracing for Millimeter-Wave Sensing Xingyu Chen et.al. 2311.13182v1 null
2023-11-21 Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models David Stotko et.al. 2311.12796v1 link
2023-11-20 Mixing-Denoising Generalizable Occupancy Networks Amine Ouasfi et.al. 2311.12125v1 null
2023-11-23 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction Peng Wang et.al. 2311.12024v2 null
2023-11-19 GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise Xinhai Li et.al. 2311.11221v1 null
2023-11-18 LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation Sébastien Henry et.al. 2311.11171v1 null
2023-11-18 Invariant-based Mapping of Space During General Motion of an Observer Juan D. Yepes et.al. 2311.11130v1 null
2023-11-16 DSR-Diff: Depth Map Super-Resolution with Diffusion Model Yuan Shi et.al. 2311.09919v1 null
2023-11-18 EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices Jingnan Gao et.al. 2311.09806v2 null
2023-11-14 DynamicSurf: Dynamic Neural RGB-D Surface Reconstruction with an Optimizable Feature Grid Mirgahney Mohamed et.al. 2311.08159v1 null
2023-11-13 $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF Liangchen Li et.al. 2311.07044v1 null
2023-11-11 3DFusion, A real-time 3D object reconstruction pipeline based on streamed instance segmented data Xi Sun et.al. 2311.06659v1 null
2023-11-09 ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image Senthil Purushwalkam et.al. 2311.05230v1 null
2023-11-08 Implicit Neural Representations for Breathing-compensated Volume Reconstruction in Robotic Ultrasound Aorta Screening Yordanka Velikova et.al. 2311.04999v1 null
2023-11-08 LRM: Large Reconstruction Model for Single Image to 3D Yicong Hong et.al. 2311.04400v1 null
2023-11-07 High-fidelity 3D Reconstruction of Plants using Neural Radiance Field Kewei Hu et.al. 2311.04154v1 null
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098v1 link
2023-11-05 MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis Xuqian Ren et.al. 2311.02778v1 null
2023-11-05 Fast Point-cloud to Mesh Reconstruction for Deformable Object Tracking Elham Amin Mansour et.al. 2311.02749v1 null
2023-11-05 IPVNet: Learning Implicit Point-Voxel Features for Open-Surface 3D Reconstruction Mohammad Samiul Arshad et.al. 2311.02552v1 link
2023-11-02 CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation Jingkang Wang et.al. 2311.01447v1 null
2023-11-02 Look at Robot Base Once: Hand-Eye Calibration with Point Clouds of Robot Base Leveraging Learning-Based 3D Vision Leihui Li et.al. 2311.01335v1 link
2023-11-02 Joint 3D Shape and Motion Estimation from Rolling Shutter Light-Field Images Hermes McGriff et.al. 2311.01292v1 link
2023-11-01 Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture Yixin Chen et.al. 2311.00457v1 null
2023-10-31 Deep Compressed Learning for 3D Seismic Inversion Maayan Gelboim et.al. 2311.00107v1 null
2023-10-31 Refined Equivalent Pinhole Model for Large-scale 3D Reconstruction from Spaceborne CCD Imagery Hong Danyang et.al. 2310.20117v1 null
2023-10-29 3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets Ta-Ying Cheng et.al. 2310.19188v1 null
2023-10-25 Open-NeRF: Towards Open Vocabulary NeRF Decomposition Hao Zhang et.al. 2310.16383v1 null
2023-10-23 Novel-View Acoustic Synthesis from 3D Reconstructed Rooms Byeongjoo Ahn et.al. 2310.15130v1 link
2023-10-23 Interaction-Driven Active 3D Reconstruction with Object Interiors Zihao Yan et.al. 2310.14700v1 null
2023-10-23 VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations Yiying Yang et.al. 2310.14487v1 null
2023-10-22 A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video Jan Emily Mangulabnan et.al. 2310.14364v1 null
2023-10-20 Single-view 3D reconstruction via inverse procedural modeling Albert Garifullin et.al. 2310.13373v1 null
2023-10-20 UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene Jiaming Gu et.al. 2310.13263v1 null
2023-10-19 Real space iterative reconstruction for vector tomography (RESIRE-V) Minh Pham et.al. 2310.12513v1 link
2023-10-18 ShapeGraFormer: GraFormer-Based Network for Hand-Object Reconstruction from a Single Depth Map Ahmed Tawfik Aboukhadra et.al. 2310.11811v1 null
2023-10-17 Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors Pengchong Hu et.al. 2310.11598v1 null
2023-10-17 Field Robot for High-throughput and High-resolution 3D Plant Phenotyping Felix Esser et.al. 2310.11516v1 null
2023-10-16 In-Situ Single Particle Reconstruction Reveals 3D Evolution of PtNi Nanocatalysts During Heating Yi-Chi Wang et.al. 2310.10253v1 null
2023-10-15 Tabletop Transparent Scene Reconstruction via Epipolar-Guided Optical Flow with Monocular Depth Completion Prior Xiaotong Chen et.al. 2310.09956v1 null
2023-10-15 CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses Hongyu Fu et.al. 2310.09776v1 null
2023-10-12 Implicit Shape and Appearance Priors for Few-Shot Full Head Reconstruction Pol Caselles et.al. 2310.08784v1 null
2023-10-13 PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm Haoyi Zhu et.al. 2310.08586v2 link
2023-10-12 Consistent123: Improve Consistency for One Image to 3D Object Synthesis Haohan Weng et.al. 2310.08092v1 null
2023-10-10 SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction Fei Wang et.al. 2310.06577v1 link
2023-10-08 Experiences with CAMRE: Single-Device Collaborative Adaptive Mixed Reality Environment Hung-Jui Guo et.al. 2310.04996v1 null
2023-10-02 PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments Xiuzhong Hu et.al. 2310.00874v1 link
2023-10-01 Enabling Neural Radiance Fields (NeRF) for Large-scale Aerial Images – A Multi-tiling Approaching and the Geometry Assessment of NeRF Ningli Xu et.al. 2310.00530v1 null
2023-09-29 3D Reconstruction in Noisy Agricultural Environments: A Bayesian Optimization Perspective for View Planning Athanasios Bacharis et.al. 2310.00145v1 null
2023-09-29 Effect of structure-based training on 3D localization precision and quality Armin Abdehkakha et.al. 2309.17265v1 null
2023-09-28 Sketch2CADScript: 3D Scene Reconstruction from 2D Sketch using Visual Transformer and Rhino Grasshopper Hong-Bin Yang et.al. 2309.16850v1 null
2023-09-29 3D Reconstruction with Generalizable Neural Fields using Scene Priors Yang Fu et.al. 2309.15164v2 null
2023-09-26 Combining optical diffraction tomography with imaging flow cytometry for characterizing morphology, hemoglobin content, and membrane deformability of live red blood cells Yu-Hsiang Chang et.al. 2309.15131v1 null
2023-09-26 PHRIT: Parametric Hand Representation with Implicit Template Zhisheng Huang et.al. 2309.14916v1 null
2023-09-26 Unsupervised Reconstruction of 3D Human Pose Interactions From 2D Poses Alone Peter Hardy et.al. 2309.14865v1 null
2023-09-26 3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction Miriam Jäger et.al. 2309.14800v1 null
2023-09-23 MP-MVS: Multi-Scale Windows PatchMatch and Planar Prior Multi-View Stereo Rongxuan Tan et.al. 2309.13294v1 link
2023-09-22 NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields Xiaoxue Chen et.al. 2309.13039v1 link
2023-09-25 Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates Ka Chun Shum et.al. 2309.11281v2 link
2023-09-19 PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental Segmentation Luigi Freda et.al. 2309.10896v1 link
2023-09-19 SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction Anilkumar Swamy et.al. 2309.10748v1 null
2023-09-18 Improving Neural Indoor Surface Reconstruction with Mask-Guided Adaptive Consistency Constraints Xinyi Yu et.al. 2309.09739v1 null
2023-09-18 Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering Chi Zhang et.al. 2309.09724v1 null
2023-09-17 Uncertainty-aware 3D Object-Level Mapping with Deep Shape Priors Ziwei Liao et.al. 2309.09118v1 null
2023-09-13 Exploiting Multiple Priors for Neural 3D Indoor Reconstruction Federico Lincetto et.al. 2309.07021v1 null
2023-09-12 Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle Maria Priisalu et.al. 2309.06313v1 null
2023-09-11 A survey on real-time 3D scene reconstruction with SLAM methods in embedded systems Quentin Picard et.al. 2309.05349v1 null
2023-09-07 A Food Package Recognition and Sorting System Based on Structured Light and Deep Learning Xuanzhi Liu et.al. 2309.03704v1 null
2023-09-06 SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction Nivetha Jayakumar et.al. 2309.03335v1 null
2023-09-06 Sparse 3D Reconstruction via Object-Centric Ray Sampling Llukman Cerkezi et.al. 2309.03008v1 link
2023-09-06 Multi-log grasping using reinforcement learning and virtual visual servoing Erik Wallin et.al. 2309.02997v1 null
2023-09-06 LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline Víctor M. Batlle et.al. 2309.02777v1 null
2023-09-05 GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction Youmin Zhang et.al. 2309.02436v1 link
2023-09-05 Doppelgangers: Learning to Disambiguate Images of Similar Structures Ruojin Cai et.al. 2309.02420v1 link
2023-09-05 TiAVox: Time-aware Attenuation Voxels for Sparse-view 4D DSA Reconstruction Zhenghong Zhou et.al. 2309.02318v1 null
2023-09-05 Iterative Superquadric Recomposition of 3D Objects from Multiple Views Stephan Alaniz et.al. 2309.02102v1 link
2023-09-01 Dense Voxel 3D Reconstruction Using a Monocular Event Camera Haodong Chen et.al. 2309.00385v1 null
2023-08-24 Improving NeRF Quality by Progressive Camera Placement for Unrestricted Navigation in Complex Environments Georgios Kopanas et.al. 2309.00014v1 null
2023-08-29 Intensity correlation holography for remote phase sensing and 3D imaging Guillaume Thekkadath et.al. 2308.15619v1 null
2023-08-28 R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras Aron Schmied et.al. 2308.14713v1 null
2023-08-27 Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views Zi-Xin Zou et.al. 2308.14078v1 null
2023-08-26 HoloPOCUS: Portable Mixed-Reality 3D Ultrasound Tracking, Reconstruction and Overlay Kian Wei Ng et.al. 2308.13823v1 null
2023-08-25 Textureless Deformable Surface Reconstruction with Invisible Markers Xinyuan Li et.al. 2308.13678v1 null
2023-08-23 ARF-Plus: Controlling Perceptual Factors in Artistic Radiance Fields for 3D Scene Stylization Wenzhao Li et.al. 2308.12452v1 null
2023-08-21 Coordinate Quantized Neural Implicit Representations for Multi-view Reconstruction Sijia Jiang et.al. 2308.11025v1 link
2023-08-19 Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos Yikai Wang et.al. 2308.10089v1 null
2023-08-19 TSAR-MVS: Textureless-aware Segmentation and Correlative Refinement Guided Multi-View Stereo Zhenlong Yuan et.al. 2308.09990v1 null
2023-08-19 A Theory of Topological Derivatives for Inverse Rendering of Geometry Ishit Mehta et.al. 2308.09865v1 null
2023-08-18 O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model Yubin Hu et.al. 2308.09591v1 link
2023-08-17 A Fusion of Variational Distribution Priors and Saliency Map Replay for Continual 3D Reconstruction Sanchar Palit et.al. 2308.08812v1 null
2023-08-17 Long-Range Grouping Transformer for Multi-View 3D Reconstruction Liying Yang et.al. 2308.08724v1 link
2023-08-16 DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479v1 link
2023-08-17 ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces Qianyi Wu et.al. 2308.07868v2 link
2023-08-15 CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction Yan Di et.al. 2308.07837v1 null
2023-08-15 Multi-view 3D Face Reconstruction Based on Flame Wenzhuo Zheng et.al. 2308.07551v1 null
2023-08-14 A One Stop 3D Target Reconstruction and multilevel Segmentation Method Jiexiong Xu et.al. 2308.06974v1 link
2023-08-11 Efficient Large-scale AUV-based Visual Seafloor Mapping Mengkun She et.al. 2308.06147v1 null
2023-08-10 PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs Wentao Hu et.al. 2308.05744v1 link
2023-08-10 HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation Chaoran Lu et.al. 2308.05387v1 null
2023-08-07 Learning Photometric Feature Transform for Free-form Object Scan Xiang Feng et.al. 2308.03492v1 null
2023-08-04 Reconstructing Three-Dimensional Models of Interacting Humans Mihai Fieraru et.al. 2308.01854v2 link
2023-08-02 HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions Andrew Guo et.al. 2308.01477v1 null
2023-08-15 Tirtha – An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites Jyotirmaya Shivottam et.al. 2308.01246v2 link
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125v1 null
2023-08-01 Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction Yufei Zhang et.al. 2308.00799v1 null
2023-07-31 Onboard View Planning of a Flying Camera for High Fidelity 3D Reconstruction of a Moving Actor Qingyuan Jiang et.al. 2308.00134v1 link
2023-07-21 Autonomous Electron Tomography Reconstruction with Machine Learning William Millsaps et.al. 2308.00099v1 null
2023-07-31 Towards Head Computed Tomography Image Reconstruction Standardization with Deep Learning Assisted Automatic Detection Bowen Zheng et.al. 2307.16440v1 null
2023-07-27 FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene Chengrui Wei et.al. 2307.14624v1 null
2023-07-27 Physically Plausible 3D Human-Scene Reconstruction from Monocular RGB Image using an Adversarial Learning Approach Sandika Biswas et.al. 2307.14570v1 null
2023-07-27 Creative Birds: Self-Supervised Single-View 3D Style Transfer Renke Wang et.al. 2307.14127v2 link
2023-07-24 CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components Davide Di Nucci et.al. 2307.12718v1 null
2023-07-24 VIRD: Immersive Match Video Analysis for High-Performance Badminton Coaching Tica Lin et.al. 2307.12539v1 link
2023-07-23 LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction Mohammad Samiul Arshad et.al. 2307.12194v1 link
2023-07-22 Replay: Multi-modal Multi-view Acted Videos for Casual Holography Roman Shapovalov et.al. 2307.12067v1 link
2023-07-20 SimCol3D – 3D Reconstruction during Colonoscopy Challenge Anita Rau et.al. 2307.11261v1 link
2023-07-14 Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction Anagh Malik et.al. 2307.09555v1 null
2023-07-18 NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and Repulsive UDF Stefan Lionar et.al. 2307.09112v1 link
2023-07-16 Enforcing Topological Interaction between Implicit Surfaces via Uniform Sampling Hieu Le et.al. 2307.08716v1 null
2023-07-13 Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D Reconstruction Sara Hatami Gazani et.al. 2307.05832v2 link
2023-07-11 3D detection of roof sections from a single satellite image and application to LOD2-building reconstruction Johann Lussange et.al. 2307.05409v1 null
2023-07-08 MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction Harnaik Dhami et.al. 2307.04004v1 null
2023-07-07 Depth Estimation Analysis of Orthogonally Divergent Fisheye Cameras with Distortion Removal Matvei Panteleev et.al. 2307.03602v1 null
2023-07-07 RGB-D Mapping and Tracking in a Plenoxel Radiance Field Andreas L. Teigen et.al. 2307.03404v1 link
2023-07-04 User-Friendly Safety Monitoring System for Manufacturing Cobots Ye-Ji Mun et.al. 2307.01886v1 null
2023-06-29 One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization Minghua Liu et.al. 2306.16928v1 link
2023-06-23 LightGlue: Local Feature Matching at Light Speed Philipp Lindenberger et.al. 2306.13643v1 link
2023-06-24 3D Reconstruction of Spherical Images based on Incremental Structure from Motion San Jiang et.al. 2306.12770v2 link
2023-06-26 Infinite Photorealistic Worlds using Procedural Generation Alexander Raistrick et.al. 2306.09310v2 null
2023-06-15 NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Varun Jampani et.al. 2306.09109v1 link
2023-06-15 Enhancing Neural Rendering Methods with Image Augmentations Juan C. Pérez et.al. 2306.08904v1 null
2023-06-14 Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data Nilesh Kulkarni et.al. 2306.08671v1 null
2023-06-13 Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data Stanislaw Szymanowicz et.al. 2306.07881v1 null
2023-06-12 Reconstructing Heterogeneous Cryo-EM Molecular Structures by Decomposing Them into Polymer Chains Bongjin Koo et.al. 2306.07274v1 null
2023-06-10 3D reconstruction using Structure for Motion Kshitij Karnawat et.al. 2306.06360v1 link
2023-06-15 NERFBK: A High-Quality Benchmark for NERF-Based 3D Reconstruction Ali Karami et.al. 2306.06300v2 link
2023-06-12 Neural Haircut: Prior-Guided Strand-Based Hair Reconstruction Vanessa Sklyarova et.al. 2306.05872v2 link
2023-06-08 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction Jiawei He et.al. 2306.05418v1 null
2023-06-08 Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields Qianqiu Tan et.al. 2306.05303v1 link
2023-06-07 BU-CVKit: Extendable Computer Vision Framework for Species Independent Tracking and Analysis Mahir Patel et.al. 2306.04736v1 null
2023-06-09 DiViNeT: 3D Reconstruction from Disparate Views via Neural Template Regularization Aditya Vora et.al. 2306.04699v2 null
2023-06-05 BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance Fields AKM Shahariar Azad Rabby et.al. 2306.03000v1 null
2023-06-05 Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data Nikolay Patakin et.al. 2306.02878v1 null
2023-06-05 Computational 3D topographic microscopy from terabytes of data per sample Kevin C. Zhou et.al. 2306.02634v1 null
2023-06-08 Adaptive Robotic Information Gathering via Non-Stationary Gaussian Processes Weizhe Chen et.al. 2306.01263v2 link
2023-06-01 BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image Tao Chu et.al. 2306.00965v1 link
2023-05-31 Humans in 4D: Reconstructing and Tracking Humans with Transformers Shubham Goel et.al. 2305.20091v1 link
2023-05-30 Template-free Articulated Neural Point Clouds for Reposable View Synthesis Lukas Uzolas et.al. 2305.19065v1 link
2023-05-29 Synfeal: A Data-Driven Simulator for End-to-End Camera Localization Daniel Coelho et.al. 2305.18260v1 link
2023-06-04 VoxDet: Voxel Learning for Novel Instance Detection Bowen Li et.al. 2305.17220v3 link
2023-05-25 Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos Matthew Chang et.al. 2305.16301v1 null
2023-05-25 Domain-Adaptive Full-Face Gaze Estimation via Novel-View-Synthesis and Feature Disentanglement Jiawei Qin et.al. 2305.16140v1 null
2023-05-25 Robust Category-Level 3D Pose Estimation from Synthetic Data Jiahao Yang et.al. 2305.16124v1 null
2023-05-25 T2TD: Text-3D Generation Model based on Prior Knowledge Guidance Weizhi Nie et.al. 2305.15753v1 null
2023-05-23 Cross3DVG: Baseline and Dataset for Cross-Dataset 3D Visual Grounding on Different RGB-D Scans Taiki Miyanishi et.al. 2305.13876v1 link
2023-05-22 A three-dimensional MR-STAT protocol for high-resolution multi-parametric quantitative MRI Hongyan Liu et.al. 2305.13022v1 null
2023-05-29 Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models Byungjun Kim et.al. 2305.11870v2 link
2023-05-19 Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields Jingbo Zhang et.al. 2305.11588v1 link
2023-05-19 RGB-D And Thermal Sensor Fusion: A Systematic Literature Review Martin Brenner et.al. 2305.11427v1 null
2023-05-18 Progressive Learning of 3D Reconstruction Network from 2D GAN Data Aysegul Dundar et.al. 2305.11102v1 null
2023-05-18 ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis Shoukang Hu et.al. 2305.11031v1 link
2023-05-17 Colonoscopy Coverage Revisited: Identifying Scanning Gaps in Real-Time G. Leifman et.al. 2305.10026v1 null
2023-05-15 AutoRecon: Automated 3D Object Discovery and Reconstruction Yuang Wang et.al. 2305.08810v1 null
2023-05-11 Towards a Better Understanding of the Computer Vision Research Community in Africa Abdul-Hakeem Omotayo et.al. 2305.06773v1 null
2023-05-10 Scan2LoD3: Reconstructing semantic 3D building models at LoD3 using ray casting and Bayesian networks Olaf Wysocki et.al. 2305.06314v1 null
2023-05-08 RelPose++: Recovering 6D Poses from Sparse-view Observations Amy Lin et.al. 2305.04926v1 link
2023-05-04 UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation Guoqing Yang et.al. 2305.02627v1 null
2023-05-03 Biological Hotspot Mapping in Coral Reefs with Robotic Visual Surveys Daniel Yang et.al. 2305.02330v1 link
2023-04-30 Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection Jie Ren et.al. 2305.00435v1 null
2023-04-29 NSLF-OL: Online Learning of Neural Surface Light Fields alongside Real-time Incremental 3D Reconstruction Yijun Yuan et.al. 2305.00282v1 null
2023-04-23 UHRNet: A Deep Learning-Based Method for Accurate 3D Reconstruction from a Single Fringe-Pattern Yixiao Wang et.al. 2304.14503v1 link
2023-04-27 Learning Articulated Shape with Keypoint Pseudo-labels from Web Images Anastasis Stathopoulos et.al. 2304.14396v1 null
2023-05-03 Combining HoloLens with Instant-NeRFs: Advanced Real-Time 3D Mobile Mapping Dennis Haitz et.al. 2304.14301v2 null
2023-04-25 Shape-Net: Room Layout Estimation from Panoramic Images Robust to Occlusion using Knowledge Distillation with 3D Shapes as Additional Inputs Mizuki Tabata et.al. 2304.12624v1 null
2023-04-24 Instant-3D: Instant Neural Radiance Field Training Towards On-Device AR/VR 3D Reconstruction Sixu Li et.al. 2304.12467v1 null
2023-04-24 Unsupervised Style-based Explicit 3D Face Reconstruction from Single Image Heng Yu et.al. 2304.12455v1 null
2023-04-24 gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction Zerui Chen et.al. 2304.11970v1 null
2023-04-24 Learning Visibility Field for Detailed 3D Human Reconstruction and Relighting Ruichen Zheng et.al. 2304.11900v1 null
2023-04-24 NoiseTrans: Point Cloud Denoising with Transformers Guangzhe Hou et.al. 2304.11812v1 null
2023-04-20 A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion Miriam Jäger et.al. 2304.10664v1 null
2023-04-20 Reconstructing Signing Avatars From Video Using Linguistic Priors Maria-Paola Forte et.al. 2304.10482v1 null
2023-04-19 Anything-3D: Towards Single-view Anything Reconstruction in the Wild Qiuhong Shen et.al. 2304.10261v1 link
2023-04-20 A geometry-aware deep network for depth estimation in monocular endoscopy Yongming Yang et.al. 2304.10241v1 link
2023-04-19 Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra Jonas Kulhanek et.al. 2304.09987v1 link
2023-04-20 Single-View View Synthesis with Self-Rectified Pseudo-Stereo Yang Zhou et.al. 2304.09527v2 null
2023-04-19 3 Dimensional Dense Reconstruction: A Review of Algorithms and Dataset Yangming Li et.al. 2304.09371v1 null
2023-04-18 SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes Yiming Gao et.al. 2304.08971v1 null
2023-04-17 Learning How To Robustly Estimate Camera Pose in Endoscopic Videos Michel Hayoz et.al. 2304.08023v1 link
2023-04-15 Temporally Consistent Online Depth Estimation Using Point-Based Fusion Numair Khan et.al. 2304.07435v1 link
2023-04-17 Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction Hansheng Chen et.al. 2304.06714v2 link
2023-04-12 SiLK – Simple Learned Keypoints Pierre Gleize et.al. 2304.06194v1 link
2023-04-12 Dynamic Voxel Grid Optimization for High-Fidelity RGB-D Supervised Surface Reconstruction Xiangyu Xu et.al. 2304.06178v1 null
2023-04-11 EvAC3D: From Event-based Apparent Contours to 3D Models via Continuous Visual Hulls Ziyun Wang et.al. 2304.05296v1 link
2023-04-10 Neural Lens Modeling Wenqi Xian et.al. 2304.04848v1 null
2023-04-10 Evaluate Geometry of Radiance Field with Low-frequency Color Prior Qihang Fang et.al. 2304.04351v1 link
2023-04-11 Analysis of Sampling Strategies for Implicit 3D Reconstruction Q. Liu et.al. 2304.03999v2 null
2023-04-08 3D GANs and Latent Space: A comprehensive survey Satya Pratheek Tata et.al. 2304.03932v1 null
2023-04-08 Photometric Correction for Infrared Sensors Jincheng Zhang et.al. 2304.03930v1 null
2023-04-07 ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation Xiaoming Zhao et.al. 2304.03608v1 link
2023-04-06 Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes Zian Wang et.al. 2304.03266v1 null
2023-04-06 DeLiRa: Self-Supervised Depth, Light, and Radiance Fields Vitor Guizilini et.al. 2304.02797v1 null
2023-04-05 Image Stabilization for Hololens Camera in Remote Collaboration Gowtham Senthil et.al. 2304.02736v1 null
2023-04-05 Real-Time Dense 3D Mapping of Underwater Environments Weihan Wang et.al. 2304.02704v1 link
2023-04-04 USTC FLICAR: A Multisensor Fusion Dataset of LiDAR-Inertial-Camera for Heavy-duty Autonomous Aerial Work Robots Ziming Wang et.al. 2304.01986v1 null
2023-04-04 End-to-End Latency Optimization of Multi-view 3D Reconstruction for Disaster Response Xiaojie Zhang et.al. 2304.01488v1 null
2023-04-04 FineRecon: Depth-aware Feed-forward Network for Detailed 3D Reconstruction Noah Stier et.al. 2304.01480v1 link
2023-04-03 One-Shot View Planning for Fast and Complete Unknown Object Reconstruction Sicong Pan et.al. 2304.00910v1 link
2023-03-31 LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses Noah Stier et.al. 2304.00054v1 link
2023-04-03 Three-dimensional coherent diffraction snapshot imaging using extreme ultraviolet radiation from a free electron laser Danny Fainozzi et.al. 2303.18166v2 null
2023-03-30 Enhanced Stable View Synthesis Nishant Jain et.al. 2303.17094v1 null
2023-03-29 AirLine: Efficient Learnable Line Detection with Local Edge Voting Xiao Lin et.al. 2303.16500v1 link
2023-03-29 Multi-View Azimuth Stereo via Tangent Space Consistency Xu Cao et.al. 2303.16447v1 link
2023-03-27 NeUDF: Learning Unsigned Distance Fields from Multi-view Images for Reconstructing Non-watertight Models Fei Hou et.al. 2303.15368v1 null
2023-03-27 TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering Jaehoon Choi et.al. 2303.15060v1 null
2023-03-26 Clean-NeRF: Reformulating NeRF to account for View-Dependent Observations Xinhang Liu et.al. 2303.14707v1 null
2023-03-25 PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters Shuhong Chen et.al. 2303.14587v1 link
2023-03-25 LPFF: A Portrait Dataset for Face Generators Across Large Poses Yiqian Wu et.al. 2303.14407v1 null
2023-03-24 BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects Bowen Wen et.al. 2303.14158v1 link
2023-03-24 Deformable Model Driven Neural Rendering for High-fidelity 3D Reconstruction of Human Heads Under Low-View Settings Baixin Xu et.al. 2303.13855v1 link
2023-03-24 Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container Jinguang Tong et.al. 2303.13805v1 link
2023-03-23 SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates Mikaela Angelina Uy et.al. 2303.13582v1 null
2023-03-21 Real-time volumetric rendering of dynamic humans Ignacio Rocco et.al. 2303.11898v1 null
2023-03-20 Zero-1-to-3: Zero-shot One Image to 3D Object Ruoshi Liu et.al. 2303.11328v1 link
2023-03-20 DIME-Net: Neural Network-Based Dynamic Intrinsic Parameter Rectification for Cameras with Optical Image Stabilization System Shu-Hao Yeh et.al. 2303.11307v1 null
2023-03-20 Ref-NeuS: Ambiguity-Reduced Neural Implicit Surface Learning for Multi-View Reconstruction with Reflection Wenhang Ge et.al. 2303.10840v1 link
2023-03-14 FingerSLAM: Closed-loop Unknown Object Localization and Reconstruction from Visuo-tactile Feedback Jialiang Zhao et.al. 2303.07997v1 null
2023-03-11 Normal-guided Garment UV Prediction for Human Re-texturing Yasamin Jafarian et.al. 2303.06504v1 null
2023-03-11 Just Flip: Flipped Observation Generation and Optimization for Neural Radiance Fields to Cover Unobserved View Minjae Lee et.al. 2303.06335v1 link
2023-03-10 ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction Zhengdi Yu et.al. 2303.05938v1 link
2023-03-10 Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction Mingfang Zhang et.al. 2303.05937v1 null
2023-03-08 FastSurf: Fast Neural RGB-D Surface Reconstruction using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning Seunghwan Lee et.al. 2303.04508v1 link
2023-03-08 Corner Detection Based on Multi-directional Gabor Filters with Multi-scales Huaqing Wang et.al. 2303.04334v1 null
2023-03-08 DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields Dipam Patel et.al. 2303.04322v1 null
2023-03-07 Proactive Multi-Camera Collaboration For 3D Human Pose Estimation Hai Ci et.al. 2303.03767v1 null
2023-03-06 System for 3D Acquisition and 3D Reconstruction using Structured Light for Sewer Line Inspection Johannes Künzel et.al. 2303.02978v1 null
2023-03-03 Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement Jiaxiang Tang et.al. 2303.02091v1 link
2023-03-09 MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices Kejie Li et.al. 2303.01932v2 link
2023-03-01 Motion Compensation via Epipolar Consistency for In-Vivo X-Ray Microscopy Mareike Thies et.al. 2303.00449v1 null
2023-02-28 3D Coronary Vessel Reconstruction from Bi-Plane Angiography using Graph Convolutional Networks Kit Mills Bransby et.al. 2302.14795v1 null
2023-02-28 Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors Ji Hou et.al. 2302.14746v1 null
2023-02-27 UMIFormer: Mining the Correlations between Similar Tokens for Multi-View 3D Reconstruction Zhenwei Zhu et.al. 2302.13987v1 link
2023-02-26 Perceiving Unseen 3D Objects by Poking the Objects Linghao Chen et.al. 2302.13375v1 null
2023-02-25 SUPS: A Simulated Underground Parking Scenario Dataset for Autonomous Driving Jiawei Hou et.al. 2302.12966v1 link
2023-02-24 3D Surface Reconstruction in the Wild by Deforming Shape Priors from Synthetic Data Nicolai Häni et.al. 2302.12883v1 null
2023-02-23 View Consistency Aware Holistic Triangulation for 3D Human Pose Estimation Xiaoyue Wan et.al. 2302.11301v2 null
2023-02-23 $PC^2$ : Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction Luke Melas-Kyriazi et.al. 2302.10668v2 link
2023-02-23 RealFusion: 360° Reconstruction of Any Object from a Single Image Luke Melas-Kyriazi et.al. 2302.10663v2 null
2023-02-20 UAVStereo: A Multiple Resolution Dataset for Stereo Matching in UAV Scenarios Zhang Xiaoyi et.al. 2302.10082v1 link
2023-02-14 HR-NeuS: Recovering High-Frequency Surface Geometry via Neural Implicit Surfaces Erich Liang et.al. 2302.06793v1 null
2023-02-14 Boosted ab initio Cryo-EM 3D Reconstruction with ACE-EM Lin Yao et.al. 2302.06091v2 null
2023-02-11 3D Colored Shape Reconstruction from a Single RGB Image through Diffusion Bo Li et.al. 2302.05573v1 null
2023-02-09 3D reconstruction of spherical images: A review of techniques, applications, and prospects San Jiang et.al. 2302.04495v1 null
2023-02-09 PredRecon: A Prediction-boosted Planning Framework for Fast and High-quality Autonomous Aerial Reconstruction Chen Feng et.al. 2302.04488v1 link
2023-02-07 S4R: Self-Supervised Semantic Scene Reconstruction from RGB-D Scans Junwen Huang et.al. 2302.03640v1 null
2023-01-30 Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction Haonan Chang et.al. 2301.13244v1 link
2023-01-27 A Comparison of Tiny-nerf versus Spatial Representations for 3d Reconstruction Saulo Abraham Gante et.al. 2301.11522v1 null
2023-01-25 Local Feature Extraction from Salient Regions by Feature Map Transformation Yerim Jung et.al. 2301.10413v1 null
2023-02-02 3D Reconstruction of Non-cooperative Resident Space Objects using Instant NGP-accelerated NeRF and D-NeRF Trupti Mahendrakar et.al. 2301.09060v2 null
2023-01-19 Parallelized computational 3D video microscopy of freely moving organisms at multiple gigapixels per second Kevin C. Zhou et.al. 2301.08351v1 link
2023-01-19 Multiview Compressive Coding for 3D Reconstruction Chao-Yuan Wu et.al. 2301.08247v1 link
2023-01-19 Regularizing disparity estimation via multi task learning with structured light reconstruction Alistair Weld et.al. 2301.08140v1 null
2023-01-12 Edge Preserving Implicit Surface Representation of Point Clouds Xiaogang Wang et.al. 2301.04860v1 null
2023-01-11 Elevation Estimation-Driven Building 3D Reconstruction from Single-View Remote Sensing Imagery Yongqiang Mao et.al. 2301.04581v1 null
2023-01-11 First 3D reconstruction of a blast furnace using muography Amélie Cohu et.al. 2301.04354v1 null
2023-01-04 Towards a Pipeline for Real-Time Visualization of Faces for VR-based Telepresence and Live Broadcasting Utilizing Neural Rendering Philipp Ladwig et.al. 2301.01490v1 link
2023-01-03 BS3D: Building-scale 3D Reconstruction from RGB-D Images Janne Mustaniemi et.al. 2301.01057v1 null
2022-12-31 Ponder: Point Cloud Pre-training via Neural Rendering Di Huang et.al. 2301.00157v1 null
2022-12-28 NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action Kuan-Chieh Wang et.al. 2212.13660v1 link
2022-12-24 Polarimetric Multi-View Inverse Rendering Jinyu Zhao et.al. 2212.12721v1 null

(<a href=#Updated-on-20240404>back to top</a>)

generate

Publish Date Title Authors PDF Code
2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Keyu Tian et.al. 2404.02905v1 link
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903v1 null
2024-04-03 DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets Harsh Rangwani et.al. 2404.02900v1 link
2024-04-03 MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Duygu Ceylan et.al. 2404.02899v1 null
2024-04-03 A Mean Field Game Model for Timely Computation in Edge Computing Systems Shubham Aggarwal et.al. 2404.02898v1 null
2024-04-03 Deep Image Composition Meets Image Forgery Eren Tahir et.al. 2404.02897v1 link
2024-04-03 ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Yifan Xu et.al. 2404.02893v1 null
2024-04-03 PoCo: Point Context Cluster for RGBD Indoor Place Recognition Jing Liang et.al. 2404.02885v1 null
2024-04-02 Segment Any 3D Object with Language Seungjun Lee et.al. 2404.02157v1 null
2024-04-02 Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration Akshay Dudhane et.al. 2404.02154v1 null
2024-04-02 GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image Chong Bao et.al. 2404.02152v1 null
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148v1 link
2024-04-02 Harder, Better, Faster, Stronger: Interactive Visualization for Human-Centered AI Tools Md Naimul Hoque et.al. 2404.02147v1 null
2024-04-02 Iterated Learning Improves Compositionality in Large Vision-Language Models Chenhao Zheng et.al. 2404.02145v1 null
2024-04-02 Multiparametric quantification and visualization of liver fat using ultrasound Jihye Baek et.al. 2404.02143v1 null
2024-03-29 Gecko: Versatile Text Embeddings Distilled from Large Language Models Jinhyuk Lee et.al. 2403.20327v1 null
2024-03-29 Shaving Logs via Large Sieve Inequality: Faster Algorithms for Sparse Convolution and More Ce Jin et.al. 2403.20326v1 null
2024-03-29 Structure and Dynamics of Magneto-Inertial, Differentially Rotating Laboratory Plasmas V. Valenzuela-Villaseca et.al. 2403.20321v1 null
2024-03-29 SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects Abhinav Kumar et.al. 2403.20318v1 link
2024-03-29 Convolutional Prompting meets Language Models for Continual Learning Anurag Roy et.al. 2403.20317v1 null
2024-03-29 Optimal Communication for Classic Functions in the Coordinator Model and Beyond Hossein Esfandiari et.al. 2403.20307v1 null
2024-03-28 GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling Bowen Zhang et.al. 2403.19655v1 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653v1 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652v1 null
2024-03-28 GraspXL: Generating Grasping Motions for Diverse Objects at Scale Hui Zhang et.al. 2403.19649v1 null
2024-03-28 Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models Samuel Marks et.al. 2403.19647v1 link
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645v1 null
2024-03-27 Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark Ziyang Chen et.al. 2403.18821v1 null
2024-03-27 MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering Guoxing Sun et.al. 2403.18820v1 null
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818v1 null
2024-03-27 Garment3DGen: 3D Garment Stylization and Texture Generation Nikolaos Sarafianos et.al. 2403.18816v1 null
2024-03-27 Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Yanwei Li et.al. 2403.18814v1 link
2024-03-27 Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment Li Siyao et.al. 2403.18811v1 null
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807v2 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936v1 null
2024-03-26 OmniVid: A Generative Framework for Universal Video Understanding Junke Wang et.al. 2403.17935v1 link
2024-03-26 SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models Kashyap Chitta et.al. 2403.17933v1 null
2024-03-26 MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution Wei Tao et.al. 2403.17927v1 null
2024-03-26 AID: Attention Interpolation of Text-to-Image Diffusion Qiyuan He et.al. 2403.17924v1 link
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921v1 link
2024-03-26 TC4D: Trajectory-Conditioned Text-to-4D Generation Sherwin Bahmani et.al. 2403.17920v1 null
2024-03-26 AgentStudio: A Toolkit for Building General Virtual Agents Longtao Zheng et.al. 2403.17918v1 null
2024-03-25 Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning Sicong Pan et.al. 2403.16803v1 null
2024-03-25 Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback Zhangqian Bi et.al. 2403.16792v1 null
2024-03-25 Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise Dilum Fernando et.al. 2403.16790v1 null
2024-03-25 HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation Linglin Jing et.al. 2403.16788v1 null
2024-03-25 Creating a Digital Twin of Spinal Surgery: A Proof of Concept Jonas Hein et.al. 2403.16736v1 null
2024-03-25 Improving Diffusion Models’s Data-Corruption Resistance using Scheduled Pseudo-Huber Loss Artem Khrapov et.al. 2403.16728v1 link
2024-03-22 DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Hanrong Ye et.al. 2403.15389v1 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385v1 null
2024-03-22 ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars Zhenwei Wang et.al. 2403.15383v1 null
2024-03-22 DragAPart: Learning a Part-Level Motion Prior for Articulated Objects Ruining Li et.al. 2403.15382v1 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378v1 link
2024-03-22 InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Yi Wang et.al. 2403.15377v1 link
2024-03-22 A Modular, End-to-End Next-Generation Network Testbed: Towards a Fully Automated Network Management Platform Ali Chouman et.al. 2403.15376v1 null
2024-03-21 Zero-Shot Multi-Object Shape Completion Shun Iwase et.al. 2403.14628v1 null
2024-03-21 MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images Yuedong Chen et.al. 2403.14627v1 link
2024-03-21 Simplified Diffusion Schrödinger Bridge Zhicong Tang et.al. 2403.14623v1 link
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621v1 link
2024-03-21 ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition Tianhao Wu et.al. 2403.14619v1 null
2024-03-21 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617v1 null
2024-03-21 Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning Hasindri Watawana et.al. 2403.14616v1 link
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613v1 null
2024-03-21 Explorative Inbetweening of Time and Space Haiwen Feng et.al. 2403.14611v1 null
2024-03-20 On Pretraining Data Diversity for Self-Supervised Learning Hasan Abed Al Kader Hammoud et.al. 2403.13808v1 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807v1 link
2024-03-20 Learning from Models and Data for Visual Grounding Ruozhen He et.al. 2403.13804v1 null
2024-03-20 Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments Yang Yang et.al. 2403.13803v1 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802v1 link
2024-03-20 Natural Language as Polices: Reasoning for Coordinate-Level Embodied Control with LLMs Yusuke Mikami et.al. 2403.13801v1 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800v1 null
2024-03-20 Reverse Training to Nurse the Reversal Curse Olga Golovneva et.al. 2403.13799v1 null
2024-03-20 Hierarchical NeuroSymbolic Approach for Action Quality Assessment Lauren Okamoto et.al. 2403.13798v1 null
2024-03-20 Bridge the Modality and Capacity Gaps in Vision-Language Model Selection Chao Yi et.al. 2403.13797v1 null
2024-03-19 LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression Zhuoshi Pan et.al. 2403.12968v1 link
2024-03-19 Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment Mengting Chen et.al. 2403.12965v1 null
2024-03-19 Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models Ce Zhang et.al. 2403.12964v1 link
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963v1 link
2024-03-19 TexTile: A Differentiable Metric for Texture Tileability Carlos Rodriguez-Pardo et.al. 2403.12961v1 null
2024-03-19 FaceXFormer: A Unified Transformer for Facial Analysis Kartik Narayan et.al. 2403.12960v1 link
2024-03-19 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He et.al. 2403.12957v1 null
2024-03-19 Abiogenesis: a possible quantum interpretation of the telepoietic conjecture Vittorio Cocchi et.al. 2403.12955v1 null
2024-03-19 Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models Elaine Sui et.al. 2403.12952v1 link
2024-03-18 RIS-aided Single-frequency 3D Imaging by Exploiting Multi-view Image Correlations Yixuan Huang et.al. 2403.11764v1 null
2024-03-19 Full-Duplex MU-MIMO Systems with Coarse Quantization: How Many Bits Do We Need? Seunghyeong Yoo et.al. 2403.11762v2 null
2024-03-18 Why E.T. Can’t Phone Home: A Global View on IP-based Geoblocking at VoWiFi Gabriel Karl Gegenhuber et.al. 2403.11759v1 null
2024-03-18 Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs M. Jehanzeb Mirza et.al. 2403.11755v1 link
2024-03-18 Asymptotically Optimal Codes for $(t,s)$ -Burst Error Yubo Sun et.al. 2403.11750v1 null
2024-03-18 Embedded Named Entity Recognition using Probing Classifiers Nicholas Popovič et.al. 2403.11747v1 null
2024-03-18 Revisiting Tensor Basis Neural Networks for Reynolds stress modeling: application to plane channel and square duct flows Jiayi Cai et.al. 2403.11746v1 null
2024-03-18 Matter and cosmogenesis in Kant’s Theory of the Heavens Garance Benoit et.al. 2403.11710v1 null
2024-03-18 Significant impact of light-matter strong coupling on chiral nonlinear optical effect Daichi Okada et.al. 2403.11709v1 null
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706v1 link
2024-03-18 Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing Juan Zhang et.al. 2403.11700v1 null
2024-03-18 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697v1 null
2024-03-18 Generalization error of spectral algorithms Maksim Velikanov et.al. 2403.11696v1 null
2024-03-18 Beamforming Design for Semantic-Bit Coexisting Communication System Maojun Zhang et.al. 2403.11693v1 null
2024-03-15 P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors Zhou Jiang et.al. 2403.10521v1 null
2024-03-15 Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives Ronghui Li et.al. 2403.10518v1 link
2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution Stephanie Fu et.al. 2403.10516v1 link
2024-03-15 A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction Anshul Gupta et.al. 2403.10511v1 null
2024-03-15 Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization Ratnadira Widyasari et.al. 2403.10507v1 null
2024-03-15 Belief Change based on Knowledge Measures Umberto Straccia et.al. 2403.10502v1 null
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638v1 null
2024-03-14 GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping Yuhang Zheng et.al. 2403.09637v1 link
2024-03-14 Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference Piotr Nawrot et.al. 2403.09636v1 null
2024-03-14 OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning Lingyi Hong et.al. 2403.09634v1 null
2024-03-14 Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image Yiqun Mei et.al. 2403.09632v1 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631v1 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630v1 link
2024-03-14 Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Eric Zelikman et.al. 2403.09629v1 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625v1 null
2024-03-14 Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Zeyu Liu et.al. 2403.09622v1 null
2024-03-13 FastMAC: Stochastic Spectral Sampling of Correspondence Graph Yifei Zhang et.al. 2403.08770v1 link
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764v1 null
2024-03-13 A local model for the optical energy and momentum transfer in dielectric media and the microscopic origin of Abraham’s force density B. Anghinoni et.al. 2403.08752v1 null
2024-03-13 iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer Dinh-Khoi Vo et.al. 2403.08746v1 link
2024-03-12 Rethinking Generative Large Language Model Evaluation for Semantic Comprehension Fangyun Wei et.al. 2403.07872v1 null
2024-03-12 TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation Shivin Dass et.al. 2403.07869v1 null
2024-03-12 Exploring Safety Generalization Challenges of Large Language Models via Code Qibing Ren et.al. 2403.07865v1 null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860v1 link
2024-03-12 Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias Sierra Wyllie et.al. 2403.07857v1 null
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842v1 null
2024-03-11 A representation-learning game for classes of prediction tasks Neria Uzan et.al. 2403.06971v1 null
2024-03-11 The pitfalls of next-token prediction Gregor Bachmann et.al. 2403.06963v1 link
2024-03-11 Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer Siddhant Satyanaik et.al. 2403.06953v1 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952v1 null
2024-03-08 Tell, Don’t Show!: Language Guidance Eases Transfer Across Domains in Images and Videos Tarun Kalluri et.al. 2403.05535v1 null
2024-03-08 Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets Lorenzo Brigato et.al. 2403.05532v1 null
2024-03-08 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Machel Reid et.al. 2403.05530v1 null
2024-03-08 The Computational Complexity of Learning Gaussian Single-Index Models Alex Damian et.al. 2403.05529v1 null
2024-03-08 GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM Hao Kang et.al. 2403.05527v1 link
2024-03-08 Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola Yijiang Li et.al. 2403.05523v1 null
2024-03-08 Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought James Chua et.al. 2403.05518v1 link
2024-03-07 BloomGML: Graph Machine Learning through the Lens of Bilevel Optimization Amber Yijia Zheng et.al. 2403.04763v1 link
2024-03-07 Lifelong Intelligence Beyond the Edge using Hyperdimensional Computing Xiaofan Yu et.al. 2403.04759v1 link
2024-03-07 KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts Adam Coscia et.al. 2403.04758v1 link
2024-03-07 Preliminary Guidelines For Combining Data Integration and Visual Data Analysis Adam Coscia et.al. 2403.04757v1 link
2024-03-07 Mechanism for Decision-aware Collaborative Federated Learning: A Pitfall of Shapley Values Meng Qi et.al. 2403.04753v1 null
2024-03-07 JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics Framework Artur P. Toshev et.al. 2403.04750v1 link
2024-03-07 A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures Kensuke Nakamura et.al. 2403.04745v1 null
2024-03-06 Backtracing: Retrieving the Cause of the Query Rose E. Wang et.al. 2403.03956v1 link
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954v1 link
2024-03-06 Bridging Language and Items for Retrieval and Recommendation Yupeng Hou et.al. 2403.03952v1 link
2024-03-06 Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset Pedro Ramoneda et.al. 2403.03947v1 null
2024-03-06 Separate and Detailed Treatment of Absolute Signal and Noise Enables NMR Under Adverse Circumstances A Guinness et.al. 2403.03943v1 null
2024-03-06 The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models Adithya Bhaskar et.al. 2403.03942v1 link
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938v1 link
2024-03-05 LC-Tsalis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits Masahiro Kato et.al. 2403.03219v1 null
2024-03-05 The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning Nathaniel Li et.al. 2403.03218v1 null
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217v1 null
2024-03-05 A Safety-Critical Framework for UGVs in Complex Environments: A Data-Driven Discrepancy-Aware Approach Skylar X. Wei et.al. 2403.03215v1 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206v1 null
2024-03-05 CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments Savitha Sam Abraham et.al. 2403.03203v1 null
2024-03-03 Bandit Profit-maximization for Targeted Marketing Joon Suk Huh et.al. 2403.01361v1 null
2024-03-03 ModelWriter: Text & Model-Synchronized Document Engineering Platform Ferhat Erata et.al. 2403.01359v1 null
2024-03-03 Improving Uncertainty Sampling with Bell Curve Weight Function Zan-Kai Chong et.al. 2403.01352v1 null
2024-03-03 Efficient FIR filtering with Bit Layer Multiply Accumulator Vincenzo Liguori et.al. 2403.01351v1 null
2024-03-02 ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation Siyuan Bian et.al. 2403.01345v1 null
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481v1 link
2024-02-29 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Tsai-Shien Chen et.al. 2402.19479v1 null
2024-02-29 Learning a Generalized Physical Face Model From Data Lingchen Yang et.al. 2402.19477v1 null
2024-02-29 The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations? Alex Gu et.al. 2402.19475v1 null
2024-02-29 The All-Seeing Project V2: Towards General Relation Comprehension of the Open World Weiyun Wang et.al. 2402.19474v1 link
2024-02-29 Retrieval-Augmented Generation for AI-Generated Content: A Survey Penghao Zhao et.al. 2402.19473v1 link
2024-02-29 Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling Gabriel Grand et.al. 2402.19471v1 null
2024-02-29 Humanoid Locomotion as Next Token Prediction Ilija Radosavovic et.al. 2402.19469v1 null
2024-02-28 UniMODE: Unified Monocular 3D Object Detection Zhuoling Li et.al. 2402.18573v1 null
2024-02-28 Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards Haoxiang Wang et.al. 2402.18571v1 link
2024-02-28 Diffusion Language Models Are Versatile Protein Learners Xinyou Wang et.al. 2402.18567v1 null
2024-02-28 Approaching Human-Level Forecasting with Language Models Danny Halawi et.al. 2402.18563v1 null
2024-02-27 The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Shuming Ma et.al. 2402.17764v1 null
2024-02-27 Reducing Unnecessary Alerts in Pedestrian Protection Systems Based on P2V Communications Ignacio Soto et.al. 2402.17763v1 null
2024-02-27 Towards Optimal Learning of Language Models Yuxian Gu et.al. 2402.17759v1 null
2024-02-27 ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living Marsil Zakour et.al. 2402.17758v1 null
2024-02-27 Evaluating Very Long-Term Conversational Memory of LLM Agents Adyasha Maharana et.al. 2402.17753v1 null
2024-02-26 Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision Fan Jiang et.al. 2402.16508v1 link
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506v1 null
2024-02-26 SAND: Decoupling Sanitization from Fuzzing for Low Overhead Ziqiao Kong et.al. 2402.16497v1 null
2024-02-26 Intelligent Known and Novel Aircraft Recognition – A Shift from Classification to Similarity Learning for Combat Identification Ahmad Saeed et.al. 2402.16486v1 null
2024-02-23 Seamless Human Motion Composition with Blended Positional Encodings German Barquero et.al. 2402.15509v1 link
2024-02-23 AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning Jianguo Zhang et.al. 2402.15506v1 link
2024-02-23 Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts Yuejiang Liu et.al. 2402.15505v1 null
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504v1 link
2024-02-23 API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs Kinjal Basu et.al. 2402.15491v1 null
2024-02-22 PALO: A Polyglot Large Multimodal Model for 5B People Muhammad Maaz et.al. 2402.14818v1 link
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817v1 null
2024-02-22 WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition Lianghui Zhu et.al. 2402.14812v1 link
2024-02-22 Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking Nikhil Prakash et.al. 2402.14811v1 null
2024-02-22 GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion Xueyi Liu et.al. 2402.14810v1 link
2024-02-22 CriticBench: Benchmarking LLMs for Critique-Correct Reasoning Zicheng Lin et.al. 2402.14809v1 link
2024-02-22 RelayAttention for Efficient Large Language Model Serving with Long System Prompts Lei Zhu et.al. 2402.14808v1 link
2024-02-22 A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Nikhil Behari et.al. 2402.14807v1 null
2024-02-22 Identifying Multiple Personalities in Large Language Models with External Evaluation Xiaoyang Song et.al. 2402.14805v1 null
2024-02-21 D-Flow: Differentiating through Flows for Controlled Generation Heli Ben-Hamu et.al. 2402.14017v1 null
2024-02-21 Corrective Machine Unlearning Shashwat Goel et.al. 2402.14015v1 link
2024-02-21 Geometry-Informed Neural Networks Arturs Berzins et.al. 2402.14009v1 null
2024-02-21 OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems Chaoqun He et.al. 2402.14008v1 link
2024-02-21 Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models Aline Ioste et.al. 2402.14002v1 null
2024-02-21 Real-time 3D-aware Portrait Editing from a Single Image Qingyan Bai et.al. 2402.14000v1 null
2024-02-20 CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples Jianrui Zhang et.al. 2402.13254v1 link
2024-02-20 BiMediX: Bilingual Medical Mixture of Experts LLM Sara Pieri et.al. 2402.13253v1 link
2024-02-20 Video ReCap: Recursive Captioning of Hour-Long Videos Md Mohaiminul Islam et.al. 2402.13250v1 null
2024-02-20 TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization Liyan Tang et.al. 2402.13249v1 link
2024-02-20 Are Fact-Checking Tools Reliable? An Evaluation of Google Fact Check Qiangeng Yang et.al. 2402.13244v1 null
2024-02-20 Unlocking Insights: Semantic Search in Jupyter Notebooks Lan Li et.al. 2402.13234v1 null
2024-02-20 A Touch, Vision, and Language Dataset for Multimodal Alignment Letian Fu et.al. 2402.13232v1 link
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376v1 link
2024-02-19 A synthetic data approach for domain generalization of NLI models Mohammad Javad Hosseini et.al. 2402.12368v1 null
2024-02-19 A Critical Evaluation of AI Feedback for Aligning Large Language Models Archit Sharma et.al. 2402.12366v1 link
2024-02-19 Almost-linear time parameterized algorithm for rankwidth via dynamic rankwidth Tuukka Korhonen et.al. 2402.12364v1 null
2024-02-19 Flip Graphs of Pseudo-Triangulations With Face Degree at Most 4 Maarten Löffler et.al. 2402.12357v1 null
2024-02-19 Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge Julien Delile et.al. 2402.12352v1 null
2024-02-16 Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning Chia-Ling Tsai et.al. 2402.10894v1 null
2024-02-16 RLVF: Learning from Verbal Feedback without Overgeneralization Moritz Stephan et.al. 2402.10893v1 link
2024-02-16 Instruction Diversity Drives Generalization To Unseen Tasks Dylan Zhang et.al. 2402.10891v1 null
2024-02-16 When is Tree Search Useful for LLM Planning? It Depends on the Discriminator Ziru Chen et.al. 2402.10890v1 link
2024-02-16 Evaluation of EAP Usage for Authenticating Eduroam Users in 5G Networks Leonardo Azalim de Oliveira et.al. 2402.10889v1 null
2024-02-16 Explainability for Machine Learning Models: From Data Adaptability to User Perception julien Delaunay et.al. 2402.10888v1 null
2024-02-16 Reviewer2: Optimizing Review Generation Through Prompt Generation Zhaolin Gao et.al. 2402.10886v1 null
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885v1 null
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210v1 null
2024-02-15 Recovering the Pre-Fine-Tuning Weights of Generative Models Eliahu Horwitz et.al. 2402.10208v1 link
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207v1 link
2024-02-15 Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention Romain Ilbert et.al. 2402.10198v1 link
2024-02-15 BitDelta: Your Fine-Tune May Only Be Worth One Bit James Liu et.al. 2402.10193v1 link
2024-02-15 Multi-Excitation Projective Simulation with a Many-Body Physics Inspired Inductive Bias Philip A. LeMaitre et.al. 2402.10192v1 link
2024-02-15 FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients Xinchi Qiu et.al. 2402.10191v1 null
2024-02-14 AQA-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability Siwei Yang et.al. 2402.09404v1 link
2024-02-14 Reinforcement Learning from Human Feedback with Active Queries Kaixuan Ji et.al. 2402.09401v1 null
2024-02-14 Long-form evaluation of model editing Domenic Rosati et.al. 2402.09394v1 null
2024-02-14 Introduction to Physically Unclonable Fuctions: Properties and Applications M. Garcia-Bosque et.al. 2402.09386v1 null
2024-02-14 GraSSRep: Graph-Based Self-Supervised Learning for Repeat Detection in Metagenomic Assembly Ali Azizpour et.al. 2402.09381v1 link
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-13 Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance Linxi Zhao et.al. 2402.08680v1 null
2024-02-13 COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability Xingang Guo et.al. 2402.08679v1 link
2024-02-13 Graph Mamba: Towards Learning on Graphs with State Space Models Ali Behrouz et.al. 2402.08678v1 link
2024-02-13 Model Assessment and Selection under Temporal Distribution Shift Elise Han et.al. 2402.08672v1 link
2024-02-13 Rec-GPT4V: Multimodal Recommendation with Large Vision-Language Models Yuqing Liu et.al. 2402.08670v1 null
2024-02-13 Improving Generalization in Semantic Parsing by Increasing Natural Language Variation Irina Saparina et.al. 2402.08666v1 link
2024-02-12 A systematic investigation of learnability from single child linguistic input Yulu Qin et.al. 2402.07899v1 null
2024-02-12 Label-Efficient Model Selection for Text Generation Shir Ashury-Tahan et.al. 2402.07891v1 null
2024-02-12 Toward an Android Static Analysis Approach for Data Protection Mugdha Khedkar et.al. 2402.07889v1 null
2024-02-12 WildfireGPT: Tailored Large Language Model for Wildfire Analysis Yangxinyu Xie et.al. 2402.07877v1 null
2024-02-12 Policy Improvement using Language Feedback Models Victor Zhong et.al. 2402.07876v1 null
2024-02-09 Feedback Loops With Language Models Drive In-Context Reward Hacking Alexander Pan et.al. 2402.06627v1 link
2024-02-09 Understanding the Effects of Iterative Prompting on Truthfulness Satyapriya Krishna et.al. 2402.06625v1 null
2024-02-09 A two-stage algorithm in evolutionary product unit neural networks for classification Antonio J. Tallón-Ballesteros et.al. 2402.06622v1 null
2024-02-09 TIC: Translate-Infer-Compile for accurate ‘text to plan’ using LLMs and logical intermediate representations Sudhir Agarwal et.al. 2402.06608v1 null
2024-02-09 On the Out-Of-Distribution Generalization of Multimodal Large Language Models Xingxuan Zhang et.al. 2402.06599v1 null
2024-02-09 CigaR: Cost-efficient Program Repair with LLMs Dávid Hidvégi et.al. 2402.06598v1 link
2024-02-09 Understanding the Weakness of Large Language Model Agents within a Complex Android Environment Mingzhe Xing et.al. 2402.06596v1 link
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937v1 null
2024-02-08 SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models Peng Gao et.al. 2402.05935v1 link
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933v1 link
2024-02-08 WebLINX: Real-World Website Navigation with Multi-Turn Dialogue Xing Han Lù et.al. 2402.05930v1 link
2024-02-08 An Interactive Agent Foundation Model Zane Durante et.al. 2402.05929v1 null
2024-02-08 Sharp Rates in Dependent Learning Theory: Avoiding Sample Size Deflation for the Square Loss Ingvar Ziemann et.al. 2402.05928v1 null
2024-02-07 Image captioning for Brazilian Portuguese using GRIT model Rafael Silva de Alencar et.al. 2402.05106v1 null
2024-02-07 You Can REST Now: Automated Specification Inference and Black-Box Testing of RESTful APIs with Large Language Models Alix Decrop et.al. 2402.05102v1 null
2024-02-07 Hydragen: High-Throughput LLM Inference with Shared Prefixes Jordan Juravsky et.al. 2402.05099v1 null
2024-02-07 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098v1 link
2024-02-07 Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation Dennis Hoftijzer et.al. 2402.05090v1 null
2024-02-07 Hyperspectral acquisition with ScanImage at the single pixel level: Application to time domain coherent Raman imaging Samuel Metais et.al. 2402.05086v1 null
2024-02-06 Linear-time Minimum Bayes Risk Decoding with Reference Aggregation Jannis Vamvas et.al. 2402.04251v1 link
2024-02-06 CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers Adjorn van Engelenhoven et.al. 2402.04239v1 null
2024-02-06 CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations Ji Qi et.al. 2402.04236v1 link
2024-02-06 Role of spontaneously generated coherence (SGC) in laser cooling of atoms Rajnandan Choudhury Das et.al. 2402.04234v1 null
2024-02-06 Can Generative Agents Predict Emotion? Ciaran Regan et.al. 2402.04232v1 null
2024-02-06 Further Constructions of AMUBs for Non-prime power Composite Dimensions Ajeet Kumar et.al. 2402.04231v1 null
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305v1 null
2024-02-05 GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models Haibo Jin et.al. 2402.03299v1 null
2024-02-05 Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks Yongchang Hao et.al. 2402.03295v1 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290v1 link
2024-02-05 Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS Matthew DeLorenzo et.al. 2402.03289v1 null
2024-02-05 A Lennard-Jones Layer for Distribution Normalization Mulun Na et.al. 2402.03287v1 null
2024-02-05 Training-Free Consistent Text-to-Image Generation Yoad Tewel et.al. 2402.03286v1 null
2024-02-05 Towards a Flexible Scale-out Framework for Efficient Visual Data Query Processing Rohit Verma et.al. 2402.03283v1 null
2024-02-02 Position Paper: Generalized grammar rules and structure-based generalization beyond classical equivariance for lexical tasks and transduction Mircea Petrache et.al. 2402.01629v1 null
2024-02-02 Stochastic Two Points Method for Deep Model Zeroth-order Optimization Yijiang Pang et.al. 2402.01621v1 null
2024-02-02 MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models Justin Chih-Yao Chen et.al. 2402.01620v1 link
2024-02-02 Style Vectors for Steering Generative Large Language Model Kai Konen et.al. 2402.01618v1 link
2024-02-02 A GP-based Robust Motion Planning Framework for Agile Autonomous Robot Navigation and Recovery in Unknown Environments Nicholas Mohammad et.al. 2402.01617v1 null
2024-02-01 AToM: Amortized Text-to-Mesh using 2D Diffusion Guocheng Qian et.al. 2402.00867v1 null
2024-02-01 Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection Qinyu Zhao et.al. 2402.00865v1 link
2024-02-01 Evaluating Large Language Models for Generalization and Robustness via Data Compression Yucheng Li et.al. 2402.00861v1 link
2024-02-01 Can Large Language Models Understand Context? Yilun Zhu et.al. 2402.00858v1 null
2024-02-01 SymbolicAI: A framework for logic-based approaches combining generative models and solvers Marius-Constantin Dinu et.al. 2402.00854v1 link
2024-02-01 LTAU-FF: Loss Trajectory Analysis for Uncertainty in Atomistic Force Fields Joshua A. Vita et.al. 2402.00853v1 null
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085v1 null
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083v1 link
2024-01-31 Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners? Andreas Opedal et.al. 2401.18070v1 null
2024-01-30 A simple, strong baseline for building damage detection on the xBD dataset Sebastian Gerard et.al. 2401.17271v1 link
2024-01-30 Weaver: Foundation Models for Creative Writing Tiannan Wang et.al. 2401.17268v1 null
2024-01-30 Proactive Detection of Voice Cloning with Localized Watermarking Robin San Roman et.al. 2401.17264v1 link
2024-01-30 Weak-to-Strong Jailbreaking on Large Language Models Xuandong Zhao et.al. 2401.17256v1 link
2024-01-29 Endo-4DGS: Distilling Depth Ranking for Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting Yiming Huang et.al. 2401.16416v1 null
2024-01-29 A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect Yunkang Cao et.al. 2401.16402v1 null
2024-01-29 Amazon’s 2023 Drought: Sentinel-1 Reveals Extreme Rio Negro River Contraction Fabien H Wagner et.al. 2401.16393v1 null
2024-01-26 EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty Yuhui Li et.al. 2401.15077v1 link
2024-01-26 Annotated Hands for Generative Models Yue Yang et.al. 2401.15075v1 link
2024-01-26 From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities Chaochao Lu et.al. 2401.15071v1 null
2024-01-26 Pairing Orthographically Variant Literary Words to Standard Equivalents Using Neural Edit Distance Models Craig Messner et.al. 2401.15068v1 null
2024-01-26 Asymmetric Influence of the Amplitude-Dependent Tune Shift on the Transverse Mode-Coupling Instability Miriam Brosi et.al. 2401.15065v1 null
2024-01-26 Expert with Clustering: Hierarchical Online Preference Learning Framework Tianyue Zhou et.al. 2401.15062v1 null
2024-01-25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen et.al. 2401.14404v1 null
2024-01-25 O(1) Insertion for Random Walk d-ary Cuckoo Hashing up to the Load Threshold Tolson Bell et.al. 2401.14394v1 null
2024-01-25 Inconsistency Masks: Removing the Uncertainty from Input-Pseudo-Label Pairs Michael R. H. Vorndran et.al. 2401.14387v1 link
2024-01-25 Manifold GCN: Diffusion-based Convolutional Neural Network for Manifold-valued Graphs Martin Hanik et.al. 2401.14381v1 null
2024-01-25 UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models Timo Kapsalis et.al. 2401.14379v1 null
2024-01-24 Graph-Informed Neural Networks for Sparse Grid-Based Discontinuity Detectors Francesco Della Santa et.al. 2401.13652v1 link
2024-01-24 Employing polyhedral methods to optimize stencils on FPGAs with stencil-specific caches, data reuse, and wide data bursts Florian Mayer et.al. 2401.13645v1 null
2024-01-24 Unveiling homophily beyond the pool of opportunities Sina Sajjadi et.al. 2401.13642v1 null
2024-01-23 GALA: Generating Animatable Layered Assets from a Single Scan Taeksoo Kim et.al. 2401.12979v1 null
2024-01-23 Zero-Shot Learning for the Primitives of 3D Affordance in General Objects Hyeonwoo Kim et.al. 2401.12978v1 null
2024-01-23 In-Context Language Learning: Arhitectures and Algorithms Ekin Akyürek et.al. 2401.12973v1 link
2024-01-23 Raidar: geneRative AI Detection viA Rewriting Chengzhi Mao et.al. 2401.12970v1 link
2024-01-23 Minimizing the Age of Two Heterogeneous Sources With Packet Drops Via Cyclic Schedulers Sahan Liyanaarachchi et.al. 2401.12962v1 null
2024-01-23 Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network Hanchen Li et.al. 2401.12961v1 null
2024-01-22 Exploring Simple Open-Vocabulary Semantic Segmentation Zihang Lai et.al. 2401.12217v1 link
2024-01-22 Genericity Through Stratification Victor Arrial et.al. 2401.12212v1 null
2024-01-22 OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Peiqi Liu et.al. 2401.12202v1 link
2024-01-22 APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference Bowen Zhao et.al. 2401.12200v1 null
2024-01-22 Learning Dynamics from Multicellular Graphs with Deep Neural Networks Haiqian Yang et.al. 2401.12196v1 null
2024-01-22 Text Embedding Inversion Attacks on Multilingual Language Models Yiyi Chen et.al. 2401.12192v1 null
2024-01-19 Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Lihe Yang et.al. 2401.10891v1 link
2024-01-19 Event detection from novel data sources: Leveraging satellite imagery alongside GPS traces Ekin Ugurel et.al. 2401.10890v1 link
2024-01-19 Synthesizing Moving People with 3D Control Boyi Li et.al. 2401.10889v1 null
2024-01-19 Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning Adib Hasan et.al. 2401.10862v1 link
2024-01-18 ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions Jeonghwan Kim et.al. 2401.10232v1 null
2024-01-18 Simultaneous Tactile Estimation and Control for Extrinsic Dexterity Antonia Bronars et.al. 2401.10230v1 null
2024-01-18 RAP-SAM: Towards Real-Time All-Purpose Segment Anything Shilin Xu et.al. 2401.10228v1 link
2024-01-18 A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting Wouter Van Gansbeke et.al. 2401.10227v1 link
2024-01-18 The Manga Whisperer: Automatically Generating Transcriptions for Comics Ragav Sachdeva et.al. 2401.10224v1 link
2024-01-18 Supervised Fine-tuning in turn Improves Visual Foundation Models Xiaohu Jiang et.al. 2401.10222v1 link
2024-01-18 AutoFT: Robust Fine-Tuning by Optimizing Hyperparameters on OOD Data Caroline Choi et.al. 2401.10220v1 null
2024-01-18 Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions Namitha Padmanabhan et.al. 2401.10217v1 null
2024-01-18 GPAvatar: Generalizable and Precise Head Avatar from Image(s) Xuangeng Chu et.al. 2401.10215v1 link
2024-01-17 Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Lianghui Zhu et.al. 2401.09417v1 link
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414v1 link
2024-01-17 Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text Mazal Bethany et.al. 2401.09407v1 null
2024-01-16 Machine Translation with Large Language Models: Prompt Engineering for Persian, English, and Russian Directions Nooshin Pourkamali et.al. 2401.08429v1 null
2024-01-16 Three ways that non-differentiability affects neural network training Siddharth Krishna Kumar et.al. 2401.08426v1 null
2024-01-16 U-DIADS-Bib: a full and few-shot pixel-precise dataset for document layout analysis of ancient manuscripts Silvia Zottin et.al. 2401.08425v1 null
2024-01-16 Ask the experts: sourcing high-quality datasets for nutritional counselling through Human-AI collaboration Simone Balloccu et.al. 2401.08420v1 link
2024-01-16 Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation Haoran Xu et.al. 2401.08417v1 link
2024-01-12 Automated Test Case Repair Using Language Models Ahmadreza Saboor Yaraghi et.al. 2401.06765v1 null
2024-01-12 APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding Mingdao Liu et.al. 2401.06761v1 null
2024-01-12 Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction Muhammad Naveed Riaz et.al. 2401.06757v1 null
2024-01-12 Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection Muhammad Tayyab Zamir et.al. 2401.06752v1 null
2024-01-12 The Unreasonable Effectiveness of Easy Training Data for Hard Tasks Peter Hase et.al. 2401.06751v1 link
2024-01-12 Measure Theoretic Reeb Graphs and Reeb Spaces Qingsong Wang et.al. 2401.06748v1 null
2024-01-11 Distilling Vision-Language Models on Millions of Videos Yue Zhao et.al. 2401.06129v1 null
2024-01-11 E $^{2}$ GAN: Efficient Training of Efficient GANs for Image-to-Image Translation Yifan Gong et.al. 2401.06127v1 null
2024-01-11 Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors Jack Saunders et.al. 2401.06126v1 null
2024-01-11 Manipulating Feature Visualizations with Gradient Slingshots Dilyara Bareeva et.al. 2401.06122v1 link
2024-01-11 Gaussian Shadow Casting for Neural Characters Luis Bolanos et.al. 2401.06116v1 null
2024-01-11 Jupyter widgets and extensions for education and research in computational physics and chemistry Dou Du et.al. 2401.06113v1 null
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335v1 null
2024-01-10 URHand: Universal Relightable Hands Zhaoxi Chen et.al. 2401.05334v1 null
2024-01-10 \textit{SmartMME}: Implementation of Base Station Switching Off Strategy in ns-3 Argha Sen et.al. 2401.05329v1 null
2024-01-10 Leveraging Print Debugging to Improve Code Generation in Large Language Models Xueyu Hu et.al. 2401.05319v1 null
2024-01-10 Can Probabilistic Feedback Drive User Impacts in Online Platforms? Jessica Dai et.al. 2401.05304v1 null
2024-01-09 Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation Xiyi Chen et.al. 2401.04728v1 null
2024-01-09 Low-Resource Vision Challenges for Foundation Models Yunhua Zhang et.al. 2401.04716v1 null
2024-01-09 Bin Packing under Random-Order: Breaking the Barrier of 3/2 Anish Hebbar et.al. 2401.04714v1 link
2024-01-09 RNA-TransCrypt: Image Encryption Using Chaotic RNA Encoding, Novel Transformative Substitution, and Tailored Cryptographic Operations Muhammad Shahbaz Khan et.al. 2401.04707v1 null
2024-01-08 AGG: Amortized Generative 3D Gaussians for Single Image to 3D Dejia Xu et.al. 2401.04099v1 null
2024-01-08 Modeling AoII in Push- and Pull-Based Sampling of Continuous Time Markov Chains Ismail Cosandal et.al. 2401.04098v1 null
2024-01-08 GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation Tong Wu et.al. 2401.04092v1 link
2024-01-08 Mixtral of Experts Albert Q. Jiang et.al. 2401.04088v1 null
2024-01-05 Denoising Vision Transformers Jiawei Yang et.al. 2401.02957v1 link
2024-01-05 Locally Adaptive Neural 3D Morphable Models Michail Tarasiou et.al. 2401.02937v1 link
2024-01-05 Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks Kevin Everson et.al. 2401.02921v1 null
2024-01-04 Learning to Prompt with Text Only Supervision for Vision-Language Models Muhammad Uzair Khattak et.al. 2401.02418v1 link
2024-01-04 LLaMA Pro: Progressive LLaMA with Block Expansion Chengyue Wu et.al. 2401.02415v1 link
2024-01-04 LLM Augmented LLMs: Expanding Capabilities through Composition Rachit Bansal et.al. 2401.02412v1 null
2024-01-04 What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs Alex Trevithick et.al. 2401.02411v1 null
2024-01-04 Correctness Comparison of ChatGPT-4, Bard, Claude-2, and Copilot for Spatial Tasks Hartwig H. Hochmair et.al. 2401.02404v1 null
2024-01-04 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation Zihao Xiao et.al. 2401.02402v1 null
2024-01-04 Learning the 3D Fauna of the Web Zizhang Li et.al. 2401.02400v1 null
2024-01-03 From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations Evonne Ng et.al. 2401.01885v1 link
2024-01-03 A rewriting-logic-with-SMT-based formal analysis and parameter synthesis framework for parametric time Petri nets Jaime Arias et.al. 2401.01884v1 null
2024-01-03 Theoretical guarantees on the best-of-n alignment policy Ahmad Beirami et.al. 2401.01879v1 null
2024-01-03 Graph Neural Networks for Surfactant Multi-Property Prediction Christoforos Brozos et.al. 2401.01874v1 link
2024-01-03 Dataset Difficulty and the Role of Inductive Bias Devin Kwok et.al. 2401.01867v1 null
2024-01-02 Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Zixiang Chen et.al. 2401.01335v1 link
2024-01-02 An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction Zaratiana Urchade et.al. 2401.01326v1 link
2024-01-02 A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models S. M Towhidul Islam Tonmoy et.al. 2401.01313v1 null
2024-01-02 On the uniqueness and computation of commuting extensions Pascal Koiran et.al. 2401.01302v1 null
2023-12-29 K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval and Persona-Adaptive Queries Kanak Raj et.al. 2312.17748v1 link
2023-12-28 Do Androids Know They’re Only Dreaming of Electric Sheep? Sky CH-Wang et.al. 2312.17249v1 null
2023-12-28 Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity Guhao Feng et.al. 2312.17248v1 null
2023-12-28 The LLM Surgeon Tycho F. A. van der Ouderaa et.al. 2312.17244v1 link
2023-12-28 Unsupervised Universal Image Segmentation Dantong Niu et.al. 2312.17243v1 link
2023-12-28 Learning to Generate Text in Arbitrary Writing Styles Aleem Khan et.al. 2312.17242v1 null
2023-12-28 An Improved Baseline for Reasoning Segmentation with Large Language Model Senqiao Yang et.al. 2312.17240v1 null
2023-12-28 Fast Inference of Mixture-of-Experts Language Models with Offloading Artyom Eliseev et.al. 2312.17238v1 link
2023-12-28 A Simple LLM Framework for Long-Range Video Question-Answering Ce Zhang et.al. 2312.17235v1 link
2023-12-28 Personalized Restoration via Dual-Pivot Tuning Pradyumna Chari et.al. 2312.17234v1 null
2023-12-26 Social-Transmotion: Promptable Human Trajectory Prediction Saeed Saadatnejad et.al. 2312.16168v1 link
2023-12-26 Age of Information in Gossip Networks: A Friendly Introduction and Literature Survey Priyanka Kaswan et.al. 2312.16163v1 null
2023-12-26 Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages Mofetoluwa Adeyemi et.al. 2312.16159v1 null
2023-12-26 From Text to Multimodal: A Comprehensive Survey of Adversarial Example Generation in Question Answering Systems Gulsum Yigit et.al. 2312.16156v1 null
2023-12-26 Validating Light Phenomena Conceptual Assessment Through The Lens of CTT and IRT Frameworks Purwoko Haryadi Santoso et.al. 2312.16153v1 null
2023-12-26 SoundCount: Sound Counting from Raw Audio with Dyadic Decomposition Neural Network Yuhang He et.al. 2312.16149v1 null
2023-12-22 MACS: Mass Conditioned 3D Hand and Object Motion Synthesis Soshi Shimada et.al. 2312.14929v1 null
2023-12-22 PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF Mohsen Gholami et.al. 2312.14915v1 link
2023-12-21 Virtual Pets: Animatable Animal Generation in 3D Scenes Yen-Chi Cheng et.al. 2312.14154v1 null
2023-12-21 DriveLM: Driving with Graph Visual Question Answering Chonghao Sima et.al. 2312.14150v1 link
2023-12-21 HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs Artem Sevastopolsky et.al. 2312.14140v1 null
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134v1 null
2023-12-20 Generative Multimodal Models are In-Context Learners Quan Sun et.al. 2312.13286v1 link
2023-12-20 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections Fangjinhua Wang et.al. 2312.13285v1 null
2023-12-20 Deep Learning on 3D Neural Fields Pierluigi Zama Ramirez et.al. 2312.13277v1 null
2023-12-20 Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting Junwu Zhang et.al. 2312.13271v1 link
2023-12-19 Weakly Supervised Open-Vocabulary Object Detection Jianghang Lin et.al. 2312.12437v1 null
2023-12-19 A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise Chaoyou Fu et.al. 2312.12436v1 link
2023-12-19 On Inference Stability for Diffusion Models Viet Nguyen et.al. 2312.12431v1 link
2023-12-19 ROSE: A reduced-order scattering emulator for optical models Daniel Odell et.al. 2312.12426v1 null
2023-12-19 SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process Mengyu Wang et.al. 2312.12425v1 link
2023-12-19 Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model Shraman Pramanick et.al. 2312.12423v1 null
2023-12-19 Scene-Conditional 3D Object Stylization and Composition Jinghao Zhou et.al. 2312.12419v1 null
2023-12-18 On Computing Makespan-Optimal Solutions for Generalized Sliding-Tile Puzzles Marcus Gozon et.al. 2312.10887v1 null
2023-12-18 A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm Yong Niu et.al. 2312.10885v1 null
2023-12-18 Sharable Clothoid-based Continuous Motion Planning for Connected Automated Vehicles Sanghoon Oh et.al. 2312.10880v1 null
2023-12-18 Country-Scale Cropland Mapping in Data-Scarce Settings Using Deep Learning: A Case Study of Nigeria Joaquin Gajardo et.al. 2312.10872v1 link
2023-12-18 From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape Timothy R. McIntosh et.al. 2312.10868v1 null
2023-12-15 Osprey: Pixel Understanding with Visual Instruction Tuning Yuqian Yuan et.al. 2312.10032v1 link
2023-12-15 Wearable Coaxially-shielded Metamaterial for Magnetic Resonance Imaging Xia Zhu et.al. 2312.10018v1 null
2023-12-15 Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects Paul Maria Scheikl et.al. 2312.10008v1 null
2023-12-15 Faithful Persona-based Conversational Dataset Generation with Large Language Models Pegah Jandaghi et.al. 2312.10007v1 link
2023-12-14 LIME: Localized Image Editing via Attention Regularization in Diffusion Models Enis Simsar et.al. 2312.09256v1 null
2023-12-14 Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization Luca Bartolomei et.al. 2312.09254v1 link
2023-12-14 FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection Hongsuk Choi et.al. 2312.09252v1 null
2023-12-14 VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation Jinguo Zhu et.al. 2312.09251v1 link
2023-12-14 Single Mesh Diffusion Models with Field Latents for Texture Generation Thomas W. Mitchel et.al. 2312.09250v1 null
2023-12-14 ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining Ruoxi Shi et.al. 2312.09249v1 null
2023-12-14 Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking Jacob Eisenstein et.al. 2312.09244v1 null
2023-12-14 OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields Chubin Zhang et.al. 2312.09243v1 link
2023-12-14 Text2Immersion: Generative Immersive Scene with 3D Gaussians Hao Ouyang et.al. 2312.09242v1 null
2023-12-13 SAM-guided Graph Cut for 3D Instance Segmentation Haoyu Guo et.al. 2312.08372v1 null
2023-12-13 PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection Kuan-Chih Huang et.al. 2312.08371v1 link
2023-12-13 An Invitation to Deep Reinforcement Learning Bernhard Jaeger et.al. 2312.08365v1 null
2023-12-13 View-Dependent Octree-based Mesh Extraction in Unbounded Scenes for Procedural Synthetic Data Zeyu Ma et.al. 2312.08364v1 link
2023-12-13 On the Computational Hardness of Quantum One-Wayness Bruno Cavalar et.al. 2312.08363v1 null
2023-12-13 Distributed Inference and Fine-tuning of Large Language Models Over The Internet Alexander Borzunov et.al. 2312.08361v1 null
2023-12-12 diff History for Long-Context Language Agents Ulyana Piterbarg et.al. 2312.07540v1 link
2023-12-12 HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation Hongyu Liu et.al. 2312.07539v1 null
2023-12-12 FreeInit: Bridging Initialization Gap in Video Diffusion Models Tianxing Wu et.al. 2312.07537v1 link
2023-12-12 FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition Sicheng Mo et.al. 2312.07536v1 null
2023-12-12 Interfacing Foundation Models’ Embeddings Xueyan Zou et.al. 2312.07532v1 link
2023-12-12 Topological Obstructions and How to Avoid Them Babak Esmaeili et.al. 2312.07529v1 null
2023-12-11 CAD: Photorealistic 3D Generation via Adversarial Distillation Ziyu Wan et.al. 2312.06663v1 null
2023-12-11 Photorealistic Video Generation with Diffusion Models Agrim Gupta et.al. 2312.06662v1 null
2023-12-11 UpFusion: Novel View Diffusion from Unposed Sparse View Observations Bharath Raj Nagoor Kani et.al. 2312.06661v1 null
2023-12-11 EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM Chong Zhou et.al. 2312.06660v1 link
2023-12-11 Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior Fangfu Liu et.al. 2312.06655v1 link
2023-12-11 LightSim: Neural Lighting Simulation for Urban Scenes Ava Pun et.al. 2312.06654v1 null
2023-12-11 Adaptive Human Trajectory Prediction via Latent Corridors Neerja Thakkar et.al. 2312.06653v1 null
2023-12-11 Nuvo: Neural UV Mapping for Unruly 3D Representations Pratul P. Srinivasan et.al. 2312.05283v1 null
2023-12-08 KBFormer: A Diffusion Model for Structured Entity Completion Ouail Kitouni et.al. 2312.05253v1 null
2023-12-08 Laboratory realization of relativistic pair-plasma beams C. D. Arrowsmith et.al. 2312.05244v1 null
2023-12-08 Contra generative AI detection in higher education assessments Cesare G. Ardito et.al. 2312.05241v1 null
2023-12-08 SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation Thuan Hoang Nguyen et.al. 2312.05239v1 null
2023-12-08 Seeing ChatGPT Through Universities’ Policies, Resources and Guidelines Hui Wang et.al. 2312.05235v1 null
2023-12-07 Scaling Laws of Synthetic Images for Model Training … for Now Lijie Fan et.al. 2312.04567v1 link
2023-12-07 Gen2Det: Generate to Detect Saksham Suri et.al. 2312.04566v1 null
2023-12-07 MuRF: Multi-Baseline Radiance Fields Haofei Xu et.al. 2312.04565v1 link
2023-12-07 GenDeF: Learning Generative Deformation Field for Video Generation Wen Wang et.al. 2312.04561v1 null
2023-12-07 NeRFiller: Completing Scenes via Generative 3D Inpainting Ethan Weber et.al. 2312.04560v1 null
2023-12-07 PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation Zhaoxi Chen et.al. 2312.04559v1 link
2023-12-07 GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Shoufa Chen et.al. 2312.04557v1 null
2023-12-07 Large Language Models for Mathematicians Simon Frieder et.al. 2312.04556v1 null
2023-12-07 Improved Visual Grounding through Self-Consistent Explanations Ruozhen He et.al. 2312.04554v1 null
2023-12-07 Generating Illustrated Instructions Sachit Menon et.al. 2312.04552v1 null
2023-12-06 Relightable Gaussian Codec Avatars Shunsuke Saito et.al. 2312.03704v1 null
2023-12-06 Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning Xinshun Wang et.al. 2312.03703v1 link
2023-12-06 Self-conditioned Image Generation via Generating Representations Tianhong Li et.al. 2312.03701v1 link
2023-12-06 Intrinsic Harmonization for Illumination-Aware Compositing Chris Careaga et.al. 2312.03698v1 link
2023-12-06 Efficient Learning in Polyhedral Games via Best Response Oracles Darshan Chakrabarti et.al. 2312.03696v1 null
2023-12-06 Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication Ali Naseh et.al. 2312.03692v1 null
2023-12-06 On the Role of Edge Dependency in Graph Generative Models Sudhanshu Chanpuriya et.al. 2312.03691v1 null
2023-12-06 Evaluating and Mitigating Discrimination in Language Model Decisions Alex Tamkin et.al. 2312.03689v1 null
2023-12-05 GPT4Point: A Unified Framework for Point-Language Understanding and Generation Zhangyang Qi et.al. 2312.02980v1 null
2023-12-05 Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World Kiana Ehsani et.al. 2312.02976v1 null
2023-12-05 Describing Differences in Image Sets with Natural Language Lisa Dunlap et.al. 2312.02974v1 link
2023-12-05 Alchemist: Parametric Control of Material Properties with Diffusion Models Prafull Sharma et.al. 2312.02970v1 null
2023-12-05 Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models Xinyu Zhang et.al. 2312.02969v1 null
2023-12-05 AmbiGen: Generating Ambigrams from Pre-trained Diffusion Model Boheng Zhao et.al. 2312.02967v1 null
2023-12-05 Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection Cheng-Ju Ho et.al. 2312.02966v1 link
2023-12-05 MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures Zhangyang Xiong et.al. 2312.02963v1 null
2023-12-04 Aligning and Prompting Everything All at Once for Universal Visual Perception Yunhang Shen et.al. 2312.02153v1 link
2023-12-04 Readout Guidance: Learning Control from Diffusion Features Grace Luo et.al. 2312.02150v1 null
2023-12-04 Generative Powers of Ten Xiaojuan Wang et.al. 2312.02149v1 null
2023-12-04 Rejuvenating image-GPT as Strong Visual Representation Learners Sucheng Ren et.al. 2312.02147v1 link
2023-12-04 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation Bingxin Ke et.al. 2312.02145v1 link
2023-12-04 Optimizing Camera Configurations for Multi-View Pedestrian Detection Yunzhong Hou et.al. 2312.02144v1 null
2023-12-04 Competition-Level Problems Are Effective Evaluators of LLMs Yiming Huang et.al. 2312.02143v1 null
2023-12-04 Object Recognition as Next Token Prediction Kaiyu Yue et.al. 2312.02142v1 link
2023-12-01 VideoBooth: Diffusion-based Video Generation with Image Prompts Yuming Jiang et.al. 2312.00777v1 null
2023-12-01 Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans Homanga Bharadhwaj et.al. 2312.00775v1 null
2023-12-01 Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses Xiao Ma et.al. 2312.00763v1 null
2023-12-01 Mamba: Linear-Time Sequence Modeling with Selective State Spaces Albert Gu et.al. 2312.00752v1 link
2023-12-01 Reduction from sparse LPN to LPN, Dual Attack 3.0 Kévin Carrier et.al. 2312.00747v1 null
2023-12-01 Adversarial Score Distillation: When score distillation meets GAN Min Wei et.al. 2312.00739v1 link
2023-11-30 Dataset Distillation in Large Data Era Zeyuan Yin et.al. 2311.18838v1 link
2023-11-30 VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models Zhen Xing et.al. 2311.18837v1 null
2023-11-30 PoseGPT: Chatting about 3D Human Pose Yao Feng et.al. 2311.18836v1 null
2023-11-30 InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation Rongyao Fang et.al. 2311.18835v1 link
2023-11-30 ART $\boldsymbol{\cdot}$ V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng et.al. 2311.18834v1 null
2023-11-30 Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction Hsin-Ying Lee et.al. 2311.18832v1 link
2023-11-30 MotionEditor: Editing Video Motion via Content-Aware Diffusion Shuyuan Tu et.al. 2311.18830v1 link
2023-11-30 MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Yanhui Wang et.al. 2311.18829v1 null
2023-11-30 One-step Diffusion with Distribution Matching Distillation Tianwei Yin et.al. 2311.18828v1 null
2023-11-30 An Adaptive Framework for Generalizing Network Traffic Prediction towards Uncertain Environments Alexander Downey et.al. 2311.18824v1 null
2023-11-29 A Simple Recipe for Language-guided Domain Generalized Segmentation Mohammad Fahes et.al. 2311.17922v1 null
2023-11-29 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921v1 link
2023-11-29 Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models Daniel Geng et.al. 2311.17919v1 null
2023-11-29 Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving Yuqi Wang et.al. 2311.17918v1 link
2023-11-29 AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text Jianfeng Zhang et.al. 2311.17917v1 null
2023-11-29 OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation Qidong Huang et.al. 2311.17911v1 link
2023-11-29 HUGS: Human Gaussian Splats Muhammed Kocabas et.al. 2311.17910v1 null
2023-11-29 CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting Alexander Vilesov et.al. 2311.17907v1 null
2023-11-28 HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting Xian Liu et.al. 2311.17061v1 null
2023-11-28 Material Palette: Extraction of Materials from a Single Image Ivan Lopes et.al. 2311.17060v1 null
2023-11-28 Panoptic Video Scene Graph Generation Jingkang Yang et.al. 2311.17058v1 link
2023-11-28 ReMoS: Reactive 3D Motion Synthesis for Two-Person Interactions Anindita Ghosh et.al. 2311.17057v1 null
2023-11-28 Self-Supervised Motion Magnification by Backpropagating Through Optical Flow Zhaoying Pan et.al. 2311.17056v1 null
2023-11-28 No Representation Rules Them All in Category Discovery Sagar Vaze et.al. 2311.17055v1 null
2023-11-28 DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models Tsun-Hsuan Wang et.al. 2311.17053v1 null
2023-11-28 Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models Zhengming Yu et.al. 2311.17050v1 null
2023-11-27 Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models Munan Ning et.al. 2311.16103v1 link
2023-11-27 Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback Mihir Prabhudesai et.al. 2311.16102v1 null
2023-11-27 How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs Haoqin Tu et.al. 2311.16101v1 link
2023-11-27 GART: Gaussian Articulated Template Models Jiahui Lei et.al. 2311.16099v1 null
2023-11-27 On Bringing Robots Home Nur Muhammad Mahi Shafiullah et.al. 2311.16098v1 link
2023-11-27 CG-HOI: Contact-Guided 3D Human-Object Interaction Generation Christian Diller et.al. 2311.16097v1 null
2023-11-27 Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling Zhe Li et.al. 2311.16096v1 link
2023-11-27 Self-correcting LLM-controlled Diffusion Models Tsung-Han Wu et.al. 2311.16090v1 null
2023-11-27 DUnE: Dataset for Unified Editing Afra Feyza Akyürek et.al. 2311.16087v1 link
2023-11-24 SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation Lingchen Meng et.al. 2311.14671v1 link
2023-11-24 Data-driven Prior Learning for Bayesian Optimisation Sigrid Passano Hellan et.al. 2311.14653v1 link
2023-11-24 One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space Raghav Addanki et.al. 2311.14652v1 null
2023-11-24 History Filtering in Imperfect Information Games: Algorithms and Complexity Christopher Solinas et.al. 2311.14651v1 null
2023-11-22 Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation Daichi Horita et.al. 2311.13602v1 null
2023-11-22 Visual In-Context Prompting Feng Li et.al. 2311.13601v1 link
2023-11-22 ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs Viraj Shah et.al. 2311.13600v1 null
2023-11-22 Risk-sensitive Markov Decision Process and Learning under General Utility Functions Zhengqi Wu et.al. 2311.13589v1 null
2023-11-22 A Survey of Serverless Machine Learning Model Inference Kamil Kojs et.al. 2311.13587v1 null
2023-11-22 On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates Stefano Bruno et.al. 2311.13584v1 null
2023-11-22 PaSS: Parallel Speculative Sampling Giovanni Monea et.al. 2311.13581v1 null
2023-11-22 Aufbau Suppressed Coupled Cluster Theory for Electronically Excited States Harrison Tuckman et.al. 2311.13576v1 null
2023-11-21 Intrinsic Image Decomposition via Ordinal Shading Chris Careaga et.al. 2311.12792v1 link
2023-11-21 Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks Samyak Jain et.al. 2311.12786v1 null
2023-11-20 Rate-Independent Gradient Crystal Plasticity Theory – Robust Algorithmic Formulations based on Incremental Energy Minimization Volker Fohrmeister et.al. 2311.12026v1 null
2023-11-20 The allosteric lever: towards a principle of specific allosteric response Maximilian Vossel et.al. 2311.12025v1 null
2023-11-20 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction Peng Wang et.al. 2311.12024v1 null
2023-11-20 Macroscopic description of a heavy particle immersed within a flow of light particles Radek Erban et.al. 2311.12021v1 null
2023-11-20 An Empirical Study of Self-Admitted Technical Debt in Machine Learning Software Aaditya Bhatia et.al. 2311.12019v1 null
2023-11-20 GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration Naoki Wake et.al. 2311.12015v1 null
2023-11-17 Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Rohit Girdhar et.al. 2311.10709v1 null
2023-11-17 SelfEval: Leveraging the discriminative nature of generative models for evaluation Sai Saketh Rambhatla et.al. 2311.10708v1 null
2023-11-17 Cactus Representations in Polylogarithmic Max-flow via Maximal Isolating Mincuts Zhongtian He et.al. 2311.10706v1 null
2023-11-16 The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Omri Avrahami et.al. 2311.10093v1 null
2023-11-16 Traffic Video Object Detection using Motion Prior Lihao Liu et.al. 2311.10092v1 null
2023-11-16 Adaptive Shells for Efficient Neural Radiance Field Rendering Zian Wang et.al. 2311.10091v1 null
2023-11-16 Emu Edit: Precise Image Editing via Recognition and Generation Tasks Shelly Sheynin et.al. 2311.10089v1 null
2023-11-16 DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback Yangyi Chen et.al. 2311.10081v1 null
2023-11-16 Improving 3D Synthetic Jet Modeling in a Crossflow Howard Ho et.al. 2311.10072v1 null
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221v1 null
2023-11-15 DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model Yinghao Xu et.al. 2311.09217v1 null
2023-11-15 Assessing Translation capabilities of Large Language Models involving English and Indian Languages Vandan Mujadia et.al. 2311.09216v1 null
2023-11-15 GRIM: GRaph-based Interactive narrative visualization for gaMes Jorge Leandro et.al. 2311.09213v1 null
2023-11-15 Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects – A Survey Ashok Urlana et.al. 2311.09212v1 link
2023-11-15 Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models Wenhao Yu et.al. 2311.09210v1 null
2023-11-15 A Unified Approach to Learning Ising Models: Beyond Independence and Bounded Width Jason Gaitonde et.al. 2311.09197v1 null
2023-11-15 Self-Supervised Curriculum Generation for Autonomous Reinforcement Learning without Task-Specific Knowledge Sang-Hyun Lee et.al. 2311.09195v1 null
2023-11-15 Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models James A. Michaelov et.al. 2311.09194v1 null
2023-11-14 Instant3D: Instant Text-to-3D Generation Ming Li et.al. 2311.08403v1 null
2023-11-14 Fine-tuning Language Models for Factuality Katherine Tian et.al. 2311.08401v1 null
2023-11-14 Towards Open-Ended Visual Recognition with Large Language Model Qihang Yu et.al. 2311.08400v1 link
2023-11-14 Are Large Language Models Temporally Grounded? Yifu Qiu et.al. 2311.08398v1 link
2023-11-14 MVSA-Net: Multi-View State-Action Recognition for Robust and Deployable Trajectory Generation Ehsan Asali et.al. 2311.08393v1 null
2023-11-14 On What Basis? Predicting Text Preference Via Structured Comparative Reasoning Jing Nathan Yan et.al. 2311.08390v1 null
2023-11-14 TSST: A Benchmark and Evaluation Models for Text Speech-Style Transfer Huashan Sun et.al. 2311.08389v1 null
2023-11-13 To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning Junke Wang et.al. 2311.07574v1 link
2023-11-13 Realizability of Free Spaces of Curves Hugo A. Akitaya et.al. 2311.07573v1 null
2023-11-13 Feature emergence via margin maximization: case studies in algebraic tasks Depen Morwani et.al. 2311.07568v1 null
2023-11-13 GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation An Yan et.al. 2311.07562v1 link
2023-11-13 Fast Normalized Cross-Correlation for Template Matching with Rotations José María Almira et.al. 2311.07561v1 null
2023-11-13 Sound Gradual Verification with Symbolic Execution Conrad Zimmerman et.al. 2311.07559v1 null
2023-11-13 Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning Arjun Bhardwaj et.al. 2311.07558v1 null
2023-11-10 Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization Weiyang Liu et.al. 2311.06243v1 null
2023-11-10 Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Bin Xiao et.al. 2311.06242v1 null
2023-11-10 Nonnegativity Problems for Matrix Semigroups Julian D’Costa et.al. 2311.06241v1 null
2023-11-10 Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild Nanna Inie et.al. 2311.06237v1 null
2023-11-10 Deep Learning meets Blockchain for Automated and Secure Access Control Asma Jodeiri Akbarfam et.al. 2311.06236v1 null
2023-11-10 Learning material synthesis-structure-property relationship by data fusion: Bayesian Co-regionalization N-Dimensional Piecewise Function Learning A. Gilad Kusne et.al. 2311.06228v1 null
2023-11-10 Does Differential Privacy Prevent Backdoor Attacks in Practice? Fereshteh Razmi et.al. 2311.06227v1 null
2023-11-09 What Do I Hear? Generating Sounds for Visuals with ChatGPT David Chuan-En Lin et.al. 2311.05609v1 null
2023-11-09 Real-Time Neural Rasterization for Large Scenes Jeffrey Yunfan Liu et.al. 2311.05607v1 null
2023-11-09 Diffusion-Generative Multi-Fidelity Learning for Physical Simulation Zheng Wang et.al. 2311.05606v1 null
2023-11-09 3D-QAE: Fully Quantum Auto-Encoding of 3D Point Clouds Lakshika Rathi et.al. 2311.05604v1 null
2023-11-09 Reconstructing Objects in-the-wild for Realistic Sensor Simulation Ze Yang et.al. 2311.05602v1 null
2023-11-09 SynH2R: Synthesizing Hand-Object Motions for Learning Human-to-Robot Handovers Sammy Christen et.al. 2311.05599v1 null
2023-11-09 LLM Augmented Hierarchical Agents Bharat Prakash et.al. 2311.05596v1 null
2023-11-08 GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs Zhenfang Chen et.al. 2311.04901v1 null
2023-11-08 How Abstract Is Linguistic Generalization in Large Language Models? Experiments with Argument Structure Michael Wilson et.al. 2311.04900v1 link
2023-11-08 Optimized measurements of chaotic dynamical systems via the information bottleneck Kieran A. Murphy et.al. 2311.04896v1 null
2023-11-08 The Monadic Theory of Toric Words Valérie Berthé et.al. 2311.04895v1 null
2023-11-08 Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs Shashank Gupta et.al. 2311.04892v1 link
2023-11-08 AutoChip: Automating HDL Generation Using LLM Feedback Shailja Thakur et.al. 2311.04887v1 link
2023-11-08 SEMQA: Semi-Extractive Multi-Source Question Answering Tal Schuster et.al. 2311.04886v1 link
2023-11-07 Towards Garment Sewing Pattern Reconstruction from a Single Image Lijuan Liu et.al. 2311.04218v1 link
2023-11-07 Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves Yihe Deng et.al. 2311.04205v1 link
2023-11-07 Sharp Thresholds Imply Circuit Lower Bounds: from random 2-SAT to Planted Clique David Gamarnik et.al. 2311.04204v1 null
2023-11-07 Exploring Recommendation Capabilities of GPT-4V(ision): A Preliminary Case Study Peilin Zhou et.al. 2311.04199v1 null
2023-11-07 JPAVE: A Generation and Classification-based Model for Joint Product Attribute Prediction and Value Extraction Zhongfen Deng et.al. 2311.04196v1 link
2023-11-06 GLaMM: Pixel Grounding Large Multimodal Model Hanoona Rasheed et.al. 2311.03356v1 null
2023-11-06 SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis Hanrong Ye et.al. 2311.03355v1 null
2023-11-06 CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding Junyan Li et.al. 2311.03354v1 null
2023-11-06 Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation Rusheb Shah et.al. 2311.03348v1 null
2023-11-06 Decomposing Probability Marginals Beyond Affine Requirements Jannik Matuschke et.al. 2311.03346v1 null
2023-11-06 Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences Zador Pataki et.al. 2311.03345v1 null
2023-11-06 Embedding First Order Logic into Kernel Machines Michelangelo Diligenti et.al. 2311.03340v1 null
2023-11-03 EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Jiawei Yang et.al. 2311.02077v1 null
2023-11-03 Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos Dayal Singh Kalra et.al. 2311.02076v1 null
2023-11-03 Envy-Free Cake-Cutting for Four Agents Alexandros Hollender et.al. 2311.02075v1 null
2023-11-03 Learning Historical Status Prompt for Accurate and Robust Visual Tracking Wenrui Cai et.al. 2311.02072v1 null
2023-11-03 Grounded Intuition of GPT-Vision’s Abilities with Scientific Images Alyssa Hwang et.al. 2311.02069v1 link
2023-11-03 GroomGen: A High-Quality Generative Hair Model Using Hierarchical Latent Representations Yuxiao Zhou et.al. 2311.02062v1 null
2023-11-03 Active Learning-Based Species Range Estimation Christian Lange et.al. 2311.02061v1 link
2023-11-02 Idempotent Generative Network Assaf Shocher et.al. 2311.01462v1 null
2023-11-02 Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization Jameel Hassan et.al. 2311.01459v1 null
2023-11-02 Detecting Deepfakes Without Seeing Any Tal Reiss et.al. 2311.01458v1 link
2023-11-02 RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation Yufei Wang et.al. 2311.01455v1 null
2023-11-02 NOIR: Neural Signal Operated Intelligent Robots for Everyday Activities Ruohan Zhang et.al. 2311.01454v1 null
2023-11-02 DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing Vint Lee et.al. 2311.01450v1 null
2023-11-02 UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation Yuwen Xiong et.al. 2311.01448v1 null
2023-11-02 CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation Jingkang Wang et.al. 2311.01447v1 null
2023-11-02 Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation Jay Sarva et.al. 2311.01446v1 null
2023-11-01 End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation Juan Zuluaga-Gomez et.al. 2311.00697v1 link
2023-11-01 Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving Zhan Ling et.al. 2311.00694v1 link
2023-11-01 Improving Interpersonal Communication by Simulating Audiences with Language Models Ryan Liu et.al. 2311.00687v1 link
2023-11-01 Deep Learning-Based Classification of Gamma Photon Interactions in Room-Temperature Semiconductor Radiation Detectors Sandeep K. Chaudhuri et.al. 2311.00682v1 null
2023-11-01 Are Large Language Models Reliable Judges? A Study on the Factuality Evaluation Capabilities of LLMs Xue-Yong Fu et.al. 2311.00681v1 null
2023-10-31 Unexpected Improvements to Expected Improvement for Bayesian Optimization Sebastian Ament et.al. 2310.20708v1 null
2023-10-31 What’s In My Big Data? Yanai Elazar et.al. 2310.20707v1 link
2023-10-31 DDAM-PS: Diligent Domain Adaptive Mixer for Person Search Mohammed Khaleed Almansoori et.al. 2310.20706v1 link
2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction Xinyuan Chen et.al. 2310.20700v1 null
2023-11-01 Bayesian Multistate Bennett Acceptance Ratio Methods Xinqiang Ding et.al. 2310.20699v2 link
2023-10-31 Learning From Mistakes Makes LLM Better Reasoner Shengnan An et.al. 2310.20689v1 link
2023-10-31 Compression with Exact Error Distribution for Federated Learning Mahmoud Hegazy et.al. 2310.20682v1 null
2023-10-30 Variational principles for the hydrodynamics of the classical one-component plasma Daniels Krimans et.al. 2310.19239v1 null
2023-10-30 Building Real-World Meeting Summarization Systems using Large Language Models: A Practical Perspective Md Tahmid Rahman Laskar et.al. 2310.19233v1 null
2023-10-30 Stochastic Configuration Machines: FPGA Implementation Matthew J. Felicetti et.al. 2310.19225v1 null
2023-10-30 CHAMMI: A benchmark for channel-adaptive models in microscopy imaging Zitong Chen et.al. 2310.19224v1 link
2023-10-27 FP8-LM: Training FP8 Large Language Models Houwen Peng et.al. 2310.18313v1 link
2023-10-27 Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models Pushkal Katara et.al. 2310.18308v1 null
2023-10-27 Interactive Motion Planning for Autonomous Vehicles with Joint Optimization Yuxiao Chen et.al. 2310.18301v1 null
2023-10-27 Enhancing the Performance of a Biomimetic Robotic Elbow-and-Forearm System Through Bionics-Inspired Optimization Haosen Yang et.al. 2310.18299v1 null
2023-10-27 Sharp-Edge Diffraction of Laguerre-Gauss Vortex Beams by Elliptic Apertures Riccardo Borghi et.al. 2310.18298v1 null
2023-10-27 Addressing GAN Training Instabilities via Tunable Classification Losses Monica Welfert et.al. 2310.18291v1 null
2023-10-26 Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model Karsten Roth et.al. 2310.17653v1 link
2023-10-26 A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection Anas Al-lahham et.al. 2310.17650v1 link
2023-10-26 6-DoF Stability Field via Diffusion Models Takuma Yoneda et.al. 2310.17649v1 null
2023-10-26 In-Context Learning Dynamics with Random Binary Sequences Eric J. Bigelow et.al. 2310.17639v1 null
2023-10-26 Generative Fractional Diffusion Models Gabriel Nobis et.al. 2310.17638v1 null
2023-10-26 JudgeLM: Fine-tuned Large Language Models are Scalable Judges Lianghui Zhu et.al. 2310.17631v1 link
2023-10-25 SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation Qianxu Wang et.al. 2310.16838v1 null
2023-10-25 Proposal-Contrastive Pretraining for Object Detection from Fewer Data Quentin Bouniot et.al. 2310.16835v1 null
2023-10-25 CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images Aaron Gokaslan et.al. 2310.16825v1 link
2023-10-26 DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior Jingxiang Sun et.al. 2310.16818v2 link
2023-10-25 The intelligent agent model – a fully two-dimensional microscopic traffic flow model Martin Treiber et.al. 2310.16816v1 null
2023-10-24 Synthetic Data as Validation Qixin Hu et.al. 2310.16052v1 null
2023-10-24 EquivAct: SIM(3)-Equivariant Visuomotor Policies beyond Rigid Object Manipulation Jingyun Yang et.al. 2310.16050v1 null
2023-10-24 MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning Zayne Sprague et.al. 2310.16049v1 link
2023-10-24 From Posterior Sampling to Meaningful Diversity in Image Restoration Noa Cohen et.al. 2310.16047v1 null
2023-10-24 Woodpecker: Hallucination Correction for Multimodal Large Language Models Shukang Yin et.al. 2310.16045v1 link
2023-10-25 Stanford-ORB: A Real-World 3D Object Inverse Rendering Benchmark Zhengfei Kuang et.al. 2310.16044v2 link
2023-10-25 WebWISE: Web Interface Control and Sequential Exploration with Large Language Models Heyi Tao et.al. 2310.16042v2 null
2023-10-24 Instruct and Extract: Instruction Tuning for On-Demand Information Extraction Yizhu Jiao et.al. 2310.16040v1 link
2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling Haonan Qiu et.al. 2310.15169v1 link
2023-10-24 Ghost on the Shell: An Expressive Representation of General 3D Shapes Zhen Liu et.al. 2310.15168v2 null
2023-10-23 SAM-Med3D Haoyu Wang et.al. 2310.15161v1 link
2023-10-23 FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models Lihe Yang et.al. 2310.15160v1 link
2023-10-23 Online Detection of AI-Generated Images David C. Epstein et.al. 2310.15150v1 null
2023-10-23 DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design Kevin Lin et.al. 2310.15144v1 link
2023-10-23 SpecTr: Fast Speculative Decoding via Optimal Transport Ziteng Sun et.al. 2310.15141v1 null
2023-10-20 Neural-Base Music Generation for Intelligence Duplication Jacob Galajda et.al. 2310.13691v1 null
2023-10-20 Exploring Linguistic Probes for Morphological Generalization Jordan Kodner et.al. 2310.13686v1 null
2023-10-20 CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages Gabriel Oliveira dos Santos et.al. 2310.13683v1 link
2023-10-20 Optimizing Retrieval-augmented Reader Models via Token Elimination Moshe Berchansky et.al. 2310.13682v1 link
2023-10-20 Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives Mario Giulianelli et.al. 2310.13676v1 link
2023-10-20 On Synthetic Data for Back Translation Jiahao Xu et.al. 2310.13675v1 link
2023-10-19 HumanTOMATO: Text-aligned Whole-body Motion Generation Shunlin Lu et.al. 2310.12978v1 null
2023-10-19 Training Dynamics of Deep Network Linear Regions Ahmed Imtiaz Humayun et.al. 2310.12977v1 null
2023-10-19 Frozen Transformers in Language Models Are Effective Visual Encoder Layers Ziqi Pang et.al. 2310.12973v1 link
2023-10-19 CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning Liyiming Ke et.al. 2310.12972v1 null
2023-10-19 CLAIR: Evaluating Image Captions with Large Language Models David Chan et.al. 2310.12971v1 null
2023-10-19 Does Your Model Think Like an Engineer? Explainable AI for Bearing Fault Detection with Deep Learning Thomas Decker et.al. 2310.12967v1 null
2023-10-18 Understanding Retrieval Augmentation for Long-Form Question Answering Hung-Ting Chen et.al. 2310.12150v1 null
2023-10-18 Object-aware Inversion and Reassembly for Image Editing Zhen Yang et.al. 2310.12149v1 null
2023-10-18 Simple Mechanisms for Representing, Indexing and Manipulating Concepts Yuanzhi Li et.al. 2310.12143v1 null
2023-10-17 DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis Youngjoong Kwon et.al. 2310.11449v1 null
2023-10-17 Functional Invariants to Watermark Large Transformers Fernandez Pierre et.al. 2310.11446v1 null
2023-10-18 EvalCrafter: Benchmarking and Evaluating Large Video Generation Models Yaofang Liu et.al. 2310.11440v2 link
2023-10-17 Sadness, Anger, or Anxiety: Twitter Users’ Emotional Responses to Toxicity in Public Conversations Ana Aleksandric et.al. 2310.11436v1 null
2023-10-17 An Empirical Study of Translation Hypothesis Ensembling with Large Language Models António Farinhas et.al. 2310.11430v1 link
2023-10-17 Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression Adam Block et.al. 2310.11428v1 null
2023-10-17 A Computational Framework for Solving Wasserstein Lagrangian Flows Kirill Neklyudov et.al. 2310.10649v2 link
2023-10-16 Step-by-Step Remediation of Students’ Mathematical Mistakes Rose E. Wang et.al. 2310.10648v1 link
2023-10-16 A Survey on Video Diffusion Models Zhen Xing et.al. 2310.10647v1 link
2023-10-16 Interactive Task Planning with Language Models Boyi Li et.al. 2310.10645v1 null
2023-10-16 TOSS:High-quality Text-guided Novel View Synthesis from a Single Image Yukai Shi et.al. 2310.10644v1 null
2023-10-16 Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting Zeyu Yang et.al. 2310.10642v1 link
2023-10-16 LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts Hanan Gani et.al. 2310.10640v1 link
2023-10-16 Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models Kevin Black et.al. 2310.10639v1 link
2023-10-13 Vision-by-Language for Training-Free Compositional Image Retrieval Shyamgopal Karthik et.al. 2310.09291v1 link
2023-10-13 Disentangled Latent Spaces Facilitate Data-Driven Auxiliary Learning Geri Skenderi et.al. 2310.09278v1 null
2023-10-13 Retro-fallback: retrosynthetic planning in an uncertain world Austin Tripp et.al. 2310.09270v1 null
2023-10-13 Genetic algorithms are strong baselines for molecule generation Austin Tripp et.al. 2310.09267v1 null
2023-10-13 Towards End-to-end 4-Bit Inference on Generative Large Language Models Saleh Ashkboos et.al. 2310.09259v1 link
2023-10-12 Octopus: Embodied Vision-Language Programmer from Environmental Feedback Jingkang Yang et.al. 2310.08588v1 link
2023-10-12 Is Generalized Dynamic Novel View Synthesis from Monocular Videos Possible Today? Xiaoming Zhao et.al. 2310.08587v1 null
2023-10-12 PonderV2: Pave the Way for 3D Foundataion Model with A Universal Pre-training Paradigm Haoyi Zhu et.al. 2310.08586v1 link
2023-10-12 Discovering Fatigued Movements for Virtual Character Animation Noshaba Cheema et.al. 2310.08583v1 null
2023-10-12 Tree-Planner: Efficient Close-loop Task Planning with Large Language Models Mengkang Hu et.al. 2310.08582v1 null
2023-10-12 Universal Visual Decomposer: Long-Horizon Manipulation Made Easy Zichen Zhang et.al. 2310.08581v1 null
2023-10-12 OmniControl: Control Any Joint at Any Time for Human Motion Generation Yiming Xie et.al. 2310.08580v1 link
2023-10-12 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Xian Liu et.al. 2310.08579v1 null
2023-10-12 Learning to Act from Actionless Videos through Dense Correspondences Po-Chen Ko et.al. 2310.08576v1 null
2023-10-12 Jigsaw: Supporting Designers in Prototyping Multimodal Applications by Assembling AI Foundation Models David Chuan-En Lin et.al. 2310.08574v1 null
2023-10-11 InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining Boxin Wang et.al. 2310.07713v1 link
2023-10-11 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Yingqing He et.al. 2310.07702v1 link
2023-10-11 Knowledge-enhanced Memory Model for Emotional Support Conversation Mengzhao Jia et.al. 2310.07700v1 null
2023-10-11 From Scarcity to Efficiency: Improving CLIP Training via Visual-enriched Captions Zhengfeng Lai et.al. 2310.07699v1 link
2023-10-11 SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation Bo Pan et.al. 2310.07698v1 null
2023-10-11 ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Bo Peng et.al. 2310.07697v1 link
2023-10-11 Large-scale photonic computing with nonlinear disordered media Hao Wang et.al. 2310.07690v1 null
2023-10-10 AutoAD II: The Sequel – Who, When, and What in Movie Audio Description Tengda Han et.al. 2310.06838v1 null
2023-10-10 Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency Eric Zelikman et.al. 2310.06837v1 null
2023-10-10 What Does Stable Diffusion Know about the 3D Scene? Guanqi Zhan et.al. 2310.06836v1 link
2023-10-10 Teaching Language Models to Hallucinate Less with Synthetic Tasks Erik Jones et.al. 2310.06827v1 null
2023-10-10 Mistral 7B Albert Q. Jiang et.al. 2310.06825v1 link
2023-10-10 The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets Samuel Marks et.al. 2310.06824v1 link
2023-10-09 Grokking as Compression: A Nonlinear Complexity Perspective Ziming Liu et.al. 2310.05918v1 null
2023-10-09 Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input Donglai Xiang et.al. 2310.05917v1 null
2023-10-09 FireAct: Toward Language Agent Fine-tuning Baian Chen et.al. 2310.05915v1 null
2023-10-09 SALMON: Self-Alignment with Principle-Following Reward Models Zhiqing Sun et.al. 2310.05910v1 link
2023-10-09 Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts Lizhang Chen et.al. 2310.05898v1 null
2023-10-06 BrainSCUBA: Fine-Grained Natural Language Captions of Visual Cortex Selectivity Andrew F. Luo et.al. 2310.04420v1 null
2023-10-06 Functional Interpolation for Relative Positions Improves Long Context Transformers Shanda Li et.al. 2310.04418v1 null
2023-10-09 CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis Xiaoxiao Sun et.al. 2310.04414v2 null
2023-10-06 FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning Peiran Xu et.al. 2310.04412v1 link
2023-10-06 RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation Fangyuan Xu et.al. 2310.04408v1 link
2023-10-06 Policy-Gradient Training of Language Models for Ranking Ge Gao et.al. 2310.04407v1 null
2023-10-06 Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Andy Zhou et.al. 2310.04406v1 link
2023-10-05 ContactGen: Generative Contact Modeling for Grasp Generation Shaowei Liu et.al. 2310.03740v1 null
2023-10-05 Aligning Text-to-Image Diffusion Models with Reward Backpropagation Mihir Prabhudesai et.al. 2310.03739v1 link
2023-10-05 Stylist: Style-Driven Feature Ranking for Robust Novelty Detection Stefan Smeu et.al. 2310.03738v1 link
2023-10-05 Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency Tianhong Li et.al. 2310.03734v1 null
2023-10-05 MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning Ke Wang et.al. 2310.03731v1 link
2023-10-05 Stochastic interpolants with data-dependent couplings Michael S. Albergo et.al. 2310.03725v1 null
2023-10-04 LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving Hao Sha et.al. 2310.03026v1 null
2023-10-04 Retrieval meets Long Context Large Language Models Peng Xu et.al. 2310.03025v1 null
2023-10-04 Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making Jeonghye Kim et.al. 2310.03022v1 null
2023-10-04 Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models Jianglong Ye et.al. 2310.03020v1 null
2023-10-04 Multimodal Question Answering for Unified Information Extraction Yuxuan Sun et.al. 2310.03017v1 link
2023-10-04 Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day Yifan Jiang et.al. 2310.03015v1 null
2023-10-04 SemiReward: A General Reward Model for Semi-supervised Learning Siyuan Li et.al. 2310.03013v1 link
2023-10-04 Towards Domain-Specific Features Disentanglement for Domain Generalization Hao Chen et.al. 2310.03007v1 null
2023-10-05 COOLer: Class-Incremental Learning for Appearance-Based Multiple Object Tracking Zhizheng Liu et.al. 2310.03006v2 link
2023-10-03 Generalizable Long-Horizon Manipulations with Large Language Models Haoyu Zhou et.al. 2310.02264v1 null
2023-10-03 MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts Pan Lu et.al. 2310.02255v1 null
2023-10-03 Talk2BEV: Language-enhanced Bird’s-eye View Maps for Autonomous Driving Vikrant Dewangan et.al. 2310.02251v1 null
2023-10-03 Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models Huaijin Pi et.al. 2310.02242v1 null
2023-10-03 MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens Kaizhi Zheng et.al. 2310.02239v1 link
2023-09-29 Efficient Streaming Language Models with Attention Sinks Guangxuan Xiao et.al. 2309.17453v1 link
2023-10-02 L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models Ansong Ni et.al. 2309.17446v2 null
2023-10-02 LLM-grounded Video Diffusion Models Long Lian et.al. 2309.17444v2 null
2023-09-29 CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets Lifan Yuan et.al. 2309.17428v1 link
2023-09-28 Learning to Transform for Generalizable Instance-wise Invariance Utkarsh Singhal et.al. 2309.16672v1 link
2023-09-29 Demystifying CLIP Data Hu Xu et.al. 2309.16671v2 link
2023-09-28 RealFill: Reference-Driven Generation for Authentic Image Completion Luming Tang et.al. 2309.16668v1 null
2023-09-28 DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation Jiaxiang Tang et.al. 2309.16653v1 link
2023-09-27 Exploiting the Signal-Leak Bias in Diffusion Models Martin Nicolas Everaert et.al. 2309.15842v1 null
2023-09-27 OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs Honglin He et.al. 2309.15830v1 null
2023-09-27 LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement Haonan Chang et.al. 2309.15821v1 null
2023-09-27 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation David Junhao Zhang et.al. 2309.15818v1 link
2023-09-26 Generating Visual Scenes from Touch Fengyu Yang et.al. 2309.15117v1 null
2023-09-27 InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition Pan Zhang et.al. 2309.15112v2 link
2023-09-26 Doduo: Learning Dense Visual Correspondence from Unsupervised Semantic-Aware Flow Zhenyu Jiang et.al. 2309.15110v1 null
2023-09-26 DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation Zeyu Wang et.al. 2309.15109v1 link
2023-09-26 New solution to Airy’s equation for modeling beams near turning points N. A. Lopez et.al. 2309.15108v1 null
2023-09-25 Extreme Parkour with Legged Robots Xuxin Cheng et.al. 2309.14341v1 null
2023-09-25 Chop & Learn: Recognizing and Generating Object-State Compositions Nirat Saini et.al. 2309.14339v1 null
2023-09-25 UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation Jianglin Fu et.al. 2309.14335v1 link
2023-09-25 Tasks Makyth Models: Machine Learning Assisted Surrogates for Tipping Points Gianluca Fabiani et.al. 2309.14334v1 null
2023-09-25 Innovative Digital Storytelling with AIGC: Exploration and Discussion of Recent Advances Rongzhang Gu et.al. 2309.14329v1 null
2023-09-25 pyParaOcean: A System for Visual Analysis of Ocean Data Toshit Jain et.al. 2309.14328v1 null
2023-09-22 E(2)-Equivariant Graph Planning for Navigation Linfeng Zhao et.al. 2309.13043v1 null
2023-09-22 MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation Jiahao Xie et.al. 2309.13042v1 link
2023-09-22 Robotic Offline RL from Internet Videos via Value-Function Pre-Training Chethan Bhateja et.al. 2309.13041v1 null
2023-09-22 Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? Xiaoxiao Sun et.al. 2309.13038v1 null
2023-09-22 GELLO: A General, Low-Cost, and Intuitive Teleoperation Framework for Robot Manipulators Philipp Wu et.al. 2309.13037v1 null
2023-09-22 A numerical framework for simulating progressive failure in composite laminates under high-cycle fatigue loading Pieter Hofman et.al. 2309.13030v1 null
2023-09-21 LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent Jianing Yang et.al. 2309.12311v1 null
2023-09-21 Rehearsal: Simulating Conflict to Teach Conflict Resolution Omar Shaikh et.al. 2309.12309v1 null
2023-09-21 Text-Guided Vector Graphics Customization Peiying Zhang et.al. 2309.12302v1 null
2023-09-21 Environment-biased Feature Ranking for Novelty Detection Robustness Stefan Smeu et.al. 2309.12301v1 null
2023-09-21 Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models Levon Haroutunian et.al. 2309.12294v1 null
2023-09-20 A Large-scale Dataset for Audio-Language Representation Learning Luoyi Sun et.al. 2309.11500v1 null
2023-09-20 DreamLLM: Synergistic Multimodal Comprehension and Creation Runpei Dong et.al. 2309.11499v1 link
2023-09-20 FreeU: Free Lunch in Diffusion U-Net Chenyang Si et.al. 2309.11497v1 link
2023-09-20 Chain-of-Verification Reduces Hallucination in Large Language Models Shehzaad Dhuliawala et.al. 2309.11495v1 null
2023-09-21 Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning Tianbao Xie et.al. 2309.11489v2 link
2023-09-19 PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes Xiao Fu et.al. 2309.10815v1 link
2023-09-19 Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Tianhua Zhang et.al. 2309.10814v1 link
2023-09-19 PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance Peiqing Yang et.al. 2309.10810v1 link
2023-09-20 AI Foundation Models for Weather and Climate: Applications, Design, and Implementation S. Karthik Mukkavilli et.al. 2309.10808v2 null
2023-09-19 Heuristic Search for Path Finding with Refuelling Anushtup Nandy et.al. 2309.10796v1 null
2023-09-19 Guide Your Agent with Adaptive Multimodal Rewards Changyeon Kim et.al. 2309.10790v1 link
2023-09-18 General In-Hand Object Rotation with Vision and Touch Haozhi Qi et.al. 2309.09979v1 null
2023-09-18 GEDepth: Ground Embedding for Monocular Depth Estimation Xiaodong Yang et.al. 2309.09975v1 link
2023-09-19 MindAgent: Emergent Gaming Interaction Ran Gong et.al. 2309.09971v2 null
2023-09-18 Empirical Study of Mix-based Data Augmentation Methods in Physiological Time Series Data Peikun Guo et.al. 2309.09970v1 link
2023-09-18 Prompt a Robot to Walk with Large Language Models Yen-Jen Wang et.al. 2309.09969v1 link
2023-09-18 Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees Alexia Jolicoeur-Martineau et.al. 2309.09968v1 link
2023-09-15 Robust e-NeRF: NeRF from Sparse & Noisy Events under Non-Uniform Motion Weng Fei Low et.al. 2309.08596v1 link
2023-09-15 Chain-of-Thought Reasoning is a Policy Improvement Operator Hugh Zhang et.al. 2309.08589v1 null
2023-09-15 Robust Frame-to-Frame Camera Rotation Estimation in Crowded Scenes Fabien Delattre et.al. 2309.08588v1 null
2023-09-15 Compositional Foundation Models for Hierarchical Planning Anurag Ajay et.al. 2309.08587v1 null
2023-09-15 Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding Xiaonan Lu et.al. 2309.08585v1 null
2023-09-15 ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer Arkadiy Saakyan et.al. 2309.08583v1 link
2023-09-15 Large-Vocabulary 3D Diffusion Model with Transformer Ziang Cao et.al. 2309.07920v2 null
2023-09-14 Unified Human-Scene Interaction via Prompted Chain-of-Contacts Zeqi Xiao et.al. 2309.07918v1 link
2023-09-14 Looking at words and points with attention: a benchmark for text-to-shape coherence Andrea Amaduzzi et.al. 2309.07917v1 null
2023-09-14 MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning Haozhe Zhao et.al. 2309.07915v1 link
2023-09-14 ALWOD: Active Learning for Weakly-Supervised Object Detection Yuting Wang et.al. 2309.07914v1 link
2023-09-14 Why would you put a flashlight in a dark matter detector? R. Gibbons et.al. 2309.07913v1 null
2023-09-14 TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting Rohan Choudhury et.al. 2309.07910v1 null
2023-09-14 Physically Plausible Full-Body Hand-Object Interaction Synthesis Jona Braun et.al. 2309.07907v1 null
2023-09-14 Generative Image Dynamics Zhengqi Li et.al. 2309.07906v1 null
2023-09-13 Text-Guided Generation and Editing of Compositional 3D Avatars Hao Zhang et.al. 2309.07125v1 null
2023-09-13 RAIN: Your Language Models Can Align Themselves without Finetuning Yuhui Li et.al. 2309.07124v1 link
2023-09-13 Tree-Structured Shading Decomposition Chen Geng et.al. 2309.07122v1 null
2023-09-13 Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics Haoqin Tu et.al. 2309.07120v1 link
2023-09-13 Weakly-Supervised Multi-Task Learning for Audio-Visual Speaker Verification Anith Selvakumar et.al. 2309.07115v1 null
2023-09-13 Contrastive Deep Encoding Enables Uncertainty-aware Machine-learning-assisted Histopathology Nirhoshan Sivaroopan et.al. 2309.07113v1 null
2023-09-13 Hardening RGB-D Object Recognition Systems against Adversarial Patch Attacks Yang Zheng et.al. 2309.07106v1 null
2023-09-12 Learning Disentangled Avatars with Hybrid 3D Representations Yao Feng et.al. 2309.06441v1 null
2023-09-12 Unveiling the potential of large language models in generating semantic and cross-language clones Palash R. Roy et.al. 2309.06424v1 null
2023-09-12 C4CAM: A Compiler for CAM-based In-memory Accelerators Hamid Farzaneh et.al. 2309.06418v1 null
2023-09-12 Robot Parkour Learning Ziwen Zhuang et.al. 2309.05665v2 null
2023-09-11 Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips Yufei Ye et.al. 2309.05663v1 null
2023-09-11 ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion Hongyu Li et.al. 2309.05662v1 null
2023-09-11 Hypothesis Search: Inductive Reasoning with Language Models Ruocheng Wang et.al. 2309.05660v1 null
2023-09-11 From Capture to Display: A Survey on Volumetric Video Yili Jin et.al. 2309.05658v1 null
2023-09-11 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning Xiang Yue et.al. 2309.05653v1 null
2023-09-11 Data efficiency, dimensionality reduction, and the generalized symmetric information bottleneck K. Michael Martini et.al. 2309.05649v1 null
2023-09-08 On the Actionability of Outcome Prediction Lydia T. Liu et.al. 2309.04470v1 null
2023-09-08 Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays Aroof Aimen et.al. 2309.04462v1 null
2023-09-08 Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models Yangyi Chen et.al. 2309.04461v1 link
2023-09-08 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning David Yunis et.al. 2309.04459v1 null
2023-09-08 Effect of Electron-Phonon Interactions on Three-Level QD-based Spaser: Linear and Quadratic Potentials Ankit Purohit et.al. 2309.04448v1 null
2023-09-07 ImageBind-LLM: Multi-modality Instruction Tuning Jiaming Han et.al. 2309.03905v1 link
2023-09-07 Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis Jiapeng Zhu et.al. 2309.03904v1 link
2023-09-07 Tracking Anything with Decoupled Video Segmentation Ho Kei Cheng et.al. 2309.03903v1 link
2023-09-07 The Making and Breaking of Camouflage Hala Lamdouar et.al. 2309.03899v1 null
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895v1 null
2023-09-07 DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection Manlin Zhang et.al. 2309.03893v1 null
2023-09-07 ArtiGrasp: Physically Plausible Synthesis of Bi-Manual Dexterous Grasping and Articulation Hui Zhang et.al. 2309.03891v1 null
2023-09-06 My Art My Choice: Adversarial Protection Against Unruly AI Anthony Rhodes et.al. 2309.03198v1 null
2023-09-06 Electrocaloric Response of the Dense Ferroelectric Nanocomposites Anna N. Morozovska et.al. 2309.03187v1 null
2023-09-06 SLiMe: Segment Like Me Aliasghar Khani et.al. 2309.03179v1 link
2023-09-05 ReliTalk: Relightable Talking Portrait Generation from a Single Video Haonan Qiu et.al. 2309.02434v1 link
2023-09-05 Generating Realistic Images from In-the-wild Sounds Taegyeong Lee et.al. 2309.02405v1 null
2023-09-01 OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation Zhening Huang et.al. 2309.00616v1 link
2023-09-01 Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following Ziyu Guo et.al. 2309.00615v1 link
2023-09-01 Iterative Multi-granular Image Editing using Diffusion Models K J Joseph et.al. 2309.00613v1 null
2023-09-01 CityDreamer: Compositional Generative Model of Unbounded 3D Cities Haozhe Xie et.al. 2309.00610v1 link
2023-09-01 Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair Yuxiang Wei et.al. 2309.00608v1 link
2023-08-31 PointLLM: Empowering Large Language Models to Understand Point Clouds Runsen Xu et.al. 2308.16911v1 link
2023-08-31 StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation Yuhan Wang et.al. 2308.16909v1 link
2023-08-31 Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator Xiaolong Wang et.al. 2308.16906v1 link
2023-08-31 InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion Sirui Xu et.al. 2308.16905v1 link
2023-08-31 Transformers as Support Vector Machines Davoud Ataee Tarzanagh et.al. 2308.16898v1 link
2023-09-01 GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields Yanjie Ze et.al. 2308.16891v2 link
2023-08-31 Prediction of Diblock Copolymer Morphology via Machine Learning Hyun Park et.al. 2308.16886v1 null
2023-08-30 Learning Vision-based Pursuit-Evasion Robot Policies Andrea Bajcsy et.al. 2308.16185v1 null
2023-08-30 SAM-Med2D Junlong Cheng et.al. 2308.16184v1 link
2023-08-30 GREC: Generalized Referring Expression Comprehension Shuting He et.al. 2308.16182v1 link
2023-08-30 Framework and Methodology for Verification of a Complex Scientific Simulation Software, Flash-X Akash Dhruv et.al. 2308.16180v1 null
2023-08-30 General Purpose Audio Effect Removal Matthew Rice et.al. 2308.16177v1 link
2023-08-30 Quantifying Uncertainty in Answers from any Language Model via Intrinsic and Extrinsic Confidence Assessment Jiuhai Chen et.al. 2308.16175v1 null
2023-08-29 3D Adversarial Augmentations for Robust Out-of-Domain Predictions Alexander Lehner et.al. 2308.15479v1 null
2023-08-29 A General-Purpose Self-Supervised Model for Computational Pathology Richard J. Chen et.al. 2308.15474v1 null
2023-08-29 Learning Modulated Transformation in GANs Ceyuan Yang et.al. 2308.15472v1 link
2023-08-29 Input margins can predict generalization too Coenraad Mouton et.al. 2308.15466v1 null
2023-08-30 Sharing proofs with predicative theories through universe polymorphic elaboration Thiago Felicissimo et.al. 2308.15465v2 link
2023-08-29 ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer Zachary Horvitz et.al. 2308.15459v1 link
2023-08-29 From SMOTE to Mixup for Deep Imbalanced Classification Wei-Chao Cheng et.al. 2308.15457v1 link
2023-08-28 AI Deception: A Survey of Examples, Risks, and Potential Solutions Peter S. Park et.al. 2308.14752v1 null
2023-08-28 MagicAvatar: Multimodal Avatar Generation and Animation Jianfeng Zhang et.al. 2308.14748v1 null
2023-08-28 CoVR: Learning Composed Video Retrieval from Web Video Captions Lucas Ventura et.al. 2308.14746v1 link
2023-08-28 Advancement on Security Applications of Private Intersection Sum Protocol Yuvaray Athur Raghuvir et.al. 2308.14741v1 null
2023-08-28 Total Selfie: Generating Full-Body Selfies Bowei Chen et.al. 2308.14740v1 null
2023-08-28 Bayesian artificial brain with ChatGPT Renato A. Krohling et.al. 2308.14732v1 null
2023-08-28 Distilled GPT for Source Code Summarization Chia-Yi Su et.al. 2308.14731v1 link
2023-08-25 ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection Yihao Fang et.al. 2308.13517v1 link
2023-08-25 Does Asking Clarifying Questions Increases Confidence in Generated Code? On the Communication Skills of Large Language Models Jie JW Wu et.al. 2308.13507v1 null
2023-08-25 A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance Ian Colbert et.al. 2308.13504v1 null
2023-08-25 Attending Generalizability in Course of Deep Fake Detection by Exploring Multi-task Learning Pranav Balaji et.al. 2308.13503v1 null
2023-08-24 ROAM: Robust and Object-aware Motion Generation using Neural Pose Descriptors Wanyue Zhang et.al. 2308.12969v1 null
2023-08-24 Dense Text-to-Image Generation with Attention Modulation Yunji Kim et.al. 2308.12964v1 link
2023-08-24 MapPrior: Bird’s-Eye View Map Layout Estimation with Generative Models Xiyue Zhu et.al. 2308.12963v1 null
2023-08-24 Motion-Guided Masking for Spatiotemporal Representation Learning David Fan et.al. 2308.12962v1 null
2023-08-24 Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks Xiangyang Zhu et.al. 2308.12961v1 link
2023-08-24 Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment Sheng Zhang et.al. 2308.12960v1 link
2023-08-24 Semi-analytical Framework for Modeling Strong Coupling of Quantum Emitters in Electromagnetic Resonators Mohammad Abutoama et.al. 2308.12957v1 null
2023-08-24 A new framework for global data regulation Ellie Graeden et.al. 2308.12955v1 null
2023-08-24 BridgeData V2: A Dataset for Robot Learning at Scale Homer Walke et.al. 2308.12952v1 link
2023-08-24 Label Budget Allocation in Multi-Task Learning Ximeng Sun et.al. 2308.12949v1 null
2023-08-23 CHORUS: Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images Sookwan Han et.al. 2308.12288v1 null
2023-08-23 Devising and Detecting Phishing: large language models vs. Smaller Human Models Fredrik Heiding et.al. 2308.12287v1 null
2023-08-23 On-Manifold Projected Gradient Descent Aaron Mahler et.al. 2308.12279v1 null
2023-08-24 A Model for Integrating Generative AI into Course Content Development Ethan Dickey et.al. 2308.12276v2 null
2023-08-23 Spatial clustering of temporal energy profiles with empirical orthogonal functions and max-p regionalization Claire Halloran et.al. 2308.12274v1 null
2023-08-23 Simple is Better and Large is Not Enough: Towards Ensembling of Foundational Language Models Nancy Tyagi et.al. 2308.12272v1 null
2023-08-23 A Generative Approach for Image Registration of Visible-Thermal (VT) Cancer Faces Catherine Ordun et.al. 2308.12271v1 null
2023-08-23 Language Reward Modulation for Pretraining Reinforcement Learning Ademi Adeniji et.al. 2308.12270v1 link
2023-08-22 GRIP: Generating Interaction Poses Using Latent Consistency and Spatial Cues Omid Taheri et.al. 2308.11617v1 null
2023-08-22 StoryBench: A Multifaceted Benchmark for Continuous Story Visualization Emanuele Bugliarello et.al. 2308.11606v1 link
2023-08-22 GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised Learning Mainak Singha et.al. 2308.11605v1 null
2023-08-22 Towards Universal Interaction for Extended Reality Pascal Knierim et.al. 2308.11600v1 null
2023-08-22 Theory of Transverse Mode Instability in Fiber Amplifiers with Multimode Excitations Kabish Wisal et.al. 2308.11599v1 null
2023-08-22 Vision-Based Intelligent Robot Grasping Using Sparse Neural Network Priya Shukla et.al. 2308.11590v1 null
2023-08-21 Structured World Models from Human Videos Russell Mendonca et.al. 2308.10901v1 null
2023-08-21 TADA! Text to Animatable Digital Avatars Tingting Liao et.al. 2308.10899v1 null
2023-08-21 Few-Shot Physically-Aware Articulated Mesh Generation via Hierarchical Deformation Xueyi Liu et.al. 2308.10898v1 link
2023-08-21 Can Language Models Learn to Listen? Evonne Ng et.al. 2308.10897v1 null
2023-08-21 Differentiable Shadow Mapping for Efficient Inverse Graphics Markus Worchel et.al. 2308.10896v1 link
2023-08-21 Proton-Boron Fusion Yield Increased by Orders of Magnitude with Foam Targets Wen-Qing Wei et.al. 2308.10878v1 null
2023-08-21 Analyzing Transformer Dynamics as Movement through Embedding Space Sumeet S. Singh et.al. 2308.10874v1 null
2023-08-18 HumanLiff: Layer-wise 3D Human Generation with Diffusion Model Shoukang Hu et.al. 2308.09712v1 null
2023-08-18 Robust Monocular Depth Estimation under Challenging Conditions Stefano Gasperini et.al. 2308.09711v1 null
2023-08-18 SimDA: Simple Diffusion Adapter for Efficient Video Generation Zhen Xing et.al. 2308.09710v1 null
2023-08-18 Training with Product Digital Twins for AutoRetail Checkout Yue Yao et.al. 2308.09708v1 link
2023-08-18 Guide3D: Create 3D Avatars from Text and Image Guidance Yukang Cao et.al. 2308.09705v1 null
2023-08-18 Counting and Sampling Labeled Chordal Graphs in Polynomial Time Ursula Hebert-Johnson et.al. 2308.09703v1 null
2023-08-16 TeCH: Text-guided Reconstruction of Lifelike Clothed Humans Yangyi Huang et.al. 2308.08545v1 link
2023-08-16 InsightMapper: A Closer Look at Inner-instance Information for Vectorized High-Definition Mapping Zhenhua Xu et.al. 2308.08543v1 null
2023-08-15 Enumerating Tarski fixed points on lattices of binary relations Julian Müller et.al. 2308.07923v1 null
2023-08-15 Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification Aojun Zhou et.al. 2308.07921v1 null
2023-08-15 The Regular Expression Inference Challenge Mojtaba Valizadeh et.al. 2308.07899v1 null
2023-08-15 A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision Julio Silva-Rodriguez et.al. 2308.07898v1 link
2023-08-14 Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation Alexander Martin et.al. 2308.07316v1 link
2023-08-14 Reinforcing Security and Usability of Crypto-Wallet with Post-Quantum Cryptography and Zero-Knowledge Proof Yathin Kethepalli et.al. 2308.07309v1 null
2023-08-15 LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked Alec Helbling et.al. 2308.07308v2 null
2023-08-14 Extend Wave Function Collapse to Large-Scale Content Generation Yuhe Nie et.al. 2308.07307v1 null
2023-08-14 Neural Authorship Attribution: Stylometric Analysis on Large Language Models Tharindu Kumarage et.al. 2308.07305v1 link
2023-08-14 DiffSED: Sound Event Detection with Denoising Diffusion Swapnil Bhosale et.al. 2308.07293v1 null
2023-08-11 Foundation Model is Efficient Multimodal Multitask Model Selector Fanqing Meng et.al. 2308.06262v1 link
2023-08-11 Enhancing Network Management Using Code Generated by Large Language Models Sathiya Kumaran Mani et.al. 2308.06261v1 link
2023-08-11 Self-Alignment with Instruction Backtranslation Xian Li et.al. 2308.06259v1 null
2023-08-11 NEMA NU 2-2018 performance evaluation of a new generation digital 32-cm axial field-of-view Omni Legend PET-CT Rhodri Lyn Smith et.al. 2308.06255v1 null
2023-08-11 Fundamental Limits on Subwavelength Range Resolution Andrew N. Jordan et.al. 2308.06252v1 null
2023-08-11 ARGUS: Visualization of AI-Assisted Task Guidance in AR Sonia Castelo et.al. 2308.06246v1 null
2023-08-10 PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs Wentao Hu et.al. 2308.05744v1 link
2023-08-10 Neural Progressive Meshes Yun-Chun Chen et.al. 2308.05741v1 null
2023-08-10 AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Haohe Liu et.al. 2308.05734v1 link
2023-08-10 FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models Guangkai Xu et.al. 2308.05733v1 null
2023-08-09 Scene-Generalizable Interactive Segmentation of Radiance Fields Songlin Tang et.al. 2308.05104v1 null
2023-08-09 LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation Leigang Qu et.al. 2308.05095v1 null
2023-08-08 SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore Sewon Min et.al. 2308.04430v1 link
2023-08-08 A Deep-Learning Method Using Auto-encoder and Generative Adversarial Network for Anomaly Detection on Ancient Stone Stele Surfaces Yikun Liu et.al. 2308.04426v1 null
2023-08-08 Density-contrast induced inertial forces on particles in oscillatory flows Siddhansh Agarwal et.al. 2308.04423v1 null
2023-08-08 Near-field 6G Networks: Why Mobile Terahertz Communications MUST Operate in the Near Field Vitaly Petrov et.al. 2308.04418v1 null
2023-08-08 DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images Xuechao Zou et.al. 2308.04417v1 link
2023-08-07 FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels Lue Fan et.al. 2308.03755v1 link
2023-08-07 Mask Frozen-DETR: High Quality Instance Segmentation with One GPU Zhanhao Liang et.al. 2308.03747v1 null
2023-08-07 A Cost Analysis of Generative Language Models and Influence Operations Micah Musser et.al. 2308.03740v1 link
2023-08-07 Labeling without Seeing? Blind Annotation for Privacy-Preserving Entity Resolution Yixiang Yao et.al. 2308.03734v1 null
2023-08-07 SurvBeX: An explanation method of the machine learning survival models based on the Beran estimator Lev V. Utkin et.al. 2308.03730v1 link
2023-08-04 Recovering non-Maxwellian particle velocity distribution functions from collective Thomson-scattered spectra Bryan C. Foo et.al. 2308.02488v1 null
2023-08-04 Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP Qihang Yu et.al. 2308.02487v1 link
2023-08-04 On the Inherent Anonymity of Gossiping Rachid Guerraoui et.al. 2308.02477v1 null
2023-08-04 Towards Generalist Foundation Model for Radiology Chaoyi Wu et.al. 2308.02463v1 link
2023-08-04 Getting the Ball Rolling: Learning a Dexterous Policy for a Biomimetic Tendon-Driven Hand with Rolling Contact Joints Yasunori Toshimitsu et.al. 2308.02453v1 link
2023-08-03 The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World Weiyun Wang et.al. 2308.01907v1 link
2023-08-03 Revisiting Deformable Convolution for Depth Completion Xinglong Sun et.al. 2308.01905v1 link
2023-08-03 UniSim: A Neural Closed-Loop Sensor Simulator Ze Yang et.al. 2308.01898v1 null
2023-08-03 Strategies for optimizing plasmonic grating couplers with topology-based inverse design Michael Efseaff et.al. 2308.01893v1 null
2023-08-02 ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders Shawn Xu et.al. 2308.01317v1 null
2023-08-02 Patched Denoising Diffusion Models For High-Resolution Image Synthesis Zheng Ding et.al. 2308.01316v1 link
2023-08-02 More Context, Less Distraction: Visual Classification by Inferring and Conditioning on Contextual Attributes Bang An et.al. 2308.01313v1 link
2023-08-02 TEASMA: A Practical Approach for the Test Assessment of Deep Neural Networks using Mutation Analysis Amin Abbasishahkoo et.al. 2308.01311v1 null
2023-08-02 Revisiting DETR Pre-training for Object Detection Yan Ma et.al. 2308.01300v1 null
2023-08-01 LISA: Reasoning Segmentation via Large Language Model Xin Lai et.al. 2308.00692v1 link
2023-08-01 AnyLoc: Towards Universal Visual Place Recognition Nikhil Keetha et.al. 2308.00688v1 link
2023-08-01 Learning from Hypervectors: A Survey on Hypervector Encoding Sercan Aygun et.al. 2308.00685v1 null
2023-07-31 Conformal PID Control for Time Series Prediction Anastasios N. Angelopoulos et.al. 2307.16895v1 link
2023-07-31 A reduced order model for geometrically parameterized two-scale simulations of elasto-plastic microstructures under large deformations Theron Guo et.al. 2307.16894v1 null
2023-07-31 LEONARDO: A Pan-European Pre-Exascale Supercomputer for HPC and AI Applications Matteo Turisini et.al. 2307.16885v1 null
2023-07-31 HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution Ehsan Kamalloo et.al. 2307.16883v1 link
2023-07-31 Image Synthesis under Limited Data: A Survey and Taxonomy Mengping Yang et.al. 2307.16879v1 link
2023-07-31 Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy Shibo Jie et.al. 2307.16867v1 link
2023-07-28 Uncertainty in Natural Language Generation: From Theory to Applications Joris Baan et.al. 2307.15703v1 null
2023-07-28 The Strong Maximum Circulation Algorithm: A New Method for Aggregating Preference Rankings Nathan Atkinson et.al. 2307.15702v1 null
2023-07-31 MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking Ruopeng Gao et.al. 2307.15700v2 link
2023-07-28 PatchMixer: Rethinking network design to boost generalization for 3D point cloud understanding Davide Boscaini et.al. 2307.15692v1 link
2023-07-28 Benchmarking Offline Reinforcement Learning on Real-Robot Hardware Nico Gürtler et.al. 2307.15690v1 link
2023-07-27 PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking Yang Zheng et.al. 2307.15055v1 link
2023-07-27 A Geometric Notion of Causal Probing Clément Guerner et.al. 2307.15054v1 null
2023-07-27 A Transformer-based Approach for Arabic Offline Handwritten Text Recognition Saleh Momeni et.al. 2307.15045v1 null
2023-07-27 Universal and Transferable Adversarial Attacks on Aligned Language Models Andy Zou et.al. 2307.15043v1 link
2023-07-27 3-Coloring $C_4$ or $C_3$ -free Diameter Two Graphs Tereza Klimošová et.al. 2307.15036v1 null
2023-07-26 WavJourney: Compositional Audio Creation with Large Language Models Xubo Liu et.al. 2307.14335v1 link
2023-07-26 Towards Generalist Biomedical AI Tao Tu et.al. 2307.14334v1 null
2023-07-26 Waypoint-Based Imitation Learning for Robotic Manipulation Lucy Xiaoyang Shi et.al. 2307.14326v1 null
2023-07-25 Benchmarking and Analyzing Generative Data for Visual Recognition Bo Li et.al. 2307.13697v1 null
2023-07-25 A Compact DAG for Storing and Searching Maximal Common Subsequences Alessio Conte et.al. 2307.13695v1 null
2023-07-25 A Comprehensive Review of Recent Research Trends on UAVs Kaled Telli et.al. 2307.13691v1 null
2023-07-25 Single reference treatment of strongly correlated H $4$ and H${10}$ isomers with Richardson-Gaudin states Paul A. Johnson et.al. 2307.13690v1 null
2023-07-25 All-optical GeV electron bunch generation in a laser-plasma accelerator via truncated-channel injection A. Picksley et.al. 2307.13689v1 null
2023-07-25 The Visual Language of Fabrics Valentin Deschaintre et.al. 2307.13681v1 null
2023-07-25 High Probability Analysis for Non-Convex Stochastic Optimization with Clipping Shaojie Li et.al. 2307.13680v1 null
2023-07-24 A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jindong Gu et.al. 2307.12980v1 link
2023-07-24 Evaluating the Ripple Effects of Knowledge Editing in Language Models Roi Cohen et.al. 2307.12976v1 link
2023-07-24 Volcanic ash delimitation using Artificial Intelligence based on Pix2Pix Christian Carrillo et.al. 2307.12970v1 null
2023-07-24 Aligning Large Language Models with Human: A Survey Yufei Wang et.al. 2307.12966v1 link
2023-07-24 RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment Kevin Yang et.al. 2307.12950v1 link
2023-07-24 Boosting Punctuation Restoration with Data Generation and Reinforcement Learning Viet Dac Lai et.al. 2307.12949v1 link
2023-07-21 Advancing Ad Auction Realism: Practical Insights & Modeling Implications Ming Chen et.al. 2307.11732v1 null
2023-07-21 OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples Ryuto Koike et.al. 2307.11729v1 link
2023-07-21 Benchmark datasets for biomedical knowledge graphs with negative statements Rita T. Sousa et.al. 2307.11719v1 null
2023-07-20 L-Eval: Instituting Standardized Evaluation for Long Context Language Models Chenxin An et.al. 2307.11088v1 link
2023-07-20 AlignDet: Aligning Pre-training and Fine-tuning in Object Detection Ming Li et.al. 2307.11077v1 link
2023-07-20 OBJECT 3DIT: Language-guided 3D-aware Image Editing Oscar Michel et.al. 2307.11073v1 null
2023-07-19 Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis Jiajie Fan et.al. 2307.10166v1 null
2023-07-19 Rethinking Backdoor Attacks Alaa Khaddaj et.al. 2307.10163v1 null
2023-07-19 Robust Driving Policy Learning with Guided Meta Reinforcement Learning Kanghoon Lee et.al. 2307.10160v1 null
2023-07-19 FABRIC: Personalizing Diffusion Models with Iterative Feedback Dimitri von Rütte et.al. 2307.10159v1 link
2023-07-19 Contact-aware Shaping and Maintenance of Deformable Linear Objects With Fixtures Kejia Chen et.al. 2307.10153v1 null
2023-07-18 Forecasting the steam mass flow in a powerplant using the parallel hybrid network Andrii Kurkin et.al. 2307.09483v1 null
2023-07-18 AnyDoor: Zero-shot Object-level Image Customization Xi Chen et.al. 2307.09481v1 link
2023-07-18 ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning Liang Zhao et.al. 2307.09474v1 null
2023-07-18 Optimal Vehicle Trajectory Planning for Static Obstacle Avoidance using Nonlinear Optimization Yajia Zhang et.al. 2307.09466v1 null
2023-07-19 Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla Tom Lieberum et.al. 2307.09458v2 null
2023-07-19 A comparative analysis of SRGAN models Fatemeh Rezapoor Nikroo et.al. 2307.09456v2 null
2023-07-18 Solving Knapsack with Small Items via L0-Proximity Ce Jin et.al. 2307.09454v1 null
2023-07-17 Diffusion Models Beat GANs on Image Classification Soumik Mukhopadhyay et.al. 2307.08702v1 null
2023-07-17 AlpaGasus: Training A Better Alpaca with Fewer Data Lichang Chen et.al. 2307.08701v1 link
2023-07-17 Fast model inference and training on-board of Satellites Vít Růžička et.al. 2307.08700v1 link
2023-07-17 Pair then Relation: Pair-Net for Panoptic Scene Graph Generation Jinghao Wang et.al. 2307.08699v1 link
2023-07-17 Flow Matching in Latent Space Quan Dao et.al. 2307.08698v1 link
2023-07-17 FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning Tri Dao et.al. 2307.08691v1 link
2023-07-17 COLLIE: Systematic Construction of Constrained Text Generation Tasks Shunyu Yao et.al. 2307.08689v1 link
2023-07-14 NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis Nilesh Kulkarni et.al. 2307.07511v1 null
2023-07-14 A Poisson Decomposition for Information and the Information-Event Diagram Cheuk Ting Li et.al. 2307.07506v1 null
2023-07-14 Exhaustive Generation of Linear Orthogonal Cellular Automata Enrico Formenti et.al. 2307.07505v1 null
2023-07-14 TALL: Thumbnail Layout for Deepfake Video Detection Yuting Xu et.al. 2307.07494v1 link
2023-07-14 BehAVExplor: Behavior Diversity Guided Testing for Autonomous Driving Systems Mingfei Cheng et.al. 2307.07493v1 null
2023-07-14 PseudoCal: A Source-Free Approach to Unsupervised Uncertainty Calibration in Domain Adaptation Dapeng Hu et.al. 2307.07489v1 null
2023-07-13 HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Nataniel Ruiz et.al. 2307.06949v1 null
2023-07-13 Self-regulating Prompts: Foundational Model Adaptation without Forgetting Muhammad Uzair Khattak et.al. 2307.06948v1 link
2023-07-13 In-context Autoencoder for Context Compression in a Large Language Model Tao Ge et.al. 2307.06945v1 link
2023-07-13 InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation Yi Wang et.al. 2307.06942v1 link
2023-07-13 Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation Yingqing He et.al. 2307.06940v1 link
2023-07-12 Diagnosis, Feedback, Adaptation: A Human-in-the-Loop Framework for Test-Time Policy Adaptation Andi Peng et.al. 2307.06333v1 null
2023-07-12 Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of “Never Enough Training Data” Kishan Govind et.al. 2307.06322v1 null
2023-07-12 Facial Reenactment Through a Personalized Generator Ariel Elazary et.al. 2307.06307v1 null
2023-07-12 Locally Adaptive Federated Learning via Stochastic Polyak Stepsizes Sohom Mukherjee et.al. 2307.06306v1 link
2023-07-11 Scale Alone Does not Improve Mechanistic Interpretability in Vision Models Roland S. Zimmermann et.al. 2307.05471v1 null
2023-07-12 My3DGen: Building Lightweight Personalized 3D Generative Model Luchao Qi et.al. 2307.05468v2 null
2023-07-11 EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone Shraman Pramanick et.al. 2307.05463v1 link
2023-07-11 Efficient 3D Articulated Human Generation with Layered Surface Volumes Yinghao Xu et.al. 2307.05462v1 null
2023-07-10 Semantic-SAM: Segment and Recognize Anything at Any Granularity Feng Li et.al. 2307.04767v1 link
2023-07-10 Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos Sagnik Majumder et.al. 2307.04760v1 null
2023-07-10 Information decomposition to identify relevant variation in complex systems with machine learning Kieran A. Murphy et.al. 2307.04755v1 link
2023-07-10 Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement Anthony Simeonov et.al. 2307.04751v1 null
2023-07-10 Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback Jaskirat Singh et.al. 2307.04749v1 null
2023-07-07 On the Efficacy of Sampling Adapters Clara Meister et.al. 2307.03749v1 link
2023-07-07 Comparing Traditional and LLM-based Search for Consumer Choice: A Randomized Experiment Sofia Eleni Spatharioti et.al. 2307.03744v1 null
2023-07-07 QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models Tommaso Pegolotti et.al. 2307.03738v1 link
2023-07-06 Simulating Nelsonian Quantum Field Theory Andrea Carosso et.al. 2307.03188v1 null
2023-07-06 Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers Yuan Gong et.al. 2307.03183v1 link
2023-07-06 Markov Persuasion Processes with Endogenous Agent Beliefs Krishnamurthy Iyer et.al. 2307.03181v1 null
2023-07-07 IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model Tianhao Wu et.al. 2307.03177v2 null
2023-07-06 Push Past Green: Learning to Look Behind Plant Foliage by Moving It Xiaoyu Zhang et.al. 2307.03175v1 null
2023-07-06 Risk-Averse Trajectory Optimization via Sample Average Approximation Thomas Lew et.al. 2307.03167v1 link
2023-07-06 VideoGLUE: Video General Understanding Evaluation of Foundation Models Liangzhe Yuan et.al. 2307.03166v1 link
2023-07-05 LongNet: Scaling Transformers to 1,000,000,000 Tokens Jiayu Ding et.al. 2307.02486v1 link
2023-07-05 Elastic Decision Transformer Yueh-Hua Wu et.al. 2307.02484v1 link
2023-07-05 Jailbroken: How Does LLM Safety Training Fail? Alexander Wei et.al. 2307.02483v1 null
2023-07-05 Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks Zhaofeng Wu et.al. 2307.02477v1 link
2023-07-05 The Calissons Puzzle Jean-Marie Favreau et.al. 2307.02475v1 null
2023-07-06 Deductive Additivity for Planning of Natural Language Proofs Zayne Sprague et.al. 2307.02472v2 link
2023-07-05 What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? Yan Zeng et.al. 2307.02469v1 null
2023-07-03 Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning Yuxiang Zhang et.al. 2307.01200v1 null
2023-07-03 NeuBTF: Neural fields for BTF encoding and transfer Carlos Rodriguez-Pardo et.al. 2307.01199v1 null
2023-07-03 Improved sampling via learned diffusions Lorenz Richter et.al. 2307.01198v1 null
2023-07-03 Segment Anything Meets Point Tracking Frano Rajič et.al. 2307.01197v1 link
2023-07-03 Squeezing Large-Scale Diffusion Models for Mobile Jiwoong Choi et.al. 2307.01193v1 null
2023-07-03 SAMAug: Point Prompt Augmentation for Segment Anything Model Haixing Dai et.al. 2307.01187v1 link
2023-07-03 Continuously Red-Shift and Blue-Shift Wavelength-Tuneable, Narrowband, High Harmonics in the EUV - X-ray Regime for Resonance Imaging and Spectroscopies Dimitar Popmintchev et.al. 2307.01182v1 null
2023-06-30 Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing Ariel N. Lee et.al. 2306.17848v1 null
2023-06-30 Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Guocheng Qian et.al. 2306.17843v1 link
2023-07-03 SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs Lijun Yu et.al. 2306.17842v2 link
2023-07-03 Statler: State-Maintaining Language Models for Embodied Reasoning Takuma Yoneda et.al. 2306.17840v2 null
2023-06-30 Federated Ensemble YOLOv5 - A Better Generalized Object Detection Algorithm Vinit Hegiste et.al. 2306.17829v1 null
2023-06-30 Understanding Unfairness via Training Concept Influence Yuanshun Yao et.al. 2306.17828v1 null
2023-06-29 An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training Zitian Chen et.al. 2306.17165v1 null
2023-06-30 Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors Tung Phung et.al. 2306.17156v2 null
2023-06-29 Generate Anything Anywhere in Any Scene Yuheng Li et.al. 2306.17154v1 null
2023-06-28 MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning Paul Pu Liang et.al. 2306.16413v1 link
2023-06-29 Even order contributions to relative energies vanish for antisymmetric perturbations O. Anatole von Lilienfeld et.al. 2306.16409v2 null
2023-06-27 Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties Hsiao-Yu Tung et.al. 2306.15668v1 null
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667v2 null
2023-06-27 SparseOptimizer: Sparsify Language Models through Moreau-Yosida Regularization and Accelerate through Compiler Co-design Fu-Ming Guo et.al. 2306.15656v1 null
2023-06-27 Optimal Area-Sensitive Bounds for Polytope Approximation Sunil Arya et.al. 2306.15648v1 null
2023-06-26 FunQA: Towards Surprising Video Comprehension Binzhu Xie et.al. 2306.14899v1 link
2023-06-27 InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback John Yang et.al. 2306.14898v2 link
2023-06-26 Supervised Pretraining Can Learn In-Context Reinforcement Learning Jonathan N. Lee et.al. 2306.14892v1 null
2023-06-26 Value of Information in Games with Multiple Strategic Information Providers Raj Kiriti Velicheti et.al. 2306.14886v1 null
2023-06-26 Restart Sampling for Improving Generative Processes Yilun Xu et.al. 2306.14878v1 link
2023-06-26 Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Yuwei Luo et.al. 2306.14872v1 null
2023-06-26 Composing Parameter-Efficient Modules with Arithmetic Operations Jinghan Zhang et.al. 2306.14870v1 link
2023-06-23 GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models Rishabh Agarwal et.al. 2306.13649v1 null
2023-06-23 Offline Skill Graph (OSG): A Framework for Learning and Planning using Offline Reinforcement Learning Skills Ben-ya Halevy et.al. 2306.13630v1 null
2023-06-22 Evading Forensic Classifiers with Attribute-Conditioned Adversarial Faces Fahad Shamshad et.al. 2306.13091v1 link
2023-06-22 PromptIR: Prompting for All-in-One Blind Image Restoration Vaishnav Potlapalli et.al. 2306.13090v1 link
2023-06-22 Improved Signal Detection for Ambient Backscatter Communications S. Zargari et.al. 2306.13083v1 null
2023-06-21 VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution Siobhan Mackenzie Hall et.al. 2306.12424v1 link
2023-06-21 Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase Qiuyu Wang et.al. 2306.12423v1 link
2023-06-21 LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models Shizhe Diao et.al. 2306.12420v1 link
2023-06-21 Coqlex: Generating Formally Verified Lexers Wendlasida Ouedraogo et.al. 2306.12411v1 null
2023-06-20 Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning Huiguo He et.al. 2306.11731v1 null
2023-06-20 Dense Video Object Captioning from Disjoint Supervision Xingyi Zhou et.al. 2306.11729v1 link
2023-06-20 Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision Ayush Tewari et.al. 2306.11719v1 null
2023-06-20 Multi-Fidelity Active Learning with GFlowNets Alex Hernandez-Garcia et.al. 2306.11715v1 link
2023-06-20 Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis Maxim Maximov et.al. 2306.11710v1 null
2023-06-16 Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness Eric Zelikman et.al. 2306.10015v1 link
2023-06-20 CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search Fahad Shamshad et.al. 2306.10008v2 link
2023-06-16 C2F2NeUS: Cascade Cost Frustum Fusion for High Fidelity and Generalizable Neural Surface Reconstruction Luoyuan Xu et.al. 2306.10003v1 null
2023-06-16 SLACK: Stable Learning of Augmentations with Cold-start and KL regularization Juliette Marrie et.al. 2306.09998v1 null
2023-06-16 Fairness in Preference-based Reinforcement Learning Umer Siddique et.al. 2306.09995v1 null
2023-06-16 Rosetta Neurons: Mining the Common Units in a Model Zoo Amil Dravid et.al. 2306.09346v2 null
2023-06-15 Evaluating Data Attribution for Text-to-Image Models Sheng-Yu Wang et.al. 2306.09345v1 link
2023-06-15 DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data Stephanie Fu et.al. 2306.09344v1 link
2023-06-15 Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis Xiaoshi Wu et.al. 2306.09341v1 link
2023-06-15 Span-Selective Linear Attention Transformers for Effective and Robust Schema-Guided Dialogue State Tracking Björn Bebensee et.al. 2306.09340v1 null
2023-06-15 From BERT to GPT-3 Codex: Harnessing the Potential of Very Large Language Models for Data Management Immanuel Trummer et.al. 2306.09339v1 null
2023-06-15 Generative Proxemics: A Prior for 3D Social Interaction from Images Lea Müller et.al. 2306.09337v1 link
2023-06-15 Fit Like You Sample: Sample-Efficient Generalized Score Matching from Fast Mixing Markov Chains Yilong Qin et.al. 2306.09332v1 null
2023-06-15 ArtFusion: Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models Dar-Yen Chen et.al. 2306.09330v1 link
2023-06-13 XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models Omkar Thawkar et.al. 2306.07971v1 link
2023-06-13 GeneCIS: A Benchmark for General Conditional Image Similarity Sagar Vaze et.al. 2306.07969v1 null
2023-06-13 One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning Arnav Chavan et.al. 2306.07967v1 link
2023-06-13 Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Shuai Yang et.al. 2306.07954v1 null
2023-06-12 Waffling around for Performance: Visual Classification with Random Words and Broad Concepts Karsten Roth et.al. 2306.07282v1 link
2023-06-12 Controlling Text-to-Image Diffusion by Orthogonal Finetuning Zeju Qiu et.al. 2306.07280v1 null
2023-06-12 Scalable 3D Captioning with Pretrained Models Tiange Luo et.al. 2306.07279v1 link
2023-06-12 Mathematical conjecture generation using machine intelligence Challenger Mishra et.al. 2306.07277v1 null
2023-06-12 Operator Learning with Neural Fields: Tackling PDEs on General Geometries Louis Serrano et.al. 2306.07266v1 link
2023-06-12 On the Collocated Form with Input Decoupling of Lagrangian Systems Pietro Pustina et.al. 2306.07258v1 null
2023-06-09 Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding Mu Cai et.al. 2306.06094v1 null
2023-06-09 HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork Bipasha Sen et.al. 2306.06093v1 null
2023-06-09 Computational Flash Photography through Intrinsics Sepideh Sarajian Maralan et.al. 2306.06089v1 null
2023-06-09 SENS: Sketch-based Implicit Neural Shape Modeling Alexandre Binninger et.al. 2306.06088v1 null
2023-06-09 Learning Not to Spoof David Byrd et.al. 2306.06087v1 null
2023-06-09 Developing Speech Processing Pipelines for Police Accountability Anjalie Field et.al. 2306.06086v1 null
2023-06-08 Background Prompting for Improved Object Depth Manel Baradad et.al. 2306.05428v1 null
2023-06-08 Grounded Text-to-Image Synthesis with Attention Refocusing Quynh Phung et.al. 2306.05427v1 null
2023-06-08 SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking Chris Cundy et.al. 2306.05426v1 null
2023-06-08 MIMIC-IT: Multi-Modal In-Context Instruction Tuning Bo Li et.al. 2306.05425v1 link
2023-06-08 Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models Muhammad Maaz et.al. 2306.05424v1 link
2023-06-08 ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Changyao Tian et.al. 2306.05423v1 null
2023-06-08 Stochastic Multi-Person 3D Motion Forecasting Sirui Xu et.al. 2306.05421v1 link
2023-06-08 Scaling Spherical CNNs Carlos Esteves et.al. 2306.05420v1 link
2023-06-08 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction Jiawei He et.al. 2306.05418v1 null
2023-06-07 Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection Yu Bai et.al. 2306.04637v1 link
2023-06-07 GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation Shuai Yang et.al. 2306.04636v1 link
2023-06-07 On the Reliability of Watermarks for Large Language Models John Kirchenbauer et.al. 2306.04634v1 link
2023-06-07 Designing a Better Asymmetric VQGAN for StableDiffusion Zixin Zhu et.al. 2306.04632v1 link
2023-06-07 Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design Julien Roy et.al. 2306.04620v1 null
2023-06-07 Helicity-dependent optical control of the magnetization state emerging from the Landau-Lifshitz-Gilbert equation Benjamin Assouline et.al. 2306.04617v1 null
2023-06-07 ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory Chenxu Hu et.al. 2306.03901v2 null
2023-06-06 Model Spider: Learning to Rank Pre-Trained Models Efficiently Yi-Kai Zhang et.al. 2306.03900v1 null
2023-06-06 Towards Label-free Scene Understanding by Vision Foundation Models Runnan Chen et.al. 2306.03899v1 link
2023-06-05 Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction Rose E. Wang et.al. 2306.03090v1 link
2023-06-05 Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models Andrew F. Luo et.al. 2306.03089v1 null
2023-06-05 DeepGraphDMD: Interpretable Spatio-Temporal Decomposition of Non-linear Functional Brain Network Dynamics Md Asadullah Turja et.al. 2306.03088v1 link
2023-06-05 MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion Chiyu Max Jiang et.al. 2306.03083v1 null
2023-06-05 InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models Lichang Chen et.al. 2306.03082v1 link
2023-06-05 Sequential Monte Carlo Steering of Large Language Models using Probabilistic Programs Alexander K. Lew et.al. 2306.03081v1 link
2023-06-05 A General Perspective on Objectives of Reinforcement Learning Long Yang et.al. 2306.03074v1 null
2023-06-05 Explore to Generalize in Zero-Shot RL Ev Zisselman et.al. 2306.03072v1 link
2023-06-02 Multilingual Conceptual Coverage in Text-to-Image Models Michael Saxon et.al. 2306.01735v1 link
2023-06-02 DocFormerv2: Local Features for Document Understanding Srikar Appalaraju et.al. 2306.01733v1 null
2023-06-02 Video Colorization with Pre-trained Text-to-Image Diffusion Models Hanyuan Liu et.al. 2306.01732v1 null
2023-06-02 Improving Generalization in Task-oriented Dialogues with Workflows and Action Plans Stefania Raimondo et.al. 2306.01729v1 null
2023-06-02 Denoising Diffusion Semantic Segmentation with Mask Prior Modeling Zeqiang Lai et.al. 2306.01721v1 link
2023-06-02 Fresh Content Needs More Attention: Multi-funnel Fresh Content Recommendation Jianling Wang et.al. 2306.01720v1 null
2023-06-02 Discreteness of asymptotic tensor ranks Jop Briët et.al. 2306.01718v1 null
2023-06-01 StyleGAN knows Normal, Depth, Albedo, and More Anand Bhattad et.al. 2306.00987v1 null
2023-06-02 Diffusion Self-Guidance for Controllable Image Generation Dave Epstein et.al. 2306.00986v2 null
2023-06-01 StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners Yonglong Tian et.al. 2306.00984v1 link
2023-06-01 StyleDrop: Text-to-Image Generation in Any Style Kihyuk Sohn et.al. 2306.00983v1 null
2023-06-01 SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds Yanyu Li et.al. 2306.00980v1 link
2023-06-01 AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Ji Lin et.al. 2306.00978v1 link
2023-06-01 Intriguing Properties of Text-guided Diffusion Models Qihao Liu et.al. 2306.00974v1 link
2023-06-01 Intelligent Grimm – Open-ended Visual Storytelling via Latent Diffusion Models Chang Liu et.al. 2306.00973v1 link
2023-06-01 Too Large; Data Reduction for Vision-Language Pre-Training Alex Jinpeng Wang et.al. 2305.20087v2 link
2023-05-31 Understanding and Mitigating Copying in Diffusion Models Gowthami Somepalli et.al. 2305.20086v1 link
2023-05-31 Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor Ruizhi Shao et.al. 2305.20082v1 null
2023-05-31 On the Capacity of Secure $K$ -user Product Computation over a Quantum MAC Yuxiang Lu et.al. 2305.20073v1 null
2023-05-31 Latent Exploration for Reinforcement Learning Alberto Silvio Chiappa et.al. 2305.20065v1 link
2023-05-31 Chatting Makes Perfect – Chat-based Image Retrieval Matan Levy et.al. 2305.20062v1 link
2023-05-30 Concise Answers to Complex Questions: Summarization of Long-form Answers Abhilash Potluri et.al. 2305.19271v1 link
2023-05-30 Microfluidics Generation of Millimeter-sized Matrigel Droplets Cory Arnold et.al. 2305.19261v1 null
2023-05-30 Shuffle SGD is Always Better than SGD: Improved Analysis of SGD with Arbitrary Data Orders Anastasia Koloskova et.al. 2305.19259v1 null
2023-05-30 Ambient Diffusion: Learning Clean Distributions from Corrupted Data Giannis Daras et.al. 2305.19256v1 link
2023-05-30 What Can We Learn from Unlearnable Datasets? Pedro Sandoval-Segura et.al. 2305.19254v1 link
2023-05-29 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths Zeyue Xue et.al. 2305.18295v1 null
2023-05-29 Transformer Language Models Handle Word Frequency in Prediction Head Goro Kobayashi et.al. 2305.18294v1 null
2023-05-29 Direct Preference Optimization: Your Language Model is Secretly a Reward Model Rafael Rafailov et.al. 2305.18290v1 link
2023-05-29 LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections M. Jehanzeb Mirza et.al. 2305.18287v1 null
2023-05-29 Characterization and evasion of backscattered light in the squeezed-light enhanced gravitational wave interferometer GEO 600 Fabio Bergamin et.al. 2305.18284v1 null
2023-05-29 Contextual Object Detection with Multimodal Large Language Models Yuhang Zang et.al. 2305.18279v1 link
2023-05-26 NeuManifold: Neural Watertight Manifold Reconstruction with Efficient and High-Quality Rendering Support Xinyue Wei et.al. 2305.17134v1 null
2023-05-26 RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation Gabriele Sarti et.al. 2305.17131v1 null
2023-05-26 Characterizing and Measuring Linguistic Dataset Drift Tyler A. Chang et.al. 2305.17127v1 link
2023-05-26 Large Language Models as Tool Makers Tianle Cai et.al. 2305.17126v1 link
2023-05-26 Manifold Regularization for Memory-Efficient Training of Deep Neural Networks Shadi Sartipi et.al. 2305.17119v1 null
2023-05-26 Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time Zichang Liu et.al. 2305.17118v1 null
2023-05-26 Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model David Soong et.al. 2305.17116v1 null
2023-05-25 Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models Shihao Zhao et.al. 2305.16322v1 link
2023-05-25 Parallel Sampling of Diffusion Models Andy Shih et.al. 2305.16317v1 link
2023-05-25 NAP: Neural 3D Articulation Prior Jiahui Lei et.al. 2305.16315v1 null
2023-05-26 Banana: Banach Fixed-Point Network for Pointcloud Segmentation with Inter-Part Equivariance Congyue Deng et.al. 2305.16314v2 null
2023-05-25 UMat: Uncertainty-Aware Single Image High Resolution Material Capture Carlos Rodriguez-Pardo et.al. 2305.16312v1 null
2023-05-25 Break-A-Scene: Extracting Multiple Concepts from a Single Image Omri Avrahami et.al. 2305.16311v1 link
2023-05-25 Securing Deep Generative Models with Universal Adversarial Signature Yu Zeng et.al. 2305.16310v1 link
2023-05-25 Imitating Task and Motion Planning with Visuomotor Transformers Murtaza Dalal et.al. 2305.16309v1 null
2023-05-25 Fine-Grained Complexity Analysis of Multi-Agent Path Finding on 2D Grids Tzvika Geft et.al. 2305.16303v1 null
2023-05-24 Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective Guhao Feng et.al. 2305.15408v1 link
2023-05-24 Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets Brandon Smith et.al. 2305.15407v1 link
2023-05-24 Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape Rundi Wu et.al. 2305.15399v1 link
2023-05-24 LayoutGPT: Compositional Visual Planning and Generation with Large Language Models Weixi Feng et.al. 2305.15393v1 link
2023-05-24 A Neural Space-Time Representation for Text-to-Image Personalization Yuval Alaluf et.al. 2305.15391v1 link
2023-05-24 Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering Avi Caciularu et.al. 2305.15387v1 link
2023-05-23 NCHO: Unsupervised Learning for Neural 3D Composition of Humans and Objects Taeksoo Kim et.al. 2305.14345v1 link
2023-05-23 Video Prediction Models as Rewards for Reinforcement Learning Alejandro Escontrela et.al. 2305.14343v1 null
2023-05-23 APPLS: A Meta-evaluation Testbed for Plain Language Summarization Yue Guo et.al. 2305.14341v1 link
2023-05-23 Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Grace Luo et.al. 2305.14334v1 null
2023-05-23 Evaluating and Modeling Attribution for Cross-Lingual Question Answering Benjamin Muller et.al. 2305.14332v1 null
2023-05-23 Large Language Models are Frame-level Directors for Zero-shot Text-to-Video Generation Susung Hong et.al. 2305.14330v1 link
2023-05-23 Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria Fivos Kalogiannis et.al. 2305.14329v1 null
2023-05-23 Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation Da Yin et.al. 2305.14327v1 link
2023-05-22 Contextualising Implicit Representations for Semantic Tasks Theo W. Costain et.al. 2305.13312v1 null
2023-05-22 VDT: An Empirical Study on Video Diffusion with Transformers Haoyu Lu et.al. 2305.13311v1 link
2023-05-22 Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching Yang Liu et.al. 2305.13310v1 link
2023-05-22 Evaluating Factual Consistency of Texts with Semantic Role Labeling Jing Fan et.al. 2305.13309v1 link
2023-05-22 If at First You Don’t Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection Shyamgopal Karthik et.al. 2305.13308v1 link
2023-05-22 NeRFuser: Large-Scale Scene Representation by NeRF Fusion Jiading Fang et.al. 2305.13307v1 link
2023-05-22 Growth of ultrawide-bandgap BN/diamond heterostructures by pulsed laser deposition Abhijit Biswas et.al. 2305.13306v1 null
2023-05-22 RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text Wangchunshu Zhou et.al. 2305.13304v1 link
2023-05-23 Training Diffusion Models with Reinforcement Learning Kevin Black et.al. 2305.13301v2 link
2023-05-22 Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations Chenglei Si et.al. 2305.13299v1 link
2023-05-19 Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models Byungjun Kim et.al. 2305.11870v1 link
2023-05-19 Reducing Sequence Length by Predicting Edit Operations with Large Language Models Masahiro Kaneko et.al. 2305.11862v1 null
2023-05-19 Video Killed the HD-Map: Predicting Driving Behavior Directly From Drone Images Yunpeng Liu et.al. 2305.11856v1 null
2023-05-19 Multimodal Web Navigation with Instruction-Finetuned Foundation Models Hiroki Furuta et.al. 2305.11854v1 null
2023-05-19 Poincare and Einstein on Mass-Energy Equivalence: A Modern Perspective on their 1900 and 1905 Papers Patrick Moylan et.al. 2305.11852v1 null
2023-05-19 Any-to-Any Generation via Composable Diffusion Zineng Tang et.al. 2305.11846v1 link
2023-05-18 Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model Siyuan Huang et.al. 2305.11176v1 link
2023-05-18 VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks Wenhai Wang et.al. 2305.11175v1 link
2023-05-18 Going Denser with Open-Vocabulary Part Segmentation Peize Sun et.al. 2305.11173v1 link
2023-05-18 ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities Peng Wang et.al. 2305.11172v1 link
2023-05-18 TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models Zorik Gekhman et.al. 2305.11171v1 link
2023-05-18 Efficient Prompting via Dynamic In-Context Learning Wangchunshu Zhou et.al. 2305.11170v1 null
2023-05-18 Evidence of Meaning in Language Models Trained on Programs Charles Jin et.al. 2305.11169v1 null
2023-05-17 FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention Guangxuan Xiao et.al. 2305.10431v1 link
2023-05-17 CLIP-GCD: Simple Language Guided Generalized Category Discovery Rabah Ouldnoughi et.al. 2305.10420v1 null
2023-05-17 Towards Multi-Layered 3D Garments Animation Yidi Shao et.al. 2305.10418v1 null
2023-05-17 Scratch Copilot Evaluation: Assessing AI-Assisted Creative Coding for Families Stefania Druga et.al. 2305.10417v1 null
2023-05-18 PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering Xiaoman Zhang et.al. 2305.10415v2 link
2023-05-17 AI Friends: A Design Framework for AI-Powered Creative Programming for Youth Stefania Druga et.al. 2305.10412v1 null
2023-05-17 Data Extraction via Semantic Regular Expression Synthesis Qiaochu Chen et.al. 2305.10401v1 null
2023-05-16 Understanding 3D Object Interaction from a Single Image Shengyi Qian et.al. 2305.09664v1 link
2023-05-16 Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation Samaneh Azadi et.al. 2305.09662v1 null
2023-05-16 Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage Jose Blanchet et.al. 2305.09659v1 null
2023-05-16 Newad: A register map automation tool for Verilog Vamsi K Vytla et.al. 2305.09657v1 null
2023-05-17 Satisfiability-Aided Language Models Using Declarative Prompting Xi Ye et.al. 2305.09656v2 link
2023-05-16 Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation Yuxin Ren et.al. 2305.09651v1 link
2023-05-16 Wavelet-based Unsupervised Label-to-Image Translation George Eskandar et.al. 2305.09647v1 link
2023-05-15 Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models Antoni Bigata Casademunt et.al. 2305.08854v1 link
2023-05-15 CQE: A Comprehensive Quantity Extractor Satya Almasian et.al. 2305.08853v1 link
2023-05-15 MV-Map: Offboard HD-Map Generation with Multi-view Consistency Ziyang Xie et.al. 2305.08851v1 link
2023-05-15 Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts Yuyang Zhao et.al. 2305.08850v1 null
2023-05-15 Privacy Auditing with One (1) Training Run Thomas Steinke et.al. 2305.08846v1 null
2023-05-15 Large Language Models are Zero-Shot Rankers for Recommender Systems Yupeng Hou et.al. 2305.08845v1 link
2023-05-15 RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs Afra Feyza Akyürek et.al. 2305.08844v1 link
2023-05-15 Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks Minyoung Huh et.al. 2305.08842v1 null
2023-05-15 Attacking Perceptual Similarity Metrics Abhijay Ghildyal et.al. 2305.08840v1 null
2023-05-12 Text2Cohort: Democratizing the NCI Imaging Data Commons with Natural Language Cohort Discovery Pranav Kulkarni et.al. 2305.07637v1 link
2023-05-12 Development of MC/DC: a performant, scalable, and portable Python-based Monte Carlo neutron transport code Ilham Variansyah et.al. 2305.07636v1 link
2023-05-12 Zero-shot Item-based Recommendation via Multi-task Product Knowledge Graph Pre-Training Ziwei Fan et.al. 2305.07633v1 null
2023-05-12 Design, Development, and Evaluation of an Interactive Personalized Social Robot to Monitor and Coach Post-Stroke Rehabilitation Exercises Min Hun Lee et.al. 2305.07632v1 null
2023-05-11 SparseGNV: Generating Novel Views of Indoor Scenes with Sparse Input Views Weihao Cheng et.al. 2305.07024v1 link
2023-05-11 Simple Token-Level Confidence Improves Caption Correctness Suzanne Petryk et.al. 2305.07021v1 null
2023-05-11 A General-Purpose Multilingual Document Encoder Onur Galoğlu et.al. 2305.07016v1 link
2023-05-11 Exploiting Diffusion Prior for Real-World Image Super-Resolution Jianyi Wang et.al. 2305.07015v1 link
2023-05-11 Occam’s razor for AI: Coarse-graining Hammett Inspired Product Ansatz in Chemical Space Marco Bragato et.al. 2305.07010v1 null
2023-05-11 Fair Price Discrimination Siddhartha Banerjee et.al. 2305.07006v1 null
2023-05-11 Subword Segmental Machine Translation: Unifying Segmentation and Target Sentence Generation Francois Meyer et.al. 2305.07005v1 link
2023-05-11 Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting Haoyang Huang et.al. 2305.07004v1 null
2023-05-11 Real-time Manipulation of Liquid Droplets using Photo-responsive Surfactant Xichen Liang et.al. 2305.07002v1 null
2023-05-10 Generalizations and Extensions to Lifting Constructions for Coded Caching V. R. Aravind et.al. 2305.06352v1 null
2023-05-10 RECKONING: Reasoning through Dynamic Knowledge Encoding Zeming Chen et.al. 2305.06349v1 link
2023-05-10 Frequency-Supported Neural Networks for Nonlinear Dynamical System Identification Krzysztof Zając et.al. 2305.06344v1 link
2023-05-10 Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs Roei Herzig et.al. 2305.06343v1 null
2023-05-10 Generalized Stratified Sampling for Efficient Reliability Assessment of Structures Against Natural Hazards Srinivasan Arunachalam et.al. 2305.06338v1 null
2023-05-10 K-UniMorph: Korean Universal Morphology and its Feature Schema Eunkyul Leah Jo et.al. 2305.06335v1 link
2023-05-10 Direct-Laser-Written Polymer Nanowire Waveguides for Broadband Single Photon Collection from Epitaxial Quantum Dots into a Gaussian-like Mode Edgar Perez et.al. 2305.06333v1 null
2023-05-09 Policy Gradient Methods in the Presence of Symmetries and State Abstractions Prakash Panangaden et.al. 2305.05666v1 link
2023-05-09 ImageBind: One Embedding Space To Bind Them All Rohit Girdhar et.al. 2305.05665v1 link
2023-05-10 InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language Zhaoyang Liu et.al. 2305.05662v2 link
2023-05-09 TidyBot: Personalized Robot Assistance with Large Language Models Jimmy Wu et.al. 2305.05658v1 link
2023-05-09 Using Knowledge Units of Programming Languages to Recommend Reviewers for Pull Requests: An Empirical Study Md Ahasanuzzaman et.al. 2305.05654v1 null
2023-05-09 Asymmetric $X$-Secure $T$ -Private Information Retrieval: More Databases is Not Always Better Mohamed Nomeir et.al. 2305.05649v1 null
2023-05-08 Learning to Evaluate the Artness of AI-generated Images Junyu Chen et.al. 2305.04923v1 null
2023-05-08 DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models Sicheng Yang et.al. 2305.04919v1 link
2023-05-08 What Do Patients Say About Their Disease Symptoms? Deep Multilabel Text Classification With Human-in-the-Loop Curation for Automatic Labeling of Patient Self Reports of Problems Lakshmi Arbatti et.al. 2305.04905v1 null
2023-05-08 Robust Positivity Problems for low-order Linear Recurrence Sequences Mihir Vahanwala et.al. 2305.04870v1 null
2023-05-05 On the Benefits of Semi-Supervised Test Case Generation for Cyber-Physical Systems Xiao Ling et.al. 2305.03714v1 null
2023-05-05 Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos Ekta Prashnani et.al. 2305.03713v1 null
2023-05-08 On the characterization of the convective heat flux in turbulent Rayleigh-Bénard convection Bérengère Podvin et.al. 2305.03708v2 null
2023-05-05 LMEye: An Interactive Perception Network for Large Language Models Yunxin Li et.al. 2305.03701v1 link
2023-05-05 Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements Jiacheng Liu et.al. 2305.03695v1 link
2023-05-05 Mining bias-target Alignment from Voronoi Cells Rémi Nahon et.al. 2305.03691v1 link
2023-05-05 COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? Arijit Ray et.al. 2305.03689v1 link
2023-05-04 ZipIt! Merging Models from Different Tasks without Training George Stoica et.al. 2305.03053v1 link
2023-05-04 Controllable Visual-Tactile Synthesis Ruihan Gao et.al. 2305.03051v1 link
2023-05-04 NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds Jun-Kun Chen et.al. 2305.03049v1 null
2023-05-04 Personalize Segment Anything Model with One Shot Renrui Zhang et.al. 2305.03048v1 link
2023-05-04 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Zhiqing Sun et.al. 2305.03047v1 link
2023-05-04 OctFormer: Octree-based Transformers for 3D Point Clouds Peng-Shuai Wang et.al. 2305.03045v1 link
2023-05-04 Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization Connor Z. Lin et.al. 2305.03043v1 null
2023-05-04 Are VAEs Bad at Reconstructing Molecular Graphs? Hagen Muenkler et.al. 2305.03041v1 null
2023-05-04 TUVF: Learning Generalizable Texture UV Radiance Fields An-Chieh Cheng et.al. 2305.03040v1 null
2023-05-03 Characterizing Political Bias in Automatic Summaries: A Case Study of Trump and Biden Karen Zhou et.al. 2305.02321v1 link
2023-05-03 Generating Synthetic Documents for Cross-Encoder Re-Rankers: A Comparative Study of ChatGPT and Human Experts Arian Askari et.al. 2305.02320v1 link
2023-05-03 Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings Daniel Rose et.al. 2305.02317v1 null
2023-05-03 AG3D: Learning to Generate 3D Avatars from 2D Image Collections Zijian Dong et.al. 2305.02312v1 null
2023-05-03 Real-Time Radiance Fields for Single-Image Portrait View Synthesis Alex Trevithick et.al. 2305.02310v1 null
2023-05-03 Calibrated Explanations: with Uncertainty Information and Counterfactuals Helena Lofstrom et.al. 2305.02305v1 link
2023-05-02 Humans as Light Bulbs: 3D Human Reconstruction from Thermal Reflection Ruoshi Liu et.al. 2305.01652v1 null
2023-05-02 Generalizing Dataset Distillation via Deep Generative Prior George Cazenavette et.al. 2305.01649v1 link
2023-05-02 Sequence Modeling with Multiresolution Convolutional Memory Jiaxin Shi et.al. 2305.01638v1 link
2023-05-02 The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers Ariel Gera et.al. 2305.01628v1 link
2023-05-02 Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks Gašper Beguš et.al. 2305.01626v1 null
2023-05-02 TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis Mathis Petrovich et.al. 2305.00976v1 null
2023-05-01 ArK: Augmented Reality with Knowledge Interactive Emergent Ability Qiuyuan Huang et.al. 2305.00970v1 null
2023-05-01 PMDG: Privacy for Multi-Perspective Process Mining through Data Generalization Ryan Hildebrant et.al. 2305.00960v1 null
2023-05-01 Non-Binary LDPC Code Design for Energy-Time Entanglement Quantum Key Distribution Debarnab Mitra et.al. 2305.00956v1 null
2023-05-01 Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation Patrick Fernandes et.al. 2305.00955v1 null
2023-04-28 LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model Peng Gao et.al. 2304.15010v1 link
2023-04-28 Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs George Pu et.al. 2304.14999v1 null
2023-04-28 ChatGPT – a Blessing or a Curse for Undergraduate Computer Science Students and Instructors? Ishika Joshi et.al. 2304.14993v1 null
2023-04-28 Robust Stackelberg Equilibria Jiarui Gan et.al. 2304.14990v1 null
2023-04-28 Interpreting Vision and Language Generative Models with Semantic Visual Priors Michele Cafagna et.al. 2304.14986v1 null
2023-04-28 Optimal majority rules and quantitative Condorcet properties of setwise Kemeny voting schemes Xuan Kien Phung et.al. 2304.14980v1 null
2023-04-28 MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks Lei Zhang et.al. 2304.14979v1 link
2023-04-27 ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System Junke Wang et.al. 2304.14407v1 null
2023-04-27 Motion-Conditioned Diffusion Model for Controllable Video Synthesis Tsai-Shien Chen et.al. 2304.14404v1 null
2023-04-27 LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions Minghao Wu et.al. 2304.14402v1 link
2023-04-27 ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs Jiteng Mu et.al. 2304.14401v1 null
2023-04-27 IconShop: Text-Based Vector Icon Synthesis with Autoregressive Transformers Ronghuan Wu et.al. 2304.14400v1 null
2023-04-27 We’re Afraid Language Models Aren’t Modeling Ambiguity Alisa Liu et.al. 2304.14399v1 link
2023-04-27 Maximizing Model Generalization for Manufacturing with Self-Supervised Learning and Federated Learning Matthew Russell et.al. 2304.14398v1 null
2023-04-27 Learning Articulated Shape with Keypoint Pseudo-labels from Web Images Anastasis Stathopoulos et.al. 2304.14396v1 null
2023-04-27 SeqTrack: Sequence to Sequence Learning for Visual Object Tracking Xin Chen et.al. 2304.14394v1 link
2023-04-26 Controllable Image Generation via Collage Representations Arantxa Casanova et.al. 2304.13722v1 null
2023-04-26 Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery Debadutta Dash et.al. 2304.13714v1 null
2023-04-27 Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond Jingfeng Yang et.al. 2304.13712v2 link
2023-04-26 UniNeXt: Exploring A Unified Architecture for Vision Recognition Fangjian Lin et.al. 2304.13700v1 link
2023-04-26 Hitting Subgraphs in Sparse Graphs and Geometric Intersection Graphs Daniel Lokshtanov et.al. 2304.13695v1 null
2023-04-26 HeySQuAD: A Spoken Question Answering Dataset Yijing Wu et.al. 2304.13689v1 link
2023-04-25 DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection Huan-ang Gao et.al. 2304.13031v1 link
2023-04-25 On the mechanism of polaritonic rate suppression from quantum transition paths Michelle C. Anderson et.al. 2304.13024v1 null
2023-04-25 Seeing is not always believing: A Quantitative Study on Human Perception of AI-Generated Images Zeyu Lu et.al. 2304.13023v1 link
2023-04-25 Certifying Ensembles: A General Certification Theory with S-Lipschitzness Aleksandar Petrov et.al. 2304.13019v1 null
2023-04-25 Bibliometric Data Fusion for Biomedical Information Retrieval Timo Breuer et.al. 2304.13012v1 null
2023-04-25 The Potential of Visual ChatGPT For Remote Sensing Lucas Prado Osco et.al. 2304.13009v1 null
2023-04-25 Answering Questions by Meta-Reasoning over Multiple Chains of Thought Ori Yoran et.al. 2304.13007v1 link
2023-04-24 Explicit Correspondence Matching for Generalizable Neural Radiance Fields Yuedong Chen et.al. 2304.12294v1 link
2023-04-24 Synthpop++: A Hybrid Framework for Generating A Country-scale Synthetic Population Bhavesh Neekhra et.al. 2304.12284v1 link
2023-04-21 Deep-Learning-based Fast and Accurate 3D CT Deformable Image Registration in Lung Cancer Yuzhen Ding et.al. 2304.11135v1 null
2023-04-20 Learning Sparse and Low-Rank Priors for Image Recovery via Iterative Reweighted Least Squares Minimization Stamatios Lefkimmiatis et.al. 2304.10536v1 null
2023-04-20 Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion Tomas Jakab et.al. 2304.10535v1 null
2023-04-20 Collaborative Diffusion for Multi-Modal Face Generation and Editing Ziqi Huang et.al. 2304.10530v1 link
2023-04-20 Generalizing Neural Human Fitting to Unseen Poses With Articulated SE(3) Equivariance Haiwen Feng et.al. 2304.10528v1 null
2023-04-20 Multidimensional Uncertainty Quantification for Deep Neural Networks Xujiang Zhao et.al. 2304.10527v1 null
2023-04-20 GenCorres: Consistent Shape Matching via Coupled Implicit-Explicit Shape Generative Models Haitao Yang et.al. 2304.10523v1 link
2023-04-20 Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget Johannes Lehner et.al. 2304.10520v1 link
2023-04-19 LipsFormer: Introducing Lipschitz Continuity to Vision Transformers Xianbiao Qi et.al. 2304.09856v1 link
2023-04-19 Bridging RL Theory and Practice with the Effective Horizon Cassidy Laidlaw et.al. 2304.09853v1 link
2023-04-19 Evaluating Verifiability in Generative Search Engines Nelson F. Liu et.al. 2304.09848v1 link
2023-04-19 Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Pan Lu et.al. 2304.09842v1 link
2023-04-19 Points of non-linearity of functions generated by random neural networks David Holmes et.al. 2304.09837v1 null
2023-04-18 Optimal PAC Bounds Without Uniform Convergence Ishaq Aden-Ali et.al. 2304.09167v1 null
2023-04-18 Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task Zihao Wu et.al. 2304.09138v1 null
2023-04-17 Conditional Generation of Audio from Video via Foley Analogies Yuexi Du et.al. 2304.08490v1 link
2023-04-17 Hyper-Decision Transformer for Efficient Online Policy Adaptation Mengdi Xu et.al. 2304.08487v1 null
2023-04-17 Visual Instruction Tuning Haotian Liu et.al. 2304.08485v1 link
2023-04-17 Text2Performer: Text-Driven Human Video Generation Yuming Jiang et.al. 2304.08483v1 link
2023-04-17 Towards Robust Prompts on Vision-Language Models Jindong Gu et.al. 2304.08479v1 null
2023-04-18 Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation Jie An et.al. 2304.08477v2 null
2023-04-14 Cross-Entropy Loss Functions: Theoretical Analysis and Applications Anqi Mao et.al. 2304.07288v1 null
2023-04-14 Solving Unique Games over Globally Hypercontractive Graphs Mitali Bafna et.al. 2304.07284v1 null
2023-04-14 Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning Bryan Brandt et.al. 2304.07280v1 null
2023-04-17 Identifying Cluttering Edges in Near-Planar Graphs Simon van Wageningen et.al. 2304.07274v2 link
2023-04-13 Expressive Text-to-Image Generation with Rich Text Songwei Ge et.al. 2304.06720v1 null
2023-04-13 Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction Hansheng Chen et.al. 2304.06714v1 link
2023-04-13 What does CLIP know about a red circle? Visual prompt engineering for VLMs Aleksandar Shtedritski et.al. 2304.06712v1 null
2023-04-13 DiffusionRig: Learning Personalized Priors for Facial Appearance Editing Zheng Ding et.al. 2304.06711v1 link
2023-04-13 How Will It Drape Like? Capturing Fabric Mechanics from Depth Images Carlos Rodriguez-Pardo et.al. 2304.06704v1 null
2023-04-13 Learning Controllable 3D Diffusion Models from Single-view Images Jiatao Gu et.al. 2304.06700v1 null
2023-04-13 Improving novelty detection with generative adversarial networks on hand gesture data Miguel Simão et.al. 2304.06696v1 null
2023-04-12 Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA James Seale Smith et.al. 2304.06027v1 null
2023-04-12 DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion Johanna Karras et.al. 2304.06025v1 null
2023-04-12 Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views Siwei Zhang et.al. 2304.06024v1 link
2023-04-12 SAM Struggles in Concealed Scenes – Empirical Study on “Segment Anything” Ge-Peng Ji et.al. 2304.06022v1 null
2023-04-12 Crowd Counting with Sparse Annotation Shiwei Zhang et.al. 2304.06021v1 null
2023-04-12 VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs Moayed Haji Ali et.al. 2304.06020v1 null
2023-04-12 Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera Ruicheng Feng et.al. 2304.06019v1 link
2023-04-12 Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning Aravind Venugopal et.al. 2304.06011v1 null
2023-04-11 HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models Eslam Mohamed Bakr et.al. 2304.05390v1 link
2023-04-11 Human-AI Co-Creation Approach to Find Forever Chemicals Replacements Juliana Jansen Ferreira et.al. 2304.05389v1 null
2023-04-11 MOST: Multiple Object localization with Self-supervised Transformers for object discovery Sai Saketh Rambhatla et.al. 2304.05387v1 null
2023-04-11 Bloom filters for molecules Jorge Medina et.al. 2304.05386v1 link
2023-04-10 A Cheaper and Better Diffusion Language Model with Soft-Masked Noise Jiaao Chen et.al. 2304.04746v1 link
2023-04-10 Ambiguous Medical Image Segmentation using Diffusion Models Aimon Rahman et.al. 2304.04745v1 link
2023-04-10 On the Possibilities of AI-Generated Text Detection Souradip Chakraborty et.al. 2304.04736v1 null
2023-04-07 Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following Mingyu Ding et.al. 2304.03767v1 null
2023-04-07 Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering Hung-Ting Su et.al. 2304.03754v1 null
2023-04-07 V3Det: Vast Vocabulary Visual Detection Dataset Jiaqi Wang et.al. 2304.03752v1 null
2023-04-07 Perspectives on AI Architectures and Co-design for Earth System Predictability Maruti K. Mudunuru et.al. 2304.03748v1 null
2023-04-07 Assessing Perceived Fairness from Machine Learning Developer’s Perspective Anoop Mishra et.al. 2304.03745v1 null
2023-04-06 Diffusion Models as Masked Autoencoders Chen Wei et.al. 2304.03283v1 null
2023-04-06 Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark Alexander Pan et.al. 2304.03279v1 link
2023-04-06 How Do US Congress Members Advertise Climate Change: An Analysis Of Ads Run On Meta’s Platforms Laurenz Aisenpreis et.al. 2304.03278v1 null
2023-04-06 Instruction Tuning with GPT-4 Baolin Peng et.al. 2304.03277v1 link
2023-04-06 That’s What I Said: Fully-Controllable Talking Face Generation Youngjoon Jang et.al. 2304.03275v1 null
2023-04-06 Towards self-driving laboratories in chemistry and materials sciences: The central role of DFT in the era of AI Bing Huang et.al. 2304.03272v1 null
2023-04-06 Causal Discovery with Score Matching on Additive Models with Arbitrary Noise Francesco Montagna et.al. 2304.03265v1 null
2023-04-05 Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models Xuhui Jia et.al. 2304.02642v1 null
2023-04-05 ENTL: Embodied Navigation Trajectory Learner Klemen Kotar et.al. 2304.02639v1 null
2023-04-05 GenPhys: From Physical Processes to Generative Models Ziming Liu et.al. 2304.02637v1 null
2023-04-05 HNeRV: A Hybrid Neural Representation for Videos Hao Chen et.al. 2304.02633v1 link
2023-04-05 Towards Explainable AI Writing Assistants for Non-native English Speakers Yewon Kim et.al. 2304.02625v1 null
2023-04-05 High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation Arvi Jonnarth et.al. 2304.02621v1 link
2023-04-04 Large Language Models are Edge-Case Fuzzers: Testing Deep Learning Libraries via FuzzGPT Yinlin Deng et.al. 2304.02014v1 null
2023-04-04 NPC: Neural Point Characters from Video Shih-Yang Su et.al. 2304.02013v1 null
2023-04-04 EGC: Image Generation and Classification via a Single Energy-Based Model Qiushan Guo et.al. 2304.02012v1 link
2023-04-04 FakET: Simulating Cryo-Electron Tomograms with Neural Style Transfer Pavol Harar et.al. 2304.02011v1 link
2023-04-04 OrienterNet: Visual Localization in 2D Public Maps with Neural Matching Paul-Edouard Sarlin et.al. 2304.02009v1 null
2023-04-04 MonoHuman: Animatable Human Neural Field from Monocular Video Zhengming Yu et.al. 2304.02001v1 null
2023-04-04 Revisiting the Evaluation of Image Synthesis with GANs Mengping Yang et.al. 2304.01999v1 link
2023-04-03 Video Instance Segmentation in an Open-World Omkar Thawakar et.al. 2304.01200v1 link
2023-04-03 Zero-Shot Semantic Segmentation with Decoupled One-Pass Network Cong Han et.al. 2304.01198v1 link
2023-04-03 Bringing Telepresence to Every Desk Shengze Wang et.al. 2304.01197v1 null
2023-04-04 Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data Canwen Xu et.al. 2304.01196v2 link
2023-04-03 Burstormer: Burst Image Restoration and Enhancement Transformer Akshay Dudhane et.al. 2304.01194v1 link
2023-04-03 Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos Yue Ma et.al. 2304.01186v1 link
2023-04-03 Whistler Wave Observations by \textit{Parker Solar Probe} During Encounter $1$ : Counter-Propagating Whistlers Collocated with Magnetic Field Inhomogeneities and their Application to Electric Field Measurement Calibration S. Karbashewski et.al. 2304.01185v1 null
2023-03-31 Towards Flexible Multi-modal Document Models Naoto Inoue et.al. 2303.18248v1 link
2023-03-31 Speeding up Madgraph5 aMC@NLO through CPU vectorization and GPU offloading: towards a first alpha release Andrea Valassi et.al. 2303.18244v1 null
2023-03-31 $\infty$ -Diff: Infinite Resolution Diffusion with Subsampled Mollified States Sam Bond-Taylor et.al. 2303.18242v1 link
2023-03-31 Procedure-Aware Pretraining for Instructional Video Understanding Honglu Zhou et.al. 2303.18230v1 link
2023-03-31 A Survey of Large Language Models Wayne Xin Zhao et.al. 2303.18223v1 link
2023-03-31 SemHint-MD: Learning from Noisy Semantic Labels for Self-Supervised Monocular Depth Estimation Shan Lin et.al. 2303.18219v1 null
2023-03-31 A Closer Look at Few-Shot 3D Point Cloud Classification Chuangguan Ye et.al. 2303.18210v1 link
2023-03-30 AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control Ruixiang Jiang et.al. 2303.17606v1 link
2023-03-30 Token Merging for Fast Stable Diffusion Daniel Bolya et.al. 2303.17604v1 link
2023-03-30 NeRF-Supervised Deep Stereo Fabio Tosi et.al. 2303.17603v1 link
2023-03-30 Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks Weihua Chen et.al. 2303.17602v1 link
2023-03-30 When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning Zichen Zhang et.al. 2303.17600v1 null
2023-03-30 Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models Wen Wang et.al. 2303.17599v1 link
2023-03-30 Consistent View Synthesis with Pose-Guided Diffusion Models Hung-Yu Tseng et.al. 2303.17598v1 null
2023-03-30 MobileInst: Video Instance Segmentation on the Mobile Renhong Zhang et.al. 2303.17594v1 null
2023-03-29 AutoAD: Movie Description in Context Tengda Han et.al. 2303.16899v1 link
2023-03-29 Bagging by Learning to Singulate Layers Using Interactive Perception Lawrence Yunliang Chen et.al. 2303.16898v1 null
2023-03-29 Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos Kun Su et.al. 2303.16897v1 null
2023-03-29 Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation Md Mostafijur Rahman et.al. 2303.16892v1 link
2023-03-29 Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations Vibashan VS et.al. 2303.16891v1 null
2023-03-29 DPF: Learning Dense Prediction Fields with Weak Supervision Xiaoxue Chen et.al. 2303.16890v1 link
2023-03-29 Towards Understanding the Effect of Pretraining Label Granularity Guan Zhe Hong et.al. 2303.16887v1 null
2023-03-29 End-to-End $n$ -ary Relation Extraction for Combination Drug Therapies Yuhang Jiang et.al. 2303.16886v1 link
2023-03-29 Instant Neural Radiance Fields Stylization Shaoxu Li et.al. 2303.16884v1 link
2023-03-29 Your Diffusion Model is Secretly a Zero-Shot Classifier Alexander C. Li et.al. 2303.16203v2 link
2023-03-28 LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention Renrui Zhang et.al. 2303.16199v1 link
2023-03-28 BC-IRL: Learning Generalizable Reward Functions from Demonstrations Andrew Szot et.al. 2303.16194v1 null
2023-03-28 Planning with Sequence Models through Iterative Energy Minimization Hongyi Chen et.al. 2303.16189v1 null
2023-03-28 Visual Chain-of-Thought Diffusion Models William Harvey et.al. 2303.16187v1 link
2023-03-28 Label Smoothing Improves Neural Source Code Summarization Sakib Haque et.al. 2303.16178v1 null
2023-03-27 IRFL: Image Recognition of Figurative Language Ron Yosef et.al. 2303.15445v1 link
2023-03-27 Zero-shot Model Diagnosis Jinqi Luo et.al. 2303.15441v1 null
2023-03-27 FaceLit: Neural 3D Relightable Faces Anurag Ranjan et.al. 2303.15437v1 null
2023-03-27 The Stable Signature: Rooting Watermarks in Latent Diffusion Models Pierre Fernandez et.al. 2303.15435v1 link
2023-03-27 Anti-DreamBooth: Protecting users from personalized text-to-image synthesis Thanh Van Le et.al. 2303.15433v1 link
2023-03-27 TextMI: Textualize Multimodal Information for Integrating Non-verbal Cues in Pre-trained Language Models Md Kamrul Hasan et.al. 2303.15430v1 null
2023-03-27 JAWS: Just A Wild Shot for Cinematic Transfer in Neural Radiance Fields Xi Wang et.al. 2303.15427v1 link
2023-03-24 Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning Xiaoyang Wu et.al. 2303.14191v1 link
2023-03-24 Learning from Few Demonstrations with Frame-Weighted Motion Generation Jianyong Sun et.al. 2303.14188v1 null
2023-03-24 Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior Junshu Tang et.al. 2303.14184v1 link
2023-03-24 Scaling Expert Language Models with Unsupervised Domain Discovery Suchin Gururangan et.al. 2303.14177v1 link
2023-03-24 A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception Asude Aydin et.al. 2303.14176v1 null
2023-03-24 UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields Yuanbo Yang et.al. 2303.14167v1 null
2023-03-23 Ablating Concepts in Text-to-Image Diffusion Models Nupur Kumari et.al. 2303.13516v1 link
2023-03-23 Persistent Nature: A Generative Model of Unbounded 3D Worlds Lucy Chai et.al. 2303.13515v1 link
2023-03-23 DreamBooth3D: Subject-Driven Text-to-3D Generation Amit Raj et.al. 2303.13508v1 null
2023-03-23 A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition Andong Deng et.al. 2303.13505v1 link
2023-03-23 Chordal Averaging on Flag Manifolds and Its Applications Nathan Mankovich et.al. 2303.13501v1 link
2023-03-23 A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias Puja Trivedi et.al. 2303.13500v1 null
2023-03-23 TriPlaneNet: An Encoder for EG3D Inversion Ananta R. Bhattarai et.al. 2303.13497v1 null
2023-03-22 Diffuse-Denoise-Count: Accurate Crowd-Counting with Diffusion Models Yasiru Ranasinghe et.al. 2303.12790v1 link
2023-03-22 EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation Hansheng Chen et.al. 2303.12787v1 link
2023-03-22 Localization-based OFDM framework for RIS-aided systems Fabio Saggese et.al. 2303.12763v1 link
2023-03-22 MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset Chen Feng et.al. 2303.12756v1 link
2023-03-22 Invariants for time-dependent Hamiltonian systems Jürgen Struckmeier et.al. 2303.12746v1 null
2023-03-22 Comment on the elastica section in Thorne and Blandford “Modern Classical Physics”, the shape of things, and the aspect ratio of reality J. A. Hanna et.al. 2303.12729v1 null
2023-03-21 Natural Language-Assisted Sign Language Recognition Ronglai Zuo et.al. 2303.12080v1 link
2023-03-21 Two-shot Video Object Segmentation Kun Yan et.al. 2303.12078v1 link
2023-03-21 CC3D: Layout-Conditioned Generation of Compositional 3D Scenes Sherwin Bahmani et.al. 2303.12074v1 null
2023-03-21 ProphNet: Efficient Agent-Centric Motion Forecasting with Anchor-Informed Proposals Xishun Wang et.al. 2303.12071v1 null
2023-03-21 Machine Learning for Brain Disorders: Transformers and Visual Transformers Robin Courant et.al. 2303.12068v1 null
2023-03-20 EVA-02: A Visual Representation for Neon Genesis Yuxin Fang et.al. 2303.11331v1 link
2023-03-20 Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation Ziyang Chen et.al. 2303.11329v1 link
2023-03-20 Zero-1-to-3: Zero-shot One Image to 3D Object Ruoshi Liu et.al. 2303.11328v1 link
2023-03-20 Open-vocabulary Panoptic Segmentation with Embedding Modulation Xi Chen et.al. 2303.11324v1 null
2023-03-20 ScribbleSeg: Scribble-based Interactive Image Segmentation Xi Chen et.al. 2303.11320v1 null
2023-03-20 Generative Semantic Segmentation Jiaqi Chen et.al. 2303.11316v1 link
2023-03-20 waywiser: Ergonomic Methods for Assessing Spatial Models Michael J Mahoney et.al. 2303.11312v1 link
2023-03-17 Data-centric Artificial Intelligence: A Survey Daochen Zha et.al. 2303.10158v1 link
2023-03-17 CoVIO: Online Continual Learning for Visual-Inertial Odometry Niclas Vödisch et.al. 2303.10149v1 link
2023-03-17 CoDEPS: Online Continual Learning for Depth Estimation and Panoptic Segmentation Niclas Vödisch et.al. 2303.10147v1 link
2023-03-17 Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting Nicolai Dorka et.al. 2303.10144v1 link
2023-03-16 Efficient Diffusion Training via Min-SNR Weighting Strategy Tiankai Hang et.al. 2303.09556v1 link
2023-03-16 PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D Supervision Konstantinos Tertikas et.al. 2303.09554v1 null
2023-03-16 SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving Yi Wei et.al. 2303.09551v1 link
2023-03-16 Diffusion-HPC: Generating Synthetic Images with Realistic Humans Zhenzhen Weng et.al. 2303.09541v1 link
2023-03-16 Deep Metric Learning for Unsupervised Remote Sensing Change Detection Wele Gedara Chaminda Bandara et.al. 2303.09536v1 link
2023-03-17 FateZero: Fusing Attentions for Zero-shot Text-based Video Editing Chenyang Qi et.al. 2303.09535v2 link
2023-03-16 Tackling Clutter in Radar Data – Label Generation and Detection Using PointNet++ Johannes Kopp et.al. 2303.09530v1 link
2023-03-15 Borda Regret Minimization for Generalized Linear Dueling Bandits Yue Wu et.al. 2303.08816v1 null
2023-03-15 BiFormer: Vision Transformer with Bi-Level Routing Attention Lei Zhu et.al. 2303.08810v1 link
2023-03-15 Stochastic Interpolants: A Unifying Framework for Flows and Diffusions Michael S. Albergo et.al. 2303.08797v1 null
2023-03-15 PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining Garrett Thomas et.al. 2303.08789v1 null
2023-03-14 Diversity-Aware Meta Visual Prompting Qidong Huang et.al. 2303.08138v1 link
2023-03-14 LayoutDM: Discrete Diffusion Model for Controllable Layout Generation Naoto Inoue et.al. 2303.08137v1 link
2023-03-15 Manipulate by Seeing: Creating Manipulation Controllers from Pre-Trained Representations Jianren Wang et.al. 2303.08135v2 null
2023-03-14 MeshDiffusion: Score-based Generative 3D Mesh Modeling Zhen Liu et.al. 2303.08133v1 link
2023-03-15 A Simple Framework for Open-Vocabulary Segmentation and Detection Hao Zhang et.al. 2303.08131v2 link
2023-03-14 ViperGPT: Visual Inference via Python Execution for Reasoning Dídac Surís et.al. 2303.08128v1 link
2023-03-14 Blind Video Deflickering by Neural Filtering with a Flawed Atlas Chenyang Lei et.al. 2303.08120v1 link
2023-03-14 Parameterised Approximation of the Fixation Probability of the Dominant Mutation in the Multi-Type Moran Process Leslie Ann Goldberg et.al. 2303.08118v1 null
2023-03-13 Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need Da-Wei Zhou et.al. 2303.07338v1 link
2023-03-13 Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR Feng Li et.al. 2303.07335v1 link
2023-03-13 A Smoothing Algorithm for Minimum Sensing Path Plans in Gaussian Belief Space Ali Reza Pedram et.al. 2303.07326v1 null
2023-03-13 Collision Cross-entropy and EM Algorithm for Self-labeled Classification Zhongwen Zhang et.al. 2303.07321v1 null
2023-03-13 Linear regularized 13-moment equations with Onsager boundary conditions for general gas molecules Zhenning Cai et.al. 2303.07314v1 null
2023-03-13 An efficient phase-field model of shear fractures using deviatoric stress split Ehsan Haghighat et.al. 2303.07309v1 link
2023-03-10 Multiple Hands Make Light Work: Enhancing Quality and Diversity using MAP-Elites with Multiple Parallel Evolution Strategies Manon Flageat et.al. 2303.06137v1 null
2023-03-10 Rewarding Chatbots for Real-World Engagement with Millions of Users Robert Irvine et.al. 2303.06135v1 null
2023-03-10 Imaging the crustal and upper mantle structure of the North Anatolian Fault: A Transmission Matrix Framework for Local Adaptive Focusing Rita Touma et.al. 2303.06123v1 null
2023-03-10 Ignorance is Bliss: Robust Control via Information Gating Manan Tomar et.al. 2303.06121v1 null
2023-03-11 Wave-function parametrization of a probability measure Leonardo Pedro et.al. 2303.06069v1 null
2023-03-09 Scaling up GANs for Text-to-Image Synthesis Minguk Kang et.al. 2303.05511v1 null
2023-03-09 Planning with Large Language Models for Code Generation Shun Zhang et.al. 2303.05510v1 null
2023-03-09 Cherry-Picking with Reinforcement Learning Yunchu Zhang et.al. 2303.05508v1 null
2023-03-09 TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization Alan Jeffares et.al. 2303.05506v1 link
2023-03-09 Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision Tarun Kalluri et.al. 2303.05503v1 null
2023-03-09 PDSketch: Integrated Planning Domain Programming and Learning Jiayuan Mao et.al. 2303.05501v1 null
2023-03-10 Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection Shilong Liu et.al. 2303.05499v2 link
2023-03-09 Learning Stationary Markov Processes with Contrastive Adjustment Ludvig Bergenstråhle et.al. 2303.05497v1 link
2023-03-09 Sparse and Local Networks for Hypergraph Reasoning Guangxuan Xiao et.al. 2303.05496v1 null
2023-03-08 Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Jiarui Xu et.al. 2303.04803v1 link
2023-03-08 Stabilized profunctors and stable species of structures Marcelo Fiore et.al. 2303.04795v1 null
2023-03-08 Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation Paul Hagemann et.al. 2303.04772v1 link
2023-03-08 SMaLL: A Software Framework for portable Machine Learning Libraries Upasana Sridhar et.al. 2303.04769v1 null
2023-03-07 Benign Overfitting for Two-layer ReLU Networks Yiwen Kou et.al. 2303.04145v1 link
2023-03-07 Toward Defining a Domain Complexity Measure Across Domains Katarina Doctor et.al. 2303.04141v1 null
2023-03-07 Diffusion Policy: Visuomotor Policy Learning via Action Diffusion Cheng Chi et.al. 2303.04137v1 null
2023-03-07 Inadequacy of equivalent circuits in nonlinear systems with inherent memory V. Lopez-Richard et.al. 2303.04135v1 null
2023-03-07 Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction Martin Josifoski et.al. 2303.04132v1 link
2023-03-07 Foundation Models for Decision Making: Problems, Methods, and Opportunities Sherry Yang et.al. 2303.04129v1 null
2023-03-07 Private Read-Update-Write with Controllable Information Leakage for Storage-Efficient Federated Learning with Top $r$ Sparsification Sajani Vithana et.al. 2303.04123v1 null
2023-03-06 Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers Sitan Chen et.al. 2303.03384v1 null
2023-03-06 SUREL+: Moving from Walks to Sets for Scalable Subgraph-based Graph Representation Learning Haoteng Yin et.al. 2303.03379v1 link
2023-03-06 PaLM-E: An Embodied Multimodal Language Model Danny Driess et.al. 2303.03378v1 null
2023-03-06 MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning Mikayel Samvelyan et.al. 2303.03376v1 null
2023-03-06 Detecting Human-Object Contact in Images Yixin Chen et.al. 2303.03373v1 link
2023-03-06 ALMOST: Adversarial Learning to Mitigate Oracle-less ML Attacks via Synthesis Tuning Animesh Basak Chowdhury et.al. 2303.03372v1 null
2023-03-06 Complex Systems of Secrecy: The Offshore Networks of Oligarchs Ho-Chun Herbert Chang et.al. 2303.03371v1 null
2023-03-06 Multimodal Prompting with Missing Modalities for Visual Recognition Yi-Lun Lee et.al. 2303.03369v1 link
2023-03-06 Referring Multi-Object Tracking Dongming Wu et.al. 2303.03366v1 link
2023-03-06 Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments Jun Yamada et.al. 2303.03365v1 null
2023-03-03 Unleashing Text-to-Image Diffusion Models for Visual Perception Wenliang Zhao et.al. 2303.02153v1 link
2023-03-03 Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners Renrui Zhang et.al. 2303.02151v1 link
2023-03-03 Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together! Shiwei Liu et.al. 2303.02141v1 link
2023-03-03 Eventual Discounting Temporal Logic Counterfactual Experience Replay Cameron Voloshin et.al. 2303.02135v1 null
2023-03-02 Dropout Reduces Underfitting Zhuang Liu et.al. 2303.01500v1 link
2023-03-02 Predicting Motion Plans for Articulating Everyday Objects Arjun Gupta et.al. 2303.01484v1 null
2023-03-02 Faster exact and approximation algorithms for packing and covering matroids via push-relabel Kent Quanrud et.al. 2303.01478v1 null
2023-03-01 StraIT: Non-autoregressive Generation with Stratified Image Transformer Shengju Qian et.al. 2303.00750v1 null
2023-03-01 Coordination of Multiple Robots along Given Paths with Bounded Junction Complexity Mikkel Abrahamsen et.al. 2303.00745v1 null
2023-03-01 READ Avatars: Realistic Emotion-controllable Audio Driven Avatars Jack Saunders et.al. 2303.00744v1 null
2023-03-01 R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents Daniel D. Johnson et.al. 2303.00732v1 link
2023-03-01 A Systematic Analysis of Vocabulary and BPE Settings for Optimal Fine-tuning of NMT: A Case Study of In-domain Translation J. Pourmostafa Roshan Sharami et.al. 2303.00722v1 null
2023-02-28 An Efficient Tester-Learner for Halfspaces Aravind Gollakota et.al. 2302.14853v1 null
2023-02-27 Internet Explorer: Targeted Representation Learning on the Open Web Alexander C. Li et.al. 2302.14051v1 link
2023-02-27 Language Is Not All You Need: Aligning Perception with Language Models Shaohan Huang et.al. 2302.14045v1 link
2023-02-27 Permutation Equivariant Neural Functionals Allan Zhou et.al. 2302.14040v1 link
2023-02-27 Measurement of Orbital Angular Momentum of Light using Stokes Parameters and Barnett’s Formalism Anirban Debnath et.al. 2302.14025v1 null
2023-02-27 Diacritic Recognition Performance in Arabic ASR Hanan Aldarmaki et.al. 2302.14022v1 null
2023-02-27 Full Stack Optimization of Transformer Inference: a Survey Sehoon Kim et.al. 2302.14017v1 null
2023-02-24 SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries Ahmed Imtiaz Humayun et.al. 2302.12828v1 link
2023-02-24 Generative Models of Huge Objects Lunjia Hu et.al. 2302.12823v1 null
2023-02-24 Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data KaShun Shum et.al. 2302.12822v1 link
2023-02-24 GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification Mengting Zhou et.al. 2302.12814v1 null
2023-02-24 Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback Baolin Peng et.al. 2302.12813v1 null
2023-02-23 Change is Hard: A Closer Look at Subpopulation Shift Yuzhe Yang et.al. 2302.12254v1 link
2023-02-23 Boosting Adversarial Transferability using Dynamic Cues Muzammal Naseer et.al. 2302.12252v1 null
2023-02-23 VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion Yiming Li et.al. 2302.12251v1 link
2023-02-23 Sequence-Based Incremental Concolic Testing of RTL Models Hasini Witharana et.al. 2302.12241v1 null
2023-02-23 What makes a language easy to deep-learn? Lukas Galke et.al. 2302.12239v1 link
2023-02-23 Improving Adaptive Conformal Prediction Using Self-Supervised Learning Nabeel Seedat et.al. 2302.12238v1 link
2023-02-23 Learning Neural Volumetric Representations of Dynamic Humans in Minutes Chen Geng et.al. 2302.12237v1 link
2023-02-23 DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models Jamie Wynn et.al. 2302.12231v1 link
2023-02-22 Beyond optimal disturbances: a statistical framework for transient growth Peter Frame et.al. 2302.11564v1 null
2023-02-22 Uncovering Bias in Face Generation Models Cristian Muñoz et.al. 2302.11562v1 null
2023-02-22 Equivariant Polynomials for Graph Neural Networks Omri Puny et.al. 2302.11556v1 null
2023-02-22 RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects Zhenjia Xu et.al. 2302.11553v1 null
2023-02-22 Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC Yilun Du et.al. 2302.11552v1 link
2023-02-22 Scaling Robot Learning with Semantically Imagined Experience Tianhe Yu et.al. 2302.11550v1 null
2023-02-21 Some Fundamental Aspects about Lipschitz Continuity of Neural Network Functions Grigory Khromov et.al. 2302.10886v1 null
2023-02-21 Context-Aware Timewise VAEs for Real-Time Vehicle Trajectory Prediction Pei Xu et.al. 2302.10873v1 link
2023-02-21 Efficient CTC Regularization via Coarse Labels for End-to-End Speech Translation Biao Zhang et.al. 2302.10871v1 link
2023-02-21 Provable Copyright Protection for Generative Models Nikhil Vyas et.al. 2302.10870v1 null
2023-02-21 A Unifying Perspective on Multi-Calibration: Unleashing Game Dynamics for Multi-Objective Learning Nika Haghtalab et.al. 2302.10863v1 null
2023-02-20 Towards Universal Fake Image Detectors that Generalize Across Generative Models Utkarsh Ojha et.al. 2302.10174v1 link
2023-02-20 Identity-Based Attribute Prototypes Distinguish Communities on Twitter Thomas Magelinski et.al. 2302.10172v1 null
2023-02-20 Compressed Error HARQ: Feedback Communication on Noise-Asymmetric Channels Sravan Kumar Ankireddy et.al. 2302.10170v1 link
2023-02-20 Learning Deep Semantics for Test Completion Pengyu Nie et.al. 2302.10166v1 link
2023-02-20 Sparse PCA Beyond Covariance Thresholding Gleb Novikov et.al. 2302.10158v1 null
2023-02-17 Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent Giannis Daras et.al. 2302.09057v1 link
2023-02-17 Geometric description of clustering in directed networks Antoine Allard et.al. 2302.09055v1 link
2023-02-17 MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation Clement Vignac et.al. 2302.09048v1 link
2023-02-17 From User Perceptions to Technical Improvement: Enabling People Who Stuter to Beter Use Speech Recognition Colin Lea et.al. 2302.09044v1 null
2023-02-17 Privately Customizing Prefinetuning to Better Match User Data in Federated Learning Charlie Hou et.al. 2302.09042v1 null
2023-02-16 Text-driven Visual Synthesis with Latent Diffusion Prior Ting-Hsuan Liao et.al. 2302.08510v1 null
2023-02-16 3D-aware Conditional Image Synthesis Kangle Deng et.al. 2302.08509v1 link
2023-02-16 The Scope of Multicalibration: Characterizing Multicalibration via Property Elicitation Georgy Noarov et.al. 2302.08507v1 null
2023-02-15 Target Specific De Novo Design of Drug Candidate Molecules with Graph Transformer-based Generative Adversarial Networks Atabey Ünlü et.al. 2302.07868v1 link
2023-02-15 Learning Performance-Improving Code Edits Aman Madaan et.al. 2302.07867v1 link
2023-02-15 Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation Joshua Vendrow et.al. 2302.07865v1 link
2023-02-15 Big Little Transformer Decoder Sehoon Kim et.al. 2302.07863v1 link
2023-02-15 One-Shot Face Video Re-enactment using Hybrid Latent Spaces of StyleGAN2 Trevine Oorloff et.al. 2302.07848v1 null
2023-02-15 NL2CMD: An Updated Workflow for Natural Language to Bash Commands Translation Quchen Fu et.al. 2302.07845v1 link
2023-02-14 Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions Raghav Singhal et.al. 2302.07261v1 null
2023-02-14 ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models Sheng Wang et.al. 2302.07257v1 link
2023-02-14 Energy Transformer Benjamin Hoover et.al. 2302.07253v1 link
2023-02-14 Generation Probabilities Are Not Enough: Exploring the Effectiveness of Uncertainty Highlighting in AI-Powered Code Completions Helena Vasconcelos et.al. 2302.07248v1 null
2023-02-14 A Deep Probabilistic Spatiotemporal Framework for Dynamic Graph Representation Learning with Application to Brain Disorder Identification Junn Yong Loo et.al. 2302.07243v1 null
2023-02-14 Parker Solar Probe Observations of High Plasma Beta Solar Wind from Streamer Belt Jia Huang et.al. 2302.07230v1 null
2023-02-13 3D-aware Blending with Generative NeRFs Hyunsu Kim et.al. 2302.06608v1 link
2023-02-13 Generative Adversarial Equilibrium Solvers Denizalp Goktas et.al. 2302.06607v1 null
2023-02-13 Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation Yuanhao Wang et.al. 2302.06606v1 null
2023-02-13 FilFL: Accelerating Federated Learning via Client Filtering Fares Fourati et.al. 2302.06599v1 null
2023-02-13 The Impact of AI on Developer Productivity: Evidence from GitHub Copilot Sida Peng et.al. 2302.06590v1 null
2023-02-13 Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction Xinyu Zhang et.al. 2302.06589v1 null
2023-02-13 Raising the Cost of Malicious AI-Powered Image Editing Hadi Salman et.al. 2302.06588v1 link
2023-02-13 AbLit: A Resource for Analyzing and Generating Abridged Versions of English Literature Melissa Roemmele et.al. 2302.06579v1 link
2023-02-10 Project and Probe: Sample-Efficient Domain Adaptation by Interpolating Orthogonal Features Annie S. Chen et.al. 2302.05441v1 null
2023-02-09 RelightableHands: Efficient Neural Relighting of Articulated Hand Models Shun Iwase et.al. 2302.04866v1 null
2023-02-09 Polynomial Neural Fields for Subband Decomposition and Manipulation Guandao Yang et.al. 2302.04862v1 link
2023-02-09 Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning Zhuolin Yang et.al. 2302.04858v1 null
2023-02-09 One-shot Visual Imitation via Attributed Waypoints and Demonstration Augmentation Matthew Chang et.al. 2302.04856v1 null
2023-02-09 SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks Mahdi Nikdan et.al. 2302.04852v1 link
2023-02-09 Robot Synesthesia: A Sound and Emotion Guided AI Painter Vihaan Misra et.al. 2302.04850v1 link
2023-02-09 Accurate and Interpretable Solution of the Inverse Rig for Realistic Blendshape Models with Quadratic Corrective Terms Stevo Racković et.al. 2302.04843v1 null
2023-02-09 Is This Loss Informative? Speeding Up Textual Inversion with Deterministic Objective Evaluation Anton Voronov et.al. 2302.04841v1 link
2023-02-08 PFGM++: Unlocking the Potential of Physics-Inspired Generative Models Yilun Xu et.al. 2302.04265v1 link
2023-02-08 Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration Chentian Jiang et.al. 2302.04250v1 null
2023-02-08 Federated Minimax Optimization with Client Heterogeneity Pranay Sharma et.al. 2302.04249v1 null
2023-02-08 Shortcut Detection with Variational Autoencoders Nicolas M. Müller et.al. 2302.04246v1 link
2023-02-07 Long Horizon Temperature Scaling Andy Shih et.al. 2302.03686v1 link
2023-02-07 Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications Johannes Kirschner et.al. 2302.03683v1 null
2023-02-07 Auditing Gender Presentation Differences in Text-to-Image Models Yanzhe Zhang et.al. 2302.03675v1 link
2023-02-07 Proportionality in Approval-Based Participatory Budgeting Markus Brill et.al. 2302.03672v1 null
2023-02-07 Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery Yuxin Wen et.al. 2302.03668v1 link
2023-02-07 HumanMAC: Masked Motion Completion for Human Motion Prediction Ling-Hao Chen et.al. 2302.03665v1 link
2023-02-07 SDYN-GANs: Adversarial Learning Methods for Multistep Generative Models for General Order Stochastic Dynamics Panos Stinis et.al. 2302.03663v1 null
2023-02-06 Zero-shot Image-to-Image Translation Gaurav Parmar et.al. 2302.03027v1 link
2023-02-06 AIM: Adapting Image Models for Efficient Video Action Recognition Taojiannan Yang et.al. 2302.03024v1 null
2023-02-06 Geometry of contact: contact planning for multi-legged robots via spin models duality Baxi Chong et.al. 2302.03019v1 null
2023-02-06 Structure and Content-Guided Video Synthesis with Diffusion Models Patrick Esser et.al. 2302.03011v1 null
2023-02-06 A novel Doppler backscattering (DBS) system to simultaneously monitor radio frequency plasma fluctuations and low frequency turbulence S. Chowdhury et.al. 2302.03009v1 null
2023-02-03 Understanding the Issues, Their Causes and Solutions in Microservices Systems: An Empirical Study Muhammad Waseem et.al. 2302.01894v1 null
2023-02-03 Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits Simone Sarti et.al. 2302.01888v1 null
2023-02-03 Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach Tanwi Mallick et.al. 2302.01887v1 null
2023-02-03 LIDAR-based Stabilization, Navigation and Localization for UAVs Operating in Dark Indoor Environments Matěj Petrl' ik et.al. 2302.01883v1 null
2023-02-03 IKEA-Manual: Seeing Shape Assembly Step by Step Ruocheng Wang et.al. 2302.01881v1 null
2023-02-02 STEPS: Joint Self-supervised Nighttime Image Enhancement and Depth Estimation Yupeng Zheng et.al. 2302.01334v1 link
2023-02-02 Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval Frederik Warburg et.al. 2302.01332v1 link
2023-02-02 SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections Zhaoxi Chen et.al. 2302.01330v1 link
2023-02-02 Dreamix: Video Diffusion Models are General Video Editors Eyal Molad et.al. 2302.01329v1 null
2023-02-02 $IC^3$ : Image Captioning by Committee Consensus David M. Chan et.al. 2302.01328v1 link
2023-02-02 Randomized Greedy Learning for Non-monotone Stochastic Submodular Maximization Under Full-bandit Feedback Fares Fourati et.al. 2302.01324v1 null
2023-02-02 Signatures for strong-field QED physics in the quantum limit of beamstrahlung W. L. Zhang et.al. 2302.01321v1 null
2023-02-01 Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data Alon Albalak et.al. 2302.00674v1 link
2023-02-01 ‘Generative CI’ through Collective Response Systems Aviv Ovadya et.al. 2302.00672v1 null
2023-02-01 Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing Grace Zhang et.al. 2302.00671v1 null
2023-02-01 Stable Target Field for Reduced Variance Score Estimation in Diffusion Models Yilun Xu et.al. 2302.00670v1 link
2023-02-01 Does Vision Accelerate Hierarchical Generalization of Neural Language Learners? Tatsuki Kuribayashi et.al. 2302.00667v1 null
2023-02-01 Extrinsic Calibration of 2D mm-Wavelength Radar Pairs Using Ego-Velocity Estimates Qilong Cheng et.al. 2302.00660v1 null
2023-02-01 Graph Neural Operators for Classification of Spatial Transcriptomics Data Junaid Ahmed et.al. 2302.00658v1 null
2023-01-31 Reverse engineering adversarial attacks with fingerprints from adversarial examples David Aaron Nicholson et.al. 2301.13869v1 null
2023-01-31 PADL: Language-Directed Physics-Based Character Control Jordan Juravsky et.al. 2301.13868v1 link
2023-01-31 Zero-Memory Graph Exploration with Unknown Inports Hans-Joachim Böckenhauer et.al. 2301.13860v1 null
2023-01-31 Interpreting Robustness Proofs of Deep Neural Networks Debangshu Banerjee et.al. 2301.13845v1 null
2023-01-31 Do Multi-Document Summarization Models Synthesize? Jay DeYoung et.al. 2301.13844v1 null
2023-01-31 RIS-Assisted Interference Mitigation for Uplink NOMA Azadeh Tabeshnezhad et.al. 2301.13841v1 null
2023-01-30 Looped Transformers as Programmable Computers Angeliki Giannou et.al. 2301.13196v1 null
2023-01-30 Adaptive Computation with Elastic Input Sequence Fuzhao Xue et.al. 2301.13195v1 link
2023-01-30 Audio-Visual Segmentation with Semantics Jinxing Zhou et.al. 2301.13190v1 link
2023-01-30 Extracting Training Data from Diffusion Models Nicholas Carlini et.al. 2301.13188v1 null
2023-01-30 Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees Shenghao Yang et.al. 2301.13187v1 link
2023-01-30 Optimal Decision Tree Policies for Markov Decision Processes Daniël Vos et.al. 2301.13185v1 link
2023-01-27 Incorporating Background Knowledge in Symbolic Regression using a Computer Algebra System Charles Fox et.al. 2301.11919v1 null
2023-01-27 OccRob: Efficient SMT-Based Occlusion Robustness Verification of Deep Neural Networks Xingwu Guo et.al. 2301.11912v1 null
2023-01-27 Multi-dimensional concept discovery (MCD): A unifying framework with completeness guarantees Johanna Vielhaben et.al. 2301.11911v1 link
2023-01-27 Tree-structured Policy Planning with Learned Behavior Models Yuxiao Chen et.al. 2301.11902v1 null
2023-01-26 Conservative Safety Monitors of Stochastic Dynamical Systems Matthew Cleaveland et.al. 2301.11330v1 null
2023-01-26 MusicLM: Generating Music From Text Andrea Agostinelli et.al. 2301.11325v1 null
2023-01-26 Joint Training of Deep Ensembles Fails Due to Learner Collusion Alan Jeffares et.al. 2301.11323v1 null
2023-01-26 Cut and Learn for Unsupervised Object Detection and Instance Segmentation Xudong Wang et.al. 2301.11320v1 link
2023-01-26 Learning Good Features to Transfer Across Tasks and Domains Pierluigi Zama Ramirez et.al. 2301.11310v1 null
2023-01-26 SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification Pranjal Aggarwal et.al. 2301.11309v1 link
2023-01-26 Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series Abdul Fatir Ansari et.al. 2301.11308v1 link
2023-01-26 DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature Eric Mitchell et.al. 2301.11305v1 link
2023-01-25 Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives Tanvi Dinkar et.al. 2301.10761v1 null
2023-01-25 Efficient Flow-Guided Multi-frame De-fencing Stavros Tsogkas et.al. 2301.10759v1 null
2023-01-25 Room-Temperature Sputtered Ultralow-loss Silicon Nitride for Hybrid Photonic Integration Shuangyou Zhang et.al. 2301.10758v1 null
2023-01-25 Generating large-scale network analyses of scientific landscapes in seconds using Dimensions on Google BigQuery Michele Pasin et.al. 2301.10736v1 null
2023-01-25 The Synchronic Web Thien-Nam Dinh et.al. 2301.10733v1 null
2023-01-24 A Watermark for Large Language Models John Kirchenbauer et.al. 2301.10226v1 link
2023-01-24 Evolution of cooperation under a generalized death-birth process Chaoqian Wang et.al. 2301.10205v1 null
2023-01-24 A general epidemic model and its application to mask design considering different preferences towards masks Chaoqian Wang et.al. 2301.10202v1 null
2023-01-23 InfiniCity: Infinite-Scale City Synthesis Chieh Hubert Lin et.al. 2301.09637v1 null
2023-01-23 Feature construction using explanations of individual predictions Boštjan Vouk et.al. 2301.09631v1 null
2023-01-23 Tracking the industrial growth of modern China with high-resolution panchromatic imagery: A sequential convolutional approach Ethan Brewer et.al. 2301.09620v1 null
2023-01-23 Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics Aamal Abbas Hussain et.al. 2301.09619v1 null
2023-01-20 The stochastic digital human is now enrolling for in silico imaging trials – Methods and tools for generating digital cohorts A Badano et.al. 2301.08719v1 null
2023-01-20 Massively Parallel Genetic Optimization through Asynchronous Propagation of Populations Oskar Taubert et.al. 2301.08713v1 link
2023-01-19 Multiview Compressive Coding for 3D Reconstruction Chao-Yuan Wu et.al. 2301.08247v1 link
2023-01-19 Booster: a Benchmark for Depth from Images of Specular and Transparent Surfaces Pierluigi Zama Ramirez et.al. 2301.08245v1 null
2023-01-19 Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture Mahmoud Assran et.al. 2301.08243v1 link
2023-01-19 Radiation-induced secondary emissions in solid-state devices as a possible contribution to quasiparticle poisoning of superconducting circuits Francisco Ponce et.al. 2301.08239v1 null
2023-01-18 Robust Zero-crossings Detection in Noisy Signals using Topological Signal Processing Sunia Tanweer et.al. 2301.07703v1 null
2023-01-18 Learning 3D-aware Image Synthesis with Unknown Pose Distribution Zifan Shi et.al. 2301.07702v1 null
2023-01-18 Prony-Based Super-Resolution Phase Retrieval of Sparse, Multivariate Signals Robert Beinert et.al. 2301.07696v1 null
2023-01-18 Private Federated Submodel Learning via Private Set Union Zhusheng Wang et.al. 2301.07686v1 null
2023-01-18 SFQEDtoolkit: a high-performance library for the accurate modeling of strong-field QED processes in PIC and Monte Carlo codes Samuele Montefiori et.al. 2301.07684v1 link
2023-01-18 OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation Tong Wu et.al. 2301.07525v1 null
2023-01-17 Three Dimensional Odd Viscosity in Ferrofluids with Vorticity-Magnetization Coupling Dylan Reynolds et.al. 2301.07096v1 null
2023-01-17 On the State of German (Abstractive) Text Summarization Dennis Aumiller et.al. 2301.07095v1 link
2023-01-17 Learning Customized Visual Models with Retrieval-Augmented Knowledge Haotian Liu et.al. 2301.07094v1 link
2023-01-17 GLIGEN: Open-Set Grounded Text-to-Image Generation Yuheng Li et.al. 2301.07093v1 link
2023-01-17 Vision Learners Meet Web Image-Text Pairs Bingchen Zhao et.al. 2301.07088v1 null
2023-01-17 MooseNet: A trainable metric for synthesized speech with plda backend Ondřej Plátek et.al. 2301.07087v1 link
2023-01-17 Transformers as Algorithms: Generalization and Implicit Model Selection in In-context Learning Yingcong Li et.al. 2301.07067v1 link
2023-01-13 Non-Stochastic CDF Estimation Using Threshold Queries Princewill Okoroafor et.al. 2301.05682v1 null
2023-01-12 See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning Zhenfang Chen et.al. 2301.05226v1 null
2023-01-12 Domain Expansion of Image Generators Yotam Nitzan et.al. 2301.05225v1 null
2023-01-12 Guiding Text-to-Image Diffusion Model Towards Grounded Generation Ziyi Li et.al. 2301.05221v1 null
2023-01-12 Adversarial Adaptation for French Named Entity Recognition Arjun Choudhry et.al. 2301.05220v1 link
2023-01-12 NDNSD: Service Publishing and Discovery in NDN Saurab Dulal et.al. 2301.05218v1 null

(<a href=#Updated-on-20240404>back to top</a>)

generation

Publish Date Title Authors PDF Code
2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Keyu Tian et.al. 2404.02905v1 link
2024-04-03 LidarDM: Generative LiDAR Simulation in a Generated World Vlas Zyrianov et.al. 2404.02903v1 null
2024-04-03 DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets Harsh Rangwani et.al. 2404.02900v1 link
2024-04-03 MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Duygu Ceylan et.al. 2404.02899v1 null
2024-04-03 A Mean Field Game Model for Timely Computation in Edge Computing Systems Shubham Aggarwal et.al. 2404.02898v1 null
2024-04-03 Deep Image Composition Meets Image Forgery Eren Tahir et.al. 2404.02897v1 link
2024-04-03 ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Yifan Xu et.al. 2404.02893v1 null
2024-04-03 PoCo: Point Context Cluster for RGBD Indoor Place Recognition Jing Liang et.al. 2404.02885v1 null
2024-04-02 Segment Any 3D Object with Language Seungjun Lee et.al. 2404.02157v1 null
2024-04-02 Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration Akshay Dudhane et.al. 2404.02154v1 null
2024-04-02 GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image Chong Bao et.al. 2404.02152v1 null
2024-04-02 Diffusion $^2$ : Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models Zeyu Yang et.al. 2404.02148v1 link
2024-04-02 Harder, Better, Faster, Stronger: Interactive Visualization for Human-Centered AI Tools Md Naimul Hoque et.al. 2404.02147v1 null
2024-04-02 Iterated Learning Improves Compositionality in Large Vision-Language Models Chenhao Zheng et.al. 2404.02145v1 null
2024-04-02 Multiparametric quantification and visualization of liver fat using ultrasound Jihye Baek et.al. 2404.02143v1 null
2024-03-29 Gecko: Versatile Text Embeddings Distilled from Large Language Models Jinhyuk Lee et.al. 2403.20327v1 null
2024-03-29 Shaving Logs via Large Sieve Inequality: Faster Algorithms for Sparse Convolution and More Ce Jin et.al. 2403.20326v1 null
2024-03-29 Structure and Dynamics of Magneto-Inertial, Differentially Rotating Laboratory Plasmas V. Valenzuela-Villaseca et.al. 2403.20321v1 null
2024-03-29 SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects Abhinav Kumar et.al. 2403.20318v1 link
2024-03-29 Convolutional Prompting meets Language Models for Continual Learning Anurag Roy et.al. 2403.20317v1 null
2024-03-29 Optimal Communication for Classic Functions in the Coordinator Model and Beyond Hossein Esfandiari et.al. 2403.20307v1 null
2024-03-28 GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling Bowen Zhang et.al. 2403.19655v1 null
2024-03-28 Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond Katherine Xu et.al. 2403.19653v1 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652v1 null
2024-03-28 GraspXL: Generating Grasping Motions for Diverse Objects at Scale Hui Zhang et.al. 2403.19649v1 null
2024-03-28 Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models Samuel Marks et.al. 2403.19647v1 link
2024-03-28 GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models Yusuf Dalva et.al. 2403.19645v1 null
2024-03-27 Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark Ziyang Chen et.al. 2403.18821v1 null
2024-03-27 MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering Guoxing Sun et.al. 2403.18820v1 null
2024-03-27 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Daniel Winter et.al. 2403.18818v1 null
2024-03-27 Garment3DGen: 3D Garment Stylization and Texture Generation Nikolaos Sarafianos et.al. 2403.18816v1 null
2024-03-27 Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Yanwei Li et.al. 2403.18814v1 link
2024-03-27 Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment Li Siyao et.al. 2403.18811v1 null
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807v2 link
2024-03-26 ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis Muhammad Hamza Mughal et.al. 2403.17936v1 null
2024-03-26 OmniVid: A Generative Framework for Universal Video Understanding Junke Wang et.al. 2403.17935v1 link
2024-03-26 SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models Kashyap Chitta et.al. 2403.17933v1 null
2024-03-26 MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution Wei Tao et.al. 2403.17927v1 null
2024-03-26 AID: Attention Interpolation of Text-to-Image Diffusion Qiyuan He et.al. 2403.17924v1 link
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921v1 link
2024-03-26 TC4D: Trajectory-Conditioned Text-to-4D Generation Sherwin Bahmani et.al. 2403.17920v1 null
2024-03-26 AgentStudio: A Toolkit for Building General Virtual Agents Longtao Zheng et.al. 2403.17918v1 null
2024-03-25 Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning Sicong Pan et.al. 2403.16803v1 null
2024-03-25 Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback Zhangqian Bi et.al. 2403.16792v1 null
2024-03-25 Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise Dilum Fernando et.al. 2403.16790v1 null
2024-03-25 HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation Linglin Jing et.al. 2403.16788v1 null
2024-03-25 Creating a Digital Twin of Spinal Surgery: A Proof of Concept Jonas Hein et.al. 2403.16736v1 null
2024-03-25 Improving Diffusion Models’s Data-Corruption Resistance using Scheduled Pseudo-Huber Loss Artem Khrapov et.al. 2403.16728v1 link
2024-03-22 DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Hanrong Ye et.al. 2403.15389v1 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385v1 null
2024-03-22 ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars Zhenwei Wang et.al. 2403.15383v1 null
2024-03-22 DragAPart: Learning a Part-Level Motion Prior for Articulated Objects Ruining Li et.al. 2403.15382v1 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378v1 link
2024-03-22 InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Yi Wang et.al. 2403.15377v1 link
2024-03-22 A Modular, End-to-End Next-Generation Network Testbed: Towards a Fully Automated Network Management Platform Ali Chouman et.al. 2403.15376v1 null
2024-03-21 Zero-Shot Multi-Object Shape Completion Shun Iwase et.al. 2403.14628v1 null
2024-03-21 MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images Yuedong Chen et.al. 2403.14627v1 link
2024-03-21 Simplified Diffusion Schrödinger Bridge Zhicong Tang et.al. 2403.14623v1 link
2024-03-21 GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation Yinghao Xu et.al. 2403.14621v1 link
2024-03-21 ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition Tianhao Wu et.al. 2403.14619v1 null
2024-03-21 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion Xiang Fan et.al. 2403.14617v1 null
2024-03-21 Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning Hasindri Watawana et.al. 2403.14616v1 link
2024-03-21 DreamReward: Text-to-3D Generation with Human Preference Junliang Ye et.al. 2403.14613v1 null
2024-03-21 Explorative Inbetweening of Time and Space Haiwen Feng et.al. 2403.14611v1 null
2024-03-20 On Pretraining Data Diversity for Self-Supervised Learning Hasan Abed Al Kader Hammoud et.al. 2403.13808v1 link
2024-03-20 Editing Massive Concepts in Text-to-Image Diffusion Models Tianwei Xiong et.al. 2403.13807v1 link
2024-03-20 Learning from Models and Data for Visual Grounding Ruozhen He et.al. 2403.13804v1 null
2024-03-20 Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments Yang Yang et.al. 2403.13803v1 link
2024-03-20 ZigMa: Zigzag Mamba Diffusion Model Vincent Tao Hu et.al. 2403.13802v1 link
2024-03-20 Natural Language as Polices: Reasoning for Coordinate-Level Embodied Control with LLMs Yusuke Mikami et.al. 2403.13801v1 link
2024-03-20 TimeRewind: Rewinding Time with Image-and-Events Video Diffusion Jingxi Chen et.al. 2403.13800v1 null
2024-03-20 Reverse Training to Nurse the Reversal Curse Olga Golovneva et.al. 2403.13799v1 null
2024-03-20 Hierarchical NeuroSymbolic Approach for Action Quality Assessment Lauren Okamoto et.al. 2403.13798v1 null
2024-03-20 Bridge the Modality and Capacity Gaps in Vision-Language Model Selection Chao Yi et.al. 2403.13797v1 null
2024-03-19 LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression Zhuoshi Pan et.al. 2403.12968v1 link
2024-03-19 Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment Mengting Chen et.al. 2403.12965v1 null
2024-03-19 Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models Ce Zhang et.al. 2403.12964v1 link
2024-03-19 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Linjiang Huang et.al. 2403.12963v1 link
2024-03-19 TexTile: A Differentiable Metric for Texture Tileability Carlos Rodriguez-Pardo et.al. 2403.12961v1 null
2024-03-19 FaceXFormer: A Unified Transformer for Facial Analysis Kartik Narayan et.al. 2403.12960v1 link
2024-03-19 GVGEN: Text-to-3D Generation with Volumetric Representation Xianglong He et.al. 2403.12957v1 null
2024-03-19 Abiogenesis: a possible quantum interpretation of the telepoietic conjecture Vittorio Cocchi et.al. 2403.12955v1 null
2024-03-19 Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models Elaine Sui et.al. 2403.12952v1 link
2024-03-18 RIS-aided Single-frequency 3D Imaging by Exploiting Multi-view Image Correlations Yixuan Huang et.al. 2403.11764v1 null
2024-03-19 Full-Duplex MU-MIMO Systems with Coarse Quantization: How Many Bits Do We Need? Seunghyeong Yoo et.al. 2403.11762v2 null
2024-03-18 Why E.T. Can’t Phone Home: A Global View on IP-based Geoblocking at VoWiFi Gabriel Karl Gegenhuber et.al. 2403.11759v1 null
2024-03-18 Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs M. Jehanzeb Mirza et.al. 2403.11755v1 link
2024-03-18 Asymptotically Optimal Codes for $(t,s)$ -Burst Error Yubo Sun et.al. 2403.11750v1 null
2024-03-18 Embedded Named Entity Recognition using Probing Classifiers Nicholas Popovič et.al. 2403.11747v1 null
2024-03-18 Revisiting Tensor Basis Neural Networks for Reynolds stress modeling: application to plane channel and square duct flows Jiayi Cai et.al. 2403.11746v1 null
2024-03-18 Matter and cosmogenesis in Kant’s Theory of the Heavens Garance Benoit et.al. 2403.11710v1 null
2024-03-18 Significant impact of light-matter strong coupling on chiral nonlinear optical effect Daichi Okada et.al. 2403.11709v1 null
2024-03-18 Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models Emilian Postolache et.al. 2403.11706v1 link
2024-03-18 Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing Juan Zhang et.al. 2403.11700v1 null
2024-03-18 Urban Scene Diffusion through Semantic Occupancy Map Junge Zhang et.al. 2403.11697v1 null
2024-03-18 Generalization error of spectral algorithms Maksim Velikanov et.al. 2403.11696v1 null
2024-03-18 Beamforming Design for Semantic-Bit Coexisting Communication System Maojun Zhang et.al. 2403.11693v1 null
2024-03-15 P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors Zhou Jiang et.al. 2403.10521v1 null
2024-03-15 Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives Ronghui Li et.al. 2403.10518v1 link
2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution Stephanie Fu et.al. 2403.10516v1 link
2024-03-15 A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction Anshul Gupta et.al. 2403.10511v1 null
2024-03-15 Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization Ratnadira Widyasari et.al. 2403.10507v1 null
2024-03-15 Belief Change based on Knowledge Measures Umberto Straccia et.al. 2403.10502v1 null
2024-03-14 SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior Huan-ang Gao et.al. 2403.09638v1 null
2024-03-14 GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping Yuhang Zheng et.al. 2403.09637v1 link
2024-03-14 Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference Piotr Nawrot et.al. 2403.09636v1 null
2024-03-14 OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning Lingyi Hong et.al. 2403.09634v1 null
2024-03-14 Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image Yiqun Mei et.al. 2403.09632v1 null
2024-03-14 3D-VLA: A 3D Vision-Language-Action Generative World Model Haoyu Zhen et.al. 2403.09631v1 null
2024-03-14 Generalized Predictive Model for Autonomous Driving Jiazhi Yang et.al. 2403.09630v1 link
2024-03-14 Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Eric Zelikman et.al. 2403.09629v1 link
2024-03-14 Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Fangfu Liu et.al. 2403.09625v1 null
2024-03-14 Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Zeyu Liu et.al. 2403.09622v1 null
2024-03-13 FastMAC: Stochastic Spectral Sampling of Correspondence Graph Yifei Zhang et.al. 2403.08770v1 link
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764v1 null
2024-03-13 A local model for the optical energy and momentum transfer in dielectric media and the microscopic origin of Abraham’s force density B. Anghinoni et.al. 2403.08752v1 null
2024-03-13 iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer Dinh-Khoi Vo et.al. 2403.08746v1 link
2024-03-12 Rethinking Generative Large Language Model Evaluation for Semantic Comprehension Fangyun Wei et.al. 2403.07872v1 null
2024-03-12 TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation Shivin Dass et.al. 2403.07869v1 null
2024-03-12 Exploring Safety Generalization Challenges of Large Language Models via Code Qibing Ren et.al. 2403.07865v1 null
2024-03-12 Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Shihao Zhao et.al. 2403.07860v1 link
2024-03-12 Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias Sierra Wyllie et.al. 2403.07857v1 null
2024-03-12 Quantifying and Mitigating Privacy Risks for Tabular Generative Models Chaoyi Zhu et.al. 2403.07842v1 null
2024-03-11 A representation-learning game for classes of prediction tasks Neria Uzan et.al. 2403.06971v1 null
2024-03-11 The pitfalls of next-token prediction Gregor Bachmann et.al. 2403.06963v1 link
2024-03-11 Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer Siddhant Satyanaik et.al. 2403.06953v1 null
2024-03-11 SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Jialu Li et.al. 2403.06952v1 null
2024-03-08 Tell, Don’t Show!: Language Guidance Eases Transfer Across Domains in Images and Videos Tarun Kalluri et.al. 2403.05535v1 null
2024-03-08 Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets Lorenzo Brigato et.al. 2403.05532v1 null
2024-03-08 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Machel Reid et.al. 2403.05530v1 null
2024-03-08 The Computational Complexity of Learning Gaussian Single-Index Models Alex Damian et.al. 2403.05529v1 null
2024-03-08 GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM Hao Kang et.al. 2403.05527v1 link
2024-03-08 Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola Yijiang Li et.al. 2403.05523v1 null
2024-03-08 Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought James Chua et.al. 2403.05518v1 link
2024-03-07 BloomGML: Graph Machine Learning through the Lens of Bilevel Optimization Amber Yijia Zheng et.al. 2403.04763v1 link
2024-03-07 Lifelong Intelligence Beyond the Edge using Hyperdimensional Computing Xiaofan Yu et.al. 2403.04759v1 link
2024-03-07 KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts Adam Coscia et.al. 2403.04758v1 link
2024-03-07 Preliminary Guidelines For Combining Data Integration and Visual Data Analysis Adam Coscia et.al. 2403.04757v1 link
2024-03-07 Mechanism for Decision-aware Collaborative Federated Learning: A Pitfall of Shapley Values Meng Qi et.al. 2403.04753v1 null
2024-03-07 JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics Framework Artur P. Toshev et.al. 2403.04750v1 link
2024-03-07 A General Calibrated Regret Metric for Detecting and Mitigating Human-Robot Interaction Failures Kensuke Nakamura et.al. 2403.04745v1 null
2024-03-06 Backtracing: Retrieving the Cause of the Query Rose E. Wang et.al. 2403.03956v1 link
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954v1 link
2024-03-06 Bridging Language and Items for Retrieval and Recommendation Yupeng Hou et.al. 2403.03952v1 link
2024-03-06 Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset Pedro Ramoneda et.al. 2403.03947v1 null
2024-03-06 Separate and Detailed Treatment of Absolute Signal and Noise Enables NMR Under Adverse Circumstances A Guinness et.al. 2403.03943v1 null
2024-03-06 The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models Adithya Bhaskar et.al. 2403.03942v1 link
2024-03-06 GUIDE: Guidance-based Incremental Learning with Diffusion Models Bartosz Cywiński et.al. 2403.03938v1 link
2024-03-05 LC-Tsalis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits Masahiro Kato et.al. 2403.03219v1 null
2024-03-05 The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning Nathaniel Li et.al. 2403.03218v1 null
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217v1 null
2024-03-05 A Safety-Critical Framework for UGVs in Complex Environments: A Data-Driven Discrepancy-Aware Approach Skylar X. Wei et.al. 2403.03215v1 null
2024-03-05 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Patrick Esser et.al. 2403.03206v1 null
2024-03-05 CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments Savitha Sam Abraham et.al. 2403.03203v1 null
2024-03-03 Bandit Profit-maximization for Targeted Marketing Joon Suk Huh et.al. 2403.01361v1 null
2024-03-03 ModelWriter: Text & Model-Synchronized Document Engineering Platform Ferhat Erata et.al. 2403.01359v1 null
2024-03-03 Improving Uncertainty Sampling with Bell Curve Weight Function Zan-Kai Chong et.al. 2403.01352v1 null
2024-03-03 Efficient FIR filtering with Bit Layer Multiply Accumulator Vincenzo Liguori et.al. 2403.01351v1 null
2024-03-02 ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation Siyuan Bian et.al. 2403.01345v1 null
2024-02-29 DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Muyang Li et.al. 2402.19481v1 link
2024-02-29 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Tsai-Shien Chen et.al. 2402.19479v1 null
2024-02-29 Learning a Generalized Physical Face Model From Data Lingchen Yang et.al. 2402.19477v1 null
2024-02-29 The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations? Alex Gu et.al. 2402.19475v1 null
2024-02-29 The All-Seeing Project V2: Towards General Relation Comprehension of the Open World Weiyun Wang et.al. 2402.19474v1 link
2024-02-29 Retrieval-Augmented Generation for AI-Generated Content: A Survey Penghao Zhao et.al. 2402.19473v1 link
2024-02-29 Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling Gabriel Grand et.al. 2402.19471v1 null
2024-02-29 Humanoid Locomotion as Next Token Prediction Ilija Radosavovic et.al. 2402.19469v1 null
2024-02-28 UniMODE: Unified Monocular 3D Object Detection Zhuoling Li et.al. 2402.18573v1 null
2024-02-28 Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards Haoxiang Wang et.al. 2402.18571v1 link
2024-02-28 Diffusion Language Models Are Versatile Protein Learners Xinyou Wang et.al. 2402.18567v1 null
2024-02-28 Approaching Human-Level Forecasting with Language Models Danny Halawi et.al. 2402.18563v1 null
2024-02-27 The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Shuming Ma et.al. 2402.17764v1 null
2024-02-27 Reducing Unnecessary Alerts in Pedestrian Protection Systems Based on P2V Communications Ignacio Soto et.al. 2402.17763v1 null
2024-02-27 Towards Optimal Learning of Language Models Yuxian Gu et.al. 2402.17759v1 null
2024-02-27 ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living Marsil Zakour et.al. 2402.17758v1 null
2024-02-27 Evaluating Very Long-Term Conversational Memory of LLM Agents Adyasha Maharana et.al. 2402.17753v1 null
2024-02-26 Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision Fan Jiang et.al. 2402.16508v1 link
2024-02-26 Stochastic Conditional Diffusion Models for Semantic Image Synthesis Juyeon Ko et.al. 2402.16506v1 null
2024-02-26 SAND: Decoupling Sanitization from Fuzzing for Low Overhead Ziqiao Kong et.al. 2402.16497v1 null
2024-02-26 Intelligent Known and Novel Aircraft Recognition – A Shift from Classification to Similarity Learning for Combat Identification Ahmad Saeed et.al. 2402.16486v1 null
2024-02-23 Seamless Human Motion Composition with Blended Positional Encodings German Barquero et.al. 2402.15509v1 link
2024-02-23 AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning Jianguo Zhang et.al. 2402.15506v1 link
2024-02-23 Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts Yuejiang Liu et.al. 2402.15505v1 null
2024-02-23 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Chun-Hsiao Yeh et.al. 2402.15504v1 link
2024-02-23 API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs Kinjal Basu et.al. 2402.15491v1 null
2024-02-22 PALO: A Polyglot Large Multimodal Model for 5B People Muhammad Maaz et.al. 2402.14818v1 link
2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang et.al. 2402.14817v1 null
2024-02-22 WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition Lianghui Zhu et.al. 2402.14812v1 link
2024-02-22 Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking Nikhil Prakash et.al. 2402.14811v1 null
2024-02-22 GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion Xueyi Liu et.al. 2402.14810v1 link
2024-02-22 CriticBench: Benchmarking LLMs for Critique-Correct Reasoning Zicheng Lin et.al. 2402.14809v1 link
2024-02-22 RelayAttention for Efficient Large Language Model Serving with Long System Prompts Lei Zhu et.al. 2402.14808v1 link
2024-02-22 A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Nikhil Behari et.al. 2402.14807v1 null
2024-02-22 Identifying Multiple Personalities in Large Language Models with External Evaluation Xiaoyang Song et.al. 2402.14805v1 null
2024-02-21 D-Flow: Differentiating through Flows for Controlled Generation Heli Ben-Hamu et.al. 2402.14017v1 null
2024-02-21 Corrective Machine Unlearning Shashwat Goel et.al. 2402.14015v1 link
2024-02-21 Geometry-Informed Neural Networks Arturs Berzins et.al. 2402.14009v1 null
2024-02-21 OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems Chaoqun He et.al. 2402.14008v1 link
2024-02-21 Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models Aline Ioste et.al. 2402.14002v1 null
2024-02-21 Real-time 3D-aware Portrait Editing from a Single Image Qingyan Bai et.al. 2402.14000v1 null
2024-02-20 CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples Jianrui Zhang et.al. 2402.13254v1 link
2024-02-20 BiMediX: Bilingual Medical Mixture of Experts LLM Sara Pieri et.al. 2402.13253v1 link
2024-02-20 Video ReCap: Recursive Captioning of Hour-Long Videos Md Mohaiminul Islam et.al. 2402.13250v1 null
2024-02-20 TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization Liyan Tang et.al. 2402.13249v1 link
2024-02-20 Are Fact-Checking Tools Reliable? An Evaluation of Google Fact Check Qiangeng Yang et.al. 2402.13244v1 null
2024-02-20 Unlocking Insights: Semantic Search in Jupyter Notebooks Lan Li et.al. 2402.13234v1 null
2024-02-20 A Touch, Vision, and Language Dataset for Multimodal Alignment Letian Fu et.al. 2402.13232v1 link
2024-02-19 FiT: Flexible Vision Transformer for Diffusion Model Zeyu Lu et.al. 2402.12376v1 link
2024-02-19 A synthetic data approach for domain generalization of NLI models Mohammad Javad Hosseini et.al. 2402.12368v1 null
2024-02-19 A Critical Evaluation of AI Feedback for Aligning Large Language Models Archit Sharma et.al. 2402.12366v1 link
2024-02-19 Almost-linear time parameterized algorithm for rankwidth via dynamic rankwidth Tuukka Korhonen et.al. 2402.12364v1 null
2024-02-19 Flip Graphs of Pseudo-Triangulations With Face Degree at Most 4 Maarten Löffler et.al. 2402.12357v1 null
2024-02-19 Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge Julien Delile et.al. 2402.12352v1 null
2024-02-16 Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning Chia-Ling Tsai et.al. 2402.10894v1 null
2024-02-16 RLVF: Learning from Verbal Feedback without Overgeneralization Moritz Stephan et.al. 2402.10893v1 link
2024-02-16 Instruction Diversity Drives Generalization To Unseen Tasks Dylan Zhang et.al. 2402.10891v1 null
2024-02-16 When is Tree Search Useful for LLM Planning? It Depends on the Discriminator Ziru Chen et.al. 2402.10890v1 link
2024-02-16 Evaluation of EAP Usage for Authenticating Eduroam Users in 5G Networks Leonardo Azalim de Oliveira et.al. 2402.10889v1 null
2024-02-16 Explainability for Machine Learning Models: From Data Adaptability to User Perception julien Delaunay et.al. 2402.10888v1 null
2024-02-16 Reviewer2: Optimizing Review Generation Through Prompt Generation Zhaolin Gao et.al. 2402.10886v1 null
2024-02-16 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations Tsung-Wei Ke et.al. 2402.10885v1 null
2024-02-15 Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan et.al. 2402.10210v1 null
2024-02-15 Recovering the Pre-Fine-Tuning Weights of Generative Models Eliahu Horwitz et.al. 2402.10208v1 link
2024-02-15 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Rui Yang et.al. 2402.10207v1 link
2024-02-15 Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention Romain Ilbert et.al. 2402.10198v1 link
2024-02-15 BitDelta: Your Fine-Tune May Only Be Worth One Bit James Liu et.al. 2402.10193v1 link
2024-02-15 Multi-Excitation Projective Simulation with a Many-Body Physics Inspired Inductive Bias Philip A. LeMaitre et.al. 2402.10192v1 link
2024-02-15 FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients Xinchi Qiu et.al. 2402.10191v1 null
2024-02-14 AQA-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability Siwei Yang et.al. 2402.09404v1 link
2024-02-14 Reinforcement Learning from Human Feedback with Active Queries Kaixuan Ji et.al. 2402.09401v1 null
2024-02-14 Long-form evaluation of model editing Domenic Rosati et.al. 2402.09394v1 null
2024-02-14 Introduction to Physically Unclonable Fuctions: Properties and Applications M. Garcia-Bosque et.al. 2402.09386v1 null
2024-02-14 GraSSRep: Graph-Based Self-Supervised Learning for Repeat Detection in Metagenomic Assembly Ali Azizpour et.al. 2402.09381v1 link
2024-02-13 IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Luke Melas-Kyriazi et.al. 2402.08682v1 null
2024-02-13 Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance Linxi Zhao et.al. 2402.08680v1 null
2024-02-13 COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability Xingang Guo et.al. 2402.08679v1 link
2024-02-13 Graph Mamba: Towards Learning on Graphs with State Space Models Ali Behrouz et.al. 2402.08678v1 link
2024-02-13 Model Assessment and Selection under Temporal Distribution Shift Elise Han et.al. 2402.08672v1 link
2024-02-13 Rec-GPT4V: Multimodal Recommendation with Large Vision-Language Models Yuqing Liu et.al. 2402.08670v1 null
2024-02-13 Improving Generalization in Semantic Parsing by Increasing Natural Language Variation Irina Saparina et.al. 2402.08666v1 link
2024-02-12 A systematic investigation of learnability from single child linguistic input Yulu Qin et.al. 2402.07899v1 null
2024-02-12 Label-Efficient Model Selection for Text Generation Shir Ashury-Tahan et.al. 2402.07891v1 null
2024-02-12 Toward an Android Static Analysis Approach for Data Protection Mugdha Khedkar et.al. 2402.07889v1 null
2024-02-12 WildfireGPT: Tailored Large Language Model for Wildfire Analysis Yangxinyu Xie et.al. 2402.07877v1 null
2024-02-12 Policy Improvement using Language Feedback Models Victor Zhong et.al. 2402.07876v1 null
2024-02-09 Feedback Loops With Language Models Drive In-Context Reward Hacking Alexander Pan et.al. 2402.06627v1 link
2024-02-09 Understanding the Effects of Iterative Prompting on Truthfulness Satyapriya Krishna et.al. 2402.06625v1 null
2024-02-09 A two-stage algorithm in evolutionary product unit neural networks for classification Antonio J. Tallón-Ballesteros et.al. 2402.06622v1 null
2024-02-09 TIC: Translate-Infer-Compile for accurate ‘text to plan’ using LLMs and logical intermediate representations Sudhir Agarwal et.al. 2402.06608v1 null
2024-02-09 On the Out-Of-Distribution Generalization of Multimodal Large Language Models Xingxuan Zhang et.al. 2402.06599v1 null
2024-02-09 CigaR: Cost-efficient Program Repair with LLMs Dávid Hidvégi et.al. 2402.06598v1 link
2024-02-09 Understanding the Weakness of Large Language Model Agents within a Complex Android Environment Mingzhe Xing et.al. 2402.06596v1 link
2024-02-08 InstaGen: Enhancing Object Detection by Training on Synthetic Dataset Chengjian Feng et.al. 2402.05937v1 null
2024-02-08 SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models Peng Gao et.al. 2402.05935v1 link
2024-02-08 Time Series Diffusion in the Frequency Domain Jonathan Crabbé et.al. 2402.05933v1 link
2024-02-08 WebLINX: Real-World Website Navigation with Multi-Turn Dialogue Xing Han Lù et.al. 2402.05930v1 link
2024-02-08 An Interactive Agent Foundation Model Zane Durante et.al. 2402.05929v1 null
2024-02-08 Sharp Rates in Dependent Learning Theory: Avoiding Sample Size Deflation for the Square Loss Ingvar Ziemann et.al. 2402.05928v1 null
2024-02-07 Image captioning for Brazilian Portuguese using GRIT model Rafael Silva de Alencar et.al. 2402.05106v1 null
2024-02-07 You Can REST Now: Automated Specification Inference and Black-Box Testing of RESTful APIs with Large Language Models Alix Decrop et.al. 2402.05102v1 null
2024-02-07 Hydragen: High-Throughput LLM Inference with Shared Prefixes Jordan Juravsky et.al. 2402.05099v1 null
2024-02-07 On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling Marcin Sendera et.al. 2402.05098v1 link
2024-02-07 Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation Dennis Hoftijzer et.al. 2402.05090v1 null
2024-02-07 Hyperspectral acquisition with ScanImage at the single pixel level: Application to time domain coherent Raman imaging Samuel Metais et.al. 2402.05086v1 null
2024-02-06 Linear-time Minimum Bayes Risk Decoding with Reference Aggregation Jannis Vamvas et.al. 2402.04251v1 link
2024-02-06 CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers Adjorn van Engelenhoven et.al. 2402.04239v1 null
2024-02-06 CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations Ji Qi et.al. 2402.04236v1 link
2024-02-06 Role of spontaneously generated coherence (SGC) in laser cooling of atoms Rajnandan Choudhury Das et.al. 2402.04234v1 null
2024-02-06 Can Generative Agents Predict Emotion? Ciaran Regan et.al. 2402.04232v1 null
2024-02-06 Further Constructions of AMUBs for Non-prime power Composite Dimensions Ajeet Kumar et.al. 2402.04231v1 null
2024-02-05 Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? Qiyao Liang et.al. 2402.03305v1 null
2024-02-05 GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models Haibo Jin et.al. 2402.03299v1 null
2024-02-05 Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks Yongchang Hao et.al. 2402.03295v1 null
2024-02-05 InstanceDiffusion: Instance-level Control for Image Generation Xudong Wang et.al. 2402.03290v1 link
2024-02-05 Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS Matthew DeLorenzo et.al. 2402.03289v1 null
2024-02-05 A Lennard-Jones Layer for Distribution Normalization Mulun Na et.al. 2402.03287v1 null
2024-02-05 Training-Free Consistent Text-to-Image Generation Yoad Tewel et.al. 2402.03286v1 null
2024-02-05 Towards a Flexible Scale-out Framework for Efficient Visual Data Query Processing Rohit Verma et.al. 2402.03283v1 null
2024-02-02 Position Paper: Generalized grammar rules and structure-based generalization beyond classical equivariance for lexical tasks and transduction Mircea Petrache et.al. 2402.01629v1 null
2024-02-02 Stochastic Two Points Method for Deep Model Zeroth-order Optimization Yijiang Pang et.al. 2402.01621v1 null
2024-02-02 MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models Justin Chih-Yao Chen et.al. 2402.01620v1 link
2024-02-02 Style Vectors for Steering Generative Large Language Model Kai Konen et.al. 2402.01618v1 link
2024-02-02 A GP-based Robust Motion Planning Framework for Agile Autonomous Robot Navigation and Recovery in Unknown Environments Nicholas Mohammad et.al. 2402.01617v1 null
2024-02-01 AToM: Amortized Text-to-Mesh using 2D Diffusion Guocheng Qian et.al. 2402.00867v1 null
2024-02-01 Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection Qinyu Zhao et.al. 2402.00865v1 link
2024-02-01 Evaluating Large Language Models for Generalization and Robustness via Data Compression Yucheng Li et.al. 2402.00861v1 link
2024-02-01 Can Large Language Models Understand Context? Yilun Zhu et.al. 2402.00858v1 null
2024-02-01 SymbolicAI: A framework for logic-based approaches combining generative models and solvers Marius-Constantin Dinu et.al. 2402.00854v1 link
2024-02-01 LTAU-FF: Loss Trajectory Analysis for Uncertainty in Atomistic Force Fields Joshua A. Vita et.al. 2402.00853v1 null
2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators Daniel Geng et.al. 2401.18085v1 null
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083v1 link
2024-01-31 Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners? Andreas Opedal et.al. 2401.18070v1 null
2024-01-30 A simple, strong baseline for building damage detection on the xBD dataset Sebastian Gerard et.al. 2401.17271v1 link
2024-01-30 Weaver: Foundation Models for Creative Writing Tiannan Wang et.al. 2401.17268v1 null
2024-01-30 Proactive Detection of Voice Cloning with Localized Watermarking Robin San Roman et.al. 2401.17264v1 link
2024-01-30 Weak-to-Strong Jailbreaking on Large Language Models Xuandong Zhao et.al. 2401.17256v1 link
2024-01-29 Endo-4DGS: Distilling Depth Ranking for Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting Yiming Huang et.al. 2401.16416v1 null
2024-01-29 A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect Yunkang Cao et.al. 2401.16402v1 null
2024-01-29 Amazon’s 2023 Drought: Sentinel-1 Reveals Extreme Rio Negro River Contraction Fabien H Wagner et.al. 2401.16393v1 null
2024-01-26 EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty Yuhui Li et.al. 2401.15077v1 link
2024-01-26 Annotated Hands for Generative Models Yue Yang et.al. 2401.15075v1 link
2024-01-26 From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities Chaochao Lu et.al. 2401.15071v1 null
2024-01-26 Pairing Orthographically Variant Literary Words to Standard Equivalents Using Neural Edit Distance Models Craig Messner et.al. 2401.15068v1 null
2024-01-26 Asymmetric Influence of the Amplitude-Dependent Tune Shift on the Transverse Mode-Coupling Instability Miriam Brosi et.al. 2401.15065v1 null
2024-01-26 Expert with Clustering: Hierarchical Online Preference Learning Framework Tianyue Zhou et.al. 2401.15062v1 null
2024-01-25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Xinlei Chen et.al. 2401.14404v1 null
2024-01-25 O(1) Insertion for Random Walk d-ary Cuckoo Hashing up to the Load Threshold Tolson Bell et.al. 2401.14394v1 null
2024-01-25 Inconsistency Masks: Removing the Uncertainty from Input-Pseudo-Label Pairs Michael R. H. Vorndran et.al. 2401.14387v1 link
2024-01-25 Manifold GCN: Diffusion-based Convolutional Neural Network for Manifold-valued Graphs Martin Hanik et.al. 2401.14381v1 null
2024-01-25 UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models Timo Kapsalis et.al. 2401.14379v1 null
2024-01-24 Graph-Informed Neural Networks for Sparse Grid-Based Discontinuity Detectors Francesco Della Santa et.al. 2401.13652v1 link
2024-01-24 Employing polyhedral methods to optimize stencils on FPGAs with stencil-specific caches, data reuse, and wide data bursts Florian Mayer et.al. 2401.13645v1 null
2024-01-24 Unveiling homophily beyond the pool of opportunities Sina Sajjadi et.al. 2401.13642v1 null
2024-01-23 GALA: Generating Animatable Layered Assets from a Single Scan Taeksoo Kim et.al. 2401.12979v1 null
2024-01-23 Zero-Shot Learning for the Primitives of 3D Affordance in General Objects Hyeonwoo Kim et.al. 2401.12978v1 null
2024-01-23 In-Context Language Learning: Arhitectures and Algorithms Ekin Akyürek et.al. 2401.12973v1 link
2024-01-23 Raidar: geneRative AI Detection viA Rewriting Chengzhi Mao et.al. 2401.12970v1 link
2024-01-23 Minimizing the Age of Two Heterogeneous Sources With Packet Drops Via Cyclic Schedulers Sahan Liyanaarachchi et.al. 2401.12962v1 null
2024-01-23 Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network Hanchen Li et.al. 2401.12961v1 null
2024-01-22 Exploring Simple Open-Vocabulary Semantic Segmentation Zihang Lai et.al. 2401.12217v1 link
2024-01-22 Genericity Through Stratification Victor Arrial et.al. 2401.12212v1 null
2024-01-22 OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics Peiqi Liu et.al. 2401.12202v1 link
2024-01-22 APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference Bowen Zhao et.al. 2401.12200v1 null
2024-01-22 Learning Dynamics from Multicellular Graphs with Deep Neural Networks Haiqian Yang et.al. 2401.12196v1 null
2024-01-22 Text Embedding Inversion Attacks on Multilingual Language Models Yiyi Chen et.al. 2401.12192v1 null
2024-01-19 Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Lihe Yang et.al. 2401.10891v1 link
2024-01-19 Event detection from novel data sources: Leveraging satellite imagery alongside GPS traces Ekin Ugurel et.al. 2401.10890v1 link
2024-01-19 Synthesizing Moving People with 3D Control Boyi Li et.al. 2401.10889v1 null
2024-01-19 Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning Adib Hasan et.al. 2401.10862v1 link
2024-01-18 ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions Jeonghwan Kim et.al. 2401.10232v1 null
2024-01-18 Simultaneous Tactile Estimation and Control for Extrinsic Dexterity Antonia Bronars et.al. 2401.10230v1 null
2024-01-18 RAP-SAM: Towards Real-Time All-Purpose Segment Anything Shilin Xu et.al. 2401.10228v1 link
2024-01-18 A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting Wouter Van Gansbeke et.al. 2401.10227v1 link
2024-01-18 The Manga Whisperer: Automatically Generating Transcriptions for Comics Ragav Sachdeva et.al. 2401.10224v1 link
2024-01-18 Supervised Fine-tuning in turn Improves Visual Foundation Models Xiaohu Jiang et.al. 2401.10222v1 link
2024-01-18 AutoFT: Robust Fine-Tuning by Optimizing Hyperparameters on OOD Data Caroline Choi et.al. 2401.10220v1 null
2024-01-18 Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions Namitha Padmanabhan et.al. 2401.10217v1 null
2024-01-18 GPAvatar: Generalizable and Precise Head Avatar from Image(s) Xuangeng Chu et.al. 2401.10215v1 link
2024-01-17 Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Lianghui Zhu et.al. 2401.09417v1 link
2024-01-17 Vlogger: Make Your Dream A Vlog Shaobin Zhuang et.al. 2401.09414v1 link
2024-01-17 Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text Mazal Bethany et.al. 2401.09407v1 null
2024-01-16 Machine Translation with Large Language Models: Prompt Engineering for Persian, English, and Russian Directions Nooshin Pourkamali et.al. 2401.08429v1 null
2024-01-16 Three ways that non-differentiability affects neural network training Siddharth Krishna Kumar et.al. 2401.08426v1 null
2024-01-16 U-DIADS-Bib: a full and few-shot pixel-precise dataset for document layout analysis of ancient manuscripts Silvia Zottin et.al. 2401.08425v1 null
2024-01-16 Ask the experts: sourcing high-quality datasets for nutritional counselling through Human-AI collaboration Simone Balloccu et.al. 2401.08420v1 link
2024-01-16 Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation Haoran Xu et.al. 2401.08417v1 link
2024-01-12 Automated Test Case Repair Using Language Models Ahmadreza Saboor Yaraghi et.al. 2401.06765v1 null
2024-01-12 APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding Mingdao Liu et.al. 2401.06761v1 null
2024-01-12 Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction Muhammad Naveed Riaz et.al. 2401.06757v1 null
2024-01-12 Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection Muhammad Tayyab Zamir et.al. 2401.06752v1 null
2024-01-12 The Unreasonable Effectiveness of Easy Training Data for Hard Tasks Peter Hase et.al. 2401.06751v1 link
2024-01-12 Measure Theoretic Reeb Graphs and Reeb Spaces Qingsong Wang et.al. 2401.06748v1 null
2024-01-11 Distilling Vision-Language Models on Millions of Videos Yue Zhao et.al. 2401.06129v1 null
2024-01-11 E $^{2}$ GAN: Efficient Training of Efficient GANs for Image-to-Image Translation Yifan Gong et.al. 2401.06127v1 null
2024-01-11 Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors Jack Saunders et.al. 2401.06126v1 null
2024-01-11 Manipulating Feature Visualizations with Gradient Slingshots Dilyara Bareeva et.al. 2401.06122v1 link
2024-01-11 Gaussian Shadow Casting for Neural Characters Luis Bolanos et.al. 2401.06116v1 null
2024-01-11 Jupyter widgets and extensions for education and research in computational physics and chemistry Dou Du et.al. 2401.06113v1 null
2024-01-10 InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes Mohamad Shahbazi et.al. 2401.05335v1 null
2024-01-10 URHand: Universal Relightable Hands Zhaoxi Chen et.al. 2401.05334v1 null
2024-01-10 \textit{SmartMME}: Implementation of Base Station Switching Off Strategy in ns-3 Argha Sen et.al. 2401.05329v1 null
2024-01-10 Leveraging Print Debugging to Improve Code Generation in Large Language Models Xueyu Hu et.al. 2401.05319v1 null
2024-01-10 Can Probabilistic Feedback Drive User Impacts in Online Platforms? Jessica Dai et.al. 2401.05304v1 null
2024-01-09 Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation Xiyi Chen et.al. 2401.04728v1 null
2024-01-09 Low-Resource Vision Challenges for Foundation Models Yunhua Zhang et.al. 2401.04716v1 null
2024-01-09 Bin Packing under Random-Order: Breaking the Barrier of 3/2 Anish Hebbar et.al. 2401.04714v1 link
2024-01-09 RNA-TransCrypt: Image Encryption Using Chaotic RNA Encoding, Novel Transformative Substitution, and Tailored Cryptographic Operations Muhammad Shahbaz Khan et.al. 2401.04707v1 null
2024-01-08 AGG: Amortized Generative 3D Gaussians for Single Image to 3D Dejia Xu et.al. 2401.04099v1 null
2024-01-08 Modeling AoII in Push- and Pull-Based Sampling of Continuous Time Markov Chains Ismail Cosandal et.al. 2401.04098v1 null
2024-01-08 GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation Tong Wu et.al. 2401.04092v1 link
2024-01-08 Mixtral of Experts Albert Q. Jiang et.al. 2401.04088v1 null
2024-01-05 Denoising Vision Transformers Jiawei Yang et.al. 2401.02957v1 link
2024-01-05 Locally Adaptive Neural 3D Morphable Models Michail Tarasiou et.al. 2401.02937v1 link
2024-01-05 Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks Kevin Everson et.al. 2401.02921v1 null
2024-01-04 Learning to Prompt with Text Only Supervision for Vision-Language Models Muhammad Uzair Khattak et.al. 2401.02418v1 link
2024-01-04 LLaMA Pro: Progressive LLaMA with Block Expansion Chengyue Wu et.al. 2401.02415v1 link
2024-01-04 LLM Augmented LLMs: Expanding Capabilities through Composition Rachit Bansal et.al. 2401.02412v1 null
2024-01-04 What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs Alex Trevithick et.al. 2401.02411v1 null
2024-01-04 Correctness Comparison of ChatGPT-4, Bard, Claude-2, and Copilot for Spatial Tasks Hartwig H. Hochmair et.al. 2401.02404v1 null
2024-01-04 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation Zihao Xiao et.al. 2401.02402v1 null
2024-01-04 Learning the 3D Fauna of the Web Zizhang Li et.al. 2401.02400v1 null
2024-01-03 From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations Evonne Ng et.al. 2401.01885v1 link
2024-01-03 A rewriting-logic-with-SMT-based formal analysis and parameter synthesis framework for parametric time Petri nets Jaime Arias et.al. 2401.01884v1 null
2024-01-03 Theoretical guarantees on the best-of-n alignment policy Ahmad Beirami et.al. 2401.01879v1 null
2024-01-03 Graph Neural Networks for Surfactant Multi-Property Prediction Christoforos Brozos et.al. 2401.01874v1 link
2024-01-03 Dataset Difficulty and the Role of Inductive Bias Devin Kwok et.al. 2401.01867v1 null
2024-01-02 Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Zixiang Chen et.al. 2401.01335v1 link
2024-01-02 An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction Zaratiana Urchade et.al. 2401.01326v1 link
2024-01-02 A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models S. M Towhidul Islam Tonmoy et.al. 2401.01313v1 null
2024-01-02 On the uniqueness and computation of commuting extensions Pascal Koiran et.al. 2401.01302v1 null
2023-12-29 K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval and Persona-Adaptive Queries Kanak Raj et.al. 2312.17748v1 link
2023-12-28 Do Androids Know They’re Only Dreaming of Electric Sheep? Sky CH-Wang et.al. 2312.17249v1 null
2023-12-28 Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity Guhao Feng et.al. 2312.17248v1 null
2023-12-28 The LLM Surgeon Tycho F. A. van der Ouderaa et.al. 2312.17244v1 link
2023-12-28 Unsupervised Universal Image Segmentation Dantong Niu et.al. 2312.17243v1 link
2023-12-28 Learning to Generate Text in Arbitrary Writing Styles Aleem Khan et.al. 2312.17242v1 null
2023-12-28 An Improved Baseline for Reasoning Segmentation with Large Language Model Senqiao Yang et.al. 2312.17240v1 null
2023-12-28 Fast Inference of Mixture-of-Experts Language Models with Offloading Artyom Eliseev et.al. 2312.17238v1 link
2023-12-28 A Simple LLM Framework for Long-Range Video Question-Answering Ce Zhang et.al. 2312.17235v1 link
2023-12-28 Personalized Restoration via Dual-Pivot Tuning Pradyumna Chari et.al. 2312.17234v1 null
2023-12-26 Social-Transmotion: Promptable Human Trajectory Prediction Saeed Saadatnejad et.al. 2312.16168v1 link
2023-12-26 Age of Information in Gossip Networks: A Friendly Introduction and Literature Survey Priyanka Kaswan et.al. 2312.16163v1 null
2023-12-26 Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages Mofetoluwa Adeyemi et.al. 2312.16159v1 null
2023-12-26 From Text to Multimodal: A Comprehensive Survey of Adversarial Example Generation in Question Answering Systems Gulsum Yigit et.al. 2312.16156v1 null
2023-12-26 Validating Light Phenomena Conceptual Assessment Through The Lens of CTT and IRT Frameworks Purwoko Haryadi Santoso et.al. 2312.16153v1 null
2023-12-26 SoundCount: Sound Counting from Raw Audio with Dyadic Decomposition Neural Network Yuhang He et.al. 2312.16149v1 null
2023-12-22 MACS: Mass Conditioned 3D Hand and Object Motion Synthesis Soshi Shimada et.al. 2312.14929v1 null
2023-12-22 PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF Mohsen Gholami et.al. 2312.14915v1 link
2023-12-21 Virtual Pets: Animatable Animal Generation in 3D Scenes Yen-Chi Cheng et.al. 2312.14154v1 null
2023-12-21 DriveLM: Driving with Graph Visual Question Answering Chonghao Sima et.al. 2312.14150v1 link
2023-12-21 HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs Artem Sevastopolsky et.al. 2312.14140v1 null
2023-12-21 Diffusion Reward: Learning Rewards via Conditional Video Diffusion Tao Huang et.al. 2312.14134v1 null
2023-12-20 Generative Multimodal Models are In-Context Learners Quan Sun et.al. 2312.13286v1 link
2023-12-20 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections Fangjinhua Wang et.al. 2312.13285v1 null
2023-12-20 Deep Learning on 3D Neural Fields Pierluigi Zama Ramirez et.al. 2312.13277v1 null
2023-12-20 Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting Junwu Zhang et.al. 2312.13271v1 link
2023-12-19 Weakly Supervised Open-Vocabulary Object Detection Jianghang Lin et.al. 2312.12437v1 null
2023-12-19 A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise Chaoyou Fu et.al. 2312.12436v1 link
2023-12-19 On Inference Stability for Diffusion Models Viet Nguyen et.al. 2312.12431v1 link
2023-12-19 ROSE: A reduced-order scattering emulator for optical models Daniel Odell et.al. 2312.12426v1 null
2023-12-19 SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process Mengyu Wang et.al. 2312.12425v1 link
2023-12-19 Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model Shraman Pramanick et.al. 2312.12423v1 null
2023-12-19 Scene-Conditional 3D Object Stylization and Composition Jinghao Zhou et.al. 2312.12419v1 null
2023-12-18 On Computing Makespan-Optimal Solutions for Generalized Sliding-Tile Puzzles Marcus Gozon et.al. 2312.10887v1 null
2023-12-18 A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm Yong Niu et.al. 2312.10885v1 null
2023-12-18 Sharable Clothoid-based Continuous Motion Planning for Connected Automated Vehicles Sanghoon Oh et.al. 2312.10880v1 null
2023-12-18 Country-Scale Cropland Mapping in Data-Scarce Settings Using Deep Learning: A Case Study of Nigeria Joaquin Gajardo et.al. 2312.10872v1 link
2023-12-18 From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape Timothy R. McIntosh et.al. 2312.10868v1 null
2023-12-15 Osprey: Pixel Understanding with Visual Instruction Tuning Yuqian Yuan et.al. 2312.10032v1 link
2023-12-15 Wearable Coaxially-shielded Metamaterial for Magnetic Resonance Imaging Xia Zhu et.al. 2312.10018v1 null
2023-12-15 Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects Paul Maria Scheikl et.al. 2312.10008v1 null
2023-12-15 Faithful Persona-based Conversational Dataset Generation with Large Language Models Pegah Jandaghi et.al. 2312.10007v1 link
2023-12-14 LIME: Localized Image Editing via Attention Regularization in Diffusion Models Enis Simsar et.al. 2312.09256v1 null
2023-12-14 Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization Luca Bartolomei et.al. 2312.09254v1 link
2023-12-14 FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection Hongsuk Choi et.al. 2312.09252v1 null
2023-12-14 VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation Jinguo Zhu et.al. 2312.09251v1 link
2023-12-14 Single Mesh Diffusion Models with Field Latents for Texture Generation Thomas W. Mitchel et.al. 2312.09250v1 null
2023-12-14 ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining Ruoxi Shi et.al. 2312.09249v1 null
2023-12-14 Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking Jacob Eisenstein et.al. 2312.09244v1 null
2023-12-14 OccNeRF: Self-Supervised Multi-Camera Occupancy Prediction with Neural Radiance Fields Chubin Zhang et.al. 2312.09243v1 link
2023-12-14 Text2Immersion: Generative Immersive Scene with 3D Gaussians Hao Ouyang et.al. 2312.09242v1 null
2023-12-13 SAM-guided Graph Cut for 3D Instance Segmentation Haoyu Guo et.al. 2312.08372v1 null
2023-12-13 PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection Kuan-Chih Huang et.al. 2312.08371v1 link
2023-12-13 An Invitation to Deep Reinforcement Learning Bernhard Jaeger et.al. 2312.08365v1 null
2023-12-13 View-Dependent Octree-based Mesh Extraction in Unbounded Scenes for Procedural Synthetic Data Zeyu Ma et.al. 2312.08364v1 link
2023-12-13 On the Computational Hardness of Quantum One-Wayness Bruno Cavalar et.al. 2312.08363v1 null
2023-12-13 Distributed Inference and Fine-tuning of Large Language Models Over The Internet Alexander Borzunov et.al. 2312.08361v1 null
2023-12-12 diff History for Long-Context Language Agents Ulyana Piterbarg et.al. 2312.07540v1 link
2023-12-12 HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation Hongyu Liu et.al. 2312.07539v1 null
2023-12-12 FreeInit: Bridging Initialization Gap in Video Diffusion Models Tianxing Wu et.al. 2312.07537v1 link
2023-12-12 FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition Sicheng Mo et.al. 2312.07536v1 null
2023-12-12 Interfacing Foundation Models’ Embeddings Xueyan Zou et.al. 2312.07532v1 link
2023-12-12 Topological Obstructions and How to Avoid Them Babak Esmaeili et.al. 2312.07529v1 null
2023-12-11 CAD: Photorealistic 3D Generation via Adversarial Distillation Ziyu Wan et.al. 2312.06663v1 null
2023-12-11 Photorealistic Video Generation with Diffusion Models Agrim Gupta et.al. 2312.06662v1 null
2023-12-11 UpFusion: Novel View Diffusion from Unposed Sparse View Observations Bharath Raj Nagoor Kani et.al. 2312.06661v1 null
2023-12-11 EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM Chong Zhou et.al. 2312.06660v1 link
2023-12-11 Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior Fangfu Liu et.al. 2312.06655v1 link
2023-12-11 LightSim: Neural Lighting Simulation for Urban Scenes Ava Pun et.al. 2312.06654v1 null
2023-12-11 Adaptive Human Trajectory Prediction via Latent Corridors Neerja Thakkar et.al. 2312.06653v1 null
2023-12-11 Nuvo: Neural UV Mapping for Unruly 3D Representations Pratul P. Srinivasan et.al. 2312.05283v1 null
2023-12-08 KBFormer: A Diffusion Model for Structured Entity Completion Ouail Kitouni et.al. 2312.05253v1 null
2023-12-08 Laboratory realization of relativistic pair-plasma beams C. D. Arrowsmith et.al. 2312.05244v1 null
2023-12-08 Contra generative AI detection in higher education assessments Cesare G. Ardito et.al. 2312.05241v1 null
2023-12-08 SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation Thuan Hoang Nguyen et.al. 2312.05239v1 null
2023-12-08 Seeing ChatGPT Through Universities’ Policies, Resources and Guidelines Hui Wang et.al. 2312.05235v1 null
2023-12-07 Scaling Laws of Synthetic Images for Model Training … for Now Lijie Fan et.al. 2312.04567v1 link
2023-12-07 Gen2Det: Generate to Detect Saksham Suri et.al. 2312.04566v1 null
2023-12-07 MuRF: Multi-Baseline Radiance Fields Haofei Xu et.al. 2312.04565v1 link
2023-12-07 GenDeF: Learning Generative Deformation Field for Video Generation Wen Wang et.al. 2312.04561v1 null
2023-12-07 NeRFiller: Completing Scenes via Generative 3D Inpainting Ethan Weber et.al. 2312.04560v1 null
2023-12-07 PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation Zhaoxi Chen et.al. 2312.04559v1 link
2023-12-07 GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Shoufa Chen et.al. 2312.04557v1 null
2023-12-07 Large Language Models for Mathematicians Simon Frieder et.al. 2312.04556v1 null
2023-12-07 Improved Visual Grounding through Self-Consistent Explanations Ruozhen He et.al. 2312.04554v1 null
2023-12-07 Generating Illustrated Instructions Sachit Menon et.al. 2312.04552v1 null
2023-12-06 Relightable Gaussian Codec Avatars Shunsuke Saito et.al. 2312.03704v1 null
2023-12-06 Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning Xinshun Wang et.al. 2312.03703v1 link
2023-12-06 Self-conditioned Image Generation via Generating Representations Tianhong Li et.al. 2312.03701v1 link
2023-12-06 Intrinsic Harmonization for Illumination-Aware Compositing Chris Careaga et.al. 2312.03698v1 link
2023-12-06 Efficient Learning in Polyhedral Games via Best Response Oracles Darshan Chakrabarti et.al. 2312.03696v1 null
2023-12-06 Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication Ali Naseh et.al. 2312.03692v1 null
2023-12-06 On the Role of Edge Dependency in Graph Generative Models Sudhanshu Chanpuriya et.al. 2312.03691v1 null
2023-12-06 Evaluating and Mitigating Discrimination in Language Model Decisions Alex Tamkin et.al. 2312.03689v1 null
2023-12-05 GPT4Point: A Unified Framework for Point-Language Understanding and Generation Zhangyang Qi et.al. 2312.02980v1 null
2023-12-05 Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World Kiana Ehsani et.al. 2312.02976v1 null
2023-12-05 Describing Differences in Image Sets with Natural Language Lisa Dunlap et.al. 2312.02974v1 link
2023-12-05 Alchemist: Parametric Control of Material Properties with Diffusion Models Prafull Sharma et.al. 2312.02970v1 null
2023-12-05 Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models Xinyu Zhang et.al. 2312.02969v1 null
2023-12-05 AmbiGen: Generating Ambigrams from Pre-trained Diffusion Model Boheng Zhao et.al. 2312.02967v1 null
2023-12-05 Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection Cheng-Ju Ho et.al. 2312.02966v1 link
2023-12-05 MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures Zhangyang Xiong et.al. 2312.02963v1 null
2023-12-04 Aligning and Prompting Everything All at Once for Universal Visual Perception Yunhang Shen et.al. 2312.02153v1 link
2023-12-04 Readout Guidance: Learning Control from Diffusion Features Grace Luo et.al. 2312.02150v1 null
2023-12-04 Generative Powers of Ten Xiaojuan Wang et.al. 2312.02149v1 null
2023-12-04 Rejuvenating image-GPT as Strong Visual Representation Learners Sucheng Ren et.al. 2312.02147v1 link
2023-12-04 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation Bingxin Ke et.al. 2312.02145v1 link
2023-12-04 Optimizing Camera Configurations for Multi-View Pedestrian Detection Yunzhong Hou et.al. 2312.02144v1 null
2023-12-04 Competition-Level Problems Are Effective Evaluators of LLMs Yiming Huang et.al. 2312.02143v1 null
2023-12-04 Object Recognition as Next Token Prediction Kaiyu Yue et.al. 2312.02142v1 link
2023-12-01 VideoBooth: Diffusion-based Video Generation with Image Prompts Yuming Jiang et.al. 2312.00777v1 null
2023-12-01 Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans Homanga Bharadhwaj et.al. 2312.00775v1 null
2023-12-01 Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses Xiao Ma et.al. 2312.00763v1 null
2023-12-01 Mamba: Linear-Time Sequence Modeling with Selective State Spaces Albert Gu et.al. 2312.00752v1 link
2023-12-01 Reduction from sparse LPN to LPN, Dual Attack 3.0 Kévin Carrier et.al. 2312.00747v1 null
2023-12-01 Adversarial Score Distillation: When score distillation meets GAN Min Wei et.al. 2312.00739v1 link
2023-11-30 Dataset Distillation in Large Data Era Zeyuan Yin et.al. 2311.18838v1 link
2023-11-30 VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models Zhen Xing et.al. 2311.18837v1 null
2023-11-30 PoseGPT: Chatting about 3D Human Pose Yao Feng et.al. 2311.18836v1 null
2023-11-30 InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation Rongyao Fang et.al. 2311.18835v1 link
2023-11-30 ART $\boldsymbol{\cdot}$ V: Auto-Regressive Text-to-Video Generation with Diffusion Models Wenming Weng et.al. 2311.18834v1 null
2023-11-30 Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction Hsin-Ying Lee et.al. 2311.18832v1 link
2023-11-30 MotionEditor: Editing Video Motion via Content-Aware Diffusion Shuyuan Tu et.al. 2311.18830v1 link
2023-11-30 MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation Yanhui Wang et.al. 2311.18829v1 null
2023-11-30 One-step Diffusion with Distribution Matching Distillation Tianwei Yin et.al. 2311.18828v1 null
2023-11-30 An Adaptive Framework for Generalizing Network Traffic Prediction towards Uncertain Environments Alexander Downey et.al. 2311.18824v1 null
2023-11-29 A Simple Recipe for Language-guided Domain Generalized Segmentation Mohammad Fahes et.al. 2311.17922v1 null
2023-11-29 Do text-free diffusion models learn discriminative visual representations? Soumik Mukhopadhyay et.al. 2311.17921v1 link
2023-11-29 Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models Daniel Geng et.al. 2311.17919v1 null
2023-11-29 Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving Yuqi Wang et.al. 2311.17918v1 link
2023-11-29 AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text Jianfeng Zhang et.al. 2311.17917v1 null
2023-11-29 OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation Qidong Huang et.al. 2311.17911v1 link
2023-11-29 HUGS: Human Gaussian Splats Muhammed Kocabas et.al. 2311.17910v1 null
2023-11-29 CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting Alexander Vilesov et.al. 2311.17907v1 null
2023-11-28 HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting Xian Liu et.al. 2311.17061v1 null
2023-11-28 Material Palette: Extraction of Materials from a Single Image Ivan Lopes et.al. 2311.17060v1 null
2023-11-28 Panoptic Video Scene Graph Generation Jingkang Yang et.al. 2311.17058v1 link
2023-11-28 ReMoS: Reactive 3D Motion Synthesis for Two-Person Interactions Anindita Ghosh et.al. 2311.17057v1 null
2023-11-28 Self-Supervised Motion Magnification by Backpropagating Through Optical Flow Zhaoying Pan et.al. 2311.17056v1 null
2023-11-28 No Representation Rules Them All in Category Discovery Sagar Vaze et.al. 2311.17055v1 null
2023-11-28 DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models Tsun-Hsuan Wang et.al. 2311.17053v1 null
2023-11-28 Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models Zhengming Yu et.al. 2311.17050v1 null
2023-11-27 Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models Munan Ning et.al. 2311.16103v1 link
2023-11-27 Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback Mihir Prabhudesai et.al. 2311.16102v1 null
2023-11-27 How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs Haoqin Tu et.al. 2311.16101v1 link
2023-11-27 GART: Gaussian Articulated Template Models Jiahui Lei et.al. 2311.16099v1 null
2023-11-27 On Bringing Robots Home Nur Muhammad Mahi Shafiullah et.al. 2311.16098v1 link
2023-11-27 CG-HOI: Contact-Guided 3D Human-Object Interaction Generation Christian Diller et.al. 2311.16097v1 null
2023-11-27 Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling Zhe Li et.al. 2311.16096v1 link
2023-11-27 Self-correcting LLM-controlled Diffusion Models Tsung-Han Wu et.al. 2311.16090v1 null
2023-11-27 DUnE: Dataset for Unified Editing Afra Feyza Akyürek et.al. 2311.16087v1 link
2023-11-24 SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation Lingchen Meng et.al. 2311.14671v1 link
2023-11-24 Data-driven Prior Learning for Bayesian Optimisation Sigrid Passano Hellan et.al. 2311.14653v1 link
2023-11-24 One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space Raghav Addanki et.al. 2311.14652v1 null
2023-11-24 History Filtering in Imperfect Information Games: Algorithms and Complexity Christopher Solinas et.al. 2311.14651v1 null
2023-11-22 Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation Daichi Horita et.al. 2311.13602v1 null
2023-11-22 Visual In-Context Prompting Feng Li et.al. 2311.13601v1 link
2023-11-22 ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs Viraj Shah et.al. 2311.13600v1 null
2023-11-22 Risk-sensitive Markov Decision Process and Learning under General Utility Functions Zhengqi Wu et.al. 2311.13589v1 null
2023-11-22 A Survey of Serverless Machine Learning Model Inference Kamil Kojs et.al. 2311.13587v1 null
2023-11-22 On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates Stefano Bruno et.al. 2311.13584v1 null
2023-11-22 PaSS: Parallel Speculative Sampling Giovanni Monea et.al. 2311.13581v1 null
2023-11-22 Aufbau Suppressed Coupled Cluster Theory for Electronically Excited States Harrison Tuckman et.al. 2311.13576v1 null
2023-11-21 Intrinsic Image Decomposition via Ordinal Shading Chris Careaga et.al. 2311.12792v1 link
2023-11-21 Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks Samyak Jain et.al. 2311.12786v1 null
2023-11-20 Rate-Independent Gradient Crystal Plasticity Theory – Robust Algorithmic Formulations based on Incremental Energy Minimization Volker Fohrmeister et.al. 2311.12026v1 null
2023-11-20 The allosteric lever: towards a principle of specific allosteric response Maximilian Vossel et.al. 2311.12025v1 null
2023-11-20 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction Peng Wang et.al. 2311.12024v1 null
2023-11-20 Macroscopic description of a heavy particle immersed within a flow of light particles Radek Erban et.al. 2311.12021v1 null
2023-11-20 An Empirical Study of Self-Admitted Technical Debt in Machine Learning Software Aaditya Bhatia et.al. 2311.12019v1 null
2023-11-20 GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration Naoki Wake et.al. 2311.12015v1 null
2023-11-17 Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Rohit Girdhar et.al. 2311.10709v1 null
2023-11-17 SelfEval: Leveraging the discriminative nature of generative models for evaluation Sai Saketh Rambhatla et.al. 2311.10708v1 null
2023-11-17 Cactus Representations in Polylogarithmic Max-flow via Maximal Isolating Mincuts Zhongtian He et.al. 2311.10706v1 null
2023-11-16 The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Omri Avrahami et.al. 2311.10093v1 null
2023-11-16 Traffic Video Object Detection using Motion Prior Lihao Liu et.al. 2311.10092v1 null
2023-11-16 Adaptive Shells for Efficient Neural Radiance Field Rendering Zian Wang et.al. 2311.10091v1 null
2023-11-16 Emu Edit: Precise Image Editing via Recognition and Generation Tasks Shelly Sheynin et.al. 2311.10089v1 null
2023-11-16 DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback Yangyi Chen et.al. 2311.10081v1 null
2023-11-16 Improving 3D Synthetic Jet Modeling in a Crossflow Howard Ho et.al. 2311.10072v1 null
2023-11-15 Single-Image 3D Human Digitization with Shape-Guided Diffusion Badour AlBahar et.al. 2311.09221v1 null
2023-11-15 DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model Yinghao Xu et.al. 2311.09217v1 null
2023-11-15 Assessing Translation capabilities of Large Language Models involving English and Indian Languages Vandan Mujadia et.al. 2311.09216v1 null
2023-11-15 GRIM: GRaph-based Interactive narrative visualization for gaMes Jorge Leandro et.al. 2311.09213v1 null
2023-11-15 Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects – A Survey Ashok Urlana et.al. 2311.09212v1 link
2023-11-15 Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models Wenhao Yu et.al. 2311.09210v1 null
2023-11-15 A Unified Approach to Learning Ising Models: Beyond Independence and Bounded Width Jason Gaitonde et.al. 2311.09197v1 null
2023-11-15 Self-Supervised Curriculum Generation for Autonomous Reinforcement Learning without Task-Specific Knowledge Sang-Hyun Lee et.al. 2311.09195v1 null
2023-11-15 Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models James A. Michaelov et.al. 2311.09194v1 null
2023-11-14 Instant3D: Instant Text-to-3D Generation Ming Li et.al. 2311.08403v1 null
2023-11-14 Fine-tuning Language Models for Factuality Katherine Tian et.al. 2311.08401v1 null
2023-11-14 Towards Open-Ended Visual Recognition with Large Language Model Qihang Yu et.al. 2311.08400v1 link
2023-11-14 Are Large Language Models Temporally Grounded? Yifu Qiu et.al. 2311.08398v1 link
2023-11-14 MVSA-Net: Multi-View State-Action Recognition for Robust and Deployable Trajectory Generation Ehsan Asali et.al. 2311.08393v1 null
2023-11-14 On What Basis? Predicting Text Preference Via Structured Comparative Reasoning Jing Nathan Yan et.al. 2311.08390v1 null
2023-11-14 TSST: A Benchmark and Evaluation Models for Text Speech-Style Transfer Huashan Sun et.al. 2311.08389v1 null
2023-11-13 To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning Junke Wang et.al. 2311.07574v1 link
2023-11-13 Realizability of Free Spaces of Curves Hugo A. Akitaya et.al. 2311.07573v1 null
2023-11-13 Feature emergence via margin maximization: case studies in algebraic tasks Depen Morwani et.al. 2311.07568v1 null
2023-11-13 GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation An Yan et.al. 2311.07562v1 link
2023-11-13 Fast Normalized Cross-Correlation for Template Matching with Rotations José María Almira et.al. 2311.07561v1 null
2023-11-13 Sound Gradual Verification with Symbolic Execution Conrad Zimmerman et.al. 2311.07559v1 null
2023-11-13 Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning Arjun Bhardwaj et.al. 2311.07558v1 null
2023-11-10 Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization Weiyang Liu et.al. 2311.06243v1 null
2023-11-10 Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Bin Xiao et.al. 2311.06242v1 null
2023-11-10 Nonnegativity Problems for Matrix Semigroups Julian D’Costa et.al. 2311.06241v1 null
2023-11-10 Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild Nanna Inie et.al. 2311.06237v1 null
2023-11-10 Deep Learning meets Blockchain for Automated and Secure Access Control Asma Jodeiri Akbarfam et.al. 2311.06236v1 null
2023-11-10 Learning material synthesis-structure-property relationship by data fusion: Bayesian Co-regionalization N-Dimensional Piecewise Function Learning A. Gilad Kusne et.al. 2311.06228v1 null
2023-11-10 Does Differential Privacy Prevent Backdoor Attacks in Practice? Fereshteh Razmi et.al. 2311.06227v1 null
2023-11-09 What Do I Hear? Generating Sounds for Visuals with ChatGPT David Chuan-En Lin et.al. 2311.05609v1 null
2023-11-09 Real-Time Neural Rasterization for Large Scenes Jeffrey Yunfan Liu et.al. 2311.05607v1 null
2023-11-09 Diffusion-Generative Multi-Fidelity Learning for Physical Simulation Zheng Wang et.al. 2311.05606v1 null
2023-11-09 3D-QAE: Fully Quantum Auto-Encoding of 3D Point Clouds Lakshika Rathi et.al. 2311.05604v1 null
2023-11-09 Reconstructing Objects in-the-wild for Realistic Sensor Simulation Ze Yang et.al. 2311.05602v1 null
2023-11-09 SynH2R: Synthesizing Hand-Object Motions for Learning Human-to-Robot Handovers Sammy Christen et.al. 2311.05599v1 null
2023-11-09 LLM Augmented Hierarchical Agents Bharat Prakash et.al. 2311.05596v1 null
2023-11-08 GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs Zhenfang Chen et.al. 2311.04901v1 null
2023-11-08 How Abstract Is Linguistic Generalization in Large Language Models? Experiments with Argument Structure Michael Wilson et.al. 2311.04900v1 link
2023-11-08 Optimized measurements of chaotic dynamical systems via the information bottleneck Kieran A. Murphy et.al. 2311.04896v1 null
2023-11-08 The Monadic Theory of Toric Words Valérie Berthé et.al. 2311.04895v1 null
2023-11-08 Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs Shashank Gupta et.al. 2311.04892v1 link
2023-11-08 AutoChip: Automating HDL Generation Using LLM Feedback Shailja Thakur et.al. 2311.04887v1 link
2023-11-08 SEMQA: Semi-Extractive Multi-Source Question Answering Tal Schuster et.al. 2311.04886v1 link
2023-11-07 Towards Garment Sewing Pattern Reconstruction from a Single Image Lijuan Liu et.al. 2311.04218v1 link
2023-11-07 Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves Yihe Deng et.al. 2311.04205v1 link
2023-11-07 Sharp Thresholds Imply Circuit Lower Bounds: from random 2-SAT to Planted Clique David Gamarnik et.al. 2311.04204v1 null
2023-11-07 Exploring Recommendation Capabilities of GPT-4V(ision): A Preliminary Case Study Peilin Zhou et.al. 2311.04199v1 null
2023-11-07 JPAVE: A Generation and Classification-based Model for Joint Product Attribute Prediction and Value Extraction Zhongfen Deng et.al. 2311.04196v1 link
2023-11-06 GLaMM: Pixel Grounding Large Multimodal Model Hanoona Rasheed et.al. 2311.03356v1 null
2023-11-06 SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis Hanrong Ye et.al. 2311.03355v1 null
2023-11-06 CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding Junyan Li et.al. 2311.03354v1 null
2023-11-06 Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation Rusheb Shah et.al. 2311.03348v1 null
2023-11-06 Decomposing Probability Marginals Beyond Affine Requirements Jannik Matuschke et.al. 2311.03346v1 null
2023-11-06 Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences Zador Pataki et.al. 2311.03345v1 null
2023-11-06 Embedding First Order Logic into Kernel Machines Michelangelo Diligenti et.al. 2311.03340v1 null
2023-11-03 EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Jiawei Yang et.al. 2311.02077v1 null
2023-11-03 Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos Dayal Singh Kalra et.al. 2311.02076v1 null
2023-11-03 Envy-Free Cake-Cutting for Four Agents Alexandros Hollender et.al. 2311.02075v1 null
2023-11-03 Learning Historical Status Prompt for Accurate and Robust Visual Tracking Wenrui Cai et.al. 2311.02072v1 null
2023-11-03 Grounded Intuition of GPT-Vision’s Abilities with Scientific Images Alyssa Hwang et.al. 2311.02069v1 link
2023-11-03 GroomGen: A High-Quality Generative Hair Model Using Hierarchical Latent Representations Yuxiao Zhou et.al. 2311.02062v1 null
2023-11-03 Active Learning-Based Species Range Estimation Christian Lange et.al. 2311.02061v1 link
2023-11-02 Idempotent Generative Network Assaf Shocher et.al. 2311.01462v1 null
2023-11-02 Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization Jameel Hassan et.al. 2311.01459v1 null
2023-11-02 Detecting Deepfakes Without Seeing Any Tal Reiss et.al. 2311.01458v1 link
2023-11-02 RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation Yufei Wang et.al. 2311.01455v1 null
2023-11-02 NOIR: Neural Signal Operated Intelligent Robots for Everyday Activities Ruohan Zhang et.al. 2311.01454v1 null
2023-11-02 DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing Vint Lee et.al. 2311.01450v1 null
2023-11-02 UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation Yuwen Xiong et.al. 2311.01448v1 null
2023-11-02 CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation Jingkang Wang et.al. 2311.01447v1 null
2023-11-02 Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation Jay Sarva et.al. 2311.01446v1 null
2023-11-01 End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation Juan Zuluaga-Gomez et.al. 2311.00697v1 link
2023-11-01 Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving Zhan Ling et.al. 2311.00694v1 link
2023-11-01 Improving Interpersonal Communication by Simulating Audiences with Language Models Ryan Liu et.al. 2311.00687v1 link
2023-11-01 Deep Learning-Based Classification of Gamma Photon Interactions in Room-Temperature Semiconductor Radiation Detectors Sandeep K. Chaudhuri et.al. 2311.00682v1 null
2023-11-01 Are Large Language Models Reliable Judges? A Study on the Factuality Evaluation Capabilities of LLMs Xue-Yong Fu et.al. 2311.00681v1 null
2023-10-31 Unexpected Improvements to Expected Improvement for Bayesian Optimization Sebastian Ament et.al. 2310.20708v1 null
2023-10-31 What’s In My Big Data? Yanai Elazar et.al. 2310.20707v1 link
2023-10-31 DDAM-PS: Diligent Domain Adaptive Mixer for Person Search Mohammed Khaleed Almansoori et.al. 2310.20706v1 link
2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction Xinyuan Chen et.al. 2310.20700v1 null
2023-11-01 Bayesian Multistate Bennett Acceptance Ratio Methods Xinqiang Ding et.al. 2310.20699v2 link
2023-10-31 Learning From Mistakes Makes LLM Better Reasoner Shengnan An et.al. 2310.20689v1 link
2023-10-31 Compression with Exact Error Distribution for Federated Learning Mahmoud Hegazy et.al. 2310.20682v1 null
2023-10-30 Variational principles for the hydrodynamics of the classical one-component plasma Daniels Krimans et.al. 2310.19239v1 null
2023-10-30 Building Real-World Meeting Summarization Systems using Large Language Models: A Practical Perspective Md Tahmid Rahman Laskar et.al. 2310.19233v1 null
2023-10-30 Stochastic Configuration Machines: FPGA Implementation Matthew J. Felicetti et.al. 2310.19225v1 null
2023-10-30 CHAMMI: A benchmark for channel-adaptive models in microscopy imaging Zitong Chen et.al. 2310.19224v1 link
2023-10-27 FP8-LM: Training FP8 Large Language Models Houwen Peng et.al. 2310.18313v1 link
2023-10-27 Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models Pushkal Katara et.al. 2310.18308v1 null
2023-10-27 Interactive Motion Planning for Autonomous Vehicles with Joint Optimization Yuxiao Chen et.al. 2310.18301v1 null
2023-10-27 Enhancing the Performance of a Biomimetic Robotic Elbow-and-Forearm System Through Bionics-Inspired Optimization Haosen Yang et.al. 2310.18299v1 null
2023-10-27 Sharp-Edge Diffraction of Laguerre-Gauss Vortex Beams by Elliptic Apertures Riccardo Borghi et.al. 2310.18298v1 null
2023-10-27 Addressing GAN Training Instabilities via Tunable Classification Losses Monica Welfert et.al. 2310.18291v1 null
2023-10-26 Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model Karsten Roth et.al. 2310.17653v1 link
2023-10-26 A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection Anas Al-lahham et.al. 2310.17650v1 link
2023-10-26 6-DoF Stability Field via Diffusion Models Takuma Yoneda et.al. 2310.17649v1 null
2023-10-26 In-Context Learning Dynamics with Random Binary Sequences Eric J. Bigelow et.al. 2310.17639v1 null
2023-10-26 Generative Fractional Diffusion Models Gabriel Nobis et.al. 2310.17638v1 null
2023-10-26 JudgeLM: Fine-tuned Large Language Models are Scalable Judges Lianghui Zhu et.al. 2310.17631v1 link
2023-10-25 SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation Qianxu Wang et.al. 2310.16838v1 null
2023-10-25 Proposal-Contrastive Pretraining for Object Detection from Fewer Data Quentin Bouniot et.al. 2310.16835v1 null
2023-10-25 CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images Aaron Gokaslan et.al. 2310.16825v1 link
2023-10-26 DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior Jingxiang Sun et.al. 2310.16818v2 link
2023-10-25 The intelligent agent model – a fully two-dimensional microscopic traffic flow model Martin Treiber et.al. 2310.16816v1 null
2023-10-24 Synthetic Data as Validation Qixin Hu et.al. 2310.16052v1 null
2023-10-24 EquivAct: SIM(3)-Equivariant Visuomotor Policies beyond Rigid Object Manipulation Jingyun Yang et.al. 2310.16050v1 null
2023-10-24 MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning Zayne Sprague et.al. 2310.16049v1 link
2023-10-24 From Posterior Sampling to Meaningful Diversity in Image Restoration Noa Cohen et.al. 2310.16047v1 null
2023-10-24 Woodpecker: Hallucination Correction for Multimodal Large Language Models Shukang Yin et.al. 2310.16045v1 link
2023-10-25 Stanford-ORB: A Real-World 3D Object Inverse Rendering Benchmark Zhengfei Kuang et.al. 2310.16044v2 link
2023-10-25 WebWISE: Web Interface Control and Sequential Exploration with Large Language Models Heyi Tao et.al. 2310.16042v2 null
2023-10-24 Instruct and Extract: Instruction Tuning for On-Demand Information Extraction Yizhu Jiao et.al. 2310.16040v1 link
2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling Haonan Qiu et.al. 2310.15169v1 link
2023-10-24 Ghost on the Shell: An Expressive Representation of General 3D Shapes Zhen Liu et.al. 2310.15168v2 null
2023-10-23 SAM-Med3D Haoyu Wang et.al. 2310.15161v1 link
2023-10-23 FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models Lihe Yang et.al. 2310.15160v1 link
2023-10-23 Online Detection of AI-Generated Images David C. Epstein et.al. 2310.15150v1 null
2023-10-23 DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design Kevin Lin et.al. 2310.15144v1 link
2023-10-23 SpecTr: Fast Speculative Decoding via Optimal Transport Ziteng Sun et.al. 2310.15141v1 null
2023-10-20 Neural-Base Music Generation for Intelligence Duplication Jacob Galajda et.al. 2310.13691v1 null
2023-10-20 Exploring Linguistic Probes for Morphological Generalization Jordan Kodner et.al. 2310.13686v1 null
2023-10-20 CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages Gabriel Oliveira dos Santos et.al. 2310.13683v1 link
2023-10-20 Optimizing Retrieval-augmented Reader Models via Token Elimination Moshe Berchansky et.al. 2310.13682v1 link
2023-10-20 Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives Mario Giulianelli et.al. 2310.13676v1 link
2023-10-20 On Synthetic Data for Back Translation Jiahao Xu et.al. 2310.13675v1 link
2023-10-19 HumanTOMATO: Text-aligned Whole-body Motion Generation Shunlin Lu et.al. 2310.12978v1 null
2023-10-19 Training Dynamics of Deep Network Linear Regions Ahmed Imtiaz Humayun et.al. 2310.12977v1 null
2023-10-19 Frozen Transformers in Language Models Are Effective Visual Encoder Layers Ziqi Pang et.al. 2310.12973v1 link
2023-10-19 CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning Liyiming Ke et.al. 2310.12972v1 null
2023-10-19 CLAIR: Evaluating Image Captions with Large Language Models David Chan et.al. 2310.12971v1 null
2023-10-19 Does Your Model Think Like an Engineer? Explainable AI for Bearing Fault Detection with Deep Learning Thomas Decker et.al. 2310.12967v1 null
2023-10-18 Understanding Retrieval Augmentation for Long-Form Question Answering Hung-Ting Chen et.al. 2310.12150v1 null
2023-10-18 Object-aware Inversion and Reassembly for Image Editing Zhen Yang et.al. 2310.12149v1 null
2023-10-18 Simple Mechanisms for Representing, Indexing and Manipulating Concepts Yuanzhi Li et.al. 2310.12143v1 null
2023-10-17 DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis Youngjoong Kwon et.al. 2310.11449v1 null
2023-10-17 Functional Invariants to Watermark Large Transformers Fernandez Pierre et.al. 2310.11446v1 null
2023-10-18 EvalCrafter: Benchmarking and Evaluating Large Video Generation Models Yaofang Liu et.al. 2310.11440v2 link
2023-10-17 Sadness, Anger, or Anxiety: Twitter Users’ Emotional Responses to Toxicity in Public Conversations Ana Aleksandric et.al. 2310.11436v1 null
2023-10-17 An Empirical Study of Translation Hypothesis Ensembling with Large Language Models António Farinhas et.al. 2310.11430v1 link
2023-10-17 Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression Adam Block et.al. 2310.11428v1 null
2023-10-17 A Computational Framework for Solving Wasserstein Lagrangian Flows Kirill Neklyudov et.al. 2310.10649v2 link
2023-10-16 Step-by-Step Remediation of Students’ Mathematical Mistakes Rose E. Wang et.al. 2310.10648v1 link
2023-10-16 A Survey on Video Diffusion Models Zhen Xing et.al. 2310.10647v1 link
2023-10-16 Interactive Task Planning with Language Models Boyi Li et.al. 2310.10645v1 null
2023-10-16 TOSS:High-quality Text-guided Novel View Synthesis from a Single Image Yukai Shi et.al. 2310.10644v1 null
2023-10-16 Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting Zeyu Yang et.al. 2310.10642v1 link
2023-10-16 LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts Hanan Gani et.al. 2310.10640v1 link
2023-10-16 Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models Kevin Black et.al. 2310.10639v1 link
2023-10-13 Vision-by-Language for Training-Free Compositional Image Retrieval Shyamgopal Karthik et.al. 2310.09291v1 link
2023-10-13 Disentangled Latent Spaces Facilitate Data-Driven Auxiliary Learning Geri Skenderi et.al. 2310.09278v1 null
2023-10-13 Retro-fallback: retrosynthetic planning in an uncertain world Austin Tripp et.al. 2310.09270v1 null
2023-10-13 Genetic algorithms are strong baselines for molecule generation Austin Tripp et.al. 2310.09267v1 null
2023-10-13 Towards End-to-end 4-Bit Inference on Generative Large Language Models Saleh Ashkboos et.al. 2310.09259v1 link
2023-10-12 Octopus: Embodied Vision-Language Programmer from Environmental Feedback Jingkang Yang et.al. 2310.08588v1 link
2023-10-12 Is Generalized Dynamic Novel View Synthesis from Monocular Videos Possible Today? Xiaoming Zhao et.al. 2310.08587v1 null
2023-10-12 PonderV2: Pave the Way for 3D Foundataion Model with A Universal Pre-training Paradigm Haoyi Zhu et.al. 2310.08586v1 link
2023-10-12 Discovering Fatigued Movements for Virtual Character Animation Noshaba Cheema et.al. 2310.08583v1 null
2023-10-12 Tree-Planner: Efficient Close-loop Task Planning with Large Language Models Mengkang Hu et.al. 2310.08582v1 null
2023-10-12 Universal Visual Decomposer: Long-Horizon Manipulation Made Easy Zichen Zhang et.al. 2310.08581v1 null
2023-10-12 OmniControl: Control Any Joint at Any Time for Human Motion Generation Yiming Xie et.al. 2310.08580v1 link
2023-10-12 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Xian Liu et.al. 2310.08579v1 null
2023-10-12 Learning to Act from Actionless Videos through Dense Correspondences Po-Chen Ko et.al. 2310.08576v1 null
2023-10-12 Jigsaw: Supporting Designers in Prototyping Multimodal Applications by Assembling AI Foundation Models David Chuan-En Lin et.al. 2310.08574v1 null
2023-10-11 InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining Boxin Wang et.al. 2310.07713v1 link
2023-10-11 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Yingqing He et.al. 2310.07702v1 link
2023-10-11 Knowledge-enhanced Memory Model for Emotional Support Conversation Mengzhao Jia et.al. 2310.07700v1 null
2023-10-11 From Scarcity to Efficiency: Improving CLIP Training via Visual-enriched Captions Zhengfeng Lai et.al. 2310.07699v1 link
2023-10-11 SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation Bo Pan et.al. 2310.07698v1 null
2023-10-11 ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation Bo Peng et.al. 2310.07697v1 link
2023-10-11 Large-scale photonic computing with nonlinear disordered media Hao Wang et.al. 2310.07690v1 null
2023-10-10 AutoAD II: The Sequel – Who, When, and What in Movie Audio Description Tengda Han et.al. 2310.06838v1 null
2023-10-10 Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency Eric Zelikman et.al. 2310.06837v1 null
2023-10-10 What Does Stable Diffusion Know about the 3D Scene? Guanqi Zhan et.al. 2310.06836v1 link
2023-10-10 Teaching Language Models to Hallucinate Less with Synthetic Tasks Erik Jones et.al. 2310.06827v1 null
2023-10-10 Mistral 7B Albert Q. Jiang et.al. 2310.06825v1 link
2023-10-10 The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets Samuel Marks et.al. 2310.06824v1 link
2023-10-09 Grokking as Compression: A Nonlinear Complexity Perspective Ziming Liu et.al. 2310.05918v1 null
2023-10-09 Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input Donglai Xiang et.al. 2310.05917v1 null
2023-10-09 FireAct: Toward Language Agent Fine-tuning Baian Chen et.al. 2310.05915v1 null
2023-10-09 SALMON: Self-Alignment with Principle-Following Reward Models Zhiqing Sun et.al. 2310.05910v1 link
2023-10-09 Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts Lizhang Chen et.al. 2310.05898v1 null
2023-10-06 BrainSCUBA: Fine-Grained Natural Language Captions of Visual Cortex Selectivity Andrew F. Luo et.al. 2310.04420v1 null
2023-10-06 Functional Interpolation for Relative Positions Improves Long Context Transformers Shanda Li et.al. 2310.04418v1 null
2023-10-09 CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis Xiaoxiao Sun et.al. 2310.04414v2 null
2023-10-06 FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning Peiran Xu et.al. 2310.04412v1 link
2023-10-06 RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation Fangyuan Xu et.al. 2310.04408v1 link
2023-10-06 Policy-Gradient Training of Language Models for Ranking Ge Gao et.al. 2310.04407v1 null
2023-10-06 Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Andy Zhou et.al. 2310.04406v1 link
2023-10-05 ContactGen: Generative Contact Modeling for Grasp Generation Shaowei Liu et.al. 2310.03740v1 null
2023-10-05 Aligning Text-to-Image Diffusion Models with Reward Backpropagation Mihir Prabhudesai et.al. 2310.03739v1 link
2023-10-05 Stylist: Style-Driven Feature Ranking for Robust Novelty Detection Stefan Smeu et.al. 2310.03738v1 link
2023-10-05 Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency Tianhong Li et.al. 2310.03734v1 null
2023-10-05 MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning Ke Wang et.al. 2310.03731v1 link
2023-10-05 Stochastic interpolants with data-dependent couplings Michael S. Albergo et.al. 2310.03725v1 null
2023-10-04 LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving Hao Sha et.al. 2310.03026v1 null
2023-10-04 Retrieval meets Long Context Large Language Models Peng Xu et.al. 2310.03025v1 null
2023-10-04 Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making Jeonghye Kim et.al. 2310.03022v1 null
2023-10-04 Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models Jianglong Ye et.al. 2310.03020v1 null
2023-10-04 Multimodal Question Answering for Unified Information Extraction Yuxuan Sun et.al. 2310.03017v1 link
2023-10-04 Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day Yifan Jiang et.al. 2310.03015v1 null
2023-10-04 SemiReward: A General Reward Model for Semi-supervised Learning Siyuan Li et.al. 2310.03013v1 link
2023-10-04 Towards Domain-Specific Features Disentanglement for Domain Generalization Hao Chen et.al. 2310.03007v1 null
2023-10-05 COOLer: Class-Incremental Learning for Appearance-Based Multiple Object Tracking Zhizheng Liu et.al. 2310.03006v2 link
2023-10-03 Generalizable Long-Horizon Manipulations with Large Language Models Haoyu Zhou et.al. 2310.02264v1 null
2023-10-03 MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts Pan Lu et.al. 2310.02255v1 null
2023-10-03 Talk2BEV: Language-enhanced Bird’s-eye View Maps for Autonomous Driving Vikrant Dewangan et.al. 2310.02251v1 null
2023-10-03 Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models Huaijin Pi et.al. 2310.02242v1 null
2023-10-03 MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens Kaizhi Zheng et.al. 2310.02239v1 link
2023-09-29 Efficient Streaming Language Models with Attention Sinks Guangxuan Xiao et.al. 2309.17453v1 link
2023-10-02 L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models Ansong Ni et.al. 2309.17446v2 null
2023-10-02 LLM-grounded Video Diffusion Models Long Lian et.al. 2309.17444v2 null
2023-09-29 CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets Lifan Yuan et.al. 2309.17428v1 link
2023-09-28 Learning to Transform for Generalizable Instance-wise Invariance Utkarsh Singhal et.al. 2309.16672v1 link
2023-09-29 Demystifying CLIP Data Hu Xu et.al. 2309.16671v2 link
2023-09-28 RealFill: Reference-Driven Generation for Authentic Image Completion Luming Tang et.al. 2309.16668v1 null
2023-09-28 DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation Jiaxiang Tang et.al. 2309.16653v1 link
2023-09-27 Exploiting the Signal-Leak Bias in Diffusion Models Martin Nicolas Everaert et.al. 2309.15842v1 null
2023-09-27 OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs Honglin He et.al. 2309.15830v1 null
2023-09-27 LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement Haonan Chang et.al. 2309.15821v1 null
2023-09-27 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation David Junhao Zhang et.al. 2309.15818v1 link
2023-09-26 Generating Visual Scenes from Touch Fengyu Yang et.al. 2309.15117v1 null
2023-09-27 InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition Pan Zhang et.al. 2309.15112v2 link
2023-09-26 Doduo: Learning Dense Visual Correspondence from Unsupervised Semantic-Aware Flow Zhenyu Jiang et.al. 2309.15110v1 null
2023-09-26 DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation Zeyu Wang et.al. 2309.15109v1 link
2023-09-26 New solution to Airy’s equation for modeling beams near turning points N. A. Lopez et.al. 2309.15108v1 null
2023-09-25 Extreme Parkour with Legged Robots Xuxin Cheng et.al. 2309.14341v1 null
2023-09-25 Chop & Learn: Recognizing and Generating Object-State Compositions Nirat Saini et.al. 2309.14339v1 null
2023-09-25 UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation Jianglin Fu et.al. 2309.14335v1 link
2023-09-25 Tasks Makyth Models: Machine Learning Assisted Surrogates for Tipping Points Gianluca Fabiani et.al. 2309.14334v1 null
2023-09-25 Innovative Digital Storytelling with AIGC: Exploration and Discussion of Recent Advances Rongzhang Gu et.al. 2309.14329v1 null
2023-09-25 pyParaOcean: A System for Visual Analysis of Ocean Data Toshit Jain et.al. 2309.14328v1 null
2023-09-22 E(2)-Equivariant Graph Planning for Navigation Linfeng Zhao et.al. 2309.13043v1 null
2023-09-22 MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation Jiahao Xie et.al. 2309.13042v1 link
2023-09-22 Robotic Offline RL from Internet Videos via Value-Function Pre-Training Chethan Bhateja et.al. 2309.13041v1 null
2023-09-22 Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? Xiaoxiao Sun et.al. 2309.13038v1 null
2023-09-22 GELLO: A General, Low-Cost, and Intuitive Teleoperation Framework for Robot Manipulators Philipp Wu et.al. 2309.13037v1 null
2023-09-22 A numerical framework for simulating progressive failure in composite laminates under high-cycle fatigue loading Pieter Hofman et.al. 2309.13030v1 null
2023-09-21 LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent Jianing Yang et.al. 2309.12311v1 null
2023-09-21 Rehearsal: Simulating Conflict to Teach Conflict Resolution Omar Shaikh et.al. 2309.12309v1 null
2023-09-21 Text-Guided Vector Graphics Customization Peiying Zhang et.al. 2309.12302v1 null
2023-09-21 Environment-biased Feature Ranking for Novelty Detection Robustness Stefan Smeu et.al. 2309.12301v1 null
2023-09-21 Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models Levon Haroutunian et.al. 2309.12294v1 null
2023-09-20 A Large-scale Dataset for Audio-Language Representation Learning Luoyi Sun et.al. 2309.11500v1 null
2023-09-20 DreamLLM: Synergistic Multimodal Comprehension and Creation Runpei Dong et.al. 2309.11499v1 link
2023-09-20 FreeU: Free Lunch in Diffusion U-Net Chenyang Si et.al. 2309.11497v1 link
2023-09-20 Chain-of-Verification Reduces Hallucination in Large Language Models Shehzaad Dhuliawala et.al. 2309.11495v1 null
2023-09-21 Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning Tianbao Xie et.al. 2309.11489v2 link
2023-09-19 PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes Xiao Fu et.al. 2309.10815v1 link
2023-09-19 Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Tianhua Zhang et.al. 2309.10814v1 link
2023-09-19 PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance Peiqing Yang et.al. 2309.10810v1 link
2023-09-20 AI Foundation Models for Weather and Climate: Applications, Design, and Implementation S. Karthik Mukkavilli et.al. 2309.10808v2 null
2023-09-19 Heuristic Search for Path Finding with Refuelling Anushtup Nandy et.al. 2309.10796v1 null
2023-09-19 Guide Your Agent with Adaptive Multimodal Rewards Changyeon Kim et.al. 2309.10790v1 link
2023-09-18 General In-Hand Object Rotation with Vision and Touch Haozhi Qi et.al. 2309.09979v1 null
2023-09-18 GEDepth: Ground Embedding for Monocular Depth Estimation Xiaodong Yang et.al. 2309.09975v1 link
2023-09-19 MindAgent: Emergent Gaming Interaction Ran Gong et.al. 2309.09971v2 null
2023-09-18 Empirical Study of Mix-based Data Augmentation Methods in Physiological Time Series Data Peikun Guo et.al. 2309.09970v1 link
2023-09-18 Prompt a Robot to Walk with Large Language Models Yen-Jen Wang et.al. 2309.09969v1 link
2023-09-18 Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees Alexia Jolicoeur-Martineau et.al. 2309.09968v1 link
2023-09-15 Robust e-NeRF: NeRF from Sparse & Noisy Events under Non-Uniform Motion Weng Fei Low et.al. 2309.08596v1 link
2023-09-15 Chain-of-Thought Reasoning is a Policy Improvement Operator Hugh Zhang et.al. 2309.08589v1 null
2023-09-15 Robust Frame-to-Frame Camera Rotation Estimation in Crowded Scenes Fabien Delattre et.al. 2309.08588v1 null
2023-09-15 Compositional Foundation Models for Hierarchical Planning Anurag Ajay et.al. 2309.08587v1 null
2023-09-15 Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding Xiaonan Lu et.al. 2309.08585v1 null
2023-09-15 ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer Arkadiy Saakyan et.al. 2309.08583v1 link
2023-09-15 Large-Vocabulary 3D Diffusion Model with Transformer Ziang Cao et.al. 2309.07920v2 null
2023-09-14 Unified Human-Scene Interaction via Prompted Chain-of-Contacts Zeqi Xiao et.al. 2309.07918v1 link
2023-09-14 Looking at words and points with attention: a benchmark for text-to-shape coherence Andrea Amaduzzi et.al. 2309.07917v1 null
2023-09-14 MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning Haozhe Zhao et.al. 2309.07915v1 link
2023-09-14 ALWOD: Active Learning for Weakly-Supervised Object Detection Yuting Wang et.al. 2309.07914v1 link
2023-09-14 Why would you put a flashlight in a dark matter detector? R. Gibbons et.al. 2309.07913v1 null
2023-09-14 TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting Rohan Choudhury et.al. 2309.07910v1 null
2023-09-14 Physically Plausible Full-Body Hand-Object Interaction Synthesis Jona Braun et.al. 2309.07907v1 null
2023-09-14 Generative Image Dynamics Zhengqi Li et.al. 2309.07906v1 null
2023-09-13 Text-Guided Generation and Editing of Compositional 3D Avatars Hao Zhang et.al. 2309.07125v1 null
2023-09-13 RAIN: Your Language Models Can Align Themselves without Finetuning Yuhui Li et.al. 2309.07124v1 link
2023-09-13 Tree-Structured Shading Decomposition Chen Geng et.al. 2309.07122v1 null
2023-09-13 Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics Haoqin Tu et.al. 2309.07120v1 link
2023-09-13 Weakly-Supervised Multi-Task Learning for Audio-Visual Speaker Verification Anith Selvakumar et.al. 2309.07115v1 null
2023-09-13 Contrastive Deep Encoding Enables Uncertainty-aware Machine-learning-assisted Histopathology Nirhoshan Sivaroopan et.al. 2309.07113v1 null
2023-09-13 Hardening RGB-D Object Recognition Systems against Adversarial Patch Attacks Yang Zheng et.al. 2309.07106v1 null
2023-09-12 Learning Disentangled Avatars with Hybrid 3D Representations Yao Feng et.al. 2309.06441v1 null
2023-09-12 Unveiling the potential of large language models in generating semantic and cross-language clones Palash R. Roy et.al. 2309.06424v1 null
2023-09-12 C4CAM: A Compiler for CAM-based In-memory Accelerators Hamid Farzaneh et.al. 2309.06418v1 null
2023-09-12 Robot Parkour Learning Ziwen Zhuang et.al. 2309.05665v2 null
2023-09-11 Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips Yufei Ye et.al. 2309.05663v1 null
2023-09-11 ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion Hongyu Li et.al. 2309.05662v1 null
2023-09-11 Hypothesis Search: Inductive Reasoning with Language Models Ruocheng Wang et.al. 2309.05660v1 null
2023-09-11 From Capture to Display: A Survey on Volumetric Video Yili Jin et.al. 2309.05658v1 null
2023-09-11 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning Xiang Yue et.al. 2309.05653v1 null
2023-09-11 Data efficiency, dimensionality reduction, and the generalized symmetric information bottleneck K. Michael Martini et.al. 2309.05649v1 null
2023-09-08 On the Actionability of Outcome Prediction Lydia T. Liu et.al. 2309.04470v1 null
2023-09-08 Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays Aroof Aimen et.al. 2309.04462v1 null
2023-09-08 Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models Yangyi Chen et.al. 2309.04461v1 link
2023-09-08 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning David Yunis et.al. 2309.04459v1 null
2023-09-08 Effect of Electron-Phonon Interactions on Three-Level QD-based Spaser: Linear and Quadratic Potentials Ankit Purohit et.al. 2309.04448v1 null
2023-09-07 ImageBind-LLM: Multi-modality Instruction Tuning Jiaming Han et.al. 2309.03905v1 link
2023-09-07 Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis Jiapeng Zhu et.al. 2309.03904v1 link
2023-09-07 Tracking Anything with Decoupled Video Segmentation Ho Kei Cheng et.al. 2309.03903v1 link
2023-09-07 The Making and Breaking of Camouflage Hala Lamdouar et.al. 2309.03899v1 null
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895v1 null
2023-09-07 DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection Manlin Zhang et.al. 2309.03893v1 null
2023-09-07 ArtiGrasp: Physically Plausible Synthesis of Bi-Manual Dexterous Grasping and Articulation Hui Zhang et.al. 2309.03891v1 null
2023-09-06 My Art My Choice: Adversarial Protection Against Unruly AI Anthony Rhodes et.al. 2309.03198v1 null
2023-09-06 Electrocaloric Response of the Dense Ferroelectric Nanocomposites Anna N. Morozovska et.al. 2309.03187v1 null
2023-09-06 SLiMe: Segment Like Me Aliasghar Khani et.al. 2309.03179v1 link
2023-09-05 ReliTalk: Relightable Talking Portrait Generation from a Single Video Haonan Qiu et.al. 2309.02434v1 link
2023-09-05 Generating Realistic Images from In-the-wild Sounds Taegyeong Lee et.al. 2309.02405v1 null
2023-09-01 OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation Zhening Huang et.al. 2309.00616v1 link
2023-09-01 Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following Ziyu Guo et.al. 2309.00615v1 link
2023-09-01 Iterative Multi-granular Image Editing using Diffusion Models K J Joseph et.al. 2309.00613v1 null
2023-09-01 CityDreamer: Compositional Generative Model of Unbounded 3D Cities Haozhe Xie et.al. 2309.00610v1 link
2023-09-01 Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair Yuxiang Wei et.al. 2309.00608v1 link
2023-08-31 PointLLM: Empowering Large Language Models to Understand Point Clouds Runsen Xu et.al. 2308.16911v1 link
2023-08-31 StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation Yuhan Wang et.al. 2308.16909v1 link
2023-08-31 Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator Xiaolong Wang et.al. 2308.16906v1 link
2023-08-31 InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion Sirui Xu et.al. 2308.16905v1 link
2023-08-31 Transformers as Support Vector Machines Davoud Ataee Tarzanagh et.al. 2308.16898v1 link
2023-09-01 GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields Yanjie Ze et.al. 2308.16891v2 link
2023-08-31 Prediction of Diblock Copolymer Morphology via Machine Learning Hyun Park et.al. 2308.16886v1 null
2023-08-30 Learning Vision-based Pursuit-Evasion Robot Policies Andrea Bajcsy et.al. 2308.16185v1 null
2023-08-30 SAM-Med2D Junlong Cheng et.al. 2308.16184v1 link
2023-08-30 GREC: Generalized Referring Expression Comprehension Shuting He et.al. 2308.16182v1 link
2023-08-30 Framework and Methodology for Verification of a Complex Scientific Simulation Software, Flash-X Akash Dhruv et.al. 2308.16180v1 null
2023-08-30 General Purpose Audio Effect Removal Matthew Rice et.al. 2308.16177v1 link
2023-08-30 Quantifying Uncertainty in Answers from any Language Model via Intrinsic and Extrinsic Confidence Assessment Jiuhai Chen et.al. 2308.16175v1 null
2023-08-29 3D Adversarial Augmentations for Robust Out-of-Domain Predictions Alexander Lehner et.al. 2308.15479v1 null
2023-08-29 A General-Purpose Self-Supervised Model for Computational Pathology Richard J. Chen et.al. 2308.15474v1 null
2023-08-29 Learning Modulated Transformation in GANs Ceyuan Yang et.al. 2308.15472v1 link
2023-08-29 Input margins can predict generalization too Coenraad Mouton et.al. 2308.15466v1 null
2023-08-30 Sharing proofs with predicative theories through universe polymorphic elaboration Thiago Felicissimo et.al. 2308.15465v2 link
2023-08-29 ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer Zachary Horvitz et.al. 2308.15459v1 link
2023-08-29 From SMOTE to Mixup for Deep Imbalanced Classification Wei-Chao Cheng et.al. 2308.15457v1 link
2023-08-28 AI Deception: A Survey of Examples, Risks, and Potential Solutions Peter S. Park et.al. 2308.14752v1 null
2023-08-28 MagicAvatar: Multimodal Avatar Generation and Animation Jianfeng Zhang et.al. 2308.14748v1 null
2023-08-28 CoVR: Learning Composed Video Retrieval from Web Video Captions Lucas Ventura et.al. 2308.14746v1 link
2023-08-28 Advancement on Security Applications of Private Intersection Sum Protocol Yuvaray Athur Raghuvir et.al. 2308.14741v1 null
2023-08-28 Total Selfie: Generating Full-Body Selfies Bowei Chen et.al. 2308.14740v1 null
2023-08-28 Bayesian artificial brain with ChatGPT Renato A. Krohling et.al. 2308.14732v1 null
2023-08-28 Distilled GPT for Source Code Summarization Chia-Yi Su et.al. 2308.14731v1 link
2023-08-25 ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection Yihao Fang et.al. 2308.13517v1 link
2023-08-25 Does Asking Clarifying Questions Increases Confidence in Generated Code? On the Communication Skills of Large Language Models Jie JW Wu et.al. 2308.13507v1 null
2023-08-25 A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance Ian Colbert et.al. 2308.13504v1 null
2023-08-25 Attending Generalizability in Course of Deep Fake Detection by Exploring Multi-task Learning Pranav Balaji et.al. 2308.13503v1 null
2023-08-24 ROAM: Robust and Object-aware Motion Generation using Neural Pose Descriptors Wanyue Zhang et.al. 2308.12969v1 null
2023-08-24 Dense Text-to-Image Generation with Attention Modulation Yunji Kim et.al. 2308.12964v1 link
2023-08-24 MapPrior: Bird’s-Eye View Map Layout Estimation with Generative Models Xiyue Zhu et.al. 2308.12963v1 null
2023-08-24 Motion-Guided Masking for Spatiotemporal Representation Learning David Fan et.al. 2308.12962v1 null
2023-08-24 Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks Xiangyang Zhu et.al. 2308.12961v1 link
2023-08-24 Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment Sheng Zhang et.al. 2308.12960v1 link
2023-08-24 Semi-analytical Framework for Modeling Strong Coupling of Quantum Emitters in Electromagnetic Resonators Mohammad Abutoama et.al. 2308.12957v1 null
2023-08-24 A new framework for global data regulation Ellie Graeden et.al. 2308.12955v1 null
2023-08-24 BridgeData V2: A Dataset for Robot Learning at Scale Homer Walke et.al. 2308.12952v1 link
2023-08-24 Label Budget Allocation in Multi-Task Learning Ximeng Sun et.al. 2308.12949v1 null
2023-08-23 CHORUS: Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images Sookwan Han et.al. 2308.12288v1 null
2023-08-23 Devising and Detecting Phishing: large language models vs. Smaller Human Models Fredrik Heiding et.al. 2308.12287v1 null
2023-08-23 On-Manifold Projected Gradient Descent Aaron Mahler et.al. 2308.12279v1 null
2023-08-24 A Model for Integrating Generative AI into Course Content Development Ethan Dickey et.al. 2308.12276v2 null
2023-08-23 Spatial clustering of temporal energy profiles with empirical orthogonal functions and max-p regionalization Claire Halloran et.al. 2308.12274v1 null
2023-08-23 Simple is Better and Large is Not Enough: Towards Ensembling of Foundational Language Models Nancy Tyagi et.al. 2308.12272v1 null
2023-08-23 A Generative Approach for Image Registration of Visible-Thermal (VT) Cancer Faces Catherine Ordun et.al. 2308.12271v1 null
2023-08-23 Language Reward Modulation for Pretraining Reinforcement Learning Ademi Adeniji et.al. 2308.12270v1 link
2023-08-22 GRIP: Generating Interaction Poses Using Latent Consistency and Spatial Cues Omid Taheri et.al. 2308.11617v1 null
2023-08-22 StoryBench: A Multifaceted Benchmark for Continuous Story Visualization Emanuele Bugliarello et.al. 2308.11606v1 link
2023-08-22 GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised Learning Mainak Singha et.al. 2308.11605v1 null
2023-08-22 Towards Universal Interaction for Extended Reality Pascal Knierim et.al. 2308.11600v1 null
2023-08-22 Theory of Transverse Mode Instability in Fiber Amplifiers with Multimode Excitations Kabish Wisal et.al. 2308.11599v1 null
2023-08-22 Vision-Based Intelligent Robot Grasping Using Sparse Neural Network Priya Shukla et.al. 2308.11590v1 null
2023-08-21 Structured World Models from Human Videos Russell Mendonca et.al. 2308.10901v1 null
2023-08-21 TADA! Text to Animatable Digital Avatars Tingting Liao et.al. 2308.10899v1 null
2023-08-21 Few-Shot Physically-Aware Articulated Mesh Generation via Hierarchical Deformation Xueyi Liu et.al. 2308.10898v1 link
2023-08-21 Can Language Models Learn to Listen? Evonne Ng et.al. 2308.10897v1 null
2023-08-21 Differentiable Shadow Mapping for Efficient Inverse Graphics Markus Worchel et.al. 2308.10896v1 link
2023-08-21 Proton-Boron Fusion Yield Increased by Orders of Magnitude with Foam Targets Wen-Qing Wei et.al. 2308.10878v1 null
2023-08-21 Analyzing Transformer Dynamics as Movement through Embedding Space Sumeet S. Singh et.al. 2308.10874v1 null
2023-08-18 HumanLiff: Layer-wise 3D Human Generation with Diffusion Model Shoukang Hu et.al. 2308.09712v1 null
2023-08-18 Robust Monocular Depth Estimation under Challenging Conditions Stefano Gasperini et.al. 2308.09711v1 null
2023-08-18 SimDA: Simple Diffusion Adapter for Efficient Video Generation Zhen Xing et.al. 2308.09710v1 null
2023-08-18 Training with Product Digital Twins for AutoRetail Checkout Yue Yao et.al. 2308.09708v1 link
2023-08-18 Guide3D: Create 3D Avatars from Text and Image Guidance Yukang Cao et.al. 2308.09705v1 null
2023-08-18 Counting and Sampling Labeled Chordal Graphs in Polynomial Time Ursula Hebert-Johnson et.al. 2308.09703v1 null
2023-08-16 TeCH: Text-guided Reconstruction of Lifelike Clothed Humans Yangyi Huang et.al. 2308.08545v1 link
2023-08-16 InsightMapper: A Closer Look at Inner-instance Information for Vectorized High-Definition Mapping Zhenhua Xu et.al. 2308.08543v1 null
2023-08-15 Enumerating Tarski fixed points on lattices of binary relations Julian Müller et.al. 2308.07923v1 null
2023-08-15 Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification Aojun Zhou et.al. 2308.07921v1 null
2023-08-15 The Regular Expression Inference Challenge Mojtaba Valizadeh et.al. 2308.07899v1 null
2023-08-15 A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision Julio Silva-Rodriguez et.al. 2308.07898v1 link
2023-08-14 Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation Alexander Martin et.al. 2308.07316v1 link
2023-08-14 Reinforcing Security and Usability of Crypto-Wallet with Post-Quantum Cryptography and Zero-Knowledge Proof Yathin Kethepalli et.al. 2308.07309v1 null
2023-08-15 LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked Alec Helbling et.al. 2308.07308v2 null
2023-08-14 Extend Wave Function Collapse to Large-Scale Content Generation Yuhe Nie et.al. 2308.07307v1 null
2023-08-14 Neural Authorship Attribution: Stylometric Analysis on Large Language Models Tharindu Kumarage et.al. 2308.07305v1 link
2023-08-14 DiffSED: Sound Event Detection with Denoising Diffusion Swapnil Bhosale et.al. 2308.07293v1 null
2023-08-11 Foundation Model is Efficient Multimodal Multitask Model Selector Fanqing Meng et.al. 2308.06262v1 link
2023-08-11 Enhancing Network Management Using Code Generated by Large Language Models Sathiya Kumaran Mani et.al. 2308.06261v1 link
2023-08-11 Self-Alignment with Instruction Backtranslation Xian Li et.al. 2308.06259v1 null
2023-08-11 NEMA NU 2-2018 performance evaluation of a new generation digital 32-cm axial field-of-view Omni Legend PET-CT Rhodri Lyn Smith et.al. 2308.06255v1 null
2023-08-11 Fundamental Limits on Subwavelength Range Resolution Andrew N. Jordan et.al. 2308.06252v1 null
2023-08-11 ARGUS: Visualization of AI-Assisted Task Guidance in AR Sonia Castelo et.al. 2308.06246v1 null
2023-08-10 PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs Wentao Hu et.al. 2308.05744v1 link
2023-08-10 Neural Progressive Meshes Yun-Chun Chen et.al. 2308.05741v1 null
2023-08-10 AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Haohe Liu et.al. 2308.05734v1 link
2023-08-10 FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models Guangkai Xu et.al. 2308.05733v1 null
2023-08-09 Scene-Generalizable Interactive Segmentation of Radiance Fields Songlin Tang et.al. 2308.05104v1 null
2023-08-09 LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation Leigang Qu et.al. 2308.05095v1 null
2023-08-08 SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore Sewon Min et.al. 2308.04430v1 link
2023-08-08 A Deep-Learning Method Using Auto-encoder and Generative Adversarial Network for Anomaly Detection on Ancient Stone Stele Surfaces Yikun Liu et.al. 2308.04426v1 null
2023-08-08 Density-contrast induced inertial forces on particles in oscillatory flows Siddhansh Agarwal et.al. 2308.04423v1 null
2023-08-08 Near-field 6G Networks: Why Mobile Terahertz Communications MUST Operate in the Near Field Vitaly Petrov et.al. 2308.04418v1 null
2023-08-08 DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images Xuechao Zou et.al. 2308.04417v1 link
2023-08-07 FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels Lue Fan et.al. 2308.03755v1 link
2023-08-07 Mask Frozen-DETR: High Quality Instance Segmentation with One GPU Zhanhao Liang et.al. 2308.03747v1 null
2023-08-07 A Cost Analysis of Generative Language Models and Influence Operations Micah Musser et.al. 2308.03740v1 link
2023-08-07 Labeling without Seeing? Blind Annotation for Privacy-Preserving Entity Resolution Yixiang Yao et.al. 2308.03734v1 null
2023-08-07 SurvBeX: An explanation method of the machine learning survival models based on the Beran estimator Lev V. Utkin et.al. 2308.03730v1 link
2023-08-04 Recovering non-Maxwellian particle velocity distribution functions from collective Thomson-scattered spectra Bryan C. Foo et.al. 2308.02488v1 null
2023-08-04 Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP Qihang Yu et.al. 2308.02487v1 link
2023-08-04 On the Inherent Anonymity of Gossiping Rachid Guerraoui et.al. 2308.02477v1 null
2023-08-04 Towards Generalist Foundation Model for Radiology Chaoyi Wu et.al. 2308.02463v1 link
2023-08-04 Getting the Ball Rolling: Learning a Dexterous Policy for a Biomimetic Tendon-Driven Hand with Rolling Contact Joints Yasunori Toshimitsu et.al. 2308.02453v1 link
2023-08-03 The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World Weiyun Wang et.al. 2308.01907v1 link
2023-08-03 Revisiting Deformable Convolution for Depth Completion Xinglong Sun et.al. 2308.01905v1 link
2023-08-03 UniSim: A Neural Closed-Loop Sensor Simulator Ze Yang et.al. 2308.01898v1 null
2023-08-03 Strategies for optimizing plasmonic grating couplers with topology-based inverse design Michael Efseaff et.al. 2308.01893v1 null
2023-08-02 ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders Shawn Xu et.al. 2308.01317v1 null
2023-08-02 Patched Denoising Diffusion Models For High-Resolution Image Synthesis Zheng Ding et.al. 2308.01316v1 link
2023-08-02 More Context, Less Distraction: Visual Classification by Inferring and Conditioning on Contextual Attributes Bang An et.al. 2308.01313v1 link
2023-08-02 TEASMA: A Practical Approach for the Test Assessment of Deep Neural Networks using Mutation Analysis Amin Abbasishahkoo et.al. 2308.01311v1 null
2023-08-02 Revisiting DETR Pre-training for Object Detection Yan Ma et.al. 2308.01300v1 null
2023-08-01 LISA: Reasoning Segmentation via Large Language Model Xin Lai et.al. 2308.00692v1 link
2023-08-01 AnyLoc: Towards Universal Visual Place Recognition Nikhil Keetha et.al. 2308.00688v1 link
2023-08-01 Learning from Hypervectors: A Survey on Hypervector Encoding Sercan Aygun et.al. 2308.00685v1 null
2023-07-31 Conformal PID Control for Time Series Prediction Anastasios N. Angelopoulos et.al. 2307.16895v1 link
2023-07-31 A reduced order model for geometrically parameterized two-scale simulations of elasto-plastic microstructures under large deformations Theron Guo et.al. 2307.16894v1 null
2023-07-31 LEONARDO: A Pan-European Pre-Exascale Supercomputer for HPC and AI Applications Matteo Turisini et.al. 2307.16885v1 null
2023-07-31 HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution Ehsan Kamalloo et.al. 2307.16883v1 link
2023-07-31 Image Synthesis under Limited Data: A Survey and Taxonomy Mengping Yang et.al. 2307.16879v1 link
2023-07-31 Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy Shibo Jie et.al. 2307.16867v1 link
2023-07-28 Uncertainty in Natural Language Generation: From Theory to Applications Joris Baan et.al. 2307.15703v1 null
2023-07-28 The Strong Maximum Circulation Algorithm: A New Method for Aggregating Preference Rankings Nathan Atkinson et.al. 2307.15702v1 null
2023-07-31 MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking Ruopeng Gao et.al. 2307.15700v2 link
2023-07-28 PatchMixer: Rethinking network design to boost generalization for 3D point cloud understanding Davide Boscaini et.al. 2307.15692v1 link
2023-07-28 Benchmarking Offline Reinforcement Learning on Real-Robot Hardware Nico Gürtler et.al. 2307.15690v1 link
2023-07-27 PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking Yang Zheng et.al. 2307.15055v1 link
2023-07-27 A Geometric Notion of Causal Probing Clément Guerner et.al. 2307.15054v1 null
2023-07-27 A Transformer-based Approach for Arabic Offline Handwritten Text Recognition Saleh Momeni et.al. 2307.15045v1 null
2023-07-27 Universal and Transferable Adversarial Attacks on Aligned Language Models Andy Zou et.al. 2307.15043v1 link
2023-07-27 3-Coloring $C_4$ or $C_3$ -free Diameter Two Graphs Tereza Klimošová et.al. 2307.15036v1 null
2023-07-26 WavJourney: Compositional Audio Creation with Large Language Models Xubo Liu et.al. 2307.14335v1 link
2023-07-26 Towards Generalist Biomedical AI Tao Tu et.al. 2307.14334v1 null
2023-07-26 Waypoint-Based Imitation Learning for Robotic Manipulation Lucy Xiaoyang Shi et.al. 2307.14326v1 null
2023-07-25 Benchmarking and Analyzing Generative Data for Visual Recognition Bo Li et.al. 2307.13697v1 null
2023-07-25 A Compact DAG for Storing and Searching Maximal Common Subsequences Alessio Conte et.al. 2307.13695v1 null
2023-07-25 A Comprehensive Review of Recent Research Trends on UAVs Kaled Telli et.al. 2307.13691v1 null
2023-07-25 Single reference treatment of strongly correlated H $4$ and H${10}$ isomers with Richardson-Gaudin states Paul A. Johnson et.al. 2307.13690v1 null
2023-07-25 All-optical GeV electron bunch generation in a laser-plasma accelerator via truncated-channel injection A. Picksley et.al. 2307.13689v1 null
2023-07-25 The Visual Language of Fabrics Valentin Deschaintre et.al. 2307.13681v1 null
2023-07-25 High Probability Analysis for Non-Convex Stochastic Optimization with Clipping Shaojie Li et.al. 2307.13680v1 null
2023-07-24 A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jindong Gu et.al. 2307.12980v1 link
2023-07-24 Evaluating the Ripple Effects of Knowledge Editing in Language Models Roi Cohen et.al. 2307.12976v1 link
2023-07-24 Volcanic ash delimitation using Artificial Intelligence based on Pix2Pix Christian Carrillo et.al. 2307.12970v1 null
2023-07-24 Aligning Large Language Models with Human: A Survey Yufei Wang et.al. 2307.12966v1 link
2023-07-24 RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment Kevin Yang et.al. 2307.12950v1 link
2023-07-24 Boosting Punctuation Restoration with Data Generation and Reinforcement Learning Viet Dac Lai et.al. 2307.12949v1 link
2023-07-21 Advancing Ad Auction Realism: Practical Insights & Modeling Implications Ming Chen et.al. 2307.11732v1 null
2023-07-21 OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples Ryuto Koike et.al. 2307.11729v1 link
2023-07-21 Benchmark datasets for biomedical knowledge graphs with negative statements Rita T. Sousa et.al. 2307.11719v1 null
2023-07-20 L-Eval: Instituting Standardized Evaluation for Long Context Language Models Chenxin An et.al. 2307.11088v1 link
2023-07-20 AlignDet: Aligning Pre-training and Fine-tuning in Object Detection Ming Li et.al. 2307.11077v1 link
2023-07-20 OBJECT 3DIT: Language-guided 3D-aware Image Editing Oscar Michel et.al. 2307.11073v1 null
2023-07-19 Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis Jiajie Fan et.al. 2307.10166v1 null
2023-07-19 Rethinking Backdoor Attacks Alaa Khaddaj et.al. 2307.10163v1 null
2023-07-19 Robust Driving Policy Learning with Guided Meta Reinforcement Learning Kanghoon Lee et.al. 2307.10160v1 null
2023-07-19 FABRIC: Personalizing Diffusion Models with Iterative Feedback Dimitri von Rütte et.al. 2307.10159v1 link
2023-07-19 Contact-aware Shaping and Maintenance of Deformable Linear Objects With Fixtures Kejia Chen et.al. 2307.10153v1 null
2023-07-18 Forecasting the steam mass flow in a powerplant using the parallel hybrid network Andrii Kurkin et.al. 2307.09483v1 null
2023-07-18 AnyDoor: Zero-shot Object-level Image Customization Xi Chen et.al. 2307.09481v1 link
2023-07-18 ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning Liang Zhao et.al. 2307.09474v1 null
2023-07-18 Optimal Vehicle Trajectory Planning for Static Obstacle Avoidance using Nonlinear Optimization Yajia Zhang et.al. 2307.09466v1 null
2023-07-19 Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla Tom Lieberum et.al. 2307.09458v2 null
2023-07-19 A comparative analysis of SRGAN models Fatemeh Rezapoor Nikroo et.al. 2307.09456v2 null
2023-07-18 Solving Knapsack with Small Items via L0-Proximity Ce Jin et.al. 2307.09454v1 null
2023-07-17 Diffusion Models Beat GANs on Image Classification Soumik Mukhopadhyay et.al. 2307.08702v1 null
2023-07-17 AlpaGasus: Training A Better Alpaca with Fewer Data Lichang Chen et.al. 2307.08701v1 link
2023-07-17 Fast model inference and training on-board of Satellites Vít Růžička et.al. 2307.08700v1 link
2023-07-17 Pair then Relation: Pair-Net for Panoptic Scene Graph Generation Jinghao Wang et.al. 2307.08699v1 link
2023-07-17 Flow Matching in Latent Space Quan Dao et.al. 2307.08698v1 link
2023-07-17 FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning Tri Dao et.al. 2307.08691v1 link
2023-07-17 COLLIE: Systematic Construction of Constrained Text Generation Tasks Shunyu Yao et.al. 2307.08689v1 link
2023-07-14 NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis Nilesh Kulkarni et.al. 2307.07511v1 null
2023-07-14 A Poisson Decomposition for Information and the Information-Event Diagram Cheuk Ting Li et.al. 2307.07506v1 null
2023-07-14 Exhaustive Generation of Linear Orthogonal Cellular Automata Enrico Formenti et.al. 2307.07505v1 null
2023-07-14 TALL: Thumbnail Layout for Deepfake Video Detection Yuting Xu et.al. 2307.07494v1 link
2023-07-14 BehAVExplor: Behavior Diversity Guided Testing for Autonomous Driving Systems Mingfei Cheng et.al. 2307.07493v1 null
2023-07-14 PseudoCal: A Source-Free Approach to Unsupervised Uncertainty Calibration in Domain Adaptation Dapeng Hu et.al. 2307.07489v1 null
2023-07-13 HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Nataniel Ruiz et.al. 2307.06949v1 null
2023-07-13 Self-regulating Prompts: Foundational Model Adaptation without Forgetting Muhammad Uzair Khattak et.al. 2307.06948v1 link
2023-07-13 In-context Autoencoder for Context Compression in a Large Language Model Tao Ge et.al. 2307.06945v1 link
2023-07-13 InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation Yi Wang et.al. 2307.06942v1 link
2023-07-13 Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation Yingqing He et.al. 2307.06940v1 link
2023-07-12 Diagnosis, Feedback, Adaptation: A Human-in-the-Loop Framework for Test-Time Policy Adaptation Andi Peng et.al. 2307.06333v1 null
2023-07-12 Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of “Never Enough Training Data” Kishan Govind et.al. 2307.06322v1 null
2023-07-12 Facial Reenactment Through a Personalized Generator Ariel Elazary et.al. 2307.06307v1 null
2023-07-12 Locally Adaptive Federated Learning via Stochastic Polyak Stepsizes Sohom Mukherjee et.al. 2307.06306v1 link
2023-07-11 Scale Alone Does not Improve Mechanistic Interpretability in Vision Models Roland S. Zimmermann et.al. 2307.05471v1 null
2023-07-12 My3DGen: Building Lightweight Personalized 3D Generative Model Luchao Qi et.al. 2307.05468v2 null
2023-07-11 EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone Shraman Pramanick et.al. 2307.05463v1 link
2023-07-11 Efficient 3D Articulated Human Generation with Layered Surface Volumes Yinghao Xu et.al. 2307.05462v1 null
2023-07-10 Semantic-SAM: Segment and Recognize Anything at Any Granularity Feng Li et.al. 2307.04767v1 link
2023-07-10 Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos Sagnik Majumder et.al. 2307.04760v1 null
2023-07-10 Information decomposition to identify relevant variation in complex systems with machine learning Kieran A. Murphy et.al. 2307.04755v1 link
2023-07-10 Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement Anthony Simeonov et.al. 2307.04751v1 null
2023-07-10 Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback Jaskirat Singh et.al. 2307.04749v1 null
2023-07-07 On the Efficacy of Sampling Adapters Clara Meister et.al. 2307.03749v1 link
2023-07-07 Comparing Traditional and LLM-based Search for Consumer Choice: A Randomized Experiment Sofia Eleni Spatharioti et.al. 2307.03744v1 null
2023-07-07 QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models Tommaso Pegolotti et.al. 2307.03738v1 link
2023-07-06 Simulating Nelsonian Quantum Field Theory Andrea Carosso et.al. 2307.03188v1 null
2023-07-06 Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers Yuan Gong et.al. 2307.03183v1 link
2023-07-06 Markov Persuasion Processes with Endogenous Agent Beliefs Krishnamurthy Iyer et.al. 2307.03181v1 null
2023-07-07 IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model Tianhao Wu et.al. 2307.03177v2 null
2023-07-06 Push Past Green: Learning to Look Behind Plant Foliage by Moving It Xiaoyu Zhang et.al. 2307.03175v1 null
2023-07-06 Risk-Averse Trajectory Optimization via Sample Average Approximation Thomas Lew et.al. 2307.03167v1 link
2023-07-06 VideoGLUE: Video General Understanding Evaluation of Foundation Models Liangzhe Yuan et.al. 2307.03166v1 link
2023-07-05 LongNet: Scaling Transformers to 1,000,000,000 Tokens Jiayu Ding et.al. 2307.02486v1 link
2023-07-05 Elastic Decision Transformer Yueh-Hua Wu et.al. 2307.02484v1 link
2023-07-05 Jailbroken: How Does LLM Safety Training Fail? Alexander Wei et.al. 2307.02483v1 null
2023-07-05 Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks Zhaofeng Wu et.al. 2307.02477v1 link
2023-07-05 The Calissons Puzzle Jean-Marie Favreau et.al. 2307.02475v1 null
2023-07-06 Deductive Additivity for Planning of Natural Language Proofs Zayne Sprague et.al. 2307.02472v2 link
2023-07-05 What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? Yan Zeng et.al. 2307.02469v1 null
2023-07-03 Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning Yuxiang Zhang et.al. 2307.01200v1 null
2023-07-03 NeuBTF: Neural fields for BTF encoding and transfer Carlos Rodriguez-Pardo et.al. 2307.01199v1 null
2023-07-03 Improved sampling via learned diffusions Lorenz Richter et.al. 2307.01198v1 null
2023-07-03 Segment Anything Meets Point Tracking Frano Rajič et.al. 2307.01197v1 link
2023-07-03 Squeezing Large-Scale Diffusion Models for Mobile Jiwoong Choi et.al. 2307.01193v1 null
2023-07-03 SAMAug: Point Prompt Augmentation for Segment Anything Model Haixing Dai et.al. 2307.01187v1 link
2023-07-03 Continuously Red-Shift and Blue-Shift Wavelength-Tuneable, Narrowband, High Harmonics in the EUV - X-ray Regime for Resonance Imaging and Spectroscopies Dimitar Popmintchev et.al. 2307.01182v1 null
2023-06-30 Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing Ariel N. Lee et.al. 2306.17848v1 null
2023-06-30 Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Guocheng Qian et.al. 2306.17843v1 link
2023-07-03 SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs Lijun Yu et.al. 2306.17842v2 link
2023-07-03 Statler: State-Maintaining Language Models for Embodied Reasoning Takuma Yoneda et.al. 2306.17840v2 null
2023-06-30 Federated Ensemble YOLOv5 - A Better Generalized Object Detection Algorithm Vinit Hegiste et.al. 2306.17829v1 null
2023-06-30 Understanding Unfairness via Training Concept Influence Yuanshun Yao et.al. 2306.17828v1 null
2023-06-29 An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training Zitian Chen et.al. 2306.17165v1 null
2023-06-30 Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors Tung Phung et.al. 2306.17156v2 null
2023-06-29 Generate Anything Anywhere in Any Scene Yuheng Li et.al. 2306.17154v1 null
2023-06-28 MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning Paul Pu Liang et.al. 2306.16413v1 link
2023-06-29 Even order contributions to relative energies vanish for antisymmetric perturbations O. Anatole von Lilienfeld et.al. 2306.16409v2 null
2023-06-27 Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties Hsiao-Yu Tung et.al. 2306.15668v1 null
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667v2 null
2023-06-27 SparseOptimizer: Sparsify Language Models through Moreau-Yosida Regularization and Accelerate through Compiler Co-design Fu-Ming Guo et.al. 2306.15656v1 null
2023-06-27 Optimal Area-Sensitive Bounds for Polytope Approximation Sunil Arya et.al. 2306.15648v1 null
2023-06-26 FunQA: Towards Surprising Video Comprehension Binzhu Xie et.al. 2306.14899v1 link
2023-06-27 InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback John Yang et.al. 2306.14898v2 link
2023-06-26 Supervised Pretraining Can Learn In-Context Reinforcement Learning Jonathan N. Lee et.al. 2306.14892v1 null
2023-06-26 Value of Information in Games with Multiple Strategic Information Providers Raj Kiriti Velicheti et.al. 2306.14886v1 null
2023-06-26 Restart Sampling for Improving Generative Processes Yilun Xu et.al. 2306.14878v1 link
2023-06-26 Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Yuwei Luo et.al. 2306.14872v1 null
2023-06-26 Composing Parameter-Efficient Modules with Arithmetic Operations Jinghan Zhang et.al. 2306.14870v1 link
2023-06-23 GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models Rishabh Agarwal et.al. 2306.13649v1 null
2023-06-23 Offline Skill Graph (OSG): A Framework for Learning and Planning using Offline Reinforcement Learning Skills Ben-ya Halevy et.al. 2306.13630v1 null
2023-06-22 Evading Forensic Classifiers with Attribute-Conditioned Adversarial Faces Fahad Shamshad et.al. 2306.13091v1 link
2023-06-22 PromptIR: Prompting for All-in-One Blind Image Restoration Vaishnav Potlapalli et.al. 2306.13090v1 link
2023-06-22 Improved Signal Detection for Ambient Backscatter Communications S. Zargari et.al. 2306.13083v1 null
2023-06-21 VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution Siobhan Mackenzie Hall et.al. 2306.12424v1 link
2023-06-21 Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase Qiuyu Wang et.al. 2306.12423v1 link
2023-06-21 LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models Shizhe Diao et.al. 2306.12420v1 link
2023-06-21 Coqlex: Generating Formally Verified Lexers Wendlasida Ouedraogo et.al. 2306.12411v1 null
2023-06-20 Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning Huiguo He et.al. 2306.11731v1 null
2023-06-20 Dense Video Object Captioning from Disjoint Supervision Xingyi Zhou et.al. 2306.11729v1 link
2023-06-20 Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision Ayush Tewari et.al. 2306.11719v1 null
2023-06-20 Multi-Fidelity Active Learning with GFlowNets Alex Hernandez-Garcia et.al. 2306.11715v1 link
2023-06-20 Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis Maxim Maximov et.al. 2306.11710v1 null
2023-06-16 Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness Eric Zelikman et.al. 2306.10015v1 link
2023-06-20 CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search Fahad Shamshad et.al. 2306.10008v2 link
2023-06-16 C2F2NeUS: Cascade Cost Frustum Fusion for High Fidelity and Generalizable Neural Surface Reconstruction Luoyuan Xu et.al. 2306.10003v1 null
2023-06-16 SLACK: Stable Learning of Augmentations with Cold-start and KL regularization Juliette Marrie et.al. 2306.09998v1 null
2023-06-16 Fairness in Preference-based Reinforcement Learning Umer Siddique et.al. 2306.09995v1 null
2023-06-16 Rosetta Neurons: Mining the Common Units in a Model Zoo Amil Dravid et.al. 2306.09346v2 null
2023-06-15 Evaluating Data Attribution for Text-to-Image Models Sheng-Yu Wang et.al. 2306.09345v1 link
2023-06-15 DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data Stephanie Fu et.al. 2306.09344v1 link
2023-06-15 Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis Xiaoshi Wu et.al. 2306.09341v1 link
2023-06-15 Span-Selective Linear Attention Transformers for Effective and Robust Schema-Guided Dialogue State Tracking Björn Bebensee et.al. 2306.09340v1 null
2023-06-15 From BERT to GPT-3 Codex: Harnessing the Potential of Very Large Language Models for Data Management Immanuel Trummer et.al. 2306.09339v1 null
2023-06-15 Generative Proxemics: A Prior for 3D Social Interaction from Images Lea Müller et.al. 2306.09337v1 link
2023-06-15 Fit Like You Sample: Sample-Efficient Generalized Score Matching from Fast Mixing Markov Chains Yilong Qin et.al. 2306.09332v1 null
2023-06-15 ArtFusion: Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models Dar-Yen Chen et.al. 2306.09330v1 link
2023-06-13 XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models Omkar Thawkar et.al. 2306.07971v1 link
2023-06-13 GeneCIS: A Benchmark for General Conditional Image Similarity Sagar Vaze et.al. 2306.07969v1 null
2023-06-13 One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning Arnav Chavan et.al. 2306.07967v1 link
2023-06-13 Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Shuai Yang et.al. 2306.07954v1 null
2023-06-12 Waffling around for Performance: Visual Classification with Random Words and Broad Concepts Karsten Roth et.al. 2306.07282v1 link
2023-06-12 Controlling Text-to-Image Diffusion by Orthogonal Finetuning Zeju Qiu et.al. 2306.07280v1 null
2023-06-12 Scalable 3D Captioning with Pretrained Models Tiange Luo et.al. 2306.07279v1 link
2023-06-12 Mathematical conjecture generation using machine intelligence Challenger Mishra et.al. 2306.07277v1 null
2023-06-12 Operator Learning with Neural Fields: Tackling PDEs on General Geometries Louis Serrano et.al. 2306.07266v1 link
2023-06-12 On the Collocated Form with Input Decoupling of Lagrangian Systems Pietro Pustina et.al. 2306.07258v1 null
2023-06-09 Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding Mu Cai et.al. 2306.06094v1 null
2023-06-09 HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork Bipasha Sen et.al. 2306.06093v1 null
2023-06-09 Computational Flash Photography through Intrinsics Sepideh Sarajian Maralan et.al. 2306.06089v1 null
2023-06-09 SENS: Sketch-based Implicit Neural Shape Modeling Alexandre Binninger et.al. 2306.06088v1 null
2023-06-09 Learning Not to Spoof David Byrd et.al. 2306.06087v1 null
2023-06-09 Developing Speech Processing Pipelines for Police Accountability Anjalie Field et.al. 2306.06086v1 null
2023-06-08 Background Prompting for Improved Object Depth Manel Baradad et.al. 2306.05428v1 null
2023-06-08 Grounded Text-to-Image Synthesis with Attention Refocusing Quynh Phung et.al. 2306.05427v1 null
2023-06-08 SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking Chris Cundy et.al. 2306.05426v1 null
2023-06-08 MIMIC-IT: Multi-Modal In-Context Instruction Tuning Bo Li et.al. 2306.05425v1 link
2023-06-08 Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models Muhammad Maaz et.al. 2306.05424v1 link
2023-06-08 ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Changyao Tian et.al. 2306.05423v1 null
2023-06-08 Stochastic Multi-Person 3D Motion Forecasting Sirui Xu et.al. 2306.05421v1 link
2023-06-08 Scaling Spherical CNNs Carlos Esteves et.al. 2306.05420v1 link
2023-06-08 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction Jiawei He et.al. 2306.05418v1 null
2023-06-07 Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection Yu Bai et.al. 2306.04637v1 link
2023-06-07 GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation Shuai Yang et.al. 2306.04636v1 link
2023-06-07 On the Reliability of Watermarks for Large Language Models John Kirchenbauer et.al. 2306.04634v1 link
2023-06-07 Designing a Better Asymmetric VQGAN for StableDiffusion Zixin Zhu et.al. 2306.04632v1 link
2023-06-07 Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design Julien Roy et.al. 2306.04620v1 null
2023-06-07 Helicity-dependent optical control of the magnetization state emerging from the Landau-Lifshitz-Gilbert equation Benjamin Assouline et.al. 2306.04617v1 null
2023-06-07 ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory Chenxu Hu et.al. 2306.03901v2 null
2023-06-06 Model Spider: Learning to Rank Pre-Trained Models Efficiently Yi-Kai Zhang et.al. 2306.03900v1 null
2023-06-06 Towards Label-free Scene Understanding by Vision Foundation Models Runnan Chen et.al. 2306.03899v1 link
2023-06-05 Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction Rose E. Wang et.al. 2306.03090v1 link
2023-06-05 Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models Andrew F. Luo et.al. 2306.03089v1 null
2023-06-05 DeepGraphDMD: Interpretable Spatio-Temporal Decomposition of Non-linear Functional Brain Network Dynamics Md Asadullah Turja et.al. 2306.03088v1 link
2023-06-05 MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion Chiyu Max Jiang et.al. 2306.03083v1 null
2023-06-05 InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models Lichang Chen et.al. 2306.03082v1 link
2023-06-05 Sequential Monte Carlo Steering of Large Language Models using Probabilistic Programs Alexander K. Lew et.al. 2306.03081v1 link
2023-06-05 A General Perspective on Objectives of Reinforcement Learning Long Yang et.al. 2306.03074v1 null
2023-06-05 Explore to Generalize in Zero-Shot RL Ev Zisselman et.al. 2306.03072v1 link
2023-06-02 Multilingual Conceptual Coverage in Text-to-Image Models Michael Saxon et.al. 2306.01735v1 link
2023-06-02 DocFormerv2: Local Features for Document Understanding Srikar Appalaraju et.al. 2306.01733v1 null
2023-06-02 Video Colorization with Pre-trained Text-to-Image Diffusion Models Hanyuan Liu et.al. 2306.01732v1 null
2023-06-02 Improving Generalization in Task-oriented Dialogues with Workflows and Action Plans Stefania Raimondo et.al. 2306.01729v1 null
2023-06-02 Denoising Diffusion Semantic Segmentation with Mask Prior Modeling Zeqiang Lai et.al. 2306.01721v1 link
2023-06-02 Fresh Content Needs More Attention: Multi-funnel Fresh Content Recommendation Jianling Wang et.al. 2306.01720v1 null
2023-06-02 Discreteness of asymptotic tensor ranks Jop Briët et.al. 2306.01718v1 null
2023-06-01 StyleGAN knows Normal, Depth, Albedo, and More Anand Bhattad et.al. 2306.00987v1 null
2023-06-02 Diffusion Self-Guidance for Controllable Image Generation Dave Epstein et.al. 2306.00986v2 null
2023-06-01 StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners Yonglong Tian et.al. 2306.00984v1 link
2023-06-01 StyleDrop: Text-to-Image Generation in Any Style Kihyuk Sohn et.al. 2306.00983v1 null
2023-06-01 SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds Yanyu Li et.al. 2306.00980v1 link
2023-06-01 AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Ji Lin et.al. 2306.00978v1 link
2023-06-01 Intriguing Properties of Text-guided Diffusion Models Qihao Liu et.al. 2306.00974v1 link
2023-06-01 Intelligent Grimm – Open-ended Visual Storytelling via Latent Diffusion Models Chang Liu et.al. 2306.00973v1 link
2023-06-01 Too Large; Data Reduction for Vision-Language Pre-Training Alex Jinpeng Wang et.al. 2305.20087v2 link
2023-05-31 Understanding and Mitigating Copying in Diffusion Models Gowthami Somepalli et.al. 2305.20086v1 link
2023-05-31 Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor Ruizhi Shao et.al. 2305.20082v1 null
2023-05-31 On the Capacity of Secure $K$ -user Product Computation over a Quantum MAC Yuxiang Lu et.al. 2305.20073v1 null
2023-05-31 Latent Exploration for Reinforcement Learning Alberto Silvio Chiappa et.al. 2305.20065v1 link
2023-05-31 Chatting Makes Perfect – Chat-based Image Retrieval Matan Levy et.al. 2305.20062v1 link
2023-05-30 Concise Answers to Complex Questions: Summarization of Long-form Answers Abhilash Potluri et.al. 2305.19271v1 link
2023-05-30 Microfluidics Generation of Millimeter-sized Matrigel Droplets Cory Arnold et.al. 2305.19261v1 null
2023-05-30 Shuffle SGD is Always Better than SGD: Improved Analysis of SGD with Arbitrary Data Orders Anastasia Koloskova et.al. 2305.19259v1 null
2023-05-30 Ambient Diffusion: Learning Clean Distributions from Corrupted Data Giannis Daras et.al. 2305.19256v1 link
2023-05-30 What Can We Learn from Unlearnable Datasets? Pedro Sandoval-Segura et.al. 2305.19254v1 link
2023-05-29 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths Zeyue Xue et.al. 2305.18295v1 null
2023-05-29 Transformer Language Models Handle Word Frequency in Prediction Head Goro Kobayashi et.al. 2305.18294v1 null
2023-05-29 Direct Preference Optimization: Your Language Model is Secretly a Reward Model Rafael Rafailov et.al. 2305.18290v1 link
2023-05-29 LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections M. Jehanzeb Mirza et.al. 2305.18287v1 null
2023-05-29 Characterization and evasion of backscattered light in the squeezed-light enhanced gravitational wave interferometer GEO 600 Fabio Bergamin et.al. 2305.18284v1 null
2023-05-29 Contextual Object Detection with Multimodal Large Language Models Yuhang Zang et.al. 2305.18279v1 link
2023-05-26 NeuManifold: Neural Watertight Manifold Reconstruction with Efficient and High-Quality Rendering Support Xinyue Wei et.al. 2305.17134v1 null
2023-05-26 RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation Gabriele Sarti et.al. 2305.17131v1 null
2023-05-26 Characterizing and Measuring Linguistic Dataset Drift Tyler A. Chang et.al. 2305.17127v1 link
2023-05-26 Large Language Models as Tool Makers Tianle Cai et.al. 2305.17126v1 link
2023-05-26 Manifold Regularization for Memory-Efficient Training of Deep Neural Networks Shadi Sartipi et.al. 2305.17119v1 null
2023-05-26 Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time Zichang Liu et.al. 2305.17118v1 null
2023-05-26 Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model David Soong et.al. 2305.17116v1 null
2023-05-25 Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models Shihao Zhao et.al. 2305.16322v1 link
2023-05-25 Parallel Sampling of Diffusion Models Andy Shih et.al. 2305.16317v1 link
2023-05-25 NAP: Neural 3D Articulation Prior Jiahui Lei et.al. 2305.16315v1 null
2023-05-26 Banana: Banach Fixed-Point Network for Pointcloud Segmentation with Inter-Part Equivariance Congyue Deng et.al. 2305.16314v2 null
2023-05-25 UMat: Uncertainty-Aware Single Image High Resolution Material Capture Carlos Rodriguez-Pardo et.al. 2305.16312v1 null
2023-05-25 Break-A-Scene: Extracting Multiple Concepts from a Single Image Omri Avrahami et.al. 2305.16311v1 link
2023-05-25 Securing Deep Generative Models with Universal Adversarial Signature Yu Zeng et.al. 2305.16310v1 link
2023-05-25 Imitating Task and Motion Planning with Visuomotor Transformers Murtaza Dalal et.al. 2305.16309v1 null
2023-05-25 Fine-Grained Complexity Analysis of Multi-Agent Path Finding on 2D Grids Tzvika Geft et.al. 2305.16303v1 null
2023-05-24 Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective Guhao Feng et.al. 2305.15408v1 link
2023-05-24 Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets Brandon Smith et.al. 2305.15407v1 link
2023-05-24 Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape Rundi Wu et.al. 2305.15399v1 link
2023-05-24 LayoutGPT: Compositional Visual Planning and Generation with Large Language Models Weixi Feng et.al. 2305.15393v1 link
2023-05-24 A Neural Space-Time Representation for Text-to-Image Personalization Yuval Alaluf et.al. 2305.15391v1 link
2023-05-24 Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering Avi Caciularu et.al. 2305.15387v1 link
2023-05-23 NCHO: Unsupervised Learning for Neural 3D Composition of Humans and Objects Taeksoo Kim et.al. 2305.14345v1 link
2023-05-23 Video Prediction Models as Rewards for Reinforcement Learning Alejandro Escontrela et.al. 2305.14343v1 null
2023-05-23 APPLS: A Meta-evaluation Testbed for Plain Language Summarization Yue Guo et.al. 2305.14341v1 link
2023-05-23 Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Grace Luo et.al. 2305.14334v1 null
2023-05-23 Evaluating and Modeling Attribution for Cross-Lingual Question Answering Benjamin Muller et.al. 2305.14332v1 null
2023-05-23 Large Language Models are Frame-level Directors for Zero-shot Text-to-Video Generation Susung Hong et.al. 2305.14330v1 link
2023-05-23 Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria Fivos Kalogiannis et.al. 2305.14329v1 null
2023-05-23 Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation Da Yin et.al. 2305.14327v1 link
2023-05-22 Contextualising Implicit Representations for Semantic Tasks Theo W. Costain et.al. 2305.13312v1 null
2023-05-22 VDT: An Empirical Study on Video Diffusion with Transformers Haoyu Lu et.al. 2305.13311v1 link
2023-05-22 Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching Yang Liu et.al. 2305.13310v1 link
2023-05-22 Evaluating Factual Consistency of Texts with Semantic Role Labeling Jing Fan et.al. 2305.13309v1 link
2023-05-22 If at First You Don’t Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection Shyamgopal Karthik et.al. 2305.13308v1 link
2023-05-22 NeRFuser: Large-Scale Scene Representation by NeRF Fusion Jiading Fang et.al. 2305.13307v1 link
2023-05-22 Growth of ultrawide-bandgap BN/diamond heterostructures by pulsed laser deposition Abhijit Biswas et.al. 2305.13306v1 null
2023-05-22 RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text Wangchunshu Zhou et.al. 2305.13304v1 link
2023-05-23 Training Diffusion Models with Reinforcement Learning Kevin Black et.al. 2305.13301v2 link
2023-05-22 Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations Chenglei Si et.al. 2305.13299v1 link
2023-05-19 Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models Byungjun Kim et.al. 2305.11870v1 link
2023-05-19 Reducing Sequence Length by Predicting Edit Operations with Large Language Models Masahiro Kaneko et.al. 2305.11862v1 null
2023-05-19 Video Killed the HD-Map: Predicting Driving Behavior Directly From Drone Images Yunpeng Liu et.al. 2305.11856v1 null
2023-05-19 Multimodal Web Navigation with Instruction-Finetuned Foundation Models Hiroki Furuta et.al. 2305.11854v1 null
2023-05-19 Poincare and Einstein on Mass-Energy Equivalence: A Modern Perspective on their 1900 and 1905 Papers Patrick Moylan et.al. 2305.11852v1 null
2023-05-19 Any-to-Any Generation via Composable Diffusion Zineng Tang et.al. 2305.11846v1 link
2023-05-18 Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model Siyuan Huang et.al. 2305.11176v1 link
2023-05-18 VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks Wenhai Wang et.al. 2305.11175v1 link
2023-05-18 Going Denser with Open-Vocabulary Part Segmentation Peize Sun et.al. 2305.11173v1 link
2023-05-18 ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities Peng Wang et.al. 2305.11172v1 link
2023-05-18 TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models Zorik Gekhman et.al. 2305.11171v1 link
2023-05-18 Efficient Prompting via Dynamic In-Context Learning Wangchunshu Zhou et.al. 2305.11170v1 null
2023-05-18 Evidence of Meaning in Language Models Trained on Programs Charles Jin et.al. 2305.11169v1 null
2023-05-17 FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention Guangxuan Xiao et.al. 2305.10431v1 link
2023-05-17 CLIP-GCD: Simple Language Guided Generalized Category Discovery Rabah Ouldnoughi et.al. 2305.10420v1 null
2023-05-17 Towards Multi-Layered 3D Garments Animation Yidi Shao et.al. 2305.10418v1 null
2023-05-17 Scratch Copilot Evaluation: Assessing AI-Assisted Creative Coding for Families Stefania Druga et.al. 2305.10417v1 null
2023-05-18 PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering Xiaoman Zhang et.al. 2305.10415v2 link
2023-05-17 AI Friends: A Design Framework for AI-Powered Creative Programming for Youth Stefania Druga et.al. 2305.10412v1 null
2023-05-17 Data Extraction via Semantic Regular Expression Synthesis Qiaochu Chen et.al. 2305.10401v1 null
2023-05-16 Understanding 3D Object Interaction from a Single Image Shengyi Qian et.al. 2305.09664v1 link
2023-05-16 Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation Samaneh Azadi et.al. 2305.09662v1 null
2023-05-16 Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage Jose Blanchet et.al. 2305.09659v1 null
2023-05-16 Newad: A register map automation tool for Verilog Vamsi K Vytla et.al. 2305.09657v1 null
2023-05-17 Satisfiability-Aided Language Models Using Declarative Prompting Xi Ye et.al. 2305.09656v2 link
2023-05-16 Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation Yuxin Ren et.al. 2305.09651v1 link
2023-05-16 Wavelet-based Unsupervised Label-to-Image Translation George Eskandar et.al. 2305.09647v1 link
2023-05-15 Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models Antoni Bigata Casademunt et.al. 2305.08854v1 link
2023-05-15 CQE: A Comprehensive Quantity Extractor Satya Almasian et.al. 2305.08853v1 link
2023-05-15 MV-Map: Offboard HD-Map Generation with Multi-view Consistency Ziyang Xie et.al. 2305.08851v1 link
2023-05-15 Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts Yuyang Zhao et.al. 2305.08850v1 null
2023-05-15 Privacy Auditing with One (1) Training Run Thomas Steinke et.al. 2305.08846v1 null
2023-05-15 Large Language Models are Zero-Shot Rankers for Recommender Systems Yupeng Hou et.al. 2305.08845v1 link
2023-05-15 RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs Afra Feyza Akyürek et.al. 2305.08844v1 link
2023-05-15 Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks Minyoung Huh et.al. 2305.08842v1 null
2023-05-15 Attacking Perceptual Similarity Metrics Abhijay Ghildyal et.al. 2305.08840v1 null
2023-05-12 Text2Cohort: Democratizing the NCI Imaging Data Commons with Natural Language Cohort Discovery Pranav Kulkarni et.al. 2305.07637v1 link
2023-05-12 Development of MC/DC: a performant, scalable, and portable Python-based Monte Carlo neutron transport code Ilham Variansyah et.al. 2305.07636v1 link
2023-05-12 Zero-shot Item-based Recommendation via Multi-task Product Knowledge Graph Pre-Training Ziwei Fan et.al. 2305.07633v1 null
2023-05-12 Design, Development, and Evaluation of an Interactive Personalized Social Robot to Monitor and Coach Post-Stroke Rehabilitation Exercises Min Hun Lee et.al. 2305.07632v1 null
2023-05-11 SparseGNV: Generating Novel Views of Indoor Scenes with Sparse Input Views Weihao Cheng et.al. 2305.07024v1 link
2023-05-11 Simple Token-Level Confidence Improves Caption Correctness Suzanne Petryk et.al. 2305.07021v1 null
2023-05-11 A General-Purpose Multilingual Document Encoder Onur Galoğlu et.al. 2305.07016v1 link
2023-05-11 Exploiting Diffusion Prior for Real-World Image Super-Resolution Jianyi Wang et.al. 2305.07015v1 link
2023-05-11 Occam’s razor for AI: Coarse-graining Hammett Inspired Product Ansatz in Chemical Space Marco Bragato et.al. 2305.07010v1 null
2023-05-11 Fair Price Discrimination Siddhartha Banerjee et.al. 2305.07006v1 null
2023-05-11 Subword Segmental Machine Translation: Unifying Segmentation and Target Sentence Generation Francois Meyer et.al. 2305.07005v1 link
2023-05-11 Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting Haoyang Huang et.al. 2305.07004v1 null
2023-05-11 Real-time Manipulation of Liquid Droplets using Photo-responsive Surfactant Xichen Liang et.al. 2305.07002v1 null
2023-05-10 Generalizations and Extensions to Lifting Constructions for Coded Caching V. R. Aravind et.al. 2305.06352v1 null
2023-05-10 RECKONING: Reasoning through Dynamic Knowledge Encoding Zeming Chen et.al. 2305.06349v1 link
2023-05-10 Frequency-Supported Neural Networks for Nonlinear Dynamical System Identification Krzysztof Zając et.al. 2305.06344v1 link
2023-05-10 Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs Roei Herzig et.al. 2305.06343v1 null
2023-05-10 Generalized Stratified Sampling for Efficient Reliability Assessment of Structures Against Natural Hazards Srinivasan Arunachalam et.al. 2305.06338v1 null
2023-05-10 K-UniMorph: Korean Universal Morphology and its Feature Schema Eunkyul Leah Jo et.al. 2305.06335v1 link
2023-05-10 Direct-Laser-Written Polymer Nanowire Waveguides for Broadband Single Photon Collection from Epitaxial Quantum Dots into a Gaussian-like Mode Edgar Perez et.al. 2305.06333v1 null
2023-05-09 Policy Gradient Methods in the Presence of Symmetries and State Abstractions Prakash Panangaden et.al. 2305.05666v1 link
2023-05-09 ImageBind: One Embedding Space To Bind Them All Rohit Girdhar et.al. 2305.05665v1 link
2023-05-10 InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language Zhaoyang Liu et.al. 2305.05662v2 link
2023-05-09 TidyBot: Personalized Robot Assistance with Large Language Models Jimmy Wu et.al. 2305.05658v1 link
2023-05-09 Using Knowledge Units of Programming Languages to Recommend Reviewers for Pull Requests: An Empirical Study Md Ahasanuzzaman et.al. 2305.05654v1 null
2023-05-09 Asymmetric $X$-Secure $T$ -Private Information Retrieval: More Databases is Not Always Better Mohamed Nomeir et.al. 2305.05649v1 null
2023-05-08 Learning to Evaluate the Artness of AI-generated Images Junyu Chen et.al. 2305.04923v1 null
2023-05-08 DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models Sicheng Yang et.al. 2305.04919v1 link
2023-05-08 What Do Patients Say About Their Disease Symptoms? Deep Multilabel Text Classification With Human-in-the-Loop Curation for Automatic Labeling of Patient Self Reports of Problems Lakshmi Arbatti et.al. 2305.04905v1 null
2023-05-08 Robust Positivity Problems for low-order Linear Recurrence Sequences Mihir Vahanwala et.al. 2305.04870v1 null
2023-05-05 On the Benefits of Semi-Supervised Test Case Generation for Cyber-Physical Systems Xiao Ling et.al. 2305.03714v1 null
2023-05-05 Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos Ekta Prashnani et.al. 2305.03713v1 null
2023-05-08 On the characterization of the convective heat flux in turbulent Rayleigh-Bénard convection Bérengère Podvin et.al. 2305.03708v2 null
2023-05-05 LMEye: An Interactive Perception Network for Large Language Models Yunxin Li et.al. 2305.03701v1 link
2023-05-05 Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements Jiacheng Liu et.al. 2305.03695v1 link
2023-05-05 Mining bias-target Alignment from Voronoi Cells Rémi Nahon et.al. 2305.03691v1 link
2023-05-05 COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? Arijit Ray et.al. 2305.03689v1 link
2023-05-04 ZipIt! Merging Models from Different Tasks without Training George Stoica et.al. 2305.03053v1 link
2023-05-04 Controllable Visual-Tactile Synthesis Ruihan Gao et.al. 2305.03051v1 link
2023-05-04 NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds Jun-Kun Chen et.al. 2305.03049v1 null
2023-05-04 Personalize Segment Anything Model with One Shot Renrui Zhang et.al. 2305.03048v1 link
2023-05-04 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Zhiqing Sun et.al. 2305.03047v1 link
2023-05-04 OctFormer: Octree-based Transformers for 3D Point Clouds Peng-Shuai Wang et.al. 2305.03045v1 link
2023-05-04 Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization Connor Z. Lin et.al. 2305.03043v1 null
2023-05-04 Are VAEs Bad at Reconstructing Molecular Graphs? Hagen Muenkler et.al. 2305.03041v1 null
2023-05-04 TUVF: Learning Generalizable Texture UV Radiance Fields An-Chieh Cheng et.al. 2305.03040v1 null
2023-05-03 Characterizing Political Bias in Automatic Summaries: A Case Study of Trump and Biden Karen Zhou et.al. 2305.02321v1 link
2023-05-03 Generating Synthetic Documents for Cross-Encoder Re-Rankers: A Comparative Study of ChatGPT and Human Experts Arian Askari et.al. 2305.02320v1 link
2023-05-03 Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings Daniel Rose et.al. 2305.02317v1 null
2023-05-03 AG3D: Learning to Generate 3D Avatars from 2D Image Collections Zijian Dong et.al. 2305.02312v1 null
2023-05-03 Real-Time Radiance Fields for Single-Image Portrait View Synthesis Alex Trevithick et.al. 2305.02310v1 null
2023-05-03 Calibrated Explanations: with Uncertainty Information and Counterfactuals Helena Lofstrom et.al. 2305.02305v1 link
2023-05-02 Humans as Light Bulbs: 3D Human Reconstruction from Thermal Reflection Ruoshi Liu et.al. 2305.01652v1 null
2023-05-02 Generalizing Dataset Distillation via Deep Generative Prior George Cazenavette et.al. 2305.01649v1 link
2023-05-02 Sequence Modeling with Multiresolution Convolutional Memory Jiaxin Shi et.al. 2305.01638v1 link
2023-05-02 The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers Ariel Gera et.al. 2305.01628v1 link
2023-05-02 Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks Gašper Beguš et.al. 2305.01626v1 null
2023-05-02 TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis Mathis Petrovich et.al. 2305.00976v1 null
2023-05-01 ArK: Augmented Reality with Knowledge Interactive Emergent Ability Qiuyuan Huang et.al. 2305.00970v1 null
2023-05-01 PMDG: Privacy for Multi-Perspective Process Mining through Data Generalization Ryan Hildebrant et.al. 2305.00960v1 null
2023-05-01 Non-Binary LDPC Code Design for Energy-Time Entanglement Quantum Key Distribution Debarnab Mitra et.al. 2305.00956v1 null
2023-05-01 Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation Patrick Fernandes et.al. 2305.00955v1 null
2023-04-28 LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model Peng Gao et.al. 2304.15010v1 link
2023-04-28 Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs George Pu et.al. 2304.14999v1 null
2023-04-28 ChatGPT – a Blessing or a Curse for Undergraduate Computer Science Students and Instructors? Ishika Joshi et.al. 2304.14993v1 null
2023-04-28 Robust Stackelberg Equilibria Jiarui Gan et.al. 2304.14990v1 null
2023-04-28 Interpreting Vision and Language Generative Models with Semantic Visual Priors Michele Cafagna et.al. 2304.14986v1 null
2023-04-28 Optimal majority rules and quantitative Condorcet properties of setwise Kemeny voting schemes Xuan Kien Phung et.al. 2304.14980v1 null
2023-04-28 MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks Lei Zhang et.al. 2304.14979v1 link
2023-04-27 ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System Junke Wang et.al. 2304.14407v1 null
2023-04-27 Motion-Conditioned Diffusion Model for Controllable Video Synthesis Tsai-Shien Chen et.al. 2304.14404v1 null
2023-04-27 LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions Minghao Wu et.al. 2304.14402v1 link
2023-04-27 ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs Jiteng Mu et.al. 2304.14401v1 null
2023-04-27 IconShop: Text-Based Vector Icon Synthesis with Autoregressive Transformers Ronghuan Wu et.al. 2304.14400v1 null
2023-04-27 We’re Afraid Language Models Aren’t Modeling Ambiguity Alisa Liu et.al. 2304.14399v1 link
2023-04-27 Maximizing Model Generalization for Manufacturing with Self-Supervised Learning and Federated Learning Matthew Russell et.al. 2304.14398v1 null
2023-04-27 Learning Articulated Shape with Keypoint Pseudo-labels from Web Images Anastasis Stathopoulos et.al. 2304.14396v1 null
2023-04-27 SeqTrack: Sequence to Sequence Learning for Visual Object Tracking Xin Chen et.al. 2304.14394v1 link
2023-04-26 Controllable Image Generation via Collage Representations Arantxa Casanova et.al. 2304.13722v1 null
2023-04-26 Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery Debadutta Dash et.al. 2304.13714v1 null
2023-04-27 Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond Jingfeng Yang et.al. 2304.13712v2 link
2023-04-26 UniNeXt: Exploring A Unified Architecture for Vision Recognition Fangjian Lin et.al. 2304.13700v1 link
2023-04-26 Hitting Subgraphs in Sparse Graphs and Geometric Intersection Graphs Daniel Lokshtanov et.al. 2304.13695v1 null
2023-04-26 HeySQuAD: A Spoken Question Answering Dataset Yijing Wu et.al. 2304.13689v1 link
2023-04-25 DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection Huan-ang Gao et.al. 2304.13031v1 link
2023-04-25 On the mechanism of polaritonic rate suppression from quantum transition paths Michelle C. Anderson et.al. 2304.13024v1 null
2023-04-25 Seeing is not always believing: A Quantitative Study on Human Perception of AI-Generated Images Zeyu Lu et.al. 2304.13023v1 link
2023-04-25 Certifying Ensembles: A General Certification Theory with S-Lipschitzness Aleksandar Petrov et.al. 2304.13019v1 null
2023-04-25 Bibliometric Data Fusion for Biomedical Information Retrieval Timo Breuer et.al. 2304.13012v1 null
2023-04-25 The Potential of Visual ChatGPT For Remote Sensing Lucas Prado Osco et.al. 2304.13009v1 null
2023-04-25 Answering Questions by Meta-Reasoning over Multiple Chains of Thought Ori Yoran et.al. 2304.13007v1 link
2023-04-24 Explicit Correspondence Matching for Generalizable Neural Radiance Fields Yuedong Chen et.al. 2304.12294v1 link
2023-04-24 Synthpop++: A Hybrid Framework for Generating A Country-scale Synthetic Population Bhavesh Neekhra et.al. 2304.12284v1 link
2023-04-21 Deep-Learning-based Fast and Accurate 3D CT Deformable Image Registration in Lung Cancer Yuzhen Ding et.al. 2304.11135v1 null
2023-04-20 Learning Sparse and Low-Rank Priors for Image Recovery via Iterative Reweighted Least Squares Minimization Stamatios Lefkimmiatis et.al. 2304.10536v1 null
2023-04-20 Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion Tomas Jakab et.al. 2304.10535v1 null
2023-04-20 Collaborative Diffusion for Multi-Modal Face Generation and Editing Ziqi Huang et.al. 2304.10530v1 link
2023-04-20 Generalizing Neural Human Fitting to Unseen Poses With Articulated SE(3) Equivariance Haiwen Feng et.al. 2304.10528v1 null
2023-04-20 Multidimensional Uncertainty Quantification for Deep Neural Networks Xujiang Zhao et.al. 2304.10527v1 null
2023-04-20 GenCorres: Consistent Shape Matching via Coupled Implicit-Explicit Shape Generative Models Haitao Yang et.al. 2304.10523v1 link
2023-04-20 Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget Johannes Lehner et.al. 2304.10520v1 link
2023-04-19 LipsFormer: Introducing Lipschitz Continuity to Vision Transformers Xianbiao Qi et.al. 2304.09856v1 link
2023-04-19 Bridging RL Theory and Practice with the Effective Horizon Cassidy Laidlaw et.al. 2304.09853v1 link
2023-04-19 Evaluating Verifiability in Generative Search Engines Nelson F. Liu et.al. 2304.09848v1 link
2023-04-19 Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Pan Lu et.al. 2304.09842v1 link
2023-04-19 Points of non-linearity of functions generated by random neural networks David Holmes et.al. 2304.09837v1 null
2023-04-18 Optimal PAC Bounds Without Uniform Convergence Ishaq Aden-Ali et.al. 2304.09167v1 null
2023-04-18 Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task Zihao Wu et.al. 2304.09138v1 null
2023-04-17 Conditional Generation of Audio from Video via Foley Analogies Yuexi Du et.al. 2304.08490v1 link
2023-04-17 Hyper-Decision Transformer for Efficient Online Policy Adaptation Mengdi Xu et.al. 2304.08487v1 null
2023-04-17 Visual Instruction Tuning Haotian Liu et.al. 2304.08485v1 link
2023-04-17 Text2Performer: Text-Driven Human Video Generation Yuming Jiang et.al. 2304.08483v1 link
2023-04-17 Towards Robust Prompts on Vision-Language Models Jindong Gu et.al. 2304.08479v1 null
2023-04-18 Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation Jie An et.al. 2304.08477v2 null
2023-04-14 Cross-Entropy Loss Functions: Theoretical Analysis and Applications Anqi Mao et.al. 2304.07288v1 null
2023-04-14 Solving Unique Games over Globally Hypercontractive Graphs Mitali Bafna et.al. 2304.07284v1 null
2023-04-14 Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning Bryan Brandt et.al. 2304.07280v1 null
2023-04-17 Identifying Cluttering Edges in Near-Planar Graphs Simon van Wageningen et.al. 2304.07274v2 link
2023-04-13 Expressive Text-to-Image Generation with Rich Text Songwei Ge et.al. 2304.06720v1 null
2023-04-13 Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction Hansheng Chen et.al. 2304.06714v1 link
2023-04-13 What does CLIP know about a red circle? Visual prompt engineering for VLMs Aleksandar Shtedritski et.al. 2304.06712v1 null
2023-04-13 DiffusionRig: Learning Personalized Priors for Facial Appearance Editing Zheng Ding et.al. 2304.06711v1 link
2023-04-13 How Will It Drape Like? Capturing Fabric Mechanics from Depth Images Carlos Rodriguez-Pardo et.al. 2304.06704v1 null
2023-04-13 Learning Controllable 3D Diffusion Models from Single-view Images Jiatao Gu et.al. 2304.06700v1 null
2023-04-13 Improving novelty detection with generative adversarial networks on hand gesture data Miguel Simão et.al. 2304.06696v1 null
2023-04-12 Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA James Seale Smith et.al. 2304.06027v1 null
2023-04-12 DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion Johanna Karras et.al. 2304.06025v1 null
2023-04-12 Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views Siwei Zhang et.al. 2304.06024v1 link
2023-04-12 SAM Struggles in Concealed Scenes – Empirical Study on “Segment Anything” Ge-Peng Ji et.al. 2304.06022v1 null
2023-04-12 Crowd Counting with Sparse Annotation Shiwei Zhang et.al. 2304.06021v1 null
2023-04-12 VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs Moayed Haji Ali et.al. 2304.06020v1 null
2023-04-12 Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera Ruicheng Feng et.al. 2304.06019v1 link
2023-04-12 Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning Aravind Venugopal et.al. 2304.06011v1 null
2023-04-11 HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models Eslam Mohamed Bakr et.al. 2304.05390v1 link
2023-04-11 Human-AI Co-Creation Approach to Find Forever Chemicals Replacements Juliana Jansen Ferreira et.al. 2304.05389v1 null
2023-04-11 MOST: Multiple Object localization with Self-supervised Transformers for object discovery Sai Saketh Rambhatla et.al. 2304.05387v1 null
2023-04-11 Bloom filters for molecules Jorge Medina et.al. 2304.05386v1 link
2023-04-10 A Cheaper and Better Diffusion Language Model with Soft-Masked Noise Jiaao Chen et.al. 2304.04746v1 link
2023-04-10 Ambiguous Medical Image Segmentation using Diffusion Models Aimon Rahman et.al. 2304.04745v1 link
2023-04-10 On the Possibilities of AI-Generated Text Detection Souradip Chakraborty et.al. 2304.04736v1 null
2023-04-07 Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following Mingyu Ding et.al. 2304.03767v1 null
2023-04-07 Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering Hung-Ting Su et.al. 2304.03754v1 null
2023-04-07 V3Det: Vast Vocabulary Visual Detection Dataset Jiaqi Wang et.al. 2304.03752v1 null
2023-04-07 Perspectives on AI Architectures and Co-design for Earth System Predictability Maruti K. Mudunuru et.al. 2304.03748v1 null
2023-04-07 Assessing Perceived Fairness from Machine Learning Developer’s Perspective Anoop Mishra et.al. 2304.03745v1 null
2023-04-06 Diffusion Models as Masked Autoencoders Chen Wei et.al. 2304.03283v1 null
2023-04-06 Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark Alexander Pan et.al. 2304.03279v1 link
2023-04-06 How Do US Congress Members Advertise Climate Change: An Analysis Of Ads Run On Meta’s Platforms Laurenz Aisenpreis et.al. 2304.03278v1 null
2023-04-06 Instruction Tuning with GPT-4 Baolin Peng et.al. 2304.03277v1 link
2023-04-06 That’s What I Said: Fully-Controllable Talking Face Generation Youngjoon Jang et.al. 2304.03275v1 null
2023-04-06 Towards self-driving laboratories in chemistry and materials sciences: The central role of DFT in the era of AI Bing Huang et.al. 2304.03272v1 null
2023-04-06 Causal Discovery with Score Matching on Additive Models with Arbitrary Noise Francesco Montagna et.al. 2304.03265v1 null
2023-04-05 Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models Xuhui Jia et.al. 2304.02642v1 null
2023-04-05 ENTL: Embodied Navigation Trajectory Learner Klemen Kotar et.al. 2304.02639v1 null
2023-04-05 GenPhys: From Physical Processes to Generative Models Ziming Liu et.al. 2304.02637v1 null
2023-04-05 HNeRV: A Hybrid Neural Representation for Videos Hao Chen et.al. 2304.02633v1 link
2023-04-05 Towards Explainable AI Writing Assistants for Non-native English Speakers Yewon Kim et.al. 2304.02625v1 null
2023-04-05 High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation Arvi Jonnarth et.al. 2304.02621v1 link
2023-04-04 Large Language Models are Edge-Case Fuzzers: Testing Deep Learning Libraries via FuzzGPT Yinlin Deng et.al. 2304.02014v1 null
2023-04-04 NPC: Neural Point Characters from Video Shih-Yang Su et.al. 2304.02013v1 null
2023-04-04 EGC: Image Generation and Classification via a Single Energy-Based Model Qiushan Guo et.al. 2304.02012v1 link
2023-04-04 FakET: Simulating Cryo-Electron Tomograms with Neural Style Transfer Pavol Harar et.al. 2304.02011v1 link
2023-04-04 OrienterNet: Visual Localization in 2D Public Maps with Neural Matching Paul-Edouard Sarlin et.al. 2304.02009v1 null
2023-04-04 MonoHuman: Animatable Human Neural Field from Monocular Video Zhengming Yu et.al. 2304.02001v1 null
2023-04-04 Revisiting the Evaluation of Image Synthesis with GANs Mengping Yang et.al. 2304.01999v1 link
2023-04-03 Video Instance Segmentation in an Open-World Omkar Thawakar et.al. 2304.01200v1 link
2023-04-03 Zero-Shot Semantic Segmentation with Decoupled One-Pass Network Cong Han et.al. 2304.01198v1 link
2023-04-03 Bringing Telepresence to Every Desk Shengze Wang et.al. 2304.01197v1 null
2023-04-04 Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data Canwen Xu et.al. 2304.01196v2 link
2023-04-03 Burstormer: Burst Image Restoration and Enhancement Transformer Akshay Dudhane et.al. 2304.01194v1 link
2023-04-03 Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos Yue Ma et.al. 2304.01186v1 link
2023-04-03 Whistler Wave Observations by \textit{Parker Solar Probe} During Encounter $1$ : Counter-Propagating Whistlers Collocated with Magnetic Field Inhomogeneities and their Application to Electric Field Measurement Calibration S. Karbashewski et.al. 2304.01185v1 null
2023-03-31 Towards Flexible Multi-modal Document Models Naoto Inoue et.al. 2303.18248v1 link
2023-03-31 Speeding up Madgraph5 aMC@NLO through CPU vectorization and GPU offloading: towards a first alpha release Andrea Valassi et.al. 2303.18244v1 null
2023-03-31 $\infty$ -Diff: Infinite Resolution Diffusion with Subsampled Mollified States Sam Bond-Taylor et.al. 2303.18242v1 link
2023-03-31 Procedure-Aware Pretraining for Instructional Video Understanding Honglu Zhou et.al. 2303.18230v1 link
2023-03-31 A Survey of Large Language Models Wayne Xin Zhao et.al. 2303.18223v1 link
2023-03-31 SemHint-MD: Learning from Noisy Semantic Labels for Self-Supervised Monocular Depth Estimation Shan Lin et.al. 2303.18219v1 null
2023-03-31 A Closer Look at Few-Shot 3D Point Cloud Classification Chuangguan Ye et.al. 2303.18210v1 link
2023-03-30 AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control Ruixiang Jiang et.al. 2303.17606v1 link
2023-03-30 Token Merging for Fast Stable Diffusion Daniel Bolya et.al. 2303.17604v1 link
2023-03-30 NeRF-Supervised Deep Stereo Fabio Tosi et.al. 2303.17603v1 link
2023-03-30 Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks Weihua Chen et.al. 2303.17602v1 link
2023-03-30 When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning Zichen Zhang et.al. 2303.17600v1 null
2023-03-30 Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models Wen Wang et.al. 2303.17599v1 link
2023-03-30 Consistent View Synthesis with Pose-Guided Diffusion Models Hung-Yu Tseng et.al. 2303.17598v1 null
2023-03-30 MobileInst: Video Instance Segmentation on the Mobile Renhong Zhang et.al. 2303.17594v1 null
2023-03-29 AutoAD: Movie Description in Context Tengda Han et.al. 2303.16899v1 link
2023-03-29 Bagging by Learning to Singulate Layers Using Interactive Perception Lawrence Yunliang Chen et.al. 2303.16898v1 null
2023-03-29 Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos Kun Su et.al. 2303.16897v1 null
2023-03-29 Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation Md Mostafijur Rahman et.al. 2303.16892v1 link
2023-03-29 Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations Vibashan VS et.al. 2303.16891v1 null
2023-03-29 DPF: Learning Dense Prediction Fields with Weak Supervision Xiaoxue Chen et.al. 2303.16890v1 link
2023-03-29 Towards Understanding the Effect of Pretraining Label Granularity Guan Zhe Hong et.al. 2303.16887v1 null
2023-03-29 End-to-End $n$ -ary Relation Extraction for Combination Drug Therapies Yuhang Jiang et.al. 2303.16886v1 link
2023-03-29 Instant Neural Radiance Fields Stylization Shaoxu Li et.al. 2303.16884v1 link
2023-03-29 Your Diffusion Model is Secretly a Zero-Shot Classifier Alexander C. Li et.al. 2303.16203v2 link
2023-03-28 LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention Renrui Zhang et.al. 2303.16199v1 link
2023-03-28 BC-IRL: Learning Generalizable Reward Functions from Demonstrations Andrew Szot et.al. 2303.16194v1 null
2023-03-28 Planning with Sequence Models through Iterative Energy Minimization Hongyi Chen et.al. 2303.16189v1 null
2023-03-28 Visual Chain-of-Thought Diffusion Models William Harvey et.al. 2303.16187v1 link
2023-03-28 Label Smoothing Improves Neural Source Code Summarization Sakib Haque et.al. 2303.16178v1 null
2023-03-27 IRFL: Image Recognition of Figurative Language Ron Yosef et.al. 2303.15445v1 link
2023-03-27 Zero-shot Model Diagnosis Jinqi Luo et.al. 2303.15441v1 null
2023-03-27 FaceLit: Neural 3D Relightable Faces Anurag Ranjan et.al. 2303.15437v1 null
2023-03-27 The Stable Signature: Rooting Watermarks in Latent Diffusion Models Pierre Fernandez et.al. 2303.15435v1 link
2023-03-27 Anti-DreamBooth: Protecting users from personalized text-to-image synthesis Thanh Van Le et.al. 2303.15433v1 link
2023-03-27 TextMI: Textualize Multimodal Information for Integrating Non-verbal Cues in Pre-trained Language Models Md Kamrul Hasan et.al. 2303.15430v1 null
2023-03-27 JAWS: Just A Wild Shot for Cinematic Transfer in Neural Radiance Fields Xi Wang et.al. 2303.15427v1 link
2023-03-24 Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning Xiaoyang Wu et.al. 2303.14191v1 link
2023-03-24 Learning from Few Demonstrations with Frame-Weighted Motion Generation Jianyong Sun et.al. 2303.14188v1 null
2023-03-24 Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior Junshu Tang et.al. 2303.14184v1 link
2023-03-24 Scaling Expert Language Models with Unsupervised Domain Discovery Suchin Gururangan et.al. 2303.14177v1 link
2023-03-24 A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception Asude Aydin et.al. 2303.14176v1 null
2023-03-24 UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields Yuanbo Yang et.al. 2303.14167v1 null
2023-03-23 Ablating Concepts in Text-to-Image Diffusion Models Nupur Kumari et.al. 2303.13516v1 link
2023-03-23 Persistent Nature: A Generative Model of Unbounded 3D Worlds Lucy Chai et.al. 2303.13515v1 link
2023-03-23 DreamBooth3D: Subject-Driven Text-to-3D Generation Amit Raj et.al. 2303.13508v1 null
2023-03-23 A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition Andong Deng et.al. 2303.13505v1 link
2023-03-23 Chordal Averaging on Flag Manifolds and Its Applications Nathan Mankovich et.al. 2303.13501v1 link
2023-03-23 A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias Puja Trivedi et.al. 2303.13500v1 null
2023-03-23 TriPlaneNet: An Encoder for EG3D Inversion Ananta R. Bhattarai et.al. 2303.13497v1 null
2023-03-22 Diffuse-Denoise-Count: Accurate Crowd-Counting with Diffusion Models Yasiru Ranasinghe et.al. 2303.12790v1 link
2023-03-22 EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation Hansheng Chen et.al. 2303.12787v1 link
2023-03-22 Localization-based OFDM framework for RIS-aided systems Fabio Saggese et.al. 2303.12763v1 link
2023-03-22 MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset Chen Feng et.al. 2303.12756v1 link
2023-03-22 Invariants for time-dependent Hamiltonian systems Jürgen Struckmeier et.al. 2303.12746v1 null
2023-03-22 Comment on the elastica section in Thorne and Blandford “Modern Classical Physics”, the shape of things, and the aspect ratio of reality J. A. Hanna et.al. 2303.12729v1 null
2023-03-21 Natural Language-Assisted Sign Language Recognition Ronglai Zuo et.al. 2303.12080v1 link
2023-03-21 Two-shot Video Object Segmentation Kun Yan et.al. 2303.12078v1 link
2023-03-21 CC3D: Layout-Conditioned Generation of Compositional 3D Scenes Sherwin Bahmani et.al. 2303.12074v1 null
2023-03-21 ProphNet: Efficient Agent-Centric Motion Forecasting with Anchor-Informed Proposals Xishun Wang et.al. 2303.12071v1 null
2023-03-21 Machine Learning for Brain Disorders: Transformers and Visual Transformers Robin Courant et.al. 2303.12068v1 null
2023-03-20 EVA-02: A Visual Representation for Neon Genesis Yuxin Fang et.al. 2303.11331v1 link
2023-03-20 Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation Ziyang Chen et.al. 2303.11329v1 link
2023-03-20 Zero-1-to-3: Zero-shot One Image to 3D Object Ruoshi Liu et.al. 2303.11328v1 link
2023-03-20 Open-vocabulary Panoptic Segmentation with Embedding Modulation Xi Chen et.al. 2303.11324v1 null
2023-03-20 ScribbleSeg: Scribble-based Interactive Image Segmentation Xi Chen et.al. 2303.11320v1 null
2023-03-20 Generative Semantic Segmentation Jiaqi Chen et.al. 2303.11316v1 link
2023-03-20 waywiser: Ergonomic Methods for Assessing Spatial Models Michael J Mahoney et.al. 2303.11312v1 link
2023-03-17 Data-centric Artificial Intelligence: A Survey Daochen Zha et.al. 2303.10158v1 link
2023-03-17 CoVIO: Online Continual Learning for Visual-Inertial Odometry Niclas Vödisch et.al. 2303.10149v1 link
2023-03-17 CoDEPS: Online Continual Learning for Depth Estimation and Panoptic Segmentation Niclas Vödisch et.al. 2303.10147v1 link
2023-03-17 Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting Nicolai Dorka et.al. 2303.10144v1 link
2023-03-16 Efficient Diffusion Training via Min-SNR Weighting Strategy Tiankai Hang et.al. 2303.09556v1 link
2023-03-16 PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D Supervision Konstantinos Tertikas et.al. 2303.09554v1 null
2023-03-16 SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving Yi Wei et.al. 2303.09551v1 link
2023-03-16 Diffusion-HPC: Generating Synthetic Images with Realistic Humans Zhenzhen Weng et.al. 2303.09541v1 link
2023-03-16 Deep Metric Learning for Unsupervised Remote Sensing Change Detection Wele Gedara Chaminda Bandara et.al. 2303.09536v1 link
2023-03-17 FateZero: Fusing Attentions for Zero-shot Text-based Video Editing Chenyang Qi et.al. 2303.09535v2 link
2023-03-16 Tackling Clutter in Radar Data – Label Generation and Detection Using PointNet++ Johannes Kopp et.al. 2303.09530v1 link
2023-03-15 Borda Regret Minimization for Generalized Linear Dueling Bandits Yue Wu et.al. 2303.08816v1 null
2023-03-15 BiFormer: Vision Transformer with Bi-Level Routing Attention Lei Zhu et.al. 2303.08810v1 link
2023-03-15 Stochastic Interpolants: A Unifying Framework for Flows and Diffusions Michael S. Albergo et.al. 2303.08797v1 null
2023-03-15 PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining Garrett Thomas et.al. 2303.08789v1 null
2023-03-14 Diversity-Aware Meta Visual Prompting Qidong Huang et.al. 2303.08138v1 link
2023-03-14 LayoutDM: Discrete Diffusion Model for Controllable Layout Generation Naoto Inoue et.al. 2303.08137v1 link
2023-03-15 Manipulate by Seeing: Creating Manipulation Controllers from Pre-Trained Representations Jianren Wang et.al. 2303.08135v2 null
2023-03-14 MeshDiffusion: Score-based Generative 3D Mesh Modeling Zhen Liu et.al. 2303.08133v1 link
2023-03-15 A Simple Framework for Open-Vocabulary Segmentation and Detection Hao Zhang et.al. 2303.08131v2 link
2023-03-14 ViperGPT: Visual Inference via Python Execution for Reasoning Dídac Surís et.al. 2303.08128v1 link
2023-03-14 Blind Video Deflickering by Neural Filtering with a Flawed Atlas Chenyang Lei et.al. 2303.08120v1 link
2023-03-14 Parameterised Approximation of the Fixation Probability of the Dominant Mutation in the Multi-Type Moran Process Leslie Ann Goldberg et.al. 2303.08118v1 null
2023-03-13 Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need Da-Wei Zhou et.al. 2303.07338v1 link
2023-03-13 Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR Feng Li et.al. 2303.07335v1 link
2023-03-13 A Smoothing Algorithm for Minimum Sensing Path Plans in Gaussian Belief Space Ali Reza Pedram et.al. 2303.07326v1 null
2023-03-13 Collision Cross-entropy and EM Algorithm for Self-labeled Classification Zhongwen Zhang et.al. 2303.07321v1 null
2023-03-13 Linear regularized 13-moment equations with Onsager boundary conditions for general gas molecules Zhenning Cai et.al. 2303.07314v1 null
2023-03-13 An efficient phase-field model of shear fractures using deviatoric stress split Ehsan Haghighat et.al. 2303.07309v1 link
2023-03-10 Multiple Hands Make Light Work: Enhancing Quality and Diversity using MAP-Elites with Multiple Parallel Evolution Strategies Manon Flageat et.al. 2303.06137v1 null
2023-03-10 Rewarding Chatbots for Real-World Engagement with Millions of Users Robert Irvine et.al. 2303.06135v1 null
2023-03-10 Imaging the crustal and upper mantle structure of the North Anatolian Fault: A Transmission Matrix Framework for Local Adaptive Focusing Rita Touma et.al. 2303.06123v1 null
2023-03-10 Ignorance is Bliss: Robust Control via Information Gating Manan Tomar et.al. 2303.06121v1 null
2023-03-11 Wave-function parametrization of a probability measure Leonardo Pedro et.al. 2303.06069v1 null
2023-03-09 Scaling up GANs for Text-to-Image Synthesis Minguk Kang et.al. 2303.05511v1 null
2023-03-09 Planning with Large Language Models for Code Generation Shun Zhang et.al. 2303.05510v1 null
2023-03-09 Cherry-Picking with Reinforcement Learning Yunchu Zhang et.al. 2303.05508v1 null
2023-03-09 TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization Alan Jeffares et.al. 2303.05506v1 link
2023-03-09 Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision Tarun Kalluri et.al. 2303.05503v1 null
2023-03-09 PDSketch: Integrated Planning Domain Programming and Learning Jiayuan Mao et.al. 2303.05501v1 null
2023-03-10 Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection Shilong Liu et.al. 2303.05499v2 link
2023-03-09 Learning Stationary Markov Processes with Contrastive Adjustment Ludvig Bergenstråhle et.al. 2303.05497v1 link
2023-03-09 Sparse and Local Networks for Hypergraph Reasoning Guangxuan Xiao et.al. 2303.05496v1 null
2023-03-08 Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Jiarui Xu et.al. 2303.04803v1 link
2023-03-08 Stabilized profunctors and stable species of structures Marcelo Fiore et.al. 2303.04795v1 null
2023-03-08 Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation Paul Hagemann et.al. 2303.04772v1 link
2023-03-08 SMaLL: A Software Framework for portable Machine Learning Libraries Upasana Sridhar et.al. 2303.04769v1 null
2023-03-07 Benign Overfitting for Two-layer ReLU Networks Yiwen Kou et.al. 2303.04145v1 link
2023-03-07 Toward Defining a Domain Complexity Measure Across Domains Katarina Doctor et.al. 2303.04141v1 null
2023-03-07 Diffusion Policy: Visuomotor Policy Learning via Action Diffusion Cheng Chi et.al. 2303.04137v1 null
2023-03-07 Inadequacy of equivalent circuits in nonlinear systems with inherent memory V. Lopez-Richard et.al. 2303.04135v1 null
2023-03-07 Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction Martin Josifoski et.al. 2303.04132v1 link
2023-03-07 Foundation Models for Decision Making: Problems, Methods, and Opportunities Sherry Yang et.al. 2303.04129v1 null
2023-03-07 Private Read-Update-Write with Controllable Information Leakage for Storage-Efficient Federated Learning with Top $r$ Sparsification Sajani Vithana et.al. 2303.04123v1 null
2023-03-06 Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers Sitan Chen et.al. 2303.03384v1 null
2023-03-06 SUREL+: Moving from Walks to Sets for Scalable Subgraph-based Graph Representation Learning Haoteng Yin et.al. 2303.03379v1 link
2023-03-06 PaLM-E: An Embodied Multimodal Language Model Danny Driess et.al. 2303.03378v1 null
2023-03-06 MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning Mikayel Samvelyan et.al. 2303.03376v1 null
2023-03-06 Detecting Human-Object Contact in Images Yixin Chen et.al. 2303.03373v1 link
2023-03-06 ALMOST: Adversarial Learning to Mitigate Oracle-less ML Attacks via Synthesis Tuning Animesh Basak Chowdhury et.al. 2303.03372v1 null
2023-03-06 Complex Systems of Secrecy: The Offshore Networks of Oligarchs Ho-Chun Herbert Chang et.al. 2303.03371v1 null
2023-03-06 Multimodal Prompting with Missing Modalities for Visual Recognition Yi-Lun Lee et.al. 2303.03369v1 link
2023-03-06 Referring Multi-Object Tracking Dongming Wu et.al. 2303.03366v1 link
2023-03-06 Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments Jun Yamada et.al. 2303.03365v1 null
2023-03-03 Unleashing Text-to-Image Diffusion Models for Visual Perception Wenliang Zhao et.al. 2303.02153v1 link
2023-03-03 Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners Renrui Zhang et.al. 2303.02151v1 link
2023-03-03 Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together! Shiwei Liu et.al. 2303.02141v1 link
2023-03-03 Eventual Discounting Temporal Logic Counterfactual Experience Replay Cameron Voloshin et.al. 2303.02135v1 null
2023-03-02 Dropout Reduces Underfitting Zhuang Liu et.al. 2303.01500v1 link
2023-03-02 Predicting Motion Plans for Articulating Everyday Objects Arjun Gupta et.al. 2303.01484v1 null
2023-03-02 Faster exact and approximation algorithms for packing and covering matroids via push-relabel Kent Quanrud et.al. 2303.01478v1 null
2023-03-01 StraIT: Non-autoregressive Generation with Stratified Image Transformer Shengju Qian et.al. 2303.00750v1 null
2023-03-01 Coordination of Multiple Robots along Given Paths with Bounded Junction Complexity Mikkel Abrahamsen et.al. 2303.00745v1 null
2023-03-01 READ Avatars: Realistic Emotion-controllable Audio Driven Avatars Jack Saunders et.al. 2303.00744v1 null
2023-03-01 R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents Daniel D. Johnson et.al. 2303.00732v1 link
2023-03-01 A Systematic Analysis of Vocabulary and BPE Settings for Optimal Fine-tuning of NMT: A Case Study of In-domain Translation J. Pourmostafa Roshan Sharami et.al. 2303.00722v1 null
2023-02-28 An Efficient Tester-Learner for Halfspaces Aravind Gollakota et.al. 2302.14853v1 null
2023-02-27 Internet Explorer: Targeted Representation Learning on the Open Web Alexander C. Li et.al. 2302.14051v1 link
2023-02-27 Language Is Not All You Need: Aligning Perception with Language Models Shaohan Huang et.al. 2302.14045v1 link
2023-02-27 Permutation Equivariant Neural Functionals Allan Zhou et.al. 2302.14040v1 link
2023-02-27 Measurement of Orbital Angular Momentum of Light using Stokes Parameters and Barnett’s Formalism Anirban Debnath et.al. 2302.14025v1 null
2023-02-27 Diacritic Recognition Performance in Arabic ASR Hanan Aldarmaki et.al. 2302.14022v1 null
2023-02-27 Full Stack Optimization of Transformer Inference: a Survey Sehoon Kim et.al. 2302.14017v1 null
2023-02-24 SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries Ahmed Imtiaz Humayun et.al. 2302.12828v1 link
2023-02-24 Generative Models of Huge Objects Lunjia Hu et.al. 2302.12823v1 null
2023-02-24 Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data KaShun Shum et.al. 2302.12822v1 link
2023-02-24 GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification Mengting Zhou et.al. 2302.12814v1 null
2023-02-24 Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback Baolin Peng et.al. 2302.12813v1 null
2023-02-23 Change is Hard: A Closer Look at Subpopulation Shift Yuzhe Yang et.al. 2302.12254v1 link
2023-02-23 Boosting Adversarial Transferability using Dynamic Cues Muzammal Naseer et.al. 2302.12252v1 null
2023-02-23 VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion Yiming Li et.al. 2302.12251v1 link
2023-02-23 Sequence-Based Incremental Concolic Testing of RTL Models Hasini Witharana et.al. 2302.12241v1 null
2023-02-23 What makes a language easy to deep-learn? Lukas Galke et.al. 2302.12239v1 link
2023-02-23 Improving Adaptive Conformal Prediction Using Self-Supervised Learning Nabeel Seedat et.al. 2302.12238v1 link
2023-02-23 Learning Neural Volumetric Representations of Dynamic Humans in Minutes Chen Geng et.al. 2302.12237v1 link
2023-02-23 DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models Jamie Wynn et.al. 2302.12231v1 link
2023-02-22 Beyond optimal disturbances: a statistical framework for transient growth Peter Frame et.al. 2302.11564v1 null
2023-02-22 Uncovering Bias in Face Generation Models Cristian Muñoz et.al. 2302.11562v1 null
2023-02-22 Equivariant Polynomials for Graph Neural Networks Omri Puny et.al. 2302.11556v1 null
2023-02-22 RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects Zhenjia Xu et.al. 2302.11553v1 null
2023-02-22 Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC Yilun Du et.al. 2302.11552v1 link
2023-02-22 Scaling Robot Learning with Semantically Imagined Experience Tianhe Yu et.al. 2302.11550v1 null
2023-02-21 Some Fundamental Aspects about Lipschitz Continuity of Neural Network Functions Grigory Khromov et.al. 2302.10886v1 null
2023-02-21 Context-Aware Timewise VAEs for Real-Time Vehicle Trajectory Prediction Pei Xu et.al. 2302.10873v1 link
2023-02-21 Efficient CTC Regularization via Coarse Labels for End-to-End Speech Translation Biao Zhang et.al. 2302.10871v1 link
2023-02-21 Provable Copyright Protection for Generative Models Nikhil Vyas et.al. 2302.10870v1 null
2023-02-21 A Unifying Perspective on Multi-Calibration: Unleashing Game Dynamics for Multi-Objective Learning Nika Haghtalab et.al. 2302.10863v1 null
2023-02-20 Towards Universal Fake Image Detectors that Generalize Across Generative Models Utkarsh Ojha et.al. 2302.10174v1 link
2023-02-20 Identity-Based Attribute Prototypes Distinguish Communities on Twitter Thomas Magelinski et.al. 2302.10172v1 null
2023-02-20 Compressed Error HARQ: Feedback Communication on Noise-Asymmetric Channels Sravan Kumar Ankireddy et.al. 2302.10170v1 link
2023-02-20 Learning Deep Semantics for Test Completion Pengyu Nie et.al. 2302.10166v1 link
2023-02-20 Sparse PCA Beyond Covariance Thresholding Gleb Novikov et.al. 2302.10158v1 null
2023-02-17 Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent Giannis Daras et.al. 2302.09057v1 link
2023-02-17 Geometric description of clustering in directed networks Antoine Allard et.al. 2302.09055v1 link
2023-02-17 MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation Clement Vignac et.al. 2302.09048v1 link
2023-02-17 From User Perceptions to Technical Improvement: Enabling People Who Stuter to Beter Use Speech Recognition Colin Lea et.al. 2302.09044v1 null
2023-02-17 Privately Customizing Prefinetuning to Better Match User Data in Federated Learning Charlie Hou et.al. 2302.09042v1 null
2023-02-16 Text-driven Visual Synthesis with Latent Diffusion Prior Ting-Hsuan Liao et.al. 2302.08510v1 null
2023-02-16 3D-aware Conditional Image Synthesis Kangle Deng et.al. 2302.08509v1 link
2023-02-16 The Scope of Multicalibration: Characterizing Multicalibration via Property Elicitation Georgy Noarov et.al. 2302.08507v1 null
2023-02-15 Target Specific De Novo Design of Drug Candidate Molecules with Graph Transformer-based Generative Adversarial Networks Atabey Ünlü et.al. 2302.07868v1 link
2023-02-15 Learning Performance-Improving Code Edits Aman Madaan et.al. 2302.07867v1 link
2023-02-15 Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation Joshua Vendrow et.al. 2302.07865v1 link
2023-02-15 Big Little Transformer Decoder Sehoon Kim et.al. 2302.07863v1 link
2023-02-15 One-Shot Face Video Re-enactment using Hybrid Latent Spaces of StyleGAN2 Trevine Oorloff et.al. 2302.07848v1 null
2023-02-15 NL2CMD: An Updated Workflow for Natural Language to Bash Commands Translation Quchen Fu et.al. 2302.07845v1 link
2023-02-14 Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions Raghav Singhal et.al. 2302.07261v1 null
2023-02-14 ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models Sheng Wang et.al. 2302.07257v1 link
2023-02-14 Energy Transformer Benjamin Hoover et.al. 2302.07253v1 link
2023-02-14 Generation Probabilities Are Not Enough: Exploring the Effectiveness of Uncertainty Highlighting in AI-Powered Code Completions Helena Vasconcelos et.al. 2302.07248v1 null
2023-02-14 A Deep Probabilistic Spatiotemporal Framework for Dynamic Graph Representation Learning with Application to Brain Disorder Identification Junn Yong Loo et.al. 2302.07243v1 null
2023-02-14 Parker Solar Probe Observations of High Plasma Beta Solar Wind from Streamer Belt Jia Huang et.al. 2302.07230v1 null
2023-02-13 3D-aware Blending with Generative NeRFs Hyunsu Kim et.al. 2302.06608v1 link
2023-02-13 Generative Adversarial Equilibrium Solvers Denizalp Goktas et.al. 2302.06607v1 null
2023-02-13 Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation Yuanhao Wang et.al. 2302.06606v1 null
2023-02-13 FilFL: Accelerating Federated Learning via Client Filtering Fares Fourati et.al. 2302.06599v1 null
2023-02-13 The Impact of AI on Developer Productivity: Evidence from GitHub Copilot Sida Peng et.al. 2302.06590v1 null
2023-02-13 Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction Xinyu Zhang et.al. 2302.06589v1 null
2023-02-13 Raising the Cost of Malicious AI-Powered Image Editing Hadi Salman et.al. 2302.06588v1 link
2023-02-13 AbLit: A Resource for Analyzing and Generating Abridged Versions of English Literature Melissa Roemmele et.al. 2302.06579v1 link
2023-02-10 Project and Probe: Sample-Efficient Domain Adaptation by Interpolating Orthogonal Features Annie S. Chen et.al. 2302.05441v1 null
2023-02-09 RelightableHands: Efficient Neural Relighting of Articulated Hand Models Shun Iwase et.al. 2302.04866v1 null
2023-02-09 Polynomial Neural Fields for Subband Decomposition and Manipulation Guandao Yang et.al. 2302.04862v1 link
2023-02-09 Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning Zhuolin Yang et.al. 2302.04858v1 null
2023-02-09 One-shot Visual Imitation via Attributed Waypoints and Demonstration Augmentation Matthew Chang et.al. 2302.04856v1 null
2023-02-09 SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks Mahdi Nikdan et.al. 2302.04852v1 link
2023-02-09 Robot Synesthesia: A Sound and Emotion Guided AI Painter Vihaan Misra et.al. 2302.04850v1 link
2023-02-09 Accurate and Interpretable Solution of the Inverse Rig for Realistic Blendshape Models with Quadratic Corrective Terms Stevo Racković et.al. 2302.04843v1 null
2023-02-09 Is This Loss Informative? Speeding Up Textual Inversion with Deterministic Objective Evaluation Anton Voronov et.al. 2302.04841v1 link
2023-02-08 PFGM++: Unlocking the Potential of Physics-Inspired Generative Models Yilun Xu et.al. 2302.04265v1 link
2023-02-08 Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration Chentian Jiang et.al. 2302.04250v1 null
2023-02-08 Federated Minimax Optimization with Client Heterogeneity Pranay Sharma et.al. 2302.04249v1 null
2023-02-08 Shortcut Detection with Variational Autoencoders Nicolas M. Müller et.al. 2302.04246v1 link
2023-02-07 Long Horizon Temperature Scaling Andy Shih et.al. 2302.03686v1 link
2023-02-07 Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications Johannes Kirschner et.al. 2302.03683v1 null
2023-02-07 Auditing Gender Presentation Differences in Text-to-Image Models Yanzhe Zhang et.al. 2302.03675v1 link
2023-02-07 Proportionality in Approval-Based Participatory Budgeting Markus Brill et.al. 2302.03672v1 null
2023-02-07 Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery Yuxin Wen et.al. 2302.03668v1 link
2023-02-07 HumanMAC: Masked Motion Completion for Human Motion Prediction Ling-Hao Chen et.al. 2302.03665v1 link
2023-02-07 SDYN-GANs: Adversarial Learning Methods for Multistep Generative Models for General Order Stochastic Dynamics Panos Stinis et.al. 2302.03663v1 null
2023-02-06 Zero-shot Image-to-Image Translation Gaurav Parmar et.al. 2302.03027v1 link
2023-02-06 AIM: Adapting Image Models for Efficient Video Action Recognition Taojiannan Yang et.al. 2302.03024v1 null
2023-02-06 Geometry of contact: contact planning for multi-legged robots via spin models duality Baxi Chong et.al. 2302.03019v1 null
2023-02-06 Structure and Content-Guided Video Synthesis with Diffusion Models Patrick Esser et.al. 2302.03011v1 null
2023-02-06 A novel Doppler backscattering (DBS) system to simultaneously monitor radio frequency plasma fluctuations and low frequency turbulence S. Chowdhury et.al. 2302.03009v1 null
2023-02-03 Understanding the Issues, Their Causes and Solutions in Microservices Systems: An Empirical Study Muhammad Waseem et.al. 2302.01894v1 null
2023-02-03 Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits Simone Sarti et.al. 2302.01888v1 null
2023-02-03 Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach Tanwi Mallick et.al. 2302.01887v1 null
2023-02-03 LIDAR-based Stabilization, Navigation and Localization for UAVs Operating in Dark Indoor Environments Matěj Petrl' ik et.al. 2302.01883v1 null
2023-02-03 IKEA-Manual: Seeing Shape Assembly Step by Step Ruocheng Wang et.al. 2302.01881v1 null
2023-02-02 STEPS: Joint Self-supervised Nighttime Image Enhancement and Depth Estimation Yupeng Zheng et.al. 2302.01334v1 link
2023-02-02 Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval Frederik Warburg et.al. 2302.01332v1 link
2023-02-02 SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections Zhaoxi Chen et.al. 2302.01330v1 link
2023-02-02 Dreamix: Video Diffusion Models are General Video Editors Eyal Molad et.al. 2302.01329v1 null
2023-02-02 $IC^3$ : Image Captioning by Committee Consensus David M. Chan et.al. 2302.01328v1 link
2023-02-02 Randomized Greedy Learning for Non-monotone Stochastic Submodular Maximization Under Full-bandit Feedback Fares Fourati et.al. 2302.01324v1 null
2023-02-02 Signatures for strong-field QED physics in the quantum limit of beamstrahlung W. L. Zhang et.al. 2302.01321v1 null
2023-02-01 Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data Alon Albalak et.al. 2302.00674v1 link
2023-02-01 ‘Generative CI’ through Collective Response Systems Aviv Ovadya et.al. 2302.00672v1 null
2023-02-01 Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing Grace Zhang et.al. 2302.00671v1 null
2023-02-01 Stable Target Field for Reduced Variance Score Estimation in Diffusion Models Yilun Xu et.al. 2302.00670v1 link
2023-02-01 Does Vision Accelerate Hierarchical Generalization of Neural Language Learners? Tatsuki Kuribayashi et.al. 2302.00667v1 null
2023-02-01 Extrinsic Calibration of 2D mm-Wavelength Radar Pairs Using Ego-Velocity Estimates Qilong Cheng et.al. 2302.00660v1 null
2023-02-01 Graph Neural Operators for Classification of Spatial Transcriptomics Data Junaid Ahmed et.al. 2302.00658v1 null
2023-01-31 Reverse engineering adversarial attacks with fingerprints from adversarial examples David Aaron Nicholson et.al. 2301.13869v1 null
2023-01-31 PADL: Language-Directed Physics-Based Character Control Jordan Juravsky et.al. 2301.13868v1 link
2023-01-31 Zero-Memory Graph Exploration with Unknown Inports Hans-Joachim Böckenhauer et.al. 2301.13860v1 null
2023-01-31 Interpreting Robustness Proofs of Deep Neural Networks Debangshu Banerjee et.al. 2301.13845v1 null
2023-01-31 Do Multi-Document Summarization Models Synthesize? Jay DeYoung et.al. 2301.13844v1 null
2023-01-31 RIS-Assisted Interference Mitigation for Uplink NOMA Azadeh Tabeshnezhad et.al. 2301.13841v1 null
2023-01-30 Looped Transformers as Programmable Computers Angeliki Giannou et.al. 2301.13196v1 null
2023-01-30 Adaptive Computation with Elastic Input Sequence Fuzhao Xue et.al. 2301.13195v1 link
2023-01-30 Audio-Visual Segmentation with Semantics Jinxing Zhou et.al. 2301.13190v1 link
2023-01-30 Extracting Training Data from Diffusion Models Nicholas Carlini et.al. 2301.13188v1 null
2023-01-30 Weighted flow diffusion for local graph clustering with node attributes: an algorithm and statistical guarantees Shenghao Yang et.al. 2301.13187v1 link
2023-01-30 Optimal Decision Tree Policies for Markov Decision Processes Daniël Vos et.al. 2301.13185v1 link
2023-01-27 Incorporating Background Knowledge in Symbolic Regression using a Computer Algebra System Charles Fox et.al. 2301.11919v1 null
2023-01-27 OccRob: Efficient SMT-Based Occlusion Robustness Verification of Deep Neural Networks Xingwu Guo et.al. 2301.11912v1 null
2023-01-27 Multi-dimensional concept discovery (MCD): A unifying framework with completeness guarantees Johanna Vielhaben et.al. 2301.11911v1 link
2023-01-27 Tree-structured Policy Planning with Learned Behavior Models Yuxiao Chen et.al. 2301.11902v1 null
2023-01-26 Conservative Safety Monitors of Stochastic Dynamical Systems Matthew Cleaveland et.al. 2301.11330v1 null
2023-01-26 MusicLM: Generating Music From Text Andrea Agostinelli et.al. 2301.11325v1 null
2023-01-26 Joint Training of Deep Ensembles Fails Due to Learner Collusion Alan Jeffares et.al. 2301.11323v1 null
2023-01-26 Cut and Learn for Unsupervised Object Detection and Instance Segmentation Xudong Wang et.al. 2301.11320v1 link
2023-01-26 Learning Good Features to Transfer Across Tasks and Domains Pierluigi Zama Ramirez et.al. 2301.11310v1 null
2023-01-26 SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification Pranjal Aggarwal et.al. 2301.11309v1 link
2023-01-26 Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series Abdul Fatir Ansari et.al. 2301.11308v1 link
2023-01-26 DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature Eric Mitchell et.al. 2301.11305v1 link
2023-01-25 Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives Tanvi Dinkar et.al. 2301.10761v1 null
2023-01-25 Efficient Flow-Guided Multi-frame De-fencing Stavros Tsogkas et.al. 2301.10759v1 null
2023-01-25 Room-Temperature Sputtered Ultralow-loss Silicon Nitride for Hybrid Photonic Integration Shuangyou Zhang et.al. 2301.10758v1 null
2023-01-25 Generating large-scale network analyses of scientific landscapes in seconds using Dimensions on Google BigQuery Michele Pasin et.al. 2301.10736v1 null
2023-01-25 The Synchronic Web Thien-Nam Dinh et.al. 2301.10733v1 null
2023-01-24 A Watermark for Large Language Models John Kirchenbauer et.al. 2301.10226v1 link
2023-01-24 Evolution of cooperation under a generalized death-birth process Chaoqian Wang et.al. 2301.10205v1 null
2023-01-24 A general epidemic model and its application to mask design considering different preferences towards masks Chaoqian Wang et.al. 2301.10202v1 null
2023-01-23 InfiniCity: Infinite-Scale City Synthesis Chieh Hubert Lin et.al. 2301.09637v1 null
2023-01-23 Feature construction using explanations of individual predictions Boštjan Vouk et.al. 2301.09631v1 null
2023-01-23 Tracking the industrial growth of modern China with high-resolution panchromatic imagery: A sequential convolutional approach Ethan Brewer et.al. 2301.09620v1 null
2023-01-23 Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics Aamal Abbas Hussain et.al. 2301.09619v1 null
2023-01-20 The stochastic digital human is now enrolling for in silico imaging trials – Methods and tools for generating digital cohorts A Badano et.al. 2301.08719v1 null
2023-01-20 Massively Parallel Genetic Optimization through Asynchronous Propagation of Populations Oskar Taubert et.al. 2301.08713v1 link
2023-01-19 Multiview Compressive Coding for 3D Reconstruction Chao-Yuan Wu et.al. 2301.08247v1 link
2023-01-19 Booster: a Benchmark for Depth from Images of Specular and Transparent Surfaces Pierluigi Zama Ramirez et.al. 2301.08245v1 null
2023-01-19 Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture Mahmoud Assran et.al. 2301.08243v1 link
2023-01-19 Radiation-induced secondary emissions in solid-state devices as a possible contribution to quasiparticle poisoning of superconducting circuits Francisco Ponce et.al. 2301.08239v1 null
2023-01-18 Robust Zero-crossings Detection in Noisy Signals using Topological Signal Processing Sunia Tanweer et.al. 2301.07703v1 null
2023-01-18 Learning 3D-aware Image Synthesis with Unknown Pose Distribution Zifan Shi et.al. 2301.07702v1 null
2023-01-18 Prony-Based Super-Resolution Phase Retrieval of Sparse, Multivariate Signals Robert Beinert et.al. 2301.07696v1 null
2023-01-18 Private Federated Submodel Learning via Private Set Union Zhusheng Wang et.al. 2301.07686v1 null
2023-01-18 SFQEDtoolkit: a high-performance library for the accurate modeling of strong-field QED processes in PIC and Monte Carlo codes Samuele Montefiori et.al. 2301.07684v1 link
2023-01-18 OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation Tong Wu et.al. 2301.07525v1 null
2023-01-17 Three Dimensional Odd Viscosity in Ferrofluids with Vorticity-Magnetization Coupling Dylan Reynolds et.al. 2301.07096v1 null
2023-01-17 On the State of German (Abstractive) Text Summarization Dennis Aumiller et.al. 2301.07095v1 link
2023-01-17 Learning Customized Visual Models with Retrieval-Augmented Knowledge Haotian Liu et.al. 2301.07094v1 link
2023-01-17 GLIGEN: Open-Set Grounded Text-to-Image Generation Yuheng Li et.al. 2301.07093v1 link
2023-01-17 Vision Learners Meet Web Image-Text Pairs Bingchen Zhao et.al. 2301.07088v1 null
2023-01-17 MooseNet: A trainable metric for synthesized speech with plda backend Ondřej Plátek et.al. 2301.07087v1 link
2023-01-17 Transformers as Algorithms: Generalization and Implicit Model Selection in In-context Learning Yingcong Li et.al. 2301.07067v1 link
2023-01-13 Non-Stochastic CDF Estimation Using Threshold Queries Princewill Okoroafor et.al. 2301.05682v1 null
2023-01-12 See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning Zhenfang Chen et.al. 2301.05226v1 null
2023-01-12 Domain Expansion of Image Generators Yotam Nitzan et.al. 2301.05225v1 null
2023-01-12 Guiding Text-to-Image Diffusion Model Towards Grounded Generation Ziyi Li et.al. 2301.05221v1 null
2023-01-12 Adversarial Adaptation for French Named Entity Recognition Arjun Choudhry et.al. 2301.05220v1 link
2023-01-12 NDNSD: Service Publishing and Discovery in NDN Saurab Dulal et.al. 2301.05218v1 null

(<a href=#Updated-on-20240404>back to top</a>)