《Computational Visual Media》

作品数:421被引量:934H指数:12
导出分析报告
《Computational Visual Media》
主办单位:清华大学
最新期次:2025年1期更多>>
发文主题:RENDERINGSURVEYLEARNINGSEGMENTATIONDEEP更多>>
发文领域:自动化与计算机技术文化科学理学医药卫生更多>>
发文基金:国家自然科学基金国家重点基础研究发展计划北京市自然科学基金中国博士后科学基金更多>>
-

检索结果分析

结果分析中...
条 记 录,以下是1-10
视图:
排序:
Diffusion models for 3D generation: A survey
《Computational Visual Media》2025年第1期1-28,共28页Chen Wang Hao-Yang Peng Ying-Tian Liu Jiatao Gu Shi-Min Hu 
Denoising diffusion models have demonstrated tremendous success in modeling data distributions and synthesizing high-quality samples.In the 2D image domain,they have become the state-of-the-art and are capable of gene...
关键词:diffusion models 3D generation generative models AIG 
Mindstorms in natural language-based societies of mind
《Computational Visual Media》2025年第1期29-81,共53页Mingchen Zhuge Haozhe Liu Francesco Faccio Dylan R.Ashley Róbert Csordás Anand Gopalakrishnan Abdullah Hamdi Hasan Abed Al Kader Hammoud Vincent Herrmann Kazuki Irie Louis Kirsch Bing Li Guohao Li Shuming Liu Jinjie Mai Piotr Piękos Aditya A.Ramesh Imanol Schlag Weimin Shi Aleksandar Stanić Wenyi Wang Yuhui Wang Mengmeng Xu Deng-Ping Fan Bernard Ghanem and Jürgen Schmidhuber 
supported by the European Research Council(ERC,Advanced Grant Number 742870;the Swiss National Science Foundation(SNF,Grant Numbers 200021 and 192356);the National Natural Science Foundation of China(Grant Number 62476143).
Inspired by Minsky’s Society of Mind,Schmidhuber’s Learning to Think,and other more 9-16 recent works,this paper proposes and advocates for the concept of natural language-based societies of mind(NLSOMs).We imagine ...
关键词:mindstorm society of mind(SOM) large languagemodels(LLMs) multimodal learning learning to think 
Swin3D: A pretrained transformer backbone for 3D indoor scene understanding
《Computational Visual Media》2025年第1期83-101,共19页Yu-Qi Yang Yu-Xiao Guo Jian-Yu Xiong Yang Liu Hao Pan Peng-Shuai Wang Xin Tong Baining Guo 
The use of pretrained backbones with finetuning has shown success for 2D vision and natural language processing tasks,with advantages over taskspecific networks.In this paper,we introduce a pretrained 3D backbone,call...
关键词:3D pretraining ponitcloud analysis trans-former backbone Swin Transformer 3D semantic segmentation 3D object detection 
Script-to-Storyboard: A new contextual retrieval dataset and benchmark
《Computational Visual Media》2025年第1期103-122,共20页Xi Tian Yong-Liang Yang Qi Wu 
supported by RCUK grant CAMERA(EP/M023281/1,EP/T022523/1);the Centre for Augmented Reasoning(CAR)at the Australian Institute for Machine Learning,and a gift from Adobe.
Storyboards comprising key illustrations and images help filmmakers to outline ideas,key moments,and story events when filming movies.Inspired by this,we introduce the first contextual benchmark dataset Script-to-Stor...
关键词:DATASET BENCHMARK text-based image retrieval MOVIE 
Hybrid mesh-neural representation for 3D transparent object reconstruction
《Computational Visual Media》2025年第1期123-140,共18页Jiamin Xu Zihan Zhu Hujun Bao Weiwei Xu 
supported by“Pioneer”and“Leading Goose”R&D Program of Zhejiang(No.2023C01181);supported by National Natural Science Foundation of China(No.62302134);Zhejiang Provincial Natural Science Foundation(No.LQ24F020031);supported by Information Technology Center and State Key Lab of CAD&CG,Zhejiang University.
In this study,we propose a novel method to reconstruct the 3D shapes of transparent objects using images captured by handheld cameras under natural lighting conditions.It combines the advantages of an explicit mesh an...
关键词:transparent object 3D reconstruction environment matting neural rendering 
FACNet: Feature alignment fast point cloud completion network
《Computational Visual Media》2025年第1期141-157,共17页Xinxing Yu Jianyi Li Chi-Chong Wong Chi-Man Vong Yanyan Liang 
supported by the Zhuhai Industry-University-Research Project(No.2220004002411);National Key R&D Program of China(No.2021YFE0205700);Science and Technology Development Fund of Macao(Nos.0070/2020/AMJ,00123/2022/A3,and 0096/2023/RIA2);Zhuhai City Polytechnic Research Project(No.2024KYBS02);Shenzhen Science and Technology Innovation Committee(No.SGDX20220530111001006);the University of Macao under Grants MYRG(Nos.GRG2023-00061-FST UMDF and 2022-00084-FST)。
Point cloud completion aims to infer complete point clouds based on partial 3D point cloud inputs.Various previous methods apply coarseto-fine strategy networks for generating complete point clouds.However,such method...
关键词:3D point clouds shape completion geometry processing deep learning 
Exploring contextual priors for real-world image super-resolution
《Computational Visual Media》2025年第1期159-177,共19页Shixiang Wu Chao Dong Yu Qiao 
Real-world blind image super-resolution is a challenging problem due to the absence of target high resolution images for training.Inspired by the recent success of the single image generation based method SinGAN,we ta...
关键词:unsupervised learning blind super-resolution image context image generation 
LucIE: Language-guided local image editing for fashion images
《Computational Visual Media》2025年第1期179-194,共16页Huanglu Wen Shaodi You Ying Fu 
Language-guided fashion image editing is challenging,as fashion image editing is local and requires high precision,while natural language cannot provide precise visual information for guidance.In this paper,we propose...
关键词:deep learning language-guided image editing local image editing content preservation fashion images 
FCDFusion: A fast, low color deviation method for fusing visible and infrared image pairs
《Computational Visual Media》2025年第1期195-211,共17页Hesong Li Ying Fu 
supported by the National Natural Science Foundation of China under Grant Nos.62171038,61827901,and 62088101.
Visible and infrared image fusion(VIF)aims to combine information from visible and infrared images into a single fused image.Previous VIF methods usually employ a color space transformation to keep the hue and saturat...
关键词:infrared images visible and infrared image fusion(VIF) gamma correction real-time display color metrics color deviation 
Multi-scale enhancement and aggregation network for singleimage deraining
《Computational Visual Media》2025年第1期213-226,共14页Rui Zhang Yuetong Liu Huijian Han Yong Zheng Tao Zhang Yunfeng Zhang 
supported by the National Natural Science Foundation of China(No.61972227);the Natural Science Foundation of Shandong Province(No.ZR201808160102);Shandong Provincial Natural Science Foundation Key Project(No.ZR2020KF015);the Key Research and Development Project of Shandong Province(No.2019GSF109112);the Science and Technology Plan for Young Talents in Colleges and Universities of Shandong Province(No.2020KJN007);the Scientific Research Studio in Colleges and Universities of Ji’nan City(No.2021GXRC092);the Science and Technology Research Program for Colleges and Universities in Shandong Province(No.KJ2018BZN029).
Rain streaks in an image appear in different sizes and orientations,resulting in severe blurring and visual quality degradation.Previous CNNbased algorithms have achieved encouraging deraining results although there a...
关键词:single-image deraining multi-scale enhan-cement and aggregation(MEA) encoder-decoder network 
检索报告 对象比较 聚类工具 使用帮助 返回顶部