TOMMCCAP - volume 18 - 2022 论文列表 |
点击这里查看 ACM Transactions on Multimedia Computing, Communications, and Applications 的JCR分区、影响因子等信息 |
Zhihan Lv Zengchen Yu Shuxuan Xie Atif Alamri
Mask or Non-Mask? Robust Face Mask Detector via Triplet-Consistency Representation Learning.Chun-Wei Yang Thanh Hai Phung Hong-Han Shuai Wen-Huang Cheng
TT-TSVD: A Multi-modal Tensor Train Decomposition with Its Application in Convolutional Neural Networks for Smart Healthcare.Debin Liu Laurence T. Yang Puming Wang Ruonan Zhao Qingchen Zhang
A Convolutional Neural Network Model Using Weighted Loss Function to Detect Diabetic Retinopathy.Mehedi Masud Mohammed F. Alhamid Yin Zhang
A Multi-feature and Time-aware-based Stress Evaluation Mechanism for Mental Status Adjustment.Min Chen Wenjing Xiao Miao Li Yixue Hao Long Hu Guangming Tao
Special Section on AI-empowered Multimedia Data Analytics for Smart Healthcare.M. Shamim Hossain Rita Cucchiara Ghulam Muhammad Diana Patricia Tobón Vallejo Abdulmotaleb El-Saddik
From Coarse to Fine: Hierarchical Structure-aware Video Summarization.Wenxu Li Gang Pan Chen Wang Zhen Xing Zhenjun Han
ECCNAS: Efficient Crowd Counting Neural Architecture Search.Yabin Wang Zhiheng Ma Xing Wei Shuai Zheng Yaowei Wang Xiaopeng Hong
Exploring Relations in Untrimmed Videos for Self-Supervised Learning.Dezhao Luo Yu Zhou Bo Fang Yucan Zhou Dayan Wu Weiping Wang
Fine-Grained Adversarial Semi-Supervised Learning.Daniele Mugnai Federico Pernici Francesco Turchini Alberto Del Bimbo
Instance Correlation Graph for Unsupervised Domain Adaptation.Lei Wu Hefei Ling Yuxuan Shi Baiyan Zhang
Fine-grained Human Analysis under Occlusions and Perspective Constraints in Multimedia Surveillance. Fine-grained Image Classification via Multi-scale Selective Hierarchical Biquadratic Pooling.Min Tan Fu Yuan Jun Yu Guijun Wang Xiaoling Gu
Rectified Meta-learning from Noisy Labels for Robust Image-based Plant Disease Classification.Deming Zhai Ruifeng Shi Junjun Jiang Xianming Liu
Age-Invariant Face Recognition by Multi-Feature Fusionand Decomposition with Self-attention.Chenggang Yan Lixuan Meng Liang Li Jiehua Zhang Zhan Wang Jian Yin Jiyong Zhang Yaoqi Sun Bolun Zheng
Seeing Crucial Parts: Vehicle Model Verification via a Discriminative Representation Model.Liqian Liang Congyan Lang Zun Li Jian Zhao Tao Wang Songhe Feng
JoT-GAN: A Framework for Jointly Training GAN and Person Re-Identification Model.Zhongwei Zhao Ran Song Qian Zhang Peng Duan Youmei Zhang
BiRe-ID: Binary Neural Network for Efficient Person Re-ID.Sheng Xu Chang Liu Baochang Zhang Jinhu Lü Guodong Guo David S. Doermann
Hybrid Modality Metric Learning for Visible-Infrared Person Re-Identification.La Zhang Haiyun Guo Kuan Zhu Honglin Qiao Gaopan Huang Sen Zhang Huichen Zhang Jian Sun Jinqiao Wang
Introduction to the Special Issue on Fine-Grained Visual Recognition and Re-Identification.Shiliang Zhang Guorong Li Weigang Zhang Qingming Huang Tiejun Huang Mubarak Shah Nicu Sebe
Kedar Nath Singh Amit Kumar Singh
Scribble-Supervised Meibomian Glands Segmentation in Infrared Images.Xiaoming Liu Shuo Wang Ying Zhang Quan Yuan
Deep Illumination-Enhanced Face Super-Resolution Network for Low-Light Images.Kehua Guo Min Hu Sheng Ren Fangfang Li Jian Zhang Haifu Guo Xiaoyan Kui
Blockchain-Based Audio Watermarking Technique for Multimedia Copyright Protection in Distribution Networks.Iynkaran Natgunanathan Purathani Praitheeshan Longxiang Gao Yong Xiang Lei Pan
A Format-compatible Searchable Encryption Scheme for JPEG Images Using Bag-of-words.Zhihua Xia Qiuju Ji Qi Gu Chengsheng Yuan Fengjun Xiao
Improving Crowd Density Estimation by Fusing Aerial Images and Radio Signals.Kai-Wei Yang Yen-Yun Huang Jen-Wei Huang Ya-Rou Hsu Chang-Lin Wan Hong-Han Shuai Li-Chun Wang Wen-Huang Cheng
When Pairs Meet Triplets: Improving Low-Resource Captioning via Multi-Objective Optimization.Yike Wu Shiwan Zhao Ying Zhang Xiaojie Yuan Zhong Su
Multilayer Video Encoding for QoS Managing of Video Streaming in VANET Environment. Distributed Gateway Selection for Video Streaming in VANET Using IP Multicast.Debanjan Roy Chowdhury Sukumar Nandi Diganta Goswami
Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition.Xiaoguang Zhu Ye Zhu Haoyu Wang Honglin Wen Yan Yan Peilin Liu
GraSP: Local Grassmannian Spatio-Temporal Patterns for Unsupervised Pose Sequence Recognition.Himanshu Buckchash Balasubramanian Raman
A Multimodal Framework for Large-Scale Emotion Recognition by Fusing Music and Electrodermal Activity Signals.Guanghao Yin Shouqian Sun Dian Yu Dejian Li Kejun Zhang
Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach.Marcella Cornia Matteo Tomei Lorenzo Baraldi Rita Cucchiara
Inner Knowledge-based Img2Doc Scheme for Visual Question Answering.Qun Li Fu Xiao Bir Bhanu Biyun Sheng Richang Hong
Recognizing Gaits Across Walking and Running Speeds.Lingxiang Yao Worapan Kusakunniran Qiang Wu Jingsong Xu Jian Zhang
CRAR: Accelerating Stereo Matching with Cascaded Residual Regression and Adaptive Refinement. Objective Object Segmentation Visual Quality Evaluation: Quality Measure and Pooling Method.Ran Shi Jing Ma King Ngi Ngan Jian Xiong Tong Qiao
Domain-invariant Graph for Adaptive Semi-supervised Domain Adaptation.Jinfeng Li Weifeng Liu Yicong Zhou Jun Yu Dapeng Tao Changsheng Xu
Enhanced 3D Shape Reconstruction With Knowledge Graph of Category Concept.Guofei Sun Yongkang Wong Mohan S. Kankanhalli Xiangdong Li Weidong Geng
Learning Adaptive Spatial-Temporal Context-Aware Correlation Filters for UAV Tracking.Di Yuan Xiaojun Chang Zhihui Li Zhenyu He
Shuffle-invariant Network for Action Recognition in Videos.Qinghongya Shi Hong-Bo Zhang Zhe Li Ji-Xiang Du Qing Lei Jing-Hua Liu
Interactive Re-ranking via Object Entropy-Guided Question Answering for Cross-Modal Image Retrieval. Causal Inference with Knowledge Distilling and Curriculum Learning for Unbiased VQA.Wenlin Zhuang Congyi Wang Jinxiang Chai Yangang Wang Ming Shao Siyu Xia
Fully Unsupervised Person Re-Identification via Selective Contrastive Learning.Bo Pang Deming Zhai Junjun Jiang Xianming Liu
Adversarial Multi-Grained Embedding Network for Cross-Modal Text-Video Retrieval.Ning Han Jingjing Chen Hao Zhang Huanwen Wang Hao Chen
Transform, Warp, and Dress: A New Transformation-guided Model for Virtual Try-on.Matteo Fincato Marcella Cornia Federico Landi Fabio Cesari Rita Cucchiara
Non-Acted Text and Keystrokes Database and Learning Methods to Recognize Emotions.Madiha Tahir Zahid Halim Atta Ur Rahman Muhammad Waqas Shanshan Tu Sheng Chen Zhu Han
Structure-aware Meta-fusion for Image Super-resolution.Haoyu Ma Bingchen Gong Yizhou Yu
Tell, Imagine, and Search: End-to-end Learning for Composing Text and Image to Image Retrieval.Feifei Zhang Mingliang Xu Changsheng Xu
SADnet: Semi-supervised Single Image Dehazing Method Based on an Attention Mechanism.Ziyi Sun Yunfeng Zhang Fangxun Bao Ping Wang Xunxiang Yao Caiming Zhang
Learning Transferable Perturbations for Image Captioning.Hanjie Wu Yongtuo Liu Hongmin Cai Shengfeng He
Moment is Important: Language-Based Video Moment Retrieval via Adversarial Learning.Yawen Zeng Da Cao Shaofei Lu Hanling Zhang Jiao Xu Zheng Qin
Deep Semantic and Attentive Network for Unsupervised Video Summarization.Sheng-Hua Zhong Jingxu Lin Jianglin Lu Ahmed Fares Tongwei Ren
Will You Ever Become Popular? Learning to Predict Virality of Dance Clips.Jiahao Wang Yunhong Wang Nina Weng Tianrui Chai Annan Li Faxi Zhang Sansi Yu
An l½ and Graph Regularized Subspace Clustering Method for Robust Image Segmentation.Jobin Francis M. Baburaj Sudhish N. George
Response Generation by Jointly Modeling Personalized Linguistic Styles and Emotions.Teng Sun Chun Wang Xuemeng Song Fuli Feng Liqiang Nie
Generating Virtual Wire Sculptural Art from 3D Models.Chih-Kuo Yeh Thi Ngoc Hanh Le Zhi-Ying Hou Tong-Yee Lee
Machine Learning Based Content-Agnostic Viewport Prediction for 360-Degree Video.Sam Van Damme Maria Torres Vega Filip De Turck
Cascaded Structure-Learning Network with Using Adversarial Training for Robust Facial Landmark Detection.Shenming Feng Xingzhong Nong Haifeng Hu
Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training.Yehao Li Jiahao Fan Yingwei Pan Ting Yao Weiyao Lin Tao Mei
An Effective Forest Fire Detection Framework Using Heterogeneous Wireless Multimedia Sensor Networks.Burak Kizilkaya Enver Ever Hakan Yekta Yatbaz Adnan Yazici
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition.Yansong Tang Xingyu Liu Xumin Yu Danyang Zhang Jiwen Lu Jie Zhou
Evaluation of an Intervention Program Based on Mobile Apps to Learn Sexism Prevention in Teenagers.Pedro Morillo José J. Navarro-Pérez Juan M. Orduña Marcos Fernández
Efficient Light Field Image Compression with Enhanced Random Access.Hadi Amirpour António M. G. Pinheiro Manuela Pereira Fernando J. P. Lopes Mohammed Ghanbari
Changming Liu Xiaojing Ma Sixing Cao Jiayun Fu Bin B. Zhu
Authentication of LINE Chat History Files by Information Hiding. LogoDet-3K: A Large-scale Image Dataset for Logo Detection.Jing Wang Weiqing Min Sujuan Hou Shengnan Ma Yuanjie Zheng Shuqiang Jiang
Robust Unsupervised Gaze Calibration Using Conversation and Manipulation Attention Priors.Rémy Siegfried Jean-Marc Odobez
Optimizing Immersive Video Coding Configurations Using Deep Learning: A Case Study on TMIV.Chih-Fan Hsu Tse-Hou Hung Cheng-Hsin Hsu
RD-IOD: Two-Level Residual-Distillation-Based Triple-Network for Incremental Object Detection.Dongbao Yang Yu Zhou Wei Shi Dayan Wu Weiping Wang
An Empirical Method for Causal Inference of Constructs for QoE in Haptic-Audiovisual Communications. Hyperspectral Image Reconstruction Using Multi-scale Fusion Learning.Xian-Hua Han Yinqiang Zheng Yen-Wei Chen
Defining Scents: A Systematic Literature Review of Olfactory-based Computing Systems.Amanda K. Holloman Chris S. Crawford
CAPTAIN: Comprehensive Composition Assistance for Photo Taking.Farshid Farhat Mohammad Mahdi Kamani James Z. Wang
Diversely-Supervised Visual Product Search.William Thong Cees G. M. Snoek
Mimicking Individual Media Quality Perception with Neural Network based Artificial Observers.Lohic Fotio Tiotsop Tomas Mizdos Marcus Barkowsky Peter Pocta Antonio Servetti Enrico Masala
Mask-Guided Deformation Adaptive Network for Human Parsing.Aihua Mao Yuan Liang Jianbo Jiao Yongtuo Liu Shengfeng He
Learning Hierarchical Video Graph Networks for One-Stop Video Delivery.Yaguang Song Junyu Gao Xiaoshan Yang Changsheng Xu
The Impact of Artificial Intelligence on the Creativity of Videos.Ana Daniela Peres Rebelo Guedes De Oliveira Inês D. E. Verboom Damion
Facial-expression-aware Emotional Color Transfer Based on Convolutional Neural Network.Shiguang Liu Huixin Wang Min Pei
A Novel Multi-Modal Network-Based Dynamic Scene Understanding.Md Azher Uddin Joolekha Bibi Joolee Young-Koo Lee Kyung-Ah Sohn
Multi-feature Fusion VoteNet for 3D Object Detection.Zhoutao Wang Qian Xie Mingqiang Wei Kun Long Jun Wang
MMSUM Digital Twins: A Multi-view Multi-modality Summarization Framework for Sporting Events.Samah Aloufi Abdulmotaleb El-Saddik
TTV Regularized LRTA Technique for the Estimation of Haze Model Parameters in Video Dehazing. Modeling the User Experience of Watching 360° Videos with Head-Mounted Displays.Ching-Ling Fan Tse-Hou Hung Cheng-Hsin Hsu
Online Learning for Adaptive Video Streaming in Mobile Networks.Theodoros Karagkioules Georgios S. Paschos Nikolaos Liakopoulos Attilio Fiandrotti Dimitrios Tsilimantos Marco Cagnazzo
Sparse LIDAR Measurement Fusion with Joint Updating Cost for Fast Stereo Matching.