作者: ciliary-body 時(shí)間: 2025-3-21 21:56
Zero-Shot Facial Expression Recognition with Multi-label Label Propagationmotional classes to describe the varied and nuancing meaning conveyed by facial expression. However, it is almost impossible to enumerate all the emotional categories and collect adequate annotated samples for each category. To this end, we propose a zero-shot learning framework with multi-label lab作者: capsaicin 時(shí)間: 2025-3-22 03:53 作者: 傳染 時(shí)間: 2025-3-22 05:01
COSONet: Compact Second-Order Network for Video Face Recognitionpose, and also suffer from video-type noises such as motion blur, out-of-focus blur and low resolution. To tackle these two types of challenges, we propose an extensive framework which contains three aspects: neural network design, training data augmentation, and loss function. First, we devise an e作者: 細(xì)節(jié) 時(shí)間: 2025-3-22 09:11 作者: theta-waves 時(shí)間: 2025-3-22 13:04 作者: theta-waves 時(shí)間: 2025-3-22 17:23 作者: Fortify 時(shí)間: 2025-3-22 22:54
Understanding Individual Decisions of CNNs via Contrastive Backpropagationr understand individual decisions of deep convolutional neural networks. The saliency maps produced by them are proven to be non-discriminative. Recently, the Layer-wise Relevance Propagation (LRP) approach was proposed to explain the classification decisions of rectifier neural networks. In this wo作者: 沒(méi)有貧窮 時(shí)間: 2025-3-23 02:32
Say Yes to the Dress: Shape and Style Transfer Using Conditional GANsmage, while maintaining the image content and object shapes. In this paper we transfer both the shape and style of chosen objects between images, leaving the remaining areas unaltered. To tackle this problem, we propose a two stage method, where each stage contains a generative adversarial network, 作者: Virtues 時(shí)間: 2025-3-23 06:46
Towards Multi-class Object Detection in Unconstrained Remote Sensing Imageryfic monitoring and disaster management. The huge variation in object scale, orientation, category, and complex backgrounds, as well as the different camera sensors pose great challenges for current algorithms. In this work, we propose a new method consisting of a novel joint image cascade and featur作者: 小平面 時(shí)間: 2025-3-23 11:26
Panorama from Representative Frames of Unconstrained Videos Using DiffeoMeshesgnment of the frames taken from a hand-held video is often very difficult to perform. The method proposed here aims to generate a panorama view of the video shot given as input. The proposed framework for panorama creation consists of four stages: The first stage performs a sparse frame selection ba作者: fiction 時(shí)間: 2025-3-23 15:43
Robust and Efficient Ellipse Fitting Using Tangent Chord Distancemains unresolved due to many practical challenges such as occlusion, background clutter, noise and outlier, and so forth. In this paper, we introduce a novel geometric distance, called Tangent Chord Distance (TCD), to formulate the ellipse fitting problem. Under the least squares framework, TCD is u作者: 客觀 時(shí)間: 2025-3-23 21:41 作者: 尊敬 時(shí)間: 2025-3-24 01:39
Bidirectional Conditional Generative Adversarial Networksnd known auxiliary information (.). We propose the Bidirectional cGAN (BiCoGAN), which effectively disentangles . and . in the generation process and provides an encoder that learns inverse mappings from . to both . and ., trained jointly with the generator and the discriminator. We present crucial 作者: 壓倒 時(shí)間: 2025-3-24 05:30
Cross-Resolution Person Re-identification with Deep Antithetical Learningrson ReID model to handle the image resolution variations for improving its generalization ability. However, most existing person ReID methods pay little attention to this resolution discrepancy problem. One paradigm to deal with this problem is to use some complicated methods for mapping all images作者: 減震 時(shí)間: 2025-3-24 08:05
A Temporally-Aware Interpolation Network for Video Frame Inpaintingl video inpainting, frame interpolation, and video prediction. We devise a pipeline composed of two modules: a bidirectional video prediction module and a temporally-aware frame interpolation module. The prediction module makes two intermediate predictions of the missing frames, each conditioned on 作者: Pelvic-Floor 時(shí)間: 2025-3-24 12:33 作者: GORGE 時(shí)間: 2025-3-24 17:27 作者: Indelible 時(shí)間: 2025-3-24 19:57
Andris Ambainis,Mike Hamburg,Dominique Unruhing the remaining areas unaltered. To tackle this problem, we propose a two stage method, where each stage contains a generative adversarial network, that will alter the shape and style of objects in a subject image to reflect a donor image. We demonstrate the effectiveness of our method by transferring clothing between images.作者: barium-study 時(shí)間: 2025-3-25 02:26 作者: fatty-acids 時(shí)間: 2025-3-25 06:09 作者: 河潭 時(shí)間: 2025-3-25 07:45
Conference proceedings 2019object detection and categorization, vision and language, video analysis and event recognition, face and gesture analysis, statistical methods and learning, performance evaluation, medical image analysis, document analysis, optimization methods, RGBD and depth camera processing, robotic vision, applications of computer vision..作者: 大都市 時(shí)間: 2025-3-25 14:34
Advances in Cryptology – CRYPTO 2018r new emotion labels via a learned semantic space. To evaluate the proposed method, we collect a multi-label FER dataset FaceME. Experimental results on FaceME and two other FER datasets demonstrate that Z-ML.P framework improves the state-of-the-art zero-shot learning methods in recognizing both seen or unseen emotions.作者: escalate 時(shí)間: 2025-3-25 17:47 作者: flammable 時(shí)間: 2025-3-25 23:13
Zero-Shot Facial Expression Recognition with Multi-label Label Propagationr new emotion labels via a learned semantic space. To evaluate the proposed method, we collect a multi-label FER dataset FaceME. Experimental results on FaceME and two other FER datasets demonstrate that Z-ML.P framework improves the state-of-the-art zero-shot learning methods in recognizing both seen or unseen emotions.作者: CHYME 時(shí)間: 2025-3-26 00:55 作者: exophthalmos 時(shí)間: 2025-3-26 04:32
0302-9743 ds and learning, performance evaluation, medical image analysis, document analysis, optimization methods, RGBD and depth camera processing, robotic vision, applications of computer vision..978-3-030-20892-9978-3-030-20893-6Series ISSN 0302-9743 Series E-ISSN 1611-3349 作者: prolate 時(shí)間: 2025-3-26 10:34
Advances in Cryptology – CRYPTO 2018 Wasserstein GAN framework, we generate colored 3D shapes from text. Our method is the first to connect natural language text with realistic 3D objects exhibiting rich variations in color, texture, and shape detail.作者: 雪崩 時(shí)間: 2025-3-26 14:06
Mihir Bellare,Ruth Ng,Bj?rn Tackmanntudent networks have less than 1% accuracy loss comparing to their teacher models for CIFAR-100 datasets. The student networks are 2–6 times faster than their teacher models for inference, and the model size of MobileNet is less than half of DenseNet-100’s.作者: 存在主義 時(shí)間: 2025-3-26 20:14 作者: 考古學(xué) 時(shí)間: 2025-3-27 00:58 作者: Bernstein-test 時(shí)間: 2025-3-27 01:15 作者: Parabola 時(shí)間: 2025-3-27 07:47
Knowledge Distillation with Feature Maps for Image Classificationtudent networks have less than 1% accuracy loss comparing to their teacher models for CIFAR-100 datasets. The student networks are 2–6 times faster than their teacher models for inference, and the model size of MobileNet is less than half of DenseNet-100’s.作者: indignant 時(shí)間: 2025-3-27 10:20
A Temporally-Aware Interpolation Network for Video Frame Inpainting experiments demonstrate that our approach produces more accurate and qualitatively satisfying results than a state-of-the-art video prediction method and many strong frame inpainting baselines. Our code is available at ..作者: Dignant 時(shí)間: 2025-3-27 15:08
Linear Solution to the Minimal Absolute Pose Rolling Shutter Probleminimal number of 9 correspondences but provides even better results than the state-of-the-art R6P. Moreover, all proposed linear solvers provide a single solution while the state-of-the-art R6P provides up?to 20 solutions which have to be pruned by expensive verification.作者: 無(wú)價(jià)值 時(shí)間: 2025-3-27 21:19 作者: 先兆 時(shí)間: 2025-3-27 22:21 作者: frugal 時(shí)間: 2025-3-28 06:02
Advances in Cryptology – CRYPTO 2018f RankGAN. We focus on face images from the CelebA dataset in our work and show visual as well as quantitative improvements in face generation and completion tasks over other GAN approaches, including WGAN and LSGAN.作者: Rustproof 時(shí)間: 2025-3-28 07:43 作者: 爭(zhēng)吵 時(shí)間: 2025-3-28 11:14
https://doi.org/10.1007/978-3-030-20893-6artificial intelligence; computer vision; estimation; image coding; image processing; image reconstructio作者: 陳舊 時(shí)間: 2025-3-28 15:18
978-3-030-20892-9Springer Nature Switzerland AG 2019作者: Bridle 時(shí)間: 2025-3-28 22:35 作者: inventory 時(shí)間: 2025-3-29 00:31 作者: TEM 時(shí)間: 2025-3-29 04:31
Viet Tung Hoang,Stefano Tessaro,Ni Trieuically, in the first stage, both head image and its position are fed into a gaze direction pathway to predict the gaze direction, and then multi-scale gaze direction fields are generated to characterize the distribution of gaze points without considering the scene contents. In the second stage, the 作者: Ischemia 時(shí)間: 2025-3-29 09:13 作者: 魯莽 時(shí)間: 2025-3-29 11:34 作者: spinal-stenosis 時(shí)間: 2025-3-29 19:25
Advances in Cryptology – CRYPTO 2018 of existing approaches attempt to solve the problem in two or multiple stages, which is considered to be the bottleneck to optimize the overall performance. To address this issue, we propose an end-to-end trainable network architecture, named?., which is able to simultaneously localize and recogniz作者: embolus 時(shí)間: 2025-3-29 20:17
Advances in Cryptology – CRYPTO 2018 and colored 3D shapes. Our model combines and extends learning by association and metric learning approaches to learn implicit cross-modal connections, and produces a joint representation that captures the many-to-many relations between language and physical properties of 3D shapes such as color an作者: Licentious 時(shí)間: 2025-3-30 02:02
Alexandra Boldyreva,Daniele Micciancior understand individual decisions of deep convolutional neural networks. The saliency maps produced by them are proven to be non-discriminative. Recently, the Layer-wise Relevance Propagation (LRP) approach was proposed to explain the classification decisions of rectifier neural networks. In this wo作者: 憂傷 時(shí)間: 2025-3-30 05:30 作者: expunge 時(shí)間: 2025-3-30 08:42
https://doi.org/10.1007/978-3-030-26951-7fic monitoring and disaster management. The huge variation in object scale, orientation, category, and complex backgrounds, as well as the different camera sensors pose great challenges for current algorithms. In this work, we propose a new method consisting of a novel joint image cascade and featur作者: Felicitous 時(shí)間: 2025-3-30 14:22
Alexandra Boldyreva,Daniele Miccianciognment of the frames taken from a hand-held video is often very difficult to perform. The method proposed here aims to generate a panorama view of the video shot given as input. The proposed framework for panorama creation consists of four stages: The first stage performs a sparse frame selection ba作者: HAVOC 時(shí)間: 2025-3-30 20:04
Advances in Cryptology – CRYPTO 2019mains unresolved due to many practical challenges such as occlusion, background clutter, noise and outlier, and so forth. In this paper, we introduce a novel geometric distance, called Tangent Chord Distance (TCD), to formulate the ellipse fitting problem. Under the least squares framework, TCD is u作者: A精確的 時(shí)間: 2025-3-30 21:09 作者: Pandemic 時(shí)間: 2025-3-31 01:54 作者: Commission 時(shí)間: 2025-3-31 05:08
Advances in Cryptology – CRYPTO 2019rson ReID model to handle the image resolution variations for improving its generalization ability. However, most existing person ReID methods pay little attention to this resolution discrepancy problem. One paradigm to deal with this problem is to use some complicated methods for mapping all images作者: Overstate 時(shí)間: 2025-3-31 09:51
Lecture Notes in Computer Sciencel video inpainting, frame interpolation, and video prediction. We devise a pipeline composed of two modules: a bidirectional video prediction module and a temporally-aware frame interpolation module. The prediction module makes two intermediate predictions of the missing frames, each conditioned on 作者: Contort 時(shí)間: 2025-3-31 16:40
Lecture Notes in Computer Scienceproach the problem using simple and fast linear solvers in an iterative scheme. We present several solutions based on fixing different sets of variables and investigate the performance of them thoroughly. We design a new alternation strategy that estimates all parameters in each iteration linearly b作者: Medley 時(shí)間: 2025-3-31 21:08 作者: syring 時(shí)間: 2025-4-1 00:37 作者: 慎重 時(shí)間: 2025-4-1 02:31