image recognition, porn Identification, medical AI
facial landmark tracking, face identity validation, liveness detection, face retrieval
original voice recognition and humming recognition, original voice elimination, speech synthesis
ID card OCR, business card OCR, driving license OCR, driver's license OCR, business license OCR, bank card OCR, universal OCR
Virtual face beautification (or markup) becomes common operations in camera or image processing Apps, which is actually deceiving. In this paper, we propose the task of restoring a portrait image from this process. As the first attempt along this line, we assume unknown global operations on human faces and aim to tackle the two issues of skin smoothing and skin color change. These two tasks, intriguingly, impose very different difficulties to estimate subtle details and major color variation. We propose a Component Regression Network (CRN) and address the limitation of using Euclidean loss in blind reversion. CRN maps the edited portrait images back to the original ones without knowing beautification operation details. Our experiments demonstrate effectiveness of the system for this novel task.
ICCV · 2017
Previous CNN-based video super-resolution approaches need to align multiple frames to the reference. In this paper, we show that proper frame alignment and motion compensation is crucial for achieving high quality results. We accordingly propose a “sub-pixel motion compensation” (SPMC) layer in a CNN framework. Analysis and experiments show the suitability of this layer in video SR. The final end-to-end, scalable CNN framework effectively incorporates the SPMC layer and fuses multiple frames to reveal image details. Our implementation can generate visually and quantitatively high-quality results, superior to current state-of-the-arts, without the need of parameter tuning.
ICCV · 2017
RGBD semantic segmentation requires joint reasoning about 2D appearance and 3D geometric information. In this paper we propose a 3D graph neural network (3DGNN) that builds a k-nearest neighbor graph on top of 3D point cloud. Each node in the graph corresponds to a set of points and is associated with a hidden representation vector initialized with an appearance feature extracted by a unary CNN from 2D images. Relying on recurrent functions, every node dynamically updates its hidden representation based on the current status and incoming messages from its neighbors. This propagation model is unrolled for a certain number of time steps and the final per-node representation is used for predicting the semantic class of each pixel. We use back-propagation through time to train the model. Extensive experiments on NYUD2 and SUN-RGBD datasets demonstrate the effectiveness of our approach.
ICCV · 2017
YouTu GrandEye Police system is a face retrieval engine, oriented by security and protection demands, based on multiple-scene high-volume databases and integrating self-developed deep learning and cluster computing. It is an intelligent massive face retrieval solution promoted for multiple scenes such as searching, deployment and control, criminal investigation and case handling, security and protection activities and social services.
Youtu GrandEye Searching Engine is designed for seeking the missing person more effectively, which owns a world-leading facial recognition system. It could locate the target face precisely among millions of faces. Based on its generalization and practicability, it has been used to identify the most similar person from the missing people database in a short time.
The smart solution launched by Youtu for the transportation industry can detect and follow up vehicles, structurally store vehicle information, automatically identify vehicle acts in violation of regulations as well as vehicle detect tracking, so as to assist in the construction of urban intelligent transportation.
Youtu FaceIn face verification, a technology which validates the user identity through 1:1 face verification and living detection between a selfie video (or a selfie) and another photo (which may come from ID card or selfie reserved in advance), can confirm whether the current user is himself/herself.
Youtu FaceIn conference sign-in system is based on the self-researched UFace face recognition technology. This system can support various small and large conferences, such as company conference sign-in, training class sign-in and large exhibitions. On the conference site, after swiping the face, the site sign-in can be completed quickly, so as to effectively eliminate the sign-in on behalf, counterfeiting and other similar behaviors.
Tencent Youtu Lab has launched a complete set of product solution including anchor identity validation and special beautifying effects for such Internet products as short video shooting, live broadcast and image processing tools, which guarantees the safety and standardization of short video and live broadcast products and also the entertainment of such products.
Youtu Security Censorship Solution is able to help detect pornographic images and directly filters out those images with high confidence. Images with mid-level score then undergo manual check, which provides an effective one-stop supervision service that replace the manual labor to screen out unlawful information and make a good control of risk.