Applications - 3D Face Modeling, Analysis and Recognition

Graphics Reference

In-Depth Information

Table 5.3

3D static and dynamic (3D+time) face data sets for 3D facial expressions recognition

Database

Format

Subjects

Scans

Expressions

Image

Resolution

Year

BU-3DFE

static

100

2500

Color

1040

1329

2006

Bosphorus

static

105

4666

Color

1600

1200

2008

ZJU-3DFE

static

360

Color

2006

ADSIP

dynamic

210

Color

601 × 549

2009

BU-4DFE

dynamic

101

606

Color

1024 × 681

2008

Hi4D-ADSIP

dynamic

3360

Color

1024 × 1728

2011

and at the Bogazi¸i University (Bosphorus database) (Savran et al., 2008), have contributed to

push the research on 3D facial expression recognition and classification. Recently, some 4D

databases that include dynamic sequences of 3D scans along the time have been also released

for public use (Matuszewski et al., 2011, 2012; Yin et al., 2008). These 4D data are mainly

intended for studying facial expressions which are intrinsically related to the time variation of

the face.

More details on the individual data sets are reported in the following paragraphs, starting

with 3D static data sets and then presenting the most recent 4D dynamic data sets.

The Face Recognition Grand Challenge (FRGC) database

The Face Recognition Grand Challenge version 2.0 (FRGC v2.0) database (Phillips et al.,

2005) is the largest 3D database in terms of subjects enrolled and the most widely used in

order to compare methods for 3D face recognition. However, it does not provide categorized

facial expressions; hence it is less useful for the purpose of 3D facial expression recognition.

It includes 3D face scans partitioned in three sets, namely, Spring2003 set (943 scans of 275

individuals), Fall2003 and Spring2004 (4007 scans of 466 subjects in total). The Spring2003

set is also referred to as FRGC v1.0 an is used for training, set up, and tuning of methods,

whereas the Fall2003 and Spring2004 are referred to as FRGC v2.0 and are used for testing.

Face scans are given as matrices of points of size 480

640, with a binary mask indicating

the valid points of the face. Because of different distances of the subjects from the sensor

during acquisition, the actual number of points representing a face can vary. Individuals have

been acquired with frontal view from the shoulder level, with very small pose variations.

About 60 percent of the faces have neutral expression, and the others show expressions of

disgust, happiness, sadness, and surprise. In particular, according to a classification originally

performed at Geometrix (Maurer et al., 2005) and subsequently suggested by the FRGC

organizers, there are 2,469 neutral expression scans, 796 small cooperative expression scans,

and 742 large non-cooperative expression scans. Some scans have also occlusions because of

hair. Guidelines suggest using the Spring2003 set for training and the remaining two sets for

validation.

It is also interesting to note that a 2D color image of the subject is also provided for each

3D scan. In fact, acquisitions are performed using the Minolta Vivid 910 laser scanner that

permits a 2D color image capturing at the same time of the 3D acquisition (actually, the 2D

image is acquired just with a very short time delay, because different sensors are used for 2D

3D Face Modeling, Analysis and Recognition

Search WWH ::

Custom Search

Home