CASE STUDY - Configuring and Customizing Sahana for the Lost Person Finder (LPF) Project

Aspects of Appearance - Eyes, Hair, Skin, Race, Ethnicity

Original author and date: Glenn Pearson, April 29, 2009

The following tables show the “old code” and “old string” values that are the default for a 0.6.x install, and the “new “replacement values chosen for LPF, based on the standard mentioned beneath each table. Also indicated are speculative possible mappings if LPF needed to interchange data with a default Sahana instance.

Field code: opt_eye_color

New Code New String Old Code Old String Comments/Suggested Mappings
XXX Unknown unk Unknown
BLK Black bla Black
BLU Blue blu Blue
BRO Brown bro Light Brown
GRY Gray GRY –> oth
GRN Green GRN –> oth
HAZ Hazel HAZ –> oth
MAR Maroon MAR –> oth
MUL Multicolored MUL –> oth
PNK Pink PNK –> oth
oth Other oth –> XXX

Source for new code & strings: FBI’s National Crime Information Center (NCIC) 2000 – Personal Descriptors. (For a full reference to NCIC 2000, see the CASE STUDY page concerning Location Hierarchy.)

Field code: opt_skin_color

New Code New String Old Code Old String Comments/Suggested Mappings
XXX Unknown unk Unknown Added for consistency; NCIC 2000 does not have “XXX - Unknown” for skin color
ALB Albino ALB –> whi
BLK Black bla Black
DRK Dark DRK –> bro
DBR Dark Brown bro Dark Brown
FAR Fair fai Fair
LGT Light LGT –> whi
LBR Light Brown LBR –> bro
MED Medium MED –> bro
MBR Medium Brown MBR –> bro
OLV Olive OLV –> oth
RUD Ruddy RUD –> oth
SAL Sallow SAL –> oth
whi White whi –> FAR
YEL Yellow YEL –> oth
oth Other oth –> XXX

Source for new code & strings: FBI’s National Crime Information Center (NCIC) 2000 – Personal Descriptors

COMMENTARY: This representation does not make much sense if considered as mapping to absolute skin tones, but as a collection of words that people might say as a description. Terms like “light”, “medium”, “dark” seem to have nothing to qualify, and perhaps are most meaningful in conjunction with a “race” selection, i.e., dark[-skinned] white, light[-skinned] black. In Sahana usage, it is not expected that color matching against a absolute standard would be possible. Skin-tone systems that use color-chip matching [1][2][3] or instrumental readings [4][5] are thus not so useful. It is possible to infer skin tone from photographs if the image includes a calibrated color chart [6], or the photographic setup (camera, lighting, subject placement) is calibrated. Photography of driver's licensee's may approach the latter, but the accompanying calibration images needed for skin tone assessment would not be available to Sahana.

Skin Tone References:

[1] Felix von Luschan, “Volker, Rassen, Sprachen”. For a modern rendering of historic von Luschen color tiles, see C. Burnett, http://commons.wikimedia.org/wiki/File:Felix_von_Luscahn_Skin_Color_Chart.svg

[2] Jean de Rigal, Marie-Laurence Abella, Franck Giron, Laurence Caisey, Marc Andre Lefebvre, “Development and validation of a new Skin Color Chart”, Skin Res. & Tech., 13(1), Wiley & Sons, pp 101-107, 22 Jan 2007.

[3] Others mentioned in [4] include the Hyperpigmentation Scale of Dr. Susan Taylor, and the commercial IMS Human Skin Tone Chart.

[4] Gary Grove, Charles Zerweck, Jonn Damia, “Human Skin Coloration using the RGB Color Space Model”, 4th Intl Symp, L'Oreal Inst for Ethnic Hair & Skin Res, Nov. 9-11, 2007. This reports a cyberDERM/Cortex joint project to develop a low-cost skin tone sensor.

[5] As reported in [4], Cortex Technology's DermaSpec and Minolta Chromameter are traditional, expensive instruments for skin tone evaluation.

[6] J. Marguier, N. Bhatti, H. Baker, M. Harville, S. Su¨sstrunk, “Assessing Human Skin Color from Uncalibrated Images”, Int J Imaging Syst Technol, 17, 143–151, 2007, Wiley; Online (www.interscience.wiley.com). DOI 10.1002/ima.20114 . also: http://infoscience.epfl.ch/record/104380/IJIST_ICIP_07_marguier.pdf

Field code: opt_hair_color

New Code New String Old Code Old String Comments/Suggested Mappings
XXX Unknown unk Unknown III interpretation. (Strict NCIC interpretation of XXX is ‘Unknown or Completely Bald’)
BLD Bald Code from Interstate Identification Index (III). If mapping to strict NCIC: BLD –> XXX, add ‘BALD’ to distinguishing marks)
BLK Black bla Black
BLN Blond or Strawberry blo Blond
BLU Blue BLU –> oth
BRO Brown bro Brown
GRY Gray or Partially Gray GRY –> oth
GRN Green GRN –> oth
ONG Orange ONG –> oth
PLE Purple PLE –> oth
PNK Pink PNK –> oth
RED Red or Auburn red Red
SDY Sandy SDY –> blo
WHI White WHI –> oth
oth Other oth –> XXX

Sources for new code & strings:

  • FBI’s National Crime Information Center (NCIC) 2000 – Personal Descriptors
  • Interstate Identification Index (III)

Field code: opt_race

It is proposed to use the US federal designators for Race (and possibly Ethnicity), in spite of their manifest shortcomings, because these are widely used in America for both investigative missing person reporting and health research analysis. (Alternatively, and friendlier, one could just have a free-text field to let the person reporting a missing person use his/her own words.)

New Code New String Old Code Old String NEMSIS Code NCIC Code (and String) Comments
U n/a unk Unknown U (Unknown) ‘U’ is from NCIC, CDC has no Unknown code.
R1 American Indian or Alaska Native 660 I (American Indian/Alaska Native)
R2 Asian 665 A (Asian or Pacific Islander) A–> R2 or R4
R3 Black or African American 670 B (Black)
R4 Native Hawaiian or Other Pacific Islander 675 A (Asian or Pacific Islander) A –> R2 or R4
R5 White 680 W (White)
filip Filipino
R9 Other Race other Other 685 n/a

In addition, the display_order field is set, giving:

Code String Display Order
R1 American Indian or Alaska Native 1
R2 Asian 2
R3 Black or African American 3
R4 Native Hawaiian or Other Pacific Islander 4
R5 White 5
R9 Other Race 6
U Unknown 7

Sources for new code & strings:

Center for Disease Control (CDC) Codes for Race and Ethnicity, circa 2000. The proposed categorization here reflects a minimal top-level hierarchy imposed government wide by the Office of Management and Budget (OMB). “New Code” values are the CDC “Hierarchical Codes”. Also shown is NEMSIS Encoding (Race), which has same hierarchy but different code values. CDC has defined additional sub-categorization codes (not shown), developed in conjunction with the Bureau of Census. Note: Census in 2000 allowed multiple races to be selected by an individual. NCIC 2000 and Sahana allow a single choice.

Possible Future Field code: opt_ethnicity [possibly as a checkbox]

New Code New String Old Code Old String NEMSIS Code Comments
E1 Hispanic or Latino 690 NCIC 2000 does not encode this, but can collect country of origin, and recommends denoting race based on individual characteristics
E2 Not Hispanic or Latino 695

Sources for new code & strings:

Center for Disease Control (CDC) Codes for Race and Ethnicity, circa 2000. The proposed categorization here reflects a minimal top-level hierarchy imposed government wide by the Office of Management and Budget (OMB). “New Code” values are the CDC “Hierarchical Codes”. Also shown is NEMSIS Encoding (Ethnicity), which has same hierarchy but different code values. CDC has defined additional E1 sub-categorization codes (not shown), developed in conjunction with the Bureau of Census. Note: Census in 2000 allowed multiple races/ethnicities to be selected by an individual.


Navigation
QR Code
QR Code dep:nlm-bhepp:casestudy:appearance (generated for current page)