Deriving age and gender from forenames for consumer analytics

被引:26
|
作者
Lansley, Guy [1 ]
Longley, Paul [1 ]
机构
[1] UCL, Dept Geog, Gower St, London WC1E 6BT, England
基金
英国经济与社会研究理事会; 英国工程与自然科学研究理事会;
关键词
Age; Gender; Geodemographics; Big Data; Social media; DEMOGRAPHICS; CHOICE; NAMES;
D O I
10.1016/j.jretconser.2016.02.007
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper explores the age and gender distributions of the bearers of British forenames and identifies key trends in British naming conventions. Age and gender characteristics are known to greatly influence consumption behaviour, and so extracting and using names to indicate these characteristics from consumer datasets is of clear value to the retail and marketing industries. Data representing over 17 million individuals sourced from birth certificates and market data have been modelled to estimate the total age and gender distributions of 32,000 unique forenames in Britain. When aggregated into five year age bands for each gender, the data reveal distinctive age profiles for different names, which are largely a product of the rise and decline in popularity of different baby names over the past 90 years. The names database produced can be used to infer the expected age and gender structures of many consumer datasets, as well as to anticipate key characteristics of consumers at the level of the individual. (C) 2016 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:271 / 278
页数:8
相关论文
共 50 条