Urban-semantic computer vision: a framework for contextual understanding of people in urban spaces

被引:0
|
作者
Anthony Vanky
Ri Le
机构
[1] Columbia University,Graduate School of Architecture, Planning and Preservation
来源
AI & SOCIETY | 2023年 / 38卷
关键词
Artificial intelligence; Computer vision; Urban space; Urban context; Urbanism; Semantic meaning; Thick description; Evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
Increasing computational power and improving deep learning methods have made computer vision technologies pervasively common in urban environments. Their applications in policing, traffic management, and documenting public spaces are increasingly common (Ridgeway 2018, Coifman et al. 1998, Sun et al. 2020). Despite the often-discussed biases in the algorithms' training and unequally borne benefits (Khosla et al. 2012), almost all applications similarly reduce urban experiences to simplistic, reductive, and mechanistic measures. There is a lack of context, depth, and specificity in these practices that enables semantic knowledge or analysis within urban contexts, especially within the context of using and occupying urban space. This paper will critique existing uses of artificial intelligence and computer vision in urban practices to propose a new framework for understanding people, action, and public space. This paper revisits Geertz's (1973) use of thick descriptions in generating interpretive theories of culture and activity and uses this lens to establish a framework to approach evaluating the varied uses of computer vision technologies that weigh meaning. By discussing cases of implemented examples of urban computer vision—from LinkNYC and Numina's urban measurements to the Detroit Police's use of DataWorks Plus's facial recognition technology—it proposes a framework for evaluating the thickness of the algorithm's conclusions against the computational method's complexity required to produce that outcome. Further, we discuss how the framework's positioning may differ (and conflict) between different users of the technology, from engineer to urban planner and policymaker, to citizen. This paper also discusses how the current use and training of deep learning algorithms and how this process limits semantic learning and proposes three potential methodologies toward gaining a more contextually specific, urban-semantic, description of urban space relevant to urbanists. This paper contributes to the critical conversations regarding the proliferation of artificial intelligence by challenging the current applications of these technologies in the urban environment by highlighting their failures within this context while also proposing an evolution of these algorithms that may ultimately make them sensitive and useful within this spatial and cultural milieu.
引用
收藏
页码:1193 / 1207
页数:14
相关论文
共 50 条
  • [31] Location Estimation of an Urban Scene using Computer Vision Techniques
    Gordan, Paul
    Boros, Hanniel
    Giosan, Ion
    VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, : 268 - 275
  • [32] A Methodological Approach for Inferring Urban Indicators Through Computer Vision
    Paiva, Sara
    Santos, Diogo
    Rossetti, Rosaldo J. F.
    2018 IEEE INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2018,
  • [33] Urban 3D Semantic Modelling Using Stereo Vision
    Sengupta, Sunando
    Greveson, Eric
    Shahrokni, Ali
    Torr, Philip H. S.
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 580 - 585
  • [34] Space for relax, spaces for rechange: effect of urban spaces on people's mood
    Rapuano, Mariachiara
    Iachini, Tina
    Ruggiero, Gennaro
    Masullo, Massimiliano
    Maffei, Luigi
    Palmieri, Alice
    Ruotolo, Francesco
    COGNITIVE PROCESSING, 2021, 22 (SUPPL 1) : 55 - 55
  • [35] Thermal comfort in outdoor urban spaces: Understanding the human parameter
    Nikolopoulou, M
    Baker, N
    Steemers, K
    SOLAR ENERGY, 2001, 70 (03) : 227 - 235
  • [36] Computer vision for transit travel time prediction: an end-to-end framework using roadside urban imagery
    Abdelhalim, Awad
    Zhao, Jinhua
    PUBLIC TRANSPORT, 2025, 17 (01) : 221 - 246
  • [37] A Computer Vision Framework for Human User Sensing in Public Open Spaces
    Sun, Peng
    Hou, Rui
    Lynch, Jerome P.
    PROCEEDINGS OF THE 1ST ACMWORKSHOP ON DEVICE-FREE HUMAN SENSING (DFHS 19), 2019, : 27 - 30
  • [38] The conditioned anticipation of people (CAP) model of driving in urban spaces
    Tice, Patricia C.
    Hancock, P. A.
    Tirtha, Sudipta Dey
    Eluru, Naveen
    TRANSPORTATION RESEARCH PART F-TRAFFIC PSYCHOLOGY AND BEHAVIOUR, 2022, 84 : 301 - 312
  • [39] Seeing spatially: people, networks and movements in digital and urban spaces
    Lim, Merlyna
    INTERNATIONAL DEVELOPMENT PLANNING REVIEW, 2014, 36 (01) : 51 - 72
  • [40] SPACES BY PEOPLE: AN URBAN DESIGN APPROACH TO EVERYDAY LIFE (1)
    Cihanger, Duygu
    METU JOURNAL OF THE FACULTY OF ARCHITECTURE, 2018, 35 (02) : 55 - 76