What is the social history of the classification system dividing a categorical variable in the dataset? How do social groups talk about this classification system? How has it been contested?

Annotations

Enter a comma separated list of user names.
Martina Klausner's picture
February 19, 2024

our knowledge of the history of the particular dataset is still a bit limited. From the interview, we learn how the decision for installing the scanner was made or at least the intentions behind it. Interesting ist that there are several units involved that are at the intersection of common interest and economic interests. The unit that installed part of the scanners is linked and funded by the city of Frankfurt with a focus on enhancing retail in the city. The company that offers the scanners and had already installed one before the city "jumped on the train", is part of an investment company for commercial real estate. The company describes itself as coming from a catholic tradition and describes itself as conservative in its investment strategies. One of the central aims is to preserve the values of "lively inner cities" which, as claimed on their website, are bound to "Heimat and Identität". I don't see how this is linked to the data set / classification system itself but it seems worthwhile to consider the insterests and values claimed by those working with it. The aim of Hystreet (which was founded as part of the investment company's strategy for enhancing data analytics) is to provide retailers and urban planners with the data basis to adjust their strategies for preserving retail in inner cities. The same is true for the economic development unit. This seems to cover many of the stories of "urban data": the interests and values that led to generate ana analyze certain data are very diverse and somehow fade into the background when put together to create more representative data.

Looking at the actual dataset and its rows and colums immediately shows one difference to the bike data: weather seems to be particularly important to interprete the pedestrian data in the right way. Most probabaly, becuase such data are easy to collect in addition and in the same resolution. 

It seems the interesting decisions are less within the classifications system itself, than with regard how the data are generated (interview: we decided against hanging them close to the subway entrance as it traffic is not the core question - so the SITES of data production seem important) and how they are interpreted (interview: the red light model that was understood/used in completely opposite ways).