In this thesis, we develop methods for modeling route choice behavior using smartphone data. The developing global positioning system (GPS) technology and the popularity of smartphones have revolutionized the revealed preference route choice data collection. Nowadays, smartphones are embedded with various kinds of sensors that are able to provide mobility related information. These sensors include GPS, accelerometer and bluetooth. The recorded raw data is not directly applicable to travel behavior study, information such as the paths and transport modes of travels have to be inferred. The inference procedure is challenging due to the poor quality and the variety of the data. This thesis deals with these challenges by proposing probabilistic methods that account for errors in the data, and fusing various kinds of smartphone data in an integrated framework. Based on the inference methods, a route choice modeling framework exploiting GPS data is developed. The low cost sensors of smartphones observe measurements with significant errors. Moreover, due to practical constraints, such as the limits on smartphone battery volume and the cost of data transmitted via wireless networks, data are usually recorded in a relatively large time interval (low frequency). These drawbacks preclude path identification (a.k.a. map-matching, MM) algorithms that are designed for dense and accurate data from dedicated GPS devices. Therefore, we first propose a probabilistic unimodal MM method that infers the traveled paths from GPS data recorded during a car trip. Instead of deterministically matching a sequence of GPS points to one path, it generates a probabilistic path observation which is composed of a set of candidate paths, and a measurement likelihood for each path. The candidate paths are generated by a candidate path generation algorithm from GPS data. It is capable of dealing with both accurate and dense data (1 second interval) from dedicated GPS devices, and noisy and sparse data (more than 10 seconds interval) from smartphones. A probabilistic measurement model is constructed to calculate the measurement likelihood, which is the likelihood that the observed GPS data is recorded along a given path. The probabilistic measurement model employs structural equation modeling techniques, and the latent status for each measurement is defined as the true location where the measurement is observed. A GPS sensor measurement model relates the status to each GPS measurement; a structural travel model captures the status over time in the network. In this approach, besides geographical coordinates, speed and time recorded from GPS also contribute to the identification of the true path. Applications and analyses on real data illustrate the robustness and effectiveness of the proposed approach. Based on the framework designed for the unimodal MM, a multimodal MM method is developed to deal with a more general problem where the trips can be multimodal and the modes are unknown
Jan Skaloud, Gabriel François Laupré