Computing and Library Services - delivering an inspiring information environment

Learning from text-based close call data

Hughes, Peter, Figueres-Esteban, Miguel and Van Gulijk, Coen (2016) Learning from text-based close call data. Safety and Reliability: SaRS Journal. ISSN 0961-7353

PDF - Accepted Version
Download (137kB) | Preview


A key feature of big data is the variety of data sources that are available; which include not just numerical data but also image or video data or even free text. The GB railways collects a large volume of free text data daily from railway workers describing close call hazard reports: instances where an accident could have – but did not – occur. These close call reports contain valuable safety information which could be useful in managing safety on the railway, but which can be lost in the very large volume of data – much larger than is viable for a human analyst to read. This paper describes the application of rudimentary natural language processing (NLP) techniques to uncover safety information from close calls. The analysis has proven that basic information extraction is possible using the rudimentary techniques, but has also identified some limitations that arise using only basic techniques. Using these findings further research in this area intends to look at how the techniques that have been proven to date can be improved with the use of more advanced NLP techniques coupled with machine-learning.

Item Type: Article
Subjects: Q Science > Q Science (General)
T Technology > T Technology (General)
Schools: School of Computing and Engineering
School of Computing and Engineering > Institute of Railway Research
Related URLs:
Depositing User: Peter Hughes
Date Deposited: 05 Jan 2017 12:17
Last Modified: 28 Aug 2021 16:27


Downloads per month over past year

Repository Staff Only: item control page

View Item View Item

University of Huddersfield, Queensgate, Huddersfield, HD1 3DH Copyright and Disclaimer All rights reserved ©