Computing and Library Services - delivering an inspiring information environment

Framework for the Optimisation of Full Text Queries on Safety Incident Reports

Newall, Matthew (2020) Framework for the Optimisation of Full Text Queries on Safety Incident Reports. Masters thesis, University of Huddersfield.

PDF - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview


The Railway Industry in Great Britain is on the brink of a seed change in the methods used to carry out safety and risk management. Advances in technology over the past few decades have enabled the collection of varied and detailed information regarding the state of track, vehicles, stations, facilities, and personnel at an ever-increasing scale. As more and more industry processes embrace data driven techniques, the growing volume of data being recorded poses a significant challenge in terms of processing and knowledge extraction.

In these chapters, a case study is made of the Close Call system used by railway organisations in Great Britain. Personnel working on the railway network can make reports to this system by phone, email, and in-app, if they see something they consider to have the potential to cause harm or damage. The near ubiquity of mobile devices allows all staff to make free-text reports at any time which has great potential for the completeness of reporting but comes with some drawbacks. The free-text nature of the reports means that basic processing is inadequate for tasks such as categorisation, knowledge extraction, and reporter feedback, requiring ongoing research into appropriate natural language processing techniques. A variety of techniques have shown promise, but the vast and growing quantity of close call reports being made has meant that computing capacity has limited the number of reports that can be processed, and work so far has often been applied only to a small subset of reports.

The aim of this thesis is to demonstrate that it is possible to process or otherwise transform the close call text to allow existing and future analysis techniques pertaining to this text (and data sources like it) to be applied on a larger scale or smaller timeframe. A novel text indexing technique is presented and evaluated, and when compared to other text indexing and pattern matching methods, it is shown that a significant speed increase is possible over the methods previously used.

Versions of the matching technique described in this work have been presented at the European Safety and Reliability Conference in 2017 and 2019.

Item Type: Thesis (Masters)
Subjects: T Technology > T Technology (General)
T Technology > TF Railroad engineering and operation
Schools: School of Computing and Engineering
School of Computing and Engineering > Institute of Railway Research
Depositing User: Annabel Danson-Darbyshire
Date Deposited: 15 Nov 2021 11:57
Last Modified: 15 Nov 2021 12:00


Downloads per month over past year

Repository Staff Only: item control page

View Item View Item

University of Huddersfield, Queensgate, Huddersfield, HD1 3DH Copyright and Disclaimer All rights reserved ©