Cai, Di and McCluskey, T.L. (2014) A General Framework of Generating Estimation Functions for Computing the Mutual Information of Terms. International Journal of Advanced Computer Science and Applications, 4 (11). pp. 198-208. ISSN 2158-107X
|
PDF
- Published Version
Available under License Creative Commons Attribution. Download (249kB) | Preview |
Abstract
Computing statistical dependence of terms in textual documents is a widely studied subject and a core problem in many areas of science. This study focuses on such a problem and explores the techniques of estimation using the expected mutual information measure. A general framework is established for tackling a variety of estimations: (i) general forms of estimation functions are introduced; (ii) a set of constraints for the estimation functions is discussed; (iii) general forms of probability distributions are defined; (iv) general forms of the measures for calculating mutual information of terms (MIT) are formalised; (v) properties of the MIT measures are studied and, (vi) relations between the MIT measures are revealed. Four estimation methods, as examples, are proposed and mathematical meanings of the individual methods are respectively interpreted. The methods may be directly applied to practical problems for computing dependence values of individual term pairs. Due to its generality, our method is applicable to various areas, involving statistical semantic analysis of textual data
Item Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Schools: | School of Computing and Engineering |
Related URLs: | |
Depositing User: | Thomas Leo Mccluskey |
Date Deposited: | 28 Feb 2017 11:26 |
Last Modified: | 28 Aug 2021 16:15 |
URI: | http://eprints.hud.ac.uk/id/eprint/31277 |
Downloads
Downloads per month over past year
Repository Staff Only: item control page
![]() |
View Item |