Natural Language Processing for Detecting Forward Reference in a Document

Daniel Siahaan, Izzatul Umami

Abstract


Meyer's seven sins have been recognized as types of mistakes that a requirements specialist are often fallen to when specifying requirements. Such mistakes play a significant role in plunging a project into failure. Many researchers were focusing in ambiguity and contradiction type of mistakes. Other types of mistakes have been given less attentions. Those mistakes often happened in reality and may equally costly as the first two mistakes. This paper introduces an approach to detect forward reference. It traverses through a requirements document, extracts, and processes each statement. During the statement extraction, any terms that may reside in the statement is also extracted. Based on certain rules which utilize POS patterns, the statement is classified as a term definition or not. For each term definition, a term is added to a list of defined terms. At the same time, every time a new term is found in a statement, it is check against the list of defined terms. If it is not found, then the requirements statement is classified as statement with forward reference. The experimentation on 30 requirements documents from various domains of software project shows that the approach has considerably almost perfect agreement with domain expert in detecting forward reference, given 0.83 kappa index value.

Keywords


Forward Reference; Natural Language Processing; Term

Full Text:

PDF

References


Meyer, B. 1985. On Formalism in Specifications. IEEE Software, 2(1), January 1985, 6–26.

Muliawan, I. W. Muliawan and Siahaan, D.O. 2012. Software Requirements Ambiguity Analysis based on SMART Requirements (Analisis Ambiguitas Kebutuhan Perangkat Lunak Berdasarkan Acuan SMART Requirements). In Manajemen Teknologi Informasi, SEMNAS XIV, Surabaya, Indonesia, 2012.

Hussain, I., Ormandjieva, O., and Kosseim, L. 2007. Automatic Quality Assessment of SRS Text by Means of a Decision-Tree- Based Text Classifier. In Proceeding of 7th International Conference on Quality Software, Portland USA, p.209-218.

Gnesi, S., Fabbrini, F. Fusani, M., and Trentanni, G. 2005. An automatic tool for the analysis of natural language requirements. International Journal of Computer Systems Science & Engineering, vol. 20(1), pp. 53–62.

Kamsties, E., Berry, D. M., and Paech, B. 2001., Detecting Ambiguities in Requirements Documents Using Inspections. in Proceedings of the First Workshop on Inspection in Software Engineering (WISE’01), pp. 68–80.

Denger, C., Berry, D. M., and Kamsties, E. 2003. Higher quality requirements specifications through natural language patterns. In Proc. of the IEEE Int. Conf. on Software – Sci. Tech. and Eng, pp. 80–91.

Marneffe, M.D., Rafferty, A.N., and Manning, C. D. 2008. Finding contradictions in text. In ACL 2008.

Siahaan, D. 2012. Software Requirements Analysis (Analisa Kebutuhan Dalam Rekayasa Perangkat Lunak. Penerbit Andi.

Cohen, J. 1960. A Coefficient of Agreement for Nominal Scales. I. Educ. Psycho!, Meat. 20:37-46.

J. R. Landis, J.R. and Koch, G.G. 1977. The Measurement of Observer Agreement for Categorical Data. Biometrics, vol. 33(1), pp. 159–174.




DOI: http://dx.doi.org/10.12962/j20882033.v23i4.99

Refbacks

  • There are currently no refbacks.


Creative Commons License

IPTEK Journal of Science and Technology by Lembaga Penelitian dan Pengabdian kepada Masyarakat, ITS is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Based on a work at https://iptek.its.ac.id/index.php/jts.