loading page

Extracting Variable Definitions from Documents on Chemical Processes Utilizing Semantic Information on Variables
  • Masaki Numoto,
  • Shota Kato,
  • Manabu Kano
Masaki Numoto
Kyoto University Graduate School of Informatics
Author Profile
Shota Kato
Kyoto University Graduate School of Informatics

Corresponding Author:[email protected]

Author Profile
Manabu Kano
Kyoto University Graduate School of Informatics
Author Profile

Abstract

Mathematical formulas are essential tools for conveying mathematical concepts. Definitions of symbols in mathematical formulas often vary among different documents; thus, knowing the definitions is fundamental to grasping the semantics of the formulas. This research targets how to extract definitions of symbols representing variables from documents on chemical processes. We defined three features focusing on the unique usage of variable symbols and definitions in these documents and proposed a new variable definition extraction method. We compared the performance of the proposed method with that of a representative conventional method using 45 papers on five chemical processes. The proposed method achieved higher accuracy than the conventional one for four processes. We also demonstrated that our newly defined features contributed to the performance improvement and that the proposed method can achieve high accuracy with a small number of training datasets.