MatNexus: A comprehensive text mining and analysis suite for materials discovery

  • In the evolving landscape of materials science, the exponential growth in scientific publications presents both an opportunity and a challenge. Efficiently extracting valuable insights from this vast volume of literature requires specialized tools that go beyond traditional methods. Here, we introduce MatNexus, a software package designed for the automated collection, processing, and analysis of text from scientific articles in the realm of materials science. MatNexus stands out with its integrated suite of modules, which facilitates the retrieval of scientific articles, processes textual data to uncover latent knowledge, generation of vector representations suitable for machine learning applications, and offers advanced visualization capabilities for these word embeddings. Our tool addresses the critical need for effective and reproducable text mining in materials science, an area marked by increasing complexity and data intensity. By making the exploration of materials more efficient and insightful, as exemplified in our case study on electrocatalysts, MatNexus represents a significant advancement in the field. It offers an end-to-end solution for harnessing the wealth of information available in scientific literature, thus aiding the discovery and innovation process in materials science.

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Lei ZhangGND, Markus StrickerGND
URN:urn:nbn:de:hbz:294-120625
DOI:https://doi.org/10.1016/j.softx.2024.101654
Parent Title (English):SoftwareX
Publisher:Elsevier B.V.
Place of publication:Amsterdam
Document Type:Article
Language:English
Date of Publication (online):2025/01/27
Date of first Publication:2024/02/15
Publishing Institution:Ruhr-Universität Bochum, Universitätsbibliothek
Tag:Open Access Fonds
Electrocatalyst; Machine learning; Material science; Scientific papers; Text mining; Word embeddings
Volume:26
Issue:Artikel 101654
First Page:101654-1
Last Page:101654-8
Note:
Article Processing Charge funded by the Deutsche Forschungsgemeinschaft (DFG) and the Open Access Publication Fund of Ruhr-Universität Bochum.
Institutes/Facilities:Interdisciplinary Centre for Advanced Materials Simulation (ICAMS)
Dewey Decimal Classification:Technik, Medizin, angewandte Wissenschaften / Ingenieurwissenschaften, Maschinenbau
open_access (DINI-Set):open_access
faculties:Fakultät für Maschinenbau
Licence (English):License LogoCreative Commons - CC BY 4.0 - Attribution 4.0 International