MatNexus: A comprehensive text mining and analysis suite for materials discovery
- In the evolving landscape of materials science, the exponential growth in scientific publications presents both an opportunity and a challenge. Efficiently extracting valuable insights from this vast volume of literature requires specialized tools that go beyond traditional methods. Here, we introduce MatNexus, a software package designed for the automated collection, processing, and analysis of text from scientific articles in the realm of materials science. MatNexus stands out with its integrated suite of modules, which facilitates the retrieval of scientific articles, processes textual data to uncover latent knowledge, generation of vector representations suitable for machine learning applications, and offers advanced visualization capabilities for these word embeddings. Our tool addresses the critical need for effective and reproducable text mining in materials science, an area marked by increasing complexity and data intensity. By making the exploration of materials more efficient and insightful, as exemplified in our case study on electrocatalysts, MatNexus represents a significant advancement in the field. It offers an end-to-end solution for harnessing the wealth of information available in scientific literature, thus aiding the discovery and innovation process in materials science.
Author: | Lei ZhangGND, Markus StrickerGND |
---|---|
URN: | urn:nbn:de:hbz:294-120625 |
DOI: | https://doi.org/10.1016/j.softx.2024.101654 |
Parent Title (English): | SoftwareX |
Publisher: | Elsevier B.V. |
Place of publication: | Amsterdam |
Document Type: | Article |
Language: | English |
Date of Publication (online): | 2025/01/27 |
Date of first Publication: | 2024/02/15 |
Publishing Institution: | Ruhr-Universität Bochum, Universitätsbibliothek |
Tag: | Open Access Fonds Electrocatalyst; Machine learning; Material science; Scientific papers; Text mining; Word embeddings |
Volume: | 26 |
Issue: | Artikel 101654 |
First Page: | 101654-1 |
Last Page: | 101654-8 |
Note: | Article Processing Charge funded by the Deutsche Forschungsgemeinschaft (DFG) and the Open Access Publication Fund of Ruhr-Universität Bochum. |
Institutes/Facilities: | Interdisciplinary Centre for Advanced Materials Simulation (ICAMS) |
Dewey Decimal Classification: | Technik, Medizin, angewandte Wissenschaften / Ingenieurwissenschaften, Maschinenbau |
open_access (DINI-Set): | open_access |
faculties: | Fakultät für Maschinenbau |
Licence (English): | Creative Commons - CC BY 4.0 - Attribution 4.0 International |