Distributed and multicore QuBiLS-MIDAS software v2.0: Computing chiral, fuzzy, weighted and truncated geometrical molecular descriptors based on tensor algebra

César R. García-Jacas*, Yovani Marrero-Ponce, Ricardo Vivas-Reyes, José Suárez-Lezcano, Felix Martinez-Rios, Julio E. Terán, Longendri Aguilera-Mendoza

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Advances to the distributed, multi-core and fully cross-platform QuBiLS-MIDAS software v2.0 (http://tomocomd.com/qubils-midas) are reported in this article since the v1.0 release. The QuBiLS-MIDAS software is the only one that computes atom-pair and alignment-free geometrical MDs (3D-MDs) from several distance metrics other than the Euclidean distance, as well as alignment-free 3D-MDs that codify structural information regarding the relations among three and four atoms of a molecule. The most recent features added to the QuBiLS-MIDAS software v2.0 are related (a) to the calculation of atomic weightings from indices based on the vertex-degree invariant (e.g., Alikhanidi index); (b) to consider central chirality during the molecular encoding; (c) to use measures based on clustering methods and statistical functions to codify structural information among more than two atoms; (d) to the use of a novel method based on fuzzy membership functions to spherically truncate inter-atomic relations; and (e) to the use of weighted and fuzzy aggregation operators to compute global 3D-MDs according to the importance and/or interrelation of the atoms of a molecule during the molecular encoding. Moreover, a novel module to compute QuBiLS-MIDAS 3D-MDs from their headings was also developed. This module can be used either by the graphical user interface or by means of the software library. By using the library, both the predictive models built with the QuBiLS-MIDAS 3D-MDs and the QuBiLS-MIDAS 3D-MDs calculation can be embedded in other tools. A set of predefined QuBiLS-MIDAS 3D-MDs with high information content and low redundancy on a set comprised of 20,469 compounds is also provided to be employed in further cheminformatics tasks. This set of predefined 3D-MDs evidenced better performance than all the universe of Dragon (v5.5) and PaDEL 0D-to-3D MDs in variability studies, whereas a linear independence study proved that these QuBiLS-MIDAS 3D-MDs codify chemical information orthogonal to the Dragon 0D-to-3D MDs. This set of predefined 3D-MDs would be periodically updated as long as new results be achieved. In general, this report highlights our continued efforts to provide a better tool for a most suitable characterization of compounds, and in this way, to contribute to obtaining better outcomes in future applications.

Original languageEnglish
Pages (from-to)1209-1227
Number of pages19
JournalJournal of Computational Chemistry
Volume41
Issue number12
DOIs
StatePublished - 5 May 2020

Bibliographical note

Publisher Copyright:
© 2020 Wiley Periodicals, Inc.

Funding

C.R.G.J. acknowledges to the program “Cátedras CONACYT” from “Consejo Nacional de Ciencia y Tecnología (CONACYT), México” by the support to the endowed chair 501/2018 at “Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE).” Y.M.P. thanks to the program “Profesor Invitado” for a post‐doctoral fellowship to work at Valencia University in 2018–2019. Y.M.P. also acknowledges the support from Universidad San Francisco de Quito Chancellor Grant 2017‐2018 (Project ID13525). C.R.G.J. acknowledges to the program ?C?tedras CONACYT? from ?Consejo Nacional de Ciencia y Tecnolog?a (CONACYT), M?xico? by the support to the endowed chair 501/2018 at ?Centro de Investigaci?n Cient?fica y de Educaci?n Superior de Ensenada (CICESE).? Y.M.P. thanks to the program ?Profesor Invitado? for a post-doctoral fellowship to work at Valencia University in 2018?2019. Y.M.P. also acknowledges the support from Universidad San Francisco de Quito Chancellor Grant 2017-2018 (Project ID13525).

FundersFunder number
Centro de Investigaci?n Cient?fica y de Educaci?n Superior de Ensenada
Cátedras CONACYT
Centro de Investigación Científica y de Educación Superior de Ensenada, Baja California
Consejo Nacional de Ciencia y Tecnología
Universitat de València
Universidad San Francisco de QuitoID13525, 2017‐2018

    Keywords

    • Choquet integral
    • GOWA aggregation operator
    • GOWAWA aggregation operator
    • QuBiLS-MIDAS software
    • chiral molecular descriptor
    • fuzzy membership function
    • fuzzy molecular descriptor
    • fuzzy spherical truncation
    • geometrical (3D) molecular descriptor
    • weighted molecular descriptor

    Fingerprint

    Dive into the research topics of 'Distributed and multicore QuBiLS-MIDAS software v2.0: Computing chiral, fuzzy, weighted and truncated geometrical molecular descriptors based on tensor algebra'. Together they form a unique fingerprint.

    Cite this