Articles

Page 2 of 10

Genetics and precision health: the ecological fallacy and artificial intelligence solutions

Authors: Scott M. Williams and Jason H. Moore

Citation: BioData Mining 2023 16:9

Content type: Editorial Published on: 13 March 2023
- View Full Text
- View PDF
Prediction of the risk of developing end-stage renal diseases in newly diagnosed type 2 diabetes mellitus using artificial intelligence algorithms

Type 2 diabetes mellitus (T2DM) imposes a great burden on healthcare systems, and these patients experience higher long-term risks for developing end-stage renal disease (ESRD). Managing diabetic nephropathy b...

Authors: Shuo-Ming Ou, Ming-Tsun Tsai, Kuo-Hua Lee, Wei-Cheng Tseng, Chih-Yu Yang, Tz-Heng Chen, Pin-Jie Bin, Tzeng-Ji Chen, Yao-Ping Lin, Wayne Huey-Herng Sheu, Yuan-Chia Chu and Der-Cherng Tarng

Citation: BioData Mining 2023 16:8

Content type: Research Published on: 10 March 2023
- View Full Text
- View PDF
Signature literature review reveals AHCY, DPYSL3, and NME1 as the most recurrent prognostic genes for neuroblastoma

Neuroblastoma is a childhood neurological tumor which affects hundreds of thousands of children worldwide, and information about its prognosis can be pivotal for patients, their families, and clinicians. One o...

Authors: Davide Chicco, Tiziana Sanavia and Giuseppe Jurman

Citation: BioData Mining 2023 16:7

Content type: Research Published on: 4 March 2023
- View Full Text
- View PDF
Ten simple rules for providing bioinformatics support within a hospital

Bioinformatics has become a key aspect of the biomedical research programmes of many hospitals’ scientific centres, and the establishment of bioinformatics facilities within hospitals has become a common pract...

Authors: Davide Chicco and Giuseppe Jurman

Citation: BioData Mining 2023 16:6

Content type: Brief Report Published on: 23 February 2023
- View Full Text
- View PDF
iU-Net: a hybrid structured network with a novel feature fusion approach for medical image segmentation

In recent years, convolutional neural networks (CNNs) have made great achievements in the field of medical image segmentation, especially full convolutional neural networks based on U-shaped structures and ski...

Authors: Yun Jiang, Jinkun Dong, Tongtong Cheng, Yuan Zhang, Xin Lin and Jing Liang

Citation: BioData Mining 2023 16:5

Content type: Research Published on: 21 February 2023
- View Full Text
- View PDF
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification

Binary classification is a common task for which machine learning and computational statistics are used, and the area under the receiver operating characteristic curve (ROC AUC) has become the common standard ...

Authors: Davide Chicco and Giuseppe Jurman

Citation: BioData Mining 2023 16:4

Content type: Methodology Published on: 17 February 2023
- View Full Text
- View PDF
LoFTK: a framework for fully automated calculation of predicted Loss-of-Function variants and genes

Loss-of-Function (LoF) variants in human genes are important due to their impact on clinical phenotypes and frequent occurrence in the genomes of healthy individuals. The association of LoF variants with compl...

Authors: Abdulrahman Alasiri, Konrad J. Karczewski, Brian Cole, Bao-Li Loza, Jason H. Moore, Sander W. van der Laan, Folkert W. Asselbergs, Brendan J. Keating and Jessica van Setten

Citation: BioData Mining 2023 16:3

Content type: Software Published on: 2 February 2023
- View Full Text
- View PDF
Detection of iron deficiency anemia by medical images: a comparative study of machine learning algorithms

Anemia is one of the global public health problems that affect children and pregnant women. Anemia occurs when the level of red blood cells within the body decreases or when the structure of the red blood cell...

Authors: Peter Appiahene, Justice Williams Asare, Emmanuel Timmy Donkoh, Giovanni Dimauro and Rosalia Maglietta

Citation: BioData Mining 2023 16:2

Content type: Research Published on: 24 January 2023
- View Full Text
- View PDF
Bacteria spatial tracking in Urban Park soils with MALDI-TOF Mass Spectrometry and Specific PCR

Urban parks constitute one of the main leisure areas, especially for the most vulnerable people in our society, children, and the elderly. Contact with soils can pose a health risk. Microbiological testing is ...

Authors: Diego Arnal, Celeste Moya, Luigi Filippelli, Jaume Segura-Garcia and Sergi Maicas

Citation: BioData Mining 2023 16:1

Content type: Research Published on: 14 January 2023
- View Full Text
- View PDF
Robust and rigorous identification of tissue-specific genes by statistically extending tau score

In this study, we aimed to identify tissue-specific genes for various human tissues/organs more robustly and rigorously by extending the tau score algorithm.

Authors: Hatice Büşra Lüleci and Alper Yılmaz

Citation: BioData Mining 2022 15:31

Content type: Methodology Published on: 9 December 2022
- View Full Text
- View PDF
Classification of breast cancer recurrence based on imputed data: a simulation study

Several studies have been conducted to classify various real life events but few are in medical fields; particularly about breast recurrence under statistical techniques. To our knowledge, there is no reported...

Authors: Rahibu A. Abassi and Amina S. Msengwa

Citation: BioData Mining 2022 15:30

Content type: Research Published on: 7 December 2022
- View Full Text
- View PDF
Detecting diseases in medical prescriptions using data mining methods

Every year, the health of millions of people around the world is compromised by misdiagnosis, which sometimes could even lead to death. In addition, it entails huge financial costs for patients, insurance comp...

Authors: Sana Nazari Nezhad, Mohammad H. Zahedi and Elham Farahani

Citation: BioData Mining 2022 15:29

Content type: Research Published on: 24 November 2022
- View Full Text
- View PDF
Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning

Cancer is one of the leading causes of death worldwide and can be caused by environmental aspects (for example, exposure to asbestos), by human behavior (such as smoking), or by genetic factors. To understand ...

Authors: Davide Chicco, Abbas Alameer, Sara Rahmati and Giuseppe Jurman

Citation: BioData Mining 2022 15:28

Content type: Research Published on: 3 November 2022
- View Full Text
- View PDF
An unsupervised image segmentation algorithm for coronary angiography

Computer visual systems can rapidly obtain a large amount of data and automatically process them with ease. These characteristics constitute advantages for the application of such systems in the automatic anal...

Authors: Zong-Xian Yin and Hong-Ming Xu

Citation: BioData Mining 2022 15:27

Content type: Methodology Published on: 21 October 2022
- View Full Text
- View PDF
Expanding a database-derived biomedical knowledge graph via multi-relation extraction from biomedical abstracts

Knowledge graphs support biomedical research efforts by providing contextual information for biomedical entities, constructing networks, and supporting the interpretation of high-throughput analyses. These dat...

Authors: David N. Nicholson, Daniel S. Himmelstein and Casey S. Greene

Citation: BioData Mining 2022 15:26

Content type: Research Published on: 18 October 2022
- View Full Text
- View PDF
EZCancerTarget: an open-access drug repurposing and data-collection tool to enhance target validation and optimize international research efforts against highly progressive cancers

The expanding body of potential therapeutic targets requires easily accessible, structured, and transparent real-time interpretation of molecular data. Open-access genomic, proteomic and drug-repurposing datab...

Authors: David Dora, Timea Dora, Gabor Szegvari, Csongor Gerdán and Zoltan Lohinai

Citation: BioData Mining 2022 15:25

Content type: Software article Published on: 1 October 2022
- View Full Text
- View PDF
Effective hybrid feature selection using different bootstrap enhances cancers classification performance

Machine learning can be used to predict the different onset of human cancers. Highly dimensional data have enormous, complicated problems. One of these is an excessive number of genes plus over-fitting, fittin...

Authors: Noura Mohammed Abdelwahed, Gh. S. El-Tawel and M. A. Makhlouf

Citation: BioData Mining 2022 15:24

Content type: Research Published on: 30 September 2022
- View Full Text
- View PDF
Polygenic risk modeling of tumor stage and survival in bladder cancer

Bladder cancer assessment with non-invasive gene expression signatures facilitates the detection of patients at risk and surveillance of their status, bypassing the discomforts given by cystoscopy. To achieve ...

Authors: Mauro Nascimben, Lia Rimondini, Davide Corà and Manolo Venturin

Citation: BioData Mining 2022 15:23

Content type: Research Published on: 30 September 2022
- View Full Text
- View PDF
A Gated Recurrent Unit based architecture for recognizing ontology concepts from biological literature

Annotating scientific literature with ontology concepts is a critical task in biology and several other domains for knowledge discovery. Ontology based annotations can power large-scale comparative analyses in...

Authors: Pratik Devkota, Somya D. Mohanty and Prashanti Manda

Citation: BioData Mining 2022 15:22

Content type: Research Published on: 28 September 2022
- View Full Text
- View PDF
Interpretable recurrent neural network models for dynamic prediction of the extubation failure risk in patients with invasive mechanical ventilation in the intensive care unit

Clinical decision of extubation is a challenge in the treatment of patient with invasive mechanical ventilation (IMV), since existing extubation protocols are not capable of precisely predicting extubation fai...

Authors: Zhixuan Zeng, Xianming Tang, Yang Liu, Zhengkun He and Xun Gong

Citation: BioData Mining 2022 15:21

Content type: Research Published on: 27 September 2022
- View Full Text
- View PDF
Machine Learning Algorithms for understanding the determinants of under-five Mortality

Under-five mortality is a matter of serious concern for child health as well as the social development of any country. The paper aimed to find the accuracy of machine learning models in predicting under-five m...

Authors: Rakesh Kumar Saroj, Pawan Kumar Yadav, Rajneesh Singh and Obvious.N. Chilyabanyama

Citation: BioData Mining 2022 15:20

Content type: Methodology Published on: 24 September 2022
- View Full Text
- View PDF
ParticleChromo3D: a Particle Swarm Optimization algorithm for chromosome 3D structure prediction from Hi-C data

The three-dimensional (3D) structure of chromatin has a massive effect on its function. Because of this, it is desirable to have an understanding of the 3D structural organization of chromatin. To gain greater...

Authors: David Vadnais, Michael Middleton and Oluwatosin Oluwadare

Citation: BioData Mining 2022 15:19

Content type: Methodology Published on: 21 September 2022
- View Full Text
- View PDF
Learning and visualizing chronic latent representations using electronic health records

Nowadays, patients with chronic diseases such as diabetes and hypertension have reached alarming numbers worldwide. These diseases increase the risk of developing acute complications and involve a substantial ...

Authors: David Chushig-Muzo, Cristina Soguero-Ruiz, Pablo de Miguel Bohoyo and Inmaculada Mora-Jiménez

Citation: BioData Mining 2022 15:18

Content type: Research Published on: 5 September 2022
- View Full Text
- View PDF
Analysis of risk factors progression of preterm delivery using electronic health records

Preterm deliveries have many negative health implications on both mother and child. Identifying the population level factors that increase the risk of preterm deliveries is an important step in the direction o...

Authors: Zeineb Safi, Neethu Venugopal, Haytham Ali, Michel Makhlouf, Faisal Farooq and Sabri Boughorbel

Citation: BioData Mining 2022 15:17

Content type: Research Published on: 17 August 2022
- View Full Text
- View PDF
Neural network methods for diagnosing patient conditions from cardiopulmonary exercise testing data

Cardiopulmonary exercise testing (CPET) provides a reliable and reproducible approach to measuring fitness in patients and diagnosing their health problems. However, the data from CPET consist of multiple time...

Authors: Donald E. Brown, Suchetha Sharma, James A. Jablonski and Arthur Weltman

Citation: BioData Mining 2022 15:16

Content type: Research Published on: 13 August 2022
- View Full Text
- View PDF
Benchmarking AutoML frameworks for disease prediction using medical claims

Ascertain and compare the performances of Automated Machine Learning (AutoML) tools on large, highly imbalanced healthcare datasets.

Authors: Roland Albert A. Romero, Mariefel Nicole Y. Deypalan, Suchit Mehrotra, John Titus Jungao, Natalie E. Sheils, Elisabetta Manduchi and Jason H. Moore

Citation: BioData Mining 2022 15:15

Content type: Short report Published on: 26 July 2022
- View Full Text
- View PDF
Novel digital approaches to the assessment of problematic opioid use

The opioid epidemic continues to contribute to loss of life through overdose and significant social and economic burdens. Many individuals who develop problematic opioid use (POU) do so after being exposed to ...

Authors: Philip J. Freda Jr, Henry R. Kranzler and Jason H. Moore

Citation: BioData Mining 2022 15:14

Content type: Review Published on: 15 July 2022
- View Full Text
- View PDF
Single_cell_GRN: gene regulatory network identification based on supervised learning method and Single-cell RNA-seq data

Single-cell RNA-seq overcomes the shortcomings of conventional transcriptome sequencing technology and could provide a powerful tool for distinguishing the transcriptome characteristics of various cell types i...

Authors: Bin Yang, Wenzheng Bao, Baitong Chen and Dan Song

Citation: BioData Mining 2022 15:13

Content type: Methodology Published on: 11 June 2022
- View Full Text
- View PDF
Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning

Cancer molecular subtyping plays a critical role in individualized patient treatment. In previous studies, high-throughput gene expression signature-based methods have been proposed to identify cancer subtypes...

Authors: Shaochuan Li, Yuning Yang, Xin Wang, Jun Li, Jun Yu, Xiangtao Li and Ka-Chun Wong

Citation: BioData Mining 2022 15:12

Content type: Research Published on: 23 April 2022
- View Full Text
- View PDF
Correction: Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies

Authors: Marc Joiret, Jestinah M. Mahachie John, Elena S. Gusareva and Kristel Van Steen

Citation: BioData Mining 2022 15:11

Content type: Correction Published on: 11 April 2022

The original article was published in BioData Mining 2019 12:11
- View Full Text
- View PDF
DIVIS: a semantic DIstance to improve the VISualisation of heterogeneous phenotypic datasets

Thanks to the wider spread of high-throughput experimental techniques, biologists are accumulating large amounts of datasets which often mix quantitative and qualitative variables and are not always complete, ...

Authors: Rayan Eid, Claudine Landès, Alix Pernet, Emmanuel Benoît, Pierre Santagostini, Angelina El Ghaziri and Julie Bourbeillon

Citation: BioData Mining 2022 15:10

Content type: Research Published on: 4 April 2022
- View Full Text
- View PDF
A new challenge for data analytics: transposons

Authors: Ralf E. Wellinger and Jesús S. Aguilar–Ruiz

Citation: BioData Mining 2022 15:9

Content type: Editorial Published on: 25 March 2022
- View Full Text
- View PDF
mSRFR: a machine learning model using microalgal signature features for ncRNA classification

This work presents mSRFR (microalgae SMOTE Random Forest Relief model), a classification tool for noncoding RNAs (ncRNAs) in microalgae, including green algae, diatoms, golden algae, and cyanobacteria. First, ...

Authors: Songtham Anuntakarun, Supatcha Lertampaiporn, Teeraphan Laomettachit, Warin Wattanapornprom and Marasri Ruengjitchatchawalya

Citation: BioData Mining 2022 15:8

Content type: Research Published on: 21 March 2022
- View Full Text
- View PDF
Predicting molecular initiating events using chemical target annotations and gene expression

The advent of high-throughput transcriptomic screening technologies has resulted in a wealth of publicly available gene expression data associated with chemical treatments. From a regulatory perspective, data ...

Authors: Joseph L. Bundy, Richard Judson, Antony J. Williams, Chris Grulke, Imran Shah and Logan J. Everett

Citation: BioData Mining 2022 15:7

Content type: Research Published on: 4 March 2022
- View Full Text
- View PDF
PredictPTB: an interpretable preterm birth prediction model using attention-based recurrent neural networks

Early identification of pregnant women at risk for preterm birth (PTB), a major cause of infant mortality and morbidity, has a significant potential to improve prenatal care. However, we lack effective predict...

Authors: Rawan AlSaad, Qutaibah Malluhi and Sabri Boughorbel

Citation: BioData Mining 2022 15:6

Content type: Research Published on: 14 February 2022
- View Full Text
- View PDF
Influenza, dengue and common cold detection using LSTM with fully connected neural network and keywords selection

Symptom-based machine learning models for disease detection are a way to reduce the workload of doctors when they have too many patients. Currently, there are many research studies on machine learning or deep ...

Authors: Wanchaloem Nadda, Waraporn Boonchieng and Ekkarat Boonchieng

Citation: BioData Mining 2022 15:5

Content type: Research Published on: 14 February 2022
- View Full Text
- View PDF
Gene-Interaction-Sensitive enrichment analysis in congenital heart disease

Gene set enrichment analysis (GSEA) uses gene-level univariate associations to identify gene set-phenotype associations for hypothesis generation and interpretation. We propose that GSEA can be adapted to inco...

Authors: Alexa A. Woodward, Deanne M. Taylor, Elizabeth Goldmuntz, Laura E. Mitchell, A.J. Agopian, Jason H. Moore and Ryan J. Urbanowicz

Citation: BioData Mining 2022 15:4

Content type: Methodology Published on: 12 February 2022
- View Full Text
- View PDF
iSuc-ChiDT: a computational method for identifying succinylation sites using statistical difference table encoding and the chi-square decision table classifier

Lysine succinylation is a type of protein post-translational modification which is widely involved in cell differentiation, cell metabolism and other important physiological activities. To study the molecular ...

Authors: Ying Zeng, Yuan Chen and Zheming Yuan

Citation: BioData Mining 2022 15:3

Content type: Research Published on: 10 February 2022
- View Full Text
- View PDF
Polymorphisms in the mTOR-PI3K-Akt pathway, energy balance-related exposures and colorectal cancer risk in the Netherlands Cohort Study

The mTOR-PI3K-Akt pathway influences cell metabolism and (malignant) cell growth. We generated sex-specific polygenic risk scores capturing natural variation in 7 out of 10 top-ranked genes in this pathway. We...

Authors: Colinda C.J.M. Simons, Leo J. Schouten, Roger W.L. Godschalk, Frederik-Jan van Schooten, Monika Stoll, Kristel Van Steen, Piet A. van den Brandt and Matty P. Weijenberg

Citation: BioData Mining 2022 15:2

Content type: Research Published on: 10 January 2022
- View Full Text
- View PDF
Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data

Single-cell RNA sequencing (scRNA-seq) data provide valuable insights into cellular heterogeneity which is significantly improving the current knowledge on biology and human disease. One of the main applicatio...

Authors: Pelin Gundogdu, Carlos Loucera, Inmaculada Alamo-Alvarez, Joaquin Dopazo and Isabel Nepomuceno

Citation: BioData Mining 2022 15:1

Content type: Methodology Published on: 3 January 2022
- View Full Text
- View PDF
Machine learning approaches for the genomic prediction of rheumatoid arthritis and systemic lupus erythematosus

Rheumatoid arthritis (RA) and systemic lupus erythematous (SLE) are autoimmune rheumatic diseases that share a complex genetic background and common clinical features. This study’s purpose was to construct mac...

Authors: Chih-Wei Chung, Tzu-Hung Hsiao, Chih-Jen Huang, Yen-Ju Chen, Hsin-Hua Chen, Ching-Heng Lin, Seng-Cho Chou, Tzer-Shyong Chen, Yu-Fang Chung, Hwai-I Yang and Yi-Ming Chen

Citation: BioData Mining 2021 14:52

Content type: Research Published on: 11 December 2021
- View Full Text
- View PDF
Identification of natural selection in genomic data with deep convolutional neural network

With the increase in the size of genomic datasets describing variability in populations, extracting relevant information becomes increasingly useful as well as complex. Recently, computational methodologies su...

Authors: Arnaud Nguembang Fadja, Fabrizio Riguzzi, Giorgio Bertorelle and Emiliano Trucchi

Citation: BioData Mining 2021 14:51

Content type: Research Published on: 4 December 2021
- View Full Text
- View PDF
LPI-EnEDT: an ensemble framework with extra tree and decision tree classifiers for imbalanced lncRNA-protein interaction data classification

Long noncoding RNAs (lncRNAs) have dense linkages with various biological processes. Identifying interacting lncRNA-protein pairs contributes to understand the functions and mechanisms of lncRNAs. Wet experime...

Authors: Lihong Peng, Ruya Yuan, Ling Shen, Pengfei Gao and Liqian Zhou

Citation: BioData Mining 2021 14:50

Content type: Research Published on: 3 December 2021
- View Full Text
- View PDF
Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making

Clinical data sets have very special properties and suffer from many caveats in machine learning. They typically show a high-class imbalance, have a small number of samples and a large number of parameters, an...

Authors: Jacqueline Beinecke and Dominik Heider

Citation: BioData Mining 2021 14:49

Content type: Short report Published on: 29 November 2021
- View Full Text
- View PDF
Development of glaucoma predictive model and risk factors assessment based on supervised models

To develop and to propose a machine learning model for predicting glaucoma and identifying its risk factors.

Authors: Mahyar Sharifi, Toktam Khatibi, Mohammad Hassan Emamian, Somayeh Sadat, Hassan Hashemi and Akbar Fotouhi

Citation: BioData Mining 2021 14:48

Content type: Research Published on: 24 November 2021
- View Full Text
- View PDF
Correction to: iGlioSub: an integrative transcriptomic and epigenomic classifier for glioblastoma molecular subtypes

Authors: Miquel Ensenyat-Mendez, Sandra Íñiguez-Muñoz, Borja Sesé and Diego M. Marzese

Citation: BioData Mining 2021 14:47

Content type: Correction Published on: 17 November 2021

The original article was published in BioData Mining 2021 14:42
- View Full Text
- View PDF
Prediction of synergistic drug combinations using PCA-initialized deep learning

Cancer is one of the main causes of death worldwide. Combination drug therapy has been a mainstay of cancer treatment for decades and has been shown to reduce host toxicity and prevent the development of acqui...

Authors: Jun Ma and Alison Motsinger-Reif

Citation: BioData Mining 2021 14:46

Content type: Research Published on: 20 October 2021
- View Full Text
- View PDF
Humans and machines in biomedical knowledge curation: hypertrophic cardiomyopathy molecular mechanisms’ representation

Biomedical knowledge is dispersed in scientific literature and is growing constantly. Curation is the extraction of knowledge from unstructured data into a computable form and could be done manually or automat...

Authors: Mila Glavaški and Lazar Velicki

Citation: BioData Mining 2021 14:45

Content type: Research Published on: 2 October 2021
- View Full Text
- View PDF
Evaluation of different approaches for missing data imputation on features associated to genomic data

Missing data is a common issue in different fields, such as electronics, image processing, medical records and genomics. They can limit or even bias the posterior analysis. The data collection process can lead...

Authors: Ben Omega Petrazzini, Hugo Naya, Fernando Lopez-Bello, Gustavo Vazquez and Lucía Spangenberg

Citation: BioData Mining 2021 14:44

Content type: Research Published on: 3 September 2021
- View Full Text
- View PDF
Taxonomy-based data representation for data mining: an example of the magnitude of risk associated with H. pylori infection

The amount of available and potentially significant data describing study subjects is ever growing with the introduction and integration of different registries and data banks. The single specific attribute of...

Authors: Inese Polaka, Danute Razuka-Ebela, Jin Young Park and Marcis Leja

Citation: BioData Mining 2021 14:43

Content type: Research Published on: 28 August 2021
- View Full Text
- View PDF

How was your experience today?

Rating Please select one rating

Awful

Bad

Good

Great

Thank you for your feedback.

Tell us why (opens in a new tab)

Articles

Follow

Annual Journal Metrics

BioData Mining

Contact us