Skip to main content

Articles

Page 8 of 10

  1. We develop a new concept that reflects how genes are connected based on microarray data using the coefficient of determination (the squared Pearson correlation coefficient). Our gene rank combines a priori kno...

    Authors: Eugene Demidenko
    Citation: BioData Mining 2015 8:2
  2. The discovery of breast cancer subtypes and subsequent development of treatments aimed at them has allowed for a great reduction in the mortality of breast cancer. But despite this progress, tumors with simila...

    Authors: Nima Pouladi, Richard Cowper-Sallari and Jason H Moore
    Citation: BioData Mining 2014 7:27
  3. Identifying genetic interactions in data obtained from genome-wide association studies (GWASs) can help in understanding the genetic basis of complex diseases. The large number of single nucleotide polymorphis...

    Authors: Charalampos S Floudas, Nara Um, M Ilyas Kamboh, Michael M Barmada and Shyam Visweswaran
    Citation: BioData Mining 2014 7:35
  4. Using a collection of different terminal nodesize constructed random forests, each generating a synthetic feature, a synthetic random forest is defined as a kind of hyperforest, calculated using the new input ...

    Authors: Hemant Ishwaran and James D Malley
    Citation: BioData Mining 2014 7:28
  5. Human genomic variations, including single nucleotide polymorphisms (SNPs) and copy number variations (CNVs), are associated with several phenotypic traits varying from mild features to hereditary diseases. Se...

    Authors: Kirsti Laurila, Reija Autio, Lingjia Kong, Elisa Närvä, Samer Hussein, Timo Otonkoski, Riitta Lahesmaa and Harri Lähdesmäki
    Citation: BioData Mining 2014 7:32
  6. A network motif is a recurring subnetwork within a network, and it takes on certain functions in practical biological macromolecule applications. Previous algorithms have focused on the computational efficienc...

    Authors: Qiang Zhang and Yuan Xu
    Citation: BioData Mining 2014 8:29
  7. PGxClean is a new web application that performs quality control analyses for data produced by the Affymetrix DMET chip or other candidate gene technologies. Importantly, the software does not assume that varia...

    Authors: Daniel Rotroff, John Jack, Nathan Campbell, Scott Clark and Alison A Motsinger-Reif
    Citation: BioData Mining 2014 7:24
  8. The major histocompatibility complex (MHC) is responsible for presenting antigens (epitopes) on the surface of antigen-presenting cells (APCs). When pathogen-derived epitopes are presented by MHC class II on a...

    Authors: Clemente Aguilar-Bonavides, Reinaldo Sanchez-Arias and Cristina Lanzas
    Citation: BioData Mining 2014 7:23
  9. The emergence of massive datasets in a clinical setting presents both challenges and opportunities in data storage and analysis. This so called “big data” challenges traditional analytic tools and will increas...

    Authors: Emad A Mohammed, Behrouz H Far and Christopher Naugler
    Citation: BioData Mining 2014 7:22
  10. Genetic understanding of complex traits has developed immensely over the past decade but remains hampered by incomplete descriptions of contribution to phenotypic variance. Gene-environment (GxE) interactions ...

    Authors: Laurence D Parnell, Britt A Blokker, Hassan S Dashti, Paula-Dene Nesbeth, Brittany Elle Cooper, Yiyi Ma, Yu-Chi Lee, Ruixue Hou, Chao-Qiang Lai, Kris Richardson and José M Ordovás
    Citation: BioData Mining 2014 7:21
  11. Effective cancer clinical outcome prediction for understanding of the mechanism of various types of cancer has been pursued using molecular-based data such as gene expression profiles, an approach that has pro...

    Authors: Dokyoon Kim, Ruowang Li, Scott M Dudek, Alex T Frase, Sarah A Pendergrass and Marylyn D Ritchie
    Citation: BioData Mining 2014 7:20
  12. Genetic contributions to major depressive disorder (MDD) are thought to result from multiple genes interacting with each other. Different procedures have been proposed to detect such interactions. Which approa...

    Authors: Magnus Lekman, Ola Hössjer, Peter Andrews, Henrik Källberg, Daniel Uvehag, Dennis Charney, Husseini Manji, John A Rush, Francis J McMahon, Jason H Moore and Ingrid Kockum
    Citation: BioData Mining 2014 7:19
  13. Reference datasets are often used to compare, interpret or validate experimental data and analytical methods. In the field of gene expression, several reference datasets have been published. Typically, they co...

    Authors: Philip Zimmermann, Stefan Bleuler, Oliver Laule, Florian Martin, Nikolai V Ivanov, Prisca Campanoni, Karen Oishi, Nicolas Lugon-Moulin, Markus Wyss, Tomas Hruz and Wilhelm Gruissem
    Citation: BioData Mining 2014 7:18
  14. Alzheimer’s disease is the most common form of progressive dementia and there is currently no known cure. The cause of onset is not fully understood but genetic factors are expected to play a significant role....

    Authors: Amanda L Zieselman, Jonathan M Fisher, Ting Hu, Peter C Andrews, Casey S Greene, Li Shen, Andrew J Saykin and Jason H Moore
    Citation: BioData Mining 2014 7:17
  15. Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex...

    Authors: Yan Zhou, Nan Lin and Baoxue Zhang
    Citation: BioData Mining 2014 7:15

    The Erratum to this article has been published in BioData Mining 2014 7:30

  16. The Consortium for Neuropsychiatric Phenomics (CNP) at UCLA was an investigation into the biological bases of traits such as memory and response inhibition phenotypes—to explore whether they are linked to synd...

    Authors: Douglass Stott Parker, Eliza Congdon and Robert M Bilder
    Citation: BioData Mining 2014 7:11
  17. In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true assoc...

    Authors: Timothy H Ciesielski, Sarah A Pendergrass, Marquitta J White, Nuri Kodaman, Rafal S Sobota, Minjun Huang, Jacquelaine Bartlett, Jing Li, Qinxin Pan, Jiang Gui, Scott B Selleck, Christopher I Amos, Marylyn D Ritchie, Jason H Moore and Scott M Williams
    Citation: BioData Mining 2014 7:10
  18. Permutation testing is a robust and popular approach for significance testing in genomic research, which has the broad advantage of estimating significance non-parametrically, thereby safe guarding against inf...

    Authors: Ronglin Che, John R Jack, Alison A Motsinger-Reif and Chad C Brown
    Citation: BioData Mining 2014 7:9
  19. The statistical genetics phenomenon of epistasis is widely acknowledged to confound disease etiology. In order to evaluate strategies for detecting these complex multi-locus disease associations, simulation st...

    Authors: Ryan J Urbanowicz, Ambrose LS Granizo-Mackenzie, Jeff Kiralis and Jason H Moore
    Citation: BioData Mining 2014 7:8
  20. In silco Biology is increasingly important and is often based on public data. While the problem of contamination is well recognised in microbiology labs the corresponding problem of database corruption has recei...

    Authors: William B Langdon
    Citation: BioData Mining 2014 7:3
  21. Mitochondria play a critical role in the cell and have DNA independent of the nuclear genome. There is much evidence that mitochondrial DNA (mtDNA) variation plays a role in human health and disease, however, ...

    Authors: Sabrina L Mitchell, Jacob B Hall, Robert J Goodloe, Jonathan Boston, Eric Farber-Eger, Sarah A Pendergrass, William S Bush and Dana C Crawford
    Citation: BioData Mining 2014 7:6
  22. Several different genetic and environmental factors have been identified as independent risk factors for bladder cancer in population-based studies. Recent studies have turned to understanding the role of gene...

    Authors: Ting Hu, Qinxin Pan, Angeline S Andrew, Jillian M Langer, Michael D Cole, Craig R Tomlinson, Margaret R Karagas and Jason H Moore
    Citation: BioData Mining 2014 7:5
  23. Logistic regression has been the de facto, and often the only, model used in the description and analysis of relationships between a binary outcome and observed features. It is widely used to obtain the condition...

    Authors: Abhijit Dasgupta, Silke Szymczak, Jason H Moore, Joan E Bailey-Wilson and James D Malley
    Citation: BioData Mining 2014 7:2
  24. Networks are commonly used to represent and analyze large and complex systems of interacting elements. In systems biology, human disease networks show interactions between disorders sharing common genetic back...

    Authors: Christian Darabos, Marquitta J White, Britney E Graham, Derek N Leung, Scott M Williams and Jason H Moore
    Citation: BioData Mining 2014 7:1
  25. The ever-growing wealth of biological information available through multiple comprehensive database repositories can be leveraged for advanced analysis of data. We have now extensively revised and updated the ...

    Authors: Sarah A Pendergrass, Alex Frase, John Wallace, Daniel Wolfe, Neerja Katiyar, Carrie Moore and Marylyn D Ritchie
    Citation: BioData Mining 2013 6:25
  26. Personal genome analysis is now being considered for evaluation of disease risk in healthy individuals, utilizing both rare and common variants. Multiple scores have been developed to predict the deleteriousne...

    Authors: Thanawadee Preeprem and Greg Gibson
    Citation: BioData Mining 2013 6:24
  27. Gene expression profiles have been broadly used in cancer research as a diagnostic or prognostic signature for the clinical outcome prediction such as stage, grade, metastatic status, recurrence, and patient s...

    Authors: Dokyoon Kim, Ruowang Li, Scott M Dudek and Marylyn D Ritchie
    Citation: BioData Mining 2013 6:23
  28. Influx of newly determined crystal structures into primary structural databases is increasing at a rapid pace. This leads to updation of primary and their dependent secondary databases which makes large scale ...

    Authors: Eshita Mutt, Sudha Sane Rani and Ramanathan Sowdhamini
    Citation: BioData Mining 2013 6:20
  29. Currently there are definitions from many agencies and research societies defining “bioinformatics” as deriving knowledge from computational analysis of large volumes of biological and biomedical data. Should thi...

    Authors: Xiuzhen Huang, Barry Bruce, Alison Buchan, Clare Bates Congdon, Carole L Cramer, Steven F Jennings, Hongmei Jiang, Zenglu Li, Gail McClure, Rick McMullen, Jason H Moore, Bindu Nanduri, Joan Peckham, Andy Perkins, Shawn W Polson, Bhanu Rekepalli…
    Citation: BioData Mining 2013 6:19
  30. With the abundance of information and analysis results being collected for genetic loci, user-friendly and flexible data visualization approaches can inform and improve the analysis and dissemination of these ...

    Authors: Daniel Wolfe, Scott Dudek, Marylyn D Ritchie and Sarah A Pendergrass
    Citation: BioData Mining 2013 6:18
  31. Over-sampling methods based on Synthetic Minority Over-sampling Technique (SMOTE) have been proposed for classification problems of imbalanced biomedical data. However, the existing over-sampling methods achie...

    Authors: Munehiro Nakamura, Yusuke Kajiwara, Atsushi Otsuka and Haruhiko Kimura
    Citation: BioData Mining 2013 6:16
  32. Elucidating the content of a DNA sequence is critical to deeper understand and decode the genetic information for any biological system. As next generation sequencing (NGS) techniques have become cheaper and m...

    Authors: Georgios A Pavlopoulos, Anastasis Oulas, Ernesto Iacucci, Alejandro Sifrim, Yves Moreau, Reinhard Schneider, Jan Aerts and Ioannis Iliopoulos
    Citation: BioData Mining 2013 6:13
  33. Public domain databases nowadays provide multiple layers of genome-wide data e.g., promoter methylation, mRNA expression, and miRNA expression and should enable integrative modeling of the mechanisms of regula...

    Authors: Y-h Taguchi
    Citation: BioData Mining 2013 6:11

Annual Journal Metrics

  • Citation Impact 2023
    Journal Impact Factor: 4.0
    5-year Journal Impact Factor: 3.7
    Source Normalized Impact per Paper (SNIP): 1.413
    SCImago Journal Rank (SJR): 0.958

    Speed 2023
    Submission to first editorial decision (median days): 15
    Submission to acceptance (median days): 171

    Usage 2023
    Downloads: 400,374
    Altmetric mentions: 146