A citation-based method for searching scientific literature

Nuala A O'Leary, Mathew W Wright, J Rodney Brister, Stacy Ciufo, Diana Haddad, Rich McVeigh, Bhanu Rajput, Barbara Robbertse, Brian Smith-White, Danso Ako-Adjei, Alexander Astashyn, Azat Badretdin, Yiming Bao, Olga Blinkova, Vyacheslav Brover, Vyacheslav Chetvernin, Jinna Choi, Eric Cox, Olga Ermolaeva, Catherine M Farrell, Tamara Goldfarb, Tripti Gupta, Daniel Haft, Eneida Hatcher, Wratko Hlavina, Vinita S Joardar, Vamsi K Kodali, Wenjun Li, Donna Maglott, Patrick Masterson, Kelly M McGarvey, Michael R Murphy, Kathleen O'Neill, Shashikant Pujar, Sanjida H Rangwala, Daniel Rausch, Lillian D Riddick, Conrad Schoch, Andrei Shkeda, Susan S Storz, Hanzhen Sun, Francoise Thibaud-Nissen, Igor Tolstoy, Raymond E Tully, Anjana R Vatsan, Craig Wallin, David Webb, Wendy Wu, Melissa J Landrum, Avi Kimchi, Tatiana Tatusova, Michael DiCuccio, Paul Kitts, Terence D Murphy, Kim D Pruitt. Nucleic Acids Res 2016
Times Cited: 1859







List of co-cited articles
302 articles co-cited >1



Times Cited
  Times     Co-cited
Similarity


Basic local alignment search tool.
S F Altschul, W Gish, W Miller, E W Myers, D J Lipman. J Mol Biol 1990
14

BLAST+: architecture and applications.
Christiam Camacho, George Coulouris, Vahram Avagyan, Ning Ma, Jason Papadopoulos, Kevin Bealer, Thomas L Madden. BMC Bioinformatics 2009
11

The Pfam protein families database in 2019.
Sara El-Gebali, Jaina Mistry, Alex Bateman, Sean R Eddy, Aurélien Luciani, Simon C Potter, Matloob Qureshi, Lorna J Richardson, Gustavo A Salazar, Alfredo Smart,[...]. Nucleic Acids Res 2019
10

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
S F Altschul, T L Madden, A A Schäffer, J Zhang, Z Zhang, W Miller, D J Lipman. Nucleic Acids Res 1997
8


Trimmomatic: a flexible trimmer for Illumina sequence data.
Anthony M Bolger, Marc Lohse, Bjoern Usadel. Bioinformatics 2014
8

The EMBL-EBI search and sequence analysis tools APIs in 2019.
Fábio Madeira, Young Mi Park, Joon Lee, Nicola Buso, Tamer Gur, Nandana Madhusoodanan, Prasad Basutkar, Adrian R N Tivey, Simon C Potter, Robert D Finn,[...]. Nucleic Acids Res 2019
7

Ensembl 2018.
Daniel R Zerbino, Premanand Achuthan, Wasiu Akanni, M Ridwan Amode, Daniel Barrell, Jyothish Bhai, Konstantinos Billis, Carla Cummins, Astrid Gall, Carlos García Girón,[...]. Nucleic Acids Res 2018
6


Accelerated Profile HMM Searches.
Sean R Eddy. PLoS Comput Biol 2011
6


New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0.
Stéphane Guindon, Jean-François Dufayard, Vincent Lefort, Maria Anisimova, Wim Hordijk, Olivier Gascuel. Syst Biol 2010
6

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.
Kazutaka Katoh, Kazuharu Misawa, Kei-ichi Kuma, Takashi Miyata. Nucleic Acids Res 2002
6


InterProScan 5: genome-scale protein function classification.
Philip Jones, David Binns, Hsin-Yu Chang, Matthew Fraser, Weizhong Li, Craig McAnulla, Hamish McWilliam, John Maslen, Alex Mitchell, Gift Nuka,[...]. Bioinformatics 2014
6

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.
Sergey Koren, Brian P Walenz, Konstantin Berlin, Jason R Miller, Nicholas H Bergman, Adam M Phillippy. Genome Res 2017
6

The Sequence Alignment/Map format and SAMtools.
Heng Li, Bob Handsaker, Alec Wysoker, Tim Fennell, Jue Ruan, Nils Homer, Gabor Marth, Goncalo Abecasis, Richard Durbin. Bioinformatics 2009
5

BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.
Felipe A Simão, Robert M Waterhouse, Panagiotis Ioannidis, Evgenia V Kriventseva, Evgeny M Zdobnov. Bioinformatics 2015
5

Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.
Bruce J Walker, Thomas Abeel, Terrance Shea, Margaret Priest, Amr Abouelliel, Sharadha Sakthikumar, Christina A Cuomo, Qiandong Zeng, Jennifer Wortman, Sarah K Young,[...]. PLoS One 2014
5

Fast gapped-read alignment with Bowtie 2.
Ben Langmead, Steven L Salzberg. Nat Methods 2012
5

Cytoscape: a software environment for integrated models of biomolecular interaction networks.
Paul Shannon, Andrew Markiel, Owen Ozier, Nitin S Baliga, Jonathan T Wang, Daniel Ramage, Nada Amin, Benno Schwikowski, Trey Ideker. Genome Res 2003
5

trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.
Salvador Capella-Gutiérrez, José M Silla-Martínez, Toni Gabaldón. Bioinformatics 2009
5

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.
Donovan H Parks, Michael Imelfort, Connor T Skennerton, Philip Hugenholtz, Gene W Tyson. Genome Res 2015
5


IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.
Lam-Tung Nguyen, Heiko A Schmidt, Arndt von Haeseler, Bui Quang Minh. Mol Biol Evol 2015
5

Prodigal: prokaryotic gene recognition and translation initiation site identification.
Doug Hyatt, Gwo-Liang Chen, Philip F Locascio, Miriam L Land, Frank W Larimer, Loren J Hauser. BMC Bioinformatics 2010
5

antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline.
Kai Blin, Simon Shaw, Katharina Steinke, Rasmus Villebro, Nadine Ziemert, Sang Yup Lee, Marnix H Medema, Tilmann Weber. Nucleic Acids Res 2019
5

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.
Michael I Love, Wolfgang Huber, Simon Anders. Genome Biol 2014
5


clusterProfiler: an R package for comparing biological themes among gene clusters.
Guangchuang Yu, Li-Gen Wang, Yanyan Han, Qing-Yu He. OMICS 2012
5


GENCODE reference annotation for the human and mouse genomes.
Adam Frankish, Mark Diekhans, Anne-Maud Ferreira, Rory Johnson, Irwin Jungreis, Jane Loveland, Jonathan M Mudge, Cristina Sisu, James Wright, Joel Armstrong,[...]. Nucleic Acids Res 2019
738
4

The human genome browser at UCSC.
W James Kent, Charles W Sugnet, Terrence S Furey, Krishna M Roskin, Tom H Pringle, Alan M Zahler, David Haussler. Genome Res 2002
4

KEGG: new perspectives on genomes, pathways, diseases and drugs.
Minoru Kanehisa, Miho Furumichi, Mao Tanabe, Yoko Sato, Kanae Morishima. Nucleic Acids Res 2017
4

The Protein Data Bank.
H M Berman, J Westbrook, Z Feng, G Gilliland, T N Bhat, H Weissig, I N Shindyalov, P E Bourne. Nucleic Acids Res 2000
4

dbSNP: the NCBI database of genetic variation.
S T Sherry, M H Ward, M Kholodov, J Baker, L Phan, E M Smigielski, K Sirotkin. Nucleic Acids Res 2001
4

Ensembl 2020.
Andrew D Yates, Premanand Achuthan, Wasiu Akanni, James Allen, Jamie Allen, Jorge Alvarez-Jarreta, M Ridwan Amode, Irina M Armean, Andrey G Azov, Ruth Bennett,[...]. Nucleic Acids Res 2020
462
4

A global reference for human genetic variation.
Adam Auton, Lisa D Brooks, Richard M Durbin, Erik P Garrison, Hyun Min Kang, Jan O Korbel, Jonathan L Marchini, Shane McCarthy, Gil A McVean, Gonçalo R Abecasis. Nature 2015
4

An efficient algorithm for large-scale detection of protein families.
A J Enright, S Van Dongen, C A Ouzounis. Nucleic Acids Res 2002
4

Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.
Fabian Sievers, Andreas Wilm, David Dineen, Toby J Gibson, Kevin Karplus, Weizhong Li, Rodrigo Lopez, Hamish McWilliam, Michael Remmert, Johannes Söding,[...]. Mol Syst Biol 2011
4

GenBank.
Eric W Sayers, Mark Cavanaugh, Karen Clark, James Ostell, Kim D Pruitt, Ilene Karsch-Mizrachi. Nucleic Acids Res 2019
134
4

STAR: ultrafast universal RNA-seq aligner.
Alexander Dobin, Carrie A Davis, Felix Schlesinger, Jorg Drenkow, Chris Zaleski, Sonali Jha, Philippe Batut, Mark Chaisson, Thomas R Gingeras. Bioinformatics 2013
4

fastp: an ultra-fast all-in-one FASTQ preprocessor.
Shifu Chen, Yanqing Zhou, Yaru Chen, Jia Gu. Bioinformatics 2018
4

SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.
Anton Bankevich, Sergey Nurk, Dmitry Antipov, Alexey A Gurevich, Mikhail Dvorkin, Alexander S Kulikov, Valery M Lesin, Sergey I Nikolenko, Son Pham, Andrey D Prjibelski,[...]. J Comput Biol 2012
4


NCBI prokaryotic genome annotation pipeline.
Tatiana Tatusova, Michael DiCuccio, Azat Badretdin, Vyacheslav Chetvernin, Eric P Nawrocki, Leonid Zaslavsky, Alexandre Lomsadze, Kim D Pruitt, Mark Borodovsky, James Ostell. Nucleic Acids Res 2016
4

Fast and sensitive protein alignment using DIAMOND.
Benjamin Buchfink, Chao Xie, Daniel H Huson. Nat Methods 2015
4

COSMIC: the Catalogue Of Somatic Mutations In Cancer.
John G Tate, Sally Bamford, Harry C Jubb, Zbyslaw Sondka, David M Beare, Nidhi Bindal, Harry Boutselakis, Charlotte G Cole, Celestino Creatore, Elisabeth Dawson,[...]. Nucleic Acids Res 2019
4

ClinVar: improving access to variant interpretations and supporting evidence.
Melissa J Landrum, Jennifer M Lee, Mark Benson, Garth R Brown, Chen Chao, Shanmuga Chitipiralla, Baoshan Gu, Jennifer Hart, Douglas Hoffman, Wonhee Jang,[...]. Nucleic Acids Res 2018
887
4

TopHat: discovering splice junctions with RNA-Seq.
Cole Trapnell, Lior Pachter, Steven L Salzberg. Bioinformatics 2009
3


Co-cited is the co-citation frequency, indicating how many articles cite the article together with the query article. Similarity is the co-citation as percentage of the times cited of the query article or the article in the search results, whichever is the lowest. These numbers are calculated for the last 100 citations when articles are cited more than 100 times.