HOME  |  GenomeBank  |  Search  |  Views  |  Datasets  |  Submit your Dataset |  About GenomeMine  | 
 
XML RSS TEXT display as grid printer version show data provenance

1
Title Calculated Composition data for Bacteria : data for all protein genes
Date Last Updated26-06-2005
Curator(s)
Haruo Suzuki hsuzuki@uidaho.edu
Department of Biological Sciences, University of Idaho http://www.sci.uidaho.edu/biosci/labl/top/
Links


DescriptionComposition data calculated by Dr Haruo Suzuki
HTML view?action=composition_all
download?action=composition_all&format=text
variables?action=dataset_variables&table_name=composition_all


2
Title Calculated Composition data for Bacteria : data for ribosomal protein genes
Date Last Updated26-06-2005
Curator(s)
Haruo Suzuki hsuzuki@uidaho.edu
Department of Biological Sciences, University of Idaho http://www.sci.uidaho.edu/biosci/labl/top/
Links
DescriptionComposition data calculated by Dr Haruo Suzuki
HTML view?action=composition_rp
download?action=composition_rp&format=text
variables?action=dataset_variables&table_name=composition_rp


3
Title Calculated Composition data for Bacteria : data for whole genome, RNA genes, and spacers
Date Last Updated26-06-2005
Curator(s)
Haruo Suzuki hsuzuki@uidaho.edu
Department of Biological Sciences, University of Idaho http://www.sci.uidaho.edu/biosci/labl/top/
Links
DescriptionComposition data calculated by Dr Haruo Suzuki
HTML view?action=composition_genome
download?action=composition_genome&format=text
variables?action=dataset_variables&table_name=composition_genome


4
TitleComposition Data for Bacteria
Date Last Updated20-03-2005
Curator(s)
Haruo Suzuki hsuzuki@uidaho.edu
Department of Biological Sciences, University of Idaho http://www.sci.uidaho.edu/biosci/labl/top/
LinksG-language
http://www.g-language.org/

DescriptionComposition data calculated by Dr Haruo Suzuki including: base usage indices (ryr gac gcc gtc gcs tas Hbu); amino acid usage indices (Haau hydropathy aromaticity); codon usage indices (Hcu Dcu Dsyn Hs Hw Ew enc cbi icdi). Analyses are implemented in a G-language GAE and distributed with the software package at http://www.g-language.org/.
HTML view?action=composition
download?action=composition&format=text
variables?action=dataset_variables&table_name=composition


5
TitleCurated ecology for bacteria
Date Last Updated12-12-2004
Curator(s)
Dawn Field dfield@ceh.ac.uk
Centre for Ecology & Hydrology

Jennifer Hughes Jennifer_B_Hughes@brown.edu
Brown University
LinksGenomeMine
http://www.genomics.ceh.ac.uk/GMINE

Definition of Variables
http://www.genomics.ceh.ac.uk/gmine/genomebankbacterialinfo.html

DescriptionCurated genomic features and ecological information for bacterial genomes
HTML view?action=BacDec2004
download?action=BacDec2004&format=text
variables?action=dataset_variables&table_name=BacDec2004


6
TitleCurated information for eukaryotic genomes
Date Last Updated01-04-2005
Curator(s)
Dawn Field dfield@ceh.ac.uk
Centre for Ecology & Hydrology

Jennifer Hughes Jennifer_B_Hughes@brown.edu
Brown University
LinksGenomeMine
http://www.genomics.ceh.ac.uk/GMINE

DescriptionInformation curated from the primary genome reports of eukaryotic genomes.
HTML view?action=EukTax
download?action=EukTax&format=text
variables?action=dataset_variables&table_name=EukTax


7
TitleData from the NCBI Genome Project Database for Bacteria
Date Last Updated06-05-2005
Curator(s)
Dawn Field dfield@ceh.ac.uk
Centre for Ecology & Hydrology

Tanya Gray tgra@ceh.ac.uk
Centre for Ecology & Hydrology
LinksNCBI Genome Project
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=genomeprj

DescriptionContains data retrieved from the NCBI GENOMEPRJ database.
HTML view?action=genomeprj_bacs
download?action=genomeprj_bacs&format=text
variables?action=dataset_variables&table_name=genomeprj_bacs


8
TitleData from the NCBI Genome Project Database for Eukaryotes
Date Last Updated06-05-2005
Curator(s)
Dawn Field dfield@ceh.ac.uk
Centre for Ecology & Hydrology

Tanya Gray tgra@ceh.ac.uk
Centre for Ecology & Hydrology
LinksNCBI Genome Project
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=genomeprj

DescriptionContains data retrieved from the NCBI GENOMEPRJ database.
HTML view?action=genomeprj_euks
download?action=genomeprj_euks&format=text
variables?action=dataset_variables&table_name=genomeprj_euks


9
TitleData from the NCBI Genome Project Database for Plasmids and Organelles
Date Last Updated06-05-2005
Curator(s)
Dawn Field dfield@ceh.ac.uk
Centre for Ecology & Hydrology

Tanya Gray tgra@ceh.ac.uk
Centre for Ecology & Hydrology
LinksNCBI Genome Project
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=genomeprj

DescriptionContains data retrieved from NCBI GENOMEPRJ database.
HTML view?action=genomeprj_other
download?action=genomeprj_other&format=text
variables?action=dataset_variables&table_name=genomeprj_other


10
TitleGenomic Features (calculated data for bacteria, organelles, plasmids, viruses and viroids)
Date Last Updated28-02-2006
Curator(s)
Tanya Gray tgra@ceh.ac.uk
Centre for Ecology & Hydrology
LinksGenomeMine
http://www.genomics.ceh.ac.uk/GMINE

DescriptionFeatures extracted and calculated from Genbank genome annotations
HTML view?action=genomic_features
download?action=genomic_features&format=text
variables?action=dataset_variables&table_name=genomic_features


11
TitleGOLD Genomes OnLine Database
Date Last Updated27-02-2006
Curator(s)
Nikos Kyrpides NCKyrpides@lbl.gov
DOE Joint Genome Institute
LinksGOLD
http://www.genomesonline.org

DescriptionInformation extracted from the table of completed genome projects for bacteria and eukaryotes curated by Nikos Krypides from his GOLD database. Post data retrieval edits by Tanya Gray (tgra@ceh.ac.uk) :- Genome size for Candida glabrata CBS138 retrieved from NCBI Genome Project database. Genome size for Mus musculus, Homo sapiens, Danio rerio (Zebrafish), and Rattus norvegicus (Rat) retrieved from Ensembl database (ensembl.org). NC number changed for record Gc00264 (Pseudomonas syringae syringae B728a) from NC_007004 (which no longer exists in NCBI genbank) to NC_007005.
HTML view?action=gold_genomes
download?action=gold_genomes&format=text
variables?action=dataset_variables&table_name=gold_genomes


12
TitleMicrosatellites in Microbial Genomes
Date Last Updated01-04-2005
Curator(s)
Milo Thurston mith@ceh.ac.uk
Centre for Ecology & Hydrology
LinksMsatfinder
http://www.genomics.ceh.ac.uk/msatfinder

DescriptionThe total number of mono- to hexanucleotide microsatellites longer than 15, 8, 8, 8, 8, 8 repeat units for bacteria, viruses, plasmids, and organellar genomes.
HTML view?action=microsat
download?action=microsat&format=text
variables?action=dataset_variables&table_name=microsat


13
TitleOrphan Gene Frequencies in Bacterial Genomes
Date Last Updated20-05-2005
Curator(s)
Gareth Wilson gawi@ceh.ac.uk
Centre for Ecology & Hydrology
LinksOrphanMine
http://www.genomics.ceh.ac.uk/orphan_mine/orphan_home.php

DescriptionDataset 1 --------- 150 complete bacterial genomes comprising 150 bacterial species/strains were used for this data analysis. Genome files (.faa) format were obtained from the NCBI and were used to generate a SELF BLAST database. Every predicted protein from every genome was blasted against every predicted protein from every genome, using the parameters displayed below: blastall -p blastp -d SELF_blast_database -e 1e-3 -b 500 -f 9 -F 'mS' -M BLOSUM45 Predicted proteins that did not find a statistically significant match (using an e-v alue threshold of 10-3) were designated as orphans. This process was performed by the suite of perl scripts known as QuickMine. Dataset 2 --------- Same as Dataset 1 except predicted proteins that were < 150 amino acids in length or contained regions of low complexity were removed. Dataset 3 --------- Same as Dataset 1 except 122 complete bacterial genomes comprising 122 bacterial species were used for this data analysis. Unlike Dataset 1 & 2 the orphans in this analysis are species specific. The first genome to be sequenced from each species was used to represent the species. Dataset 4 --------- Same as Dataset 3 except predicted proteins that were < 150 amino acids in length or contained regions of low complexity were removed.
HTML view?action=bacteria_orphan_genes
download?action=bacteria_orphan_genes&format=text
variables?action=dataset_variables&table_name=bacteria_orphan_genes


14
TitleSwytch - prokaryotic contingency loci with microsatellites
Date Last Updated15-11-2005
Curator(s)
Paul Swift pswi@ceh.ac.uk
Centre for Ecology & Hydrology
LinksSwytch
http://www.genomics.ceh.ac.uk/lab/swytch.php

DescriptionEach bacterial genome is categorised according to whether it has known (yes) or putative (putative) contingency loci containing microsatellites. Evidence is taken from the primary literature and if the associated reference is a primary genome report it is marked 'genome'. This minimal list of references is far from comprehesive. To date, we have not found evidence in the literature support the presence of such loci in the genomes marked as lacking such loci ('no') although these could exist.
HTML view?action=swytch
download?action=swytch&format=text
variables?action=dataset_variables&table_name=swytch


15
TitleTaxonomy data for bacteria
Date Last Updated01-10-2004
Curator(s)
Dawn Field dfield@ceh.ac.uk
Centre for Ecology & Hydrology
LinksNCBI Taxonomy
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Taxonomy

DescriptionCurated taxonomy based on NCBI taxonomy.
HTML view?action=bactax
download?action=bactax&format=text
variables?action=dataset_variables&table_name=bactax


16
TitleTaxonomy data for organelles
Date Last Updated01-10-2004
Curator(s)
Milo Thurston mith@ceh.ac.uk
Centre for Ecology & Hydrology
LinksNCBI Taxonomy
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Taxonomy

DescriptionCurated by Milo Thurston. Curated taxonomy based on NCBI taxonomy.
HTML view?action=orgtax
download?action=orgtax&format=text
variables?action=dataset_variables&table_name=orgtax


17
TitleTaxonomy data for plasmids
Date Last Updated01-10-2004
Curator(s)
Adrian Tett adet@ceh.ac.uk
Centre for Ecology & Hydrology
LinksNCBI Taxonomy
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Taxonomy

DescriptionCurated by Adrian Tett. Curated taxonomy based on NCBI taxonomy.
HTML view?action=plasmidtax
download?action=plasmidtax&format=text
variables?action=dataset_variables&table_name=plasmidtax


18
TitleTaxonomy data for viruses, phages, and viroids
Date Last Updated01-10-2004
Curator(s)
Milo Thurston mith@ceh.ac.uk
Centre for Ecology & Hydrology
LinksNCBI Taxonomy
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Taxonomy

ICTV
http://www.ncbi.nlm.nih.gov/ICTVdb/

DescriptionData retrieved from NCBI Genbank and ICTV. Additional curation by Milo Thurston.
HTML view?action=virtax
download?action=virtax&format=text
variables?action=dataset_variables&table_name=virtax