Decade of GeneCards Symposium
New Search (GeneCards Home)  |  GeneCards Guide  |    User Feedback WIN AN
 iPod!!!
 |  Terms of Use  |  Notice about third-party sites
  This service is provided free to academic non-profit institutions. ALL other users require a Commercial License from XenneX, Inc.
What's New
GeneCards Guide
  Getting Started
  About GeneCards
  Data Sources
  Citing This Resource
  Publications

Mirror sites

Weizmann Institute of
Science

Crown Human Genome
Center

Bioinformatics Unit

Jobs


Output File Format


This page provides information about the format of the output file supplied by the GeneALaCart Batch Queries engine of GeneCards.

General Comments

  • The development of the engine is still in progress. Therefore, some of the features are disabled and will be supplied in future versions.
  • Current input consists of gene cards symbols, gene cards symbols/aliases or an assortment of symbols and other identifiers (UniProt, Ensembl, Entrez Gene, Hgnc, Aliases and/or GeneCards IDs). Symbols should be separated by white space separators only (space, tab or carriage return).
  • You can enter your genes by pasting them or uploading a file. It is possible to choose the name for your output file.
  • If a gene was not found in our database, the comment "NOT FOUND" will appear in the Gene Name field in the output file.
  • By default, if your input contains more than one identifier for the same gene, it will appear only once in the output, with the redundancy noted. To preserve the order of entries (including unmatched) in the result list, the 'preserve original input file order' output file option (at the top of the page) should be selected.
  • The symbol checkbox cannot be unchecked.
  • At least one field, aside from the symbol, must be selected.
  • Fields that do not contain information for this gene are left blank. In the case of the chromosome or the strand fields "NA" stands for 'no information is available'.

    Output format and separators:

    • Fields are separated by tabs
    • List Items are separated by pipes (|)
    • Information on items is separated from the items by the sharp symbol (#)
    • Nested list items are separated by double pipes (||)

      Example of a complex field output:
      Allele1: MGI_id1#Phenotypes: |Allele2: MGI_id2#Phenotypes: phenotype_id1#phenotype_name||phenotype_id2#phenotype_name...

    The fields that are currently available to be downloaded are the following (other fields will be available in future versions):

    GeneCards sectionQueried fieldsSources
    Gene Symbol HGNC, NCBI, Ensembl
    GeneCards IDGeneCards IDGeneCards, GeneLoc
    CategoryCategoryEntrez Gene, Ensembl, GeneCards
    Gene Description HGNC, Entrez Gene
    ApprovalApproved, Not approvedHGNC
    Source HGNC, Entrez Gene, Ensembl
    Aliases and DescriptionsAliases, DescriptionsHGNC, UniProt, SwissProt, TrEMBL, NCBI, GDB, OMIM, GeneLoc
    Genomic LocationChromosome, strand, cytogenetic band, genomic location start/end, gene size, Mapped to contig (flag if the information is not genomic)GeneLoc, NCBI, Ensembl, Entrez Gene, HGNC, HORDE, miRBase
    ProteinsAliases, UniProt ID, protein name, size, cofactor, subunit, subcellular location, tissue specificity, developmental stage, ptm, miscellaneous, RefSeq ID, Ensembl IDUniProt, Entrez Gene(NCBI), Ensembl
    Protein Domains / FamiliesInterPro domains, UniProt ID, UniProt domains, UniProt similaritiesEBI, UniProt
    Gene FunctionUniProt ID, UniProt function, MGI mutant phenotypeUniProt, MGD
    OntologiesGO ID, GO termGO
    Pathways and InteractionsKEGG ID, KEGG pathway description, Invitrogen ID, Invitrogen pathway description, CST(Cell Signaling Technology) ID, CST pathway descriptionKEGG, iPath, CST
    AKS compoundsCompound nameAKS
    TranscriptsRefseq transcripts, Unigene cluster, Unigene cluster descriptionRefSeq, Unigene
    Expression in Human tissuesU95 probe-sets, binary expression patterns in tissues ordered as in the GeneCards display, sensitivity, specificityGeneNote, GeneAnnot
    Similar Genes in Other OrganismsOrganism, gene, percent protein similarity to the human gene, percent nucleotide similarity to the human geneHomoloGene, euGenes
    HomoloGene ParalogsgeneHomoloGene
    Ensembl ParalogsgeneEnsembl
    SNPs/Variantsnumber of NCBI snps, NCBI ID, location type, populations studied, validation, position, amino acid change, sequence, number of sources
    number of additional AB snps, AB ID
    NCBI, AB
    OMIM disordersOMIM ID, Disorder descriptionOMIM
    UniProt disordersUniProt ID, Disorder descriptionUniProt
    AKS disordersDisorder descriptionAKS
    Other Genome Wide ResourcesEntrezGene ID, Ensembl IDNCBI, Ensembl


    Field formats

    GeneCards_ID

    gc_id

    Category

    protein-coding, pseudogene, RNA gene, genetic locus, gene cluster, or uncategorized

    Gene Description

    Gene description according to HGNC or Entrez Gene

    Approval

    Approved, not approved according to HGNC

    Source

    Gene symbol from HGNC, Entrez Gene or Ensembl

    Aliases

    alias1|alias2|alias3|...

    Descriptions

    description1|description2|description3|...

    Chromosome

    1-22 or X,Y,MT (mitochondria). NA appears where chromosome is unknown.

    Strand

    Plus, Minus or NA (where strand is unknown)

    Cytogenetic band

    Cytogenetic band according to Entrez Gene, Ensembl and/or HGNC

    Gene start

    chromosomal coordinate in bp from pter

    Gene end

    chromosomal coordinate in bp from pter

    Gene size

    size of genomic sequence in bp

    UniProt Protein details

    source:aliases|uniprot protein_id1#protein name#size#cofactor & cofactor#subunit & subunit#subcellular location & subcellular location#tissue specificity & tissue specificity# developmental stage & developmental stage#ptm & ptm#miscellaneous & miscellaneous|
    uniprot protein_id2#protein name#size#cofactor..#subunit..#subcellular location..#tissue specificity..#developmental stage..#ptm..#miscellaneous..|
    uniprot protein_id3#protein name#size#cofactor..#subunit..#subcellular location..#tissue specificity..#developmental stage..#ptm..#miscellaneous..|...

    RefSeq Protein ID

    refseq protein_id1|refseq protein_id2|refseq protein_id3|...

    Ensembl Protein ID

    ensembl protein_id1|ensembl protein_id2|ensembl protein_id3|...

    InterPro domains and families

    InterPro_id1#domain_name|InterPro_id2#domain_name...

    UniProt Domains and Families

    source:uniprot_id1#domain & domain..#similarity & similarity..|uniprot_id2#domain & domain..#similarity & similarity..|uniprot_id3#domain & domain..#similarity & similarity..|...

    Gene Function - UniProt

    source:uniprot_id1#function|uniprot_id2#function|uniprot_id3#function...

    Gene Function - MGI mutant phenotype

    Allele1: MGI_id1#Phenotypes: |Allele2: MGI_id2#Phenotypes: phenotype_id1#phenotype_name||phenotype_id2#phenotype_name...

    Gene Ontologies (GO)

    go_id1#go_term1|go_id2#go_term2|go_id3#go_term3|...

    Pathways (KEGG)

    kegg_id1#kegg_pathway1|kegg_id2#kegg_pathway2|kegg_id3#kegg_pathway3|...

    Pathways (Invitrogen)

    Invitrogen_id1#Invitrogen_pathway1|Invitrogen_id2#Invitrogen_pathway2|Invitrogen_id3#Invitrogen_pathway3|...

    Pathways (CST)

    cst_id1#cst_pathway1|cst_id2#cst_pathway2|cst_id3#cst_pathway3|...

    AKS Compounds

    compound1|compound2|compound3|...

    Transcripts (Refseq)

    transcript1|transcript2|transcript3|...

    Transcripts (Unigene)

    ug_cluster1#description|ug_cluster2#description|ug_cluster3#description...

    Expression in Human tissues

    probe-set_id1#binary_pattern1#sensitivity1#specificity1|probe-set_id2#binary_pattern2#sensitivity2#specificity2|probe-set_id3#binary_pattern3#sensitivity3#specificity3...

    Similar Genes in Other Organisms

    organism1#gene1#percent protein similarity#percent nucleotide similarity|organism2#gene2#percent protein similarity#percent nucleotide similarity|organism3#gene3#percent protein similarity#percent nucleotide similarity|...

    The organisms are depicted by their two or three letter acronyms as follows:
    AcronymScientific nameCommon name
    AgaAnopheles gambiaeAfrican malaria mosquito
    AtArabidopsis thalianaThale cress
    BtBos taurusCow
    CelCaenorhabditis elegansWorm
    CfaCanis familiarisDog
    CinCiona intestinalisSea squirt
    CreChlamydomonas reinhardtiiGreen algae
    DdiDictyostelium discoideumAmoeba
    DmDrosophila melanogasterFruit fly
    DrDanio rerioZebrafish
    EgAshbya gossypiiA. gosspyii yeast
    GgaGallus gallusChicken
    GmaGlycine maxSoybean
    HvHordeum vulgareBarley
    KlKluyveromyces lactisK. lactis yeast
    LesLycopersicon esculentumTomato
    MgrMagnaporthe griseaRice blast fungus
    MmMus musculusMouse
    MtrMedicago truncatulaMedicago trunc
    NcrNeurospora crassaBread mold
    OmyOncorhynchus mykissRainbow trout
    OsOryza sativaRice
    PfPlasmodium falciparumMalaria parasite
    PtPan troglodytesChimpanzee
    PtaPinus taedaLoblolly pine
    RnRattus norvegicusRat
    SbiSorghum bicolorSorghum
    ScSaccharomyces cerevisiaeBaker's yeast
    SmaSchistosoma mansoniSchistosome parasite
    SofSaccharum officinarumSugarcane
    SpSchizosaccharomyces pombeFission yeast
    SscSus scrofaPig
    StrSilurana tropicalisTropical clawed frog
    TaTriticum aestivumWheat
    TgoToxoplasma gondiiToxoplasmosis
    VvaVitis viniferaAlicante grape
    XlXenopus laevisAfrican clawed frog
    ZmZea maysCorn


    Homologene Paralogs

    gene1|gene2|gene3|...

    Ensembl Paralogs

    gene1|gene2|gene3|...

    Snps (NCBI)

    number of snps: ncbi_id1#location type#populations studied#validation#position#amino acid change#sequence#number of sources|ncbi_id2#location type#populations studied#validation#position#amino acid change#sequence#number of sources...

    Snps (AB)

    additional number of snps: ab_id1, ab_id2, ab_id3...

    Disorders - Omim_ID & disorder description

    omim_id#disorder1|disorder2|disorder3|...

    UniProt Disorders

    source:uniprot_id1#disorder & disorder..|uniprot_id2#disorder & disorder..|uniprot_id3#disorder & disorder..|...

    AKS Disorders

    disorder1|disorder2|disorder3|...

    EntrezGene_ID

    entrezgene_id

    Ensembl_ID

    ensembl_id





  • Developed at the Crown Human Genome Center & at the Weizmann Institute of Science
    Back to top


    Copyright © 1997-2006, Weizmann Institute of Science. All Rights Reserved.