<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>The SEED Servers</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/" />
    <link rel="self" type="application/atom+xml" href="http://blog.theseed.org/servers/atom.xml" />
    <id>tag:blog.theseed.org,2010-06-01:/servers//12</id>
    <updated>2011-04-20T03:26:51Z</updated>
    <subtitle>A network API for accessing the SEED data.</subtitle>
    <generator uri="http://www.sixapart.com/movabletype/">Movable Type Pro 5.02</generator>

<entry>
    <title>What SEED Do I Use?</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2011/04/what-seed-do-i-use.html" />
    <id>tag:blog.theseed.org,2011:/servers//12.972</id>

    <published>2011-04-20T03:26:20Z</published>
    <updated>2011-04-20T03:26:51Z</updated>

    <summary><![CDATA[ What SEED Do I Use? March 17, 2011 Ross Overbeek A number of distinct SEEDs have emerged over the years.&nbsp; Almost all users will find they wish to use just the following two: &nbsp; 1.&nbsp;&nbsp;&nbsp;&nbsp; The PubSEED will rapidly...]]></summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<!--StartFragment-->

<h2><span lang="X-NONE" style="font-family:Calibri">What SEED Do I Use?<o:p></o:p></span></h2>

<p class="MsoNormal">March 17, 2011</p>

<p class="MsoNormal">Ross Overbeek</p>

<p class="MsoNormal">A number of distinct SEEDs have emerged over the years.<span style="mso-spacerun: yes">&nbsp; </span>Almost all users will find they wish to
use just the following two:</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal" style="margin-left:.5in;text-indent:-.25in;mso-list:l0 level1 lfo1"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">1.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>The
<b style="mso-bidi-font-weight:normal">PubSEED </b>will rapidly become the
central SEED for use by the research community.<span style="mso-spacerun:
yes">&nbsp; </span>It will support access to the largest collection of
genomes.<span style="mso-spacerun: yes">&nbsp; </span>The constant influx of
new genomes will be to the PubSEED first.<span style="mso-spacerun: yes">&nbsp;
</span>The PubSEED will support the ability for registered users to make
annotations and subsystems (unfortunately, that implies that they will be able
to overwrite the work of others, too).<span style="mso-spacerun: yes">&nbsp;
</span>We will support the ability for registered users to install genomes from
RAST directly into the PubSEED.<span style="mso-spacerun: yes">&nbsp;
</span>Access and update capabilities, by genome, will require establishing a
notion of <i style="mso-bidi-font-style:normal">ownership</i> <i style="mso-bidi-font-style:normal">of genomes.<span style="mso-spacerun:
yes">&nbsp; </span></i>Users will be allowed to <i style="mso-bidi-font-style:
normal">copy a genome</i> creating a version with ownership rights that they
control.<span style="mso-spacerun: yes">&nbsp; </span>We will architect a few
basic rules, and then we will do our best to develop tools that support
reconciliation of conflicts, backup and recovery to specific points in time,
and so forth.<span style="mso-spacerun: yes">&nbsp; </span>This SEED will be
the center of much of our work.</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal" style="margin-left:.5in;text-indent:-.25in;mso-list:l0 level1 lfo1"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">2.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>The
<b style="mso-bidi-font-weight:normal">UC-SEED </b>(i.e., the University of
Chicago SEED) will be rebuilt periodically<span style="mso-spacerun:
yes">&nbsp; </span>as a copy of the PubSEED.<span style="mso-spacerun:
yes">&nbsp; </span>It is a place where classes can be held, students can do
annotations and build subsystems, and so forth.<span style="mso-spacerun:
yes">&nbsp; </span>Subsystems built in the UC-SEED can be <i style="mso-bidi-font-style:
normal">exported to the Clearinghouse</i>, and then <i style="mso-bidi-font-style:
normal">imported to the PubSEED</i>.<span style="mso-spacerun: yes">&nbsp;
</span>This procedure allows users to save work and make it available, if they
wish.<span style="mso-spacerun: yes">&nbsp; </span>However, the whole system is
rebuilt and the existing contents destroyed on a periodic basis (usually
between semesters, and after several weeks in which there will be a posted
notice on the front page).</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<h3><span lang="X-NONE">Ownership of Genomes</span></h3>

<p class="MsoNormal">I am now discussing a basic position relating to genomes
that is not yet fully implemented.<span style="mso-spacerun: yes">&nbsp;
</span>I believe that it will move to the position I describe within 4-6
months.<b style="mso-bidi-font-weight:normal"><span style="font-size:14.0pt;
mso-bidi-font-size:11.0pt;line-height:115%"><o:p></o:p></span></b></p>

<p class="MsoNormal"><b style="mso-bidi-font-weight:normal"><span style="font-size:14.0pt;mso-bidi-font-size:11.0pt;line-height:115%"><o:p>&nbsp;</o:p></span></b></p>

<p class="MsoNormal">There will soon be hundreds of thousands of genomes.<span style="mso-spacerun: yes">&nbsp; </span>Many of these genomes will be either
identical or almost identical.<span style="mso-spacerun: yes">&nbsp; </span>In
some cases the genomic sequence data will be identical, but distinct user
groups will insist on the ability to annotate isolated copies that are
protected from unauthorized updates.<span style="mso-spacerun: yes">&nbsp;
</span>Determination of a protocol that effectively supports both sharing and
isolated annotation will require support for effectively managing privileges
and interactions in a way that minimally constrains experts attempting to
contribute.</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">It will rapidly become critical that we be able to talk
about genomes, contigs, genes, and proteins and to easily detect whether two
references are to the "same" entity.<span style="mso-spacerun: yes">&nbsp;
</span>As we move into a world with hundreds of thousands of genomes, some with
identical sequence, and others with sequences that differ by only a few
characters, it will become critical that we support basic <b style="mso-bidi-font-weight:
normal">ID Correspondence Services</b> in a consistent manner.</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">We suggest employing the following set of definitions for
what it means to be "the same" for <b style="mso-bidi-font-weight:normal">genomes,
contigs, genes, </b>and <b style="mso-bidi-font-weight:normal">proteins.<o:p></o:p></b></p>

<p class="MsoNormal"><b style="mso-bidi-font-weight:normal"><o:p>&nbsp;</o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l1 level1 lfo2"><b style="mso-bidi-font-weight:normal"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">1.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span></b>Two sequences are the same if the MD5
functions of the uppercase versions of the sequences are identical.<b style="mso-bidi-font-weight:normal"><o:p></o:p></b></p>

<p class="MsoNormal"><b style="mso-bidi-font-weight:normal"><o:p>&nbsp;</o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l1 level1 lfo2"><b style="mso-bidi-font-weight:normal"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">2.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span></b>Two contigs are considered the same if their
DNA sequences are the same.<span style="mso-spacerun: yes">&nbsp; </span><b style="mso-bidi-font-weight:normal"><o:p></o:p></b></p>

<p class="MsoNormal"><b style="mso-bidi-font-weight:normal"><o:p>&nbsp;</o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l1 level1 lfo2"><b style="mso-bidi-font-weight:normal"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">3.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span></b>Two genomes are considered the same iff <b style="mso-bidi-font-weight:normal"><o:p></o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l1 level2 lfo2"><b style="mso-bidi-font-weight:normal"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">a.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span></b>They have the same number of contigs.<b style="mso-bidi-font-weight:normal"><o:p></o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l1 level2 lfo2"><b style="mso-bidi-font-weight:normal"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">b.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span></b>The MD5 function of the sorted and
concatenated contig MD5s match.<span style="mso-spacerun: yes">&nbsp; </span>We
call the MD5 function of the sorted and concatenated contigs the <b style="mso-bidi-font-weight:normal">MD5 of the genome.<o:p></o:p></b></p>

<p class="MsoNormal" style="margin-left:1.0in"><b style="mso-bidi-font-weight:
normal"><o:p>&nbsp;</o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l1 level1 lfo2"><b style="mso-bidi-font-weight:normal"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">4.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span></b>Two genes are considered identical if they
are in genomes that are the same, and <b style="mso-bidi-font-weight:normal"><o:p></o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l1 level2 lfo2"><b style="mso-bidi-font-weight:normal"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">a.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span></b>They occur in contigs that are the same, <b style="mso-bidi-font-weight:normal"><o:p></o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l1 level2 lfo2"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">b.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>They
have identical start and stop positions in the two contigs.<span style="mso-spacerun: yes">&nbsp; </span></p>

<p class="MsoNormal" style="margin-left:1.0in"><b style="mso-bidi-font-weight:
normal"><o:p>&nbsp;</o:p></b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l1 level1 lfo2"><b style="mso-bidi-font-weight:normal"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">5.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span></b>Two proteins are the same if their sequences
are the same (note that this is not a notion that is equivalent to saying that
they are the gene products of two genes that are the same).<b style="mso-bidi-font-weight:
normal"><o:p></o:p></b></p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">We will support the ability to rapidly determine which
genomes, genes, and proteins are identical.<span style="mso-spacerun:
yes">&nbsp; </span>Further, we will support the capability of users defining
sets of representative genomes and limiting displays to any selected set.<span style="font-size:14.0pt;mso-bidi-font-size:11.0pt;line-height:115%"><o:p></o:p></span></p>

<p class="MsoNormal"><span style="font-size:14.0pt;mso-bidi-font-size:11.0pt;
line-height:115%"><o:p>&nbsp;</o:p></span></p>

<p class="MsoNormal">We are architecting the SEED environment as a framework that
will be able to effectively integrate initially thousands, and within a short
period millions, of distinct genomes.<span style="mso-spacerun:
yes">&nbsp;&nbsp; </span>Genomes will enter the collection from a growing
number of sources.<span style="mso-spacerun: yes">&nbsp;&nbsp; </span><i style="mso-bidi-font-style:normal">Registerying a Genome</i> will amount to
claiming unique IDs for the genome and the features that occur within the
genome.<span style="mso-spacerun: yes">&nbsp; </span>This will inevitably lead
to multiple registrations for identical genomes.<span style="mso-spacerun:
yes">&nbsp; </span>Further, while we will not support alteration of the
sequence of a genome (i.e., such a change would lead to the creation and
registration of a new genome), we will support addition and deletion of
features on a genome.<span style="mso-spacerun: yes">&nbsp; </span>A deletion
will lead to recording a change in status (retaining a complete record of the
deleted feature indefinitely).<span style="mso-spacerun: yes">&nbsp; </span>The
addition of a feature would require the acquisition on a new ID.<span style="mso-spacerun: yes">&nbsp; </span>Changing the start location of a gene
would cause deletion of the existing feature and addition of a new feature,
which would inherit the appropriate attributes from the deleted feature.</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">The SEED environment will support the maintenance of genomes
and features via a set of services that will include:</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.75in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l4 level1 lfo3"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">1.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>acquire_a_genome_ID<span style="mso-spacerun:
yes">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span>returns a genome ID to a registered user</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.75in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l4 level1 lfo3"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">2.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>acquire_a_feature_ID(Genome,Contig,Start,Stop)<span style="mso-spacerun: yes">&nbsp; </span>returns a feature ID</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.75in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l4 level1 lfo3"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">3.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>delete_a_feature(ID)<span style="mso-spacerun: yes">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span><span style="mso-spacerun:
yes">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</span>requires a update
privileges</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.75in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l4 level1 lfo3"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">4.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>reactivate
a deleted genome(ID)<span style="mso-spacerun: yes">&nbsp;&nbsp;&nbsp;
</span>requires update privileges<span style="mso-spacerun:
yes">&nbsp;&nbsp;&nbsp;&nbsp; </span></p>

<p class="MsoNormal"><b style="mso-bidi-font-weight:normal"><span style="font-size:14.0pt;mso-bidi-font-size:11.0pt;line-height:115%"><o:p>&nbsp;</o:p></span></b></p>

<p class="MsoNormal">Registered users will be able to make any of these
operations against genomes for which they have the required privileges.<span style="mso-spacerun: yes">&nbsp; </span>Users owning genomes will have the
ability to restrict access to a specified set of users.<span style="mso-spacerun: yes">&nbsp; </span>That is, we will support <i style="mso-bidi-font-style:normal">private genomes</i> that are not seen by
everyone, and we will support the ability of owners to change the status of a
genome (from <i style="mso-bidi-font-style:normal">private </i>to <i style="mso-bidi-font-style:normal">public</i> and vice versa).</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">Perhaps a short summary of the decision procedures on
access/update rights would be as follows:</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l2 level1 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">1.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>We
have <b style="mso-bidi-font-weight:normal">registered users.<span style="mso-spacerun: yes">&nbsp; </span></b>Users are either <b style="mso-bidi-font-weight:normal">superusers</b> or <b style="mso-bidi-font-weight:
normal">normal users.</b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l2 level1 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">2.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>We
have genomes.<span style="mso-spacerun: yes">&nbsp; </span>Genomes are either <b style="mso-bidi-font-weight:normal">private </b>or <b style="mso-bidi-font-weight:
normal">public.</b></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l2 level1 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">3.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>Anyone
attempting to access a genome or a feature of a genome will be given access if
and only if</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l2 level2 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">a.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>the
genome is public, or</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l2 level2 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">b.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>the
user is a superuser, or</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l2 level2 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">c.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>the
user either owns the genome or has been granted access to the genome.</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l2 level1 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">4.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>Anyone
attempting to update a genome (which includes annotating features, deleting
features, and adding features) will be allowed to make the update iff and only
if</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l2 level2 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">a.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>the
genome is public, or</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l2 level2 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">b.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>the
user is a superuser, or</p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:1.0in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:
-.25in;line-height:normal;mso-list:l2 level2 lfo4"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">c.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>the
user either owns the genome or has been granted write privileges to the genome.</p>

<p class="MsoNormal"><b style="mso-bidi-font-weight:normal"><span style="font-size:14.0pt;mso-bidi-font-size:11.0pt;line-height:115%"><o:p>&nbsp;</o:p></span></b></p>

<h3><span lang="X-NONE">Setting Up an Annotation Group</span></h3>

<p class="MsoNormal">If a group wishes to use the SEED Environment as a resource
for supporting annotation and analysis of their genome, they would begin by
registering each member of the group, and then establishing a group containing
those members.</p>

<p class="MsoNormal">They would select the genomes they wish to annotate
(probably by importing a newly-annotated genome from RAST into the
PubSEED).<span style="mso-spacerun: yes">&nbsp; </span>They would decide
whether access and update privileges should be restricted to the group or not.</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">Then, they would use the framework we currently use to
support our annotators to examine and edit annotations, construct metabolic
models, or whatever.<span style="mso-spacerun: yes">&nbsp; </span>The set of
genomes that would be simultaneously be edited could all be public or all be
private.<span style="mso-spacerun: yes">&nbsp; </span>If private, they would be
imported from RAST or as copies of existing genomes. </p>

<p class="MsoNormal"><b style="mso-bidi-font-weight:normal"><span style="font-size:14.0pt;mso-bidi-font-size:11.0pt;line-height:115%"><o:p>&nbsp;</o:p></span></b></p>

<h3><span lang="X-NONE">Access of Data Via the Servers</span></h3>

<p class="MsoNormal">Most users of the SEED will use a web browser.<span style="mso-spacerun: yes">&nbsp; </span>However, a growing body of users will
also start using our <b style="mso-bidi-font-weight:normal">SEED Servers</b>,
which support a well-defined API to access and update data from a SEED.<span style="mso-spacerun: yes">&nbsp; </span>We will run servers for the
PubSEED.<span style="mso-spacerun: yes">&nbsp; </span>See </p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal" style="text-indent:.5in"><a href="http://servers.nmpdr.org/servers">http://servers.nmpdr.org/servers</a></p>

<p class="MsoNormal"><span style="mso-spacerun: yes">&nbsp;</span></p>

<p class="MsoNormal"><span style="mso-spacerun: yes">&nbsp;</span>for a
discussion of the servers, the APIs used to access them, and the command-line
services supported via the servers.<span style="mso-spacerun: yes">&nbsp;
</span>We believe that research groups may wish to use or help extend this
growing confederation of servers. </p>

<p class="MsoNormal"><b style="mso-bidi-font-weight:normal"><span style="font-size:14.0pt;mso-bidi-font-size:11.0pt;line-height:115%"><o:p>&nbsp;</o:p></span></b></p>

<h3><span lang="X-NONE">Maintenance of Subsystems</span></h3>

<p class="MsoNormal">The PubSEED, and UC-SEED both support development of
subsystems.<span style="mso-spacerun: yes">&nbsp; </span>From any of these
platforms, subsystems can be exported to the <b style="mso-bidi-font-weight:
normal">Clearinghouse</b>, and they can be imported into any other SEED (if you
have the appropriate privileges).<span style="mso-spacerun: yes">&nbsp;
</span>We anticipate that students in classes would use the UC-SEED to avoid
destroying the work of others.<span style="mso-spacerun: yes">&nbsp;
</span>Users wishing to make a more permanent contribution would use the
PubSEED.</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">We will try to install a few basic rules to prevent
bloodshed in instances in which incompatible annotations must be
reconciled.<span style="mso-spacerun: yes">&nbsp; </span>They would be
something like</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l3 level1 lfo5"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">1.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>You
may overwrite any annotation that is not in a subsystem or is a duplicate in a
subsystem (i.e., a case in which two genes currently have the same assigned
functional role).</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormalCxSpMiddle" style="margin-top:0in;margin-right:0in;margin-bottom:
0in;margin-left:.5in;margin-bottom:.0001pt;mso-add-space:auto;text-indent:-.25in;
line-height:normal;mso-list:l3 level1 lfo5"><span style="mso-bidi-font-family:Calibri"><span style="mso-list:Ignore">2.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp; </span></span></span>Before
overwriting a function in which a gene plays a unique role in someone else's
subsystem, email them and ask for permission.<span style="mso-spacerun:
yes">&nbsp; </span>If they do not respond with a few days, proceed.</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">As the number of genomes grows rapidly, we believe that
fewer and fewer annotators will actually construct and maintain comprehensive
subsystems.<span style="mso-spacerun: yes">&nbsp; </span>Rather, there will be
a growing number of subsystems that contain only a subset of the actual genomes
that have the machinery.<span style="mso-spacerun: yes">&nbsp; </span>To handle
this situation, we will periodically produce estimates, for each genome, of the
subsystems that should contain the genome.<span style="mso-spacerun:
yes">&nbsp; </span>These will not impact any of the subsystems, but will allow
users to have a reasonable estimate of the molecular machinery that can be
identified.<span style="mso-spacerun: yes">&nbsp; </span>This mimics what is
now done in RAST, where a new genome contains estimates of which genes go into
which subsystems, but these estimates do not actually impact the subsystems
themselves.</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal">The real point is that subsystems will no longer be thought
of as comprehensive.<span style="mso-spacerun: yes">&nbsp; </span>Up to this
point the goal was to provide the tools needed to support manual curation of
subsystems that were to contain as many genomes as possible from the existing
collection.<span style="mso-spacerun: yes">&nbsp;&nbsp; </span>The goal will
shift.<span style="mso-spacerun: yes">&nbsp; </span>We will think of subsystems
as containing a diverse collection of instances needed to support accurate
projection over the entire collection. The PubSEED will be used to house as
complete a collection as possible, to support experimentation, and will
inevitably lead to conflicts (that, hopefully, enrich the overall collection
and get resolved peaceably).</p>

<p class="MsoNormal"><o:p>&nbsp;</o:p></p>

<p class="MsoNormal" align="center" style="text-align:center"><b style="mso-bidi-font-weight:
normal"><span style="font-size:14.0pt;mso-bidi-font-size:11.0pt;line-height:
115%"><o:p>&nbsp;</o:p></span></b></p>

<p class="MsoNormal"><span class="MsoHyperlink"><o:p><span style="text-decoration:
 none">&nbsp;</span></o:p></span></p>

<!--EndFragment-->


]]>
        
    </content>
</entry>

<entry>
    <title>The SEED Book</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2011/04/the-seed-book.html" />
    <id>tag:blog.theseed.org,2011:/servers//12.942</id>

    <published>2011-04-04T16:01:32Z</published>
    <updated>2011-04-05T19:00:58Z</updated>

    <summary><![CDATA[We have compiled many of these blog entries into a book form. The book is viewable in PDF form at&nbsp;SEEDBook, the word version can be downloaded&nbsp;here,Data for the April 2011 tutorial is available here&nbsp;http://www.theseed.org/TheBook/GXP.htmRoss's Atomic Regulon Page is&nbsp;here...]]></summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[We have compiled many of these blog entries into a book form. The book is viewable in PDF form at&nbsp;<a href="http://www.theseed.org/TheBook/SEEDBook.pdf">SEEDBook</a>, the word version can be downloaded&nbsp;<a href="http://www.theseed.org/TheBook/SEEDBook.doc">here</a>,<div><br /></div><div>Data for the April 2011 tutorial is available here&nbsp;</div><div><br /></div><div><a href="http://www.theseed.org/TheBook/GXP.htm">http://www.theseed.org/TheBook/GXP.htm</a><br /></div><div><br /></div><div>Ross's Atomic Regulon Page is&nbsp;<a href="http://www.theseed.org/TheBook/atomic_regulons.htm">here</a></div><div><br /></div><div><br /></div><div><br /><div><br /></div></div>]]>
        
    </content>
</entry>

<entry>
    <title>Some simple Sapling examples</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/11/some-simple-sapling-examples.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.922</id>

    <published>2010-11-16T22:14:51Z</published>
    <updated>2010-11-17T18:08:21Z</updated>

    <summary><![CDATA[At a brief session at the last workshop we built several simple applications using the Sapling API.The first is a simple replica of the svr_all_genomes script:&nbsp;&nbsp; &nbsp;&nbsp;allgenomesThe second allows one to recall all the proteins in the vibrio genomes:&nbsp;&nbsp; &nbsp;&nbsp;recall_vibriosThe...]]></summary>
    <author>
        <name>Robert Olson</name>
        
    </author>
    
        <category term="Coding Examples" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Tutorials" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Workshop" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="applicationprogramminginterface" label="Application programming interface" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<div style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: 10px; padding-right: 10px; padding-bottom: 10px; padding-left: 10px; height: 90%; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; background-image: initial; background-attachment: initial; background-origin: initial; background-clip: initial; background-color: rgb(255, 255, 255); color: rgb(51, 51, 51); font: normal normal normal 13px/normal arial, helvetica, hirakakupro-w3, osaka, 'ms pgothic', sans-serif; font-family: Times; font-size: medium; ">At a brief session at the last workshop we built several simple applications using the Sapling API.<div><br /></div><div>The first is a simple replica of the svr_all_genomes script:</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;&nbsp;<a href="http://blog.theseed.org/downloads/tutorial-scripts/allgenomes" style="text-decoration: underline; ">allgenomes</a></div><div><br /></div><div>The second allows one to recall all the proteins in the vibrio genomes:</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;&nbsp;<a href="http://blog.theseed.org/downloads/tutorial-scripts/recall_vibrios" style="text-decoration: underline; ">recall_vibrios</a></div><div><br /></div><div>The third allows one to recall the proteins in a given fasta file, and compare to a file of annotations for the proteins in that genome:</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;<a href="http://blog.theseed.org/downloads/tutorial-scripts/compare_functions" style="text-decoration: underline; ">compare_functions</a>&nbsp;fasta-file function-file</div><div><br /></div><div>The last does a potentially dubious translation of the hits from a myRAST metagenomics output run into proteins.</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;<a href="http://blog.theseed.org/downloads/tutorial-scripts/hits-to-protein">hits-to-protein</a> dna-fasta hits-file genetic-code</div><div><br /></div></div> 

<div class="zemanta-pixie" style="margin-top:10px;height:15px"><a class="zemanta-pixie-a" href="http://www.zemanta.com/" title="Enhanced by Zemanta"><img class="zemanta-pixie-img" src="http://img.zemanta.com/zemified_e.png?x-id=c83519d9-ba5b-4dee-96d5-b9d43c0f13d7" alt="Enhanced by Zemanta" style="border:none;float:right" /></a><span class="zem-script more-related pretty-attribution"><script type="text/javascript" src="http://static.zemanta.com/readside/loader.js" defer="defer"></script></span></div>]]>
        
    </content>
</entry>

<entry>
    <title>An Exercise to Get a List of Homologs of Virulence Factors</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/10/an-exercise-to-get-a-list-of-homologs-of-virulence-factors.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.852</id>

    <published>2010-10-05T14:05:03Z</published>
    <updated>2010-10-06T19:00:56Z</updated>

    <summary><![CDATA[ An Exercise to Get a List of Homologs of Virulence Factors Suppose that you have a file containing data relating to known virulence factors.&nbsp;For example, here are the first few lines of a file given to me by a...]]></summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="Tutorials" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<div><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px"><span class="Apple-tab-span" style="white-space:pre">	</span><b>An Exercise to Get a List of Homologs of Virulence Factors</b></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">Suppose that you have a file containing data relating to known virulence factors.&nbsp;For example, here are the first few lines of a file given to me by a participant&nbsp;in one of our tutorials (These columns are separated by tab characters):</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">VFG_id<span class="Apple-tab-span" style="white-space:pre">	</span>gi_id<span class="Apple-tab-span" style="white-space:pre">	</span>Gene_name<span class="Apple-tab-span" style="white-space:pre">	</span>Product<span class="Apple-tab-span" style="white-space:pre">	</span>Organism<span class="Apple-tab-span" style="white-space:pre">	</span>VF_id</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">VFG1293 21282773 &nbsp; &nbsp; &nbsp; &nbsp;hla &nbsp; &nbsp; Alpha-Hemolysin precursor &nbsp; &nbsp; &nbsp; Staphylococcus aureus MW2 &nbsp; &nbsp; &nbsp; VF0001</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">VFG1798 46588 &nbsp; hlb &nbsp; &nbsp; beta-hemolysin &nbsp;Staphylococcus aureus &nbsp; VF0002</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">The second column contains GI numbers (without the usual "gi|" prefix). &nbsp;The goal&nbsp;was to extract the set of FIGfams containing these genes, along with a list of the PEGs&nbsp;(and their functions).</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">We can do these easily in two steps. We'll use the "svr" command line tools in this example. If you are using Windows, be sure to remember to use the <b>myRAST</b> shell, &nbsp;or, if you are using a Mac, &nbsp;ensure that the svr commands are in your path with this command in your shell window:</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><b>export PATH=$PATH:/Applications/myRAST.app/bin</b></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px">Our solution requires that the servers use the data from the PSEED, so be sure to run this command in your shell window:</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px">Mac users: &nbsp;<b>export SAS_SERVER=PSEED</b></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px">Windows Users:&nbsp;<b>set SAS_SERVER=PSEED</b></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 15.0px Arial; min-height: 17.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">&nbsp;Now, to begin, &nbsp;let's try</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">svr_ids_to_figfams -c=2 -source=NCBI &lt; known.virulence.factors 2&gt; no.matches | svr_figfams_to_ids &gt; with.figfams</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">This produces two files. &nbsp;The file "no.matches" contains a list of the lines from&nbsp;the input file ("known.virulence.factors") that have not yet been placed into FIGfams.&nbsp;The second file ("with.figfams") is a file in which two columns have been added: a FIGfam number&nbsp;and a PEG from that FIGfam. &nbsp;Now, if we wished to add a function of the PEG, as well as&nbsp;a readable version of the genome it is from, we would use</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233; min-height: 15.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px">svr_gene_data function genome-name &lt; with.figfams &gt; desired.table.txt</span></p><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"><span style="letter-spacing: 0.0px"><br /></span></p><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Arial; color: #333233"></p><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px">If the user needs to see the "aliases" for the PEGs that have been found, they might try</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica; min-height: 17.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px">svr_aliases_of -c=9 &lt; desired.table.txt &gt; desired.table.aliases.txt</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica; min-height: 17.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px">but be warned: we return all of the IDs of protein sequences that we know about that have the same&nbsp;</span>protein sequence (from a potentially large collection of very similar genomes).</p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica; min-height: 17.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px">Finally, we should note that the initial virulence factors may have multiple entries leading to&nbsp;</span>the same FIGfam (and, hence, to the same sets of PEGs). &nbsp;This can lead to the situation &nbsp;where multiple roles, while not identical, would have information on the same PEG. &nbsp;To&nbsp;collapse the table to unique, you need to sort on the 9th field, assuming tab-delimited fields.</p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px">This can be done using</span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica; min-height: 17.0px"><span style="letter-spacing: 0.0px"></span><br /></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px"><span class="Apple-tab-span" style="white-space:pre">	</span>sort -k 9 -t $'\t' -u &lt; desired.table.aliases.txt &nbsp;&gt; sorted.txt</span></p><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px"><br /></span></p><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></span></p><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px"><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">Let us consider an somewhat more useful alternative.</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">We begin by just connecting the aliases in column 2 to PEGs in the SEED:</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">svr_aliases_to_pegs -c=2 -source=NCBI -protein=1 &lt; known.virulence.factors 2&gt; no.matches.1 &gt; with.peg</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">This produces a file (no.matches.1) of lines that could not be connected to PEGs.</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">Now,</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">svr_ids_to_figfams &lt; with.peg &gt; with.ff 2&gt; no.matches.to.ff</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">can be used to add a FIGfam function and FIGfam ID to the end of each line</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">for which the identified PEG is in a FIGfam.&nbsp; Those lines that do not have a PEG</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">get written to a file (no.matches.to.ff).</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">Now, it is worth noting that we may have many lines pointing at the same FIGfam (i.e.,</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">many lines with the same FIGfam ID in the last column, which is column 9).</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">By using</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">&nbsp;&nbsp; sort -u -k 9 -t$'\t' with.ff &gt; with.ff.sorted</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">we sort the file on the FIGfam ID, deleting all but one of each set that share</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">a common FIGfam ID.</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">Now,</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">svr_figfams_to_ids &lt; with.ff.sorted &gt; with.ff.peg</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">can be used to expand each input line to a set of lines, each one containing a&nbsp;</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">distinct PEG from the FIGfam.&nbsp; Using&nbsp;</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">svr_gene_data function genome-name &lt; with.ff.peg &gt; complete</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">we add the PEG function and the genome-name to the end of each row.</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">Finally, we run</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">svr_aliases_of -c 10 &lt; complete &gt; complete.with.aliases</font></span></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica; min-height: 14.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><span style="letter-spacing: 0.0px"></span><br /></font></p>
<p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; ">to add known aliases for each PEG as the last column in the table.</font></span></p><div><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></span></div></span></p><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><span style="letter-spacing: 0.0px"><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></span></p><p style="margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px Helvetica"><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></p><p></p></div><div><span style="letter-spacing: 0.0px"><br /></span></div> ]]>
        
    </content>
</entry>

<entry>
    <title>Accessing the Wireless Network at Argonne Tutorials and Workshops</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/10/accessing-the-wireless-network-at-argonne-tutorials-and-workshops.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.842</id>

    <published>2010-10-04T14:20:34Z</published>
    <updated>2010-11-18T20:08:51Z</updated>

    <summary>When you are at one of our tutorials onsite at the Argonne National Laboratory, here are the instructions for accessing the wireless network.Turn on your wireless adapter if it is not already active.Connect to wireless network ANLTCSG-Guest (if you have...</summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="Workshop" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[When you are at one of our tutorials onsite at the Argonne National Laboratory, here are the instructions for accessing the wireless network.<br /><br /><ol><li>Turn on your wireless adapter if it is not already active.</li><li>Connect to wireless network<b> ANLTCSG-Guest</b> (if you have a normal 802.11b/g/n adapter) or <b>ANLTCSA-Guest</b> (if you have a 5Ghz-band 802.11a adapter).</li><li>Bring up a web browser and browse to <a href="http://wireless.anl.gov/">wireless.anl.gov</a> (or any other website). Instead of the website you've chosen, an ANL network page will come up.</li><li>Read the agreement and click the link at the bottom, then fill out the form that comes up.</li><li>You may need to wait a bit and turn your wireless off and back on again, or possibly reboot (but probably not that). If the registration page comes up again, reload the page in the browser. It might have cached the registration page for you and helpfully prevented you from seeing the real page.</li><li>If you are using Windows, you may be asked what type of network this is. Choose <i>Public</i>.<br /></li></ol>]]>
        
    </content>
</entry>

<entry>
    <title>An Etude Relating to a Metagenomics Sample</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/09/an-etude-relating-to-a-metagenomics-sample.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.802</id>

    <published>2010-09-27T16:10:48Z</published>
    <updated>2010-09-27T16:15:49Z</updated>

    <summary><![CDATA[In another short note, I suggested using&nbsp;svr_assign_to_dna_using_figfams &lt; MG.sample | svr_summarize_MG_output &gt; function.summary 2&gt; otu.summaryto get summaries of function and population from a sample in the fileMG.sample.A participant in one of our tutorials took a sample and ran it throughboth...]]></summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="Tutorials" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<div>In another short note, I suggested using</div><div><br /></div><div>&nbsp;svr_assign_to_dna_using_figfams &lt; MG.sample | svr_summarize_MG_output &gt; function.summary 2&gt; otu.summary</div><div><br /></div><div>to get summaries of function and population from a sample in the file</div><div>MG.sample.</div><div><br /></div><div>A participant in one of our tutorials took a sample and ran it through</div><div>both the two commands above and did a more thorough analysis using</div><div>MG-RAST. &nbsp;The estimates of the most highly represented phylogenetic</div><div>groups differed somewhat, and the participant asked me to look into</div><div>it.</div><div><br /></div><div>I am going to use this request as a motivation for showing how one</div><div>might use the server scripts effectively.</div><div><br /></div><div><br /></div> ]]>
        <![CDATA[<div><br /></div><div><div>To begin with, I suggest getting a feel for how the server scripts</div><div>might assign function and OTUs to data we think we understand. &nbsp;I</div><div>decicded to take the contigs of E.coli K12 and run them through</div><div>the svr_assign_to_dna_using_figfams tool to see what came out.</div><div><br /></div><div>This requires that I get a fasta file of the E.coli contigs. &nbsp;To do</div><div>that, I used</div><div><br /></div><div>&nbsp;&nbsp;svr_contigs_in_genome &lt; e.coli.id | svr_dna_seq -fasta &gt; ecoli.contigs</div><div><br /></div><div>This constructs a file of the contigs that make up the E.coli genome.</div><div><br /></div><div>Then I ran</div><div><br /></div><div>&nbsp;&nbsp; &nbsp; svr_assign_to_dna_using_figfams -by_location &lt; ecoli.contigs &gt; kmer.dna.output.default</div><div><br /></div><div>This produces the output of the tool on a piece of DNA that is well anotated.</div><div>I asked to have the proposed hits displayed in order by</div><div>location so that I could compare the proposed hits (and corresponding</div><div>OTUs) against the usual annotations of the E.coli K12 genome.</div><div><br /></div><div>The first few lines of output were as follows:</div><div><br /></div><div>Contig &nbsp; &nbsp; &nbsp; &nbsp;# hits &nbsp; &nbsp;location (region) &nbsp; &nbsp; &nbsp; proposed function &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; representative of an OTU</div><div><br /></div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>14<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_190_253<span class="Apple-tab-span" style="white-space: pre; ">	</span>Thr operon leader peptide<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>815<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_248_2797<span class="Apple-tab-span" style="white-space: pre; ">	</span>Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>5<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_572_1625<span class="Apple-tab-span" style="white-space: pre; ">	</span>hypothetical protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>Anaeromyxobacter sp. K</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>305<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_2801_3728<span class="Apple-tab-span" style="white-space: pre; ">	</span>Homoserine kinase (EC 2.7.1.39)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>4<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_3213_3180<span class="Apple-tab-span" style="white-space: pre; ">	</span>Glycerol kinase (EC 2.7.1.30)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Mycobacterium smegmatis str. MC2 155</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>420<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_3734_5018<span class="Apple-tab-span" style="white-space: pre; ">	</span>Threonine synthase (EC 4.2.3.1)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>136<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_5088_6022<span class="Apple-tab-span" style="white-space: pre; ">	</span>hypothetical protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>5<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_5600_5564<span class="Apple-tab-span" style="white-space: pre; ">	</span>Ribonucleotide reductase of class Ia (aerobic), beta subunit (EC 1.17.4.1)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>100<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_6245_2551<span class="Apple-tab-span" style="white-space: pre; ">	</span>hypothetical protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>249<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_6459_5685<span class="Apple-tab-span" style="white-space: pre; ">	</span>UPF0246 protein YaaA<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>5<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_6967_7223<span class="Apple-tab-span" style="white-space: pre; ">	</span>hypothetical protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>Azoarcus sp. EbN1</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>3<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_7893_8614<span class="Apple-tab-span" style="white-space: pre; ">	</span>hypothetical protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>Tsukamurella paurometabola DSM 20162</div><div><br /></div><div>----------------------</div><div><br /></div><div>To get the standard annotations for E.coli, I ran</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;svr_all_features 83333.1 peg | svr_function_of | svr_gene_data -c 1 location &gt; ecoli.pegs</div><div><br /></div><div>The first few lines of output were</div><div><br /></div><div>fig|83333.1.peg.1<span class="Apple-tab-span" style="white-space: pre; ">	</span>Thr operon leader peptide<span class="Apple-tab-span" style="white-space: pre; ">	</span>83333.1:NC_000913_190+66</div><div>fig|83333.1.peg.2<span class="Apple-tab-span" style="white-space: pre; ">	</span>Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3)<span class="Apple-tab-span" style="white-space: pre; ">	</span>83333.1:NC_000913_337+2463</div><div>fig|83333.1.peg.3<span class="Apple-tab-span" style="white-space: pre; ">	</span>Homoserine kinase (EC 2.7.1.39)<span class="Apple-tab-span" style="white-space: pre; ">	</span>83333.1:NC_000913_2801+933</div><div>fig|83333.1.peg.4<span class="Apple-tab-span" style="white-space: pre; ">	</span>Threonine synthase (EC 4.2.3.1)<span class="Apple-tab-span" style="white-space: pre; ">	</span>83333.1:NC_000913_3734+1287</div><div>fig|83333.1.peg.5<span class="Apple-tab-span" style="white-space: pre; ">	</span>hypothetical protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>83333.1:NC_000913_5234+297</div><div>fig|83333.1.peg.6<span class="Apple-tab-span" style="white-space: pre; ">	</span>UPF0246 protein YaaA<span class="Apple-tab-span" style="white-space: pre; ">	</span>83333.1:NC_000913_6459-777</div><div>fig|83333.1.peg.7<span class="Apple-tab-span" style="white-space: pre; ">	</span>Putative alanine/glycine transport protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>83333.1:NC_000913_7959-1431</div><div>fig|83333.1.peg.8<span class="Apple-tab-span" style="white-space: pre; ">	</span>Transaldolase (EC 2.2.1.2)<span class="Apple-tab-span" style="white-space: pre; ">	</span>83333.1:NC_000913_8238+954</div><div><br /></div><div>----------------------</div><div><br /></div><div>If you compare the two outputs briefly, it becomes apparent that regions with hits of 5 or less&nbsp;</div><div>represent spurious hits. &nbsp;It is clear that the default parameters are producing matches that are not real.</div><div><br /></div><div>You have several parameters that determine the behavior of the "kmer searches":</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;1. The most obvious is the size of the kmers. &nbsp;In protein sequences we use a default size of</div><div>&nbsp;&nbsp; &nbsp; &nbsp; 8 amino acids. &nbsp;An 8-mer is a "signature", if it has only been seen in proteins with a&nbsp;</div><div>&nbsp;&nbsp; &nbsp; &nbsp; given function. &nbsp;In DNA searches, if we use kmers of length 8, we search for 24-character</div><div>&nbsp;&nbsp; &nbsp; &nbsp; sequences that translate to a "signature kmer". &nbsp;Looking at the short piece of output relating</div><div>&nbsp;&nbsp; &nbsp; &nbsp; to E.coli, it is clear that using kmers of length 8 is (in that case) leading to a situation in</div><div>&nbsp;&nbsp; &nbsp; &nbsp; which there is a low-level (say, 1%) of the hits that are "spurious" in the sense that the</div><div>&nbsp;&nbsp; &nbsp; &nbsp; matches are almost certainly not instances of the specified function (which is usually "hypothetical</div><div>&nbsp;&nbsp; &nbsp; &nbsp; protein".</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;2. A minimum number of hits (occurrences of a "signature kmer" within the region). &nbsp;In the short piece&nbsp;</div><div>&nbsp;&nbsp; &nbsp; &nbsp; of output displayed above, we were using a minimum number of hits of 3. &nbsp;It is clear from the</div><div>&nbsp;&nbsp; &nbsp; &nbsp; output that, with kmers of size 8, we were getting hits of as many as five due to random</div><div>&nbsp;&nbsp; &nbsp; &nbsp; occurrences within the DNA strings.</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;3. A "maxGap" parameter requires that the gap between occurrences of "signature kmers" be less than</div><div>&nbsp;&nbsp; &nbsp; &nbsp; or equal to the specified value for the hits to be grouped into a single region. &nbsp;We were using a</div><div>&nbsp;&nbsp; &nbsp; &nbsp; length of 600, which is probably too long.</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;4. Finally, we set a minimum size for a detected region. &nbsp;Note that you could recognize 5 occurrences</div><div>&nbsp;&nbsp; &nbsp; &nbsp; of 8-mers within a DNA sequence of length 28, which might or might not be desirable.</div><div><br /></div><div>I reran the experiment using kmers of length 9, a minimum number of hits of 3, a maximum gap of 300,&nbsp;</div><div>and a minimum size of 48. &nbsp;I did this by running</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;svr_assign_to_dna_using_figfams -by_location -kmer 9 -maxGap 300 -minSize 48 -reliability 3 &lt; ecoli.contigs &gt; better</div><div><br /></div><div>This produced the following lines for the corresponding section of the E.coli genome:</div><div><br /></div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>13<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_190_253<span class="Apple-tab-span" style="white-space: pre; ">	</span>Thr operon leader peptide<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>797<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_248_2797<span class="Apple-tab-span" style="white-space: pre; ">	</span>Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>290<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_2801_3731<span class="Apple-tab-span" style="white-space: pre; ">	</span>Homoserine kinase (EC 2.7.1.39)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>407<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_3734_5018<span class="Apple-tab-span" style="white-space: pre; ">	</span>Threonine synthase (EC 4.2.3.1)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>122<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_5088_5674<span class="Apple-tab-span" style="white-space: pre; ">	</span>hypothetical protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>76<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_5666_5312<span class="Apple-tab-span" style="white-space: pre; ">	</span>hypothetical protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>238<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_6459_5685<span class="Apple-tab-span" style="white-space: pre; ">	</span>UPF0246 protein YaaA<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>425<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_7959_6531<span class="Apple-tab-span" style="white-space: pre; ">	</span>Putative alanine/glycine transport protein<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>NC_000913<span class="Apple-tab-span" style="white-space: pre; ">	</span>310<span class="Apple-tab-span" style="white-space: pre; ">	</span>NC_000913_8178_9189<span class="Apple-tab-span" style="white-space: pre; ">	</span>Transaldolase (EC 2.2.1.2)<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div><br /></div><div><br /></div><div>This is substantially better. &nbsp;There is one bad call, but I suspect that it reflects actual</div><div>annotations in other close E.coli genomes. &nbsp;</div><div><br /></div><div>Now let us see whether the differences in parameters effects our ability to estimate populations in a real</div><div>metagenomic sample. &nbsp;To see I ran</div><div><br /></div><div>&nbsp;&nbsp; svr_assign_to_dna_using_figfams -kmer 8 -maxGap 600 -minSize 30 -reliability 3 &lt; MG.sample | svr_summarize_MG_output &gt; function.summary.loose &nbsp;2&gt; otu.summary.loose</div><div>and</div><div>&nbsp;&nbsp; svr_assign_to_dna_using_figfams -kmer 9 -maxGap 300 -minSize 54 -reliability 3 &lt; MG.sample | svr_summarize_MG_output &gt; function.summary.tighter &nbsp;2&gt; otu.summary.tighter</div><div><br /></div><div>The differences in estimates of population (i.e., a tabulation of the counts of hits against distinct OTUs) was</div><div><br /></div><div>loose:</div><div><br /></div><div>1714<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.110860<span class="Apple-tab-span" style="white-space: pre; ">	</span>Staphylococcus aureus subsp. aureus COL</div><div>780<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.050450<span class="Apple-tab-span" style="white-space: pre; ">	</span>Pseudomonas fluorescens SBW25</div><div>568<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.036738<span class="Apple-tab-span" style="white-space: pre; ">	</span>Acinetobacter baumannii AB307-0294</div><div>510<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.032986<span class="Apple-tab-span" style="white-space: pre; ">	</span>Acidovorax avenae subsp. citrulli AAC00-1</div><div>473<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.030593<span class="Apple-tab-span" style="white-space: pre; ">	</span>Natranaerobius thermophilus JW/NM-WN-LF</div><div>443<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.028653<span class="Apple-tab-span" style="white-space: pre; ">	</span>Streptococcus pneumoniae TIGR4</div><div>412<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.026648<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>351<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.022702<span class="Apple-tab-span" style="white-space: pre; ">	</span>Stackebrandtia nassauensis DSM 44728</div><div>331<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.021409<span class="Apple-tab-span" style="white-space: pre; ">	</span>Verminephrobacter eiseniae EF01-2</div><div>287<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.018563<span class="Apple-tab-span" style="white-space: pre; ">	</span>Burkholderia cepacia R1808</div><div>269<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.017399<span class="Apple-tab-span" style="white-space: pre; ">	</span>Xanthomonas campestris pv. campestris ATCC 33913</div><div>268<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.017334<span class="Apple-tab-span" style="white-space: pre; ">	</span>Propionibacterium acnes KPA171202</div><div>234<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.015135<span class="Apple-tab-span" style="white-space: pre; ">	</span>Delftia acidovorans SPH-1</div><div>231<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.014941<span class="Apple-tab-span" style="white-space: pre; ">	</span>Synechococcus sp. WH 8102</div><div>227<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.014682<span class="Apple-tab-span" style="white-space: pre; ">	</span>Serratia marcescens Db11</div><div>222<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.014359<span class="Apple-tab-span" style="white-space: pre; ">	</span>Sphingomonas wittichii RW1</div><div>204<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.013194<span class="Apple-tab-span" style="white-space: pre; ">	</span>Burkholderia fungorum</div><div>193<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.012483<span class="Apple-tab-span" style="white-space: pre; ">	</span>Bradyrhizobium japonicum USDA 110</div><div>188<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.012160<span class="Apple-tab-span" style="white-space: pre; ">	</span>Bacillus anthracis str. 'Ames Ancestor'</div><div>165<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.010672<span class="Apple-tab-span" style="white-space: pre; ">	</span>Bordetella avium 197N</div><div>162<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.010478<span class="Apple-tab-span" style="white-space: pre; ">	</span>Sulfurospirillum deleyianum DSM 6946</div><div><br /></div><div>=================</div><div>tighter:</div><div><br /></div><div>1251<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.242301<span class="Apple-tab-span" style="white-space: pre; ">	</span>Staphylococcus aureus subsp. aureus COL</div><div>442<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.085609<span class="Apple-tab-span" style="white-space: pre; ">	</span>Pseudomonas fluorescens SBW25</div><div>329<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.063723<span class="Apple-tab-span" style="white-space: pre; ">	</span>Streptococcus pneumoniae TIGR4</div><div>325<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.062948<span class="Apple-tab-span" style="white-space: pre; ">	</span>Acidovorax avenae subsp. citrulli AAC00-1</div><div>319<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.061786<span class="Apple-tab-span" style="white-space: pre; ">	</span>Acinetobacter baumannii AB307-0294</div><div>232<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.044935<span class="Apple-tab-span" style="white-space: pre; ">	</span>Propionibacterium acnes KPA171202</div><div>227<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.043967<span class="Apple-tab-span" style="white-space: pre; ">	</span>Verminephrobacter eiseniae EF01-2</div><div>196<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.037962<span class="Apple-tab-span" style="white-space: pre; ">	</span>Escherichia coli K12</div><div>158<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.030602<span class="Apple-tab-span" style="white-space: pre; ">	</span>Serratia marcescens Db11</div><div>130<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.025179<span class="Apple-tab-span" style="white-space: pre; ">	</span>Delftia acidovorans SPH-1</div><div>107<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.020724<span class="Apple-tab-span" style="white-space: pre; ">	</span>Herminiimonas arsenicoxydans</div><div>69<span class="Apple-tab-span" style="white-space: pre; ">	</span>0.013364<span class="Apple-tab-span" style="white-space: pre; ">	</span>Polaromonas sp. JS666</div><div><br /></div><div>=================</div><div><br /></div><div>I have included counts of the OTUs that registered at least 1% of the total tabulated hits.</div><div><br /></div><div><br /></div><div>What does this mean?</div><div>--------------------</div><div><br /></div><div>I suppose that the obvious lessons are as follows:</div><div><br /></div><div>&nbsp;&nbsp;1. &nbsp;I am writing this short note largely to illustrate the power of the server scripts -- I</div><div>&nbsp;&nbsp; &nbsp; &nbsp;think that it should be clear that a user can learn a number of important lessons without writing custom Perl code.</div><div>&nbsp;&nbsp;</div><div>&nbsp;&nbsp;2. &nbsp;If one wished to do a serious analysis of a metagenomic sample using our simple tools, I would</div><div>&nbsp;&nbsp; &nbsp; &nbsp;encourage them to "calibrate" the tools on data they have already characterized.</div><div><br /></div><div>&nbsp;&nbsp;3. &nbsp;Using loose parameters to extract weak hits from lower-quality sequence should be viewed critically.</div><div>&nbsp;&nbsp; &nbsp; &nbsp;It seems to me that one might consider a 2-stage approach in which the population is first estimated</div><div>&nbsp;&nbsp; &nbsp; &nbsp;fairly conservatively (pulling out the clear matches), and then a second less stringent pass could be made</div><div>&nbsp;&nbsp; &nbsp; &nbsp;using the estimated population to "weight the results" (i.e., weak hits against OTUs that were shown to be</div><div>&nbsp;&nbsp; &nbsp; &nbsp;present by the first pass would be taken more seriously in the second pass).</div><div><br /></div><div>In any event, I hope that these comments make the tools somewhat more useful.</div><div><br /></div><div><br /></div></div>]]>
    </content>
</entry>

<entry>
    <title>Upcoming Tutorials</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/09/upcoming-tutorials.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.792</id>

    <published>2010-09-21T17:02:13Z</published>
    <updated>2010-09-21T17:14:10Z</updated>

    <summary><![CDATA[On Aug 31 - Sept 2 we held a tutorial on use of the SEED servers, RAST, and myRAST to annotate genomes, formulate metabolic reconstructions, and model metabolic networks. &nbsp;We think that it went pretty well. &nbsp;Links to the presentations...]]></summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<p class="MsoNormal" style="color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; "><p class="MsoNormal" style="color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; ">On Aug 31 - Sept 2 we held a tutorial on use of the SEED servers, RAST, and myRAST to annotate genomes, formulate metabolic reconstructions, and model metabolic networks. &nbsp;We think that it went pretty well. &nbsp;Links to the presentations used can be found <a href="http://blog.theseed.org/servers/rast-workshop-presentations.html">here</a>. &nbsp;We will use basically the same agenda on the four more classes that are now scheduled. These tutorials will be offered at Argonne National Laboratory on the following dates:&nbsp;</p><p class="MsoNormal" style="color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; ">· &nbsp; &nbsp; &nbsp; &nbsp;Oct 4-6 (Now full)</p><p class="MsoNormal" style="color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; ">· &nbsp; &nbsp; &nbsp; Oct 25-27</p><p class="MsoNormal" style="color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; ">· &nbsp; &nbsp; &nbsp; Nov 1-3</p><p class="MsoNormal" style="color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; ">· &nbsp; &nbsp; &nbsp; Nov 15-17</p><p class="MsoNormal" style="color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; ">If you are interested in attending any of these tutorials, please send a note to&nbsp;<a href="mailto:rast-tutorial@lists.mcs.anl.gov">rast-tutorial@lists.mcs.anl.gov</a>&nbsp;specifying the dates of interest (Oct 25-27, Nov 1-3 or Nov 15-17).<span>&nbsp;&nbsp;</span></p><div><br /></div></p><p class="MsoListParagraphCxSpLast" style="text-indent: -0.25in; color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; "><br /></p><p class="MsoListParagraphCxSpLast" style="text-indent: -0.25in; color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; "><br /></p><p class="MsoListParagraphCxSpLast" style="text-indent: -0.25in; color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; ">If</p><p class="MsoListParagraphCxSpLast" style="text-indent: -0.25in; color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; "><br /></p>]]>
        
    </content>
</entry>

<entry>
    <title>A Short Note on Mapping PEGs to Subsystems</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/09/a-short-note-on-mapping-pegs-to-subsystems.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.782</id>

    <published>2010-09-14T16:32:16Z</published>
    <updated>2010-09-14T16:32:59Z</updated>

    <summary><![CDATA[A Short Note on Mapping PEGs to SubsystemsYou can use&nbsp;svr_all_features 83333.1 peg | svr_ids_to_subsystems 2&gt; /dev/null &gt; E.coli.pegs.in.subsystemsto get a 2-column file [PEG,Subsystem] for the PEGs in the genome with ID 83333.1(which happens to be E.coli). &nbsp;If you also want...]]></summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="Tutorials" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<div>A Short Note on Mapping PEGs to Subsystems</div><div><br /></div><div>You can use</div><div><br /></div><div>&nbsp;svr_all_features 83333.1 peg | svr_ids_to_subsystems 2&gt; /dev/null &gt; E.coli.pegs.in.subsystems</div><div><br /></div><div>to get a 2-column file [PEG,Subsystem] for the PEGs in the genome with ID 83333.1</div><div>(which happens to be E.coli). &nbsp;If you also want the function of the PEGs, use</div><div><br /></div><div>&nbsp;svr_all_features 83333.1 peg | svr_ids_to_subsystems 2&gt; /dev/null | svr_function_of -c 1 &gt; E.coli.pegs.in.subsystems</div><div><br /></div><div><br /></div><div>The&nbsp;</div><div><br /></div><div>&nbsp;&nbsp; &nbsp;svr_function_of -c 1</div><div><br /></div><div>command takes as input a file in which each line is a tab-separated set of fields. &nbsp;The</div><div><br /></div><div><span class="Apple-tab-span" style="white-space:pre">	</span>-c 1</div><div><br /></div><div>says that the PEG ids for which functions are to be appended occur in the first column.</div><div><br /></div><div>This produces output like</div><div><br /></div><div>fig|83333.1.peg.2 &nbsp; &nbsp; &nbsp; CBSS-216591.1.peg.168 &nbsp; Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3)</div><div>fig|83333.1.peg.2 &nbsp; &nbsp; &nbsp; Lysine Biosynthesis DAP Pathway Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3)</div><div>fig|83333.1.peg.2 &nbsp; &nbsp; &nbsp; Threonine and Homoserine Biosynthesis &nbsp; Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3)</div><div>fig|83333.1.peg.2 &nbsp; &nbsp; &nbsp; Threonine and Homoserine Biosynthesis &nbsp; Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3)</div><div>fig|83333.1.peg.2 &nbsp; &nbsp; &nbsp; Methionine Biosynthesis Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3)</div><div>fig|83333.1.peg.3 &nbsp; &nbsp; &nbsp; Threonine and Homoserine Biosynthesis &nbsp; Homoserine kinase (EC 2.7.1.39)</div><div>fig|83333.1.peg.3 &nbsp; &nbsp; &nbsp; Methionine Biosynthesis Homoserine kinase (EC 2.7.1.39)</div><div>fig|83333.1.peg.3 &nbsp; &nbsp; &nbsp; CBSS-269482.1.peg.1294 &nbsp;Homoserine kinase (EC 2.7.1.39)</div><div>fig|83333.1.peg.4 &nbsp; &nbsp; &nbsp; Threonine and Homoserine Biosynthesis &nbsp; Threonine synthase (EC 4.2.3.1)</div><div>fig|83333.1.peg.6 &nbsp; &nbsp; &nbsp; YaaA &nbsp; &nbsp;UPF0246 protein YaaA</div><div><br /></div> ]]>
        
    </content>
</entry>

<entry>
    <title>A Short Note on Use of Server Scripts to Access Functional Coupling Scores</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/09/A Short Note on Use of Server Scripts to Access Functional Coupling Scores.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.772</id>

    <published>2010-09-14T16:28:40Z</published>
    <updated>2010-09-14T19:52:20Z</updated>

    <summary><![CDATA[A Short Note on Use of Server Scripts to Access Functional Coupling ScoresThis short note will discuss the following command and what it accomplishes:svr_all_features 83333.1 peg | svr_ids_to_figfams | svr_fc_figfams -MinSc 100 | svr_figfams_to_ids 83333.1 | svr_function_of &gt; EC.dataIt chains...]]></summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="Tutorials" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<div>A Short Note on Use of Server Scripts to Access Functional Coupling Scores</div><div><br /></div><div>This short note will discuss the following command and what it accomplishes:</div><div><br /></div><div><br /></div><div>svr_all_features 83333.1 peg | svr_ids_to_figfams | svr_fc_figfams -MinSc 100 | svr_figfams_to_ids 83333.1 | svr_function_of &gt; EC.data</div><div><br /></div><div>It chains together 5 svr scripts, which I will now discuss.</div><div><br /></div>]]>
        <![CDATA[<div><br /></div><div><div>&nbsp;&nbsp; svr_all_features 83333.1 pegs</div><div><br /></div><div>is used to generate a list of the features of type 'peg' that occur in the genome with ID 83333.1 (which is</div><div>Escherichia coli K12). &nbsp;It produces output of the form</div><div><br /></div><div>fig|83333.1.peg.1</div><div>fig|83333.1.peg.2</div><div>fig|83333.1.peg.3</div><div>fig|83333.1.peg.4</div><div>fig|83333.1.peg.5</div><div>fig|83333.1.peg.6</div><div>.</div><div>.</div><div>.</div><div><br /></div><div>&nbsp;&nbsp; svr_ids_to_figfams</div><div><br /></div><div>takes an input file if which one column is PEG IDs (by default the last column).</div><div>It writes two files. &nbsp;The file written to STDERR will be those lines of input in which&nbsp;the PEG was not included in any FIGfams. &nbsp;All other lines are written to STDOUT, and they&nbsp;will be the input line with two fields appended: the family function of the FIGfam containing&nbsp;the PEG and the FIGfam ID. &nbsp;Thus, the first few lines that are written to STDOUT would look like</div><div><br /></div><div>fig|83333.1.peg.1 &nbsp; &nbsp; &nbsp; Thr operon leader peptide &nbsp; &nbsp; &nbsp; FIG164298</div><div>fig|83333.1.peg.2 &nbsp; &nbsp; &nbsp; Aspartokinase (EC 2.7.2.4) / Homoserine dehydrogenase (EC 1.1.1.3) &nbsp; &nbsp; &nbsp;FIG000885</div><div>fig|83333.1.peg.3 &nbsp; &nbsp; &nbsp; Homoserine kinase (EC 2.7.1.39) FIG000582</div><div>fig|83333.1.peg.4 &nbsp; &nbsp; &nbsp; Threonine synthase (EC 4.2.3.1) FIG000134</div><div>fig|83333.1.peg.5 &nbsp; &nbsp; &nbsp; hypothetical protein &nbsp; &nbsp;FIG004675</div><div>fig|83333.1.peg.6 &nbsp; &nbsp; &nbsp; UPF0246 protein YaaA &nbsp; &nbsp;FIG002158</div><div>.</div><div>.</div><div>.</div><div><br /></div><div>The command</div><div><span class="Apple-tab-span" style="white-space: pre; ">		</span>svr_fc_figfams -MinSc 100</div><div><br /></div><div>takes an input stream in which one column (by default the last) contains a FIGfam ID.&nbsp;If the FIGfam does not show a tendency to co-occur (i.e., tends to occur within 5kb on &nbsp;the chromosome) with another FIGfam, it produces no output. &nbsp;However, if the FIGfam does&nbsp;tend to co-occur, a line will be written for every FIGfam that it tends to co-occur with.&nbsp;Two fields are added to the end of each output line: the number of OTUs in which the FIGfams co-occur and&nbsp;</div><div>the FIGfam ID of the co-occurring FIGfam. &nbsp;An OTU (for our purposes) is just a set of genomes &nbsp;that are very close phylogenetic neighbors. &nbsp;If you wish to get a list of the genomes that represent &nbsp;distinct OTUs, just run</div><div><br /></div><div><span class="Apple-tab-span" style="white-space: pre; ">	</span>&nbsp;svr_otus</div><div><br /></div><div>If you wish to also see the set of genomes included in each OTU, use</div><div><br /></div><div>&nbsp;&nbsp; &nbsp; &nbsp;&nbsp;<span class="Apple-tab-span" style="white-space: pre; ">	</span>&nbsp;svr_otus | svr_members_of_otu</div><div><br /></div><div>Anyway, the command&nbsp;</div><div><br /></div><div><span class="Apple-tab-span" style="white-space: pre; ">	</span>svr_fc_figfams -MinSc 100</div><div><br /></div><div>says "show me only FIGfams that occur close to the input FIGfam in at least 100 distinct OTUs".&nbsp;At this point in history (September, 2010), the SEED contains approximately 1000 distinct OTUs.</div><div><br /></div><div><br /></div><div>The command</div><div><br /></div><div><span class="Apple-tab-span" style="white-space: pre; ">	</span>svr_figfams_to_ids 83333.1&nbsp;</div><div><br /></div><div>takes an input stream (a tab-delimited table) that have a column containing FIGfam IDs (by&nbsp;default, the last column). &nbsp;The command line arguments say "restrict output to members of these&nbsp;genomes (in this case to a single genome -- 83333.1)". &nbsp;Foreach input line, a number of output lines will be written --</div><div>one for each PEG in the input FIGfam from the designated set of genomes).</div><div><br /></div><div>Finally, I used</div><div><br /></div><div><span class="Apple-tab-span" style="white-space: pre; ">	</span>&nbsp;svr_function_of</div><div><br /></div><div>to tack on the functions of the PEGs listed in the last column.</div><div><br /></div><div>I urger you to get used to thinking about this simple technique for extracting data.&nbsp;Used with the Unix tools of sort, cut, and grep they offer rich functionality.</div></div><div><br /></div>]]>
    </content>
</entry>

<entry>
    <title>Annotated list of Getting Started, Tutorial and Coding Examples</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/09/annotated-list-of-tutorial-and-coding-examples.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.762</id>

    <published>2010-09-02T17:05:58Z</published>
    <updated>2010-09-02T20:43:56Z</updated>

    <summary><![CDATA[ Getting Started Links A page of links exploring the recent RAST tutorials RAST Tutorial Links &nbsp; &nbsp; An introduction to the Fellowship for the Interpretation of Genomes, creators of the SEED What is FIG &nbsp; &nbsp; How to get...]]></summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<!--StartFragment-->

<blockquote class="webkit-indent-blockquote" style="margin: 0 0 0 40px; border: none; padding: 0px;"><table class="MsoNormalTable" border="1" cellspacing="0" cellpadding="0" style="margin-left:-30.6pt;border-collapse:collapse;mso-table-layout-alt:fixed;
 border:none;mso-border-top-alt:solid #BCB5B0 1.0pt;mso-border-left-alt:solid #BCB5B0 1.0pt;
 mso-border-right-alt:solid #BCB5B0 1.0pt;mso-padding-alt:0in 5.4pt 0in 5.4pt">
 <tbody><tr style="mso-yfti-irow:0;mso-yfti-firstrow:yes">
  <td width="342" style="width:4.75in;border:solid #BCB5B0 1.0pt;border-right:
  none;background:#D4D5D5;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:13.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><b><span style="font-size:11.0pt;font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020;text-shadow:auto">Getting Started<o:p></o:p></span></b></p>
  </td>
  <td width="162" style="width:2.25in;border:solid #BCB5B0 1.0pt;background:#D4D5D5;
  padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:13.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><b><span style="font-size:11.0pt;font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020;text-shadow:auto">Links<o:p></o:p></span></b></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:1">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020">A page of links exploring
  the recent RAST tutorials<o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/09/rast-tutorial-links.html"><span style="color:#002653">RAST Tutorial Links</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:2">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;
  color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:3">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020">An introduction to the
  Fellowship for the Interpretation of Genomes, creators of the SEED<o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/08/what-is-fig.html"><span style="color:#002653">What is FIG</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:4">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;
  color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:5">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020">How to get a RAST account
  and access it to set your password<o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/08/video-tutorial-creating-a-rast-account.html"><span style="color:#002653">Video Tutorial: Creating a RAST Account</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:6">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;
  color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:7">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020">How to download, install
  and try out the SEED servers client code<o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/accessing-the-seed-servers-getting-started.html"><span style="color:#002653">Accessing the SEED Servers: Getting Started</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:8;mso-yfti-lastrow:yes">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
</tbody></table><br /></blockquote><blockquote class="webkit-indent-blockquote" style="margin: 0 0 0 40px; border: none; padding: 0px;"><!--StartFragment-->

<table class="MsoNormalTable" border="1" cellspacing="0" cellpadding="0" style="margin-left:-30.6pt;border-collapse:collapse;mso-table-layout-alt:fixed;
 border:none;mso-border-top-alt:solid #BCB5B0 1.0pt;mso-border-left-alt:solid #BCB5B0 1.0pt;
 mso-border-right-alt:solid #BCB5B0 1.0pt;mso-padding-alt:0in 5.4pt 0in 5.4pt">
 <tbody><tr style="mso-yfti-irow:0;mso-yfti-firstrow:yes">
  <td width="342" style="width:4.75in;border:solid #BCB5B0 1.0pt;border-right:
  none;background:#D4D5D5;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:13.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><b><span style="font-size:11.0pt;font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020;text-shadow:auto">Using the "svr" command line
  scripts<o:p></o:p></span></b></p>
  </td>
  <td width="162" style="width:2.25in;border:solid #BCB5B0 1.0pt;background:#D4D5D5;
  padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:13.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><b><span style="font-size:11.0pt;font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020;text-shadow:auto">Links<o:p></o:p></span></b></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:1">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;">Using svr
  command line scripts to retrieve all the features for a given genome. This is
  often the first step in an analysis sequence.</span><span style="font-family:
  &quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/find-all-features-for-a-genome.html"><span style="color:#002653">Find all features for a genome.</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:2">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:3">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626">Using the svr command-line scripts
  to download all the genes for a genome.</span><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/08/downloading-a-genome.html"><span style="color:#002653">Downloading a Genome</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:4">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:5">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020">Using svr </span><span style="font-size:13.0pt;font-family:Arial;mso-bidi-font-family:Arial;
  color:#262626">command-line scripts to get the roles and features of a
  subsystem.</span><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/08/downloading-a-subsystem.html"><span style="color:#002653">Downloading a Subsystem</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:6">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;mso-bidi-font-family:Arial;
  color:#262626"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:7">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626">Using the svr command-line scripts
  to access the complete set of FIGfams.</span><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/08/downloading-the-figfams.html"><span style="color:#002653">Downloading the FIGfams</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:8">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:9">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="mso-pagination:none;mso-layout-grid-align:none;
  text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020">Using the svr command line scripts to </span><span style="font-size:13.0pt;font-family:Arial;mso-bidi-font-family:Arial;
  color:#262626">access the Annotation Clearinghouse. &nbsp;How to see all of
  the assignments made by any group to a protein having the same sequence.<o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/getting-all-ids-aliases-and-assertions-of-function-for-one-or-more-protein-sequences.html"><span style="color:#002653">Getting all IDs, Aliases and Assertions of Function for
  One or more Protein sequences</span></a><o:p></o:p></span></p>
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:10">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020">Using svr command line scripts to </span><span style="font-family:&quot;Lucida Grande&quot;">find PEGs that are in the neighborhood of
  a given PEG</span><span style="font-size:18.0pt;font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:Times">.</span><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/find-neighbors.html"><span style="color:#002653">Find Neighbors</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:11">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:12">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;">Using svr
  command line scripts to see all the aliases by which a given feature or set
  of features is known in the SEED.</span><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/find-gene-aliases.html"><span style="color:#002653">Find Gene Aliases</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:13">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:14">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;">Using svr command
  line scripts to retrieve the assigned function for each of a set of genes.</span><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;
  color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/find-gene-function.html"><span style="color:#002653">Find Gene Function</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:15">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:16">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;">Using svr
  command line scripts to retrieve all the features for a given genome. This is
  often the first step in an analysis sequence.</span><span style="font-family:
  &quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/find-all-features-for-a-genome.html"><span style="color:#002653">Find all features for a genome.</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:17">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:18;mso-yfti-lastrow:yes">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="mso-pagination:none;mso-layout-grid-align:none;
  text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626">Two methods of doing annotations
  with the SEED servers<o:p></o:p></span></p>
  <p class="MsoNormal" style="mso-pagination:none;mso-layout-grid-align:none;
  text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626"><o:p>&nbsp;</o:p></span></p>
  <p class="MsoNormal" style="mso-pagination:none;mso-layout-grid-align:none;
  text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626">1. How to use svr command line
  scripts to do genome annotation: call the RNA-encoding genes and
  protein-encoding genes, asserts &nbsp;functions for the gene products, and
  produces an initial metabolic reconstruction.<o:p></o:p></span></p>
  <p class="MsoNormal" style="mso-pagination:none;mso-layout-grid-align:none;
  text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626"><o:p>&nbsp;</o:p></span></p>
  <p class="MsoNormal" style="mso-pagination:none;mso-layout-grid-align:none;
  text-autospace:none"><span style="font-size:13.0pt;font-family:Arial;
  mso-bidi-font-family:Arial;color:#262626">2.<span style="mso-spacerun:
  yes">&nbsp; </span>How to use the Server Perl API to call the RNAs, PEGs,
  identify the functions, and generate an initial
  metabolic&nbsp;reconstruction.<o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/annotating-a-genome-via-the-seed-servers-via-the-command-line-or-perl-code.html"><span style="color:#002653">Annotating a genome using the SEED servers (via the command
  line or Perl code)</span></a><o:p></o:p></span></p>
  </td>
 </tr>
</tbody></table><br />

<!--EndFragment-->


</blockquote><blockquote class="webkit-indent-blockquote" style="margin: 0 0 0 40px; border: none; padding: 0px;"><br /></blockquote><blockquote class="webkit-indent-blockquote" style="margin: 0 0 0 40px; border: none; padding: 0px;"><!--StartFragment-->

<table class="MsoNormalTable" border="1" cellspacing="0" cellpadding="0" style="margin-left:-30.6pt;border-collapse:collapse;mso-table-layout-alt:fixed;
 border:none;mso-border-top-alt:solid #BCB5B0 1.0pt;mso-border-left-alt:solid #BCB5B0 1.0pt;
 mso-border-right-alt:solid #BCB5B0 1.0pt;mso-padding-alt:0in 5.4pt 0in 5.4pt">
 <tbody><tr style="mso-yfti-irow:0;mso-yfti-firstrow:yes">
  <td width="342" style="width:4.75in;border:solid #BCB5B0 1.0pt;border-right:
  none;background:#D4D5D5;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:13.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><b><span style="font-size:11.0pt;font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020;text-shadow:auto">Using the Server Perl API<o:p></o:p></span></b></p>
  </td>
  <td width="162" style="width:2.25in;border:solid #BCB5B0 1.0pt;background:#D4D5D5;
  padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:13.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><b><span style="font-size:11.0pt;font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020;text-shadow:auto">Links<o:p></o:p></span></b></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:1">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:2">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;
  color:#202020">Using the Server Perl API to:<o:p></o:p></span></p>
  <p class="MsoListParagraphCxSpFirst" style="text-indent:-.25in;mso-list:l0 level1 lfo1"><span style="font-family:&quot;Lucida Grande&quot;;mso-fareast-font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><span style="mso-list:
  Ignore">1.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp; </span></span></span><span style="font-family:&quot;Lucida Grande&quot;">Identify Genes<o:p></o:p></span></p>
  <p class="MsoListParagraphCxSpMiddle" style="text-indent:-.25in;mso-list:l0 level1 lfo1"><span style="font-family:&quot;Lucida Grande&quot;;mso-fareast-font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><span style="mso-list:
  Ignore">2.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp; </span></span></span><span style="font-family:&quot;Lucida Grande&quot;">Assign Functions to Encoded Proteins<o:p></o:p></span></p>
  <p class="MsoListParagraphCxSpLast" style="text-indent:-.25in;mso-list:l0 level1 lfo1"><span style="font-family:&quot;Lucida Grande&quot;;mso-fareast-font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><span style="mso-list:
  Ignore">3.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp; </span></span></span><span style="font-family:&quot;Lucida Grande&quot;">Create a Metabolic Reconstruction</span><o:p></o:p></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/services-to-support-annotation-of-genes.html"><span style="color:#002653">Services to Support Annotation of Genes</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:3">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;
  color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:4">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;">Discussion of
  the example (Example 4) that uses the SEED servers Perl API<span style="mso-spacerun: yes">&nbsp; </span>to access to the data used to compute
  co-occurrence scores.</span><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/example-4-access-to-functional-coupling-conserved-contiguity-data.html"><span style="color:#002653">Access to Functional Coupling (Conserved Contiguity)
  Data</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:5">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:6">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;">Discussion of server
  Perl API Example 3 illustrating the functions required to determine the
  location of a SEED gene encoding a specific protein and to acquire the genes
  from a given region centered on that location.</span><span style="font-family:
  &quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/example-3-creating-custom-interfaces.html"><span style="color:#002653">Creating Custom Interfaces</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:7">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:8">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020">Using the server Perl API </span><span style="font-family:&quot;Lucida Grande&quot;">to identify the subsystems can be inferred
  from a<span style="mso-spacerun: yes">&nbsp; </span>set of functional roles.</span><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;
  color:#202020"><o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/example-2-metabolic-reconstructions-provided-for-complete-prokaryotic-genomes.html"><span style="color:#002653">Metabolic Reconstructions Provided for Complete
  Prokaryotic Genomes</span></a><o:p></o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:9">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F0F0F0;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:10;mso-yfti-lastrow:yes">
  <td width="342" valign="top" style="width:4.75in;border:none;border-left:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;">Using the server Perl API
  to&nbsp;illustrate basic capabilities that relate to determining the set of
  IDs attached to specific protein sequences</span><span style="font-size:18.0pt;
  font-family:Times;mso-bidi-font-family:Times">.</span><span style="font-family:
  &quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"> <o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border:none;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/06/-example-1-conversion-of-gene-and-protein-ids.html"><span style="color:#002653">Conversion of Gene and Protein IDs</span></a><o:p></o:p></span></p>
  </td>
 </tr>
</tbody></table><br />

<!--EndFragment-->


</blockquote><blockquote class="webkit-indent-blockquote" style="margin: 0 0 0 40px; border: none; padding: 0px;"><br /></blockquote><blockquote class="webkit-indent-blockquote" style="margin: 0 0 0 40px; border: none; padding: 0px;"><!--StartFragment-->

<table class="MsoNormalTable" border="1" cellspacing="0" cellpadding="0" style="margin-left:-30.6pt;border-collapse:collapse;mso-table-layout-alt:fixed;
 border:none;mso-border-top-alt:solid #BCB5B0 1.0pt;mso-border-left-alt:solid #BCB5B0 1.0pt;
 mso-border-right-alt:solid #BCB5B0 1.0pt;mso-padding-alt:0in 5.4pt 0in 5.4pt">
 <tbody><tr style="mso-yfti-irow:0;mso-yfti-firstrow:yes">
  <td width="342" style="width:4.75in;border:solid #BCB5B0 1.0pt;border-right:
  none;background:#D4D5D5;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:13.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><b><span style="font-size:11.0pt;font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:
  &quot;Lucida Grande&quot;;color:#202020;text-shadow:auto">Metagenomics<o:p></o:p></span></b></p>
  </td>
  <td width="162" style="width:2.25in;border:solid #BCB5B0 1.0pt;background:#D4D5D5;
  padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:13.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><b><span style="font-size:11.0pt;font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020;text-shadow:auto">Links<o:p></o:p></span></b></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:1">
  <td width="342" valign="top" style="width:4.75in;border:none;border-left:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;"><o:p>&nbsp;</o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border:none;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><o:p>&nbsp;</o:p></span></p>
  </td>
 </tr>
 <tr style="mso-yfti-irow:2;mso-yfti-lastrow:yes">
  <td width="342" valign="top" style="width:4.75in;border-top:none;border-left:
  solid #BCB5B0 1.0pt;border-bottom:solid #F0F0F0 1.0pt;border-right:none;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" align="center" style="text-align:center;line-height:14.0pt;
  mso-pagination:none;mso-layout-grid-align:none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;mso-bidi-font-family:&quot;Lucida Grande&quot;;
  color:#202020">Using the RAST servers for Metagenomics<o:p></o:p></span></p>
  </td>
  <td width="162" valign="top" style="width:2.25in;border-top:none;border-left:
  none;border-bottom:solid #F0F0F0 1.0pt;border-right:solid #BCB5B0 1.0pt;
  background:#F6FAF6;padding:0in 5.4pt 0in 5.4pt">
  <p class="MsoNormal" style="line-height:14.0pt;mso-pagination:none;mso-layout-grid-align:
  none;text-autospace:none"><span style="font-family:&quot;Lucida Grande&quot;;
  mso-bidi-font-family:&quot;Lucida Grande&quot;;color:#202020"><a href="http://blog.theseed.org/servers/2010/08/getting-summaries-of-functional-content-and-otus-for-an-metagenomic-sample.html"><span style="color:#002653">Getting Summaries of Functional Content and OTUs for an
  Metagenomic Sample</span></a><o:p></o:p></span></p>
  </td>
 </tr>
</tbody></table>

<!--EndFragment-->


</blockquote>

<!--EndFragment-->





 ]]>
        
    </content>
</entry>

<entry>
    <title>RAST Tutorial Links</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/09/rast-tutorial-links.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.752</id>

    <published>2010-09-02T16:49:11Z</published>
    <updated>2010-09-02T16:49:41Z</updated>

    <summary>These links open in new windows, allowing you to follow the presentations in sequence or explore topics of interest.OverviewThe ComponentsThe SEED and the P-SEEDP-SEED and A-SEED ServersRASTmyRASTAnnotationsSubsystemsFIGfamsSubmitting a job with web-based RASTAnnotations with RASTAnnotations with myRASTExporting from RASTExporting from myRAST...</summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="Tutorials" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<div style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: 10px; padding-right: 10px; padding-bottom: 10px; padding-left: 10px; height: 90%; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; background-image: initial; background-attachment: initial; background-origin: initial; background-clip: initial; background-color: rgb(255, 255, 255); color: rgb(51, 51, 51); font: normal normal normal 13px/normal arial, helvetica, hirakakupro-w3, osaka, 'ms pgothic', sans-serif; font-family: Times; font-size: medium; ">These links open in new windows, allowing you to follow the presentations in sequence or explore topics of interest.<div><br /></div><div><ol style="margin-top: 0px; margin-right: 0px; margin-bottom: 0.75em; margin-left: 20px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; list-style-type: decimal; list-style-position: outside; list-style-image: initial; background-repeat: no-repeat repeat; "><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/overview.html" target="_new" style="text-decoration: underline; ">Overview</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/the-components.html" target="_new" style="text-decoration: underline; ">The Components</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/the-seed-and-the-pseed.html" target="_new" style="text-decoration: underline; ">The SEED and the P-SEED</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/servers-overview.html" target="_new" style="text-decoration: underline; ">P-SEED and A-SEED Servers</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/rast.html" target="_new" style="text-decoration: underline; ">RAST</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/drast-overview.html" target="_new" style="text-decoration: underline; ">myRAST</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/annotations.html" target="_new" style="text-decoration: underline; ">Annotations</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/subsystems.html" target="_new" style="text-decoration: underline; ">Subsystems</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/figfams.html" target="_new" style="text-decoration: underline; ">FIGfams</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/running-a-job-with-web-based-rast.html" target="_new" style="text-decoration: underline; ">Submitting a job with web-based RAST</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/annotation-with-rast.html" target="_new" style="text-decoration: underline; ">Annotations with RAST</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/running-a-job-with-the-desktop-rast.html" target="_new" style="text-decoration: underline; ">Annotations with myRAST</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/exporting-from-rast.html" target="_new" style="text-decoration: underline; ">Exporting from RAST</a></li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; "><a href="http://blog.theseed.org/servers/presentations/t1/exporting-from-drast.html" target="_new" style="text-decoration: underline; ">Exporting from myRAST</a></li></ol></div></div>]]>
        
    </content>
</entry>

<entry>
    <title>The Update Protocol for Maintenance of Annotations</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/08/the-update-protocol-for-maintenance-of-annotations.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.692</id>

    <published>2010-08-30T17:56:17Z</published>
    <updated>2010-08-30T18:14:24Z</updated>

    <summary>This document describes the update protocol used to maintain annotations within the SEED Project.There are now two SEEDs used as components of the maintenance process:The Annotators SEED (A-SEED) remains the SEED used to support manual curations, which are largely cast...</summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="Technical Notes" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<p>This document describes the update protocol used to maintain annotations within the SEED Project.<br /><br />There are now two SEEDs used as components of the maintenance process:<br /><br /></p><ol><li>The Annotators SEED (A-SEED) remains the SEED used to support manual curations, which are largely cast as creation and maintenance of subsystems.</li><li>The PATRIC-SEED (P-SEED)&nbsp; contains all of the complete prokaryotic genomes from RefSeq, as well as all of the complete genomes from the A-SEED.&nbsp; </li></ol><br />The need for two distinct SEEDs arises because the A-SEED represents a much more productive framework for manually constructing subsystems.&nbsp; There the annotators are not faced with hundreds of almost identical genomes or with genomes of very low quality.&nbsp; The A-SEED is intended to contain representative genomes of high quality.&nbsp; On the other hand, construction of "pan-genomes" or development of assertions of the form "gene X and gene Y play identical functions" are best done in a more comprehensive collection of genomes.<br /><br />The need to utilize two versions of the SEED necessarily leads to a somewhat more complex update protocol.&nbsp; The steps in the current protocol are now discussed.&nbsp; For purposes of this discussion, we will reference FIGfam collections used for a variety of purposes.&nbsp; The version of the FIGfams corresponding to the current A-SEED will be called A-FIGfams.&nbsp; A-FIGfams contain proteins from only genomes present in the A-SEED.&nbsp; The current version of the FIGfams used in the P-SEED will be called P-FIGfams, and these will contain proteins from genomes in the P-SEED.<br /><br />Each SEED maintains a set of representative complete genomes.&nbsp; These representatives are thought of as members of Operational Taxonomic Units (OTUs).&nbsp; An OTU contains genomes that are "very similar" (technically, we group genomes that have SSU rRNAs that are at least 97% identical).&nbsp;&nbsp; The set of similar genomes constitutes the OTU, and the representative genome name is used as the "name" of the OTU.&nbsp; It should be noted that OTUs must be maintained for each copy of the SEED.&nbsp; The protocol implemented in the steps below assumes that the OTUs are current.&nbsp; If currency is not ensured by the protocol for adding genomes, then it is important to update OTUs in both SEEDs before proceeding.&nbsp; <br /><br />Similarly, this protocol assumes that the flow of genomes into the P-SEED has been successful.&nbsp;&nbsp; It is critical that the flow of genomes into the PSEED include all complete prokaryotic genomes maintained in GenBank.<br /><br />To understand the following protocol, you must be familiar with our use of Kmers to annotate genomes.&nbsp; For the purposes of this discussion, the Kmers are 8-character sequences in the amino acid alphabet.&nbsp; We maintain a collection of signature Kmers, which are Kmers that have been detected only in protein sequences that have a common function (i.e.,&nbsp; the only known proteins containing the Kmer all have been assigned the same function).&nbsp;&nbsp; Thus, when we detect an instant of one of the signature Kmers in a new protein sequence, it is evidence of function.&nbsp; If we detect a large number (usually, more than 3), it becomes very likely that we can say that the protein probably shares the same function as the protein sequences from which the Kmer was derived.&nbsp;&nbsp; Currently, we derive signature Kmers for each of the FIGfam protein families.<br /><br />These are the steps required to update the annotations of the P-SEED genomes:<br /><br /><br />

<ol>
<li style="margin-bottom: 1.5em;"><strong>Create A-FIGfams-1, the restriction of P-FIGfams to A-SEED Genomes</strong><br /><br />The first step is to take the current P-FIGfams, which may have numerous assertions relating to the extended set of genomes maintained in P-SEED, and restricting those protein sets to genomes from A-SEED.&nbsp; Protein sets that condense to singleton sets are removed in the process.</li>

<li style="margin-bottom: 1.5em;"><strong>Create A-FIGfams-2 by Updating the A-FIGfams-1 in the A-SEED</strong><br /><br />This step (in effect) uses the current subsystems in the A-SEED, as well as any non-subsystem clustering done in the P-SEED, to create a reconciled set of FIGfams for the A-SEED.&nbsp; A kmer-update is generated from the contents of A-FIGfams-2.</li>

<li style="margin-bottom: 1.5em;"><strong>Recall Protein Functions and Subsystems in P-SEED</strong><br /><br />This, in effect, imports the manual subsystem maintenance into the annotations of P-SEED.</li>

<li style="margin-bottom: 1.5em;"><strong>Update P-FIGfams to P-FIGfams-1 in the Updated P-SEED</strong><br /><br />Since this is an update of P-FIGfams using the updated subsystem&nbsp;&nbsp; annotations, it retains the clustering work done in P-SEED, while keeping the annotations imported from A-SEED.&nbsp; Create an updated set of kmers from P-FIGfams-1.</li>

<li style="margin-bottom: 1.5em;"><strong>Install the new Kmers into the Kmer Server</strong><br /><br />This leaves one with the largest set of Kmers (suitable for recognition of pan-genomes, and offering a larger set of signature kmers than is possible using just genomes from the A-SEED).</li>

<li style="margin-bottom: 1.5em;"><strong>Recall the Functions of&nbsp; the Proteins in P-SEED using the New Kmers</strong><br /><br />The change should be minor, but will induce consistency.&nbsp; The subsystem assertions can then be finally updated.</li>
</ol>
<p>This completes the steps needed to update the A-FIGfams (to A-FIGfams-2), the annotations in the P-SEED, P-FIGfams (to P-FIGfams-1) and the Kmers.<br /><br />It remains essential that we maintain the list of induced changes and do manual inspections to enforce quality control on the process.</p>]]>
        
    </content>
</entry>

<entry>
    <title>Getting Summaries of Functional Content and OTUs for an Metagenomic Sample</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/08/getting-summaries-of-functional-content-and-otus-for-an-metagenomic-sample.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.682</id>

    <published>2010-08-29T19:04:56Z</published>
    <updated>2010-08-31T19:00:19Z</updated>

    <summary>Getting Summaries of Functional Content and OTUs for an Metagenomic Sample(Note: in order to use the svr commands, you must have installed the myRAST app and set your environment correctly, see this post for instructions)It is worth mentioning that two...</summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="Tutorials" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<div><font class="Apple-style-span" style="font-size: 1.25em; ">Getting Summaries of Functional Content and OTUs for an Metagenomic Sample</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">(Note: in order to use the svr commands, you must have installed the myRAST app and set your environment correctly, see t<a href="http://blog.theseed.org/servers/installation/distribution-of-the-seed-server-packages.html">his</a> post for instructions)</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">It is worth mentioning that two of the svr functions provide a means</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">of getting quick summaries of content for a newly-sequenced metagenomic sample.</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">&nbsp;svr_assign_to_dna_using_figfams &lt; MG.sample</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">takes as input a set of DNA sequences in fasta format. &nbsp;It outputs a 5-column, tab-separated table containing:</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><ol><li><font class="Apple-style-span" style="font-size: 1.25em; ">The ID of one of the sequences</font></li><li><font class="Apple-style-span" style="font-size: 1.25em; ">&nbsp;The number of Kmer hits against the sequence</font></li><li><font class="Apple-style-span" style="font-size: 1.25em; ">&nbsp;The region identified as potentially supporting the function&nbsp;&nbsp;(in the form of a contig, begin, and end coordinates separated by underscores),</font></li><li><font class="Apple-style-span" style="font-size: 1.25em; ">&nbsp;The function associated with the region (which may just be "hypothetical protein"),&nbsp;</font></li><li><font class="Apple-style-span" style="font-size: 1.25em; ">&nbsp;&nbsp;A genome name that represents an "operational taxonomic unit" that appears to be the source of the hit.</font></li></ol></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">This tab-separated table can be summarized using</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">&nbsp;&nbsp;svr_summarize_MG_output &lt; table &gt; function.summary 2&gt; otu.summary</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">Normally, these are just pipelined using</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">&nbsp;svr_assign_to_dna_using_figfams &lt; MG.sample | svr_summarize_MG_output &gt; function.summary 2&gt; otu.summary</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">The pipeline will usually process roughly 6-8 megabases of data per minute.</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">Finally, you can use</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><span class="Apple-tab-span" style="white-space:pre"><font class="Apple-style-span" style="font-size: 1.25em; ">	</font></span><font class="Apple-style-span" style="font-size: 1.25em; "> svr_metabolic_reconstruction &lt; function.summary | cut -f 4,5 | sort -u</font></div><div><font class="Apple-style-span" style="font-size: 1.25em; "><br /></font></div><div><font class="Apple-style-span" style="font-size: 1.25em; ">to get a quick metabolic reconstruction summarizing the active subsystems that could be determined (along&nbsp;with the appropriate variant code).</font></div><div><br /></div> ]]>
        
    </content>
</entry>

<entry>
    <title>Upcoming RAST Tutorials</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/08/rast-tutorials.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.672</id>

    <published>2010-08-29T18:30:24Z</published>
    <updated>2010-08-30T12:20:56Z</updated>

    <summary>We are announcing 5 upcoming RAST tutorials. The dates are:August 31 - Sep 2October 4-6October 25-27November 1-3November 15-18These tutorials are designed to help people make the most of our RAST and other bioinformatics resources. The tutorials are free and open...</summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
        <category term="News" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[We are announcing 5 upcoming RAST tutorials. The dates are:<div><br /></div><div><ol><li>August 31 - Sep 2</li><li><font class="Apple-style-span" face="-editor-proxy">October 4-6</font></li><li><font class="Apple-style-span" face="-editor-proxy">October 25-27</font></li><li><font class="Apple-style-span" face="-editor-proxy">November 1-3</font></li><li><font class="Apple-style-span" face="-editor-proxy">November 15-18</font></li></ol><div><br /></div></div><div>These tutorials are designed to help people make the most of our RAST and other bioinformatics resources. The tutorials are free and open to the public. Attendees are encouraged to bring their own genome for some hands-on practice guided by our experienced staff. You can find an agenda for the tutorials right here on the blog at&nbsp;<a href="http://blog.theseed.org/servers/presentations/t1/overview.html">http://blog.theseed.org/servers/presentations/t1/overview.html</a>.</div><div><br /></div><div><br /></div><div>Email us at&nbsp;<a href="mailto:rast-tutorial@lists.mcs.anl.gov">rast-tutorial@lists.mcs.anl.gov</a>&nbsp;to register or find out more about these tutorials.</div>]]>
        
    </content>
</entry>

<entry>
    <title>What is FIG</title>
    <link rel="alternate" type="text/html" href="http://blog.theseed.org/servers/2010/08/what-is-fig.html" />
    <id>tag:blog.theseed.org,2010:/servers//12.642</id>

    <published>2010-08-25T18:47:43Z</published>
    <updated>2010-08-25T18:49:46Z</updated>

    <summary>What is FIG?FIG is a nonprofit organization devoted to providing support for those analyzing genomes.Sequencing of genomes is laying the foundation for advances in sciencethat will dramatically reshape our society. These advances willinitially occur in medicine, agriculture, and chemical production,...</summary>
    <author>
        <name>The SEED Team</name>
        
    </author>
    
    
    <content type="html" xml:lang="en-us" xml:base="http://blog.theseed.org/servers/">
        <![CDATA[<div>What is FIG?</div><div><br /></div><div>FIG is a nonprofit organization devoted to providing support for those analyzing genomes.</div><div><br /></div><div>Sequencing of genomes is laying the foundation for advances in science</div><div>that will dramatically reshape our society. These advances will</div><div>initially occur in medicine, agriculture, and chemical production, but</div><div>in the long term the impact will be pervasive. The computer revolution</div><div>started by impacting payrolls, but eventually allowed man to travel to</div><div>the moon. Similarly, the biological revolution is beginning by</div><div>reshaping the life sciences, but this will surely not be the the whole</div><div>story or even the most significant outcome.</div><div><br /></div><div>The interpretation of genomes will constitute the most exciting and</div><div>most significant science of the century. By rapidly advancing our</div><div>understanding of life, how it arose, and how it continues to change,</div><div>we will acquire the tools that will allow us to better understand and</div><div>improve our existence. Understanding will begin with relatively imple</div><div>forms of life -- unicellular organisms. While the central mechanisms</div><div>of life are shared by both these organisms and the most complex</div><div>animals and plants, they also contain a remarkable diversity. They</div><div>have an immense amount to teach us about life itself, and we will need</div><div>to master these lessons before full understanding of complex genomes</div><div>will be achievable.</div><div><br /></div><div>The Fellowship for Interpretation of Genomes will focus on organizing</div><div>the data needed to support interpretation of genomes, providing the</div><div>infrastructure needed by the world community in its efforts to achieve</div><div>understanding. In addition, we will ourselves pick specific, critical</div><div>problems and attempt to actively participate in the unravelling of the</div><div>secrets within these amazing entities. It is only by merging the work</div><div>of building infrastructure with the applications that use it that we</div><div>will more deeply understand what is needed at each step.</div><div><br /></div><div>FIG was started in May, 2003. The founders were Michael Fonstein,</div><div>Yakov Kogan, Andrei Osterman, Ross Overbeek, and Veronika Vonstein. An</div><div>early position paper began with the following comments:</div><div><br /></div><blockquote class="webkit-indent-blockquote" style="margin: 0 0 0 40px; border: none; padding: 0px;"><div><i>The FIG Architecture: the SEED</i></div><div><i><br /></i></div><div><i>We begin with the "seed" of FIG. The SEED contains the essential,</i></div><div><i>basic elements that are needed to sustain a scalable integration of</i></div><div><i>thousands of genomes. The later parts of this document will attempt to</i></div><div><i>offer precise notions of what makes up the seed of FIG. I will cover</i></div><div><i>the basic types of objects, make comments on what extensions will be</i></div><div><i>needed to support hundreds of thousands of genomes, and offer an</i></div><div><i>implementation plan.</i></div><div><i><br /></i></div><div><i>However, before we go into such detail, some broad notions should be</i></div><div><i>discussed. The idea of integrating hundreds of thousands of genomes</i></div><div><i>needs some clarification. Indeed, what is meant by integrating a bunch</i></div><div><i>of genomes, no matter what the number. In my mind, the notion of</i></div><div><i>integration is essentially "maintenance of notions of neighborhood,</i></div><div><i>allowing forms of access that can be used to easily explore</i></div><div><i>connections and comparisons between data from numerous genomes". This</i></div><div><i>may be viewed as a complicated way to say "a framework to support</i></div><div><i>comparative analysis". &nbsp;</i></div></blockquote><div><br /></div><div><br /></div><div>To be &nbsp;more precise:</div><div><br /></div><div>Genes from a single genomes are often "functionally related" in that</div><div>the participate in implementing a single pathway or subsystem. For any</div><div>single gene, the "functional neighborhood" of that gene is the set of</div><div>genes that are functionally related to the gene. To support access</div><div>relating to this notion of neighborhood requires an encoding of the</div><div>cellular machinery (e.g., pathways). &nbsp;Genes that occur close to each</div><div>other on a chromosome may be thought of as "postionally related". The</div><div>set of genes that are positionally related to a given gene amounts to</div><div>the "positional neighborhood" of the gene. One of the huge payouts of</div><div>integrations to data has been based on a correlation between the</div><div>neighborhoods imposed by "functionally related" and "positionally</div><div>related" in the case of prokaryotic genomes. &nbsp;Genes from one or more</div><div>genomes that share a common ancestor are called "homologous". Homology</div><div>induces yet another notion of neigborhood. One can build more</div><div>restricted neighborhoods upon this basic concept. Thus, we tend to</div><div>think of a protein family as a set of homologous genes that have a</div><div>common function (an imprecise notion, we grant). Maintenance of</div><div>protein families will, of course, be an absolutely essential part of</div><div>effectively integrating many thousands of genomes. &nbsp;Sets of very</div><div>closely related genomes may be viewed as a neighborhood (i.e., the</div><div>neighborhood of a genome becomes a set of closely related</div><div>genomes). One can layer a notion of "variation", including SNPs, on</div><div>the notion of closely related genomes, and then whole frameworks for</div><div>exploring minor variations become possible. &nbsp;The power in an</div><div>integration arises from mixing the different notions of</div><div>neighborhood. The tools for supporting effective use of a variety of</div><div>comparative notions constitute the computational framework for</div><div>comparative analysis, which is often abbreviated to the notion of</div><div>"integration".</div><div><br /></div><div>FIG offers the key services required to architect and implement a</div><div>comparative framwork for interpreting genomes.</div><div><br /></div><div>The Fellowship for Interpretation of Genomes is a 501 (c) (3) organization.</div><div><br /></div>]]>
        
    </content>
</entry>

</feed>

