Lab Logo
people  |  projects  |  pictures  |  publications  |  protocols  |  places  |  play

Summaries of lab projects:

 

All sequencing projects performed in the Pringle lab are deposited in SRA as they are generated. They can be freely downloaded for analysis. If you have questions or comments (we'd appreciate knowing who is using the data and how), please e-mail Erik Lehnert.

symb logo

Transcriptome of Symbiotic CC7 Aiptasia (with some algal sequences as well):

Any transcripts having 99% identity over an alignment that covers at least 20% of both transcripts were clustered and a single representative transcript retained in this GZ file.
Each representative transcript has been classified based on most probable species of origin based on TopSort (a machine-learning algorithm) and genomic read alignment (.txt file).

Transcriptome of aposymbiotic CC7 Aiptasia

We assembled the transcriptome of a clonal population of adult, aposymbiotic (dinoflagelate-free) Aiptasia pallida from ~208 million reads, yielding 58,018 contigs.  We demonstrated that many of these contigs represent full-length or near-full-length transcripts that encode proteins similar to those from a diverse array of pathways in other organisms, including various metabolic enzymes, cytoskeletal proteins, and neuropeptide precursors. These contigs were uploaded to the NCBI Transcriptome Assembly Shotgun Sequence Database, accession numbers JV077153-JV134524.
Adult aposymbiotic Aiptasia transcriptome version 1.1

Many alternatively spliced transcript were present in out transcriptome.  For the purposes of RNA-Seq experiments, transcripts with highly redundant sequence should be removed.  Below is a fasta file of containing the longest transcript from each cluster.
Adult aposymbiotic
Aiptasia transcriptome FOR MAPPING version 1.1

The contigs were annotated by sequence similarity, assigned GO terms, and scanned for conserved protein domains. 
You can download a tab-delimited annotations file here:
Adult aposymbiotic Aiptasia transcriptome version 1.1 annotations

 

Reads from gene expression experiments and genomic DNA

We are currently analyzing expression data comparing symbiotic and aposymbiotic anemones, symbiotic and cultured Symbiodinium, and heat stressed Aiptasia. We are also in the process of generating reads derived from aposymbiotic Aiptasia genomic DNA, which will be assembled into a genome. All of these reads are available at: http://www.ncbi.nlm.nih.gov/sra/?term=pringle

Fulcrum Read Collapser "Leveraging the computer power you have into the results you need"

Our Lab has developed a pipeline to collapse near-duplicate reads resulting from Ultra Hight Throughput Sequencing (UHTS), specifically Illumina and 454. The pipeline uses the Parallel Python module to spread work across processors or networked computers, and uses MapReduce to allow processing of even larger datasets. Included in this archive are detailed manuals and several helper scripts.
http://www.ncbi.nlm.nih.gov/pubmed/22419786

Updated, Oct. 2012: We have updated fulcrum to be compatible with the new Illumina fastq output.

fulcrum_v_043 (.zip)

We have added a new option to linenumfast05.py. -n [int] Length of longest read. Default is 101-bp. If you are using shorter, just change it AND change the variable MAX_LENGTH at the top of parallel05.py. Finally, if you are using something besides FASTQ 1.8+, change the OFFSET value in parallel05.py from 33 to 64.

Deprecated version: fulcrum_v_042 (.zip)