fgenes - softberry · 2005. 12. 7. · (salamov and solovyev, 2000) on the baylor college of...

79
FGENES ......................................................................................................................................... 1 FGENESH ...................................................................................................................................... 4 FGENESH++ ............................................................................................................................... 14 FGENESH+ .................................................................................................................................. 18 FGENESB .................................................................................................................................... 20 FGENESV .................................................................................................................................... 22 FGENES-M .................................................................................................................................. 24 BESTORF .................................................................................................................................... 25 PROTCOMP ................................................................................................................................ 26 BPROM ........................................................................................................................................ 30 SPLICEDB ................................................................................................................................... 33 SCAN2 .......................................................................................................................................... 35 PDISORDER................................................................................................................................ 36 SPL ................................................................................................................................................ 37 NSITE ........................................................................................................................................... 38 TSSP.............................................................................................................................................. 39 PLANTPROM ............................................................................................................................. 40 PROMH ........................................................................................................................................ 42 Other FGENESH ......................................................................................................................... 43 Various programs ........................................................................................................................ 74 FGENES Bioinformatics 2005 21(9):1776-1781; doi:10.1093/bioinformatics/bti283 Gene finding for the helical cytokines Darrell Conklin , Betty Haldeman and Zeren Gao 1,* 2 2 1 Department of Computing, City University London, United Kingdom 2 ZymoGenetics Inc. Seattle, USA * To whom correspondence should be addressed. ... For comparison purposes, three other gene finding methods were also run on the same dataset: Fgenes v1.6 (Solovyev et al., 1994), Genscan v1.0 (Burge and Karlin ... Genes, Chromosomes and Cancer Volume 43, Issue 1 , Pages 83 – 94 Published Online: 18 Feb 2005 Molecular cloning and characterization of FBXO47, a novel gene containing an F-box domain, located in the 17q12 band deleted in papillary renal cell carcinoma Barbara Simon-Kayser 1 * , Catherine Scoul 1 , Karine Renaudin 2 , Pascal Jezequel 3 , Olivier Bouchot 4 , Jérôme Rigaud 4 , Stéphane Bezieau 1 1 Laboratoire d'Etude du Polymorphisme de l' ADN, Faculté de Médecine, Nantes, France 2 Service d' anatomo-pathologie, Hôtel-Dieu, Centre Hospitalier Universitaire, Nantes, France 3 Centre de Recherche de Lutte Contre le Cancer René Gauducheau, Saint Herblain, France 4 Service d'urologie, Hôtel-Dieu, Centre Hospitalier Universitaire, Nantes, France Sequence data from this article have been deposited with the GenBank Data Library under Accession No. AY700575. ... html), GrailExp (http://grail.Isd.ornl.gov/grailexp), Fgenes (http://genomic.sanger. ac.uk/gf/gf.shtml), and Genmark (http://www2.ebi.ac.uk/genemark). ...

Upload: others

Post on 10-Mar-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

FGENES 1 FGENESH 4 FGENESH++ 14 FGENESH+18 FGENESB 20 FGENESV 22 FGENES-M 24 BESTORF 25 PROTCOMP26 BPROM 30 SPLICEDB 33 SCAN2 35 PDISORDER36 SPL37 NSITE 38 TSSP39 PLANTPROM 40 PROMH42 Other FGENESH43 Various programs74 FGENES Bioinformatics 2005 21(9)1776-1781 doi101093bioinformaticsbti283 Gene finding for the helical cytokines Darrell Conklin Betty Haldeman and Zeren Gao 1 2 2

1Department of Computing City University London United Kingdom 2ZymoGenetics Inc Seattle USA To whom correspondence should be addressed For comparison purposes three other gene finding methods were also run on the same dataset Fgenes v16 (Solovyev et al 1994) Genscan v10 (Burge and Karlin Genes Chromosomes and Cancer Volume 43 Issue 1 Pages 83 ndash 94 Published Online 18 Feb 2005 Molecular cloning and characterization of FBXO47 a novel gene containing an F-box domain located in the 17q12 band deleted in papillary renal cell carcinoma

Barbara Simon-Kayser 1 Catherine Scoul 1 Karine Renaudin 2 Pascal Jezequel 3 Olivier Bouchot 4 Jeacuterocircme Rigaud 4 Steacutephane Bezieau 1

1Laboratoire dEtude du Polymorphisme de l ADN Faculteacute de Meacutedecine Nantes France 2Service d anatomo-pathologie Hocirctel-Dieu Centre Hospitalier Universitaire Nantes France 3Centre de Recherche de Lutte Contre le Cancer Reneacute Gauducheau Saint Herblain France 4Service durologie Hocirctel-Dieu Centre Hospitalier Universitaire Nantes France

Sequence data from this article have been deposited with the GenBank Data Library under Accession No AY700575 html) GrailExp (httpgrailIsdornlgovgrailexp) Fgenes (httpgenomicsanger acukgfgfshtml) and Genmark (httpwww2ebiacukgenemark)

Journal of Neurobiology Volume 63 Issue 3 Pages 235 ndash 254 Published Online 4 Mar 2005 Shaw potassium channel genes in Drosophila James J L Hodge 1 2 James C Choi 2 Cahir J OKane 1 Leslie C Griffith 2

1Department of Genetics University of Cambridge Downing Site Cambridge CB2 3EH United Kingdom 2Department of Biology and Volen Center for Complex Systems Brandeis University MS008 415 South Street Waltham Massachusetts 02454-9110 Intronexon structure of Shawl was initially predicted using the FGENES program (Salamov and Solovyev 2000) on the Baylor College of Medicine Genefinder Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Nature Biotechnology 22 1146 ndash 1149 5-end SAGE for the analysis of transcriptional start sites Hashimoto et al (2004) from expressed sequence tag (EST) maps analysis of full-length cDNAs and computational annotation by Genscan Genie FGENES and other programs Hemoglobin Volume 28 Number 3 2004 255 ndash 259 An a-Thalassemia Phenotype in a Dutch Hindustani Caused by a New Point Mutation that Creates an Alternative Splice Donor Site in the First Exon of the a2-Globin Gene Cornelis L Harteveld A1 Pierre W Wijermans A2 Peter van Delft A1 Ellen Rasp A2 Hans L Haak A2 Piero C Giordano A1 A1 Hemoglobinopathies Laboratory Leiden University Medical Center PO Box 9503 2300 Leiden The Netherlands A2 Leyenburg Hospital The Hague The Netherlands and GRAIL gap2 (Gene Recognition and Assembly Internet Link Sequence exploration and gene discovery Version 13) Genefinder GENSCAN Fgenes and HMMGene Journal of Medical Genetics 200441e52 Genomic organisation of the UDP-N-acetylglucosamine-1-phosphotransferase gamma subunit (GNPTAG) and its mutations in mucolipidosis III

A Raas-Rothschild R Bargal O Goldman E Ben-Asher J E M Groener A Toutain E Stemmer Z Ben-Neriah H Flusser F A Beemer M Penttinen T Olender A J J T Rein G Bach and M Zeigler

1 1 1 2 3 4

1 1 5 6 7 2 8

1 11 Department of Human Genetics Hadassah University Medical Center Jerusalem Israel 2 Department of Molecular Genetics Weizmann Institute of Science Rehovot Israel 3 Departments of Pediatrics and Clinical Genetics Leiden University Medical Center Leiden The Netherlands 4 Service de Geacuteneacutetique CHU de Tours Hocircpital Bretonneau Tours France 5 Child Development Unit Soroka University Hospital Beer Sheva Israel 6 Department of Medical Genetics University Medical Center Utrecht The Netherlands 7 Clinical Genetics Unit Turku University Central Hospital Turku Finland 8 Division of Pediatric Cardiology Hadassah University Medical Center Jerusalem Israel Correspondence to A Raas-Rothschild MD Department of Human Genetics Hadassah University Hospital Jerusalem 91120 Israel annickmdhujiacil third intron is 9 kb This structure was consistent with the exon prediction of fgenes and Genscan Each splice donor and acceptor

FGENESH European Journal of Plant Pathology Issue Volume 112 Number 1 Date May 2005 Pages 23 - 29 DOI 101007s10658-004-7088-7 Leptosphaeria maculans a fungal pathogen of Brassica napus secretes a subtilisin-like serine protease Leanne M Wilson1 and Barbara J Howlett1

1 School of Botany The University of Melbourne Parkville Victoria 3010 Australia DNA and cDNA sequences were compared to identify intron positions which confirmed those predicted by FGENESH gene prediction software (wwwsoftberrycom) Current Genetics Issue Volume 47 Number 5 Date May 2005 Pages 307 ndash 315 DOI 101007s00294-004-0559-8 During attachment Phytophthora spores secrete proteins containing thrombospondin type 1 repeats Andrea V Robold1 and Adrienne R Hardham1

(1) Plant Cell Biology Group Research School of Biological Sciences The Australian National University Canberra ACT 2601 Australia infohtml) The DNA sequence was searched for introns using the soft- ware program FGENESH (httpwwwsoftberry comberryphtml Microbiology 151 (2005) 1499-1505 DOI 101099mic027759-0 Overproduction purification and characterization of FgaPT2 a dimethylallyltryptophan synthase from Aspergillus fumigatus Inge A Unsoumlld and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry wwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for intron prediction Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman 1 12

Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeedu Present address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706

Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom)

Plant Mol Biol 2005 Feb57(3)445-60 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Yao H Guo L Fu Y Borsuk LA Wen TJ Skibbe DS Cui X Scheffler BE Cao J Emrich SJ Ashlock DA Schnable PS Department of Genetics Development and Cell Biology Iowa State University Ames 50011-3650 USA hellipThe five programs were used in conjunction with RT-PCR to identify and establish the structures of two new genes in the a1-sh2 interval of the maize genome FGENESH GeneMarkhmm and GENSCAN were tested on a larger data set consisting of maize assembled genomic islands (MAGIs) that had been aligned to ESTs FGENESH GeneMarkhmm and GENSCAN correctly predicted gene models in 773 625 and 371 MAGIs respectively out of the 1353 MAGIs that comprise data set 2hellip New Phytologist 167 (1) July 2005 239-247 doi101111j1469-8137200501392x Identification of perennial ryegrass (Lolium perenne (L)) and meadow fescue (Festuca pratensis (Huds)) candidate orthologous sequences to the rice Hd1(Se1) and barley HvCO1 CONSTANS-like genes through comparative mapping and microsynteny P Armstead L Skoslasht L B Turner K Skoslasht I S Donnison M O Humphreys and I P King Author for correspondence Ian Armstead Tel +44 (0)1970 823108 Fax +44 (0)1970 823242 Email ianarmsteadbbsrcacuk Predictions of mRNA and protein sequences were carried out using fgenesh and fgenesh+ software ( http wwwsoftberrycom berryphtml ) Plant Physiology May 2005 Vol 138 pp 38-46 BIOINFORMATICS-PLANT DATABASES Databases and Information Integration for the Medicago truncatula Genome and Transcriptome1 Steven B Cannon John A Crow Michael L Heuer Xiaohong Wang Ethalinda KS Cannon Christopher Dwan Anne-Francoise Lamblin Jayprakash Vasdewani Joann Mudge Andrew Cook John Gish Foo Cheung Steve Kenton Timothy M Kunau Douglas Brown Gregory D May Dongjin Kim Douglas R Cook Bruce A Roe Chris D Town Nevin D Young and Ernest F Retzel

2 3

Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (SBC XW EKSC JV JM NDY) Center for Computational Genomics and Bioinformatics University of Minnesota Minneapolis Minnesota 55455 (JAC MLH CD AFL TMK

EFR) Department of Plant Pathology University of California Davis California 95616 (AC JG DJK DRC) The Institute for Genomic Research Rockville Maryland 20850 (FC CDT) Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR SK) Plant Biology Division The Samuel Roberts Noble Foundation Ardmore Oklahoma 73401 (GDM) and North Carolina State University Fungal Genomics Laboratory Department of Plant Pathology Raleigh North Carolina 27695 (DB) 1 This work was supported by the National Science Foundation (awards DBIndash0321460 DBIndash0196197 DBIndash0110206 DBIndash9975806 and DBIndash9872565) by the US Department of Agriculture Cooperative State Research Education and Extension ServiceNational Research Initiative Program and by the Samuel Noble Roberts Foundation 2 Present address BioTeam Inc Cambridge MA 3 Present address University of Minnesota Cancer Center MMC 806 420 Delaware St SE Minneapolis MN 55455 wwwplantphysiolorgcgidoi101104pp104059204 Corresponding author e-mail ernestccgbumnedu fax 612ndash626ndash6069 Received December 31 2004 returned for revision March 4 2005 accepted March 21 2005 Annotation involves a multi-institution pipeline relying on Medicago-trained FGENESH (Salamov and Solovyev 2000 ) predictions the EuGene (Foissac et al Plant Physiology May 2005 Vol 138 pp 18-26 BIOINFORMATICS-PLANT DATABASES The Institute for Genomic Research Osa1 Rice Genome Annotation Database1 Qiaoping Yuan Shu Ouyang Aihui Wang Wei Zhu Rama Maiti Haining Lin John Hamilton Brian Haas Razvan Sultana Foo Cheung Jennifer Wortman and C Robin Buell

2

The Institute for Genomic Research Rockville Maryland 20850 1 This work (on rice genome annotation) was supported by the National Science Foundation (grant no DBIndash0321538 to CRB) and the US Department of Agriculture (grant no 2003ndash35317ndash13173 to CRB) 2 Present address Laboratory of Neurogenetics NIAAA NIH 5625 Fishers Lane Suite 3532 MSC 9412 Bethesda MD 20892 wwwplantphysiolorgcgidoi101104pp104059063 Corresponding author e-mail rbuelltigrorg fax 301ndash838ndash0208 Received December 31 2004 returned for revision February 24 2005 accepted March 21 2005 The ab initio gene finders used in the rice EGC pipeline include FGENESH (monocot matrix Salamov and Solovyev 2000 ) GeneMarkhmm (rice matrix Lukashin and Genome Research 15577-582 2005 Closing in on the C elegans ORFeome by cloning TWINSCAN predictions Chaochun Wei Philippe Lamesch Manimozhiyan Arumugam Jennifer Rosenberg Ping Hu Marc Vidal and Michael R Brent

1 2 1 2

1 2 13

1 Laboratory for Computational Genomics and Department of Computer Science and Engineering Washington University St Louis Missouri 63130 USA 2 Center for Cancer Systems Biology Dana-Farber Cancer Institute and Department of Genetics Harvard Medical School Boston Massachusetts 02115 USA

3 Corresponding author E-mail brentcsewustledu fax (314) 935-7302 Finally we compared TWINSCAN with two other gene-prediction systems that have recently been developed for nematodesndashFGENESH (Salamov and Solovyev 2000 Plant Physiology April 2005 Vol 137 pp 1174-1181 UPDATE ON SEQUENCING MEDICAGO TRUNCATULA AND LOTUS JAPONICUS Sequencing the Genespaces of Medicago truncatula and Lotus japonicus1 Nevin D Young Steven B Cannon Shusei Sato Dongjin Kim Douglas R Cook Chris D Town Bruce A Roe and Satoshi Tabata Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (NDY SBC) Kazusa DNA Research Institute Kisarazu Chiba 292ndash0818 Japan (SS ST) Department of Plant Pathology University of California Davis California 95616 (DRC DK) The Institute for Genomic Research Rockville Maryland 20850 (CDT) and Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR) 1 The US component of the Medicago truncatula sequencing effort was initially supported by a grant from the Samuel Roberts Noble Foundation to BAR Current support comes from National Science Foundation Plant Genome Research Program (grant no 0110206 to DRC DK CDT and NDY and grant no 0321460 to NDY BAR and CDT) Funding for Lotus japonicus sequencing comes from the Kazusa DNA Research Institute Foundation wwwplantphysiolorgcgidoi101104pp104057034 Corresponding author e-mail nevinyumnedu fax 612ndash625ndash9728 hellip (These estimates are based on Fgenesh predictions [Salamov and Solovyev 2000 ] using a Mt-trained matrix retaining peptides with a BLASTP match at 10endash4 to the UniProt NREF100 database of peptides [Apweiler et al 2004 ] This estimate for Lj differs from the published value of 1 gene per 101 kb [Asamizu et al 2003a ] due to the use here of the Fgenesh gene-calling algorithm so Mt and Lj could be compared directly)hellip hellipThis estimate increases to 6500 when Lj genes are predicted by the Mt-trained Fgenesh algorithm described earlierhellip International Journal of CancerVolume 109 Issue 1 Pages 71 - 75Cancer Genetics Candidate regions of tumor suppressor locus on chromosome 9q311 in gastric cancer Naoto Kakinuma 1 Kazuyoshi Kohu 1 2 Masaaki Sato 1 Tatsuya Yamada 3 Motowo Nakajima 1 Tetsu Akiyama 2 Susumu Ohwada 3 Yasuhiko Shibanaka 1

1Novartis Pharma Tsukuba Research Institute Ibaraki Japan 2Laboratory of Molecular and Genetic Information Institute for Molecular and Cellular Biosciences University of Tokyo Tokyo Japan 3Second Department of Surgery Gunma University School of Medicine Gunma Japan email Yasuhiko Shibanaka (yasuhikoshibanakapharmanovartiscom) Correspondence to Yasuhiko Shibanaka Novartis Pharma Tsukuba Research Institute Ohkubo 8 Tsukuba-shi Ibaraki 300-2611 Japan Fax +81-29-865-2281

predict the genes between D9S277 and D9S127 in 9q311 the gene prediction tools also in the UCSC Genome Browser having Acembly Ensembl Fgenesh GenScan and Genome Research 1554-66 2005 Gene and alternative splicing annotation with AIR Liliana Florea145 Valentina Di Francesco2 Jason Miller1 Russell Turner1 Alison Yao2 Michael Harris2 Brian Walenz1 Clark Mobarry1 Gennady V Merkulov3 Rosane Charlab3 Ian Dew1 Zuoming Deng3 Sorin Istrail1 Peter Li2 and Granger Sutton1 1 Informatics Research Applied Biosystems Rockville Maryland 20850 USA 2 Advanced Solutions Celera Genomics Rockville Maryland 20850 USA 3 Scientific Content and Applications Celera Genomics Rockville Maryland 20850 USA Ab initio prediction programs such as GenScan (Burge and Karlin 1997 ) FGenesH (Salamov and Solovyev 2000 ) Genie (Kulp et al Nucleic Acids Research 2005 Vol 33 Database issue D399-D402 SilkDB a knowledgebase for silkworm biology and genomics Jing Wang Qingyou Xia Ximiao He Mingtao Dai Jue Ruan Jie Chen Guo Yu Haifeng Yuan Yafeng Hu Ruiqiang Li Tao Feng Chen Ye Cheng Lu Jun Wang Songgang Li Gane Ka-Shu Wong Huanming Yang Jian Wang Zhonghuai Xiang Zeyang Zhou and Jun Yu

1 2 34 3 34 3 3

3 3 3 3 3 2 135

1 36 35 35 2

2 35

1 College of Life Sciences Peking University Beijing 100871 China 2 The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China 3 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 4 Graduate School of the Chinese Academy of Sciences Yuquan Road 19A Beijing 100039 China 5 Key Laboratory of Bioinformatics of Zhejiang Province Hangzhou Genomics Institute James D Watson Institute of Genome Sciences of Zhejiang University Hangzhou 310007 China and 6 Department of Medicine University of Washington Genome Center Seattle WA 98195 USA To whom correspondence should be addressed Tel +86 10 80481455 Fax +86 10 80498676 Email junyugenomicsorgcn Correspondence may also be addressed to Zeyang Zhou Tel +86 23 68251123 Fax +86 23 68251128 Email zyzhouswaucqcn The authors wish it to be known that in their opinion the first three authors should be regarded as joint First Authors BGF is a self-developed ab initio program based on GenScan (9) and FgeneSH (10) and was successfully utilized for our rice genome annotation (11) Clinical Cancer Research Vol 11 4029-4036 June 1 2005 Imaging Diagnosis Prognosis A Molecular Signature in Superficial Bladder Carcinoma Predicts Clinical Outcome Lars Dyrskjoslasht Karsten Zieger Mogens Kruhoslashffer Thomas Thykjaer Jens L Jensen Hanne Primdahl Natasha Aziz Niels Marcussen Klaus Moslashller and Torben F Oslashrntoft

1 12 15 15 4

1 6 3 2 1

Authors Affiliations 1 Molecular Diagnostic Laboratory Department of Clinical Biochemistry 2 Department of Urology and 3 University Institute of Pathology Aarhus University Hospital 4 Departments of Theoretical Statistics

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 2: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

Journal of Neurobiology Volume 63 Issue 3 Pages 235 ndash 254 Published Online 4 Mar 2005 Shaw potassium channel genes in Drosophila James J L Hodge 1 2 James C Choi 2 Cahir J OKane 1 Leslie C Griffith 2

1Department of Genetics University of Cambridge Downing Site Cambridge CB2 3EH United Kingdom 2Department of Biology and Volen Center for Complex Systems Brandeis University MS008 415 South Street Waltham Massachusetts 02454-9110 Intronexon structure of Shawl was initially predicted using the FGENES program (Salamov and Solovyev 2000) on the Baylor College of Medicine Genefinder Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Nature Biotechnology 22 1146 ndash 1149 5-end SAGE for the analysis of transcriptional start sites Hashimoto et al (2004) from expressed sequence tag (EST) maps analysis of full-length cDNAs and computational annotation by Genscan Genie FGENES and other programs Hemoglobin Volume 28 Number 3 2004 255 ndash 259 An a-Thalassemia Phenotype in a Dutch Hindustani Caused by a New Point Mutation that Creates an Alternative Splice Donor Site in the First Exon of the a2-Globin Gene Cornelis L Harteveld A1 Pierre W Wijermans A2 Peter van Delft A1 Ellen Rasp A2 Hans L Haak A2 Piero C Giordano A1 A1 Hemoglobinopathies Laboratory Leiden University Medical Center PO Box 9503 2300 Leiden The Netherlands A2 Leyenburg Hospital The Hague The Netherlands and GRAIL gap2 (Gene Recognition and Assembly Internet Link Sequence exploration and gene discovery Version 13) Genefinder GENSCAN Fgenes and HMMGene Journal of Medical Genetics 200441e52 Genomic organisation of the UDP-N-acetylglucosamine-1-phosphotransferase gamma subunit (GNPTAG) and its mutations in mucolipidosis III

A Raas-Rothschild R Bargal O Goldman E Ben-Asher J E M Groener A Toutain E Stemmer Z Ben-Neriah H Flusser F A Beemer M Penttinen T Olender A J J T Rein G Bach and M Zeigler

1 1 1 2 3 4

1 1 5 6 7 2 8

1 11 Department of Human Genetics Hadassah University Medical Center Jerusalem Israel 2 Department of Molecular Genetics Weizmann Institute of Science Rehovot Israel 3 Departments of Pediatrics and Clinical Genetics Leiden University Medical Center Leiden The Netherlands 4 Service de Geacuteneacutetique CHU de Tours Hocircpital Bretonneau Tours France 5 Child Development Unit Soroka University Hospital Beer Sheva Israel 6 Department of Medical Genetics University Medical Center Utrecht The Netherlands 7 Clinical Genetics Unit Turku University Central Hospital Turku Finland 8 Division of Pediatric Cardiology Hadassah University Medical Center Jerusalem Israel Correspondence to A Raas-Rothschild MD Department of Human Genetics Hadassah University Hospital Jerusalem 91120 Israel annickmdhujiacil third intron is 9 kb This structure was consistent with the exon prediction of fgenes and Genscan Each splice donor and acceptor

FGENESH European Journal of Plant Pathology Issue Volume 112 Number 1 Date May 2005 Pages 23 - 29 DOI 101007s10658-004-7088-7 Leptosphaeria maculans a fungal pathogen of Brassica napus secretes a subtilisin-like serine protease Leanne M Wilson1 and Barbara J Howlett1

1 School of Botany The University of Melbourne Parkville Victoria 3010 Australia DNA and cDNA sequences were compared to identify intron positions which confirmed those predicted by FGENESH gene prediction software (wwwsoftberrycom) Current Genetics Issue Volume 47 Number 5 Date May 2005 Pages 307 ndash 315 DOI 101007s00294-004-0559-8 During attachment Phytophthora spores secrete proteins containing thrombospondin type 1 repeats Andrea V Robold1 and Adrienne R Hardham1

(1) Plant Cell Biology Group Research School of Biological Sciences The Australian National University Canberra ACT 2601 Australia infohtml) The DNA sequence was searched for introns using the soft- ware program FGENESH (httpwwwsoftberry comberryphtml Microbiology 151 (2005) 1499-1505 DOI 101099mic027759-0 Overproduction purification and characterization of FgaPT2 a dimethylallyltryptophan synthase from Aspergillus fumigatus Inge A Unsoumlld and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry wwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for intron prediction Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman 1 12

Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeedu Present address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706

Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom)

Plant Mol Biol 2005 Feb57(3)445-60 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Yao H Guo L Fu Y Borsuk LA Wen TJ Skibbe DS Cui X Scheffler BE Cao J Emrich SJ Ashlock DA Schnable PS Department of Genetics Development and Cell Biology Iowa State University Ames 50011-3650 USA hellipThe five programs were used in conjunction with RT-PCR to identify and establish the structures of two new genes in the a1-sh2 interval of the maize genome FGENESH GeneMarkhmm and GENSCAN were tested on a larger data set consisting of maize assembled genomic islands (MAGIs) that had been aligned to ESTs FGENESH GeneMarkhmm and GENSCAN correctly predicted gene models in 773 625 and 371 MAGIs respectively out of the 1353 MAGIs that comprise data set 2hellip New Phytologist 167 (1) July 2005 239-247 doi101111j1469-8137200501392x Identification of perennial ryegrass (Lolium perenne (L)) and meadow fescue (Festuca pratensis (Huds)) candidate orthologous sequences to the rice Hd1(Se1) and barley HvCO1 CONSTANS-like genes through comparative mapping and microsynteny P Armstead L Skoslasht L B Turner K Skoslasht I S Donnison M O Humphreys and I P King Author for correspondence Ian Armstead Tel +44 (0)1970 823108 Fax +44 (0)1970 823242 Email ianarmsteadbbsrcacuk Predictions of mRNA and protein sequences were carried out using fgenesh and fgenesh+ software ( http wwwsoftberrycom berryphtml ) Plant Physiology May 2005 Vol 138 pp 38-46 BIOINFORMATICS-PLANT DATABASES Databases and Information Integration for the Medicago truncatula Genome and Transcriptome1 Steven B Cannon John A Crow Michael L Heuer Xiaohong Wang Ethalinda KS Cannon Christopher Dwan Anne-Francoise Lamblin Jayprakash Vasdewani Joann Mudge Andrew Cook John Gish Foo Cheung Steve Kenton Timothy M Kunau Douglas Brown Gregory D May Dongjin Kim Douglas R Cook Bruce A Roe Chris D Town Nevin D Young and Ernest F Retzel

2 3

Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (SBC XW EKSC JV JM NDY) Center for Computational Genomics and Bioinformatics University of Minnesota Minneapolis Minnesota 55455 (JAC MLH CD AFL TMK

EFR) Department of Plant Pathology University of California Davis California 95616 (AC JG DJK DRC) The Institute for Genomic Research Rockville Maryland 20850 (FC CDT) Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR SK) Plant Biology Division The Samuel Roberts Noble Foundation Ardmore Oklahoma 73401 (GDM) and North Carolina State University Fungal Genomics Laboratory Department of Plant Pathology Raleigh North Carolina 27695 (DB) 1 This work was supported by the National Science Foundation (awards DBIndash0321460 DBIndash0196197 DBIndash0110206 DBIndash9975806 and DBIndash9872565) by the US Department of Agriculture Cooperative State Research Education and Extension ServiceNational Research Initiative Program and by the Samuel Noble Roberts Foundation 2 Present address BioTeam Inc Cambridge MA 3 Present address University of Minnesota Cancer Center MMC 806 420 Delaware St SE Minneapolis MN 55455 wwwplantphysiolorgcgidoi101104pp104059204 Corresponding author e-mail ernestccgbumnedu fax 612ndash626ndash6069 Received December 31 2004 returned for revision March 4 2005 accepted March 21 2005 Annotation involves a multi-institution pipeline relying on Medicago-trained FGENESH (Salamov and Solovyev 2000 ) predictions the EuGene (Foissac et al Plant Physiology May 2005 Vol 138 pp 18-26 BIOINFORMATICS-PLANT DATABASES The Institute for Genomic Research Osa1 Rice Genome Annotation Database1 Qiaoping Yuan Shu Ouyang Aihui Wang Wei Zhu Rama Maiti Haining Lin John Hamilton Brian Haas Razvan Sultana Foo Cheung Jennifer Wortman and C Robin Buell

2

The Institute for Genomic Research Rockville Maryland 20850 1 This work (on rice genome annotation) was supported by the National Science Foundation (grant no DBIndash0321538 to CRB) and the US Department of Agriculture (grant no 2003ndash35317ndash13173 to CRB) 2 Present address Laboratory of Neurogenetics NIAAA NIH 5625 Fishers Lane Suite 3532 MSC 9412 Bethesda MD 20892 wwwplantphysiolorgcgidoi101104pp104059063 Corresponding author e-mail rbuelltigrorg fax 301ndash838ndash0208 Received December 31 2004 returned for revision February 24 2005 accepted March 21 2005 The ab initio gene finders used in the rice EGC pipeline include FGENESH (monocot matrix Salamov and Solovyev 2000 ) GeneMarkhmm (rice matrix Lukashin and Genome Research 15577-582 2005 Closing in on the C elegans ORFeome by cloning TWINSCAN predictions Chaochun Wei Philippe Lamesch Manimozhiyan Arumugam Jennifer Rosenberg Ping Hu Marc Vidal and Michael R Brent

1 2 1 2

1 2 13

1 Laboratory for Computational Genomics and Department of Computer Science and Engineering Washington University St Louis Missouri 63130 USA 2 Center for Cancer Systems Biology Dana-Farber Cancer Institute and Department of Genetics Harvard Medical School Boston Massachusetts 02115 USA

3 Corresponding author E-mail brentcsewustledu fax (314) 935-7302 Finally we compared TWINSCAN with two other gene-prediction systems that have recently been developed for nematodesndashFGENESH (Salamov and Solovyev 2000 Plant Physiology April 2005 Vol 137 pp 1174-1181 UPDATE ON SEQUENCING MEDICAGO TRUNCATULA AND LOTUS JAPONICUS Sequencing the Genespaces of Medicago truncatula and Lotus japonicus1 Nevin D Young Steven B Cannon Shusei Sato Dongjin Kim Douglas R Cook Chris D Town Bruce A Roe and Satoshi Tabata Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (NDY SBC) Kazusa DNA Research Institute Kisarazu Chiba 292ndash0818 Japan (SS ST) Department of Plant Pathology University of California Davis California 95616 (DRC DK) The Institute for Genomic Research Rockville Maryland 20850 (CDT) and Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR) 1 The US component of the Medicago truncatula sequencing effort was initially supported by a grant from the Samuel Roberts Noble Foundation to BAR Current support comes from National Science Foundation Plant Genome Research Program (grant no 0110206 to DRC DK CDT and NDY and grant no 0321460 to NDY BAR and CDT) Funding for Lotus japonicus sequencing comes from the Kazusa DNA Research Institute Foundation wwwplantphysiolorgcgidoi101104pp104057034 Corresponding author e-mail nevinyumnedu fax 612ndash625ndash9728 hellip (These estimates are based on Fgenesh predictions [Salamov and Solovyev 2000 ] using a Mt-trained matrix retaining peptides with a BLASTP match at 10endash4 to the UniProt NREF100 database of peptides [Apweiler et al 2004 ] This estimate for Lj differs from the published value of 1 gene per 101 kb [Asamizu et al 2003a ] due to the use here of the Fgenesh gene-calling algorithm so Mt and Lj could be compared directly)hellip hellipThis estimate increases to 6500 when Lj genes are predicted by the Mt-trained Fgenesh algorithm described earlierhellip International Journal of CancerVolume 109 Issue 1 Pages 71 - 75Cancer Genetics Candidate regions of tumor suppressor locus on chromosome 9q311 in gastric cancer Naoto Kakinuma 1 Kazuyoshi Kohu 1 2 Masaaki Sato 1 Tatsuya Yamada 3 Motowo Nakajima 1 Tetsu Akiyama 2 Susumu Ohwada 3 Yasuhiko Shibanaka 1

1Novartis Pharma Tsukuba Research Institute Ibaraki Japan 2Laboratory of Molecular and Genetic Information Institute for Molecular and Cellular Biosciences University of Tokyo Tokyo Japan 3Second Department of Surgery Gunma University School of Medicine Gunma Japan email Yasuhiko Shibanaka (yasuhikoshibanakapharmanovartiscom) Correspondence to Yasuhiko Shibanaka Novartis Pharma Tsukuba Research Institute Ohkubo 8 Tsukuba-shi Ibaraki 300-2611 Japan Fax +81-29-865-2281

predict the genes between D9S277 and D9S127 in 9q311 the gene prediction tools also in the UCSC Genome Browser having Acembly Ensembl Fgenesh GenScan and Genome Research 1554-66 2005 Gene and alternative splicing annotation with AIR Liliana Florea145 Valentina Di Francesco2 Jason Miller1 Russell Turner1 Alison Yao2 Michael Harris2 Brian Walenz1 Clark Mobarry1 Gennady V Merkulov3 Rosane Charlab3 Ian Dew1 Zuoming Deng3 Sorin Istrail1 Peter Li2 and Granger Sutton1 1 Informatics Research Applied Biosystems Rockville Maryland 20850 USA 2 Advanced Solutions Celera Genomics Rockville Maryland 20850 USA 3 Scientific Content and Applications Celera Genomics Rockville Maryland 20850 USA Ab initio prediction programs such as GenScan (Burge and Karlin 1997 ) FGenesH (Salamov and Solovyev 2000 ) Genie (Kulp et al Nucleic Acids Research 2005 Vol 33 Database issue D399-D402 SilkDB a knowledgebase for silkworm biology and genomics Jing Wang Qingyou Xia Ximiao He Mingtao Dai Jue Ruan Jie Chen Guo Yu Haifeng Yuan Yafeng Hu Ruiqiang Li Tao Feng Chen Ye Cheng Lu Jun Wang Songgang Li Gane Ka-Shu Wong Huanming Yang Jian Wang Zhonghuai Xiang Zeyang Zhou and Jun Yu

1 2 34 3 34 3 3

3 3 3 3 3 2 135

1 36 35 35 2

2 35

1 College of Life Sciences Peking University Beijing 100871 China 2 The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China 3 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 4 Graduate School of the Chinese Academy of Sciences Yuquan Road 19A Beijing 100039 China 5 Key Laboratory of Bioinformatics of Zhejiang Province Hangzhou Genomics Institute James D Watson Institute of Genome Sciences of Zhejiang University Hangzhou 310007 China and 6 Department of Medicine University of Washington Genome Center Seattle WA 98195 USA To whom correspondence should be addressed Tel +86 10 80481455 Fax +86 10 80498676 Email junyugenomicsorgcn Correspondence may also be addressed to Zeyang Zhou Tel +86 23 68251123 Fax +86 23 68251128 Email zyzhouswaucqcn The authors wish it to be known that in their opinion the first three authors should be regarded as joint First Authors BGF is a self-developed ab initio program based on GenScan (9) and FgeneSH (10) and was successfully utilized for our rice genome annotation (11) Clinical Cancer Research Vol 11 4029-4036 June 1 2005 Imaging Diagnosis Prognosis A Molecular Signature in Superficial Bladder Carcinoma Predicts Clinical Outcome Lars Dyrskjoslasht Karsten Zieger Mogens Kruhoslashffer Thomas Thykjaer Jens L Jensen Hanne Primdahl Natasha Aziz Niels Marcussen Klaus Moslashller and Torben F Oslashrntoft

1 12 15 15 4

1 6 3 2 1

Authors Affiliations 1 Molecular Diagnostic Laboratory Department of Clinical Biochemistry 2 Department of Urology and 3 University Institute of Pathology Aarhus University Hospital 4 Departments of Theoretical Statistics

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 3: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

A Raas-Rothschild R Bargal O Goldman E Ben-Asher J E M Groener A Toutain E Stemmer Z Ben-Neriah H Flusser F A Beemer M Penttinen T Olender A J J T Rein G Bach and M Zeigler

1 1 1 2 3 4

1 1 5 6 7 2 8

1 11 Department of Human Genetics Hadassah University Medical Center Jerusalem Israel 2 Department of Molecular Genetics Weizmann Institute of Science Rehovot Israel 3 Departments of Pediatrics and Clinical Genetics Leiden University Medical Center Leiden The Netherlands 4 Service de Geacuteneacutetique CHU de Tours Hocircpital Bretonneau Tours France 5 Child Development Unit Soroka University Hospital Beer Sheva Israel 6 Department of Medical Genetics University Medical Center Utrecht The Netherlands 7 Clinical Genetics Unit Turku University Central Hospital Turku Finland 8 Division of Pediatric Cardiology Hadassah University Medical Center Jerusalem Israel Correspondence to A Raas-Rothschild MD Department of Human Genetics Hadassah University Hospital Jerusalem 91120 Israel annickmdhujiacil third intron is 9 kb This structure was consistent with the exon prediction of fgenes and Genscan Each splice donor and acceptor

FGENESH European Journal of Plant Pathology Issue Volume 112 Number 1 Date May 2005 Pages 23 - 29 DOI 101007s10658-004-7088-7 Leptosphaeria maculans a fungal pathogen of Brassica napus secretes a subtilisin-like serine protease Leanne M Wilson1 and Barbara J Howlett1

1 School of Botany The University of Melbourne Parkville Victoria 3010 Australia DNA and cDNA sequences were compared to identify intron positions which confirmed those predicted by FGENESH gene prediction software (wwwsoftberrycom) Current Genetics Issue Volume 47 Number 5 Date May 2005 Pages 307 ndash 315 DOI 101007s00294-004-0559-8 During attachment Phytophthora spores secrete proteins containing thrombospondin type 1 repeats Andrea V Robold1 and Adrienne R Hardham1

(1) Plant Cell Biology Group Research School of Biological Sciences The Australian National University Canberra ACT 2601 Australia infohtml) The DNA sequence was searched for introns using the soft- ware program FGENESH (httpwwwsoftberry comberryphtml Microbiology 151 (2005) 1499-1505 DOI 101099mic027759-0 Overproduction purification and characterization of FgaPT2 a dimethylallyltryptophan synthase from Aspergillus fumigatus Inge A Unsoumlld and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry wwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for intron prediction Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman 1 12

Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeedu Present address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706

Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom)

Plant Mol Biol 2005 Feb57(3)445-60 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Yao H Guo L Fu Y Borsuk LA Wen TJ Skibbe DS Cui X Scheffler BE Cao J Emrich SJ Ashlock DA Schnable PS Department of Genetics Development and Cell Biology Iowa State University Ames 50011-3650 USA hellipThe five programs were used in conjunction with RT-PCR to identify and establish the structures of two new genes in the a1-sh2 interval of the maize genome FGENESH GeneMarkhmm and GENSCAN were tested on a larger data set consisting of maize assembled genomic islands (MAGIs) that had been aligned to ESTs FGENESH GeneMarkhmm and GENSCAN correctly predicted gene models in 773 625 and 371 MAGIs respectively out of the 1353 MAGIs that comprise data set 2hellip New Phytologist 167 (1) July 2005 239-247 doi101111j1469-8137200501392x Identification of perennial ryegrass (Lolium perenne (L)) and meadow fescue (Festuca pratensis (Huds)) candidate orthologous sequences to the rice Hd1(Se1) and barley HvCO1 CONSTANS-like genes through comparative mapping and microsynteny P Armstead L Skoslasht L B Turner K Skoslasht I S Donnison M O Humphreys and I P King Author for correspondence Ian Armstead Tel +44 (0)1970 823108 Fax +44 (0)1970 823242 Email ianarmsteadbbsrcacuk Predictions of mRNA and protein sequences were carried out using fgenesh and fgenesh+ software ( http wwwsoftberrycom berryphtml ) Plant Physiology May 2005 Vol 138 pp 38-46 BIOINFORMATICS-PLANT DATABASES Databases and Information Integration for the Medicago truncatula Genome and Transcriptome1 Steven B Cannon John A Crow Michael L Heuer Xiaohong Wang Ethalinda KS Cannon Christopher Dwan Anne-Francoise Lamblin Jayprakash Vasdewani Joann Mudge Andrew Cook John Gish Foo Cheung Steve Kenton Timothy M Kunau Douglas Brown Gregory D May Dongjin Kim Douglas R Cook Bruce A Roe Chris D Town Nevin D Young and Ernest F Retzel

2 3

Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (SBC XW EKSC JV JM NDY) Center for Computational Genomics and Bioinformatics University of Minnesota Minneapolis Minnesota 55455 (JAC MLH CD AFL TMK

EFR) Department of Plant Pathology University of California Davis California 95616 (AC JG DJK DRC) The Institute for Genomic Research Rockville Maryland 20850 (FC CDT) Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR SK) Plant Biology Division The Samuel Roberts Noble Foundation Ardmore Oklahoma 73401 (GDM) and North Carolina State University Fungal Genomics Laboratory Department of Plant Pathology Raleigh North Carolina 27695 (DB) 1 This work was supported by the National Science Foundation (awards DBIndash0321460 DBIndash0196197 DBIndash0110206 DBIndash9975806 and DBIndash9872565) by the US Department of Agriculture Cooperative State Research Education and Extension ServiceNational Research Initiative Program and by the Samuel Noble Roberts Foundation 2 Present address BioTeam Inc Cambridge MA 3 Present address University of Minnesota Cancer Center MMC 806 420 Delaware St SE Minneapolis MN 55455 wwwplantphysiolorgcgidoi101104pp104059204 Corresponding author e-mail ernestccgbumnedu fax 612ndash626ndash6069 Received December 31 2004 returned for revision March 4 2005 accepted March 21 2005 Annotation involves a multi-institution pipeline relying on Medicago-trained FGENESH (Salamov and Solovyev 2000 ) predictions the EuGene (Foissac et al Plant Physiology May 2005 Vol 138 pp 18-26 BIOINFORMATICS-PLANT DATABASES The Institute for Genomic Research Osa1 Rice Genome Annotation Database1 Qiaoping Yuan Shu Ouyang Aihui Wang Wei Zhu Rama Maiti Haining Lin John Hamilton Brian Haas Razvan Sultana Foo Cheung Jennifer Wortman and C Robin Buell

2

The Institute for Genomic Research Rockville Maryland 20850 1 This work (on rice genome annotation) was supported by the National Science Foundation (grant no DBIndash0321538 to CRB) and the US Department of Agriculture (grant no 2003ndash35317ndash13173 to CRB) 2 Present address Laboratory of Neurogenetics NIAAA NIH 5625 Fishers Lane Suite 3532 MSC 9412 Bethesda MD 20892 wwwplantphysiolorgcgidoi101104pp104059063 Corresponding author e-mail rbuelltigrorg fax 301ndash838ndash0208 Received December 31 2004 returned for revision February 24 2005 accepted March 21 2005 The ab initio gene finders used in the rice EGC pipeline include FGENESH (monocot matrix Salamov and Solovyev 2000 ) GeneMarkhmm (rice matrix Lukashin and Genome Research 15577-582 2005 Closing in on the C elegans ORFeome by cloning TWINSCAN predictions Chaochun Wei Philippe Lamesch Manimozhiyan Arumugam Jennifer Rosenberg Ping Hu Marc Vidal and Michael R Brent

1 2 1 2

1 2 13

1 Laboratory for Computational Genomics and Department of Computer Science and Engineering Washington University St Louis Missouri 63130 USA 2 Center for Cancer Systems Biology Dana-Farber Cancer Institute and Department of Genetics Harvard Medical School Boston Massachusetts 02115 USA

3 Corresponding author E-mail brentcsewustledu fax (314) 935-7302 Finally we compared TWINSCAN with two other gene-prediction systems that have recently been developed for nematodesndashFGENESH (Salamov and Solovyev 2000 Plant Physiology April 2005 Vol 137 pp 1174-1181 UPDATE ON SEQUENCING MEDICAGO TRUNCATULA AND LOTUS JAPONICUS Sequencing the Genespaces of Medicago truncatula and Lotus japonicus1 Nevin D Young Steven B Cannon Shusei Sato Dongjin Kim Douglas R Cook Chris D Town Bruce A Roe and Satoshi Tabata Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (NDY SBC) Kazusa DNA Research Institute Kisarazu Chiba 292ndash0818 Japan (SS ST) Department of Plant Pathology University of California Davis California 95616 (DRC DK) The Institute for Genomic Research Rockville Maryland 20850 (CDT) and Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR) 1 The US component of the Medicago truncatula sequencing effort was initially supported by a grant from the Samuel Roberts Noble Foundation to BAR Current support comes from National Science Foundation Plant Genome Research Program (grant no 0110206 to DRC DK CDT and NDY and grant no 0321460 to NDY BAR and CDT) Funding for Lotus japonicus sequencing comes from the Kazusa DNA Research Institute Foundation wwwplantphysiolorgcgidoi101104pp104057034 Corresponding author e-mail nevinyumnedu fax 612ndash625ndash9728 hellip (These estimates are based on Fgenesh predictions [Salamov and Solovyev 2000 ] using a Mt-trained matrix retaining peptides with a BLASTP match at 10endash4 to the UniProt NREF100 database of peptides [Apweiler et al 2004 ] This estimate for Lj differs from the published value of 1 gene per 101 kb [Asamizu et al 2003a ] due to the use here of the Fgenesh gene-calling algorithm so Mt and Lj could be compared directly)hellip hellipThis estimate increases to 6500 when Lj genes are predicted by the Mt-trained Fgenesh algorithm described earlierhellip International Journal of CancerVolume 109 Issue 1 Pages 71 - 75Cancer Genetics Candidate regions of tumor suppressor locus on chromosome 9q311 in gastric cancer Naoto Kakinuma 1 Kazuyoshi Kohu 1 2 Masaaki Sato 1 Tatsuya Yamada 3 Motowo Nakajima 1 Tetsu Akiyama 2 Susumu Ohwada 3 Yasuhiko Shibanaka 1

1Novartis Pharma Tsukuba Research Institute Ibaraki Japan 2Laboratory of Molecular and Genetic Information Institute for Molecular and Cellular Biosciences University of Tokyo Tokyo Japan 3Second Department of Surgery Gunma University School of Medicine Gunma Japan email Yasuhiko Shibanaka (yasuhikoshibanakapharmanovartiscom) Correspondence to Yasuhiko Shibanaka Novartis Pharma Tsukuba Research Institute Ohkubo 8 Tsukuba-shi Ibaraki 300-2611 Japan Fax +81-29-865-2281

predict the genes between D9S277 and D9S127 in 9q311 the gene prediction tools also in the UCSC Genome Browser having Acembly Ensembl Fgenesh GenScan and Genome Research 1554-66 2005 Gene and alternative splicing annotation with AIR Liliana Florea145 Valentina Di Francesco2 Jason Miller1 Russell Turner1 Alison Yao2 Michael Harris2 Brian Walenz1 Clark Mobarry1 Gennady V Merkulov3 Rosane Charlab3 Ian Dew1 Zuoming Deng3 Sorin Istrail1 Peter Li2 and Granger Sutton1 1 Informatics Research Applied Biosystems Rockville Maryland 20850 USA 2 Advanced Solutions Celera Genomics Rockville Maryland 20850 USA 3 Scientific Content and Applications Celera Genomics Rockville Maryland 20850 USA Ab initio prediction programs such as GenScan (Burge and Karlin 1997 ) FGenesH (Salamov and Solovyev 2000 ) Genie (Kulp et al Nucleic Acids Research 2005 Vol 33 Database issue D399-D402 SilkDB a knowledgebase for silkworm biology and genomics Jing Wang Qingyou Xia Ximiao He Mingtao Dai Jue Ruan Jie Chen Guo Yu Haifeng Yuan Yafeng Hu Ruiqiang Li Tao Feng Chen Ye Cheng Lu Jun Wang Songgang Li Gane Ka-Shu Wong Huanming Yang Jian Wang Zhonghuai Xiang Zeyang Zhou and Jun Yu

1 2 34 3 34 3 3

3 3 3 3 3 2 135

1 36 35 35 2

2 35

1 College of Life Sciences Peking University Beijing 100871 China 2 The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China 3 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 4 Graduate School of the Chinese Academy of Sciences Yuquan Road 19A Beijing 100039 China 5 Key Laboratory of Bioinformatics of Zhejiang Province Hangzhou Genomics Institute James D Watson Institute of Genome Sciences of Zhejiang University Hangzhou 310007 China and 6 Department of Medicine University of Washington Genome Center Seattle WA 98195 USA To whom correspondence should be addressed Tel +86 10 80481455 Fax +86 10 80498676 Email junyugenomicsorgcn Correspondence may also be addressed to Zeyang Zhou Tel +86 23 68251123 Fax +86 23 68251128 Email zyzhouswaucqcn The authors wish it to be known that in their opinion the first three authors should be regarded as joint First Authors BGF is a self-developed ab initio program based on GenScan (9) and FgeneSH (10) and was successfully utilized for our rice genome annotation (11) Clinical Cancer Research Vol 11 4029-4036 June 1 2005 Imaging Diagnosis Prognosis A Molecular Signature in Superficial Bladder Carcinoma Predicts Clinical Outcome Lars Dyrskjoslasht Karsten Zieger Mogens Kruhoslashffer Thomas Thykjaer Jens L Jensen Hanne Primdahl Natasha Aziz Niels Marcussen Klaus Moslashller and Torben F Oslashrntoft

1 12 15 15 4

1 6 3 2 1

Authors Affiliations 1 Molecular Diagnostic Laboratory Department of Clinical Biochemistry 2 Department of Urology and 3 University Institute of Pathology Aarhus University Hospital 4 Departments of Theoretical Statistics

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 4: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

FGENESH European Journal of Plant Pathology Issue Volume 112 Number 1 Date May 2005 Pages 23 - 29 DOI 101007s10658-004-7088-7 Leptosphaeria maculans a fungal pathogen of Brassica napus secretes a subtilisin-like serine protease Leanne M Wilson1 and Barbara J Howlett1

1 School of Botany The University of Melbourne Parkville Victoria 3010 Australia DNA and cDNA sequences were compared to identify intron positions which confirmed those predicted by FGENESH gene prediction software (wwwsoftberrycom) Current Genetics Issue Volume 47 Number 5 Date May 2005 Pages 307 ndash 315 DOI 101007s00294-004-0559-8 During attachment Phytophthora spores secrete proteins containing thrombospondin type 1 repeats Andrea V Robold1 and Adrienne R Hardham1

(1) Plant Cell Biology Group Research School of Biological Sciences The Australian National University Canberra ACT 2601 Australia infohtml) The DNA sequence was searched for introns using the soft- ware program FGENESH (httpwwwsoftberry comberryphtml Microbiology 151 (2005) 1499-1505 DOI 101099mic027759-0 Overproduction purification and characterization of FgaPT2 a dimethylallyltryptophan synthase from Aspergillus fumigatus Inge A Unsoumlld and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry wwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for intron prediction Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman 1 12

Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeedu Present address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706

Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom)

Plant Mol Biol 2005 Feb57(3)445-60 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Yao H Guo L Fu Y Borsuk LA Wen TJ Skibbe DS Cui X Scheffler BE Cao J Emrich SJ Ashlock DA Schnable PS Department of Genetics Development and Cell Biology Iowa State University Ames 50011-3650 USA hellipThe five programs were used in conjunction with RT-PCR to identify and establish the structures of two new genes in the a1-sh2 interval of the maize genome FGENESH GeneMarkhmm and GENSCAN were tested on a larger data set consisting of maize assembled genomic islands (MAGIs) that had been aligned to ESTs FGENESH GeneMarkhmm and GENSCAN correctly predicted gene models in 773 625 and 371 MAGIs respectively out of the 1353 MAGIs that comprise data set 2hellip New Phytologist 167 (1) July 2005 239-247 doi101111j1469-8137200501392x Identification of perennial ryegrass (Lolium perenne (L)) and meadow fescue (Festuca pratensis (Huds)) candidate orthologous sequences to the rice Hd1(Se1) and barley HvCO1 CONSTANS-like genes through comparative mapping and microsynteny P Armstead L Skoslasht L B Turner K Skoslasht I S Donnison M O Humphreys and I P King Author for correspondence Ian Armstead Tel +44 (0)1970 823108 Fax +44 (0)1970 823242 Email ianarmsteadbbsrcacuk Predictions of mRNA and protein sequences were carried out using fgenesh and fgenesh+ software ( http wwwsoftberrycom berryphtml ) Plant Physiology May 2005 Vol 138 pp 38-46 BIOINFORMATICS-PLANT DATABASES Databases and Information Integration for the Medicago truncatula Genome and Transcriptome1 Steven B Cannon John A Crow Michael L Heuer Xiaohong Wang Ethalinda KS Cannon Christopher Dwan Anne-Francoise Lamblin Jayprakash Vasdewani Joann Mudge Andrew Cook John Gish Foo Cheung Steve Kenton Timothy M Kunau Douglas Brown Gregory D May Dongjin Kim Douglas R Cook Bruce A Roe Chris D Town Nevin D Young and Ernest F Retzel

2 3

Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (SBC XW EKSC JV JM NDY) Center for Computational Genomics and Bioinformatics University of Minnesota Minneapolis Minnesota 55455 (JAC MLH CD AFL TMK

EFR) Department of Plant Pathology University of California Davis California 95616 (AC JG DJK DRC) The Institute for Genomic Research Rockville Maryland 20850 (FC CDT) Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR SK) Plant Biology Division The Samuel Roberts Noble Foundation Ardmore Oklahoma 73401 (GDM) and North Carolina State University Fungal Genomics Laboratory Department of Plant Pathology Raleigh North Carolina 27695 (DB) 1 This work was supported by the National Science Foundation (awards DBIndash0321460 DBIndash0196197 DBIndash0110206 DBIndash9975806 and DBIndash9872565) by the US Department of Agriculture Cooperative State Research Education and Extension ServiceNational Research Initiative Program and by the Samuel Noble Roberts Foundation 2 Present address BioTeam Inc Cambridge MA 3 Present address University of Minnesota Cancer Center MMC 806 420 Delaware St SE Minneapolis MN 55455 wwwplantphysiolorgcgidoi101104pp104059204 Corresponding author e-mail ernestccgbumnedu fax 612ndash626ndash6069 Received December 31 2004 returned for revision March 4 2005 accepted March 21 2005 Annotation involves a multi-institution pipeline relying on Medicago-trained FGENESH (Salamov and Solovyev 2000 ) predictions the EuGene (Foissac et al Plant Physiology May 2005 Vol 138 pp 18-26 BIOINFORMATICS-PLANT DATABASES The Institute for Genomic Research Osa1 Rice Genome Annotation Database1 Qiaoping Yuan Shu Ouyang Aihui Wang Wei Zhu Rama Maiti Haining Lin John Hamilton Brian Haas Razvan Sultana Foo Cheung Jennifer Wortman and C Robin Buell

2

The Institute for Genomic Research Rockville Maryland 20850 1 This work (on rice genome annotation) was supported by the National Science Foundation (grant no DBIndash0321538 to CRB) and the US Department of Agriculture (grant no 2003ndash35317ndash13173 to CRB) 2 Present address Laboratory of Neurogenetics NIAAA NIH 5625 Fishers Lane Suite 3532 MSC 9412 Bethesda MD 20892 wwwplantphysiolorgcgidoi101104pp104059063 Corresponding author e-mail rbuelltigrorg fax 301ndash838ndash0208 Received December 31 2004 returned for revision February 24 2005 accepted March 21 2005 The ab initio gene finders used in the rice EGC pipeline include FGENESH (monocot matrix Salamov and Solovyev 2000 ) GeneMarkhmm (rice matrix Lukashin and Genome Research 15577-582 2005 Closing in on the C elegans ORFeome by cloning TWINSCAN predictions Chaochun Wei Philippe Lamesch Manimozhiyan Arumugam Jennifer Rosenberg Ping Hu Marc Vidal and Michael R Brent

1 2 1 2

1 2 13

1 Laboratory for Computational Genomics and Department of Computer Science and Engineering Washington University St Louis Missouri 63130 USA 2 Center for Cancer Systems Biology Dana-Farber Cancer Institute and Department of Genetics Harvard Medical School Boston Massachusetts 02115 USA

3 Corresponding author E-mail brentcsewustledu fax (314) 935-7302 Finally we compared TWINSCAN with two other gene-prediction systems that have recently been developed for nematodesndashFGENESH (Salamov and Solovyev 2000 Plant Physiology April 2005 Vol 137 pp 1174-1181 UPDATE ON SEQUENCING MEDICAGO TRUNCATULA AND LOTUS JAPONICUS Sequencing the Genespaces of Medicago truncatula and Lotus japonicus1 Nevin D Young Steven B Cannon Shusei Sato Dongjin Kim Douglas R Cook Chris D Town Bruce A Roe and Satoshi Tabata Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (NDY SBC) Kazusa DNA Research Institute Kisarazu Chiba 292ndash0818 Japan (SS ST) Department of Plant Pathology University of California Davis California 95616 (DRC DK) The Institute for Genomic Research Rockville Maryland 20850 (CDT) and Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR) 1 The US component of the Medicago truncatula sequencing effort was initially supported by a grant from the Samuel Roberts Noble Foundation to BAR Current support comes from National Science Foundation Plant Genome Research Program (grant no 0110206 to DRC DK CDT and NDY and grant no 0321460 to NDY BAR and CDT) Funding for Lotus japonicus sequencing comes from the Kazusa DNA Research Institute Foundation wwwplantphysiolorgcgidoi101104pp104057034 Corresponding author e-mail nevinyumnedu fax 612ndash625ndash9728 hellip (These estimates are based on Fgenesh predictions [Salamov and Solovyev 2000 ] using a Mt-trained matrix retaining peptides with a BLASTP match at 10endash4 to the UniProt NREF100 database of peptides [Apweiler et al 2004 ] This estimate for Lj differs from the published value of 1 gene per 101 kb [Asamizu et al 2003a ] due to the use here of the Fgenesh gene-calling algorithm so Mt and Lj could be compared directly)hellip hellipThis estimate increases to 6500 when Lj genes are predicted by the Mt-trained Fgenesh algorithm described earlierhellip International Journal of CancerVolume 109 Issue 1 Pages 71 - 75Cancer Genetics Candidate regions of tumor suppressor locus on chromosome 9q311 in gastric cancer Naoto Kakinuma 1 Kazuyoshi Kohu 1 2 Masaaki Sato 1 Tatsuya Yamada 3 Motowo Nakajima 1 Tetsu Akiyama 2 Susumu Ohwada 3 Yasuhiko Shibanaka 1

1Novartis Pharma Tsukuba Research Institute Ibaraki Japan 2Laboratory of Molecular and Genetic Information Institute for Molecular and Cellular Biosciences University of Tokyo Tokyo Japan 3Second Department of Surgery Gunma University School of Medicine Gunma Japan email Yasuhiko Shibanaka (yasuhikoshibanakapharmanovartiscom) Correspondence to Yasuhiko Shibanaka Novartis Pharma Tsukuba Research Institute Ohkubo 8 Tsukuba-shi Ibaraki 300-2611 Japan Fax +81-29-865-2281

predict the genes between D9S277 and D9S127 in 9q311 the gene prediction tools also in the UCSC Genome Browser having Acembly Ensembl Fgenesh GenScan and Genome Research 1554-66 2005 Gene and alternative splicing annotation with AIR Liliana Florea145 Valentina Di Francesco2 Jason Miller1 Russell Turner1 Alison Yao2 Michael Harris2 Brian Walenz1 Clark Mobarry1 Gennady V Merkulov3 Rosane Charlab3 Ian Dew1 Zuoming Deng3 Sorin Istrail1 Peter Li2 and Granger Sutton1 1 Informatics Research Applied Biosystems Rockville Maryland 20850 USA 2 Advanced Solutions Celera Genomics Rockville Maryland 20850 USA 3 Scientific Content and Applications Celera Genomics Rockville Maryland 20850 USA Ab initio prediction programs such as GenScan (Burge and Karlin 1997 ) FGenesH (Salamov and Solovyev 2000 ) Genie (Kulp et al Nucleic Acids Research 2005 Vol 33 Database issue D399-D402 SilkDB a knowledgebase for silkworm biology and genomics Jing Wang Qingyou Xia Ximiao He Mingtao Dai Jue Ruan Jie Chen Guo Yu Haifeng Yuan Yafeng Hu Ruiqiang Li Tao Feng Chen Ye Cheng Lu Jun Wang Songgang Li Gane Ka-Shu Wong Huanming Yang Jian Wang Zhonghuai Xiang Zeyang Zhou and Jun Yu

1 2 34 3 34 3 3

3 3 3 3 3 2 135

1 36 35 35 2

2 35

1 College of Life Sciences Peking University Beijing 100871 China 2 The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China 3 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 4 Graduate School of the Chinese Academy of Sciences Yuquan Road 19A Beijing 100039 China 5 Key Laboratory of Bioinformatics of Zhejiang Province Hangzhou Genomics Institute James D Watson Institute of Genome Sciences of Zhejiang University Hangzhou 310007 China and 6 Department of Medicine University of Washington Genome Center Seattle WA 98195 USA To whom correspondence should be addressed Tel +86 10 80481455 Fax +86 10 80498676 Email junyugenomicsorgcn Correspondence may also be addressed to Zeyang Zhou Tel +86 23 68251123 Fax +86 23 68251128 Email zyzhouswaucqcn The authors wish it to be known that in their opinion the first three authors should be regarded as joint First Authors BGF is a self-developed ab initio program based on GenScan (9) and FgeneSH (10) and was successfully utilized for our rice genome annotation (11) Clinical Cancer Research Vol 11 4029-4036 June 1 2005 Imaging Diagnosis Prognosis A Molecular Signature in Superficial Bladder Carcinoma Predicts Clinical Outcome Lars Dyrskjoslasht Karsten Zieger Mogens Kruhoslashffer Thomas Thykjaer Jens L Jensen Hanne Primdahl Natasha Aziz Niels Marcussen Klaus Moslashller and Torben F Oslashrntoft

1 12 15 15 4

1 6 3 2 1

Authors Affiliations 1 Molecular Diagnostic Laboratory Department of Clinical Biochemistry 2 Department of Urology and 3 University Institute of Pathology Aarhus University Hospital 4 Departments of Theoretical Statistics

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 5: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom)

Plant Mol Biol 2005 Feb57(3)445-60 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Yao H Guo L Fu Y Borsuk LA Wen TJ Skibbe DS Cui X Scheffler BE Cao J Emrich SJ Ashlock DA Schnable PS Department of Genetics Development and Cell Biology Iowa State University Ames 50011-3650 USA hellipThe five programs were used in conjunction with RT-PCR to identify and establish the structures of two new genes in the a1-sh2 interval of the maize genome FGENESH GeneMarkhmm and GENSCAN were tested on a larger data set consisting of maize assembled genomic islands (MAGIs) that had been aligned to ESTs FGENESH GeneMarkhmm and GENSCAN correctly predicted gene models in 773 625 and 371 MAGIs respectively out of the 1353 MAGIs that comprise data set 2hellip New Phytologist 167 (1) July 2005 239-247 doi101111j1469-8137200501392x Identification of perennial ryegrass (Lolium perenne (L)) and meadow fescue (Festuca pratensis (Huds)) candidate orthologous sequences to the rice Hd1(Se1) and barley HvCO1 CONSTANS-like genes through comparative mapping and microsynteny P Armstead L Skoslasht L B Turner K Skoslasht I S Donnison M O Humphreys and I P King Author for correspondence Ian Armstead Tel +44 (0)1970 823108 Fax +44 (0)1970 823242 Email ianarmsteadbbsrcacuk Predictions of mRNA and protein sequences were carried out using fgenesh and fgenesh+ software ( http wwwsoftberrycom berryphtml ) Plant Physiology May 2005 Vol 138 pp 38-46 BIOINFORMATICS-PLANT DATABASES Databases and Information Integration for the Medicago truncatula Genome and Transcriptome1 Steven B Cannon John A Crow Michael L Heuer Xiaohong Wang Ethalinda KS Cannon Christopher Dwan Anne-Francoise Lamblin Jayprakash Vasdewani Joann Mudge Andrew Cook John Gish Foo Cheung Steve Kenton Timothy M Kunau Douglas Brown Gregory D May Dongjin Kim Douglas R Cook Bruce A Roe Chris D Town Nevin D Young and Ernest F Retzel

2 3

Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (SBC XW EKSC JV JM NDY) Center for Computational Genomics and Bioinformatics University of Minnesota Minneapolis Minnesota 55455 (JAC MLH CD AFL TMK

EFR) Department of Plant Pathology University of California Davis California 95616 (AC JG DJK DRC) The Institute for Genomic Research Rockville Maryland 20850 (FC CDT) Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR SK) Plant Biology Division The Samuel Roberts Noble Foundation Ardmore Oklahoma 73401 (GDM) and North Carolina State University Fungal Genomics Laboratory Department of Plant Pathology Raleigh North Carolina 27695 (DB) 1 This work was supported by the National Science Foundation (awards DBIndash0321460 DBIndash0196197 DBIndash0110206 DBIndash9975806 and DBIndash9872565) by the US Department of Agriculture Cooperative State Research Education and Extension ServiceNational Research Initiative Program and by the Samuel Noble Roberts Foundation 2 Present address BioTeam Inc Cambridge MA 3 Present address University of Minnesota Cancer Center MMC 806 420 Delaware St SE Minneapolis MN 55455 wwwplantphysiolorgcgidoi101104pp104059204 Corresponding author e-mail ernestccgbumnedu fax 612ndash626ndash6069 Received December 31 2004 returned for revision March 4 2005 accepted March 21 2005 Annotation involves a multi-institution pipeline relying on Medicago-trained FGENESH (Salamov and Solovyev 2000 ) predictions the EuGene (Foissac et al Plant Physiology May 2005 Vol 138 pp 18-26 BIOINFORMATICS-PLANT DATABASES The Institute for Genomic Research Osa1 Rice Genome Annotation Database1 Qiaoping Yuan Shu Ouyang Aihui Wang Wei Zhu Rama Maiti Haining Lin John Hamilton Brian Haas Razvan Sultana Foo Cheung Jennifer Wortman and C Robin Buell

2

The Institute for Genomic Research Rockville Maryland 20850 1 This work (on rice genome annotation) was supported by the National Science Foundation (grant no DBIndash0321538 to CRB) and the US Department of Agriculture (grant no 2003ndash35317ndash13173 to CRB) 2 Present address Laboratory of Neurogenetics NIAAA NIH 5625 Fishers Lane Suite 3532 MSC 9412 Bethesda MD 20892 wwwplantphysiolorgcgidoi101104pp104059063 Corresponding author e-mail rbuelltigrorg fax 301ndash838ndash0208 Received December 31 2004 returned for revision February 24 2005 accepted March 21 2005 The ab initio gene finders used in the rice EGC pipeline include FGENESH (monocot matrix Salamov and Solovyev 2000 ) GeneMarkhmm (rice matrix Lukashin and Genome Research 15577-582 2005 Closing in on the C elegans ORFeome by cloning TWINSCAN predictions Chaochun Wei Philippe Lamesch Manimozhiyan Arumugam Jennifer Rosenberg Ping Hu Marc Vidal and Michael R Brent

1 2 1 2

1 2 13

1 Laboratory for Computational Genomics and Department of Computer Science and Engineering Washington University St Louis Missouri 63130 USA 2 Center for Cancer Systems Biology Dana-Farber Cancer Institute and Department of Genetics Harvard Medical School Boston Massachusetts 02115 USA

3 Corresponding author E-mail brentcsewustledu fax (314) 935-7302 Finally we compared TWINSCAN with two other gene-prediction systems that have recently been developed for nematodesndashFGENESH (Salamov and Solovyev 2000 Plant Physiology April 2005 Vol 137 pp 1174-1181 UPDATE ON SEQUENCING MEDICAGO TRUNCATULA AND LOTUS JAPONICUS Sequencing the Genespaces of Medicago truncatula and Lotus japonicus1 Nevin D Young Steven B Cannon Shusei Sato Dongjin Kim Douglas R Cook Chris D Town Bruce A Roe and Satoshi Tabata Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (NDY SBC) Kazusa DNA Research Institute Kisarazu Chiba 292ndash0818 Japan (SS ST) Department of Plant Pathology University of California Davis California 95616 (DRC DK) The Institute for Genomic Research Rockville Maryland 20850 (CDT) and Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR) 1 The US component of the Medicago truncatula sequencing effort was initially supported by a grant from the Samuel Roberts Noble Foundation to BAR Current support comes from National Science Foundation Plant Genome Research Program (grant no 0110206 to DRC DK CDT and NDY and grant no 0321460 to NDY BAR and CDT) Funding for Lotus japonicus sequencing comes from the Kazusa DNA Research Institute Foundation wwwplantphysiolorgcgidoi101104pp104057034 Corresponding author e-mail nevinyumnedu fax 612ndash625ndash9728 hellip (These estimates are based on Fgenesh predictions [Salamov and Solovyev 2000 ] using a Mt-trained matrix retaining peptides with a BLASTP match at 10endash4 to the UniProt NREF100 database of peptides [Apweiler et al 2004 ] This estimate for Lj differs from the published value of 1 gene per 101 kb [Asamizu et al 2003a ] due to the use here of the Fgenesh gene-calling algorithm so Mt and Lj could be compared directly)hellip hellipThis estimate increases to 6500 when Lj genes are predicted by the Mt-trained Fgenesh algorithm described earlierhellip International Journal of CancerVolume 109 Issue 1 Pages 71 - 75Cancer Genetics Candidate regions of tumor suppressor locus on chromosome 9q311 in gastric cancer Naoto Kakinuma 1 Kazuyoshi Kohu 1 2 Masaaki Sato 1 Tatsuya Yamada 3 Motowo Nakajima 1 Tetsu Akiyama 2 Susumu Ohwada 3 Yasuhiko Shibanaka 1

1Novartis Pharma Tsukuba Research Institute Ibaraki Japan 2Laboratory of Molecular and Genetic Information Institute for Molecular and Cellular Biosciences University of Tokyo Tokyo Japan 3Second Department of Surgery Gunma University School of Medicine Gunma Japan email Yasuhiko Shibanaka (yasuhikoshibanakapharmanovartiscom) Correspondence to Yasuhiko Shibanaka Novartis Pharma Tsukuba Research Institute Ohkubo 8 Tsukuba-shi Ibaraki 300-2611 Japan Fax +81-29-865-2281

predict the genes between D9S277 and D9S127 in 9q311 the gene prediction tools also in the UCSC Genome Browser having Acembly Ensembl Fgenesh GenScan and Genome Research 1554-66 2005 Gene and alternative splicing annotation with AIR Liliana Florea145 Valentina Di Francesco2 Jason Miller1 Russell Turner1 Alison Yao2 Michael Harris2 Brian Walenz1 Clark Mobarry1 Gennady V Merkulov3 Rosane Charlab3 Ian Dew1 Zuoming Deng3 Sorin Istrail1 Peter Li2 and Granger Sutton1 1 Informatics Research Applied Biosystems Rockville Maryland 20850 USA 2 Advanced Solutions Celera Genomics Rockville Maryland 20850 USA 3 Scientific Content and Applications Celera Genomics Rockville Maryland 20850 USA Ab initio prediction programs such as GenScan (Burge and Karlin 1997 ) FGenesH (Salamov and Solovyev 2000 ) Genie (Kulp et al Nucleic Acids Research 2005 Vol 33 Database issue D399-D402 SilkDB a knowledgebase for silkworm biology and genomics Jing Wang Qingyou Xia Ximiao He Mingtao Dai Jue Ruan Jie Chen Guo Yu Haifeng Yuan Yafeng Hu Ruiqiang Li Tao Feng Chen Ye Cheng Lu Jun Wang Songgang Li Gane Ka-Shu Wong Huanming Yang Jian Wang Zhonghuai Xiang Zeyang Zhou and Jun Yu

1 2 34 3 34 3 3

3 3 3 3 3 2 135

1 36 35 35 2

2 35

1 College of Life Sciences Peking University Beijing 100871 China 2 The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China 3 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 4 Graduate School of the Chinese Academy of Sciences Yuquan Road 19A Beijing 100039 China 5 Key Laboratory of Bioinformatics of Zhejiang Province Hangzhou Genomics Institute James D Watson Institute of Genome Sciences of Zhejiang University Hangzhou 310007 China and 6 Department of Medicine University of Washington Genome Center Seattle WA 98195 USA To whom correspondence should be addressed Tel +86 10 80481455 Fax +86 10 80498676 Email junyugenomicsorgcn Correspondence may also be addressed to Zeyang Zhou Tel +86 23 68251123 Fax +86 23 68251128 Email zyzhouswaucqcn The authors wish it to be known that in their opinion the first three authors should be regarded as joint First Authors BGF is a self-developed ab initio program based on GenScan (9) and FgeneSH (10) and was successfully utilized for our rice genome annotation (11) Clinical Cancer Research Vol 11 4029-4036 June 1 2005 Imaging Diagnosis Prognosis A Molecular Signature in Superficial Bladder Carcinoma Predicts Clinical Outcome Lars Dyrskjoslasht Karsten Zieger Mogens Kruhoslashffer Thomas Thykjaer Jens L Jensen Hanne Primdahl Natasha Aziz Niels Marcussen Klaus Moslashller and Torben F Oslashrntoft

1 12 15 15 4

1 6 3 2 1

Authors Affiliations 1 Molecular Diagnostic Laboratory Department of Clinical Biochemistry 2 Department of Urology and 3 University Institute of Pathology Aarhus University Hospital 4 Departments of Theoretical Statistics

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 6: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

EFR) Department of Plant Pathology University of California Davis California 95616 (AC JG DJK DRC) The Institute for Genomic Research Rockville Maryland 20850 (FC CDT) Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR SK) Plant Biology Division The Samuel Roberts Noble Foundation Ardmore Oklahoma 73401 (GDM) and North Carolina State University Fungal Genomics Laboratory Department of Plant Pathology Raleigh North Carolina 27695 (DB) 1 This work was supported by the National Science Foundation (awards DBIndash0321460 DBIndash0196197 DBIndash0110206 DBIndash9975806 and DBIndash9872565) by the US Department of Agriculture Cooperative State Research Education and Extension ServiceNational Research Initiative Program and by the Samuel Noble Roberts Foundation 2 Present address BioTeam Inc Cambridge MA 3 Present address University of Minnesota Cancer Center MMC 806 420 Delaware St SE Minneapolis MN 55455 wwwplantphysiolorgcgidoi101104pp104059204 Corresponding author e-mail ernestccgbumnedu fax 612ndash626ndash6069 Received December 31 2004 returned for revision March 4 2005 accepted March 21 2005 Annotation involves a multi-institution pipeline relying on Medicago-trained FGENESH (Salamov and Solovyev 2000 ) predictions the EuGene (Foissac et al Plant Physiology May 2005 Vol 138 pp 18-26 BIOINFORMATICS-PLANT DATABASES The Institute for Genomic Research Osa1 Rice Genome Annotation Database1 Qiaoping Yuan Shu Ouyang Aihui Wang Wei Zhu Rama Maiti Haining Lin John Hamilton Brian Haas Razvan Sultana Foo Cheung Jennifer Wortman and C Robin Buell

2

The Institute for Genomic Research Rockville Maryland 20850 1 This work (on rice genome annotation) was supported by the National Science Foundation (grant no DBIndash0321538 to CRB) and the US Department of Agriculture (grant no 2003ndash35317ndash13173 to CRB) 2 Present address Laboratory of Neurogenetics NIAAA NIH 5625 Fishers Lane Suite 3532 MSC 9412 Bethesda MD 20892 wwwplantphysiolorgcgidoi101104pp104059063 Corresponding author e-mail rbuelltigrorg fax 301ndash838ndash0208 Received December 31 2004 returned for revision February 24 2005 accepted March 21 2005 The ab initio gene finders used in the rice EGC pipeline include FGENESH (monocot matrix Salamov and Solovyev 2000 ) GeneMarkhmm (rice matrix Lukashin and Genome Research 15577-582 2005 Closing in on the C elegans ORFeome by cloning TWINSCAN predictions Chaochun Wei Philippe Lamesch Manimozhiyan Arumugam Jennifer Rosenberg Ping Hu Marc Vidal and Michael R Brent

1 2 1 2

1 2 13

1 Laboratory for Computational Genomics and Department of Computer Science and Engineering Washington University St Louis Missouri 63130 USA 2 Center for Cancer Systems Biology Dana-Farber Cancer Institute and Department of Genetics Harvard Medical School Boston Massachusetts 02115 USA

3 Corresponding author E-mail brentcsewustledu fax (314) 935-7302 Finally we compared TWINSCAN with two other gene-prediction systems that have recently been developed for nematodesndashFGENESH (Salamov and Solovyev 2000 Plant Physiology April 2005 Vol 137 pp 1174-1181 UPDATE ON SEQUENCING MEDICAGO TRUNCATULA AND LOTUS JAPONICUS Sequencing the Genespaces of Medicago truncatula and Lotus japonicus1 Nevin D Young Steven B Cannon Shusei Sato Dongjin Kim Douglas R Cook Chris D Town Bruce A Roe and Satoshi Tabata Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (NDY SBC) Kazusa DNA Research Institute Kisarazu Chiba 292ndash0818 Japan (SS ST) Department of Plant Pathology University of California Davis California 95616 (DRC DK) The Institute for Genomic Research Rockville Maryland 20850 (CDT) and Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR) 1 The US component of the Medicago truncatula sequencing effort was initially supported by a grant from the Samuel Roberts Noble Foundation to BAR Current support comes from National Science Foundation Plant Genome Research Program (grant no 0110206 to DRC DK CDT and NDY and grant no 0321460 to NDY BAR and CDT) Funding for Lotus japonicus sequencing comes from the Kazusa DNA Research Institute Foundation wwwplantphysiolorgcgidoi101104pp104057034 Corresponding author e-mail nevinyumnedu fax 612ndash625ndash9728 hellip (These estimates are based on Fgenesh predictions [Salamov and Solovyev 2000 ] using a Mt-trained matrix retaining peptides with a BLASTP match at 10endash4 to the UniProt NREF100 database of peptides [Apweiler et al 2004 ] This estimate for Lj differs from the published value of 1 gene per 101 kb [Asamizu et al 2003a ] due to the use here of the Fgenesh gene-calling algorithm so Mt and Lj could be compared directly)hellip hellipThis estimate increases to 6500 when Lj genes are predicted by the Mt-trained Fgenesh algorithm described earlierhellip International Journal of CancerVolume 109 Issue 1 Pages 71 - 75Cancer Genetics Candidate regions of tumor suppressor locus on chromosome 9q311 in gastric cancer Naoto Kakinuma 1 Kazuyoshi Kohu 1 2 Masaaki Sato 1 Tatsuya Yamada 3 Motowo Nakajima 1 Tetsu Akiyama 2 Susumu Ohwada 3 Yasuhiko Shibanaka 1

1Novartis Pharma Tsukuba Research Institute Ibaraki Japan 2Laboratory of Molecular and Genetic Information Institute for Molecular and Cellular Biosciences University of Tokyo Tokyo Japan 3Second Department of Surgery Gunma University School of Medicine Gunma Japan email Yasuhiko Shibanaka (yasuhikoshibanakapharmanovartiscom) Correspondence to Yasuhiko Shibanaka Novartis Pharma Tsukuba Research Institute Ohkubo 8 Tsukuba-shi Ibaraki 300-2611 Japan Fax +81-29-865-2281

predict the genes between D9S277 and D9S127 in 9q311 the gene prediction tools also in the UCSC Genome Browser having Acembly Ensembl Fgenesh GenScan and Genome Research 1554-66 2005 Gene and alternative splicing annotation with AIR Liliana Florea145 Valentina Di Francesco2 Jason Miller1 Russell Turner1 Alison Yao2 Michael Harris2 Brian Walenz1 Clark Mobarry1 Gennady V Merkulov3 Rosane Charlab3 Ian Dew1 Zuoming Deng3 Sorin Istrail1 Peter Li2 and Granger Sutton1 1 Informatics Research Applied Biosystems Rockville Maryland 20850 USA 2 Advanced Solutions Celera Genomics Rockville Maryland 20850 USA 3 Scientific Content and Applications Celera Genomics Rockville Maryland 20850 USA Ab initio prediction programs such as GenScan (Burge and Karlin 1997 ) FGenesH (Salamov and Solovyev 2000 ) Genie (Kulp et al Nucleic Acids Research 2005 Vol 33 Database issue D399-D402 SilkDB a knowledgebase for silkworm biology and genomics Jing Wang Qingyou Xia Ximiao He Mingtao Dai Jue Ruan Jie Chen Guo Yu Haifeng Yuan Yafeng Hu Ruiqiang Li Tao Feng Chen Ye Cheng Lu Jun Wang Songgang Li Gane Ka-Shu Wong Huanming Yang Jian Wang Zhonghuai Xiang Zeyang Zhou and Jun Yu

1 2 34 3 34 3 3

3 3 3 3 3 2 135

1 36 35 35 2

2 35

1 College of Life Sciences Peking University Beijing 100871 China 2 The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China 3 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 4 Graduate School of the Chinese Academy of Sciences Yuquan Road 19A Beijing 100039 China 5 Key Laboratory of Bioinformatics of Zhejiang Province Hangzhou Genomics Institute James D Watson Institute of Genome Sciences of Zhejiang University Hangzhou 310007 China and 6 Department of Medicine University of Washington Genome Center Seattle WA 98195 USA To whom correspondence should be addressed Tel +86 10 80481455 Fax +86 10 80498676 Email junyugenomicsorgcn Correspondence may also be addressed to Zeyang Zhou Tel +86 23 68251123 Fax +86 23 68251128 Email zyzhouswaucqcn The authors wish it to be known that in their opinion the first three authors should be regarded as joint First Authors BGF is a self-developed ab initio program based on GenScan (9) and FgeneSH (10) and was successfully utilized for our rice genome annotation (11) Clinical Cancer Research Vol 11 4029-4036 June 1 2005 Imaging Diagnosis Prognosis A Molecular Signature in Superficial Bladder Carcinoma Predicts Clinical Outcome Lars Dyrskjoslasht Karsten Zieger Mogens Kruhoslashffer Thomas Thykjaer Jens L Jensen Hanne Primdahl Natasha Aziz Niels Marcussen Klaus Moslashller and Torben F Oslashrntoft

1 12 15 15 4

1 6 3 2 1

Authors Affiliations 1 Molecular Diagnostic Laboratory Department of Clinical Biochemistry 2 Department of Urology and 3 University Institute of Pathology Aarhus University Hospital 4 Departments of Theoretical Statistics

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 7: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

3 Corresponding author E-mail brentcsewustledu fax (314) 935-7302 Finally we compared TWINSCAN with two other gene-prediction systems that have recently been developed for nematodesndashFGENESH (Salamov and Solovyev 2000 Plant Physiology April 2005 Vol 137 pp 1174-1181 UPDATE ON SEQUENCING MEDICAGO TRUNCATULA AND LOTUS JAPONICUS Sequencing the Genespaces of Medicago truncatula and Lotus japonicus1 Nevin D Young Steven B Cannon Shusei Sato Dongjin Kim Douglas R Cook Chris D Town Bruce A Roe and Satoshi Tabata Department of Plant Pathology University of Minnesota St Paul Minnesota 55108 (NDY SBC) Kazusa DNA Research Institute Kisarazu Chiba 292ndash0818 Japan (SS ST) Department of Plant Pathology University of California Davis California 95616 (DRC DK) The Institute for Genomic Research Rockville Maryland 20850 (CDT) and Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma 73019 (BAR) 1 The US component of the Medicago truncatula sequencing effort was initially supported by a grant from the Samuel Roberts Noble Foundation to BAR Current support comes from National Science Foundation Plant Genome Research Program (grant no 0110206 to DRC DK CDT and NDY and grant no 0321460 to NDY BAR and CDT) Funding for Lotus japonicus sequencing comes from the Kazusa DNA Research Institute Foundation wwwplantphysiolorgcgidoi101104pp104057034 Corresponding author e-mail nevinyumnedu fax 612ndash625ndash9728 hellip (These estimates are based on Fgenesh predictions [Salamov and Solovyev 2000 ] using a Mt-trained matrix retaining peptides with a BLASTP match at 10endash4 to the UniProt NREF100 database of peptides [Apweiler et al 2004 ] This estimate for Lj differs from the published value of 1 gene per 101 kb [Asamizu et al 2003a ] due to the use here of the Fgenesh gene-calling algorithm so Mt and Lj could be compared directly)hellip hellipThis estimate increases to 6500 when Lj genes are predicted by the Mt-trained Fgenesh algorithm described earlierhellip International Journal of CancerVolume 109 Issue 1 Pages 71 - 75Cancer Genetics Candidate regions of tumor suppressor locus on chromosome 9q311 in gastric cancer Naoto Kakinuma 1 Kazuyoshi Kohu 1 2 Masaaki Sato 1 Tatsuya Yamada 3 Motowo Nakajima 1 Tetsu Akiyama 2 Susumu Ohwada 3 Yasuhiko Shibanaka 1

1Novartis Pharma Tsukuba Research Institute Ibaraki Japan 2Laboratory of Molecular and Genetic Information Institute for Molecular and Cellular Biosciences University of Tokyo Tokyo Japan 3Second Department of Surgery Gunma University School of Medicine Gunma Japan email Yasuhiko Shibanaka (yasuhikoshibanakapharmanovartiscom) Correspondence to Yasuhiko Shibanaka Novartis Pharma Tsukuba Research Institute Ohkubo 8 Tsukuba-shi Ibaraki 300-2611 Japan Fax +81-29-865-2281

predict the genes between D9S277 and D9S127 in 9q311 the gene prediction tools also in the UCSC Genome Browser having Acembly Ensembl Fgenesh GenScan and Genome Research 1554-66 2005 Gene and alternative splicing annotation with AIR Liliana Florea145 Valentina Di Francesco2 Jason Miller1 Russell Turner1 Alison Yao2 Michael Harris2 Brian Walenz1 Clark Mobarry1 Gennady V Merkulov3 Rosane Charlab3 Ian Dew1 Zuoming Deng3 Sorin Istrail1 Peter Li2 and Granger Sutton1 1 Informatics Research Applied Biosystems Rockville Maryland 20850 USA 2 Advanced Solutions Celera Genomics Rockville Maryland 20850 USA 3 Scientific Content and Applications Celera Genomics Rockville Maryland 20850 USA Ab initio prediction programs such as GenScan (Burge and Karlin 1997 ) FGenesH (Salamov and Solovyev 2000 ) Genie (Kulp et al Nucleic Acids Research 2005 Vol 33 Database issue D399-D402 SilkDB a knowledgebase for silkworm biology and genomics Jing Wang Qingyou Xia Ximiao He Mingtao Dai Jue Ruan Jie Chen Guo Yu Haifeng Yuan Yafeng Hu Ruiqiang Li Tao Feng Chen Ye Cheng Lu Jun Wang Songgang Li Gane Ka-Shu Wong Huanming Yang Jian Wang Zhonghuai Xiang Zeyang Zhou and Jun Yu

1 2 34 3 34 3 3

3 3 3 3 3 2 135

1 36 35 35 2

2 35

1 College of Life Sciences Peking University Beijing 100871 China 2 The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China 3 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 4 Graduate School of the Chinese Academy of Sciences Yuquan Road 19A Beijing 100039 China 5 Key Laboratory of Bioinformatics of Zhejiang Province Hangzhou Genomics Institute James D Watson Institute of Genome Sciences of Zhejiang University Hangzhou 310007 China and 6 Department of Medicine University of Washington Genome Center Seattle WA 98195 USA To whom correspondence should be addressed Tel +86 10 80481455 Fax +86 10 80498676 Email junyugenomicsorgcn Correspondence may also be addressed to Zeyang Zhou Tel +86 23 68251123 Fax +86 23 68251128 Email zyzhouswaucqcn The authors wish it to be known that in their opinion the first three authors should be regarded as joint First Authors BGF is a self-developed ab initio program based on GenScan (9) and FgeneSH (10) and was successfully utilized for our rice genome annotation (11) Clinical Cancer Research Vol 11 4029-4036 June 1 2005 Imaging Diagnosis Prognosis A Molecular Signature in Superficial Bladder Carcinoma Predicts Clinical Outcome Lars Dyrskjoslasht Karsten Zieger Mogens Kruhoslashffer Thomas Thykjaer Jens L Jensen Hanne Primdahl Natasha Aziz Niels Marcussen Klaus Moslashller and Torben F Oslashrntoft

1 12 15 15 4

1 6 3 2 1

Authors Affiliations 1 Molecular Diagnostic Laboratory Department of Clinical Biochemistry 2 Department of Urology and 3 University Institute of Pathology Aarhus University Hospital 4 Departments of Theoretical Statistics

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 8: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

predict the genes between D9S277 and D9S127 in 9q311 the gene prediction tools also in the UCSC Genome Browser having Acembly Ensembl Fgenesh GenScan and Genome Research 1554-66 2005 Gene and alternative splicing annotation with AIR Liliana Florea145 Valentina Di Francesco2 Jason Miller1 Russell Turner1 Alison Yao2 Michael Harris2 Brian Walenz1 Clark Mobarry1 Gennady V Merkulov3 Rosane Charlab3 Ian Dew1 Zuoming Deng3 Sorin Istrail1 Peter Li2 and Granger Sutton1 1 Informatics Research Applied Biosystems Rockville Maryland 20850 USA 2 Advanced Solutions Celera Genomics Rockville Maryland 20850 USA 3 Scientific Content and Applications Celera Genomics Rockville Maryland 20850 USA Ab initio prediction programs such as GenScan (Burge and Karlin 1997 ) FGenesH (Salamov and Solovyev 2000 ) Genie (Kulp et al Nucleic Acids Research 2005 Vol 33 Database issue D399-D402 SilkDB a knowledgebase for silkworm biology and genomics Jing Wang Qingyou Xia Ximiao He Mingtao Dai Jue Ruan Jie Chen Guo Yu Haifeng Yuan Yafeng Hu Ruiqiang Li Tao Feng Chen Ye Cheng Lu Jun Wang Songgang Li Gane Ka-Shu Wong Huanming Yang Jian Wang Zhonghuai Xiang Zeyang Zhou and Jun Yu

1 2 34 3 34 3 3

3 3 3 3 3 2 135

1 36 35 35 2

2 35

1 College of Life Sciences Peking University Beijing 100871 China 2 The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China 3 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 4 Graduate School of the Chinese Academy of Sciences Yuquan Road 19A Beijing 100039 China 5 Key Laboratory of Bioinformatics of Zhejiang Province Hangzhou Genomics Institute James D Watson Institute of Genome Sciences of Zhejiang University Hangzhou 310007 China and 6 Department of Medicine University of Washington Genome Center Seattle WA 98195 USA To whom correspondence should be addressed Tel +86 10 80481455 Fax +86 10 80498676 Email junyugenomicsorgcn Correspondence may also be addressed to Zeyang Zhou Tel +86 23 68251123 Fax +86 23 68251128 Email zyzhouswaucqcn The authors wish it to be known that in their opinion the first three authors should be regarded as joint First Authors BGF is a self-developed ab initio program based on GenScan (9) and FgeneSH (10) and was successfully utilized for our rice genome annotation (11) Clinical Cancer Research Vol 11 4029-4036 June 1 2005 Imaging Diagnosis Prognosis A Molecular Signature in Superficial Bladder Carcinoma Predicts Clinical Outcome Lars Dyrskjoslasht Karsten Zieger Mogens Kruhoslashffer Thomas Thykjaer Jens L Jensen Hanne Primdahl Natasha Aziz Niels Marcussen Klaus Moslashller and Torben F Oslashrntoft

1 12 15 15 4

1 6 3 2 1

Authors Affiliations 1 Molecular Diagnostic Laboratory Department of Clinical Biochemistry 2 Department of Urology and 3 University Institute of Pathology Aarhus University Hospital 4 Departments of Theoretical Statistics

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 9: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

and Mathematical Sciences University of Aarhus 5 Aros Applied Biotechnology Aarhus Denmark and 6 Eos BiotechnologyProtein Design Labs Fremont California Requests for reprints Torben F Oslashrntoft Molecular Diagnostic Laboratory Department of Clinical Biochemistry Aarhus University Hospital Skejby DK-8200 Aarhus N Denmark Phone 45-89495100 Fax 45-89496018 E-mail orntoftkiaudk array comprising 59619 probe sets representing 46000 unique sequences including known genes expressed sequence tag clusters and FGENESH-predicted exons BMC Evolutionary Biology 2005 51 doi1011861471-2148-5-1 Research article The WRKY transcription factor superfamily its origin in eukaryotes and expansion in plants Yuanji Zhang and Liangjiang Wang Address Plant Biology Division The Samuel Roberts Noble Foundation Ardmore OK 73402 USA Email Yuanji Zhang - yjzhangnobleorg Liangjiang Wang - Kevinlwangaolcom Corresponding author Despite minor differences in the gene structure prediction both gene prediction programs FGENESH and GENSCAN agree on the major features of the protein Nature Immunology 2005 v6 n3 The immunoglobulin heavy-chain locus in zebrafish identification and expression of a previously hellip N Danilova J Bussmann K Jekosch LA Steiner - - naturecom Nature Immunology Full text access provided to Googlebot Access by Web Services PLoS Biol 2005 June 3(6) e181 Published online 2005 May 24 doi 101371journalpbio0030181 RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov 1 and Jerzy Jurka 1

1Genetic Information Research Institute Mountain View California United States of America David Nemazee Academic EditorScripps Research Institute United States of America

Corresponding author Vladimir V Kapitonov vladimirgirinstorg Jerzy Jurka jurkagirinstorg Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Genetics Published Articles Ahead of Print published on February 16 2005 as 101534genetics104036327 Identification and Characterization of Regions of the Rice Genome Associated with Broad- Spectrum Quantitative Disease Resistance Randall J Wisser Qi Sundagger Scot H Hulbertsect Stephen Kresovich and Rebecca J Nelsondaggerdagger 1 Department of Plant Breeding and Genetics Institute for Genomic Diversity Cornell University Ithaca New York 14853 daggerComputational Biology Service Unit Cornell Theory

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 10: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

Center Cornell University Ithaca New York 14853 sectDepartment of Plant Pathology Kansas State University Manhattan Kansas 66506 and daggerdaggerDepartment of Plant Pathology Cornell University Ithaca New York 14853 1Corresponding author Rebecca J Nelson Department of Plant Pathology Cornell University 321 Plant Science Ithaca NY 14853 Email rjn7cornelledu GENSCAN (B URGE and K ARLIN 1997) and FGENESH (S ALAMOV and S OLOVYEV 2001) to predict open reading frames Further searches against Plant Physiology January 2005 Vol 137 pp 176-189 Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells1[w]

Zhen Xie Zhong-Lin Zhang Xiaolu Zou Jie Huang Paul Ruas Daniel Thompson and Qingxi J Shen

2 23

Department of Biological Sciences University of Nevada Las Vegas Nevada 891541 This work was supported by the US Department of Agriculture (grant no 02ndash35301ndash12066) by the National Institutes of Health (Biomedical Research Infrastructure Network seed grant no P20 RR16464) and by the University of Las Vegas Nevada (start-up funds to QJS) XZ was supported by a National Science Foundation Experimental Program to Stimulate Competitive

Research (EPSCoR) Integrative Approaches to Abiotic Stress (EPSndash0132556) graduate assistantship and Z-LZ by a National Science Foundation EPSCoR Advanced Computing in Environmental Sciences postdoctoral fellowship 2 These authors contributed equally to the paper 3 Present address Department of Plant and Microbial Biology University of California Berkeley CA 94720 and Plant Gene Expression Center US Department of Agriculture Albany CA 94710 [w] The online version of this article contains Web-only data First of all three genes (OsWRKY41 -43 and -44) were reannotated using FGENESH (wwwsoftberrycom) because the first introns of these genes were too small PLoS Biol 3(6) e181 (2005) RAG1 Core and V(D)J Recombination Signal Sequences Were Derived from Transib Transposons Vladimir V Kapitonov1 Jerzy Jurka1 1 Genetic Information Research Institute Mountain View California United States of America To whom correspondence should be addressed E-mail vladimirgirinstorg (VVK) E-mail jurkagirinstorg (JJ) Using FGENESH [33] we detected that the RAG1 corendashlike open reading frame (ORF) in the contig 29068 forms a terminal exon (positions 1154ndash2947) of an Plant and Cell Physiology 2005 46(1)3-13 doi101093pcppci503 From Mapping to Sequencing Post-sequencing and Beyond Takuji Sasaki Takashi Matsumoto Baltazar A Antonio and Yoshiaki Nagamura 1

National Institute of Agrobiological Sciences 2-1-2 Kannondai Tsukuba Ibaraki 305-8602 Japan 1 Corresponding author E-mail tsasakiniasaffrcgojp

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 11: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

The gene predictions by programs such as Genescan (Burge and Karlin 1997 ) FGENESH [see Appendix 1 (4)] and Genemark [see Appendix 1 (5)] BLAST (Altschul et Improving the nutritional value of Golden Rice through increased pro-vitamin A content JA Paine CA Shipton S Chaggar RM Howells MJ hellip - Nature Biotechnology 2005 - naturecom Arabidopsis thaliana psy and rice psy (AY024351) genes identified genomic sequences of similarity in which genes were predicted using FGENESH algorithm with Genetics Published Articles Ahead of Print published on January 16 2005 as 101534genetics104035543 THE GENETIC BASIS FOR INFLORESCENCE VARIATION BETWEEN FOXTAIL AND GREEN MILLET (POACEAE) Andrew N Doust Katrien M Devosdagger1 Mike D Gadberry Mike D Galedagger amp Elizabeth A Kellogg University of Missouri-St Louis Department of Biology One University Boulevard St Louis MO 63121 USA daggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH UK Current address University of Georgia-Athens Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building Athens GA 30602 USA 1 Each of these contigs was scanned using FgeneSH (S ALAMOV and S OLOVYEV 2000) and open reading frames (ORFs) were translated and PLoS Biol 3(1) e13 January 2005

Sorghum Genome Sequencing by Methylation Filtration Joseph A Bedell1 Muhammad A Budiman2 Andrew Nunberg1 Robert W Citek1 Dan Robbins1 Joshua Jones2 Elizabeth Flick2 Theresa Rohlfing3 Jason Fries3 Kourtney Bradford3 Jennifer McMenamy3 Michael Smith4 Heather Holeman4 Bruce A Roe5 Graham Wiley5 Ian F Korf6 Pablo D Rabinowicz7 Nathan Lakey8 W Richard McCombie9 Jeffrey A Jeddeloh4 Robert A Martienssen9 1 Bioinformatics Orion Genomics Saint Louis Missouri United States of America 2 Library Construction Orion Genomics Saint Louis Missouri United States of America 3 Sequencing Orion Genomics Saint Louis Missouri United States of America 4 Biomarkers Orion Genomics Saint Louis Missouri United States of America 5 Department of Chemistry and Biochemistry University of Oklahoma Norman Oklahoma United States of America 6 Genome Center University of California Davis California United States of America 7 The Institute for Genomic Research Rockville Maryland United States of America 8 Business Orion Genomics Saint Louis Missouri United States of America 9 Cold Spring Harbor Laboratory Cold Spring Harbor New York United States of America additional parameters wordmask=seg lcmask M=1 N=ndash1 Q=3 R=3 kap E=1e-10 hspmax=0 To look for potentially novel genes we used Fgenesh (httpwww BMC Genomics 2005 611 doi1011861471-2164-6-11 FAM20 an evolutionarily conserved family of secreted proteins expressed in hematopoietic cells Demet Nalbant1 Hyewon Youn1 3 S Isil Nalbant1 Savitha Sharma1 Everardo Cobos2 3 Elmus G Beale1 Yang Du1 and Simon C Williams1 3 1Department of Cell Biology and Biochemistry Texas Tech University Health Sciences Center

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 12: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

Lubbock Texas 79430 USA 2Department of Internal Medicine Texas Tech University Health Sciences Center Lubbock Texas 79430 USA 3Southwest Cancer Center at University Medical Center Lubbock Texas 79430 USA These results were compared against genes assembled by two gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmit Plant Physiology July 2005 Vol 138 pp 1205-1215 Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus1[w] Romain Guyot Xudong Cheng Yan Su Zhukuan Cheng Edith Schlagenhauf Beat Keller and Hong-Qing Ling

2 2

Institute of Plant Biology University of Zurich 8008 Zurich Switzerland (RG ES BK H-QL) and Institute of Genetics and Developmental Biology Chinese Academy of Sciences Chaoyang District Beijing 100101 China (XC YS ZC H-QL) Putative genes were determined by a combination of coding region prediction software (GENSCAN FGENESH and MZEF with Arabidopsis andor monocot matrix J Gen Virol 86 (2005) 973-983 DOI 101099vir080833-0 Cloning characterization and analysis by RNA interference of various genes of the Chelonus inanitus polydnavirus Marianne Bonvin Dorothee Marti Stefan Wyder Dejan Kojic Marc Annaheim and Beatrice Lanzrein Institute of Cell Biology University of Berne Baltzerstrasse 4 CH-3012 Bern Switzerland Correspondence Beatrice Lanzrein beatricelanzreinizbunibech 12g1forw (5-GAGTCCATGCCGAATGTCAC-3) and 12g1rev (5-CTTCTTGCACAGCGACGAAC-3) were set to amplify the middle region of 12g1 as predicted with FGENESH 10 and The Plant Cell 17343-360 (2005) Evolution of DNA Sequence Nonhomologies among Maize Inbreds Stephan Brunner Kevin Fengler Michele Morgante Scott Tingey and Antoni Rafalski a1 a b a a

a DuPont Crop Genetics Research Wilmington Delaware 19880-353 b Universitaacute degli Studi di Udine Dipartimento di Scienze Agrarie ed Ambientali 33100 Udine Italy 1 To whom correspondence should be addressed E-mail stephanbrunnercgrdupontcom fax 302-695-2726 PNAS | February 1 2005 | vol 102 | no 5 | 1566-1571 Published online before print January 24 2005 101073pnas0409421102 A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 13: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)

Mark Yandell Adina M Bailey Sima Misra ShengQiang Shu Colin Wiel Martha Evans-Holm Susan E Celniker and Gerald M Rubin para para Howard Hughes Medical Institute and Department of Molecular and Cell Biology University of California Life Sciences Addition Berkeley CA 94720-3200 and paraDepartment of Genome Sciences Lawrence Berkeley National Laboratory One Cyclotron Road Mailstop 64-121 Berkeley CA 94720 genes based on a microarray-based approach that involved hybridizing randomly primed cDNA against probes corresponding to a large set of FGENESH predictions Insect Molecular Biology Volume 14 Issue 2 Page 113 - 119 April 2005 doi101111j1365-2583200400536x Detection and analysis of alternative splicing in the silkworm by aligning expressed sequence tags with the genomic sequence X-F Zha Q-Y Xia P Zhao J Li J Duan Z-L Wang J-F Qian and Z-H Xiang Correspondence Dr Qing-You Xia The Key Sericultural Laboratory of Agricultural Ministry Southwest Agricultural University Chongqing 400716 China Tel +86 23 68250748 fax +86 23 68251128 e-mail xiaqyswaucqcn previously predicted silkworm genes in the genomic sequences by BGF a newly developed program based on GENSCAN (Burge amp Karlin 1997) and Fgenesh (Salamov amp Microbiology 151 (2005) 2199-2207 DOI 101099mic027962-0 Overproduction purification and characterization of FtmPT1 a brevianamide F prenyltransferase from Aspergillus fumigatus Alexander Grundmann and Shu-Ming Li Pharmazeutische Biologie Pharmazeutisches Institut Eberhard-Karls-Universitaumlt Tuumlbingen Auf der Morgenstelle 8 72076 Tuumlbingen Germany Correspondence Shu-Ming Li shumingliuni-tuebingende FGENESH (Softberry Inc httpwwwsoftberrycomberryphtml) and the DNASIS software package (version 21 Hitachi Software Engineering) were used for

FGENESH++ Am J Hum Genet 76652-662 2005 Position Effects Due to Chromosome Breakpoints that Map 900 Kb Upstream and 13 Mb

Downstream of SOX9 in Two Patients with Campomelic Dysplasia

Gopalrao V N Velagaleti12 Gabriel A Bien-Willner3 Jill K Northup1 Lillian H Lockhart2

Judy C Hawkins2 Syed M Jalal6 Marjorie Withers3 James R Lupski345 and

Pawel Stankiewicz3 Departments of 1Pathology and 2Pediatrics University of Texas Medical Branch Galveston

Departments of 3Molecular and Human Genetics and 4Pediatrics Baylor College of Medicine and

5Texas Childrens Hospital Houston and 6Department of Laboratory Medicine and Pathology Mayo Clinic Rochester MN hellipIn an effort to identify possible transcripts that may be responsible for the CD phenotype we

used several gene-prediction programs and identified seven hypothetical transcripts in the region

that spans 100 kb in either direction from the breakpoint on chromosome 17 Ecgenes H17C123061 and H17C123081 SGP genes Chr17_15381 and Ch17_15391 Fgenesh++ gene C17001650 and Genscan genes NT_01064144 and NT_01064145hellip Genome Research 15566-576 2005 ECgene Genome-based EST clustering and gene modeling for alternative splicing Namshin Kim Seokmin Shin and Sanghyuk Lee12 2 13 1 Division of Molecular Life Sciences Ewha Womans University Seoul 120-750 Korea 2 School of Chemistry Seoul National University Seoul 151-747 Korea hellipthe structure of full-length mRNA can be inferred by examining the flanking genomic region especially with the aid of ab initio gene predicting programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh++ (Salamov and Solovyev 2000 )hellip Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA hellip we built a three-way synteny map based on chains of Fgenesh++-predicted (Solovyev 2002 ) exons rather than whole genes hellip Genome Research 14539-548 2004 Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of

Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA hellipThe gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom )hellip Nucleic Acids Research 2003 Vol 31 No 1 207-211 copy 2003 Oxford University Press The PEDANT genome database Dmitrij Frishman Martin Mokrejs Denis Kosykh Gabi Kastenmuumlller Grigory Kolesov Igor Zubrzycki Christian Gruber Birgitta Geier Andreas Kaps Kaj Albermann Andreas Volz Christian Wagner Matthias Fellenberg Klaus Heumann and Hans-Werner Mewes

1 1 1 1 1

1 2 2 2 2

2 2 2 2 13

1 Institute for Bioinformatics GSF - National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 85764 Neueherberg Germany 2 Biomax Informatics AG Lochhamer Straszlige 11 82152 Martinsried Germany 3 Department of Genome-oriented Bioinformatics Wissenschaftszentrum Weihenstephan Technische Universitaumlt Muumlnchen 85350 Freising Germany To whom correspondence should be addressed Tel +49 89 31874201 Fax +49 89 31873585 Email dfrishmangsfde The mouse database contains 20 chromosome contigs with 37 793 genes predicted using the Fgenesh++ software (wwwsoftberrycom) Reprint from Daily Biotech Updates www genengnewscom Vol 22 No 17 October 1 2002 DrugDiscovery Tech NoteAn Enhanced Human-Genome Database Transforming Raw Human Sequence Data Into Useful Information Christine Schuumlller PhD and Andreas Fritz PhD The Softberry analysis results for which Biomax has the exclusive world-wide commercial license contain approximately 40000 genes which agrees well with predictions of the total number of human genes (according to the International Human Genome Sequencing Consortium or IHGSC) hellip For example 50 of the genes in the Biomax Human Genome Database are not found in the Ensembl database These genes (identified by FGENESH++ and Biomax and not found in Ensembl database) comprise the following 6 of genes classified as known genes 50 classified as having some similarity to known genes and 90 of the genes not having similarity to known genes

For human genome applications the FGENESH++ software was first used to map known human genes using sequences available from the Reference Sequence (RefSeq) Project at the Nation al Center for Biotechnology Information (NCBI Bethesda MD wwwncbinlm nihgovLocusLinkrefseqhtml) REFERENCES 1 Salamov AA and Solovyev VVAb initio gene finding in Drosophila genomic DNA Genome Res 10 391ndash7 (2000) Genome Research 14539-548 2004 ISSN 1088-9051 $500 Letter Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes Gregory M Cooper Michael Brudno Eric A Stone Inna Dubchak Serafim Batzoglou and Arend Sidow

1 2 3 5 2

146 1 Department of Genetics Stanford University Stanford California 94305 USA 2 Department of Computer Science Stanford University Stanford California 94305 USA 3 Department of Statistics Stanford University Stanford California 94305 USA 4 Department of Pathology Stanford University Stanford California 94305 USA 5 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA The gene annotations used to classify the constrained elements contain nearly 40000 genes including RefSeq genes and gene predictions they are based on annotations for the human mouse and rat genomes made by Fgenesh++ software developed by Softberry Inc (Solovyev 2002 httpwwwsoftberrycom ) Published online before print June 12 2003 101101gr529803 Genome Research 131765-1774 2003 ISSN 1088-9051 $500 Identification of Promoter Regions in the Human Genome by Using a Retroviral Plasmid Library-Based Functional Reporter Gene Assay Shirin Khambata-Ford15 Yueyi Liu2 Christopher Gleason1 Mark Dickson3 Russ B Altman2 Serafim Batzoglou4 and Richard M Myers136 1 Department of Genetics Stanford University School of Medicine Stanford California 94305 USA 2 Stanford Medical Informatics Stanford University School of Medicine Stanford California 94305 USA 3 Stanford Human Genome Center Stanford University School of Medicine Stanford California 94305 USA 4 Department of Computer Science Stanford University Stanford California 94305 USA hellipOf 858 sequences 9 of GFP+ low clones and 8 of GFP+ high clones aligned to the 2-kb

segment upstream of the transcription start site of a predicted gene in at least two of four data sets of predicted genes from Genscan Ensembl Softberry (Fgenesh++) and Acembly (category

B in Table 1) Cell Vol 110 521ndash529 August 23 2002 Copyright 2002 by Cell Press HIV-1 Integration in the Human Genome Favors Active Genes and Local Hotspots

Astrid RW Schrouml der1 Paul Shinn2 Huaming Chen2 Charles Berry3 Joseph R Ecker2 and Frederic Bushman14 1Infectious Disease Laboratory 2Genomic Analysis Laboratory The Salk Institute 10010 North Torrey Pines Road La Jolla California 92037 3Department of FamilyPreventive Medicine School of Medicine University of California San Diego San Diego California 92093 hellipAn integration target sequence was scored as a part of a transcrip-tion unit if it was (1) a member of the Refseq set of well-studied genes (httpwwwncbinlmnihgovLocusLinkrefseqhtml) or (2) if it was predicted to be a transcription unit by the ENSEMBLE (httpwwwensemblorg) or Fgenesh++ (httpwwwsoftberrycomHelpfgeneshplus2htm) programs and if that assignment was supported by mRNA or spliced EST sequence evidence Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation

FGENESH+ Genome Research 14685-692 2004 Automated Whole-Genome Multiple Alignment of Rat Mouse and Human Michael Brudno Alexander Poliakov Asaf Salamov Gregory M Cooper Arend Sidow Edward M Rubin Victor Solovyev Serafim Batzoglou

1 2 34 5 56

23 34 17 and Inna Dubchak237 1 Department of Computer Science Stanford University Stanford California 94305 USA 2 Genomics Division Lawrence Berkeley National Laboratory Berkeley California 94720 USA 3 US Department of Energy Joint Genome Institute Walnut Creek California 94598 USA 4 Softberry Inc Mount Kisco New York 10549 USA 5 Department of Genetics Stanford University Stanford California 94305-5324 USA 6 Department of Pathology Stanford University Stanford California 94305-5324 USA Fgenesh+ gene prediction is conducted on sequences with protein homology Genome Research 13313ndash322 copy2003 Received March 27 2002 accepted in revised form December 3 2002 Article and publication are at httpwwwgenomeorgcgidoi101101gr313703 A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences Trees-Juen Chuang1 Wen-Chang Lin1 Hurng-Chun Lee2 Chi-Wei Wang2 Keh-Lin Hsiao2 Zi-Hao Wang2 Danny Shieh2 Simon C Lin2 and Lan-Yang Chrsquoang13

1Bioinformatics Research Center Institute of Biomedical Sciences Academia Sinica Taipei 11529 Taiwan 2Academia Sinica Computing Center Academia Sinica Taipei 11529 Taiwan 3Corresponding author E-MAIL lychangibmssinicaedutw FAX886-2-27858594 hellipNumerous ab initio prediction programs have been used extensively in genome annotation including FGENESH (Solovyev et al 1995 Salamov and Solovyev 2000) hellipSuccessful implementation of this method includes AAT (Huang et al 1997) FGENESH+ and FGENESH++ (Salamov and Solovyev 2000)hellip Among these programs FGENESH+ (and FGENESH++) GenomeScan GeneWise and Procrustes are combined tools of sequence homology and ab initio annotation Annual Review of Genomics and Human Genetics Vol 3 293-310 (Volume publication date September 2002) (doi101146annurevgenom3030502101529) DATABASES AND TOOLS FOR BROWSING GENOMES Ewan Birney 1 Michele Clamp and 2 Tim Hubbard2 1European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom e-mail birneyebiacuk 2Wellcome Trust Sanger Institute Wellcome Trust Genome Campus Hinxton Cambridgeshire CB10 1SA United Kingdom michelesangeracuk thsangeracuk

Another predicted gene track on the UCSC browser comes from Softberry ( http wwwsoftberrycom ) and uses a program Fgenesh+ which is based on HMMs and

FGENESB Appl Environ Microbiol 2004 April 70(4) 2332ndash2341 Oxygen-Controlled Bacterial Growth in the Sponge Suberites domuncula toward a Molecular Understanding of the Symbiotic Relationships between Sponge and Bacteriadagger

Werner E G Muumlller Vladislav A Grebenjuk Narsinh L Thakur Archana N Thakur Renato Batel Anatoli Krasko Isabel M Muumlller and Hans J Breter Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz D-55099 Mainz Germany Corresponding author Mailing address Institut fuumlr Physiologische Chemie Abteilung Angewandte Molekularbiologie Universitaumlt Mainz Duesbergweg 6 55099 Mainz Germany Phone 6131-3925910 Fax 6131-3925243 E-mail wmuellermailuni-mainzde For genes and potential promoter prediction we used the FGENESB-PatternMarkov chain-based bacterial operon and gene prediction program from the SoftBerry Journal of Theoretical Biology 230 (2004) 133ndash144 Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae Bin Yana Barbara A Metheґ b Derek R Lovleyc Julia Krushkala aDepartment of Preventive Medicine Center of Genomics and Bioinformatics University of Tennesee Health Science Center 66 N Pauline St Ste 633 Memphis TN 38163 USA bThe Institute for Genomic Research Rockville MD USA cDepartment of Microbiology Morrill Science Center IV North University of Massachusetts 639 North Pleasant Str Amherst MA 01003 USA the conserved nature of the operons 2 Operons in Geobacter sulfurreducens were predicted ab initio by the public version of program FGENESB (V Solovyev and V Molecular Microbiology Volume 52 Issue 6 Page 1579 -1596 June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Copyright copy 2003 The National Academy of Sciences Proc Natl Acad Sci U S A 2003 October 28 100(22) 12830ndash12835 doi 101073pnas2133554100 Published online 2003 October 17 Evolution Proteorhodopsin genes are distributed among divergent marine bacterial taxa Joseacute R de la TorredaggerDagger Lynne M Christiansondagger Oded Beacutejagravedaggersect Marcelino T Suzukidaggerpara David M Karl John Heidelberg and Edward F DeLongdaggerdaggerdagger

daggerMonterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 sectDepartment of Biology Technion-Israel Institute of Technology Haifa 32000 Israel

paraChesapeake Biological Laboratory University of Maryland Solomons MD 20688 Department of Oceanography University of Hawaii Manoa HI 96822 and Institute for Genomic Research Rockville MD 20850 Edited by Sallie W Chisholm Massachusetts Institute of Technology Cambridge MA and approved August 21 2003 (received for review 2003 June 10) DaggerPresent address Department of Civil and Environmental Engineering University of Washington Seattle WA 98195 daggerdagger To whom correspondence should be addressed E-mail delongmbariorg hellip Analysis of the potential genes and protein-coding regions was performed by using a combination of the BLAST (11) GLIMMER 202 (TIGR) (12 13) FGENESB (Softberry Mount Kisco NY) and ARTEMIS (Sanger Center Cambridge University UK) (14) software packages Environmental Microbiology September 2004 vol 6 no 9 pp 903-910(8) DOI 101111j1462-2920200400676x Different SAR86 subgroups harbour divergent proteorhodopsins Gazalah Sabehi1 Oded Beacutejagrave1 Marcelino T Suzuki2 Christina M Preston3 Edward F DeLong4

Affiliations 1 Department of Biology Technion-Israel Institute of Technology Haifa 32000 Israel 2 Chesapeake Biological Laboratory University of Maryland Center for Environmental Sciences Solomons MD 20688 USA 3 Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing CA 95039 USA 4 Massachusetts Institute of Technology Cambridge MA 02139 USA

program FGENESB (Softberry) and the annotation was subsequently refined and curated manually using ARTEMIS (Sanger Center) Fig

FGENESV Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song Qi Wei Qin Jin Qiu Can Hua Huang Fan Wang and Choy Leong Hew1 2 1 1 1 1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2

A total of 162 ORFs predicted by the FGENESV program (available through httpwwwsoftberrycom) supplemented with Vector NTI suite 71 are indicated doi101023BVIRU000002577148128f8 Virus Genes 28 (3) 239-246 April 2004 Article ID 5269250 Complete Nucleotide Sequence of a Strawberry Isolate of Beet Pseudoyellows Virus Ioannis E Tzanetakis Molecular and Cellular Biology Program Department of Botany and Plant Pathology Oregon State University Corvallis 97331 USA Robert R Martin Horticultural Crops Research Laboratory USDA-ARS Corvallis OR 97330 USA E-mail martinrrscienceoregonstateedu httpwwwncbinlmnih govgorfgorfhtml) and the gene finder in viruses at httpwwwsoftberrycom The amino acid comparisons Geno Prot amp Bioinfo Vol 1 No 3 August 2003 226-235 Genome Organization of the SARS-CoV Jing Xu1 Jianfei Hu21 Jing Wang21 Yujun Han1 Yongwu Hu13 Jie Wen1 Yan Li1 Jia Ji1 Jia Ye14 Zizhang Zhang5 Wei Wei4 Songgang Li12 Jun Wang1 Jian Wang14 Jun Yu14 and Huanming Yang14 1 Beijing Genomics Institute Chinese Academy of Sciences Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3Wenzhou Medical College Wenzhou 325003 China 4 James D Watson Institute of Genome Sciences Zhijiang Campus Zhejiang University and Hangzhou Genomics Institute Hangzhou 310008 China 5 College of Materials Science and Chemical Engineering Yuquan Campus Zhejiang University Hangzhou 310027 China These authors contributed equally to this work Corresponding authors E-mail junyugenomicsorgcn yanghmgenomicsorgcn hellipFGENESV a program for gene prediction provided by Softberry Inc (Mount Kisco USA) through a web-based interface has been specially modimacred and trained with parameters for virus (httpwwwsoftberrycomberryphtmltopic= gfindv) hellip

The hypothetical minus sense ORF iden-timacred by FGENESV (from 48 to 203 nt on the minus strand or 29523 to 29678 nt on the plus strand) may be fake but we should not absolutely deny the prob-ability of the existence of minus ORFs hellipFurthermore we employed FGENESV to explore the sequences of MHV (NC 001846 in NCBI) and AIBV (NC 001451 in NCBI) and compared the re-sults with their previous annotations respectively Rapport de stage de DEA Juin 2003 Analyse du geacutenome du virus de lrsquoarcheacutee Pyrococcus abyssi (PAV1) ROUAULT Karen Laboratoire de Microbiologie et Biotechnologie des Extrecircmophiles IFREMER- Centre de Brest et Equipe Microbiologie LEMAR ndash Institut Universitaire Europeacuteen de la Mer [14] FGENESV httpwwwsoftberrycomberry phtmltopic=gfindv Virus ( gt10 kb) Modegraveles de Markov Forme du geacutenome Code geacuteneacutetique [40]

FGENES-M BMC Bioinformatics 2005 6 25 Published online 2005 February 10 doi 1011861471-2105-6-25 Integrating alternative splicing detection into gene prediction Sylvain Foissac 1 and Thomas Schiex11Uniteacute de Biomeacutetrie et Intelligence Artificielle INRA 31326 Castanet Tolosan France

Corresponding author Sylvain Foissac foissactoulouseinrafr Thomas Schiex tschiextoulouseinrafrReceived July 27 2004 Accepted February 10 2005 This approach has been applied eg in HMMgene or in FGENES-M (unpub)hellip DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39 - 43 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom) DNA Sequence - The Journal of Sequencing and Mapping Volume 15 Number 1 February 2004 39-43 DOI 1010801042517032000160189 Isolation and Characterization of Two cDNAs Encoding Translation Initiation Factor 1A from Rice(Oryza sativa L) Ji Huang Jianfei Wang Shengping Qiu Hongsheng Zhang State Key Laboratory of Crop Genetics and Germplasm Enhancement Rice Research Institute Nanjing Agricultural University 210095 Nanjing Peoples Republic of China Then the ORF without introns was assembled based on the sequence of each scaffold using the program FGENES-M (httpwww softberrycom)

BESTORF Eukaryotic Cell March 2005 p 526-535 Vol 4 No 3 1535-977805$0800+0 doi101128EC43526-5352005 Sex-Specific Homeodomain Proteins Sxi1 and Sxi2a Coordinately Regulate Sexual Development in Cryptococcus neoformans Christina M Hull1 Marie-Josee Boily and Joseph Heitman1 12 Department of Molecular Genetics and Microbiology1 the Howard Hughes Medical Institute Duke University Medical Center Durham North Carolina2 Received 14 January 2005 Accepted 14 January 2005 Corresponding author Mailing address 322 CARL Building Box 3546 Department of Molecular Genetics and Microbiology 100 Research Dr Duke University Medical Center Durham NC 27710 Phone (919) 684-2824 Fax (919) 684-5458 E-mail heitm001dukeeduPresent address Departments of Biomolecular Chemistry and Medical Microbiology amp

Immunology University of WisconsinmdashMadison Madison WI 53706 Sequence manipulations Splice predictions of candidate gene sequences for SXI2a were facilitated with a Softberry algorithm (wwwsoftberrycom) hellipWe utilized the BESTORF gene prediction algorithm from Softberry Inc to electronically produce predicted spliced cDNA products encoded by a 10-kb regionhellip

PROTCOMP Genes and Immunity 2005 v5 n4 - naturecom Immune response in silico(IRIS) immune-specific genes identified from a compendium of microarray hellip AR Abbas D Baldwin Y Ma W Ouyang A Gurney F hellip The Protcomp algorithm (Softberry Inc) predicts for the 1589 IRIS genes with ORFs that 24 of the encoded proteins are in the plasma membrane 13 are MPMI Vol 17 No 7 2004 pp 789ndash797 Publication no M-2004-0426-01R copy 2004 The American Phytopathological Society Lotus japonicus LjKUP Is Induced Late During Nodule Development and Encodes a Potassium Transporter of the Plasma Membrane Guilhem Desbrosses Claudia Kopka Thomas Ott and Michael K Udvardi Max Planck Institute of Molecular Plant Physiology Am Muumlhlenberg 1 14476 Golm Germany Submitted 3 November 2003 Accepted 13 February 2004 hellipBoth PSORT and Protcomp predicted a PM location for LjKUPhellip Planta DOI 101007s00425-003-1182-5 Issue Volume 218 Number 6 Date April 2004 Pages 965 - 975 Biochemical and immunological characterization of pea nuclear intermediate filament proteins Sonal S D Blumenthal1 Gregory B Clark1 and Stanley J Roux1

(1) School of Biological Sciences Section of Molecular Cell and Developmental Biology The University of Texas Austin TX 78712 USA Stanley J Roux Email srouxutsccutexasedu html) BCM Search Launcher (Protein structure prediction http searchlauncher bcmtmcedu) SoftBerry (Protein subcellular localization Comparative and Functional Genomics Volume 5 Issue 4 Pages 342 - 353Published Online 20 May 2004 Research Paper Investigation into the use of C- and N-terminal GFP fusion proteins for subcellular localization studies using reverse transfection microarrays Ella Palmer Tom Freeman

MRC Rosalind Franklin Centre for Genomics Research (formerly the HGMP-Resource Centre) Genome Campus Hinxton Cambridge CB10 1SB UK email Tom Freeman (tfreemanrfcgrmrcacuk) Correspondence to Tom Freeman RFCGR Hinxton CambridgeCB10 1SB UK

ProtComp version 4 (Softberry) combines results with proteins of known subcellular localization and assumed subcellular localization (based on theoret- ical Plant Physiol2004 134 286-295 RHM2 Is Involved in Mucilage Pectin Synthesis and Is Required for the Development Usadel et al Tentative subcellular localization prediction by TargetP (Emanuelsson et al 2000 ) or ProtComp (httpwwwsoftberrycom) a prediction software trained on Journal of Cellular BiochemistryVolume 90 Issue 2 Pages 361 - 378Published Online 3 Sep 2003 A proteomic study of the arabidopsis nuclear matrix Tomasz T Calikowski 1 3 Tea Meulia 2 Iris Meier 1

1Department of Plant Biology and Plant Biotechnology Center Ohio State University Columbus Ohio 43210 2Molecular and Cellular Imaging Center Ohio Agricultural and Research Development Center Ohio State University Columbus Ohio 43210 3Institute of Biochemistry and Biophysics Polish Academy of Sciences UL Pawinskiego 5A 02-106 Warszawa Poland email Iris Meier (meier56osuedu)Correspondence to Iris Meier Department of Plant Biology and Plant Biotechnology Center Ohio State University 244 Rightmire Hall 1060 Carmack Rd Columbus OH 43210 For prediction of subcellular localization ProtComp 4 (Softberry Inc Mount Kisco NY httpwwwsoftberrycomberryphtmltopicfrac14 proteinloc) PSORT v64 Cellular Molecular Life Sciences 2003 in press Automatic prediction of protein function Burkhard Rost 1 2 3 Jinfeng Liu 1 3 4 Rajesh Nair 1 5 Kazimierz O Wrzeszczynski 1 and Yanay Ofran 16

1 CUBIC Dept of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 2 Columbia University Center for Computational Biology and Bioinformatics (C2B2) Russ Berrie Pavilion 1150 St Nicholas Avenue New York NY 10032 USA

3 North East Structural Genomics Consortium (NESG) Department of Biochemistry and Molecular Biophysics Columbia University 650 West 168th Street BB217 New York NY 10032 USA 4 Dept of Pharmacology Columbia Univ 630 West 168th Street New York NY 10032 USA 5 Dept of Physics Columbia Univ 538 West 120th Street New York NY 10027 USA 6 Dept of Medical Informatics Columbia Univ 630 West 168th Street New York NY 10032 USA Corresponding author cubiccubicbioccolumbiaedu URL httpcubicbioccolumbiaedu Tel +1-212-305-4018 fax +1-212-305-7932

genomelocalize ProtComp predict localization for plants httpwwwsoftberrycomberryphtmltopic=proteinloc Predotar predict Published online before print September 15 2003 101101gr1293003 Genome Research 132265-2270 2003 The Secreted Protein Discovery Initiative (SPDI) a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins A Bioinformatics Assessment Hilary F Clark1 Austin L Gurney Evangeline Abaya Kevin Baker Daryl Baldwin Jennifer Brush Jian Chen Bernard Chow Clarissa Chui Craig Crowley Bridget Currell Bethanne Deuel Patrick Dowd Dan Eaton Jessica Foster Christopher Grimaldi Qimin Gu Philip E Hass Sherry Heldens Arthur Huang Hok Seon Kim Laura Klimowski Yisheng Jin Stephanie Johnson James Lee Lhney Lewis Dongzhou Liao Melanie Mark Edward Robbie Celina Sanchez Jill Schoenfeld Somasekar Seshagiri Laura Simmons Jennifer Singh Victoria Smith Jeremy Stinson Alicia Vagts Richard Vandlen Colin Watanabe David Wieand Kathryn Woods Ming-Hong Xie Daniel Yansura Sothy Yi Guoying Yu Jean Yuan Min Zhang Zemin Zhang Audrey Goddard William I Wood and Paul Godowski Departments of Bioinformatics Molecular Biology and Protein Chemistry Genentech Inc South San Francisco California 94080 USA 1 Corresponding author E-MAIL hclarkgenecomFAX (650) 225-5389 An automated computational strategy was utilized to query each protein translation with the Signal Sensor Sighmm Tmdetect (T Wu unpubl) hmmpfam (Eddy 1998 ) and Protcomp (Softberry Inc) algorithms hellipThe Protcomp algorithm predicts the subcellular localization of a protein on the basis of homology to well-annotated proteins a neural net and various protein motifs In this case the Protcomp subcellular localization prediction was used to categorize these genes as Other Secreted Other Transmembrane or Other Cytoplasmic or Nuclear Plant Physiol February 2002 Vol 128 pp 336-340 Gene prediction in eukaryota Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 genomelocalize ProtComp predict localization for plants httpwww softberrycomberryphtmltopic=proteinloc Predotar predict Proc Natl Acad Sci U S A 2001 April 24 98(9) 5341ndash5346 doi 101073pnas101534498 Published online 2001 April 17 Plant Biology The Cia5 gene controls formation of the carbon concentrating mechanism in Chlamydomonas reinhardtii Youbin Xiang Jun Zhang and Donald P Weeks

Department of Biochemistry and School of Biological Sciences University of Nebraska Lincoln NE 68588-0664

Edited by Bob B Buchanan University of California Berkeley CA and approved March 14 2001 (received for review 2000 November 8) To whom reprint requests should be addressed E-mail dweeks1unledu hellipComputer-assisted analysis of the CIA5 aa sequence (PROTCOMP version 4 httpwwwsoftberrycom) predicted a nuclear localization of the protein hellipFinally computer program predictions (eg PROTCOMP version 4 httpwwwsoftberrycom) for a nuclear localization of CIA5 and the clear-cut nuclear localization of CIA5 in onion epidermal cells (Fig 3) provide additional weight to the argument that CIA5 may be a transcription factor Dissertation zur Erlangung des akademischen Grades Dr rer nat der Fakultaumlt der Naturwissenschaften der Universitaumlt Ulm Untersuchungen zur Identifizierung von Faktoren und Mechanismen der mRNA 3 Prozessierung und Degradation in Chloroplasten houmlherer Pflanzen vorgelegt von Michael Walter aus Immenstadt i Allgaumlu Abteilung Molekulare Botanik Universitaumlt Ulm Ulm November 2001 Tag der Promotion 19 Feb 2002 Algorithmen zur Vorhersage der subzellulaumlren Lokalisation - PSORT httppsortnibbacjp8800formhtml (Nakai und Kanehisa 1992) - ChloroP httpwwwcbsdtudkservicesChloroP (Emanuelsson et al 2000) - TargetP httpwwwcbsdtudkservicesTargetP (Emanuelsson et al 2000) - Predotar httpwwwinrafrInternetProduitsPredotar - Softberry httpwwwsoftberrycom

BPROM Extremophiles Issue Volume 9 Number 2 Date April 2005 Pages 99 ndash 109 DOI 101007s00792-004-0425-0

The genome of BCJA1c a bacteriophage active against the alkaliphilic bacterium Bacillus clarkii

Andrew M Kropinski1 Melissa Hayward1 M Dorothy Agnew1 and Ken F Jarrell1

(1) Department of Microbiology and Immunology Queens University Kingston ON K7L 3N6 Canada

al 2002) Promoters were predicted using Softberryrsquos BPROM program at httpwwwsoftberry comberry phtmltopic=promoter

Journal of Bacteriology February 2005 p 1091-1104 Vol 187 No 3 0021-919305$0800+0 doi101128JB18731091-11042005

The Generalized Transducing Salmonella Bacteriophage ES18 Complete Genome Sequence and DNA Packaging Strategy

Sherwood R Casjens12 Eddie B Gilcrease1 Danella A Winn-Stapley1 Petra Schicklmaier3 Horst Schmieger3 Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24 Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah1 Department of Biological Sciences4 Pittsburgh Bacteriophage Institute University of Pittsburgh Pittsburgh Pennsylvania 2 Institut fuumlr Genetik und Mikrobiologie Universitaumlt Muumlnchen Munich Germany3 Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu Present address Biology Department MIT Cambridge MA 02139 Present address Biogen Idec GmbH D-85737 Ismaning Germany

Received 1 September 2004 Accepted 3 November 2004

The DNA sequence analysis software used was DNA Strider (24) GeneMark (5) Staden programs (78) BLAST (2) BPROM (httpwwwsoftberrycomberryphtmltopic

Infection and Immunity May 2005 p 2899-2909 Vol 73 No 5 Characterization of the Major Secreted Zinc Metalloprotease- Dependent GlycerophospholipidCholesterol Acyltransferase PlaC of Legionella pneumophila Sangeeta Banerji1 Mayte Bewersdorff1 Bjoumlrn Hermes1 Nicholas P Cianciotto2 and Antje Flieger1 Robert Koch-Institut Berlin Germany1 Department of Microbiology-Immunology Northwestern University Medical School Chicago Illinois2 Received 25 October 2004 Returned for modification 18 November 2004 Accepted 22 December 2004

Corresponding author Mailing address Robert Koch-Institut Research Group NG5 Pathogenesis of Legionella Infections Nordufer 20 D-13353 Berlin Germany Phone 49-30-4547-2522 Fax 49-30-4547-2328 E-mail fliegerarkide MB and BH contributed equally to this work

legion) (12) Nucleotide sequences were also analyzed for promoters using the web-based program BPROM (wwwsoftberrycom) Sequence Journal of Bacteriology April 2005 p 2458-2468 Vol 187 No 7 The Type III-Dependent Hrp Pilus Is Required for Productive Interaction of Xanthomonas campestris pv vesicatoria with Pepper Host Plants Ernst Weber1 Tuula Ojanen-Reuhs2 Elisabeth Huguet3 Gerd Hause4 Martin Romantschuk2 Timo K Korhonen2 Ulla Bonas13 and Ralf Koebnik1 Institute of Genetics1 Biozentrum Martin Luther University Halle Germany4 General Microbiology Faculty of Biosciences University of Helsinki Helsinki Finland2 Institut des Sciences Veacutegeacutetales CNRS Gif-sur-Yvette France3 Received 10 November 2004 Accepted 28 December 2004 Corresponding author Mailing address Martin-Luther-Universitaumlt Institut fuumlr Genetik Weinbergweg 10 D-06120 Halle (Saale) Germany Phone 49 345 5526293 Fax 49 345 5527151 E-mail koebnikgmxde Present address Purdue University Department of Food Sciences West Lafayette IN 47907 Present address Institut de Recherche sur la Biologie de lInsecte UMR CNRS 6035 Faculteacute

des Sciences F-37200 Tours France Present address University of Helsinki Department of Ecological and Environmental Sciences

FIN-15140 Lahti Finland

The promoter recognition program BPROM (Softberry Inc Mt Kisco NY) was used for prediction of bacterial sigma70 promoter motifs RESULTS

J Bacteriol 2004 September 186(17) 5945ndash5949 doi 101128JB186175945-59492004 Identification of Operators and Promoters That Control SXT Conjugative Transfer John W Beaber and Matthew K Waldor

Department of Microbiology Tufts University School of Medicine and Howard Hughes Medical Institute Boston Massachusetts Corresponding author Mailing address Tufts University School of Medicine 136 Harrison Ave Jaharis 425 Boston MA 02111 Phone (617) 636-2730 Fax (617) 636-2723 E-mail matthewwaldortuftsedu Received April 1 2004 Accepted May 24 2004 hellipComputer algorithms and 5prime random amplification of cDNA ends (RACE) were used to define the setR and s086 transcription start sites Software for the identification of bacterial promoters (httpwwwsoftberrycomberryphtmltopic=bpromampgroup=programsampsubgroup=gfindb) identified putative minus10 and minus35 elements for both PL and PR (Fig 2) (23 24)hellip JOURNAL OF BACTERIOLOGY Mar 2004 p 1818ndash1832 Vol 186 No 6

The pKO2 Linear Plasmid Prophage of Klebsiella oxytoca Sherwood R Casjens12 Eddie B Gilcrease1 Wai Mun Huang1 Kim L Bunny3

Marisa L Pedulla24 Michael E Ford24 Jennifer M Houtz24

Graham F Hatfull24 and Roger W Hendrix24 Department of Pathology University of Utah Medical School Salt Lake City Utah 841321 Pittsburgh Bacteriophage Institute2 and Department of Biological Sciences4 University of Pittsburgh Pittsburgh Pennsylvania 15260 and Section of Microbiology University of California at Davis Davis California 956163

Corresponding author Mailing address Department of Pathology University of Utah Medical School Salt Lake City UT 84132 Phone (801) 581-5980 Fax (801) 581-3607 E-mail sherwoodcasjenspathutahedu hellipThe DNA sequence analysis software packages used were DNA Strider (27) GeneMark (8) the Staden programs (94) BLAST (3) BPROM httpwwwsoftberrycomberryphtmltopic_gfindb) and DNA Master (J Lawrence [httpcobamide2biopittedu])hellip

BMC Microbiology 2004 44 Analysis of the lambdoid prophage element e14 in the E coli K-12 genome Preeti Mehta1 Sherwood Casjens2 and Sankaran Krishnaswamy1 Address 1Bioinformatics Centre School of Biotechnology Madurai Kamaraj University Madurai-625021 India and 2University of Utah Medical School Department of Pathology 90 North 1900 East Salt Lake City UT 84132-2501 USA Email Preeti Mehta - mehta_p74yahoocom Sherwood Casjens - sherwoodcasjenspathutahedu Sankaran Krishnaswamy - krishnamrnatnnicin Corresponding author This article is available from httpwwwbiomedcentralcom1471-218044 Putative promoters predicted using BPROM available at the website http wwwsoftberrycom Scores are as given by BPROM Promoters Plant Molecular Biology 53 (6) 865-876 December 2003 Prokaryotic orthologues of mitochondrial alternative oxidase and plastid terminal oxidase Allison E McDonald Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Sasan Amirsadeghi Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada Greg C Vanlerberghe Department of Life Sciences and Department of Botany University of Toronto at Scarborough 1265 Military Trail Scarborough Ontario M1C 1A4 Canada (e-mail gregvutscutorontoca The A variabilis PTOX sequence was analyzed in the upstream region of the start codon with Softberryrsquos BPROM software (httpwwwsoftberrycom)

SPLICEDB Plant Molecular Biology DOI 101007s11103-005-0271-1 Issue Volume 57 Number 3 Date February 2005 Pages 445 - 460 Evaluation of five ab initio gene prediction programs for the discovery of maize genes Hong Yao1 4 Ling Guo1 6 Yan Fu1 4 Lisa A Borsuk1 6 Tsui-Jung Wen2 David S Skibbe1 5 Xiangqin Cui1 4 9 Brian E Scheffler8 Jun Cao1 4 Scott J Emrich6 Daniel A Ashlock3 6 and Patrick S Schnable1 2 4 5 6 7

(1)Department of Genetics Development and Cell Biology Iowa State University Ames Iowa 50011-3650 (2) Department of Agronomy Iowa State University Ames Iowa 50011-3650 (3) Department of Mathematics Iowa State University Ames Iowa 50011-3650 (4) Inderdepartmental Graduate Programs in Genetics Iowa State University Ames Iowa 50011-3650 (5) Department of Molecular Cellular and Developmental Biology Iowa State University Ames Iowa 50011-3650 (6) Department of Electrical and Computer Engineering and Department of Bioinformatics and Computational Biology Iowa State University Ames Iowa 50011-3650 (7) Center for Plant Genomics Iowa State University Ames Iowa 50011-3650 (8) Mid South Area Genomics Facility USDA-ARS Stoneville MS 38776-0038 USA(9) Present address Department of Biostatistics Birmingham AL 35294 USA model FGENESH httpwwwsoftberrycom berryphtmltopic=fgeneshampgroup= programsampsubgroup=gfind Monocots Yes Yes Yes GHMM a GeneMark Finding short DNA motifs using permuted markov models X Zhao H Huang TP Speed The data are human donor sequences from SpliceDB [9] a recently developed database of known mammalian splice site sequences (httpwwwsoftberrycomspldb Current Opinion in Structural Biology 2004 14273ndash282 The evolving roles of alternative splicing Liana F Lareau1 Richard E Green1 Rajiv S Bhatnagar23 and Steven E Brenner12_ Departments of 1Molecular and Cell Biology and 2Plant and Microbial Biology University of California Berkeley California 94720 USA 3Department of Dermatology University of California San Francisco California 94143 USA _e-mail brennercompbioberkeleyedu [79] SpliceDB httpwwwsoftberrycomberryphtmltopicfrac14splicedb Database and composition statistics for mammalian splice sites inferred from ESTs [80] Yearbook of Medical Informatics Review Paper 2004 121-136 Curated databases and their role in clinical bioinformatics CC Englbrecht M Han MT Mader A Osanger KFX Mayer MIPS Institute for Bioinformatics Address of the authors Claudia C Englbrecht Michael Han

Michael T Mader Andreas Osanger Klaus F X Mayer MIPS Institute for Bioinformatics GSF - National Research Center for Environment and Health 85758 Neuherberg Germany E-mail kmayergsfdeCorresponding author hellipSpliceDB httpwwwsoftberrycomspldbSpliceDBhtmlCanonical and non-canonical mammalian splice sites [122] 122Burset M Seledtsov IA Solovyev VV SpliceDB database of canonical and non-canonical mammalian splice sites Nucleic Acids Res 200129255-9 Nucleic Acids Research 2001 Vol 29 No 1 255-259 SpliceDB database of canonical and non-canonical mammalian splice sites M Burset I A Seledtsov1 and V V Solovyev The Sanger Centre Hinxton Cambridge CB10 1SA UK and 1Softberry Inc 108 Corporate Park Drive Suite 120 White Plains NY 10604 USA To whom correspondence should be addressed at present address EOS Biotechnology 225A Gateway Boulevard South San Francisco CA 94080 USA Tel +1 650 246 2331 Fax +1 650 583 3881 Email solovyeveosbiotechcom Present address M Burset Institut Municipal drsquoInvestigacioacute Megravedica (IMIM) CDr Aiguader 80 08003 Barcelona Spain

SCAN2 African Journal of Biotechnology Vol 2 (12) pp 714-718 December 2003 Available online at httpwwwacademicjournalsorgAJB ISSN 1684ndash5315 copy 2003 Academic Journal Accepted 14 November 2003 Minireview Web-based bioinformatic resources for protein and nucleic acids sequence alignment Kamel A Abd-Elsalam Molecular Markers Lab Plant Pathology Research Institute Agricultural Research Center Orman 12619 Giza Egypt E-mail kaabdelsalammsncom 16-SCAN2 program for aligning two multimegabyte-size sequences httpwwwsoftberrycomberryphtmltopic=scanhampprg= SCAN2 derived Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK The full-length sequences of gene pairs have been aligned by the SCAN2 program (httpsoftberrycomberryphtmltopic=scanhampprg=SCAN2) which can align

PDISORDER BMC Bioinformatics 2005 622 doi1011861471-2105-6-22 Research article Open Access Proteins with two SUMO-like domains in chromatin-associated complexes The RENi (Rad60-Esc2-NIP45) family Maria Novatchkova1 Andreas Bachmair3 Birgit Eisenhaber2 and Frank Eisenhaber2 Address 1Gregor Mendel-Institut GMI Austrian Academy of Sciences Vienna Biocenter A-1030 Vienna Austria 2Research Institute of Molecular Pathology Dr Bohr-Gasse 7 A-1030 Vienna Austria and 3Max Planck Institute for Plant Breeding Research Carl-von-Linneacute-Weg 10 D-50829 Cologne Germany Email Maria Novatchkova - marianovatchkovagmioeawacat Andreas Bachmair - bachmairmpiz-koelnmpgde Birgit Eisenhaber - b_eisenimpunivieacat Frank Eisenhaber - FrankEisenhaberimpunivieacat Corresponding author hellipInitial analysis of its sequence complexity shows that the disordered N-terminal half of the protein is followed by a likely globular segment (predicted using Pdisorder by Softberry Inc)hellip

SPL The National Academy of Sciences Proc Natl Acad Sci U S A 2003 November 25 100(Suppl 2) 14537ndash14542 doi 101073pnas2335847100 Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster Hugh M Robertsondagger Coral G WarrDaggersect and John R Carlsonsect Department of Entomology University of Illinois 505 South Goodwin Avenue Urbana IL 61801 DaggerSchool of Biological Sciences Monash University Clayton VIC 3800 Australia and sectDepartment of Molecular Cellular and Developmental Biology Yale University New Haven CT 06520 The genes were reconstructed manually in the PAUP editor (23) by using the expected exonintron structures as guides and the SPL program (Softberry wwwsoftberrycomberryphtml) to locate predicted introns

NSITE GENETIC ANALYSES OF BOVINE CARD15 A PUTATIVE DISEASE RESISTANCE GENE A Dissertation by KRISTEN HAWKINS TAYLOR Submitted to Texas AampM University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2004 Major Subject Genetics hellipShort motifs identified as being conserved between the three species in these intronic regions as well as in the 5rsquoUTR and 3rsquoUTR were then analyzed using the TFSCAN (httpzeonwelloxacukgit-bintfscan) and NSITE (available through SoftBerry httpwwwsoftberrycomberryphtmltopic=promoter) programs to identify putative regulatory motifs Motifs selected for analysis required homology consisting of 6 or more bases with no more than 2 substitutions among the 3 specieshellip hellipSequence that included the SNPs located within intronic regions and in the 3rsquo and 5rsquoUTRs were analyzed using NSITE (available through SoftBerry at httpwwwsoftberrycomberryphtmltopic=promoter) to identify putative regulatory motifshellip

TSSP Bioinformatics 2005 21(14)3074-3081 doi101093bioinformaticsbti490 Cis-regulatory element based targeted gene finding genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana Weixiong Zhang 12 Jianhua Ruan 1 Tuan-hua David Ho 3 Youngsook You 3 Taotao Yu 1 and Ralph S Quatrano 3 1Department of Computer Science and Engineering Washington University in Saint Louis Saint Louis MO 63130 USA 2Department of Genetics Washington University in Saint Louis Saint Louis MO 63130 USA 3Department of Biology Washington University in Saint Louis Saint Louis MO 63130 USA To whom correspondence should be addressed

sites (TSSs) To predict TSSs we combined an Athaliana cDNA database and a software TSSP (SoftBerry httpwwwsoftberrycom) As

PLANTPROM BMC Genomics 2005 6 25 Genome wide analysis of Arabidopsis core promoters Carlos Molina12 and Erich Grotewold 11Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center The Ohio State University Columbus OH 43210 2Departamento de Informaacutetica Universidad Teacutecnica Federico Santa Mariacutea Valparaiacuteso Chile search for TATA elements is carried out on the 12749 [-500 -1] regions 6316 sequences (using the MEME NFM) or 8776 (using the expanded PlantProm NFM) are Nucleic Acids Research 2005 Vol 33 No 3 1069ndash1076 doi101093nargki247 Plant promoter prediction with confidence estimation I A Shahmuradov1 V V Solovyev12 and A J Gammerman1 1Royal Holloway University of London Egham Surrey TW20 0EX UK and 2Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA MATERIALS AND METHODS Training and testing sequences For training and testing procedures we used 301 promoters with annotated TSS from PlantProm DB (22) Nucleic Acids Research 2004 Vol 32 Database issue D368plusmnD372 DOI 101093nargkh017 AthaMap an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome Nils Ole Steffens Claudia Galuschka Martin Schindler Lorenz BuEgravelow and Reinhard Hehl Institut fuEgrave r Genetik Technische UniversitaEgrave t Braunschweig Spielmannstraucirce 7 D-38106 Braunschweig Germany hellipShahmuradovIA GammermanAJ HancockJM BramleyPM and SolovyevVV (2003) PlantProm a database of plant promoter sequences Plant Physiology October 2004 Vol 136 pp 3023-3033 GENOME ANALYSIS Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space1[w] Nathan Michael Springer Xiequn Xu and W Brad Barbazuk Center for Plant and Microbial Genomics Department of Plant Biology University of Minnesota St Paul Minnesota 55108 (NMS) and Donald Danforth Plant Sciences Center St Louis Missouri 63132 (XX WBB) Text] Shahmuradov IA Gammerman AJ Hancock JM Bramley PM Solovyev VV (2003) PlantProm a database of plant promoter sequences

Plant Physiology April 2004 Vol 134 pp 1ndash12 wwwplantphysiolorg Characterization of Three Functional High-Affinity Ammonium Transporters in Lotus japonicus with Differential Transcriptional Regulation and Spatial Expression1

Enrica DrsquoApuzzo2 Alessandra Rogato2 Ulrike Simon-Rosin Hicham El Alaoui3 Ani Barbulova Marco Betti Maria Dimou Panagiotis Katinakis Antonio Marquez Anne-Marie Marini Michael K Udvardi and Maurizio Chiurazzi Institute of Genetics and Biophysics Via Marconi 12 80125 Napoli Italy (ED AR HEA AB MC) Molecular Plant Nutrition Group Max Planck Institute of Molecular Plant Physiology Am Muhlenberg 1 14476 Golm Germany (US-R MKU) Universiteacute Libre de Bruxelles Institut de Biologie de Meacutedecine Moleacuteculaires 6041 Gosselies Belgium (A-MM) Agricultural University of Athens Department of Agricultural Biotechnology 11855 Athens Greece (MD PK) and Departamento de Bioquıacutemica Vegetal y Biologıacutea Molecular Facultad de Quimica 41080 Seville Spain (MB AM) hellipPlantProm a database of plant promoter sequences Nucleic Acids Res 31 114ndash117 Nucleic Acids Research 1 January 2003 vol 31 no 1 pp 114-117(4) PlantProm a database of plant promoter sequences Authors Bramley PM1 Solovyev VV2 Shahmuradov IA Gammerman AJ Hancock JM Affiliations Department of Computer Science Royal Holloway University of London Egham Surrey TW20 0EX UK 1 School of Biological Sciences Royal Holloway University of London UK 2 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 3 To whom correspondence should be addressed Email victorsoftberrycom Present address John M Hancock MRC Mammalian Genetics Unit Harwell Oxfordshire UK

PROMH Nucleic Acids Research 2003 Vol 31 No 13 3540-3545 PromH promoters identification using orthologous genomic sequences V V Solovyev and I A Shahmuradov1 Softberry Inc 116 Radio Circle Suite 400 Mount Kisco NY 10549 USA 1 Institute of Botany Azerbaijan National Academy of Sciences 370073 Baku Azerbaijan To whom correspondence should be addressed Tel +1 914 242 3592 Fax +1 914 242 3593 Email victorsoftberrycom Present address I A Shahmuradov Royal Holloway University of London Egham Surrey TW20 0EX UK Received February 15 2003 Revised and Accepted March 21 2003

Other FGENESH Molecular Microbiology Volume 52 Issue 6 Page 1579 - June 2004 doi101111j1365-2958200404086x Gene conversion a mechanism for generation of heterogeneity in the tprK gene of Treponema pallidum during infection Arturo Centurion-Lara Rebecca E LaFond Karin Hevner Charmie Godornes Barbara J Molini Wesley C Van Voorhis and Sheila A Lukehart Affiliations Departments of Medicine and Pathobiology University of Washington Harborview Medical Center Box 359779 325 Ninth Ave Seattle WA 98104 USA E-mail acenturuwashingtonedu Tel (+1) 206 341 5364 Fax (+1) 206 341 5363 Using the fgenesb program which identifies putative operons and genes in microbial genomes (Softberry http wwwsoftberrycom berryphtml ) the tprK ORF Microbiology 150 (2004) 518-520 DOI 101099mic026871-0 IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated with recognizable overlapping genes Mark W Silby1 Paul B Rainey23 and Stuart B Levy14 1 Center for Adaptation Genetics and Drug Resistance Department of Molecular Biology and Microbiology Tufts University School of Medicine Boston MA 02111 USA 2 Department of Plant Sciences University of Oxford South Parks Road Oxford OX1 3RB UK 3 School of Biological Sciences University of Auckland Private Bag 92019 Auckland New Zealand 4 Department of Medicine Tufts University School of Medicine Boston MA 02111 USA Correspondence Stuart B Levy (stuartlevytuftsedu) hellipUsing SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of the iiv5 ORF respectivelyhellip Journal of Virology November 2004 p 12576-12590 Vol 78 No 22 DOI 101128JVI782212576-125902004

Functional Genomics Analysis of Singapore Grouper Iridovirus Complete Sequence Determination and Proteomic Analysis Wen Jun Song1 Qi Wei Qin2 Jin Qiu1 Can Hua Huang1 Fan Wang1 and Choy Leong Hew1 Department of Biological Sciences1 Tropical Marine Science Institute National University of Singapore Singapore2 Corresponding author Mailing address Department of Biological Sciences National University of Singapore 10 Kent Ridge Crescent Singapore 119260 Singapore Phone 65-68742692 Fax 65-67795671 E-mail dbshewclnusedusg or dbsheadnusedusg Received 19 March 2004 Accepted 29 June 2004

hellipThe whole genome was also submitted to httpwwwsoftberrycom (Softberry Inc Mount Kisco NY) for identification of all potential ORFshellip hellipCoding capacity of the viral genomic DNA sequence Prediction of presumptive genes was carried out by using the viral gene prediction program under the website httpwwwsoftberrycom supplemented with Vector NTI suite 71hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1695-8 Issue Volume 109 Number 4 Date August 2004 Pages 681 - 689

Linkage disequilibrium and sequence diversity in a 500-kbp region around the adh1 locus in elite maize germplasm Mark Jung1 Ada Ching1 Dinakar Bhattramakki2 Maureen Dolan1 Scott Tingey1 Michele Morgante1 3 and Antoni Rafalski1

(1) DuPont Crop Genetics Experimental Station PO Box 80353 Wilmington DE 19880-0353 USA (2) Pioneer Hi-Bred International Inc 7300 NW 62nd Avenue PO Box 1004 Johnston IA 50131-1004 USA (3) Dipartimento di Produzione Vegetale e Tecnologie Agrarie Universita di Udine Via delle Scienze 208 33100 Udine ItalyReceived 26 January 2004 Accepted 2 April 2004 Published online 6 August 2004 Mark Jung Email marktjungcgrdupontcom 1) Gene locations were defined by several methods Annotations provided in Tikhonov et al (1999) were first used then FGENESH gene-finding software DNA Sequence - The Journal of Sequencing and Mapping Issue Volume 15 Number 4 August 2004 Pages 269 ndash 276 DOI 10108010425170412331279648 Isolation Characterization and Expression Analysis of a Leaf-specific Phosphoenolpyruvate Carboxylase Gene in Oryza sativa Chang-Fa Lin A1 Chun Wei A1 Li-Zhi Jiang A1 A2 Ke-Gui Li A1 Xiao-Yin Qian A1 Kotb Attia A1 Jin-Shui Yang A1 A1 State Key Laboratory of Genetic Engineering Institute of Genetics School of Life Sciences Fudan University Shanghai 200433 PRChina A2 National Key Laboratory for Soil Erosion and Dry Land Farming on Loess Plateau Northwest Sci-Tech University of Agriculture and Forest Shannxi 712100 PRChina tools of GeneMark (httpopal biologygatechedugeneMark) and Softberry (httpwwwsoftberrycom) For the isolation of putative Plant Molecular Biology DOI 101023BPLAN00000382568980957 Issue Volume 54 Number 4 Date March 2004 Pages 519 ndash 532

Genome-Wide Analysis of the GRAS Gene Family in Rice and Arabidopsis Chaoguang Tian1 Ping Wan1 Shouhong Sun1 Jiayang Li1 and Mingsheng Chen1

(1) Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road Chaoyang District Beijing 100101 ChinaMingsheng Chen Email mschengeneticsaccn

database FgeneSH (Salamov and 90 Solovyev 2000) was used for gene prediction pre- 207 dicted by FgeneSH (minor discrepancies exist due 208 Mycological Research (2004) 108 853-857 Cambridge University Press doi 101017S095375620400067X Published Online 13 авг 2004 Combining transcriptome data with genomic and cDNA sequence alignments to make confident functional assignments for Aspergillus nidulans genes Andrew H SIMS a1 Manda E GENT a1 Geoffrey D ROBSON a1 Nigel S DUNN-COLEMAN a2 and Stephen G OLIVER a1c1 a1 School of Biological Sciences University of Manchester The Michael Smith Building Oxford Road Manchester M13 9PT UK E-mail steveolivermanacuk a2 Genencor International Inc 925 Page Mill Road Palo Alto CA 94304 USA c1 Corresponding author Kingdom Page 2 Genewise FgeneSH FgeneSH+) consisting of 9541 putative open reading frames (ORFs) was released in June 2003 We TAG Theoretical and Applied Genetics DOI 101007s00122-004-1603-2 Issue Volume 109 Number 1 Date June 2004 Pages 129 ndash 139

Gene content and density in banana (Musa acuminata) as revealed by genomic sequencing of BAC clones R Aert1 2 L Saacutegi2 and G Volckaert1

(1) Laboratory of Gene Technology Katholieke Universiteit Leuven Kasteelpark Arenberg 21 3001 Leuven Belgium

Present address Laboratory of Tropical Crop Improvement Katholieke Universiteit Leuven (2) Kasteelpark Arenberg 13 3001 Leuven Belgium

R Aert Email RitaAertagrkuleuvenacbe

gscrikengojp) fgenesh version 11 (Salamov and Solovyev 2000 httpwwwsoftberry com) genemarkhmm version 22a (Lukashin and Borodovsky 1998 http Genome Research 142503-2509 2004 Resources EAnnot A genome annotation tool using experimental evidence Li Ding1 Aniko Sabo Nicolas Berkowicz Rekha R Meyer Yoram Shotland Mark R Johnson Kymberlie H Pepin Richard K Wilson and John Spieth Genome Sequencing Center Washington University School of Medicine St Louis Missouri 63110 USA Article and publication are at httpwwwgenomeorgcgidoi101101gr3152604 1 Corresponding author E-mail ldingwatsonwustledu fax (314) 286-1810 Some ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov and Solovyev 2000 ) are based on intrinsic characteristics of coding hellipSome ab initio programs such as Genscan (Burge and Karlin 1997 ) and Fgenesh (Salamov

and Solovyev 2000 ) are based on intrinsic characteristics of coding sequence (eg codon usage consensus splice sites etc) and require training on known genes from the organismhellip

hellipTo further evaluate the performance of EAnnot we compared EAnnot predictions with Ensembl Genscan and Fgenesh predictions using manual annotation as a standard While Genscan and Fgenesh are ab initio programs Ensembl takes into account experimental data a feature shared with EAnnot Ensembl predicted 1037 known genes with 1798 transcripts and 1457 EST genes with 2308 transcripts for chromosome 6 (build 31) while Fgenesh and Genscan predicted 6230 and 6225 genes respectively We evaluated the performance of each program with respect to splice sites transcripts and genes across all of chromosome 6hellip TAG Theoretical and Applied Genetics DOI 101007s00122-004-1758-x Issue Volume 109 Number 7 Date November 2004 Pages 1434 - 1447

Full-genome analysis of resistance gene homologues in rice

B Monosi1 R J Wisser2 L Pennill1 and S H Hulbert1

(1) Department of Plant Pathology Kansas State University Manhattan KS 66506-5502 USA (2) Department of Plant Pathology Cornell University Ithaca NY 14853 USAReceived 18 February 2004 Accepted 16 June 2004 Published online 10 August 2004 S H Hulbert Email shulbrtksuedu DNA sequences were analyzed using the gene prediction programs GENSCAN (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (Salamov and arXivq-bioGN0402046 v1 27 Feb 2004 Sublinear growth of Information in DNA sequences Giulia Menconi Dipartimento di Matematica Applicata and CISSC Centro Interdisciplinare per lo Studio dei Sistemi Complessi Universit`a di Pisa Via Bonanno Pisano 25b 56126 PISA - Italy menconimaildmunipiit October 23 2003 hellipAs a result four putative genes G1 G2 G3 and G4 have been located by means of Hidden Markov Model-based program FGENESH2 that has been created for predicting multiple genes and their structure in genomic DNA sequences The analysis via FGENESH has been exploited with respect to known genes in Arabidopsis thaliana Their predicted position is illustrated in Figure 13 hellip2This program is available at the website wwwsoftberrycom to which we refer con-cerning the reliability and e_ciency of the algorithmhellip Current Opinion in Plant Biology 2004 7732ndash736 DOI 101016jpbi200409003 Consistent over-estimation of gene number in complex plant genomes Jeffrey L Bennetzen14 Craig Coleman27 Renyi Liu15 Jianxin Ma16 and Wusirika Ramakrishna38 1 Department of Genetics University of Georgia Athens Georgia 30602 USA 2 Department of Plant and Animal Sciences Brigham Young University Provo Utah 84602 USA 3 Department of Biological Sciences Michigan Tech University Houghton Michigan 49931 USA

4e-mail maizeugaedu 5e-mail lryugaedu 6e-mail jmaugaedu 7e-mail biotechnologybyuedu 8e-mail wusirikamtuedu hellipWe have found that the standard gene-discovery programs FGENESH GeneMark and GENSCAN annotate segments of most retrotransposons and many invertedrepeat transposable elements as genes Using FGENESH to annotate maize BAC clones for instance 70ndash100 of the predicted genes are actually from transposable elementshellip The Plant Cell 162795-2808 (2004) Spotted leaf11 a Negative Regulator of Plant Cell Death and Defense Encodes a U-BoxArmadillo Repeat Protein Endowed with E3 Ubiquitin Ligase Activity Li-Rong Zenga Shaohong Qua Alicia Bordeosb Chengwei Yangc Marietta Baraoidanb Hongyan Yanc Qi Xiec Baek Hie Nahmd Hei Leungb and Guo-Liang Wanga1 a Department of Plant Pathology Ohio State University Columbus Ohio 43210 b International Rice Research Institute Metro Manila Philippines c State Key Lab for Biocontrol Sun Yat-sen (Zhongshan) University Guangzhou China 510275 d Department of Biological Science Myongji University Kyonggido Korea 449728 1 To whom correspondence should be addressed E-mail wang620osuedu fax 614-292-4455 in spl11 Exons predicted in G3 by the programs GENSCAN and Fgenesh using different matrixes are displayed in dark gray (D) RFLP Source Human Genomics Volume 1 Number 2 January 2004 pp 146-149(4) Publisher Henry Stewart Publications The truth about mouse human worms and yeast Authors David R Nelson1 Daniel W Nebert2 1 Department of Molecular Sciences and The UT Center of Excellence in Genomics and Bioinformatics University of Tennessee Memphis Tennessee 38163 USA 2 Department of Environmental Health and Center for Environmental Genetics (CEG) University of Cincinnati Medical Center Cincinnati Ohio 45267-0056 USA unpublished data 2003 see also Ref [7]) FGENESH 21 TWINSCAN 22 and the Ensembl annotation pipeline 23 The output of the four Genome Biology 2004 5R73 doi101186gb-2004-5-10-r73 A comprehensive transcript index of the human genome generated using microarrays and computational approaches Eric E Schadt 1 Stephen W Edwards 1 Debraj GuhaThakurta1 Dan Holder2 Lisa Ying2 Vladimir Svetnik2 Amy Leonardson1 Kyle W Hart3 Archie Russell1 Guoya Li1 Guy Cavet1 John Castle1 Paul McDonagh4 Zhengyan Kan1 Ronghua Chen1 Andrew Kasarskis1 Mihai Margarint1 Ramon M Caceres1 Jason M Johnson1

Christopher D Armour1 Philip W Garrett-Engele1 Nicholas F Tsinoremas5 and Daniel D Shoemaker1

1Rosetta Inpharmatics LLC 12040 115th Avenue NE Kirkland WA 98034 USA 2Merck Research Laboratories W42-213 Sumneytown Pike POB 4 Westpoint PA 19846 USA 3Rally Scientific 41 Fayette Street Suite 1 Watertown MA 02472 USA 4Amgen Inc 1201 Amgen Court W Seattle WA 98119 USA 5The Scripps Research Institute Jupiter FL 33458 USA hellipGrailEXP 40 [47] GENSCAN 10 [48] FGENESH [49] and FGENESH+ [49]ab initio gene-prediction algorithms were run independently across the entire genome assembly to augment alignment-based gene identification methods GrailEXP 40 GENSCAN 10 and FGENESH version 1c were run with default parameters for human sequence GrailEXP used expressed sequence evidence from RefSeq UniGene and DoubleTwist HGI to refine gene predictions FGENESH+ was run with protein sequences from BLASTX with E-score lower than 10-5 When multiple protein alignments overlapped all overlapping protein sequences were clustered with BLASTClust [50] and the lowest E-score hit was used by FGENESH+hellip hellipAdditionally 3 sim4 and 3 or 5 FGENESH+ predicted exons that were short andor distant from internal predicted exons were removedhellip Genome Research 14988-995 2004 ENSEMBL Special GeneWise and Genomewise Ewan Birney13 Michele Clamp2 and Richard Durbin2 1 The European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK hellipThere has been a long history of successful ab initio programs which do not use any additional

evidence to predict genes on genomic DNA of which Genscan (Burge and Karlin 1997 ) and Fgenesh (Solovyev and Salamov 1997 ) are two of the most successful caseshellip hellipAnother class of evidence-based gene prediction programs are ones which use external evidence to influence the scoring of potential exons including SGP-2 (Parra et al 2003 ) Genie

(Kulp et al 1996 ) Genomescan (Yeh et al 2001 ) HMMGene (Krogh 2000 ) and Fgenesh++ (Solovyev and Salamov 1997 )hellip Published online before print February 5 2004 101073pnas0308430100 PNAS | February 17 2004 | vol 101 | no 7 | 1910-1915 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin Nam Joonyul Kim para Shinyoung Lee Gynheung An Hong Ma and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State

University University Park PA 16802 and National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Contributed by Masatoshi Nei December 22 2003

hellipBecause annotation of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program FGENESH (wwwsoftberrycom) from the genome sequences obtained from TIGR and the Rice Genome Database (China) (25) Functional amp Integrative Genomics DOI 101007s10142-004-0109-y Issue Volume 4 Number 2 Date May 2004 Pages 102 - 117

Sequence analysis of the long arm of rice chromosome 11 for ricendashwheat synteny

Nagendra K Singh1 Saurabh Raghuvanshi2 Subodh K Srivastava1 Anupama Gaur2 Ajit K Pal1 Vivek Dalal1 Archana Singh1 Irfan A Ghazi1 Ashutosh Bhargav1 Mahavir Yadav1 Anupam Dixit1 Kamlesh Batra1 Kishor Gaikwad1 Tilak R Sharma1 Amitabh Mohanty2 Arvind K Bharti2 Anita Kapur2 Vikrant Gupta2 Dibyendu Kumar2 Shubha Vij2 Ravi Vydianathan2 Parul Khurana2 Sulabha Sharma2 W Richard McCombie3 Joachim Messing4 Rod Wing5 Takuji Sasaki6 Paramjit Khurana2 Trilochan Mohapatra1 Jitendra P Khurana2 and Akhilesh K Tyagi2

1 Indian Initiative for Rice Genome Sequencing National Research Centre on Plant Biotechnology

Indian Agricultural Research Institute New Delhi 110012 India 2 Indian Initiative for Rice Genome Sequencing Department of Plant Molecular Biology University of

Delhi South Campus New Delhi 110021 India 3 Genome Research Centre Cold Spring Harbor Laboratory 1 Bungtown Road New York USA 4 The Plant Genome Initiative at RutgersndashWaksman Institute Rutgers University 190 Frelinghuysen

Road Piscataway NJ 08873 USA 5 Department of Plant Sciences 303 Forbes Building Arizona Genomics Institute The University of

Arizona Tucson AZ 85721 USA 6 Department of Genome Research National Institute of Agrobiological Sciences 1-2 Kannondai 2-

chome Tsukuba Ibaraki 305-8602 Japan Akhilesh K Tyagi Email akhileshgenomeindiaorg Wherever RiceGAAS data were not available the genes were predicted by FGENESH trained for monocot plant species (http wwwsoftberrycomberryphtml) TAG Theoretical and Applied Genetics DOI 101007s00122-004-1621-0 Issue Volume 109 Number 1 Date June 2004 Pages 10 - 22

Annotation of a 95-kb Populus deltoides genomic sequence reveals a disease resistance gene cluster and novel class I and class II transposable elements

M Lescot1 4 S Rombauts1 J Zhang1 S Aubourg1 5 C Matheacute1 6 S Jansson2 P Rouzeacute1 3 and W Boerjan1

1 Department of Plant Systems Biology Flanders Interuniversity Institute for Biotechnology Ghent University Technologiepark 927 9052 Gent Belgium

2 Department of Plant Physiology University of Umearing 901 87 Umearing Sweden 3 Laboratoire Associeacute de l Institut National de la Recherche Agronomique (France) Ghent

University 9052 Gent Belgium 4 Present address CIRAD-Biotrop TA4003 34398 Montpellier Cedex 5 France 5 Pre sent address Uniteacute de Recherche en Geacutenomique Veacutegeacutetale INRA 91057 Evry Cedex

France 6 Present address Laboratoire de Biologie Vasculaire Institut de Pharmacologie et Biologie

Structurale 205 route de Narbonne 31077 Toulouse Cedex France

P Rouzeacute Email pierrerouzepsbugentbe 1999 httpwwwtigrorgtdbglimmermglmr_formhtml) and FgenesH for dicots or monocots (Salamov and Solovyev 2000 httpwwwsoftberrycom) BIOINFORMATICS 2004 vol20 N9 p1416-1427 J Yuan B Bush A Elbrecht Y Liu T Zhang W Zhao hellip - suchasGRAIL(Lopezetal 1994 Roberts 1991 Uberbacher et al 1996) GENESCOPE (Murakami and Takagi 1998) fgenesh (Salamov and Solovyev 2000) GeneMark Molecular Plant Pathology Volume 5 Issue 6 Page 515 - November 2004 Pathogen profile Heading for disaster Fusarium graminearum on cereal crops RUBELLA S GOSWAMI AND H CORBY KISTLERCorrespondence E-mail HCKISTumnedu This pipeline uses a combination of the programs FGENESH and FGENESH+ (Salamov and Solovyev 2000) modified by Softberry ( http wwwsoftberrycom ) with Nucleic Acids Research 2004 Vol 32 Database issue D41-D44 MIPS analysis and annotation of proteins from whole genomes H W Mewes12 C Amid1 R Arnold1 D Frishman2 U Guumlldener1 G Mannhaupt2 M Muumlnsterkoumltter1 P Pagel1 N Strack2 V Stuumlmpflen1 J Warfsmann1 and A Ruepp1 1 Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaedter Landstrasse 1 D-85764 Neuherberg Germany and 2 Technische Universitaumlt Muumlnchen Chair of Genome Oriented Bioinformatics Center of Life and Food Science D-85350 Freising-Weihenstephan Germany To whom correspondence should be addressed at Institute for Bioinformatics (MIPS) GSF National Research Center for Environment and Health Ingolstaumldter Landstraszlige 1 D-85764 Neuherberg Germany Tel +49 89 3187 3580 Fax +49 89 3187 3585 Email wmewesgsfde The genome of 40 Mb encodes 10 000 proteins automatically predicted by the program FGENESH (httpsoftberry com) specifically trained for Neurospora Annual Review of Genomics and Human Genetics Vol 5 15-56 (Volume publication date September 2004) COMPARATIVE GENOMICS Webb Miller Kateryna D Makova Anton Nekrutenko and Ross C Hardison The Center for Comparative Genomics and Bioinformatics The Huck Institutes of Life Sciences and the Departments of Biology Computer Science and Engineering and Biochemistry and Molecular Biology Pennsylvania State University University Park Pennsylvania email webbbxpsuedu kdm16psuedu antonbxpsuedu rch8psuedu These algorithms include Genscan the most popular gene prediction tool (24) GenMark (117) Fgenesh (155) GeneID (144) and others (for an excellent overview DNA and Cell Biology May 2004 Vol 23 No 5 311-324 Harbinger Transposons and an Ancient HARBI1 Gene Derived from a Transposase

Vladimir V Kapitonov Genetic Information Research Institute Mountain View California Jerzy Jurka Genetic Information Research Institute Mountain View California We used FGENESH (Salamov and Solovyev 2000) and GeneScan (Burge and Karlin 1997) for the identification of exons and introns The d N d S ratio Proc Natl Acad Sci U S A 2004 February 17 101(7) 1910ndash1915 Published online 2004 February 5 doi 101073pnas0308430100 Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms Jongmin NamdaggerDagger Joonyul Kimsectpara Shinyoung Leesect Gynheung Ansect Hong Madagger and Masatoshi Neidagger

daggerInstitute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University University Park PA 16802 and sectNational Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790-784 Korea Dagger To whom correspondence should be addressed E-mail jyn101psuedu paraPresent address Michigan State UniversityndashDepartment of Energy Plant Research Laboratory and Department of Biochemistry and Molecular Biology Michigan State University East Lansing MI 48824 of rice gene was still in progress at the time of this study we ourselves conducted gene annotation by using the computer program fgenesh (wwwsoftberrycom Nucleic Acids Research 2004 Vol 32 Database issue D377-D382 BGI-RIS an integrated information resource and comparative analysis workbench for rice genomics Wenming Zhao1 Jing Wang2 Ximiao He1 Xiaobing Huang1 Yongzhi Jiao1 Mingtao Dai1 Shulin Wei1 Jian Fu1 Ye Chen1 Xiaoyu Ren1 Yong Zhang12 Peixiang Ni1 Jianguo Zhang1 Songgang Li12 Jian Wang1 Gane Ka-Shu Wong13 Hongyu Zhao4 Jun Yu1 Huanming Yang1 and Jun Wang1 1 Beijing Genomics Institute (BGI) Chinese Academy of Sciences (CAS) Beijing Airport Industrial Zone-B6 Beijing 101300 China 2 College of Life Sciences Peking University Beijing 100871 China 3 University of Washington Genome Center Department of Medicine Seattle WA 98195 USA and 4 Yale University School of Medicine Department of Epidemiology and Public Health New Haven CT 06520-8034 USA To whom correspondence should be addressed Tel +86 10 80481662 Fax +86 10 80498676 Email wangjgenomicsorgcn Correspondence may also be addressed to Huanming Yang Tel +86 10 80494969 Fax +86 10 80491181 Email yanghmgenomicsorgcn The authors wish it to be known that in their opinion the first four authors should be regarded as joint First Authors The contig sequences were annotated for gene content by using automated processes that involve ab initio gene finders such as FgeneSH (httpwwwsoftberrycom Genome Research 141932-1937 2004 Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome

Jinsheng Lai1 Nrisingha Dey2 Cheol-Soo Kim35 Arvind K Bharti1 Stephen Rudd46 Klaus FX Mayer4 Brian A Larkins3 Philip Becraft2 and Joachim Messing17 1 Waksman Institute Rutgers The State University of New Jersey Piscataway New Jersey 08854 USA 2 Department of Genetics Development amp Cell Biology Iowa State University Ames Iowa 50011 USA 3 Department of Plant Science University of Arizona Tucson Arizona 85721 USA 4 Munich Information Center for Protein Sequences Institute for Bioinformatics GSF Research Center for Environment and Health Neuherberg Germany A total of 54397 putative genes could be predicted for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for Plant Molecular Biology DOI 101023BPLAN000002876821587dc Issue Volume 54 Number 1 Date January 2004 Pages 55 - 69

Dynamics of the evolution of orthologous and paralogous portions of a complex locus region in two genomes of allopolyploid wheat

Xiu-Ying Kong1 2 Yong Qiang Gu3 Frank M You4 Jorge Dubcovsky4 and Olin D Anderson3

1 Genetic Resources Conservation Program University of California Davis CA 95616 USA 2 Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081

China 3 US Dept of Agriculture Western Regional Research Center Agricultural Research Service 800

Buchanan Street Albany CA 94710 USA 4 Department of Agronomy and Range Sciences University of California Davis CA 95616 USA FGENESH (httpwwwsoftberrycomberryphtml) and GENES- CAN (httpgenemark miteduGENESCANhtml) were used for gene prediction Current Genetics DOI 101007s00294-003-0451-y Issue Volume 44 Number 6 Date January 2004 Pages 329 - 338

Chromosome rearrangements in isolates that escape from het-c heterokaryon incompatibility in Neurospora crassa

Qijun Xiang1 and N Louise Glass1

Department of Plant and Microbial Biology University of California Berkeley CA 94720-3102 USA Hypothetical proteins are predicted from FGENESH calls with overlapping Blastx hits (but not with trusted homology) while Predicted Molecular Genetics and Genomics DOI 101007s00438-004-0990-z Issue Volume 271 Number 4 Date May 2004 Pages 402 - 415

Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

T Zhou1 Y Wang1 J-Q Chen1 H Araki2 Z Jing1 K Jiang1 J Shen1 and D Tian1

1 State Key Laboratory of Pharmaceutical Biotechnology Department of Biology Nanjing University 210093 Nanjing China

2 Department of Ecology and Evolution University of Chicago Chicago IL 60637 USA to 5000ndash10000 bp from both ends of the hits and then the expanded nucleotide fragments were reannotated using the gene-finding programs FGENESH (http www Proc Natl Acad Sci U S A 2004 June 15 101(24) 9045ndash9050 Genetics

Genetic control of branching in foxtail millet Andrew N Doust Katrien M DevosDaggerdagger Michael D Gadberrysect Mike D GaleDagger and Elizabeth A KelloggDepartment of Biology University of Missouri 8001 Natural Bridge Road St Louis MO 63121 and DaggerJohn Innes Centre Norwich Research Park Colney Norwich NR4 7UH United Kingdom dagger To whom correspondence should be addressed E-mail adoustumsledu sectPresent address Departments of Crop and Soil Sciences and Plant Biology Miller Plant Sciences Building University of Georgia Athens GA 30602 Communicated by Peter H Raven Missouri Botanical Garden St Louis MO April 23 2004 Each of these contigs was scanned by using fgenesh (28) and identified ORFs were translated and compared with ORFs from other contigs from the same QTL region Mol Biol Evol 21(9)1769-1780 2004 Merlin a New Superfamily of DNA Transposons Identified in Diverse Animal Genomes and Related to Bacterial IS1016 Insertion Sequences Ceacutedric Feschotte1 Departments of Plant Biology and Genetics The University of Georgia Athens Correspondence E-mail cedricplantbiougaedu coding sequences were assembled by removing introns predicted with more than 85 confidence by NetGene2 (httpwwwcbsdtudk) andor FGENESH (httpgenomic Genome Research 141924ndash1931 copy2004 Gene Loss and Movement in the Maize Genome Jinsheng Lai1 Jianxin Ma23 Zuzana Swigonˇovб1 Wusirika Ramakrishna24

Eric Linton15 Victor Llaca16 Bahattin Tanyolac17 Yong-Jin Park28 O-Young Jeong29

Jeffrey L Bennetzen23 and Joachim Messing110 1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854-8020 USA 2Department of Biological Sciences Purdue University West Lafayette Indiana 47907-1392 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602-7223 USA 4Department of Biological Sciences 740 DOW Michigan Tech University Houghton MI 49931 USA 5Plant Biology Labs Michigan State University East Lansing MI 48824 USA 6Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880-0353 USA 7Department of Bioengineering Ege University Izmir 35100 Turkey 8National Institute of Agricultural Biotechnology Suwon 441-707 Republic of Korea 9National Institute of Crop Science Suwon 441-857 Republic of Korea 10Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 The FGENESH program predicted four of which three (gene 1d in the maize orp1 region gene 5a 5b in the rice r1 region) would produce truncated proteins

Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Molecular Microbiology Volume 54 Issue 2 Page 407 - October 2004 doi101111j1365-2958200404310x Cryptococcus neoformans Kin1 protein kinase homologue identified through a Caenorhabditis elegans screen promotes virulence in mammals Eleftherios Mylonakis1 Alexander Idnurm2 Roberto Moreno1 Joseph El Khoury134 James B Rottman5 Frederick M Ausubel67 Joseph Heitman28910 and Stephen B Calderwood1111Division of Infectious Diseases Massachusetts General Hospital Boston MA 02114 USA 2Department of Molecular Genetics and Microbiology Duke University Medical Center Durham NC 27710 USA 3Center for Immunology and Inflammatory Diseases and 4Division of Rheumatology Allergy and Immunology Massachusetts General Hospital Boston MA 02114 USA 5Archemix Corporation Cambridge MA 02139 USA 6Department of Molecular Biology Massachusetts General Hospital Boston MA 02114 USA 7Department of Genetics Harvard Medical School Boston MA 02115 USA 8Division of Infectious Diseases 9Department of Medicine and 10Howard Hughes Medical Institute Duke University Medical Center Durham NC 27710 USA 11Department of Microbiology and Molecular Genetics Harvard Medical School Boston MA 02115 USA E-mail scalderwoodpartnersorg

Sequences were compared with the H99 genome database at Duke University and genes predicted in these regions by FGENESH software ( http wwwsoftberrycom TAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Original Paper Characterization of soybean genomic features by analysis of its expressed sequence tags Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

1 Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

2 Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China Jin-Song Zhang Email jszhanggeneticsaccnShou-Yi Chen Email sychengeneticsaccn six BAC-contig sequences of M truncatula were analyzed and the results based on the gene prediction program FGENSH (Arabidopsis matchFGENESH prediction) (http Current Proteomics Volume 1 Number 1 January 2004 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE11 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Chinese Science Bulletin 2004 Vol 49 No 4 355-362 The VER2 promoter contains repeated sequences and requires vernalization for its activity in winter wheat (Triticum aestivum L) XU Wenzhong

1 WANG Xin

1 FENG Qi

2 ZHANG Lei

2 LIU Yaoguang

3 HAN Bin

2 CHONG

Kang1 XU Zhihong

1 amp TAN Kehui

1

1 Research Center for Molecular Developmental Biology Key Lab of Photosynthesis and Environmental Molecular Physiology Institute of Botany Chinese Academy of Sciences (CAS) Beijing 100093 China 2 National Center for Gene Research CAS Shanghai 200233 China 3 Genetic Engineering Laboratory College of Life Sciences South China Agricultural University Guangzhou 510642 China Correspondence should be addressed to Chong Kang (e-mail mailtochongknsibcasaccn) Sequence analyses were finished using biological softwares on Internet such as FGENESH 10 (Prediction of potential genes in Plant (Dct) genomic DNA)

TAG Theoretical and Applied Genetics DOI 101007s00122-003-1457-z Issue Volume 108 Number 3 Date February 2004 Pages 392 - 400

Sequence variations of simple sequence repeats on chromosome-4 in two subspecies of the Asian cultivated rice Can Li1 Yu Zhang1 Kai Ying1 Xiaolei Liang1 and Bin Han1

(1) National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China To characterize the possible relationship between SSRs and genes predicted by using FGENESH we investigated the distribution of SSRs in the rice chromosome-4 Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene TAG Theoretical and Applied Genetics DOI 101007s00122-004-1591-2 Issue Volume 108 Number 8 Date May 2004 Pages 1449 - 1457

Positional cloning of the rice Rf-1 gene a restorer of BT-type cytoplasmic male sterility that encodes a mitochondria-targeting PPR protein

H Akagi1 A Nakamura2 Y Yokozeki-Misono2 A Inagaki2 4 H Takahashi1 K Mori1 and T Fujimura3

1 Laboratory of Plant Breeding and Genetics Department of Biological Production Faculty of Bioresource Sciences Akita Prefectural University Kaidoubata-Nishi 241-7 Shimoshinjyo-Nakano 010-0195 Akita Japan

2 Biochemical Technology Section Life Science Laboratory Performance Materials RampD Center Mitsui Chemicals Togo 1144 297-0017 Mobara Japan

3 Institute of Agricultural and Forest Engineering University of Tsukuba Tennoudai 1-1-1 Tsukuba 305-8572 Ibaraki Japan

4 Present address Department of Synthetic Chemistry and Biological Chemistry Graduate School of Engineering Kyoto University Yoshida-Honmachi Sakyo-ku 606-8501 Kyoto japan

H Akagi Email akagiakita-puacjp Software Develop- ment Tokyo) Genomic sequences were also analyzed using gene prediction programs genescan and fgenesh Table 1 DNA Genome Research 14942-950 2004

The Ensembl Automatic Gene Annotation System Val Curwen1 Eduardo Eyras1 T Daniel Andrews1 Laura Clarke1 Emmanuel Mongin2 Steven MJ Searle1 and Michele Clamp34 1 The Wellcome Trust Sanger Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SA UK 2 EMBL European Bioinformatics Institute The Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD UK 3 The Broad Institute Cambridge Massachusetts 02141 USA Commonly we use Genscan for ab initio prediction in human mouse and rat but the system is equally applicable to other methods such as FgenesH (Solovyev et Gene 324 (2004) 105ndash115 Transcript abundance of rml1 encoding a putative GT1-like factor in rice is up-regulated by Magnaporthe grisea and down-regulated by light Rong Wang ab Guofan Hongab Bin Hana a

National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 500 Caobao Road Shanghai 200233 China b

Shanghai Institute of Biochemistry and Cell Biology Shanghai Institutes for Biological Sciences Chinese Academy of Sciences 320 Yueyang Road Shanghai 200031 China 1A and 2A) The structure of rml1 _ a is as same as that predicted by FGENESH software The 3V UTR of rml1 _ a is confirmed with the length of 596 bp Genome Biology 2004 5R46 doi101186gb-2004-5-7-r46 Identification of conserved gene structures and carboxy-terminal motifs in the Myb gene family of Arabidopsis and Oryza sativa L ssp Indica Cizhong Jiang1 Xun Gu1 2 and Thomas Peterson1 1Department of Genetics Development and Cell Biology and Department of Agronomy Iowa State University Ames IA 50011 USA 2LHB Center for Bioinformatics and Biological Statistics Iowa State University Ames IA 50011 USA FGeneSH has been used successfully to predict genes in rice [9] and GenScan was used together with it to predict genes by taking rice genomic sequences as Molecular Microbiology 53 (5) 1307-1318 - September 2004 doi 101111 j1365-2958200404215x The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans Donald M Gardiner1 Anton J Cozijnsen1 Leanne M Wilson1 M Soledade C Pedras2 and Barbara J Howlett1

1School of Botany The University of Melbourne Victoria Australia 3010 2Department of Chemistry University of Saskatchewan 110 Science Place Saskatoon SK Canada S7N 5C9 E-mail dgardinerpgradunimelbeduau http wwwtigrorg Putative genes were predicted using Fgenesh software at http wwwsoftberrycom Fungal culture The wild type Journal of Molecular Evolution DOI 101007s00239-004-2666-z Issue Volume 59 Number 6 Date December 2004 Pages 761 - 770 Analysis of the Molecular Evolutionary History of the Ascorbate Peroxidase Gene Family Inferences from the Rice Genome Felipe Karam Teixeira1 Larissa Menezes-Benavente1 Rogeacuterio Margis1 2 and Maacutercia Margis-Pinheiro1

(1) Laboratoacuterio de Geneacutetica Molecular Vegetal Departamento de Geneacutetica UFRJ 21944-970 Rio de Janeiro Brasil(2) Departamento de Bioquiacutemica Instituto de Quiacutemica UFRJ 21944-970 Rio de Janeiro Brasil Genomic se- quences were also analyzed in the FGENESH gene structure pre- diction program (httpwwwsoftberrycom) (Solovyev 2001) and GeneMark program (http Incomplete gene structure prediction with almost 100 specificity SL Chin J Xiong T Ioerger SH Sze - 2004 - txspacetamuedu iv Fgenesh-C The proposed gene structure prediction algorithm by far has the best 25 12 Comparison against Est2Genome Sim4 Spidey Fgenesh-c Journal of Biotechnology 109 (2004) 217ndash226

Preparation of single rice chromosome for construction of a DNA library using a laser microbeam trap

Xiaohui Liu a1 Haowei Wang b1 Yinmei Li b Yesheng Tang a Yilei Liu a Xin u a Peixin Jia a Kai Ying a Qi Feng a Jianping Guan a Chaoqing Jin a Lei Zhang a Liren Lou b Zhuan Zhou c Bin Han aA National Center for Gene Research Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200233 China B Department of Physics University of Science and Technology of China Hefei 230026 China C Institute of Neuroscience Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai 200031 China Corresponding author Fax +86-21-64825775 E-mail address bhanncgraccn (B Han) ers These sequences were further annotated using gene-prediction software FGENESH to give the pos- sible protein-coding region Science 303 1364-1367 Medicago truncatula DMI1 Required for Bacterial and Fungal Symbioses in Legumes Ane et al (2004) 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberry phtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Proc Natl Acad Sci U S A 2004 August 24 101(34) 12404ndash12410 Inaugural Articles Rapid recent growth and divergence of rice nuclear genomes Jianxin Ma and Jeffrey L Bennetzen

Department of Genetics University of Georgia Athens GA 30602 To whom correspondence should be addressed E-mail maizeugaedu Contributed by Jeffrey L Bennetzen May 25 2004 Almost all LTR-retrotransposons including solo LTRs identified in our studies were predicted as genes by the gene-finding program fgenesh (data not shown) The Plant Journal Volume 37 Issue 4 Page 517 -527 - February 2004 doi101046j1365-313X200301976x Xa26 a gene conferring resistance to Xanthomonas oryzae pv oryzae in rice encodes an LRR receptor kinase-like protein Xinli Sun Yinglong Cao Zhifen Yang Caiguo Xu Xianghua Li Shiping Wang and Qifa Zhang National Key Laboratory of Crop Genetic Improvement National Center of Crop Molecular Breeding Huazhong Agricultural University Wuhan 430070 China For correspondence (fax +86 27 87287092 e-mail swangmailhzaueducn) al 1997) Gene prediction programs used were genscan (Burge and Karlin 1997) and fgenesh (httpwwwsoftberrycom) Promoter Genome Research 141916ndash1923 copy2004 by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonˇovaґ16 Jinsheng Lai16 Jianxin Ma23 Wusirika Ramakrishna24

Victor Llaca15 Jeffrey L Bennetzen23 and Joachim Messing17

1Waksman Institute of Microbiology Rutgers University Piscataway New Jersey 08854 USA

2Department of Biological Sciences and Genetics Program West Lafayette Indiana 47907 USA Present addresses 3Department of Genetics University of Georgia Athens GA 30602 USA 4Department of Biological Sciences Michigan Tech University MI 49931 USA 5Analytical and Genomic Technologies Crop Genetics RampD DuPont Agriculture amp Nutrition Wilmington DE 19880 USA 6These authors contributed equally to this work 7Corresponding author E-MAIL messingwaksmanrutgersedu FAX (732) 445-0072 Page 1 Close Split of Sorghum and Maize Genome Progenitors Zuzana Swigonovaacute 16 Jinsheng Lai 16 Jianxin Ma 23 Wusirika Ramakrishna 24 TAG Theoretical and Applied Genetics DOI 101007s00122-004-1667-z Issue Volume 109 Number 3 Date August 2004 Pages 515 - 522

Physical mapping and putative candidate gene identification of a quantitative trait locus Ctb1 for cold tolerance at the booting stage of rice K Saito1 Y Hayano-Saito1 W Maruyama-Funatsuki1 Y Sato1 and A Kato1

(1) National Agricultural Research Center for Hokkaido Region Hitsujigaoka 1 Toyohira Sapporo Hokkaido 062-8555 JapanK Saito Email kjsaitoaffrcgojp GENSCAN RICEHMM FGENESH MZEF ) a splice prediction program ( SPLICEPREDIC- TOR ) homology search analysis programs ( BLAST HMMER TAG Theoretical and Applied Genetics DOI 101007s00122-004-1697-6 Issue Volume 109 Number 4 Date August 2004 Pages 690 - 699 The anthracnose resistance locus Co-4 of common bean is located on chromosome 3 and contains putative disease resistance-related genes M Melotto1 4 M F Coelho1 A Pedrosa-Harand2 J D Kelly3 and L E A Camargo1

1 Departamento de Fitopatologia Laboratoacuterio de Geneacutetica Molecular ESALQ Universidade de Satildeo Paulo Piracicaba SP CP 9 13418-900 Brazil

2 Department of Cell Biology and Genetics Institute of Botany University of Vienna Rennweg 14 Vienna 1030 Austria

3 Department of Crop and Soil Sciences Michigan State University East Lansing MI 48824 USA 4 Present address MSU-DOE Plant Research Laboratory Michigan State University 206 Plant Biology

Building East Lansing MI 48824 USA M Melotto Email melottommsuedu and Karlin 1997 httpgenesmiteduGENSCANhtml) and FGENESH (httpwww softberrycom)mdashusing Arabidopsis as the model or- ganism Journal of Genetics Vol 83 No 1 P 79-99 April 2004 Structural and functional analysis of rice genome Tyagi A K Khurana J P Khurana P Raghuvanshi S Gaur A Kapur A Gupta V Kumar D Ravi V Vij S Khurana P and Sharma S Department of Plant Molecular Biology University of Delhi South Campus Benito Juarez Road New Delhi 110 021 India It inte- grates results from several gene prediction software such as GENSCAN (Burge and Karlin 1997) FGENESH (Sala- mov and Solovyev 2000) RiceHMM (Sakata The Plant Cell 161220-1234 (2004) Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice Shin-Han Shiua Wojciech M Karlowskib Runsun Panad Yun-Huei Tzengac Klaus F X Mayerb and Wen-Hsiung Lia1 a Department of Ecology and Evolution University of Chicago Chicago Illinois 60637 b Munich Information Center for Protein SequencesInstitute of Bioinformatics GSF National Research Center for Environment and Health Neuherberg 85764 Germany c Department of Mathematics National Tsing Hua University Hsinchu Taiwan 300 d Institute of Information Science Academia Sinica Taiwan 115

1 To whom correspondence should be addressed E-mail whliuchicagoedu fax 773-702-9740 a permissive E value cutoff of 1 The rice genes from the indica subspecies was predicted using the whole genome shotgun assembly with FGENESH (Solovyev 2002 Genome Research 141474ndash1482 (2004) copy by Cold Spring Harbor Laboratory Press ISSN 1088-905104 Incongruent Patterns of Local and Global Genome Size Evolution in Cotton Corrinne E Grover1 HyeRan Kim2 Rod A Wing2 Andrew H Paterson3 and Jonathan F Wendel14 1Department of Ecology Evolution and Organismal Biology Iowa State University Ames Iowa 50011 USA 2Arizona Genomics Institute University of Arizona Tucson Arizona 85721 USA 3Plant Genome Mapping Laboratory University of Georgia Athens Georgia 30602 USA hellip Potential genes were predicted by three independent programs FGENESH (httpwwwsoftberrycom) Plant Physiology May 2004 Vol 135 pp 459-470 Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four Triticeae Genomes Yong Qiang Gu Devin Coleman-Derr Xiuying Kong and Olin D Anderson United States Department of Agriculture-Agricultural Research Service Western Regional Research Center Albany California 94710 (YQG DC-D ODA) and Institute of Crop Germplasm Resources Chinese Academy of Agricultural Sciences Beijing 100081 China (XK) FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology October 2004 Vol 136 pp 3177-3190 Comparative Sequence Analysis of the Region Harboring the Hardness Locus in Barley and Its Colinear Region in Rice1

Katherine S Caldwell2 Peter Langridge and Wayne Powell Scottish Crop Research Institute Invergowrie Dundee DD2 5DA United Kingdom (KSC WP) and School of Agriculture and Wine (KSC PL) and Australian Centre for Plant Functional Genomics (PL) University of Adelaide Waite Campus Glen Osmond South Australia 5064 Australia jp Sakata et al 2002 ) which couples the integration of several programs for the prediction of open reading frames (GENSCAN RiceHMM FGENESH MZEF) with GENES amp DEVELOPMENT 18687-699 2004 pyramus and thisbe FGF genes that pattern the mesoderm of Drosophila embryos

Angelike Stathopoulos1 Bergin Tam1 Matthew Ronshaugen1 Manfred Frasch2 and Michael Levine13 1 Department of Molecular and Cell Biology Division of Genetics amp Development University of California Berkeley California 94720-3204 USA 2 Brookdale Department of Molecular Cell and Developmental Biology Mount Sinai School of Medicine New York New York 10029 USA hellip FGF protein sequences used in alignment and phylogenetic reconstruction were gathered from GenBank or inferred from genomic sequence using GENESCAN (Burge and Karlin 1997 ) and FGENESHhellip Genome Research 141888-1901 2004 Organization and Evolution of a Gene-Rich Region of the Mouse Genome A 127-Mb Region Deleted in the Del(13)Svea36H Mouse Ann-Marie Mallon14 Laurens Wilming24 Joseph Weekes1 James GR Gilbert2 Jennifer Ashurst2 Sandrine Peyrefitte2 Lucy Matthews2 Matthew Cadman1 Richard McKeone1 Chris A Sellick1 Ruth Arkell1 Marc RM Botcherby3 Mark A Strivens1 R Duncan Campbell3 Simon Gregory25 Paul Denny1 John M Hancock16 Jane Rogers2 and Steve DM Brown1 1 Medical Research Council Mammalian Genetics Unit Harwell Oxfordshire United Kingdom 2 Wellcome Trust Sanger Institute Hinxton Genome Campus United Kingdom 3 Medical Research Council Rosalind Franklin Centre for Genomics Research Hinxton Genome Campus United Kingdom hellip Ab initio gene structures were predicted using FGENESH (Salamov and Solovyev 2000 ) and GENSCANhellip Current Proteomics January 2004 vol 1 no 1 pp 41-48(8) Annotation of the Human Genome by High-Throughput Sequence Analysis of Naturally Occurring Proteins Authors McGowan SJ1 Terrett J1 Brown CG1 Adam PJ1 Aldridge L1 Allen JC1 Amess B1 Andrews KA1 Barnes M1 Barnwell DE1 Affiliations 1 Oxford GlycoSciences plc The Forum 86 Milton Park Abingdon OX14 4RY UK The polymorphic lsquohypothetical transcriptomersquo was cons- tructed from transcripts predicted by FGENES FGENESH (Softberry Inc Mount Kisco NY USA) GENSCAN Proc Natl Acad Sci U S A 2004 June 29 101(26) 9885ndash9890 Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep Kelly Palaisa Michele MorgantedaggerDagger Scott Tingeydaggersect and Antoni RafalskidaggersectparadaggerDuPont Crop Genetics Molecular Genetics Group 1 Innovation Way Newark DE 19711 Department of Plant and Soil Sciences and Delaware Biotechnology Institute University of Delaware Newark DE 19716 and DaggerDipartimento di Produzione Vegetale e Tecnologie Agrarie Universitaacute degli Studi di Udine Via della Scienze 208 33100 Udine Italy para To whom correspondence should be addressed E-mail j-antonirafalskicgrdupontcom

sectPresent address DuPont Crop Genetics Research DuPont Experimental Station Building E353 Wilmington DE 19880 The Y1 gene comprises a small uninterrupted gene island consisting of the Y1 an MLO homolog lying immediately downstream of Y1 and an fgenesh-predicted gene Nucleic Acids Research 2003 Vol 31 No 4 1148-1155 Characterization of Arabidopsis thaliana ortholog of the human breast cancer susceptibility gene 1 AtBRCA1 strongly induced by gamma rays S Lafarge and M-H Montaneacute CEA Cadarache DSV-DEVM Laboratoire de Radiobiologie Veacutegeacutetale Bat 185 F-13108 St Paul Lez Durance Cedex France To whom correspondence should be addressed Tel +33 4 42 25 35 56 Fax + 33 4 42 25 26 25 Email marie-helenemontaneceafr Received November 18 2002 Accepted December 5 2002 DDBJEMBLGenBank accession no AF515728 hellipGene structure prediction was done on software implemented on the Softberry web page (httpwwwsoftberrycom) analysis of protein domains using the SMARThellip hellipThe gene structure of At4g21070 was determined with three gene structure prediction software packages (Softberry GenScan Grail) hellip To resolve this ambiguity in intronndashexon prediction we postulated the presence of two genes given by Softberry prediction software and performed northern blotting and 5 RACE to characterize the structural organization of the At4g21070 locushellip Proc Natl Acad Sci U S A 2003 July 22 100(15) 9055ndash9060 doi 101073pnas1032999100 Plant Biology Gene expression of a gene family in maize based on noncollinear haplotypes Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway NJ 08854-8020 Communicated by Brian A Larkins University of Arizona Tucson AZ May 19 2003 (received for review 2002 April 10) To whom correspondence should be addressed E-mail messingmbclrutgersedu The FGENESH program (Softberry Mount Kisco NY) was used for gene prediction analysis BMC Genomics 2003 4 22

doi 1011861471-2164-4-22 Published online 2003 June 3 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams 12 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA 3Department of Biological Sciences Wichita State University Wichita Kansas USA

Corresponding author Sreedhar Oduru odurusreedharttuhscedu Janee L Campbell janeecampbellttuhscedu SriTulasi Karri phrskttuhscedu William J Hendry williamhendrywichitaedu Shafiq A Khan shafiqkhanttuhscedu Simon C Williams simonwilliamsttuhscedu

Two gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=H Insect Molecular Biology Volume 12 Issue 4 Page 319 - August 2003 doi101046j1365-2583200300415x Expression of an Aedes aegypti cation-chloride cotransporter and its Drosophila homologues V Filippov K Aimanova and S S GillAffiliations Department of Cell Biology and Neuroscience University of California Riverside USA Correspondence Sarjeet S Gill 5429 Boyce Hall Environmental Toxicology Graduate Program University of California Riverside CA 92521 USA Tel +1 909 787 4621 Fax +1 909 787 3087 E-mail Sarjeetgillucredu significant similarity to the Drosophila genes were used for gene structure prediction with the FGENESH program available on site http wwwsoftberrycom Developmental Biology 256 (2003) 276ndash289 tcl-2 encodes a novel protein that acts synergistically with Wnt signaling pathways in C elegans Xiaojun Zhaoa Hitoshi Sawab and Michael A Hermana a Program in Molecular Cellular and Developmental Biology Division of Biology Kansas State University Manhattan KS 66506 USA b Laboratory for Cell Fate Decision RIKEN Center for Developmental Biology 2-2-3 Minatojima-minamimachi Chuo-ku Kobe 650-0047 Japan Received for publication 10 September 2002 revised 25 November 2002 accepted 19 December 2002 hellipCbTCL-2 is conceptually translated from a gene predicted by the FGENSH (Salamov and Solovyev 2000 httpwwwsoftberrycom) using defaults for C elegans genomic sequences Proc Natl Acad Sci U S A 2003 May 27 100(11) 6569ndash6574 doi 101073pnas0732024100 Evolution Molecular paleontology of transposable elements in the Drosophila melanogaster genome Vladimir V Kapitonov and Jerzy Jurka

Genetic Information Research Institute 2081 Landings Drive Mountain View CA 94043 Communicated by Margaret G Kidwell University of Arizona Tucson AZ April 7 2003 (received for review 2002 December 23)

To whom correspondence may be addressed E-mail vladimirulamgirinstorg or jurkagirinstorg hellipWe used FGENESH (ref 18 wwwsoftberrycom) for identifying genes encoded by TEs Genetics and Molecular Biology ISSN 1415-4757 versioacuten impresa Genet Mol Biol v26 n4 Satildeo Paulo dic 2003 Iron homeostasis related genes in rice Jeferson GrossI II Ricardo Joseacute SteinII Arthur Germano Fett-NetoI II Janette Palma FettI II

IUniversidade Federal do Rio Grande do Sul Centro de Biotecnologia Porto Alegre RS Brazil IIUniversidade Federal do Rio Grande do Sul Departamento de Botacircnica Porto Alegre RS Brazil IIIBotanical Institute of Ludwig-Maximilians-Universitaumlt Muumlnchen Germany The prediction algorithms were GenScan (Burge and Karlin 1997 httpgenesmiteduGENSCANhtml) GenomeScan (Burge and Karlin 1997 httpgenesmitedugenomescanhtml) FGENESH (Salamov and Solovyev 2000 httpwwwsoftberrycomberryphtmltopic= gfind) GeneMarkhmm (Borodovsky and Lukashin unpublished httpopalbiologygatecheduGeneMarkeukhmmcgi) and GrailEXP (Xu and Uberbacher 1997 httpcompbioornlgovgrailexp) Characterization of the Maize Endosperm Transcriptome and Its Comparison to the Rice Genome J Lai N Dey CS Kim AK Bharti S Rudd KFX Mayer hellip for the rice genome from this data set (Table 3) using the FGENESH program with the default setting for monocotyledonous genes (httpwwwsoftberrycom) Published 23 September 2003 BMC Plant Biology 2003 36 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya12 Address 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia and 2NSW Agricultural Genomics Centre Wagga Wagga Australia Email Qian-Hao Zhu - qianhaozhucsiroau Mohammad Shamsul Hoque - mohammadhoquecsiroau Elizabeth S Dennis - lizdenniscsiroau Narayana M Upadhyaya - narayanaupadhyayacsiroau Corresponding author hellip Analyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCANhellip

The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Australasian Plant Pathology Volume 32 Number 4 2003 pp 511-519 Small scale functional genomics of the blackleg fungus Leptosphaeria maculans analysis of a 38 kb region Alexander Idnurm Janet L Taylor M Soledade C Pedras and Barbara J Howlett vertebrate and Arabidopsis settings Burge and Karlin 1997) and FGENESH on Neurospora crassa and Schizosaccharomyces pombe settings (wwwsoftberrycom) as Barley Genetics Newsletter Volume 32 Hard-copy edition pages 34 - 37 MAPPING AND SEQUENCING OF THE BARLEY PUTATIVE HYPERSENSITIVE INDUCED REACTION GENES Nils Rostoks1 David Kudrna1 and Andris Kleinhofs12

1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 2 School of Molecular Biosciences Washington State University Pullman WA 99164 The full length coding sequence was reconstructed using a combination of FGENESH gene prediction program (httpwwwsoftberrycom) and alignment with cDNAs from the other barley HIR groups TAG Theoretical and Applied Genetics DOI 101007s00294-003-0391-6 Issue Volume 43 Number 5 Date August 2003 Pages 351 - 357 Characterisation of the mating-type locus of the plant pathogenic ascomycete Leptosphaeria maculans Anton J Cozijnsen A1 and Barbara J Howlett A1 A1 School of Botany The University of Melbourne 3010 Victoria Australia hellipGenes introns exons and transcription initiation sites were predicted by analysis with FGENESH (wwwsoftberrycom) on Neurospora crassa andhellip BMC Plant Biol 2003 3 6

doi 1011861471-2229-3-6 Published online 2003 September 23 Received May 23 2003 Accepted September 23 2003 Ds tagging of BRANCHED FLORETLESS 1 (BFL1) that mediates the transition from spikelet to floret meristem in rice (Oryza sativa L) Qian-Hao Zhu12 Mohammad Shamsul Hoque12 Elizabeth S Dennis12 and Narayana M Upadhyaya 12 1CSIRO Plant Industry GPO Box 1600 Canberra ACT 2601 Australia 2NSW Agricultural Genomics Centre Wagga Wagga Australia

Corresponding author Qian-Hao Zhu qianhaozhucsiroau Mohammad Shamsul Hoque mohammadhoquecsiroau Elizabeth S Dennis lizdenniscsiroau Narayana M Upadhyaya narayanaupadhyayacsiroau hellipAnalyses of 10-kb sequence of the ~30-kb contig 239 from the China Rice Genome database harboring the BFL1 locus by gene prediction programs FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml identified a single-exon gene capable of encoding a protein with the DNA binding domain of the EREBPAP2 family of plant transcription factors [2636] 1515 bp downstream from the Ds insertion The ORF was identified by using FGENESH httpwwwsoftberrycomberryphtml and GENSCAN httpgenesmiteduGENSCANhtml Alignment of EREBPAP2 domains was performed using programs of Genetics Computer Group Wisconsin software suit [11] Genetics Vol 164 655-664 June 2003 Copyright copy 2003 Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat Li Huanga Steven A Brooksa Wanlong Lia John P Fellersb Harold N Tricka and Bikram S Gilla a Wheat Genetics Resource Center Department of Plant Pathology Kansas State University Manhattan Kansas 66506-5502 b USDA-ARS Plant Science and Entomology Unit Kansas State University Manhattan Kansas 66506-5502 Corresponding author Bikram S Gill 4024 Throckmorton Kansas State University Manhattan KS 66506-5502 bsgksuedu (E-mail hellipIn addition FGENSH 11 (httpwwwsoftberrycom) was used for gene prediction (with monocot genomic DNA parameters) Nucleic Acids Research 2003 Vol 31 No 1 229-233 The TIGR rice genome annotation resource annotating the rice genome and creating resources for plant biologists Qiaoping Yuan Shu Ouyang Jia Liu Bernard Suh Foo Cheung Razvan Sultana Dan Lee John Quackenbush and C Robin Buell The Institute for Genomic Research 9712 Medical Center Dr Rockville MD 20850 USA To whom correspondence should be addressed Tel +1 301 8383558 Fax +1 301 8380208 Email rbuelltigrorg Received August 14 2002 Revised and Accepted October 2 2002 hellipThe rice sequences were processed with multiple ab initio gene finders including FGENESH (httpwwwsoftberrycom)hellip hellip Working models were generated using the FGENESH output and putative identification for the gene was obtained from the most significant database match while models with no significant database match were labeled as hypothetical proteins

JXB Advance Access originally published online on June 18 2003 Journal of Experimental Botany Vol 54 No 389 pp 1995-1996 August 1 2003 Received 21 April 2003 Accepted 25 April 2003 OsSET1 a novel SET-domain-containing gene from rice Yun-Kuan Liang Ying Wang Yong Zhang Song-Gang Li Xiao-Chun Lu Hong Li Cheng Zou Zhi-Hong Xu and Shu-Nong Bai PKU-Yale Joint Research Center of Agricultural and Plant Molecular Biology National Key Laboratory of Protein Engineering and Plant Gene Engineering College of Life Sciences Peking University 5 Yiheyuan Road Beijing 100871 PR China To whom correspondence should be addressed Fax +86 10 6275 1526 E-mail shunongbpkueducn It localizes at chromosome three in rice genome at the contig 1300 (httpwwwsoftberrycomberryphtmltopic=gfindampprg=FGENESH GenBank accession number BMC Genomics 2003 422 Published 3 June 2003 Received 31 January 2003 Accepted 3 June 2003 This article is available from httpwwwbiomedcentralcom1471-2164422 Gene discovery in the hamster a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs Sreedhar Oduru1 Janee L Campbell1 SriTulasi Karri1 William J Hendry3 Shafiq A Khan1 and Simon C Williams12 Address 1Department of Cell Biology amp Biochemistry Texas Tech University Health Sciences Center Lubbock Texas USA 2Southwest Cancer Center at UMC Lubbock Texas USA and 3Department of Biological Sciences Wichita State University Wichita Kansas USA Email Sreedhar Oduru - odurusreedharttuhscedu Janee L Campbell - janeecampbellttuhscedu SriTulasi Karri - phrskttuhscedu William J Hendry - williamhendrywichitaedu Shafiq A Khan - shafiqkhanttuhscedu Simon C Williams - simonwilliamsttuhscedu Corresponding author hellipTwo gene prediction programs were used FGENESH httpwwwsoftberrycomberryphtml and GENEMARK httpopalbiologygatecheduGeneMarkeukhmmcgiorg=HsapiensTAG Theoretical and Applied Genetics DOI 101007s00122-003-1499-2 Issue Volume 108 Number 5 Date March 2004 Pages 903 - 913 Characterization of soybean genomic features by analysis of its expressed sequence tags

Ai-Guo Tian1 Jun Wang2 Peng Cui2 Yu-Jun Han2 Hao Xu2 Li-Juan Cong2 Xian-Gang Huang2 Xiao-Ling Wang2 Yong-Zhi Jiao2 Bang-Jun Wang1 Yong-Jun Wang1 Jin-Song Zhang1 and Shou-Yi Chen1

Jin-Song Zhang Email jszhanggeneticsaccn

Shou-Yi Chen Email sychengeneticsaccn Phone +86-10-64886859 Fax +86-10-64873428

(1) Plant Biotechnology Laboratory Institute of Genetics and Developmental Biology Chinese Academy of Sciences Datun Road 100101 Beijing China

(2) Beijing Genomics Institute Chinese Academy of Sciences 101300 Beijing China

prediction of these BAC-contig sequences was based on the gene-prediction program FGENSH (Arabidopsis matchFGENESH DDT Vol 7 No 11 (Suppl) 2002 S70-S76 wwwdrugdiscoverytodaycom Genome annotation techniques new approaches and challenges Alistair G Rust Emmanuel Mongin and Ewan Birney European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus Hinxton Cambridge UK CB10 1SD tel +44 1223 494420 fax +44 1223 494468 e-mail birneyebiacuk Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Box 1 Useful human genome annotation and browser URLs Human genome browsers bull UCSC Human Genome Browser httpgenomecseucsceducgi-binhgGateway bull Softberry Genome Explorer httpwwwsoftberrycomberryphtmltopic=genomexp Ab initio gene prediction programs Ab initio gene predictors rely on the statistical qualities of exons rather than on homologies Examples of such prediction programs include Genscan [14] (used by Celera ORNL and Ensembl) fgenesh [15] (used in the Softberry and UCSC browsers as well as Celerarsquos pipeline) and GrailEXP [16] (ORNL) Proc Natl Acad Sci U S A 2002 August 20 99(17) 11423ndash11428 doi 101073pnas162276199 Neurobiology Identification of G protein-coupled receptors for Drosophila PRXamide peptides CCAP corazonin and AKH supports a theory of ligand-receptor coevolution Yoonseong Parkdagger Young-Joon Kim and Michael E AdamsdaggerDagger

Departments of Entomology and daggerCell Biology and Neuroscience 5429 Boyce Hall University of California Riverside CA 92521 Edited by Lutz Birnbaumer National Institutes of Health Research Triangle Park NC and approved June 14 2002 (received for review 2002 May 7) DaggerTo whom reprint requests should be addressed E-mail adamsmailucredu

This article has been corrected See Proc Natl Acad Sci U S A 2002 October 15 99(21) 13961b For each Drosophila GPCR prediction of gene structure was made in FGENESH (wwwsoftberrycom ref 21) by using about 20 kb of genomic sequence surrounding highly conserved regions particularly for 5 prime and 3 prime ends of ORFs Putative Drosophila GPCRs in the database were amplified by RT-PCR using primers based on gene predictions in the FGENESH gene finder (wwwsoftberrycom ref 21) 21 Salamov A A amp Solovyev V V (2000) Genome Res 10 516-522

prediction httpwwwsoftberrycomberry

Eukaryotic Cell October 2002 p 719-724 Vol 1 No 5 Isocitrate Lyase Is Essential for Pathogenicity of the Fungus Leptosphaeria maculans to Canola (Brassica napus) Alexander Idnurm and Barbara J Howlett School of Botany The University of Melbourne Melbourne Victoria 3010 Australia Received 17 June 2002 Accepted 29 July 2002 hellip The DNA sequence obtained was compared to those in the GenBank database by using BLAST (1) and genes were predicted by using FGENESH software (httpwwwsoftberrycom) and GENSCAN (wwwbionavigatorcom) Bio-Almanac GLE ToxExpress P Offering CLG Annotated sequence data The genes are identified with the FGENESH11 gene modeling software exclusively li- censed from Softberry Inc Automatic Analysis of 106 kb of contiguous DNA sequence from the D genome of wheat reveals high gene density hellip SA Brooks L Huang BS Gill JP Fellers trix In addition FGENESH 11 (httpwwwsoftberrycom) was used for CDS prediction with monocot genomic DNA parameters Both Molecular Genetics and Genomics DOI 101007s00438-002-0706-1 Issue Volume 267 Number 6 Date August 2002 Pages 713 - 720 Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast DNA insertion Q Yuan J Hill J Hsiao K Moffat S Ouyang Z Cheng J Jiang C Buell A1 The Institute for Genomic Research 9712 Medical Center Drive Rockville MD 20850 USA A2 Department of Horticulture University of Wisconsin Madison WI 53706 USA The sequences were analyzed with several gene prediction programs including FGENESH (httpwwwsoftberrycom) Genemarkhmm (rice matrix httpopalbiology

Genetics Vol 162 1389-1400 November 2002 Copyright copy 2002 Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes Wusirika Ramakrishnaa Jorge Dubcovskyb Yong-Jin Park1a Carlos Bussob John Embertona Phillip SanMiguelc and Jeffrey L Bennetzena a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Department of Agronomy and Range Science University of California Davis California 95616 c Purdue University Genomics Core WSLR Purdue University West Lafayette Indiana 47907

Corresponding author Jeffrey L Bennetzen Hansen Bldg Purdue University West Lafayette IN 47907 maizebilbobiopurdueedu (E-mail)

FGENESH (httpwwwsoftberrycomnucleohtml) with the maize training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and

GeneMarkhmm (httpgenemarkbiologygatecheduGene Mark) Functional amp Integrative Genomics DOI 101007s10142-002-0055-5 Issue Volume 2 Numbers 1-2 Date May 2002 Pages 51 - 59 Genomic sequencing reveals gene content genomic organization and recombination relationships in barley Nils Rostoks Yong-Jin Park Wusirika Ramakrishna Jianxin Ma Arnis Druka Bryan A Shiloff Phillip J SanMiguel Zeyu Jiang Robert Brueggeman Devinder Sandhu Kulvinder Gill Jeffrey L Bennetzen Andris Kleinhofs A1 Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA A2 Department of Biological Sciences Purdue University West Lafayette IN 47907 USA A3 National Center for Genome Resources 2935 Rodeo Park Drive East Santa Fe NM 87505 USA A4 G302 Agronomy Hall Iowa State University Ames IA 50011-1010 USA A5 Department of Agronomy University of Nebraska Lincoln NE 68583 USA A6 School of Molecular Biosciences and Department of Crop and Soil Sciences Washington State University Pullman WA 99164 USA version 10 with maize parameters The FGENESH predictions were run at httpwwwsoftberrycom BAC genomic regions were defined Structural organization of the barley D-hordein locus in comparison with its orthologous regions of hellip YQ Gu OD Anderson CF Londeore X Kong RN hellip et al 1997) to search for additional genes In addition FGENESH (httpwwwsoftberrycomberryphtml) and GENESCAN (httpgenes

Published online before print June 20 2002 101073pnas142284999 PNAS | July 9 2002 | vol 99 | no 14 | 9328-9333 The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases R Brueggeman N Rostoks D Kudrna A Kilian F Han J Chensect A Druka B Steffensonpara and A Kleinhofs Department of Crop and Soil Sciences Washington State University Pullman WA 99164-6420 para Department of Plant Pathology 495 Borlaug Hall 1991 Upper Buford Circle St Paul MN 55108-6030 and School of Molecular Biosciences Washington State University Pullman WA 99164-4234 Communicated by Diter von Wettstein Washington State University Pullman WA May 13 2002 (received for review March 25 2002) The gene prediction programs GENSCAN (httpgenesmiteduGENSCANhtml) and FGENESH (httpwwwsoftberrycom) as well as NEURAL NETWORK PROMOTER

PREDICTION (httpwwwfruitflyorgseq_toolspromoterhtml) localized the putative transcription start site of the gene about 400 bp upstream of the translation start site Plant Physiol 2002 December 130(4) 1626ndash1635 doi 101104pp012179 Received July 30 2002 Accepted October 1 2002 Contiguous Genomic DNA Sequence Comprising the 19-kD Zein Gene Family from Maize1

Rentao Song and Joachim Messing

Waksman Institute Rutgers The State University of New Jersey 190 Frelinghuysen Road Piscataway New Jersey 08854ndash8020 Corresponding author e-mail messingmbclrutgersedu fax 732ndash445ndash0072 Draft sequences generated from high-throughput DNA sequencing (phase II) were subjected to gene prediction programs with FGENESH (Softberry Inc Mount Kisco NY) The Plant Cell Vol 14 3213-3223 December 2002 Copyright copy 2002 Received July 22 2002 accepted September 26 2002 Structural Analysis of the Maize Rp1 Complex Reveals Numerous Sites and Unexpected Mechanisms of Local Rearrangement Wusirika Ramakrishnaa John Embertona Matthew Ogdena Phillip SanMiguelb and Jeffrey L Bennetzen1a a Department of Biological Sciences Purdue University West Lafayette Indiana 47907 b Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 1 To whom correspondence should be addressed E-mail maizebilbobiopurdueedu fax 765-496-1496 FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCAN Plant Physiol 2002 December 130(4) 1728ndash1738

doi 101104pp014951 Comparative Sequence Analysis of the Sorghum Rph Region and the Maize Rp1 Resistance Gene Complex Wusirika Ramakrishna John Emberton Phillip SanMiguel Matthew Ogden Victor Llaca Joachim Messing and Jeffrey L Bennetzen

Department of Biological Sciences Purdue University West Lafayette Indiana 47907 (WR JE MO JLB) Purdue University Genomics Core Purdue University West Lafayette Indiana 47907 (PSM) and Waksman Institute Rutgers University Piscataway New Jersey 08854 (VL JM) Corresponding author e-mail maizebilbobiopurdueedu fax 765ndash496ndash1496 Received September 19 2002 Accepted October 8 2002 hellipAnnotation and sequence analysis were performed as described earlier (Dubcovsky et al 2001 Song et al 2001 Ramakrishna et al 2002a ) FGENESH (httpwwwsoftberrycomberryphtml) with the monocot training set was used for gene prediction in addition to GENSCAN (httpgenesmiteduGENSCANhtml) and GeneMarkhmm (httpopalbiologygatecheduGeneMarkeukhmmcgi) Rapid Genome Evolution Revealed by Comparative Sequence Analysis of Orthologous Regions from Four hellipYQ Gu D Coleman-Derr X Kong OD Anderson FGENESH (httpwwwsoftberrycomnucleohtml) and GENESCAN (httpgenemark miteduGENESCANhtm) were used for gene prediction Plant Physiology 2004 - plantphysiolorg - plantphysiolorg - intlplantphysiolorg - ncbinlmnihgov - all 5 versions raquoA Genome-Wide Screen Identifies Genes Required for Centromeric Cohesion JJ Doyle J Denarie F Debelle JC Prome BB Amor hellip 17 HK Choi et al Genetics in press 18 FGENESH see wwwsoftberrycomberryphtml 19 E Bornberg-Bauer E Rivals M Vingron Nucleic Acids Res

Various programs Plant Molecular Biology Issue Volume 58 Number 3 Date June 2005 Pages 421 ndash 433 DOI 101007s11103-005-5702-5 OsPPR1 a pentatricopeptide repeat protein of rice is essential for the chloroplast biogenesis Kodiveri M Gothandam1 Eun-Sook Kim1 Hongjoo Cho1 and Yong-Yoon Chung1

(1) School of Life Sciences and Biotechnology Korea University Sungbuk-ku 136-701 Seoul Anam-Dong Korea nucleotide and amino acid sequences were analyzed by the Basic Local Alignment SearchTool (BLAST) and the Soft berry prog- rame (httpwwwsoftberrycom)

Plant Physiol February 2002 Vol 128 pp 336-340 wwwplantphysiolorgcgidoi101104pp010875 Received September 25 2001 returned for revision September 29 2001 accepted November 2 2001 Cellulose Synthase-Like Genes of Rice1

Samuel P Hazen John S Scott-Craig and Jonathan D Walton Department of Energy-Plant Research Laboratory Michigan State University East Lansing Michigan 48824 Corresponding author e-mail waltonmsuedu fax 517-353-9168 The Rice Genome Research Program cDNA clones were of high quality all but one were viable and accurately annotated The one exception D22177 was chimeric containing OsCSLA2 at one end and a predicted DNA-binding protein at the other For all sequences the corresponding

proteins were deduced using gene prediction software from GeneMark (Atlanta httpopalbiologygatecheduGeneMark) and Softberry Inc (White Plains NY httpwwwsoftberrycom) and by manual alignment with the Arabidopsis Csl proteins and with each other IVET experiments in Pseudomonas fluorescens reveal cryptic promoters at loci associated Silby et al Microbiology2004 150 518-520 MW Silby PB Rainey SB Levy Using SoftBerry software (httpwwwsoftberrycomberryphtml) -35 and -10 boxes and a transcriptional start site were predicted 84 60 and 44 bp upstream of Plant Cell 2002 September 14(9) 2107ndash2119 Received March 25 2002 Accepted June 14 2002 Two Novel Fungal Virulence Genes Specifically Expressed in Appressoria of the Rice Blast Fungus

Chaoyang Xuea Gyungsoon Parka Woobong Choib Li Zhengc Ralph A Deanb and Jin-Rong Xua1

aDepartment of Botany and Plant Pathology Purdue University West Lafayette Indiana 47907 bDepartment of Plant Pathology North Carolina State University Raleigh North Carolina 27606 cSyngenta Agribusiness Biotechnology Research Inc Research Triangle Park North Carolina 27709 1To whom correspondence should be addressed E-mail xubtnypurdueedu fax 765-494-0363 Approximately 12- and 14-kb upstream sequences of GAS1 and GAS2 were sequenced and analyzed with several programs including TRES (wwwbioportalbicnusedusgtres) Expasy (wwwexpasyorg) and SoftBerry (wwwsoftberrycom) European Journal of Neuroscience January 2002 vol 15 no 1 pp 79-86(8) Characterizing CGI-94 (comparative gene identification-94) which is down-regulated in the hippocampus of early stage Alzheimers disease brain Heese K Nakayama T Hata R Masumura M Akatsu H Li F Nagai Y Yamamoto T Kosaka K Suemoto T Sawada T Additionally protein sequence analysis was performed using the following programs at ExPASy httpwwwexpasych softberry httpwwwsoftberrycomindex Journal of Cellular Biochemistry Volume 91 Issue 5 Pages 1030 - 1042 Published Online 25 Feb 2004 Copyright copy 2004 Wiley-Liss Inc A Wiley Company Received 4 September 2003 Accepted 21 November 2003 Characterizing the new transcription regulator protein p60TRP K Heese 1 T Yamada 1 H Akatsu 2 T Yamamoto 2 K Kosaka 2 Y Nagai 1 T Sawada 1

1BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan 2Choju Medical Institute Fukushimura Hospital 19-14 Aza-Yamanaka Noyori Toyohashi Aichi 441-8124 Japan email K Heese (heeseksilverocnnejp) Correspondence to K Heese BF Research Institute co National Cardiovascular Center 5-7-1 Fujishirodai Suita Osaka 565-0873 Japan expasych) softberry httpwwwsoftberry comindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorg toolsaacomp Genome Research 14929-933 2004 ISSN 1088-9051 $500 The Ensembl Core Software Libraries Arne Stabenau1 Graham McVicker1 Craig Melsopp1 Glenn Proctor1 Michele Clamp2 and Ewan Birney13 1 EMBL European Bioinformatics Institute Wellcome Trust Genome Campus Hinxton CB10 1SD UK 2 The Broad Institute Cambridge Massachusetts 02141-2023 USA

2003 ) Finally there have been several commercial genome management products based on proprietary technology from Softberry Celera and Doubletwist Generation of T-DNA tagging lines with a bidirectional gene trap vector 2 and the establishment of hellip G An annotated in the public databases we undertook 274 annotation with the Softberry program (http 275 wwwsoftberrycomberryphtml) Functional clas- 276 Journal of Bacteriology January 2002 p 183-190 Vol 184 No 1 0021-919301$0400+0 DOI 101128JB1841183-1902002 Received 25 July 2001 Accepted 11 October 2001 Regulation of the acuF Gene Encoding Phosphoenolpyruvate Carboxykinase in the Filamentous Fungus Aspergillus nidulans Michael J Hynes Oliver W Draht and Meryl A Davis Department of Genetics University of Melbourne Parkville Victoria 3010 Australia The Protein Sequence Analysis program (httpwwwsoftberrycomproteinhtml) predicted a PEPCK (ATP) signature sequence between amino acids 275 and 290 Proteins Structure Function and GeneticsVolume 53 Issue S6 Pages 352 - 368 Supplement Fifth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction Published Online 15 Oct 2003 Received 4 March 2003 Accepted 23 June 2003 Digital Object Identifier (DOI) 101002prot10543 Comp Modeling Assessment Assessment of homology-based predictions in CASP5 Anna Tramontano 1 Veronica Morea 2

1Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Rome Italy 2CNR Institute of Molecular Biology and Pathology University of Rome La Sapienza Rome Italy email Anna Tramontano (AnnaTramontanouniromalit) Correspondence to Anna Tramontano Department of Biochemical Sciences A Rossi Fanelli University of Rome La Sapienza Ple Aldo Moro 5-00185 Rome Italy hellipPage 1 Assessment of Homology-Based Predictions in CASP5 Anna Tramontano 1 and Veronica Morea 2 1 Department of Biochemical Sciences American Journal of HematologyVolume 73 Issue 3 Pages 161 - 168 Published Online 20 Jun 2003 Received 23 October 2002 Accepted 15 April 2003 Digital Object Identifier (DOI) 101002ajh10358

Spectrum of thalassemia mutations and HbF levels in the heterozygous Moroccan population Wafaa Lemsaddek 1 Isabel Picanccedilo 2 Filomena Seuanes 2 Lahoucine Mahmal 3 Saacircd Benchekroun 3 Mohammed Khattab 4 Paulo Nogueira 5 Leonor Osoacuterio-Almeida 1

1Laboratoacuterio de Geneacutetica Molecular Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa Caparica Portugal 2Laboratoacuterio de Hematologia Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal 3Service Heacutemato-Oncologie Hocircpital 20 Ao t CHU Ibn Rochd Casablanca Morocco 4Service Heacutemato-Oncologie Peacutediatrique Hocircpital dEnfants CHU Rabat Morocco 5Observatoacuterio Nacional de Sauacutede Instituto Nacional de Sauacutede Dr Ricardo Jorge Lisboa Portugal email Leonor Osoacuterio-Almeida (mlofctunlpt) Correspondence to Leonor Osoacuterio-Almeida Laboratoacuterio de Geneacutetica Molecular Secccedilatildeo Autoacutenoma de Biotecnologia Faculdade de Ciecircncias e Tecnologia Universidade Nova de Lisboa 2829-516 Caparica Portugal hellip Page 1 Spectrum of Thalassemia Mutations and HbF Levels in the Heterozygous Moroccan Population Wafaa Lemsaddek 1 Isabel Picanccedilo Lecture Notes in Computer Science Publisher Springer-Verlag Heidelberg ISSN 0302-9743 Subject Computer Science Volume 2812 2003 Title Algorithms in Bioinformatics Third International Workshop WABI 2003 Budapest Hungary September 15-20 2003 Proceedings Editors Gary Benson Roderic Page ISBN 3-540-20076-2 DOI 101007b13243 Chapter pp 124 - 138 Online Date December 2003 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron1 and Irena Rusu1

(1) IRIN Universiteacute de Nantes 2 Rue de la Houssiniegravere BP 92208 44322 Nantes Cedex 3 France Page 1 Pattern Discovery Allowing Wild-Cards Substitution Matrices and Multiple Score Functions Alban Mancheron and Irena Rusu Internal Medicine Journal Volume 34 Issue 3 Page 79 -90 - March 2004 doi101111j1444-0903200400581x Received 23 October 2003 accepted 15 December 2003 Systematic genome-wide approach to positional candidate cloning for identification of novel human disease genes

H Kiyosawa 1 T Kawashima 2 D Silva 3 N Petrovsky 4 Y Hasegawa 25 K Sakai 6 and Y Hayashizaki 257

Affiliations 1Technology and Development team for Mammalian Cellular Dynamics Bioresource Center RIKEN Tsukuba Institute Tsukuba Ibaraki 2Laboratory for Genome Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute and 3John Curtin School of Medical Research Australian National University and 4Medical Informatics Centre University of Canberra Canberra Australian Capital Territory Australia5Division of Genomic Information Resource Exploration Science of Biological Supramolecular Systems Yokahama City University Graduate School of Integrated Science Tsurumi-ku Yokohama Kanagawa 6Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Mishima Shizuoka and 7Genome Science Laboratory RIKEN Wako Saitama Japan Correspondence Yoshihide Hayashizaki Laboratory for Exploration Research Group RIKEN Genomic Sciences Center RIKEN Yokohama Institute Suehiro-cho 1-7-22 Tsurumi-ku Yokohama Kanagawa 230-0045 Japan Email rgscerggscrikengojp The data described in the present paper can be accessed on the web at httpgenomegscrikengojpmpcc hellipPage 1 Internal Medicine Journal 2004 34 79ndash90 O RIGINAL A RTICLE Systematic genome-wide approach to positional candidate cloning Brain Aging Vol 2 No 3 2002 Page 9-22 Long-term Efficacy of Cholinesterase Inhibitors Serge Gauthier McGill Centre for Studies in Aging Quebec Canada Correspondence Dr Serge Gauthier FRCPC McGill Centre for Studies in Aging 6825 LaSalle Boulevard Verdun Quebec Canada H4H 1R3 Tel +1 514 766 2010 Fax +1 514 888 4050 Email sergegauthiermcgillca hellipAdditionally protein sequence analysis was performed using the following programs at the ExPASyndashwwwndashserver (httpwwwexpasych) softberry httpwwwsoftberrycomindexhtml and Amino Acid Composition Search (AACompIdent) httpkrexpasyorgtoolsaacomp Plant Physiology December 2003 Vol 133 pp 2040ndash2047 Generation and Analysis of End Sequence Database for T-DNA Tagging Lines in Rice1 Suyoung An2 Sunhee Park2 Dong-Hoon Jeong Dong-Yeon Lee Hong-Gyu Kang Jung-Hwa Yu Junghe Hur Sung-Ryul Kim Young-Hea Kim Miok Lee Soonki Han Soo-Jin Kim Jungwon Yang Eunjoo Kim Soo Jin Wi Hoo Sun Chung Jong-Pil Hong Vitnary Choe Hak-Kyung Lee Jung-Hee Choi Jongmin Nam Seong-Ryong Kim Phun-Bum Park Ky Young Park Woo Taek Kim Sunghwa Choe Chin-Bum Lee and Gynheung An National Research Laboratory of Plant Functional Genomics Division of Molecular and Life Sciences Pohang University of Science and Technology Pohang 790ndash784 Korea (SA SP D-HJ D-YL H-GK J-HY JH S-RK Y-HK ML GA) Department of Life Science Sogang University Seoul 121ndash742 Korea (SH S-JK S-RK) Department of Genetic Engineering Suwon University Suwon 445ndash743 Korea (JY EK P-BP) Department of Biology Sunchon National University Sunchon 540ndash742 Korea (SJW KYP) Department of

Biology Yonsei University Seoul 120ndash749 Korea (HSC J-PH WTK) Department of Biology Seoul National University Seoul 151ndash747 Korea (VC SC) Department of Biology Dong-eui University Pusan 614ndash714 Korea (H-KL J-HC C-BL) and Institute of Molecular Evolutionary Genetics and Department of Biology Pennsylvania State University 208 Mueller Laboratory University Park Pennsylvania 16802 (JN) hellipIf a particular sequence had not yet been annotated in the public database the sequence surrounding the insertion site was annotated using the Softberry program (httpwwwsoftberrycom) and the GeneMark program (httpopalbiologygatecheduGeneMark) Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie F Moehrlen ndash Page 1 Analyse der Struktur Funktion und Evolution der Astacin-Protein-Familie Frank Moumlhrlen Heidelberg 2002 Page 2 INAUGURAL-DISSERTATION zur heidiubuni-heidelbergde

  • FGENES
  • FGENESH
  • FGENESH++
    • Genome Research 15566-576 2005
    • ECgene Genome-based EST clustering and gene modeling for alternative splicing
    • Genome Research 14685-692 2004
    • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
    • Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
    • Genome Research
      • Computing Center Academia Sinica Taipei 11529 Taiwan
        • FGENESH+
          • Genome Research 14685-692 2004
          • Automated Whole-Genome Multiple Alignment of Rat Mouse and Human
            • Computing Center Academia Sinica Taipei 11529 Taiwan
                • FGENESB
                  • Proteorhodopsin genes are distributed among divergent marine bacterial taxa
                  • Different SAR86 subgroups harbour divergent proteorhodopsins
                    • FGENESV
                      • Genome Organization of the SARS-CoV
                        • FGENES-M
                        • BESTORF
                        • PROTCOMP
                          • A proteomic study of the arabidopsis nuclear matrix
                          • Gene prediction in eukaryota
                            • BPROM
                            • SPLICEDB
                              • The evolving roles of alternative splicing
                              • SpliceDB database of canonical and non-canonical mammalian splice sites
                                • SCAN2
                                  • PromH promoters identification using orthologous genomic sequences
                                    • PDISORDER
                                    • SPL
                                    • NSITE
                                    • TSSP
                                    • PLANTPROM
                                      • Plant promoter prediction with confidence estimation
                                      • PlantProm a database of plant promoter sequences
                                        • PROMH
                                        • Other FGENESH
                                          • TAG Theoretical and Applied Genetics
                                            • MIPS analysis and annotation of proteins from whole genomes
                                              • Gene expression of a gene family in maize based on noncollinear haplotypes
                                              • TAG Theoretical and Applied Genetics
                                              • Genome annotation techniques new approaches and challenges
                                                • Various programs
                                                  • Characterizing the new transcription regulator protein p60TRP
                                                  • The Ensembl Core Software Libraries
                                                  • Lecture Notes in Computer Science
                                                  • Long-term Efficacy of Cholinesterase Inhibitors
Page 14: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 15: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 16: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 17: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 18: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 19: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 20: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 21: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 22: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 23: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 24: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 25: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 26: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 27: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 28: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 29: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 30: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 31: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 32: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 33: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 34: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 35: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 36: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 37: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 38: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 39: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 40: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 41: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 42: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 43: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 44: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 45: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 46: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 47: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 48: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 49: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 50: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 51: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 52: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 53: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 54: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 55: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 56: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 57: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 58: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 59: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 60: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 61: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 62: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 63: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 64: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 65: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 66: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 67: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 68: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 69: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 70: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 71: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 72: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 73: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 74: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 75: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 76: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 77: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 78: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)
Page 79: FGENES - Softberry · 2005. 12. 7. · (Salamov and Solovyev, 2000) on the Baylor College of Medicine Genefinder ... Current Proteomics, January 2004, vol. 1, no. 1, pp. 41-48(8)