Publications of MA. Andrade

Iliopoulos I, Tsoka S, Andrade MA, Enright AJ, Carroll M, Poullet P, Promponas V, Liakopoulos T, Palaios G, Pasquier C, Hamodrakas S, Tamames J, Yagnik AT, Tramontano A, Devos DP, Blaschke C, Valencia A, Brett D, Martin D, Leroy C, Rigoutsos I, Sander C, Ouzounis CA

Bioinformatics.

2003 Apr 12; 19(6): 717-26. PubMed: 12691983.

Abstract + PDF

30.

The way we write.

Netzel R, Perez-Iratxeta C, Bork P, Andrade MA

EMBO Rep.

2003 May; 4(5): 446-51. PubMed: 12728240.

PDF

29.

A protocol for the update of references to scientific literature in biological databases.

Perez-Iratxeta C, Astola N, Ciccarelli FD, Shah PK, Bork P, Andrade MA

Appl Bioinformatics.

2003; 2(3): 189-91. PubMed: 15130808.

Abstract + PDF

28.

Update on XplorMed: A web server for exploring scientific literature.

Perez-Iratxeta C, Pérez AJ, Bork P, Andrade MA

Nucleic Acids Res.

2003 Jul 1; 31(13): 3866-8. PubMed: 12824439.

Abstract + PDF

27.

Information extraction from full text scientific articles: where are the keywords?

Shah PK, Perez-Iratxeta C, Bork P, Andrade MA

BMC Bioinformatics.

2003 May 29; 4: 20. Epub 2003 May 29; PubMed: 12775220.

Abstract + PDF

2002 6 publication(s).

26.

NEAT: a domain duplicated in genes near the components of a putative Fe3+ siderophore transporter from Gram-positive pathogenic bacteria.

Andrade MA, Ciccarelli FD, Perez-Iratxeta C, Bork P

Genome Biol.

2002 Aug 15; 3(9): RESEARCH0047. Epub 2002 Aug 15; PubMed: 12225586.

Abstract + PDF

25.

A self-organizing model for the development of orientation selectivity and ocular dominance patterns in the visual nervous system.

Muro EM, Andrade MA, Morán F

Neural Network World.

2002; 12: 319-332. PID: 336.

24.

Worldwide scientific publishing activity.

Perez-Iratxeta C, Andrade MA

Science.

2002 Jul 26; 297(5581): 519. PubMed: 12143877.

PDF

23.

Association of genes to genetically inherited diseases using data mining.

Perez-Iratxeta C, Bork P, Andrade MA

Nat Genet.

2002 Jul; 31(3): 316-9. Epub 2002 May 13; PubMed: 12006977.

Abstract + PDF

22.

Exploring MEDLINE abstracts with XplorMed.

Perez-Iratxeta C, Bork P, Andrade MA

Drugs Today (Barc).

2002 Jun; 38(6): 381-9. PubMed: 12532176.

Abstract

21.

Computing fuzzy associations for the analysis of biological literature.

Perez-Iratxeta C, Keer HS, Bork P, Andrade MA

Biotechniques.

2002 Jun; 32(6): 1380-2, 1384-5. PubMed: 12074170.

Abstract + PDF

2001 6 publication(s).

20.

A combination of the F-box motif and kelch repeats defines a large Arabidopsis family of F-box proteins.

Andrade MA, González-Guzmán M, Serrano R, Rodríguez PL

Plant Mol Biol.

2001 Jul; 46(5): 603-14. PubMed: 11516153.

Abstract

19.

Simulation of plasticity in the adult visual cortex.

Andrade MA, Muro EM, Morán F

Biol Cybern.

2001 Jun; 84(6): 445-51. PubMed: 11417056.

Abstract

18.

Protein repeats: structures, functions, and evolution.

Andrade MA, Perez-Iratxeta C, Ponting CP

J Struct Biol.

2001; 134(2-3): 117-31. PubMed: 11551174.

Abstract + PDF

17.

Comparison of ARM and HEAT protein repeats.

Andrade MA, Petosa C, O'Donoghue SI, Müller CW, Bork P

J Mol Biol.

2001 May 25; 309(1): 1-18. PubMed: 11491282.

Abstract + PDF

16.

Genome sequences and great expectations.

Iliopoulos I, Tsoka S, Andrade MA, Janssen P, Audit B, Tramontano A, Valencia A, Leroy C, Sander C, Ouzounis CA

Genome Biol.

2001; 2(1): INTERACTIONS0001. PubMed: 11178275.

Abstract + PDF

15.

XplorMed: a tool for exploring MEDLINE abstracts.

Perez-Iratxeta C, Bork P, Andrade MA

Trends Biochem Sci.

2001 Sep; 26(9): 573-5. PubMed: 11551795.

Abstract + PDF

2000 6 publication(s).

14.

Automated extraction of information in molecular biology.

Andrade MA, Bork P

FEBS Lett.

2000 Jun 30; 476(1-2): 12-7. PubMed: 10878241.

Abstract + PDF

13.

Homology-based method for identification of protein repeats using statistical significance estimates.

Andrade MA, Ponting CP, Gibson TJ, Bork P

J Mol Biol.

2000 May 5; 298(3): 521-37. PubMed: 10772867.

Abstract + PDF

12.

Re-annotating the Mycoplasma pneumoniae genome sequence: adding value, function and reading frames.

Dandekar T, Huynen MA, Regula JT, Ueberle B, Zimmermann CU, Andrade MA, Doerks T, Sánchez-Pulido L, Snel B, Suyama M, Yuan YP, Herrmann R, Bork P

Nucleic Acids Res.

2000 Sep 1; 28(17): 3278-88. PubMed: 10954595.

Abstract + PDF

11.

The GeneQuiz web server: protein functional analysis through the Web.

Hoersch S, Leroy C, Brown NP, Andrade MA, Sander C

Trends Biochem Sci.

2000 Jan; 25(1): 33-5. PubMed: 10637611.

10.

Evolution of domain families.

Ponting CP, Schultz J, Copley RR, Andrade MA, Bork P

Adv Protein Chem.

2000; 54: 185-244. PubMed: 10829229.

PDF

9.

NAIL-Network Analysis Interface for Linking HMMER results.

Sánchez-Pulido L, Yuan YP, Andrade MA, Bork P

Bioinformatics.

2000 Jul; 16(7): 656-7. PubMed: 11038338.

Abstract + PDF

1999 4 publication(s).

8.

Position-specific annotation of protein function based on multiple homologs.

Andrade MA

Proc Int Conf Intell Syst Mol Biol.

1999; 7: 28-33. PubMed: 10786283.

Abstract

7.

Automated genome sequence analysis and annotation.

Andrade MA, Brown NP, Leroy C, Hoersch S, de Daruvar A, Reich C, Franchini A, Tamames J, Valencia A, Ouzounis C, Sander C

Bioinformatics.

1999 May; 15(5): 391-412. PubMed: 10366660.

Abstract + PDF

MOTIVATION: Large-scale genome projects generate a rapidly increasing number of sequences, most of them biochemically uncharacterized. Research in bioinformatics contributes to the development of methods for the computational characterization of these sequences. However, the installation and application of these methods require experience and are time consuming. RESULTS: We present here an automatic system for preliminary functional annotation of protein sequences that has been applied to the analysis of sets of sequences from complete genomes, both to refine overall performance and to make new discoveries comparable to those made by human experts. The GeneQuiz system includes a Web-based browser that allows examination of the evidence leading to an automatic annotation and offers additional information, views of the results, and links to biological databases that complement the automatic analysis. System structure and operating principles concerning the use of multiple sequence databases, underlying sequence analysis tools, lexical analyses of database annotations and decision criteria for functional assignments are detailed. The system makes automatic quality assessments of results based on prior experience with the underlying sequence analysis tools; overall error rates in functional assignment are estimated at 2.5-5% for cases annotated with highest reliability ('clear' cases). Sources of over-interpretation of results are discussed with proposals for improvement. A conservative definition for reporting 'new findings' that takes account of database maturity is presented along with examples of possible kinds of discoveries (new function, family and superfamily) made by the system. System performance in relation to sequence database coverage, database dynamics and database search methods is analysed, demonstrating the inherent advantages of an integrated automatic approach using multiple databases and search methods applied in an objective and repeatable manner. AVAILABILITY: The GeneQuiz system is publicly available for analysis of protein sequences through a Web server at http://www.sander.ebi.ac. uk/gqsrv/submit

PDF

6.

Functional classes in the three domains of life.

Andrade MA, Ouzounis C, Sander C, Tamames J, Valencia A

J Mol Evol.

1999 Nov; 49(5): 551-7. PubMed: 10552036.

Abstract + PDF

5.

Automatic extraction of biological information from scientific text: protein-protein interactions.

Blaschke C, Andrade MA, Ouzounis C, Valencia A

Proc Int Conf Intell Syst Mol Biol.

1999; : 60-7. PubMed: 10786287.

Abstract

1998 2 publication(s).

4.

Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families.

Andrade MA, Valencia A

Bioinformatics.

1998; 14(7): 600-7. PubMed: 9730925.

Abstract

MOTIVATION: Annotation of the biological function of different protein sequences is a time-consuming process currently performed by human experts. Genome analysis tools encounter great difficulty in performing this task. Database curators, developers of genome analysis tools and biologists in general could benefit from access to tools able to suggest functional annotations and facilitate access to functional information. APPROACH: We present here the first prototype of a system for the automatic annotation of protein function. The system is triggered by collections of s related to a given protein, and it is able to extract biological information directly from scientific literature, i.e. MEDLINE abstracts. Relevant keywords are selected by their relative accumulation in comparison with a domain-specific background distribution. Simultaneously, the most representative sentences and MEDLINE abstracts are selected and presented to the end-user. Evolutionary information is considered as a predominant characteristic in the domain of protein function. Our system consequently extracts domain-specific information from the analysis of a set of protein families. RESULTS: The system has been tested with different protein families, of which three examples are discussed in detail here: 'ataxia-telangiectasia associated protein', 'ran GTPase' and 'carbonic anhydrase'. We found generally good correlation between the amount of information provided to the system and the quality of the annotations. Finally, the current limitations and future developments of the system are discussed. AVAILABILITY: The current system can be considered as a prototype system. As such, it can be accessed as a server at http://columba.ebi.ac. uk:8765/andrade/abx. The system accepts text related to the protein or proteins to be evaluated (optimally, the result of a MEDLINE search by keyword) and the results are returned in the form of Web pages for keywords, sentences and s. SUPPLEMENTARY INFORMATION: Web pages containing full information on the examples mentioned in the text are available at: http://www.cnb.uam.es/ approximately cnbprot/keywords/ CONTACT: valencia@cnb.uam.es

3.

Computational space reduction and parallelization of a new clustering approach for large groups of sequences.

Trelles O, Andrade MA, Valencia A, Zapata EL, Carazo JM

Bioinformatics.

1998 Jun; 14(5): 439-51. PubMed: 9682057.

Abstract

1995 2 publication(s).

2.

HEAT repeats in the Huntington's disease protein.

Andrade MA, Bork P

Nat Genet.

1995 Oct; 11(2): 115-6. PubMed: 7550332.