Pfam

What labs and/or how many are using Pfam?

20 C2B2/MAGNet labs

50 Cancer-center and non-cancercenter labs

Who is the main “database authority” for Pfam?

Richard Friedman (friedman<at>cancercenter.columbia.edu),
Pavel Morozov (pm259@columbia.edu)

Michael Honig (mhonig@c2b2.columbia.edu) - relational version
What kind of database is it? (flat-file, relational, XML, etc)

Flat-file and relational (2 versions)
Size of database?

Statistical models of 9,000 protein families
Backup procedures? How often?

200

Monthly

Web interface, command line (GCG package) and x-Windows (seqlab)

What is the primary purpose of the database & types of data stored?
- Pfam is a large collection of multiple sequence alignments and hidden Markov models covering many common protein domains and families.
- For each family in Pfam you can:
May I access the database and if so, what is the login info?
- http://cancercenter.columbia.edu; for account => Contact: Janie Weiss (janie<at>cancercenter.columbia.edu)
- adgate.cu-genome.org; for account => Contact: Hans-Erik Aronson (hga1<at>columbia.edu)