The absolute nature of evolutionary events that had led to creation of ancient (>450) paralogy regions in the vertebrate genome is extremely difficult to track through inter-genomic and intra-genomic map comparison approaches because such ancient events experienced multiple chromosomal breakages and rearrangement events that led to the alteration of karyotype and disruption of gene order on chromosomes. A more convincing way to determine the mechanism of origin of vertebrate ancient paralogons is phylogenetic analysis of multigene families (Abbasi, 2010; Abbasi and Grzeschik, 2007; Asrar et al., 2013; Hughes, 1998; Hughes et al., 2001a).
In order to perform a thorough phylogenetic analysis of genes from 80 gene families with threefold or fourfold representations on human FGFR bearing chromosomes, the chromosomal locations were obtained from Ensembl genome browser (Hubbard et al., 2002). Robust scanning identified that 17 of these families have members on each of the human FGFR bearing chromosomes while 63 have their members on at least three of those chromosomes. BLASTP (Altschul et al., 1990) of the Ensembl genome browser, using bidirectional best hit strategy, was employed to attain closest putative orthologous sequences of the human proteins in other species (Hubbard et al., 2002). For those organisms for which sequences were not available at Ensembl (Paul et al., 2014), the BLASTP (Altschul et al., 1990) search was carried out against the protein database available at the National Center for Biotechnology Information (Johnson et al., 2008) and the Joint Genome Institute (Nordberg et al., 2014).
Amino acid sequences were aligned by using CLUSTAL W (Thompson et al., 1994) under default parameters. Phylogenetic trees were constructed by using Neighbor-Joining (NJ) method (Russo et al., 1996; Saitou and Nei, 1987) with p-distance as amino-acid substitution model, implemented in MEGA version 5 (Kumar et al., 2008; Tamura et al., 2011). Complete deletion option was selected to eliminate any site which can introduce a gap in the sequences. Tree topologies were supported by bootstrap (Felsenstein, 1985) with 1000 replicates. To systematically check and validate trees with different reconstruction method; Maximum Likelihood with Whelan and Goldman (WAG) model (Whelan and Goldman, 2001), using MEGA 5 program (Tamura et al., 2011) was implemented. Results from both the methods are presented in this database.
Figure 1(a): Neighbor Joining (N.J) Tree of FGFR Family Figure 1(b): Maximum Likelihood (M.L) Tree of FGFR Family
Useful references: