Practical Protein Sequence Alignment With Algebraic Dynamic Programming
Practical Protein Sequence Alignment With Algebraic Dynamic Programming Lyle Kopnicky Pac. Soft Research Group Tim Sheard, Adviser
Bioinformatics GTTAGCGTGAATCTGTACTGAG • • • DNA, RNA and proteins are strings Strings contain information Some problems • • • Determine relatedness of strands of DNA Figure out how RNA folds on itself Identify proteins in a sample
Tools for bioinformatics • • • Written in a general-purpose programming language such as C Designed to solve a narrow range of problems When problem doesn’t fit tool: • • Tweak data to fit tool – awkward, inefficient, may not fully solve problem Write new tools – time consuming, errorprone, require maintenance
The disconnect #ifndef SS strncpy(pgm_name, "gsw", MAX_FN); #else strncpy(pgm_name, "ssw", MAX_FN); #endif standard_pam("BL 50", ppst); ppst->nsq = naa; ppst->nsqx = naax; for (i=0; i<=ppst->nsqx; i++) { ppst->sq[i]=aa[i]; /* sq = aa */ ppst->hsq[i]=haa[i]; /* hsq = haa */ ppst->sqx[i]=aax[i]; /* sq = aax */ ppst->hsqx[i]=haax[i]; /* hsq = haax */ } ppst->sq[ppst->nsqx+1] = ppst->sqx[ppst->nsqx+1] = '