CS444: BIOINFORMATICS (Assignment 1)
Q1: What is the complement to the DNA sequence given below?
5’-ACCAAACAAAGTTGGGTAAGGATAGATCAATCAATGATCATATTCTAGTACACTTAGGATTCAAGATCCT
ATTATCAGGGACAAGAGCAGGATTAGGGATATCCGAGATGGCCACACTTTTGAGGAGCTTAGCATTGTTC
AAAAGAAACAAGGACAAACCACCCATTACATCAGGATCCGGTGGAGCCATCAGAGGAATCAAACACATTA
TTATAGTACCAATTCCTGGAGATTCCTCAATTACCACTCGATCCAGACTACTGGACCGGTTGGTCAGGTT
AATTGGAAACCCGGATGTGAGCGGGCCCAAACTAACAGGGGCACTAATAGGTATATTATCCTTATTTGTG
GAGTCTCCAGGTCAATTGATTCAGAGGATCACCGATGACCCTGACGTTAGCATCAGGCTGTTAGAGGTTG
TTCAGAGTGACCAGTCACAATCTGGCCTTACCTTCGCATCAAGAGGTACCAACATGGAGGATGAGGCGGA
CCAATACTTTTCACATGATGATCCAAGCAGTAGTGATCAATCCAGGTCCGGATGGTTCGAGAACAAGGAA
ATCTCAGATATTGAAGTGCAAGACCCTGAGGGATTCAACATGATTCTGGGTACCATTCTAGCCCAGATCT
GGGTCTTGCTCGCAAAGGCGGTTACGGCCCCAGACACGGCAGCTGATTCGGAGCTAAGAAGGTGGATAAA
GTACACCCAACAAAGAAGGGTAGTTGGTGAATTTAGATTGGAGAGAAAATGGTTGGATGTGGTGAGGAAC
AGGATTGCCGAGGACCTCTCTTTACGCCGATTCATGGTGGCTCTAATCCTGGATATCAAGAGGACACCCG
GGAACAAACCTAGGATTGCTGAAATGATATGTGACATTGATACATATATCGTAGAGGCAGGATTAGCCAG
TTTTATCCTGACTATTAAGTTTGGGATAGAAACTATGTATCCTGCTCTTGGACTGCATGAATTTGCTGGT
GAGTTATCCACACTTGAGTCCTTGATGAATCTTTACCAGCAAATGGGAGAAACTGCACCCTACATGGTAA-3’
Q2: What is the mRNA sequence of the given DNA sequence in Q1?
Q3: What is the protein sequence formed from the mRNA sequence of Q2?
Q4: What will be the mRNA encoded sequence if all the “AT”s are mutated into “TA”s in Q1 DNA sequence?
Q5: What will be the protein sequence of the new mRNA sequence formed after Q4?
Q1: Complement sequence:
3'-TGGTTTGTTTCAACCCATTCCTATCTAGTTAGTTACTAGTATAAGATCATGTGAATCCTAAGTTCTAGGATAATAGTCCCTGTTCTCGTCCTAATCCCTATAGGCTCTACCGGTGTGAAAACTCCTCGAATCGTAACAAGTTTTCTTTGTTCCTGTTTGGTGGGTAATGTAGTCCTAGGCCACCTCGGTAGTCTCCTTAGTTTGTGTAATAATATCATGGTTAAGGACCTCTAAGGAGTTAATGGTGAGCTAGGTCTGATGACCTGGCCAACCAGTCCAATTAACCTTTGGGCCTACACTCGCCCGGGTTTGATTGTCCCCGTGATTATCCATATAATAGGAATAAACACCTCAGAGGTCCAGTTAACTAAGTCTCCTAGTGGCTACTGGGACTGCAATCGTAGTCCGACAATCTCCAACAAGTCTCACTGGTCAGTGTTAGACCGGAATGGAAGCGTAGTTCTCCATGGTTGTACCTCCTACTCCGCCTGGTTATGAAAAGTGTACTACTAGGTTCGTCATCACTAGTTAGGTCCAGGCCTACCAAGCTCTTGTTCCTTTAGAGTCTATAACTTCACGTTCTGGGACTCCCTAAGTTGTACTAAGACCCATGGTAAGATCGGGTCTAGACCCAGAACGAGCGTTTCCGCCAATGCCGGGGTCTGTGCCGTCGACTAAGCCTCGATTCTTCCACCTATTTCATGTGGGTTGTTTCTTCCCATCAACCACTTAAATCTAACCTCTCTTTTACCAACCTACACCACTCCTTGTCCTAACGGCTCCTGGAGAGAAATGCGGCTAAGTACCACCGAGATTAGGACCTATAGTTCTCCTGTGGGCCCTTGTTTGGATCCTAACGACTTTACTATACACTGTAACTATGTATATAGCATCTCCGTCCTAATCGGTCAAAATAGGACTGATAATTCAAACCCTATCTTTGATACATAGGACGAGAACCTGACGTACTTAAACGACCACTCAATAGGTGTGAACTCAGGAACTACTTAGAAATGGTCGTTTACCCTCTTTGACGTGGGATGTACCATT-5'
Q2: mRNA sequence:
5'-ACCAAACAAAGUUGGGUAAGGAUAGAUCAAUCAAUGAUCAUAUUCUAGUACACUUAGGAUUCAAGAUCCUAUUAUCAGGGACAAGAGCAGGAUUAGGGAUAUCCGAGAUGGCCACACUUUUGAGGAGCUUAGCAUUGUUCAAAAGAAACAAGGACAAACCACCCAUUACAUCAGGAUCCGGUGGAGCCAUCAGAGGAAUCAAACACAUUAUUAUAGUACCAAUUCCUGGAGAUUCCUCAAUUACCACUCGAUCCAGACUACUGGACCGGUUGGUCAGGUUAAUUGGAAACCCGGAUGUGAGCGGGCCCAAACUAACAGGGGCACUAAUAGGUAUAUUAUCCUUAUUUGUGGAGUCUCCAGGUCAAUUGAUUCAGAGGAUCACCGAUGACCCUGACGUUAGCAUCAGGCUGUUAGAGGUUGUUCAGAGUGACCAGUCACAAUCUGGCCUUACCUUCGCAUCAAGAGGUACCAACAUGGAGGAUGAGGCGGACCAAUACUUUUCACAUGAUGAUCCAAGCAGUAGUGAUCAAUCCAGGUCCGGAUGGUUCGAGAACAAGGAAAUCUCAGAUAUUGAAGUGCAAGACCCUGAGGGAUUCAACAUGAUUCUGGGUACCAUUCUAGCCCAGAUCUGGGUCUUGCUCGCAAAGGCGGUUACGGCCCCAGACACGGCAGCUGAUUCGGAGCUAAGAAGGUGGAUAAAGUACACCCAACAAAGAAGGGUAGUUGGUGAAUUUAGAUUGGAGAGAAAAUGGUUGGAUGUGGUGAGGAACAGGAUUGCCGAGGACCUCUCUUUACGCCGAUUCAUGGUGGCUCUAAUCCUGGAUAUCAAGAGGACACCCGGGAACAAACCUAGGAUUGCUGAAAUGAUAUGUGACAUUGAUACAUAUAUCGUAGAGGCAGGAUUAGCCAGUUUUAUCCUGACUAUUAAGUUUGGGAUAGAAACUAUGUAUCCUGCUCUUGGACUGCAUGAAUUUGCUGGUGAGUUAUCCACACUUGAGUCCUUGAUGAAUCUUUACCAGCAAAUGGGAGAAACUGCACCCUACAUGGUAA-3'
Q3: Protein sequence:
MATLLRSLALFKRNKDKPPITSGSGGAIRGIKHIIIVPIPGDSSITTRSRLLDRLVRLIGNPDVSGPKLTGALIGILSLFVESPGQLIQRITDDPDVSIRLLEVVQSDQSQSGLTFASRGTNMEDEADQYFSHDDPSSSDQSRSGWFENKEISDIEVQDPEGFNMILGTILAQIWVLLAKAVTAPDTAADSELRRWIKYTQQRRVVGEFRLERKWLDVVRNRIAEDLSLRRFMVALILDIKRTPGNKPRIAEMICDIDTYIVEAGLASFILTIKFGIETMYPALGLHEFAGELSTLESLMNLYQQMGETAPYMV
Q4: Mutated mRNA:
5'-UGGUUUGUUUCAACCCUAUCCUUACUAGUUAGUUACUAGUUAAAGUACUAGUGAUACCUAAGUUCUAGGUAAUAAGUCCCUGUUCUCGUCCUAUACCCUUAAGGCUCUACCGGUGUGAAAACUCCUCGAUACGUAACAAGUUUUCUUUGUUCCUGUUUGGUGGGUAUAGUAGUCCUAGGCCACCUCGGUAGUCUCCUUAGUUUGUGUAUAAUAUACUAGGUUAAGGACCUCUAAGGAGUUAUAGGUGAGCUAGGUCUGUAGACCUGGCCAACCAGUCCAUAUAACCUUUGGGCCUACACUCGCCCGGGUUUGUAUGUCCCCGUGUAUUACCUAUAAUAAGGAUAAAACACCUCAGAGGUCCAGUUAACUAAGUCUCCUAGUGGCUACUGGGACUGCAUACGUAGUCCGACAUACUCCAACAAGUCUCACUGGUCAGUGUUAGACCGGAUAGGAAGCGUAGUUCUCCUAGGUUGUACCUCCUACUCCGCCUGGUUUAGAAAAGUGUACUACUAGGUUCGUCUACACUAGUUAGGUCCAGGCCUACCAAGCUCUUGUUCCUUUAGAGUCUUAAACUUCACGUUCUGGGACUCCCUAAGUUGUACUAAGACCCUAGGUAAGUACGGGUCUAGACCCAGAACGAGCGUUUCCGCCAUAGCCGGGGUCUGUGCCGUCGACUAAGCCUCGUAUCUUCCACCUUAUUCUAGUGGGUUGUUUCUUCCCUACAACCACUUAAUACUAACCUCUCUUUUACCAACCUACACCACUCCUUGUCCUAACGGCUCCUGGAGAGAAUAGCGGCUAAGUACCACCGAGUAUAGGACCUUAAGUUCUCCUGUGGGCCCUUGUUUGGUACCUAACGACUUUACUUAACACUGUAACUUAGUUAUAAGCUACUCCGUCCUAUACGGUCAAAUAAGGACUGUAAUAUCAAACCCUUACUUUGUAACUAAGGACGAGAACCUGACGUACUUAAACGACCACUCAUAAGGUGUGAACUCAGGAACUACUUAGAAUAGGUCGUUUACCCUCUUUGACGUGGGUAGUACCUAU-3'
Q5: Mutated protein sequence:
MSPCITYNKDKTPQRSS
Comments
Leave a comment