Identifying representative sequences of protein families using submodular optimization

Identifying representative sequences for groups of functionally similar proteins and enzymes poses significant computational …