Comments on Bioinformatics: Loopless programming (for calculating methylation types)

2010-02-01T23:41:31.970-08:00

This comment has been removed by a blog administrator.

The main benefit is that my function generalizes t...

2010-01-25T05:02:40.712-08:00

The main benefit is that my function generalizes to ndarray i.e. the input could represent an alignment i.e. be a rank-2 numpy array. The other benefit is that it works on numpy arrays as the "find" method is broken in char arrays.

In [1]: "AGCG".find('GC')
Out[1]: 1
In [2]: arr = np.array('AGCG', dtype='c').view(np.char.chararray)
In [3]: arr.find('GC')
Out[3]: array([-1, -1, -1, -1])

char array does not do the right thing because it takes each element from arr e.g. 'A' and calls it's method e.g. 'find' withe the given argument e.g. 'GC', which fails "G".find("GC") returns -1

ah, i see. is there a benefit over the pure pytho...

2010-01-24T13:13:27.146-08:00

ah, i see. is there a benefit over the
pure python version
?
i guess yours could more easily handle multiple ktup's with numpy broadcasting.

It allows to find occurrences of words in a sequen...

2010-01-24T12:20:04.008-08:00

It allows to find occurrences of words in a sequence e.g. all start codons in a sequence of letters
>>> sequence = np.array('ATGCGCGTAGCTATGAGAGCATCGAT', dtype='c')
>>> find(sequence, 'ATG')
(array([ 0, 12]),)

The first ATG starts at index 0 the second at index 12. The output from "find" can be used to index an array.

>>> sequence[find(sequence, 'ATG')]
array(['A', 'A'],
dtype='|S1')

@diffusing thoughts . cool. could you show an exam...

2010-01-24T11:04:08.808-08:00

@diffusing thoughts . cool. could you show an example usage? i'm not quite sure i follow.

I like your approach. In a similar spirit this lit...

2010-01-23T19:46:53.280-08:00

I like your approach. In a similar spirit this little function finds the first index of every k-tuple in a numpy array. http://python.pastebin.com/f46f7dae7