Return k-word frequencies for sequences in SequenceCollection.
Parameters: | k : int
overlapping : bool, optional
|
---|---|
Returns: | list
|
See also
Examples
>>> from skbio import SequenceCollection, DNA
>>> sequences = [DNA('A', id="seq1"),
... DNA('AT', id="seq2"),
... DNA('TTTT', id="seq3")]
>>> s1 = SequenceCollection(sequences)
>>> for freqs in s1.k_word_frequencies(1):
... print(freqs)
defaultdict(<type 'float'>, {'A': 1.0})
defaultdict(<type 'float'>, {'A': 0.5, 'T': 0.5})
defaultdict(<type 'float'>, {'T': 1.0})
>>> for freqs in s1.k_word_frequencies(2):
... print(freqs)
defaultdict(<type 'float'>, {})
defaultdict(<type 'float'>, {'AT': 1.0})
defaultdict(<type 'float'>, {'TT': 1.0})