在python中计算一组相同字符

我有一个这样的输入文件：

CCCCCCCCCCCCCBBBCCBBBBBBBBBBBCCCCCCCCCCCCCCCCBCCC

我想数一下每组有多少个"B"。因此输出将是：

B: 3, 11, 1

我尝试了两个不同的脚本，但都给出了B=15的总数。

这是我的一个尝试：

import collections
with open('input.txt') as infile:  
counts = collections.Counter(B.strip() for B in infile)  
for line, count in counts.most_common():  
print line, count

这是itertools.groupby的一个很好的应用，它将把类似值的输入分组到一个子分类器中。

>>> import itertools
>>> text="CCCCCCCCCCCCCBBBCCBBBBBBBBBBBCCCCCCCCCCCCCCCCBCCC"
>>> b_counts = []
>>> for letter, repeats in itertools.groupby(text):
...     if letter == "B":
...             b_counts.append(len(list(repeats)))
... 
>>> b_counts
[3, 11, 1]

试试这个：

with open('input.txt') as infile:  
counts = [i.count('B') for i in infile]
>>>print(counts)

[3, 11, 1]

看起来很简单。

def countBGroups(S):
groups = []
c = 0
for s in S:
if s == "B":
c += 1
else:
if c != 0:
groups.append(c)

c = 0

if c != 0:
groups.append(c)

return groups
with open("input.txt") as f:
print(countBGroups(f.read()))

相关内容

最新更新

热门标签：