基于公共元组元素组合元组列表

考虑两个元组列表：

data1 = [([X1], 'a'), ([X2], 'b'), ([X3], 'c')]
data2 = [([Y1], 'a'), ([Y2], 'b'), ([Y3], 'c')]

哪里len(data1) == len(data2)

每个元组包含两个元素：

一些字符串的列表（即[X1]）
data1和data2的常见元素：字符串'a'、'b'等。

我想将它们合并为以下内容：

[('a', [X1], [Y1]), ('b', [X2], [Y2]),...]

有谁知道我该怎么做？

您可以使用zip函数和列表推导：

[(s1,l1,l2) for (l1,s1),(l2,s2) in zip(data1,data2)]

如果data列表中所有元素的顺序相同，则@Kasramvd的解决方案是好的。如果不是，则不会考虑这一点。

一个

解决方案，利用一个defaultdict：

from collections import defaultdict
d = defaultdict(list)  # values are initialized to empty list
data1 = [("s1", 'a'), ("s2", 'c'), ("s3", 'b')]
data2 = [("s1", 'c'), ("s2", 'b'), ("s3", 'a')]
for value, common in data1 + data2:
    d[common].append(value)

为了获得它的列表，只需将其包装在list()调用中：

res = list(d.items())
print(res)
# Prints: [('b', ['s3', 's2']), ('a', ['s1', 's3']), ('c', ['s2', 's1'])]

我们可以在单个理解表达式中做到这一点，使用 reduce 函数

from functools import reduce
from operator import add
[tuple([x]+reduce(add,([y[0]] for y in data1+data2 if y[1]==x))) for x in set(y[1] for y in data1+data2)]

如果列表很大，以至于data1+data2会造成严重的时间或内存损失，则最好预先计算它

combdata = data1+data2
[tuple([x]+reduce(add,[y[0]] for y in combdata if y[1]==x))) for x in set(y[1] for y in combdata)]

此解决方案不依赖于两个列表中出现的所有"键"，也不依赖于顺序相同。

如果退货订单很重要，我们甚至可以这样做

sorted([tuple([x]+reduce(add,([y[0]] for y in data1+data2 if y[1]==x))) for x in set(y[1] for y in data1+data2)],key = lambda x,y=[x[0] for x in data1+data2]: y.index(x[1]))

以确保顺序与

原始列表中的顺序相同。同样，预计算data1+data2给出了

sorted([tuple([x]+reduce(add,([y[0]] for y in combdata if y[1]==x))) for x in set(y[1] for y in combdata)],key = lambda x,y=[x[0] for x in combdata]: y.index(x[1]))

相关内容

最新更新

热门标签：