Python里使用[]创建一个列表。容器类型的数据进行运算和操作，生成新的列表最搞笑的办法——列表生成式。

列表生成式，优雅、简洁，下面盘点在工作中主要使用案例和场景。

案例一：数据在运算

实现对每个元素的乘方操作后，利用列表生成式返回一个新的列表。

>>> a = range(0, 11)
# 利用列表生成式创建列表
>>> b = [ x**2 for x in a ]
>>> b
[0, 1, 4, 9, 16, 25, 36, 49, 64, 81, 100]

将数值型的元素列表，转换为字符串类型的列表。

>>> a = range(0, 10)
>>> b = [ str(i) for i in a ]
>>> b
['0', '1', '2', '3', '4', '5', '6', '7', '8', '9']

案例二：一串随机数

生成10个0到1的随机浮点数，保留小数点后两位

>>> from random import random
>>> a = [ round(random(), 2) for _ in range(10) ]
>>> a
[0.84, 0.76, 0.42, 0.26, 0.51, 0.4, 0.78, 0.3, 0.48, 0.58]

生成10个0到10的满足均匀分布的浮点数，保留小数点后两位

>>> from random import uniform
>>> a = [ round(uniform(0, 10), 2) for _ in range(10) ]
>>> a
[4.43, 5.12, 1.03, 4.67, 0.45, 6.39, 6.71, 8.41, 0.12, 8.13]

案例三：if和嵌套for

对一个列表里面的数据筛选，只计算[0, 11)中偶数的平方

>>> a = range(11)
>>> c = [ x**2 for x in a if x%2==0 ]
>>> c
[0, 4, 16, 36, 64, 100]

列表生成式中嵌套for，一行代码生成99乘法表的45个元素

>>> a = [ i*j for i in range(10) for j in range(1, i+1) ]
>>> a
[1, 2, 4, 3, 6, 9, 4, 8, 12, 16, 5, 10, 15, 20, 25, 6, 12, 18, 24, 30, 36, 7, 14, 21, 28, 35, 42, 49, 8, 16, 24, 32, 40, 48, 56, 64, 9, 18, 27, 36, 45, 54, 63, 72, 81]
>>> len(a)
45

案例四：zip和列表

将两个列表组合成一个新的列表

>>> a = range(5)
>>> b = ['a', 'b', 'd', 'e']
>>> c = [str(y) + str(x) for x,y in zip(a,b)]
>>> c
['a0', 'b1', 'd2', 'e3']

案例五：打印键值对

>>> a = { 'a': 1, 'b': 2, 'c': 3 }
>>> b = [ k + '=' + v for k, v in a.items() ]
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "<stdin>", line 1, in <listcomp>
TypeError: can only concatenate str (not "int") to str
>>> b = [ k + '=' + str(v) for k, v in a.items() ]
>>> b
['a=1', 'b=2', 'c=3']

注意：Python属于强类型语言，注意数据的类型

案例六：文件列表

查询目录下所有文件

>>> import os
>>> a = [ d for d in os.listdir('.')]
>>> a
['scaffolds', 'db.json', 'source', 'node_modules', '_config.butterfly.yml', 'yarn.lock', 'public', '.gitignore', 'package-lock.json', 'package.json', '_config.yml', '.github', '_config.landscape.yml', '.deploy_git', 'themes']

只查找出文件夹（目录）

1
2
3

>>> dirs = [ d for d in os.listdir('.') if os.path.isdir(d) ]
>>> dirs
['scaffolds', 'source', 'node_modules', 'public', '.github', '.deploy_git', 'themes']

只查找出文件

1
2
3

>>> files = [ d for d in os.listdir('.') if os.path.isfile(d) ]
>>> files
['db.json', '_config.butterfly.yml', 'yarn.lock', '.gitignore', 'package-lock.json', 'package.json', '_config.yml', '_config.landscape.yml']

案例七：转为小写

1
2
3

>>> a = [ 'Hello', 'World', '2022python' ]
>>> [ w.lower() for w in a ]
['hello', 'world', '2022python']

注意：Python的列表中的元素是可以不同数据类型的，如果按以上方法写是会报错的，例如：

>>> a = [ 'Hello', 'World', 2022, 'Python' ]
>>> [ w.lower() for w in a ]
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "<stdin>", line 1, in <listcomp>
AttributeError: 'int' object has no attribute 'lower'

上面操作出现int对象没有方法lower的问题，因此需要将元素转化为str后再操作：

1 2	>>> [ str(w).lower() for w in a ] ['hello', 'world', '2022', 'python']

更友好的做法，使用 isinstance，判断元素是否为 str 类型，如果是，再调用 lower 做转化：

1 2	>>> [ w.lower() for w in a if isinstance(w, str) ] ['hello', 'world', 'python']

案例八：保留唯一值

>>> def filter_non_unique(lst):
...     return [ item for item in lst if lst.count(item) == 1 ]
...
>>> filter_non_unique([ 1, 2, 2, 3, 4, 4, 5])
[1, 3, 5]

案例九：筛选分组

>>> def bifurcate(lst, filter):
...     return [
...         [ x for i, x in enumerate(lst) if filter[i] == True],
...         [ x for i, x in enumerate(lst) if filter[i] == False]
...     ]
...
>>> bifurcate(['beep', 'boop', 'foo', 'bar'], [True, True, False, True])
[['beep', 'boop', 'bar'], ['foo']]

案例十：函数分组

>>> def bifurcate_by(lst, fn):
...     return [
...         [ x for x in lst if fn(x) ],
...         [ x for x in lst if not fn(x) ]
...     ]
...
>>> bifurcate_by(['Python3', 'up', 'users', 'people'], lambda x: x[0] == 'u')
[['up', 'users'], ['Python3', 'people']]

案例十一：差集

>>> def difference(a, b):
...     _a, _b = set(a), set(b)
...     return [ item for item in _a if item not in _b ]
...
>>> difference([1,1,2,3,3], [1,2,4])
[3]

案例十二：函数差集

列表a、b中元素经过fn映射后，返回在a不在b中的元素。

>>> def difference_by(a, b, fn):
...     _b = set(map(fn, b))
...     return [ item for item in a if fn(item) not in _b ]
...

列表元素为单个元素

1
2
3

>>> from math import floor
>>> difference_by([2.1, 1.2], [2.3, 3.4], floor)
[1.2]

列表元素为字典

1 2	>>> difference_by([{ 'x': 2}, { 'x': 1 }], [{ 'x': 1}], lambda v: v['x']) [{'x': 2}]

Jean's Blog

Python列表生成式高效使用的12个案例