用 Python 优化日常任务的 19 个妙招-51CTO.COM

1. 快速批量重命名文件

有时候我们需要一次性更改大量文件的名字，比如整理照片或文档时。Python可以轻松搞定这个任务。

import os

# 设置文件夹路径
folder_path = 'C:/Users/YourName/Documents/'

# 获取文件夹中所有文件名
files = os.listdir(folder_path)

# 为每个文件重命名
for index, file in enumerate(files):
    # 获取文件扩展名
    extension = os.path.splitext(file)[1]
    # 生成新文件名
    new_name = f"file_{index}{extension}"
    # 构建完整路径
    old_file_path = os.path.join(folder_path, file)
    new_file_path = os.path.join(folder_path, new_name)
    # 重命名文件
    os.rename(old_file_path, new_file_path)1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.

说明：这段代码会将指定文件夹中的所有文件按顺序重新命名为file_0.jpg, file_1.pdf等格式。

2. 自动下载网页上的图片

假设你在浏览一个网页，发现上面有很多漂亮的图片想要保存下来。手动一张张保存太麻烦了，不如用Python来自动下载。

import requests
from bs4 import BeautifulSoup
import os

url = 'https://example.com'
response = requests.get(url)
soup = BeautifulSoup(response.text, 'html.parser')
images = soup.find_all('img')

folder_path = 'C:/Users/YourName/Pictures/'

if not os.path.exists(folder_path):
    os.makedirs(folder_path)

for image in images:
    src = image.get('src')
    if src.startswith('http'):
        img_data = requests.get(src).content
        filename = os.path.join(folder_path, src.split('/')[-1])
        with open(filename, 'wb') as f:
            f.write(img_data)1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.

说明：这段脚本会访问指定URL，解析HTML文档找到所有图片链接，并将其下载到本地目录中。

3. 使用正则表达式提取信息

正则表达式是处理文本的强大工具。比如，你可以用它从邮件地址列表中提取所有的邮箱地址。

import re

text = '''
Hello,
My email is example@email.com and my friend's email is another@example.org.
Please contact us at your convenience.
'''

emails = re.findall(r'\b[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Z|a-z]{2,}\b', text)
print(emails)  # 输出: ['example@email.com', 'another@example.org']1.
2.
3.
4.
5.
6.
7.
8.
9.
10.

说明：这里使用了正则表达式匹配常见的电子邮件格式。

4. 自动生成Excel报告

工作中经常需要制作报表，如果能自动生成Excel表格，那该多好啊！

import pandas as pd

data = {
    'Name': ['Alice', 'Bob', 'Charlie'],
    'Age': [25, 30, 35],
    'City': ['New York', 'Los Angeles', 'Chicago']
}

df = pd.DataFrame(data)

# 创建Excel writer对象
writer = pd.ExcelWriter('report.xlsx', engine='xlsxwriter')

# 将DataFrame写入Excel
df.to_excel(writer, sheet_name='Sheet1', index=False)

# 保存Excel文件
writer.save()1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.

说明：这段代码创建了一个包含三列数据的DataFrame，并将其导出为名为report.xlsx的Excel文件。

5. 通过发送电子邮件提醒自己

有些重要的事情容易忘记？让Python帮你发送邮件提醒吧！

import smtplib
from email.mime.text import MIMEText
from email.mime.multipart import MIMEMultipart

sender_email = "your_email@example.com"
receiver_email = "receiver_email@example.com"
password = input("Type your password and press enter:")

message = MIMEMultipart("alternative")
message["Subject"] = "Python Email Reminder"
message["From"] = sender_email
message["To"] = receiver_email

# Create the plain-text and HTML version of your message
text = """\
Hi,
How are you?
Real Python has many great tutorials:
www.realpython.com"""
html = """\
<html>
  <body>
    <p>Hi,<br>
       How are you?<br>
       <a href="http://www.realpython.com">Real Python</a> 
       has many great tutorials.
    </p>
  </body>
</html>
"""

# Turn these into plain/html MIMEText objects
part1 = MIMEText(text, "plain")
part2 = MIMEText(html, "html")

# Add HTML/plain-text parts to MIMEMultipart message
# The email client will try to render the last part first
message.attach(part1)
message.attach(part2)

# Create secure connection with server and send email
context = ssl.create_default_context()
with smtplib.SMTP_SSL("smtp.gmail.com", 465, context=context) as server:
    server.login(sender_email, password)
    server.sendmail(
        sender_email, receiver_email, message.as_string()
    )1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.
43.
44.
45.
46.
47.

说明：此脚本允许用户通过输入密码的方式安全地发送带有HTML格式内容的电子邮件。

6. 批量压缩文件

有时候我们需要将多个文件压缩成一个归档文件，以方便传输或存储。Python可以轻松完成这一任务。

import zipfile
import os

def zip_files(zip_filename, files):
    with zipfile.ZipFile(zip_filename, 'w', zipfile.ZIP_DEFLATED) as zipf:
        for file in files:
            zipf.write(file)

# 指定要压缩的文件列表
files_to_zip = ['file1.txt', 'file2.txt', 'file3.txt']

# 指定压缩后的文件名
zip_filename = 'archive.zip'

# 压缩文件
zip_files(zip_filename, files_to_zip)1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.

说明：这段代码将file1.txt, file2.txt, file3.txt这三个文件压缩成一个名为archive.zip的归档文件。

7. 图像处理与识别

图像处理是一个非常实用的功能，特别是在社交媒体和摄影中。我们可以使用Python的Pillow库来处理图像。

from PIL import Image

# 打开图像文件
image = Image.open('input.jpg')

# 调整图像大小
resized_image = image.resize((800, 600))

# 保存修改后的图像
resized_image.save('output.jpg')

# 旋转图像
rotated_image = image.rotate(90)
rotated_image.save('rotated_output.jpg')

# 添加水印
watermark = Image.open('watermark.png')
watermark = watermark.resize((100, 100))
position = (image.width - watermark.width, image.height - watermark.height)
image.paste(watermark, position, mask=watermark)
image.save('watermarked_output.jpg')1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.

说明：这段代码展示了如何调整图像大小、旋转图像以及添加水印。

8. 数据清洗与预处理

在进行数据分析之前，通常需要对数据进行清洗和预处理。Python的pandas库提供了强大的数据处理功能。

import pandas as pd

# 读取CSV文件
data = pd.read_csv('data.csv')

# 查看前几行数据
print(data.head())

# 删除缺失值
data.dropna(inplace=True)

# 删除重复行
data.drop_duplicates(inplace=True)

# 修改列名
data.rename(columns={'old_column_name': 'new_column_name'}, inplace=True)

# 保存处理后的数据
data.to_csv('cleaned_data.csv', index=False)1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.

说明：这段代码展示了如何读取CSV文件、删除缺失值、删除重复行以及修改列名。

9. 网页爬虫

从网页上抓取数据是一项常见任务。Python的requests和BeautifulSoup库可以帮助我们轻松实现这一目标。

import requests
from bs4 import BeautifulSoup

url = 'https://example.com'
response = requests.get(url)

# 解析HTML
soup = BeautifulSoup(response.text, 'html.parser')

# 提取标题
title = soup.title.string
print(f'Title: {title}')

# 提取所有段落
paragraphs = soup.find_all('p')
for p in paragraphs:
    print(p.get_text())

# 提取所有链接
links = soup.find_all('a')
for link in links:
    print(link.get('href'))1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.

说明：这段代码展示了如何获取网页标题、提取所有段落内容以及获取所有链接。

10. 文本处理与分析

文本处理在自然语言处理中非常重要。Python的nltk库提供了丰富的文本处理功能。

import nltk
from nltk.tokenize import word_tokenize, sent_tokenize
from nltk.corpus import stopwords

# 下载停用词
nltk.download('punkt')
nltk.download('stopwords')

text = """
Python is a high-level programming language designed to be easy to read and simple to implement. It is widely used for web development as well as data analysis.
"""

# 分词
words = word_tokenize(text)
print(words)

# 句子分割
sentences = sent_tokenize(text)
print(sentences)

# 移除停用词
stop_words = set(stopwords.words('english'))
filtered_words = [word for word in words if word.lower() not in stop_words]
print(filtered_words)

# 词频统计
from collections import Counter
word_counts = Counter(filtered_words)
print(word_counts.most_common())1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.

说明：这段代码展示了如何分词、句子分割、移除停用词以及统计词频。

11. Excel数据处理

Excel文件在日常工作中非常常见。Python的pandas库可以轻松处理Excel文件。

import pandas as pd

# 读取Excel文件
data = pd.read_excel('data.xlsx')

# 查看前几行数据
print(data.head())

# 删除缺失值
data.dropna(inplace=True)

# 计算总和
total = data['Amount'].sum()
print(f'Total: {total}')

# 保存处理后的数据
data.to_excel('processed_data.xlsx', index=False)1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.

说明：这段代码展示了如何读取Excel文件、删除缺失值、计算总和以及保存处理后的数据。

12. 日志记录与调试

在编写程序时，日志记录是非常重要的。Python的logging模块可以帮助我们记录程序运行过程中的信息。

import logging

# 配置日志记录
logging.basicConfig(filename='app.log', level=logging.INFO)

# 记录日志
logging.info('This is an info message')
logging.warning('This is a warning message')
logging.error('This is an error message')1.
2.
3.
4.
5.
6.
7.
8.
9.

说明：这段代码展示了如何配置日志记录并记录不同级别的日志信息。

13. 数据可视化

数据可视化可以让数据分析更加直观。Python的matplotlib和seaborn库提供了丰富的绘图功能。

import matplotlib.pyplot as plt
import seaborn as sns
import pandas as pd

# 读取数据
data = pd.read_csv('data.csv')

# 绘制柱状图
plt.figure(figsize=(10, 6))
sns.barplot(x='Category', y='Value', data=data)
plt.title('Bar Plot')
plt.xlabel('Category')
plt.ylabel('Value')
plt.show()

# 绘制折线图
plt.figure(figsize=(10, 6))
sns.lineplot(x='Date', y='Sales', data=data)
plt.title('Line Plot')
plt.xlabel('Date')
plt.ylabel('Sales')
plt.show()

# 绘制散点图
plt.figure(figsize=(10, 6))
sns.scatterplot(x='X', y='Y', data=data)
plt.title('Scatter Plot')
plt.xlabel('X')
plt.ylabel('Y')
plt.show()1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.

说明：这段代码展示了如何绘制柱状图、折线图和散点图。

14. PDF操作

PDF文件在很多场合都非常常用。Python的PyPDF2库可以用来处理PDF文件。

import PyPDF2

# 读取PDF文件
pdf_file = open('input.pdf', 'rb')
reader = PyPDF2.PdfReader(pdf_file)

# 获取页面数量
num_pages = len(reader.pages)
print(f'Number of pages: {num_pages}')

# 提取第一页内容
page = reader.pages[0]
text = page.extract_text()
print(f'First page content:\n{text}')

# 合并PDF文件
pdf_writer = PyPDF2.PdfWriter()
for page_num in range(num_pages):
    page = reader.pages[page_num]
    pdf_writer.add_page(page)

# 写入新文件
output_pdf = open('output.pdf', 'wb')
pdf_writer.write(output_pdf)

# 关闭文件
pdf_file.close()
output_pdf.close()1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.

说明：这段代码展示了如何读取PDF文件、提取页面内容以及合并PDF文件。

15. 数据加密与解密

数据安全性非常重要。Python的cryptography库可以用来加密和解密数据。

from cryptography.fernet import Fernet

# 生成密钥
key = Fernet.generate_key()
cipher_suite = Fernet(key)

# 加密数据
plaintext = b"This is a secret message."
ciphertext = cipher_suite.encrypt(plaintext)
print(f'Ciphertext: {ciphertext}')

# 解密数据
decrypted_text = cipher_suite.decrypt(ciphertext)
print(f'Decrypted text: {decrypted_text.decode()}')1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.

说明：这段代码展示了如何生成密钥、加密数据以及解密数据。

16. 文件监控与通知

监控文件夹的变化并及时通知是非常有用的。Python的watchdog库可以帮助我们实现这一功能。

from watchdog.observers import Observer
from watchdog.events import FileSystemEventHandler

class MyHandler(FileSystemEventHandler):
    def on_modified(self, event):
        print(f'Event type: {event.event_type}  path : {event.src_path}')
        print('File modified!')

# 设置观察器
observer = Observer()
observer.schedule(MyHandler(), path='C:/Users/YourName/Documents/', recursive=True)
observer.start()

try:
    while True:
        pass
except KeyboardInterrupt:
    observer.stop()

observer.join()1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.

说明：这段代码展示了如何设置文件夹监控并在文件发生变化时打印通知。

17. 自动化办公任务

自动化办公任务可以大大提高工作效率。Python可以用来自动化Excel、Word等办公软件的操作。

import win32com.client

# 创建Excel应用程序对象
excel = win32com.client.Dispatch("Excel.Application")
excel.Visible = True

# 新建一个工作簿
workbook = excel.Workbooks.Add()

# 在单元格中写入数据
worksheet = workbook.Worksheets(1)
worksheet.Cells(1, 1).Value = "Hello, World!"

# 保存工作簿
workbook.SaveAs("C:/Users/YourName/Documents/automated_workbook.xlsx")

# 关闭工作簿
workbook.Close(SaveChanges=True)

# 关闭Excel应用程序
excel.Application.Quit()1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.

说明：这段代码展示了如何创建Excel应用程序对象、新建工作簿、写入数据并保存。

18. 自动化邮件发送

自动化邮件发送可以在特定时间发送邮件，提高工作效率。

import smtplib
from email.mime.text import MIMEText
from email.mime.multipart import MIMEMultipart

sender_email = "your_email@example.com"
receiver_email = "receiver_email@example.com"
password = input("Type your password and press enter:")

message = MIMEMultipart("alternative")
message["Subject"] = "Automated Email"
message["From"] = sender_email
message["To"] = receiver_email

# 创建纯文本和HTML版本的消息
text = """\
Hi,
This is an automated email from Python.
"""

html = """\
<html>
  <body>
    <p>Hi,<br>
       This is an automated email from Python.
    </p>
  </body>
</html>
"""

# 将纯文本和HTML消息转换为MIMEText对象
part1 = MIMEText(text, "plain")
part2 = MIMEText(html, "html")

# 将纯文本和HTML消息附加到MIMEMultipart消息
message.attach(part1)
message.attach(part2)

# 创建安全连接并发送邮件
context = ssl.create_default_context()
with smtplib.SMTP_SSL("smtp.gmail.com", 465, context=context) as server:
    server.login(sender_email, password)
    server.sendmail(sender_email, receiver_email, message.as_string())1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.

说明：这段代码展示了如何创建纯文本和HTML版本的消息并发送邮件。

19. 实战案例：自动化数据分析流程

假设你是一名数据分析师，每天需要处理大量的销售数据。我们可以使用Python自动化整个数据分析流程。

读取数据
数据清洗
数据处理
数据可视化
发送报告

import smtplib
from email.mime.text import MIMEText
from email.mime.multipart import MIMEMultipart

sender_email = "your_email@example.com"
receiver_email = "receiver_email@example.com"
password = input("Type your password and press enter:")

message = MIMEMultipart("alternative")
message["Subject"] = "Automated Email"
message["From"] = sender_email
message["To"] = receiver_email

# 创建纯文本和HTML版本的消息
text = """\
Hi,
This is an automated email from Python.
"""

html = """\
<html>
  <body>
    <p>Hi,<br>
       This is an automated email from Python.
    </p>
  </body>
</html>
"""

# 将纯文本和HTML消息转换为MIMEText对象
part1 = MIMEText(text, "plain")
part2 = MIMEText(html, "html")

# 将纯文本和HTML消息附加到MIMEMultipart消息
message.attach(part1)
message.attach(part2)

# 创建安全连接并发送邮件
context = ssl.create_default_context()
with smtplib.SMTP_SSL("smtp.gmail.com", 465, context=context) as server:
    server.login(sender_email, password)
    server.sendmail(sender_email, receiver_email, message.as_string())1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.

说明：这段代码展示了如何读取销售数据、进行数据清洗、数据处理、数据可视化并将结果发送给指定的邮箱。