爬虫实现翻页通常有几种常见的方法,以下是一些基本的步骤和示例:
1. 基于URL的翻页
很多网页的翻页是通过URL的查询参数来实现的,比如:
```
http://example.com/page1.html
http://example.com/page2.html?pn=2
http://example.com/page3.html?pn=3
```
在这种情况下,你可以通过解析当前页面的URL,然后修改查询参数来实现翻页。
Python 示例:
```python
import requests
from urllib.parse import urlencode
def get_page(url):
response = requests.get(url)
return response.text
def get_next_page(url):
假设pn是页码参数
current_page_number = int(url.split('?pn=')[-1])
next_page_number = current_page_number + 1
next_page_url = f"{url.split('?')[0]