首页技术 urllib和urllib3(urllib2和urllib3的区别)

urllib和urllib3(urllib2和urllib3的区别)

编程之家2026-06-051160次浏览

大家好，urllib和urllib3相信很多的网友都不是很明白，包括urllib2和urllib3的区别也是一样，不过没有关系，接下来就来为大家分享关于urllib和urllib3和urllib2和urllib3的区别的一些知识点，大家可以关注收藏，免得下次来找不到哦，下面我们开始吧！

urllib和urllib3(urllib2和urllib3的区别)

python的httplib,urllib和urllib2的区别及用

宗述

首先来看一下他们的区别

urllib和urllib2

urllib和urllib2都是接受URL请求的相关模块，但是urllib2可以接受一个Request类的实例来设置URL请求的headers，urllib仅可以接受URL。

这意味着，你不可以伪装你的User Agent字符串等。

urllib提供urlencode方法用来GET查询字符串的产生，而urllib2没有。这是为何urllib常和urllib2一起使用的原因。

urllib和urllib3(urllib2和urllib3的区别)

目前的大部分http请求都是通过urllib2来访问的

httplib

httplib实现了HTTP和HTTPS的客户端协议，一般不直接使用，在更高层的封装模块中（urllib,urllib2）使用了它的http实现。

urllib简单用法

urllib.urlopen(url[, data[, proxies]]):

[python] view plain copy

urllib和urllib3(urllib2和urllib3的区别)

google= urllib.urlopen('')

print'http header:/n', google.info()

print'http status:', google.getcode()

print'url:', google.geturl()

for line in google:#就像在操作本地文件

print line,

google.close()

详细使用方法见

urllib学习

urllib2简单用法

最简单的形式

import urllib2

response=urllib2.urlopen(')

html=response.read()

实际步骤：

1、urllib2.Request()的功能是构造一个请求信息，返回的req就是一个构造好的请求

2、urllib2.urlopen()的功能是发送刚刚构造好的请求req，并返回一个文件类的对象response，包括了所有的返回信息。

3、通过response.read()可以读取到response里面的html，通过response.info()可以读到一些额外的信息。

如下：

#!/usr/bin/env python

import urllib2

req= urllib2.Request("")

response= urllib2.urlopen(req)

html= response.read()

print html

有时你会碰到，程序也对，但是服务器拒绝你的访问。这是为什么呢?问题出在请求中的头信息(header)。有的服务端有洁癖，不喜欢程序来触摸它。这个时候你需要将你的程序伪装成浏览器来发出请求。请求的方式就包含在header中。

常见的情形：

import urllib

import urllib2

url='

user_agent='Mozilla/4.0(compatible; MSIE 5.5; Windows NT)'#将user_agent写入头信息

values={'name':'who','password':'123456'}

headers={'User-Agent': user_agent}

data= urllib.urlencode(values)

req= urllib2.Request(url, data, headers)

response= urllib2.urlopen(req)

the_page= response.read()

values是post数据

GET方法

例如百度：

，这样我们需要将{‘wd’:’xxx’}这个字典进行urlencode

#coding:utf-8

import urllib

import urllib2

url=''

values={'wd':'D_in'}

data= urllib.urlencode(values)

print data

url2= url+'?'+data

response= urllib2.urlopen(url2)

the_page= response.read()

print the_page

POST方法

import urllib

import urllib2

url=''

user_agent='Mozilla/4.0(compatible; MSIE 5.5; Windows NT)'//将user_agent写入头信息

values={'name':'who','password':'123456'}//post数据

headers={'User-Agent': user_agent}

data= urllib.urlencode(values)//对post数据进行url编码

req= urllib2.Request(url, data, headers)

response= urllib2.urlopen(req)

the_page= response.read()

urllib2带cookie的使用

#coding:utf-8

import urllib2,urllib

import cookielib

url= r''

#创建一个cj的cookie的容器

cj= cookielib.CookieJar()

opener= urllib2.build_opener(urllib2.HTTPCookieProcessor(cj))

#将要POST出去的数据进行编码

data= urllib.urlencode({"email":email,"password":pass})

r= opener.open(url,data)

print cj

httplib简单用法

简单示例

#!/usr/bin/env python

#-*- coding: utf-8-*-

import httplib

import urllib

def sendhttp():

data= urllib.urlencode({'@number': 12524,'@type':'issue','@action':'show'})

headers={"Content-type":"application/x-www-form-urlencoded",

"Accept":"text/plain"}

conn= httplib.HTTPConnection('bugs.python.org')

conn.request('POST','/', data, headers)

httpres= conn.getresponse()

print httpres.status

print httpres.reason

print httpres.read()

if __name__=='__main__':

sendhttp()

具体用法见

httplib模块

python 3.x中urllib库和urilib2库合并成了urllib库。其中、

首先你导入模块由

import urllib

import urllib2

变成了

import urllib.request

然后是urllib2中的方法使用变成了如下

urllib2.urlopen()变成了urllib.request.urlopen()

urllib2.Request()变成了urllib.request.Request()

urllib2.URLError变成了urllib.error.URLError

而当你想使用urllib带数据的post请求时，

在python2中

urllib.urlencode(data)

而在python3中就变成了

urllib.parse.urlencode(data)

脚本使用举例：

python 2中

import urllib

import urllib2

import json

from config import settings

def url_request(self, action, url,**extra_data): abs_url=""%(settings.configs['Server'],

settings.configs["ServerPort"],

url)

if action in('get','GET'):

print(abs_url, extra_data)

try:

req= urllib2.Request(abs_url)

req_data= urllib2.urlopen(req, timeout=settings.configs['RequestTimeout'])

callback= req_data.read()

# print"-->server response:",callback

return callback

except urllib2.URLError as e:

exit("\033[31;1m%s\033[0m"% e)

elif action in('post','POST'):

# print(abs_url,extra_data['params'])

try:

data_encode= urllib.urlencode(extra_data['params'])

req= urllib2.Request(url=abs_url, data=data_encode)

res_data= urllib2.urlopen(req, timeout=settings.configs['RequestTimeout'])

callback= res_data.read()

callback= json.loads(callback)

print("\033[31;1m[%s]:[%s]\033[0m response:\n%s"%(action, abs_url, callback))

return callback

except Exception as e:

print('---exec', e)

exit("\033[31;1m%s\033[0m"% e)

python3.x中

import urllib.request

import json

from config import settings

def url_request(self, action, url,**extra_data):

abs_url='(settings.configs['ServerIp'], settings.configs['ServerPort'], url)

if action in('get','Get'):# get请求

print(action, extra_data)try:

req= urllib.request.Request(abs_url)

req_data= urllib.request.urlopen(req, timeout=settings.configs['RequestTimeout'])

callback= req_data.read()

return callback

except urllib.error.URLError as e:

exit("\033[31;1m%s\033[0m"% e)

elif action in('post','POST'):# post数据到服务器端

try:

data_encode= urllib.parse.urlencode(extra_data['params'])

req= urllib.request.Request(url=abs_url, data=data_encode)

req_data= urllib.request.urlopen(req, timeout=settings.configs['RequestTimeout'])

callback= req_data.read()

callback= json.loads(callback.decode())

return callback

except urllib.request.URLError as e:

print('---exec', e)

exit("\033[31;1m%s\033[0m"% e)

settings配置如下：

configs={

'HostID': 2,

"Server":"localhost",

"ServerPort": 8000,

"urls":{

'get_configs': ['api/client/config','get'],#acquire all the services will be monitored

'service_report': ['api/client/service/report/','post'],

},

'RequestTimeout': 30,

'ConfigUpdateInterval': 300,# 5 mins as default

}

python2.7 怎样集成 urllib2

python最恶心的地方就在于它的版本和配置了，特别是安装第三方包的时候经常会出现莫名其妙的错误，又不懂。

所以只能不断的切来切去的。

今天学习python爬虫，其中Python2.7使用了urllib和urllib2，python3的urllib结合了py2.7的两部分。但是电脑不知为什么又安装不了py3的urllib，好烦。出现下面的错误。

python2.7和python3主要是模块的位置变化地方较多。

其中python2.7的urllib和urllib2的区别一下：

urllib2可以接受一个Request类的实例来设置URL请求的headers，urllib仅可以接受URL。这意味着，你不可以通过urllib模块伪装你的User Agent字符串等（伪装浏览器）。

urllib提供urlencode方法用来GET查询字符串的产生，而urllib2没有。这是为何urllib常和urllib2一起使用的原因。

urllib2模块比较优势的地方是urlliburllib2.urlopen可以接受Request对象作为参数，从而可以控制HTTP Request的header部。

但是urllib.urlretrieve函数以及urllib.quote等一系列quote和unquote功能没有被加入urllib2中，因此有时也需要urllib的辅助。

OK，本文到此结束，希望对大家有所帮助。

单片机汇编语言汇编语言是高级语言吗写html的软件(编辑html的软件有哪些)