大漠驼铃

置身浩瀚的沙漠，方向最为重要，希望此blog能向大漠驼铃一样，给我方向和指引。
Java,Php,Shell,Python,服务器运维,大数据，SEO, 网站开发、运维,云服务技术支持，IM服务供应商, FreeSwitch搭建，技术支持等. 技术讨论QQ群：428622099

随笔 - 238, 文章 - 3, 评论 - 117, 引用 - 0

数据加载中……

Python Urllib2

http://docs.python.org/library/urllib.html

设置超时时间

import socket

import urllib2

socket.setdefaulttimeout(seconds)

open = urllib2.urlopen("http://www.shuoqian.net")

过socket.setdefaulttimeout() 设置了全局默认超时时间，从而给urllibe2.urlopen()也设置了默认的超时时间

抓取图片(这个例子里的Request找不到，我一般只有urllib2)
soup=urlllib2.open(url)

# Let's create a function that downloads a file, and saves it locally.

# This function accepts a file name, a read/write mode(binary or text),

# and the base url.

def stealStuff(file_name,file_mode,base_url):

from urllib2 import Request, urlopen, URLError, HTTPError

#create the url and the request

url = base_url + file_name

req = Request(url)

# Open the url

try:

f = urlopen(req)

print "downloading " + url

# Open our local file for writing

local_file = open(file_name, "w" + file_mode)

#Write to our local file

local_file.write(f.read())

local_file.close()

#handle errors

except HTTPError, e:

print "HTTP Error:",e.code , url

except URLError, e:

print "URL Error:",e.reason , url

# Set the range of images to 1-50.It says 51 because the

# range function never gets to the endpoint.

image_range = range(1,51)

# Iterate over image range

for index in image_range:

base_url = 'http://www.techniqal.com/'

#create file name based on known pattern

file_name = str(index) + ".jpg"

# Now download the image. If these were text files,

# or other ascii types, just pass an empty string

# for the second param ala stealStuff(file_name,'',base_url)

stealStuff(file_name,"b",base_url)

posted on 2011-03-04 16:02 草原上的骆驼阅读(1194) 评论(0) 编辑收藏所属分类: Python

新用户注册刷新评论列表


只有注册用户登录后才能发表评论。




网站导航: 博客园 IT新闻 Chat2DB C++博客博问管理
相关文章: python 判断 null Python 连接 Mysql python的time和date处理 Python IO Python Mysql python 中文乱码 Python 正则表达式 Python Beautiful Soup Python UUID Python Urllib2

大漠驼铃

Python Urllib2

导航

公告

常用链接

留言簿(11)

随笔分类(214)

随笔档案(239)

文章分类(1)

文章档案(1)

相册

作品

搜索

积分与排名

最新评论

阅读排行榜

评论排行榜