http Archives - Page 6 of 10

How to download a file over HTTP?

August 22, 2022 by Magenaut

I have a small utility that I use to download an MP3 file from a website on a schedule and then builds/updates a podcast XML file which I’ve added to iTunes.

What is the fastest way to send 100,000 HTTP requests in Python?

August 21, 2022 by Magenaut

I am opening a file which has 100,000 URL’s. I need to send an HTTP request to each URL and print the status code. I am using Python 2.6, and so far looked at the many confusing ways Python implements threading/concurrency. I have even looked at the python concurrence library, but cannot figure out how to write this program correctly. Has anyone come across a similar problem? I guess generally I need to know how to perform thousands of tasks in Python as fast as possible – I suppose that means ‘concurrently’.

How do you send a HEAD HTTP request in Python 2?

August 20, 2022 by Magenaut

What I’m trying to do here is get the headers of a given URL so I can determine the MIME type. I want to be able to see if http://somedomain/foo/ will return an HTML document or a JPEG image for example. Thus, I need to figure out how to send a HEAD request so that I can read the MIME type without having to download the content. Does anyone know of an easy way of doing this?

urllib2.HTTPError: HTTP Error 403: Forbidden

August 20, 2022 by Magenaut

I am trying to automate download of historic stock data using python. The URL I am trying to open responds with a CSV file, but I am unable to open using urllib2. I have tried changing user agent as specified in few questions earlier, I even tried to accept response cookies, with no luck. Can you please help.

How to use Python to login to a webpage and retrieve cookies for later usage?

August 20, 2022 by Magenaut

I want to download and parse webpage using python, but to access it I need a couple of cookies set. Therefore I need to login over https to the webpage first. The login moment involves sending two POST params (username, password) to /login.php. During the login request I want to retrieve the cookies from the response header and store them so I can use them in the request to download the webpage /data.php.

Python requests – print entire http request (raw)?

August 19, 2022 by Magenaut

While using the requests module, is there any way to print the raw HTTP request?

What is the quickest way to HTTP GET in Python?

August 19, 2022 by Magenaut

What is the quickest way to HTTP GET in Python if I know the content will be a string? I am searching the documentation for a quick one-liner like:

Problem HTTP error 403 in Python 3 Web Scraping

August 19, 2022 by Magenaut

I was trying to scrape a website for practice, but I kept on getting the HTTP Error 403 (does it think I’m a bot)?

Using an HTTP PROXY – Python

August 19, 2022 by Magenaut

I familiar with the fact that I should set the HTTP_RPOXY environment variable to the proxy address. Generally urllib works fine, the problem is dealing with urllib2. >>> urllib2.urlopen("http://www.google.com").read() returns urllib2.URLError: <urlopen error [Errno 10061] No connection could be made because the target machine actively refused it> or urllib2.URLError: <urlopen error [Errno 11004] getaddrinfo failed> … Read more

Python urllib2, basic HTTP authentication, and tr.im

August 18, 2022 by Magenaut

I’m playing around, trying to write some code to use the tr.im
APIs to shorten a URL.