Skip to content

Magenaut

  • Home
  • Topics
    • Notes
    • Tutorial
    • Bug fixing
    • Extension
    • Server
  • Q&A
  • Privacy Policy
  • About

screen-scraping

Scraping ajax pages using python

August 16, 2022 by Magenaut

I’ve already seen this question about scraping ajax, but python isn’t mentioned there. I considered using scrapy, i believe they have some docs on that subject, but as you can see the website is down. So i don’t know what to do. I want to do the following:

Categories Python, Q&A Tags ajax, python, scrapy, screen-scraping, web-scraping Leave a comment

Download image file from the HTML page source using python?

August 16, 2022 by Magenaut

I am writing a scraper that downloads all the image files from a HTML page and saves them to a specific folder. all the images are the part of the HTML page.

Categories Python, Q&A Tags python, screen-scraping Leave a comment

web scraping dynamic content with python

August 14, 2022 by Magenaut

I’d like to use Python to scrape the contents of the “Were you looking for these authors:” box on web pages like this one: http://academic.research.microsoft.com/Search?query=lander

Categories Python, Q&A Tags python, screen-scraping, web-scraping Leave a comment

Beautiful Soup cannot find a CSS class if the object has other classes, too

August 13, 2022 by Magenaut

if a page has <div class="class1"> and <p class="class1">, then soup.findAll(True, 'class1') will find them both.

Categories Python, Q&A Tags beautifulsoup, python, screen-scraping Leave a comment

Python regular expression for HTML parsing (BeautifulSoup)

August 13, 2022 by Magenaut

I want to grab the value of a hidden input field in HTML.

Categories Python, Q&A Tags python, regex, screen-scraping Leave a comment

unable to call firefox from selenium in python on AWS machine

August 12, 2022 by Magenaut

I am trying to use selenium from python to scrape some dynamics pages with javascript. However, I cannot call firefox after I followed the instruction of selenium on the pypi page(http://pypi.python.org/pypi/selenium). I installed firefox on AWS ubuntu 12.04. The error message I got is:

Categories Python, Q&A Tags amazon-web-services, python, screen-scraping, selenium, web-scraping Leave a comment

scrape websites with infinite scrolling

August 11, 2022 by Magenaut

I have written many scrapers but I am not really sure how to handle infinite scrollers. These days most website etc, Facebook, Pinterest has infinite scrollers.

Categories Python, Q&A Tags python, scraper, screen-scraping Leave a comment
Newer posts
← Previous Page1 Page2
  1. michealSmith07 on Is there a way to dynamically refresh the less command?August 21, 2022

    That is a very nice post. I like this post.

  2. anonymous on Fix libwacom9 dependency issue when upgrade DebianJune 27, 2022

    saved my day!! Thanks for the help…

  3. sreedhar on Fix libwacom9 dependency issue when upgrade DebianMay 10, 2022

    Thanks its working

  4. saintnick on Fix libwacom9 dependency issue when upgrade DebianMay 10, 2022

    remove libwacom2 worked for me as well

  5. ranafoul on Fix libwacom9 dependency issue when upgrade DebianApril 22, 2022

    apt remove libwacom2 helped on kali 2022.01. gr8

.net ajax asp.net asp.net-core asp.net-mvc asp.net-mvc-3 asp.net-mvc-4 asp.net-web-api bash c# command-line css custom-post-types custom-taxonomy dataframe dictionary django entity-framework functions gridview html iis javascript jquery json linux list matplotlib numpy pandas php plugin-development plugins posts python python-2.7 python-3.x security shell shell-script sql string vb.net webforms wp-query

© 2026 Magenaut • Built with GeneratePress