Job description
I am looking for a programmer to help me create a system to scrape content from selected websites, process this content using AI (ChatGPT) and publish it automatically on a portal built on WordPress. I would like the entire process to be as automated as possible and minimize manual involvement.
The scope of the project includes:
Creating a scraper to pull content from other websites
I need a dedicated scraper that will automatically pull content from selected websites (e.g. articles, news, tutorials, journalism). The scraper should:
Use popular scraping tools such as BeautifulSoup, Requests or Scrapy (depending on the size of the project).
Handle dynamic changes to pages (e.g., changes in HTML structure, CSS, JavaScript) and deal with potential blockers (e.g., CAPTCHA, User-Agent blockers).
Save downloaded data in JSON format or in a lightweight database (e.g. SQLite).
AI processing of downloaded data (ChatGPT).