Automation of data collection and uploading the products to the online clothes shop
Web Scraping Case Study in Ecommerce: find out how a retailer has automated the process of adding products to their online store from the supplier’s website.

Customer challenges
The customer sells goods from suppliers located in multiple countries through their online store. One of their primary suppliers has a product catalog with up to a million SKUs. Unfortunately, this supplier does not have its own API for exchanging data with their customers (online stores). Working with such vast volumes of data manually is impossible, and it was of utmost importance for our customer to obtain accurate information regarding the goods, their prices, and product availability.
The main details of the project
- Prices and discounts for products change frequently on the supplier’s end
- The availability and sizes of the goods from the supplier change regularly.
- It is crucial to receive the most recent data promptly.
- The supplier’s website displays different products and prices based on the country from which the user accesses the site.
- The supplier’s website has protections in place to prevent a scraper from frequently accessing pages from the same IP address.
- Depending on the brand’s country of origin, the supplier offers different clothing and footwear with various sizing charts.
Solution
Our company developed a web scraper that automatically gathers data on a daily basis. This scraper is also capable of transforming the information about prices and sizing charts into a format suitable for the online store.
Project stages
-
1
Customer requirements
We gathered customer requirements and reviewed the supplier's website.
-
2
Scraper development
We developed a scraper that collects all the necessary data.
-
3
Data transformation
We developed an additional program which transforms sizes of the clothes and shoes.
-
4
Setting up data transfer
We integrated with the customer's data warehouse, so that the customer receives the data directly to their system.
-
5
Testing and launch
System stability testing and after that we made a commercial launch of the program.
Implementation
To solve this problem, a number of technical solutions were applied, namely:
- We developed a scraper that works with a minimum of calls to the supplier’s site, this allows us not to load the site each time and at the same time get all the necessary information.
- Since prices and availability of products change frequently, the customer needs to update the data as frequently as on the supplier’s side. We have configured our scraping infrastructure solution in such a way that we receive data about the entire catalogue and spend a minimum of time on this task. The data collection time is calculated so that the customer has fresh data by the time he updates his catalogue.
- Since the supplier’s website is a multilingual store with visitors from different countries, the prices and availability of goods vary depending on the location of the visitor. We have set up data scraping from the customer’s desired geographic location.
- To ensure that the site does not block data collection, and we could meet the deadline in which to provide updated prices, we have configured scraper work so that it does not load the supplier’s site. Also we used an IP server rotator, it helped us avoid blocking the scraper.
- One of the problems faced by the customer was the availability of different sizes of clothes and shoes. If we would leave everything as it is, users would have to find the right product on their own using the comparison tables. It would be inconvenient and could lead to errors and as a result to product returns and refunds. It was necessary to bring the size grids of hundreds of thousands of goods to the sizes of clothes and shoes to one universal format that would be convenient to all the users. To solve this problem, we developed a program that converts the dimensions and size grids into understandable format to all the users.
- The collected data is transmitted to the customer in their data storage system in the required format.

Project results
- A flexible solution has been developed that can be reconfigured to meet changing requirements.
- The entire process of uploading and updating products to the online store is fully automated.
- The customer regularly receives up-to-date information about prices and availability of goods from the supplier.
- Pricing Craft provides full technical support for the existing solution and we make sure on a daily basis that everything works correctly.
How to increase the profit of the online store?
Pricing Craft is a service for monitoring competitors’ prices.
We help to increase profits by optimising pricing.
