Commit Graph

57 Commits

Author SHA1 Message Date
korenlazar
7b63eab7bd leftover from last commit 2022-10-04 11:42:57 +03:00
korenlazar
ceff48dbd9 Fixed the bug with cerberus_web_client.py by working with Selenium. To login each chain working with it must have a username for login with Selenium. in this mechanism, a path to a gz file is returned instead of url
Added the option to output a prices json file in main.py under --prices-with-promos, where the prices are updated by the latest promotions (under the 'final_price' key, where 'price' represents the price before promotions).

Fixed small bug of BinaWebCleint by checking that filename does not contain 'null'.

Changed Hierarchy of chains such that it includes the webclients.

Added the date to the output filenames to start storing the data over time.

Black formatting (according to pip 8 guidelines).

Changed the chains_dict in main to a constant one.
2022-10-04 11:42:36 +03:00
korenLazar
b5db721a3d Merge pull request #6 from korenLazar/test-scraping
Test scraping
2021-08-18 12:26:23 +03:00
KorenLazar
90cab0a2e1 Minor changes 2021-08-18 11:32:04 +03:00
KorenLazar
87b6fbe2b0 Changed ClubID enum class to include a string field used for printing, and define ClubID.OTHER as a default value for the class to handle invalid inputs. 2021-08-18 11:30:31 +03:00
KorenLazar
322995ba15 Added TODO for ordering the argparse 2021-08-18 11:16:25 +03:00
KorenLazar
294dee8cc2 Added test for searching different files' urls. Specifically, asserting the searching non-full files does not yield urls of full files. 2021-08-17 13:08:39 +03:00
KorenLazar
cffdd84086 Added specific searching for the download url of non-full promotions and prices files. Changed return value of get_download_url accordingly. 2021-08-17 13:06:42 +03:00
KorenLazar
3770352d04 Added new requirements to requirements.txt 2021-08-17 09:35:20 +03:00
KorenLazar
63fec1490c Added new requirements to requirements.txt 2021-08-17 09:18:45 +03:00
KorenLazar
c1281cb312 Added a test for scraping the promotions and exporting them to xlsx files. 2021-08-16 23:09:10 +03:00
KorenLazar
1a88ed6e01 minor changes 2021-08-16 23:08:04 +03:00
KorenLazar
9b0ab013c9 Added requirements to requirements.txt 2021-08-16 23:07:32 +03:00
KorenLazar
1a6707341d Logical fixes in promotions scraping and calculation. 2021-08-16 23:07:07 +03:00
KorenLazar
844a106c57 Added tqdm 2021-08-16 23:05:16 +03:00
KorenLazar
c793057623 Documentation and minor changes 2021-08-16 14:06:54 +03:00
KorenLazar
13991aaa40 Documentation and minor changes 2021-08-16 14:05:22 +03:00
KorenLazar
b3d410306d Removed filtering by PRODUCTS_TO_IGNORE 2021-08-16 14:04:46 +03:00
korenLazar
62089dd538 Merge pull request #5 from korenLazar/export-promotions-to-xlsx-table
Export promotions to xlsx table
2021-08-16 12:51:48 +03:00
KorenLazar
03ff6d5281 Changed create_items_dict function to included non-full prices file in the items dictionary.
Changed log_products_prices to work with an items dictionary and a __repr__ function of the Item class.
2021-08-16 12:44:32 +03:00
KorenLazar
e09b2da4a1 removed get_all_deals function 2021-08-16 12:43:01 +03:00
KorenLazar
58bb04f1dd Added get_all_promos_tags function and included the non-full promotions file in the promotions collection. 2021-08-16 12:42:38 +03:00
KorenLazar
ebb1e912b9 Change INFO logging format 2021-08-16 12:40:06 +03:00
KorenLazar
98dcc1c33d Add price_by_measure member to Item object 2021-08-16 12:39:28 +03:00
korenLazar
8a726ff605 Merge pull request #4 from korenLazar/export-promotions-to-xlsx-table
finished implementing exporting promotion to xlsx table and automatic…
2021-06-17 10:36:20 +03:00
KorenLazar
27b45a4999 finished implementing exporting promotion to xlsx table and automatically opening the xlsx file 2021-06-01 21:00:40 +03:00
KorenLazar
ec505dba67 minor rephrasing in documentation 2021-05-18 14:34:11 +03:00
korenLazar
e740b122ff Merge pull request #1 from 1kamma/master
this will be better for the windows and unix-bases
2021-04-17 18:34:11 +03:00
d4ba19bf41 remove_unneeded 2021-04-17 12:07:08 +03:00
2a4b6562b7 change encoding 2021-04-17 12:06:25 +03:00
KorenLazar
9f5464317d Has added tests for the promotion functions for Shufersal and CoOp. Also added minor design changes in promotion.py and item.py 2021-03-08 14:13:30 +02:00
KorenLazar
c86fc7c1ab Moved to writing solely to CSV. Added some columns and drastly improved the logics behind price after promotion column. 2021-02-25 20:54:44 +02:00
KorenLazar
8aa33cbcda added columns to csv: price after promotion, discount in percentage and promotion type (regular/club/credit card). 2021-02-23 08:27:00 +02:00
KorenLazar
850d3963fe has added binaproject clients 2021-02-07 10:46:54 +02:00
KorenLazar
9983d07c2b replaced the member '_class_name' by the 'class.__name__' 2021-02-07 08:18:22 +02:00
KorenLazar
67bff9fa76 minor changes 2021-02-06 22:53:18 +02:00
KorenLazar
18f3fa32b9 has added many chains 2021-02-06 21:42:31 +02:00
KorenLazar
5aa4cd734d changed chains' members to be 'immutable static' 2021-02-06 15:57:05 +02:00
KorenLazar
3a57edf5af has added RamiLevi to the chains collection 2021-02-06 14:41:04 +02:00
KorenLazar
d7e5b709f8 has added a CSV format promotions file when running --promos. Item.py was added for moduling. 2021-01-28 16:25:38 +02:00
KorenLazar
47c0d04ce4 Updated README.md with latest changes and directory names to be more meaningful 2021-01-28 14:13:34 +02:00
KorenLazar
6bcbdee486 added sorting by promotiong length (shorter first) 2021-01-09 16:35:55 +02:00
KorenLazar
ee35e3436c Has added Zol Vebengadol. Fixed small bug in Co-Op. 2020-12-26 17:18:21 +02:00
KorenLazar
be47a5ad81 has added Co-Op chain. Extendability to other chain has significantly improved as well. 2020-12-17 22:48:10 +02:00
KorenLazar
4c0eba1771 added --find_promos_by_name option 2020-12-16 16:09:43 +02:00
KorenLazar
db6bfb3632 changed create_bs mechanism and moved to simply printing to stdout in --find_store_id and --price 2020-12-15 11:28:17 +02:00
KorenLazar
975447dcd7 splitted to multiple files 2020-12-13 13:33:07 +02:00
KorenLazar
c16f0bada9 Merge branch 'master' of https://github.com/korenLazar/shufersal-scraping 2020-12-07 13:50:03 +02:00
KorenLazar
795d7050a5 has added gitignore file 2020-12-07 13:49:05 +02:00
KorenLazar
dfca61c793 has fixed has_expired validation, added start_date and has_started validation, changed promotions sorting to max(update_date,start_date), 2020-12-07 13:48:16 +02:00