korenlazar
b1737839ce
Fixed bug with Shufersal Scraping by changing xml files category back to normal Enum.
2022-10-04 12:09:42 +03:00
korenlazar
7b63eab7bd
leftover from last commit
2022-10-04 11:42:57 +03:00
korenlazar
ceff48dbd9
Fixed the bug with cerberus_web_client.py by working with Selenium. To login each chain working with it must have a username for login with Selenium. in this mechanism, a path to a gz file is returned instead of url
...
Added the option to output a prices json file in main.py under --prices-with-promos, where the prices are updated by the latest promotions (under the 'final_price' key, where 'price' represents the price before promotions).
Fixed small bug of BinaWebCleint by checking that filename does not contain 'null'.
Changed Hierarchy of chains such that it includes the webclients.
Added the date to the output filenames to start storing the data over time.
Black formatting (according to pip 8 guidelines).
Changed the chains_dict in main to a constant one.
2022-10-04 11:42:36 +03:00
korenLazar
b5db721a3d
Merge pull request #6 from korenLazar/test-scraping
...
Test scraping
2021-08-18 12:26:23 +03:00
KorenLazar
90cab0a2e1
Minor changes
2021-08-18 11:32:04 +03:00
KorenLazar
87b6fbe2b0
Changed ClubID enum class to include a string field used for printing, and define ClubID.OTHER as a default value for the class to handle invalid inputs.
2021-08-18 11:30:31 +03:00
KorenLazar
322995ba15
Added TODO for ordering the argparse
2021-08-18 11:16:25 +03:00
KorenLazar
294dee8cc2
Added test for searching different files' urls. Specifically, asserting the searching non-full files does not yield urls of full files.
2021-08-17 13:08:39 +03:00
KorenLazar
cffdd84086
Added specific searching for the download url of non-full promotions and prices files. Changed return value of get_download_url accordingly.
2021-08-17 13:06:42 +03:00
KorenLazar
3770352d04
Added new requirements to requirements.txt
2021-08-17 09:35:20 +03:00
KorenLazar
63fec1490c
Added new requirements to requirements.txt
2021-08-17 09:18:45 +03:00
KorenLazar
c1281cb312
Added a test for scraping the promotions and exporting them to xlsx files.
2021-08-16 23:09:10 +03:00
KorenLazar
1a88ed6e01
minor changes
2021-08-16 23:08:04 +03:00
KorenLazar
9b0ab013c9
Added requirements to requirements.txt
2021-08-16 23:07:32 +03:00
KorenLazar
1a6707341d
Logical fixes in promotions scraping and calculation.
2021-08-16 23:07:07 +03:00
KorenLazar
844a106c57
Added tqdm
2021-08-16 23:05:16 +03:00
KorenLazar
c793057623
Documentation and minor changes
2021-08-16 14:06:54 +03:00
KorenLazar
13991aaa40
Documentation and minor changes
2021-08-16 14:05:22 +03:00
KorenLazar
b3d410306d
Removed filtering by PRODUCTS_TO_IGNORE
2021-08-16 14:04:46 +03:00
korenLazar
62089dd538
Merge pull request #5 from korenLazar/export-promotions-to-xlsx-table
...
Export promotions to xlsx table
2021-08-16 12:51:48 +03:00
KorenLazar
03ff6d5281
Changed create_items_dict function to included non-full prices file in the items dictionary.
...
Changed log_products_prices to work with an items dictionary and a __repr__ function of the Item class.
2021-08-16 12:44:32 +03:00
KorenLazar
e09b2da4a1
removed get_all_deals function
2021-08-16 12:43:01 +03:00
KorenLazar
58bb04f1dd
Added get_all_promos_tags function and included the non-full promotions file in the promotions collection.
2021-08-16 12:42:38 +03:00
KorenLazar
ebb1e912b9
Change INFO logging format
2021-08-16 12:40:06 +03:00
KorenLazar
98dcc1c33d
Add price_by_measure member to Item object
2021-08-16 12:39:28 +03:00
korenLazar
8a726ff605
Merge pull request #4 from korenLazar/export-promotions-to-xlsx-table
...
finished implementing exporting promotion to xlsx table and automatic…
2021-06-17 10:36:20 +03:00
KorenLazar
27b45a4999
finished implementing exporting promotion to xlsx table and automatically opening the xlsx file
2021-06-01 21:00:40 +03:00
KorenLazar
ec505dba67
minor rephrasing in documentation
2021-05-18 14:34:11 +03:00
korenLazar
e740b122ff
Merge pull request #1 from 1kamma/master
...
this will be better for the windows and unix-bases
2021-04-17 18:34:11 +03:00
d4ba19bf41
remove_unneeded
2021-04-17 12:07:08 +03:00
2a4b6562b7
change encoding
2021-04-17 12:06:25 +03:00
KorenLazar
9f5464317d
Has added tests for the promotion functions for Shufersal and CoOp. Also added minor design changes in promotion.py and item.py
2021-03-08 14:13:30 +02:00
KorenLazar
c86fc7c1ab
Moved to writing solely to CSV. Added some columns and drastly improved the logics behind price after promotion column.
2021-02-25 20:54:44 +02:00
KorenLazar
8aa33cbcda
added columns to csv: price after promotion, discount in percentage and promotion type (regular/club/credit card).
2021-02-23 08:27:00 +02:00
KorenLazar
850d3963fe
has added binaproject clients
2021-02-07 10:46:54 +02:00
KorenLazar
9983d07c2b
replaced the member '_class_name' by the 'class.__name__'
2021-02-07 08:18:22 +02:00
KorenLazar
67bff9fa76
minor changes
2021-02-06 22:53:18 +02:00
KorenLazar
18f3fa32b9
has added many chains
2021-02-06 21:42:31 +02:00
KorenLazar
5aa4cd734d
changed chains' members to be 'immutable static'
2021-02-06 15:57:05 +02:00
KorenLazar
3a57edf5af
has added RamiLevi to the chains collection
2021-02-06 14:41:04 +02:00
KorenLazar
d7e5b709f8
has added a CSV format promotions file when running --promos. Item.py was added for moduling.
2021-01-28 16:25:38 +02:00
KorenLazar
47c0d04ce4
Updated README.md with latest changes and directory names to be more meaningful
2021-01-28 14:13:34 +02:00
KorenLazar
6bcbdee486
added sorting by promotiong length (shorter first)
2021-01-09 16:35:55 +02:00
KorenLazar
ee35e3436c
Has added Zol Vebengadol. Fixed small bug in Co-Op.
2020-12-26 17:18:21 +02:00
KorenLazar
be47a5ad81
has added Co-Op chain. Extendability to other chain has significantly improved as well.
2020-12-17 22:48:10 +02:00
KorenLazar
4c0eba1771
added --find_promos_by_name option
2020-12-16 16:09:43 +02:00
KorenLazar
db6bfb3632
changed create_bs mechanism and moved to simply printing to stdout in --find_store_id and --price
2020-12-15 11:28:17 +02:00
KorenLazar
975447dcd7
splitted to multiple files
2020-12-13 13:33:07 +02:00
KorenLazar
c16f0bada9
Merge branch 'master' of https://github.com/korenLazar/shufersal-scraping
2020-12-07 13:50:03 +02:00
KorenLazar
795d7050a5
has added gitignore file
2020-12-07 13:49:05 +02:00