prabhaavp
|
525e359c6b
|
- Html -> TSV
|
2026-03-12 12:14:31 -04:00 |
|
prabhaavp
|
1614d85270
|
- Fixed Bug: Certain characters can't be used for folder names. Need to fix it so those characters are removed. There is now a sanitize_slug function used
|
2026-03-10 14:45:45 -04:00 |
|
prabhaavp
|
cfbddf2a24
|
- Updates to make it name the folder the name of the wikipedia slug. Fix needed: Certain characters can't be used for folder names. Need to fix it so those characters are removed.
|
2026-03-10 14:15:33 -04:00 |
|
prabhaavp
|
36af063777
|
- Delete the folders if we skipped a movie due to not being found
|
2026-03-10 13:17:21 -04:00 |
|
prabhaavp
|
0ac1234afa
|
- Fix directories
|
2026-03-10 13:10:25 -04:00 |
|
prabhaavp
|
401e7e5497
|
- Extract info needed from ZIM file
|
2026-02-12 20:07:09 -05:00 |
|
IshaAtteri
|
0cc571727b
|
wikipedia movie scraping using api code
|
2026-02-11 17:51:38 -05:00 |
|
prabhaavp
|
2d2ee64c0e
|
- Added venv instruction + requirements.txt
- Added data folder structure with .gitkeep
- Added .gitignore
- Added load.py to load IMDB dataset and preview with D-Tale
|
2026-02-03 22:21:41 -05:00 |
|