Vadella, Anna
|
d4777b5e72
|
Updated page.tsx
Updated home page, still using preprocessed_data.xlsx because using updated_datav2.xlsx basically crashes the frontend rn lol :p
|
2026-04-01 13:02:29 -04:00 |
|
IshaAtteri
|
90a551c048
|
Merge branch 'main' of https://github.com/IshaAtteri/datamining_881
changes to embeddings for plot
|
2026-03-26 14:05:55 -04:00 |
|
IshaAtteri
|
ee358acf64
|
text embeddings for plot
|
2026-03-26 14:05:02 -04:00 |
|
Vadella, Anna
|
3a912bf09e
|
Frontend changes
I still need to change the path of the excel in convert_to_JSON.js to be the updated data xlsx lmao
|
2026-03-26 12:35:58 -04:00 |
|
prabhaavp
|
24e0d2cc21
|
Ground Truth Spreadsheets and Code used to move all pictures into a folder
|
2026-03-26 01:18:43 -04:00 |
|
IshaAtteri
|
496761ca78
|
director and cast preprocessing
|
2026-03-25 18:21:16 -04:00 |
|
IshaAtteri
|
233fa3df17
|
preprocessing changes
|
2026-03-25 18:14:03 -04:00 |
|
IshaAtteri
|
41eaba161b
|
structural changes
|
2026-03-19 12:49:10 -04:00 |
|
IshaAtteri
|
c5d1ff3ab4
|
some changes
|
2026-03-19 12:45:32 -04:00 |
|
IshaAtteri
|
db645f3bbe
|
changes
|
2026-03-19 12:32:56 -04:00 |
|
prabhaavp
|
492160c3a3
|
Revisions to Zim parsing, netflix parsing, and updates to html scraping to include synopsis
|
2026-03-19 01:56:14 -04:00 |
|
prabhaavp
|
279fe399ed
|
Minor Cleanup of files. Moved to unused folder.
|
2026-03-17 01:24:09 -04:00 |
|
prabhaavp
|
2638de1191
|
The code to extract zim into a spreadsheet.
|
2026-03-12 14:19:40 -04:00 |
|
IshaAtteri
|
a435592f75
|
Merge branch 'main' of https://github.com/IshaAtteri/datamining_881 into isha
|
2026-03-12 12:41:15 -04:00 |
|
IshaAtteri
|
437492e623
|
small changes
|
2026-03-12 12:16:51 -04:00 |
|
prabhaavp
|
525e359c6b
|
- Html -> TSV
|
2026-03-12 12:14:31 -04:00 |
|
IshaAtteri
|
a1beba6730
|
beatifulsoup extract code
|
2026-03-12 12:11:37 -04:00 |
|
prabhaavp
|
1614d85270
|
- Fixed Bug: Certain characters can't be used for folder names. Need to fix it so those characters are removed. There is now a sanitize_slug function used
|
2026-03-10 14:45:45 -04:00 |
|
prabhaavp
|
cfbddf2a24
|
- Updates to make it name the folder the name of the wikipedia slug. Fix needed: Certain characters can't be used for folder names. Need to fix it so those characters are removed.
|
2026-03-10 14:15:33 -04:00 |
|
IshaAtteri
|
8fa2cdba3c
|
preprocessing script
|
2026-03-10 14:14:59 -04:00 |
|
prabhaavp
|
36af063777
|
- Delete the folders if we skipped a movie due to not being found
|
2026-03-10 13:17:21 -04:00 |
|
prabhaavp
|
0ac1234afa
|
- Fix directories
|
2026-03-10 13:10:25 -04:00 |
|
prabhaavp
|
401e7e5497
|
- Extract info needed from ZIM file
|
2026-02-12 20:07:09 -05:00 |
|
IshaAtteri
|
0cc571727b
|
wikipedia movie scraping using api code
|
2026-02-11 17:51:38 -05:00 |
|
prabhaavp
|
2d2ee64c0e
|
- Added venv instruction + requirements.txt
- Added data folder structure with .gitkeep
- Added .gitignore
- Added load.py to load IMDB dataset and preview with D-Tale
|
2026-02-03 22:21:41 -05:00 |
|