Commit Graph

150 Commits

Author SHA1 Message Date
21e325a968 refactor: improve data fetching in Stats page 2026-01-29 19:31:51 +00:00
5332af57e8 refactor: remove unnecessay WordCloud abstraction 2026-01-29 15:02:24 +00:00
64c3422395 refactor: move pandas processing out of Stat Obj
Also improves error handling and logging
2026-01-29 14:52:43 +00:00
0a45bd6855 docs: update README & requirements
Remove old reqs not longer needed
2026-01-28 20:04:30 +00:00
dc2fb39fb7 fix: incorrect attr name for word cloud 2026-01-28 19:54:38 +00:00
506ef3bd46 refactor: rename endpoint to events_per_day 2026-01-28 19:48:37 +00:00
85bcdb336b chore: remove unused dependencies 2026-01-28 19:42:38 +00:00
119032d524 refactor: move stat generation into separate class
Stats are pre-computed as well, improving performance
2026-01-28 19:41:38 +00:00
855546677f fix: heatmap colouring based on max value 2026-01-28 19:00:05 +00:00
1466b05bde feat: hourly heatmap of new posts in Stats Page 2026-01-27 20:26:09 +00:00
2a255fb983 refactor: rename main stat page to StatPage 2026-01-27 19:52:11 +00:00
dd8070dd51 chore: add nivo/heatmaps dependency 2026-01-27 19:51:25 +00:00
71b66252a6 feat: heatmap endpoint in Flask 2026-01-27 19:44:36 +00:00
Dylan De Faoite
92c5d3c2cd Update README.md 2026-01-27 19:00:19 +00:00
dda1c1fee8 feat: word cloud on stat page 2026-01-27 18:37:42 +00:00
ecc1a62d24 feat: word frequency endpoint with stop word removal 2026-01-27 18:18:32 +00:00
dcdeafd7a9 update NPM libraries 2026-01-27 17:41:38 +00:00
3fefe4f51e feat: graph showing posts per day in React 2026-01-27 17:41:30 +00:00
1234ffde85 add stat page and separate home page and upload 2026-01-27 17:36:24 +00:00
2fcb4eabae add react router 2026-01-27 17:32:08 +00:00
5c1e65b606 remove unused package.json 2026-01-27 17:29:53 +00:00
200645a4e0 update requirements.txt 2026-01-27 15:06:56 +00:00
8394673b3c feat: posts_per_day and comments_per_day endpoints in Flask 2026-01-27 13:20:51 +00:00
2482c1da1c add error message in React Page 2026-01-27 13:20:07 +00:00
d78c58a70c fix comment datetime to be parsed to timestamp in YoutubeAPI 2026-01-27 12:37:16 +00:00
e72d336de4 remove /data subdir
Dataset is now created in the pwd. Avoids issues if the folder didn't exist
2026-01-27 12:36:01 +00:00
2401875a19 combine posts and comments endpoint 2026-01-27 12:14:33 +00:00
ffba2d78c2 add two inputs for posts and comments 2026-01-27 12:14:22 +00:00
322b69825c add package lock 2026-01-27 12:00:06 +00:00
82bd9a7a9b add upload post endpoint in flask app 2026-01-27 11:59:01 +00:00
524a5d2619 Add react app 2026-01-27 11:58:08 +00:00
ff2b08fc2d update gitignore 2026-01-27 11:50:56 +00:00
7d94494fe2 youtube connector returns posts and comments in a flat manner 2026-01-24 20:19:15 +00:00
d96845d48b implement pagination to search subreddit method & remove timeframe attr
In addition, it now searches new posts instead of top
2026-01-22 17:10:16 +00:00
8f504b7d4d updated reddit api to flatten comments and posts into separate data structures 2026-01-22 17:05:32 +00:00
79cdb7babf remove unused top subreddit posts method 2026-01-22 15:55:33 +00:00
152264bda9 separate comment and post data structures
This allows for a flat data structure, benefical to data analysis
2026-01-22 15:53:47 +00:00
3c4aad77ef update number of fetched comments and videos from youtube 2026-01-22 15:29:55 +00:00
501dec9dd5 convert YouTube published_at to timestamp 2026-01-22 15:02:55 +00:00
096a415f3b fix datetime from boards.ie not being parsed properly 2026-01-22 14:49:01 +00:00
a34252deda Add response code 500 error handling in reddit api 2026-01-19 22:55:28 +00:00
245ab19183 Add error handling for YouTube comments fetching 2026-01-19 22:54:48 +00:00
2243558e56 update gitignore 2026-01-19 20:57:26 +00:00
09a7c6fc9f remove debug print 2026-01-19 20:53:56 +00:00
187401c5eb Implement YouTube API integration for video and comment fetching 2026-01-19 20:50:17 +00:00
2b0aed0f74 add .env to gitignore 2026-01-19 20:33:25 +00:00
85388ef6aa Add comment limit to _parse_comments method in BoardsAPI
Some boards.ie threads have thousands of comments which is slow to fetch with pagination
2026-01-19 20:23:11 +00:00
9c66ec8b82 Save to jsonl file after every fetch
Reduces errors and lost data
2026-01-19 20:22:47 +00:00
e9cf51731d Add comment parsing functionality to BoardsAPI
Pagination required due to multiple pages of comments on boards.
2026-01-19 18:24:44 +00:00
415b1ca87e update README 2026-01-17 22:16:56 +00:00