Heshan Sanjuka hexsyro

Hi, I'm Heshan 👋

I build web scraping, automation systems, and data products — from scrapers that handle dynamic sites at scale to full production platforms that serve structured data via REST APIs.

🚀 What I Do

High-performance web scraping (dynamic + static sites)
Multi-source data aggregation pipelines
Dataset cleaning, enrichment & normalization
Secure REST API development
Automated data collection systems

📂 Projects

📰 PulseAggregator

Production news aggregation platform that indexes 6,900+ articles per run from 250+ live sources including BBC, Reuters, The Guardian, TechCrunch, and more.

Built with a 4-tier RSS fallback chain, Playwright-powered scraper for paywalled and dynamic sources, and an hourly APScheduler pipeline. Features full-text search, keyword alerts, weekly digest emails, and a REST API.

FastAPI · PostgreSQL · Playwright · Next.js · APScheduler · Resend · Railway · Supabase

🔐 SocialIntel

Production-grade OSINT dataset marketplace aggregating, enriching, and distributing structured social media data across Reddit, YouTube, GitHub, and Medium.

Features a scalable scraping architecture, automated enrichment pipelines, and secure subscription-based API delivery. Datasets cover AI training, financial sentiment, brand monitoring, and market intelligence.

FastAPI · PostgreSQL · Next.js · Paddle · JWT · AWS S3

📜 GoodQuote Scraper

Multi-page Python scraper extracting quotes, authors, and tags from Goodreads into structured datasets.

Python · BeautifulSoup · CSV

🛠 Tech Stack

Scraping & Automation

Playwright (Python & Node)
Selenium
BeautifulSoup
Requests / HTTPX
Asyncio

Data Processing

Pandas · NumPy
CSV / JSON / JSONL / Parquet exports
lxml (XML/RSS repair)

Backend

FastAPI
PostgreSQL (asyncpg)
JWT Authentication
APScheduler
Resend (transactional email)

Frontend

Next.js 15 (App Router)
Tailwind CSS
TypeScript

Infrastructure

Railway (API hosting)
Vercel (frontend)
Supabase (managed PostgreSQL)
Docker

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Heshan Sanjuka hexsyro

Block or report hexsyro