AWAN LABs
SMD_Scraper
SMD_Scraper
A specialized digital hub that merges social media streams with a searchable archive of Arabic digital newspapers. It harvests posts from X, Facebook, Instagram, and leading news sites, cleans and tags the content with NLP models, then files every item into topic-based databases — from economy to rights, gender, environment, sports, and more.
A specialized digital hub that merges social media streams with a searchable archive of Arabic digital newspapers. It harvests posts from X, Facebook, Instagram, and leading news sites, cleans and tags the content with NLP models, then files every item into topic-based databases — from economy to rights, gender, environment, sports, and more.
{,}
{,}
How it works
How it works
Open Datasets
Open Datasets
You can find below demo datasets with real data
You can find below demo datasets with real data
Order a customized dataset
Order a Custom
Dataset
Are you working on a report or investigation?
Check the SMD you’d like to have and order it using this form
Are you working on a report or investigation?
Check the SMD you’d like to have and order it using this form
Contact us
info@awanservices.net
Contact us
info@awanservices.net
Data intake
API feeds + web crawlers pull new posts and articles 24/7
Analytics dashboards
One click reveals timelines, engagement spikes, network graphs, and heat maps so reporters can spot patterns fast
Curation & tagging
Duplicate removal, Arabic text normalization, entity and topic labeling
Smart search
A multilingual engine (MSA + dialects) lets journalists query by keywords, entities, or time ranges-ranking the most newsworthy results first
Export
Findings download as CSV/Excel/JSON, or entire search sessions can be saved and shared
Why it matters
Why it matters
Turns raw digital noise into verifiable evidence for data-driven stories.
Turns raw digital noise into verifiable evidence for data-driven stories.
Cuts research time from days to minutes, freeing reporters for deep investigative work.
Cuts research time from days to minutes, freeing reporters for deep
investigative work.
Uncovers hidden trends—coordinated disinformation, hate speech waves, lobbying campaigns—that traditional sources miss.
Uncovers hidden trends, coordinated disinformation, hate speech waves, lobbying campaigns that traditional sources miss.
Builds a collaborative knowledge base for newsrooms across the Arab world, strengthening accountability and transparency.
Builds a collaborative knowledge base for newsrooms across the Arab world, strengthening accountability and transparency.
1
1
2
2
3
3
4
4
Discover More


