Pycon MY 2024: Using Python on SPRM Offenders Data
Panel, Presentation

Pycon MY 2024: Using Python on SPRM Offenders Data

published Aug 27, 2024 , last modified Apr 10, 2026

Presentation at Pycon My 2024 on experience of scraping SPRM (MACC) website database of offenders and preliminary analysis of data.

 

SPRM (Malaysian Anti Corruption Commission) has listed offenders on their website. The database is for public use, particularly to assist others in conducting background checks. The data includes the offenders' images and other details: personal information, summary of offense, penalty, and employer information.

Using a crawler in Python, we converted the data into a machine-readable format that will be publicly available. From this dataset alone, there are many use cases to be considered for contributing to the Open Sanctions Database, matching names against other persons of interest databases such as CIDB and ICIJ, and using the images for facial recognition in conducting security checks/ other use.

Attachments Type
Pycon MY 2024: Using Python on SPRM Offenders Data Link
Pycon MY 2024_ Using Python on SPRM Offenders Data.pdf File
Contributors
Khairil Yusof

Partners

Related Project
Politikus
Rights
Creative Commons Attribution license (reuse allowed)