Pycon MY 2024: Using Python on SPRM Offenders Data
Presentation at Pycon My 2024 on experience of scraping SPRM (MACC) website database of offenders and preliminary analysis of data.
SPRM (Malaysian Anti Corruption Commission) has listed offenders on their website. The database is for public use, particularly to assist others in conducting background checks. The data includes the offenders' images and other details: personal information, summary of offense, penalty, and employer information.
Using a crawler in Python, we converted the data into a machine-readable format that will be publicly available. From this dataset alone, there are many use cases to be considered for contributing to the Open Sanctions Database, matching names against other persons of interest databases such as CIDB and ICIJ, and using the images for facial recognition in conducting security checks/ other use.
| Attachments | Type |
|---|---|
| Pycon MY 2024: Using Python on SPRM Offenders Data | Link |
| Pycon MY 2024_ Using Python on SPRM Offenders Data.pdf | File |
