I'm always excited to take on new projects and collaborate with innovative minds.
Deployed and configured DSpace, an open-source digital repository system, for the Central Library of Oran1 University. Automated metadata extraction using Python to streamline the digital archiving process.
Description:
This project involved the installation and configuration of DSpace, an open-source software used for building digital libraries and archives. The goal was to create a digital repository system for the Central Library of Oran1 University, enabling the collection, storage, and distribution of digital documents, such as academic theses. The deployment included preparing a Linux Debian server, configuring Nginx, PostgreSQL, and setting up SSL/TLS for secure access.
A significant challenge was the high volume of theses needing digitization. To address this, I developed a Python script using the PyPDF2 library to automatically extract metadata (thesis title, author, committee members) from the first page of PDF files and convert this information into a CSV format. This CSV file could then be easily imported into DSpace, reducing the manual workload for library staff and accelerating the archiving process. The solution reduced an 8-month task to just one week, significantly improving efficiency.
Key Features:
Technologies Used:
Images:
Your email address will not be published. Required fields are marked *