I'm always excited to take on new projects and collaborate with innovative minds.

Social Links

Project

DSpace Deployment and Automation

Deployed and configured DSpace, an open-source digital repository system, for the Central Library of Oran1 University. Automated metadata extraction using Python to streamline the digital archiving process.

Client

Oran 1 University - Ahmed Ben Bella.

Start Date

Nov 10, 2022
DSpace Deployment and Automation

Description: 

This project involved the installation and configuration of DSpace, an open-source software used for building digital libraries and archives. The goal was to create a digital repository system for the Central Library of Oran1 University, enabling the collection, storage, and distribution of digital documents, such as academic theses. The deployment included preparing a Linux Debian server, configuring Nginx, PostgreSQL, and setting up SSL/TLS for secure access.

A significant challenge was the high volume of theses needing digitization. To address this, I developed a Python script using the PyPDF2 library to automatically extract metadata (thesis title, author, committee members) from the first page of PDF files and convert this information into a CSV format. This CSV file could then be easily imported into DSpace, reducing the manual workload for library staff and accelerating the archiving process. The solution reduced an 8-month task to just one week, significantly improving efficiency.

Key Features:

  • Server Configuration: Prepared and configured a Linux Debian server with Nginx, PostgreSQL, and SSL/TLS.
  • Digital Repository Setup: Installed and customized DSpace for digital archiving and library services.
  • Automated Metadata Extraction: Developed a Python script to extract metadata from PDF files and generate a CSV for bulk import.
  • Access Control: Implemented user management and permissions using Access Control Lists (ACL).
  • Efficient Archiving: Automated bulk data processing, reducing workload and saving significant time.

Technologies Used:

  • Linux Debian
  • Nginx
  • PostgreSQL
  • DSpace
  • Python (PyPDF2)
  • SMTP Server
  • SSL/TLS

Images:

Screenshot 2024-11-10 120817
 

Share

Leave a comment

Your email address will not be published. Required fields are marked *