Big Data Management Systems

Course Code:

8170

Semester:

8th

Elective Courses

Professor:

Course Description

The emergence of sophisticated web applications, the proliferation of social networks, the massive deployment of sensor networks and other data-producing applications have led to an exponential growth of data volumes, unforeseen just few years ago. At the same time, the incorporation of a variety of data formats (structured, semi- and un-structured) into mainstream data analysis, along with the velocity aspect of modern applications, revise the fundamental aspects of data management. The era of big data mandated a new generation of data management systems: not-necessarily relational, focusing on fault-tolerance and availability, involving cloud-based computations, distributed in nature and exploiting large main-memories. The goal of this course is to delineate the challenges in managing big data, present the various systems that have emerged for this purpose and provide representative implementations and applications. The following systems will be presented: MapReduce/Hadoop, Redis, MongoDB, Neo4j, Azure Stream Analytics.

Learning Outcomes

The use of data in making correct, valid and timely decisions has become a “must-have” success factor for most modern businesses and organizations. At the same time, in recent years, with the development of new technologies and applications – such as the spread of social networks, the extensive use of smart phones, the installation of sensors, etc. – the volume and form of data has changed dramatically: we now have data volumes of petabytes and exabytes and in text, audio, video, images formats. The need to manage and exploit this data has led to the development of a new generation of systems, models and programming tools – which are still in their infancy – such as: Map Reduce, Hadoop and its ecosystem, NoSQL, etc., technologies that allow for parallel processing of data on a large scale and in a fault-tolerant manner. The purpose of this course is to present the basic principles of these systems and how they operate.

Upon completion of the course, students should be able to:

understand the concept of "data analysis pipeline", the various phases of this pipeline and the implementation requirements of each phase,
use HDFS to store/retrieve data and develop MapReduce jobs to answer specific queries; have a first encounter with the Hadoop ecosystem
use a key-value system like Redis through a programming language like python or java to support applications requiring such a system,
use a document store such as MongoDB and write queries involving JSON documents,
use a graph database like Neo4j and write queries in a query language over graph databases like Cypher,
define simple continuous queries over data streams, using an extended SQL query language and a stream engine, such as Stream Analytics of Azure.

26-05-2025

Leaders in Real Estate: Career Journeys and Lessons Learned

Invitation to the ULI Greece & Cyprus Young Leaders Group event on 12/06/2025.

05-05-2025

Course/teaching evaluation

Patricipation in the course/teaching evalution of the spring semester 2024-2025.

25-02-2025

Fabric & PowerBI Meetup March '25

Event on Microsoft Fabric and PowerBI on March 5th 2025.

22-01-2025

ΑΙ Hackathon 2025

Applications to participate in the AI Hackathon to be held 21/02/2025-23/02/2025.

09-10-2024

GenAI Hackathon at Open Conf 2024

Tech for Humanity: Sustainable Development GenAI Hackathon is coming to Open Conf 2024.

11-09-2024

Organization of the GenAI Hackathon

The Tech for Humanity: Sustainable Development GenAI Hackathon is coming to Open Conf.

09-09-2024

Best Paper Award

Best Paper Award for the PhD candidate of the Department Mr. Anastasios Koukopoulos.

01-04-2024

Sustainable Future scientific seminar by Get Involved

Sustainable Future ΙΙΙ - Powering the Future of Energy, Supply Chain and Finance, by Get Involved.

26-03-2024

ΑΙ Makeathon

UniAI presents the innovative and creative artificial intelligence competition Makeathon.

14-02-2024

Microsoft | Women AI Hackathon

Get ready for the groundbreaking "Women in AI" Hackathon. Application Deadline is 25 February 2024.

Big Data Management Systems

Undergraduate Studies

Postgraduate studies

Research

Contact

Announcements

Sitemap

Online Presence