CS - Research

Big Data Lab Establishment

A Big Data lab has been established in Lahore campus. The use of Big Data is becoming a crucial way for leading companies to outperform their peers. In most industries, established competitors and new entrants alike, leverage data-driven strategies to innovate, compete, and capture value. Big Data will help to create new growth opportunities, and also entirely new categories of companies, such as those that aggregate and analyze industry data.

Students of Dr. Usman Awais and Dr. Zareen Alamgir have successfully setup this lab and cluster. In guidance of Dr. Usman Awais an OpenStack Cloud infrastructure is also established. The deployed OpenStack cloud provides Platform as a Service (PaaS) infrastructure.The PaaS enables the students to analyze many different distributed computing technologies, including Data and Compute Clusters and Grids.The initial set up will be used by master thesis students, and students working on their Final Year Projects (FYP). It will allow the students to have a practical exposure to cloud related technologies. A personalized cloud dashboard can also be made accessible over the internet, in future.

Students can access this Hadoop master cluster from other Labs. Now "Big Data" students can access the Hadoop cluster for their assignments, without going to the Big Data Lab.

Following are the students who made considerable efforts to realize the plan of setting up these labs.

Muhammad Naqeeb : L15-5024
Amir Shahzad: L15-5025
Muhammad Hassan: L14-5017

Current and Past Projects

· Fuzzy clustering of mixed mode data in Apache SPARK

· Exploiting review helpfulness rating on scalable recommendation systems using Apache Spark

· SP-CURE, a clustering algorithm developed on Apache Spark to cluster gigantic datasets

· Personalized User Tag recommender

· for Social Media Photos using Spark

· Tag Recommender for Spark, a system for recommending user tags for huge datasets.

· Social Event based recommendation system in distributed environment

· Distributed Typicality-based Recommendation System for Apache Spark

· Hybrid Recommendation System using Apache Spark Framework

· Keyword based personalized Hotel Recommender on Map Reduce.

· Opinion Mining on Big Data using MapReduce Framework.

· Deep Packet analysis in Software Defined Networks

· Distributed algorithms for Machine Learning (kubernetes operators are being used)

· FRSG to develop a Kubernetes operator

· Distributed Simulation using Docker ( Kubernetes will be included)

Recent Publications

Zareen Alamgir and Hassan Jamil, “Personalized recommender systems for big data using distributed Spark framework”, World Conference on Technology, Innovation and Entrepreneurship (WOCTINE), Istanbul University, Istanbul, Turkey, June 2019.
Zareen Alamgir, “Generating recommendations for customers using bipartite graph”, World Conference on Technology, Innovation and Entrepreneurship (WOCTINE), Istanbul University, Istanbul, Turkey, June 2019.
Zareen Alamgir, Saira Karim and Syed Husnine, “Linear algorithm for generating c-isolated bicliques”, International Journal of Computer Mathematics, Vol 94, Issue 8, pp 1574 – 1590, 2017.
Noshaba Nasir, Kashif Zafar and Zareen Alamgir, “Sentiment Analysis of Social Media using MapReduce”, Women in Data Science, WinDS, Houston, Texas, USA, 2017.

FAST School of Computing Research Groups

Big Data Lab Establishment

IMPORTANT LINKS

DEPARTMENTS