Dec 04, 2024  
2023-2024 Undergraduate Catalog 
    
2023-2024 Undergraduate Catalog [ARCHIVED CATALOG]

CSE 487LEC - Data Intensive Computing


Data-intensive computing deals with storage models, application architectures, middleware, and programming models and tools for large-scale data analytics. In particular we study approaches that address challenges in managing and utilizing ultra-scale data and the methods for transforming voluminous data sets (big data) into discoveries and intelligence for human understanding and decision making. Topics include: storage requirements of big data, organization of big data repositories such as Google File System (GFS) semantic organization of data, data-intensive programming models such as MapReduce, fault-tolerance, privacy, security and performance, services-based cloud computing middleware, intelligence discovery methods, and scalable analytics and visualization. This course has three majors goals: (i) understand data-intensive computing, (ii) study, design and develop solutions using data-intensive computing models such as MapReduce and (iii) focus on methods for scalability using the cloud computing infrastructures such as Google App Engine (GAE), Amazon Elastic Compute Cloud (EC2), and Windows Azure. On completion of this course students will be able to analyze, design, and implement effective solutions for data-intensive applications with very large scale data sets.

Credits: 3

Grading
Graded (GRD)

Typically Offered:
Fall, Spring

Requisites:
Pre-Requisite: CSE 250  and Computer Science, Computer Engineering, or Bioinformatics majors only. Departmental senior standing recommended. Students must complete a mandatory advisement session with their faculty advisor.