| |
May 04, 2026
|
|
|
|
|
2024-2025 Undergraduate Catalog [ARCHIVED CATALOG]
|
CSE 477LEC - Processing of Strings and Sequences This course is intended for students interested in learning efficient techniques for processing and analyzing large text collections, such as large-scale system logs, massive text corpora or databases of DNA sequences. The main focus is on classic algorithms and data structures for strings and sequences, including pattern matching, pairwise comparison, indexing and searching, as well as probabilistic methods, like fingerprinting and hashing. The theoretical component is complemented by practical considerations regarding efficient implementations of the discussed algorithms, and their applications in the real-world systems. The example applications include tools like UNIX grep, frameworks for plagiarism detection, as well as tools driving computational biology (e.g., BLAST, DNA assemblers, etc.). The course has also a programming component, in which students implement in their language of choice small but fully functional text processing applications.
Credits: 3
Grading Graded (GRD)
Typically Offered: Spring
Requisites: Prerequisite: CSE 250
|
|