General files or links of interest
File(s) Description Last modified C introduction Local UW C Tutorial 2015 MMDS site Mining of Massive Datasets web site Constantly Web archive Way back web searching Constantly survey paper Wooyoung Kim, Parallel Clustering Algorithms: Survey 2009 hash-table.zip A fully functional hash table system with an example of its usage 2011 J48.java Decision tree algorithm and code 1999?
The lecture links below will be filled in during the semester. The order of the talks is similar to what is in the table.
Topic Link Description DBDDAS dbddas+examples Lectures about Big Data, dynamic data apps, and real examples concepts Problems sentences Distance k computing on sentence datasets common problems Interesting common problems in my overall courses Basics hashtables Hash tables and hash functions Big Data and mapreduce MapReduce and friends, including workflow systems Data mining concepts pig-oink Apache Pig and Sandia Oink clustering General clustering algorithms for both Euclidean and non-Euclidean datasets machine-learning An introduction to machine learning, aka supervised clustering algorithms dim-reduction Dimension reduction algorithms find Finding similar items mining-data-streams Data mining of streams