CSE012/CS059 – Data Mining
Spring 2017
|
|
Homework
Free
pass policy: To deal with overlapping
deadlines between courses, each one of you has 4 “free passes” when handing
in an assignment. That is, you have 4 days that you can use for extending the
deadline whenever there is a problem. A free pass is used (if you want) when
the assignment is submitted after the deadline. If more than 24 hours have
passed then a second free pass is used. If you do not have a free pass (or if
you do not want to use one) then the late assignment policy is applied.
Late
assignment policy: The first day of delay removes
10% of the maximum possible grade, the second day 20%, the third 40%, and the
fourth 80%. In the fifth day you lose 100% of the assignment.
Turn-in: You can turn-in the assignment using the
command: turnin assignmentΧ@ple059 <your files>. Give self-explanatory
names to your files, and write your name and AM in the files. The last
turn-in is the one that will be graded and if it is late the late assignment
policy is applied.
Reports: In some assignments you will be asked to write a
short report about your code, or about the results you obtain. For the code,
you need to shortly describe how the code is structured, and how one can run
the code. For the results, you need to look at what the code produces and
write your observations: How well did you do with respect to what you set out
to do? Did you find something interesting? Are there cases to which you
should draw the reader’s attention? This is a very important part of the
assignment. You assignment will be marked based on the report as well.
September Assignment
You can download September Assignment here. The deadline for the
assignment is on September 24, at the end of the day. Turn in the code in the
folder assignment-sept. You can turn in the rest of the assignment electronically,
or on paper. There will be an oral exam on the week following the deadline.
Email me the day and time that would be most convenient for you.
Assignment 4 You can download Assignment 4 here. The deadline for the
assignment is on June 21, at the end of the day. Turn in the code in the
folder assignment4. You can turn in the rest of the assignment electronically,
or on paper. For the second question you will need to submit a solution to
the Kaggle competition for the class (here is the link to the
competition), which has deadline on June 25. Create a Kaggle account with the
department email, so as to have access to the competition. The link to the
competition may not be accessible until the competition is reviewed by the
moderators. There will be an oral examination the week following the
deadline. Email me if you want to do the exam earlier.
Assignment 3 You can download Assignment 3 here. The deadline for the
assignment is on May 21, at the end of the day. Turn in the code in the
folder assignment3. You can turn in the rest of the assignment
electronically, or on paper.
Material for the assignment: The file clinton_trump_tweets.txt, and clinton_trump_user_classes.txt
Assignment 2 You can download Assignment 2 here. The deadline for the
assignment is on May 2, before class. Turn in the code in the folder
assignment2. You can turn in the rest of the assignment electronically, or on
paper.
Material for the assignment: The file stringHash.py
Assignment 1 You can download Assignment 1 here. The deadline for the
assignment is on March 31, 11:59 pm. Turn in the code in the folder
assignment1. You can turn in the rest of the assignment electronically, or on
paper. There will be an examination for the assignment in the week of April 3rd.
Material for the assignment:
· The file data.csv for
Question 2
· The file twitter_dataset.txt
for Question 3
|