The University of Queensland Homepage
School of ITEE ITEE Main Website

 INFS4203/INFS7203 Data Mining

Announcement

2008/11/17 I have marked all the exam papers. Here is the summary of this course:
 
  Assignmnet 1 Assignment 2 Assignment 3 Assignment 4 Mid Term Final Total
Mean 3.85/5 4.29/5 4.35/5 4.94/5 62.06/90 77.59/100 77.78/100
Standard Deviation 0.52 0.47 1.28 0.24 13.65 6.93 5.23

All of you passed this course. Congratulation.

2008/11/06 Assignment 3 and 4 are marked. The solutions are posted.
2008/10/31 I noticed that some of you may be very busy. So, if you cannot come to university, you may submit Assignment 4 to me via email. I will reply you if I received it. Thx.
2008/10/30 Some announcement about the assignment:
  1. There are two issues in the assignment (but I think will not affect your result):

    (a) For question 1: The "10%" should be ignore. It is for me to have a reference when I was designing the assignment. I forgot to erase it afterward.
    (b) For question 2: The first statement should be "Given the following sequences" but not "Given the following 3-sequences".

    Thank you for those students who point these two issues to me.
     

  2. I have uploaded some hints for Assignment 4. Please click HERE for details. If you have submitted the assignment to me already, you can re-submit to me via email or hardcopy.
     
  3. I have just discovered that the version of Assignment 4 which I posted on the web is different from what in my computer -- Question 2 on the web has some serious problems. That's why quite a number of students asked my about Question 2. Since the due date is tomorrow, I decide to cancel Question 2 of Assignment 4. Sorry for the confusion. I have uploaded a new and corrected version of Assignment 4. You are welcome to do Question 2. I will mark it if you have do it. But no mark will be deducted if you have not done it.
2008/10/23 Note that no tutorial tomorrow as we have finished the course already. Also, I have said in the last lecture, I am having a research trip and am not in Brisbane until Next Wed. If you have any question, please feel free to email me. I will read the emails regularly.
2008/10/20 Assignment 4 is released.

IMPORTANT: As discussed in the lecture, for the final examination, all of the required equations would be provided EXCEPT the followings (because they are very useful, simple and common for data mining):
1. How to compute Support and Confidence in Association Rule Mining.
2. How to compute the tf-idf schema and normalization in Text Mining.

2008/10/16 I have finished marking Assignment 02. You may come to my office to pick it up, or find a classmate to help you to pick it up. The highest mark is 5 and the lowest mark is 3.5. The mean is 4.29, and standard deviation is 0.47.
2008/10/09 IMPORTANT: I have been told that I have to respresent my research group to attent an inter-faculty meeting on 10/10 (tomorrow) due to the leave of absent of my research group leader. From the agenda, it is a whole day meeting (9:00 - 4:00). As a result, this week's tutorial (10/10) have to be cancelled. Sorry for the late announcement. .
2008/09/24 Since we have discussed the mid-semester exam on Monday, we have not teach any new topic in this week. Therefore, this week's tutorial will be canceled. Thank you for your attention.
2008/09/22 Assignment 2 is released.

Some errors in the Tutorial 2 Q2 Excel file. Thanks for a student pointing out this to me. However, the final answer/conclusion is not affected.

2008/09/17 A few announcements: (1) There will be No Tutorial tomorrow (19/09). Let take a break after your mid-sem exam. Sorry for the late announcement. (2) We will discuss the mid-sem exam on next Monday. (3) I have finished marking all the mid-sem exam papers. You may come and collect your own on or after next Monday.
2008/09/12 I have finished marking Assignment 01. You may come to my office to pick it up, or find a classmate to help you to pick it up. The highest mark is 5 and the lowest mark is 2.5. The mean is 3.85, and standard deviation is 0.52.
2008/09/08 Solution for assignment 01 is posted!

The scope of the mid-semester examination is classification and clustering. The format of the question is similar to your assignment / tutorial (i.e. A few short questions)

It is a closed book closed note examination. You are allowed to bring a calculator and unmarked dictionary.

2008/09/05 Note that the mid-semester examination is scheduled on 15/09 (Monday) from 10:00am to 11:30am.
2008/09/04 As discussed in class, I have corrected some typos for Lecture Note 03 classification II. Note that in Pg. 67 (Advanced Topic), I try to make the equation more "clear" by modifying the definition of "c" as "Number of different values for attribute X" instead of using the term "class" (in the old note), as I afraid some of you may wrongly interpret "class" as "class label".
2008/09/04 Amendment of Solution 02 Q2 - P(Year=1|CA) should be 2/4 = 0.5. I will update the solution after I back home and scan the solution.
2008/08/27 Assignment 01 is released. The deadline is 5th Sep 2008. NO LATE SUBMISSION IS ALLOWED (UNLESS YOU HAVE STRONG JUSTIFICATION).
2008/08/11    NOTE: I will go to a conference, and therefore not class until 25th. (i.e. NO class on 15th (tutorial), 18th (lecture) and 22nd (tutorial)!!!)
2008/08/06  We have class on next Monday!!! (Next Monday is NOT holiday for St. Lucia Campus. Sorry for the confusion!!!)
2008/07/21 Tutorial (Friday Lesson) start from Week 3.

 

Course Information:

  • Lecture Hour: 10:00am - 11:50am Monday 35-213
  • Tutorial Hour: 2:00pm - 2:50pm Friday 78-224

 

Instructor

  • Name: Dr. Gabriel Fung
    Address: Rm. 638, Building 78
    Email: g.fung@uq.edu.au

 

Lecture Note

 

Tutorial Note

 

Assignments