Saturday, May 31, 2008

A taste of teaching

Last week, it was our last class with Prof. Foster Provot for this semester. This is a PhD level seminar discussing all kinds of topics related to data mining and machine learning. As the only three registered PhD students in Stern, Xiaohan, Mihaela and I were "pushed" to give (bi-)weekly presentations and lead discussions for every paper on those topics.

Oh, god! That was hard! I couldn't understand this. When I sit down in the class and listen to the professors, they are all talking and smiling, making all kinds of jokes, writing gracefully and drawing nice pictures on the board. They are teaching as if doing something really really really easy. However, when I stood in front of the class, no matter how hard I had prepared, I felt nervous, awkward, and then suddenly forgot what I should say. My tones got wondered and my voice became frozen. My confidence was quickly fading out... In fact, I was pretty confident in my presentation skills because I already had some conference/workshop presentation experiences before. I always felt proud of my cool behavior in front of a group of people. But now the truth was that it did not work here! Teaching in class is totally different from giving a short 20-minute talk, at all! For this, I really admire Foster! He is such a sharp person and a great professor. He can always notice the key point in our thoughts and help us sort it out right away. Often times, his questions are actually helpful and informative "hints", which inspire us to think what we have neglected and then better organize our thoughts.

Prof. Anindya Ghose once told me that when you talk to people, you should try to make your point as clear as you can at the first time. Do not wait for people to find themselves confused and then ask you. I believe this is important, but it is not easy to achieve. Sometimes, when we explain something, we have a tendency to either describe it too much that makes the redundancy, or speak too little that leads to the ambiguity. (It seems that the distribution for the intensity of our explanatory words is "bimodal", either too high or too low.) I like Prof. Panos Ipeirotis's teaching, because his way is highly logic. You feel like you are led into a room, and then get to explore by yourself with encouragements time by time. He does not show the whole picture at one time, but leave to us ourselves to find it out. That is coolest part. You never know how big the picture is! Just like an adventure game!

I sometimes was imaging myself in the future, can I do this well when I become a real professor? Will my students enjoy my teaching too? Yea, I believe so! That is my goal and just keep going:-)

Tuesday, May 20, 2008

Data Mining Blogs: The Big List(ZZ from Sandro Saitta)

Sandro Saitta has a full list about the data mining blogs. Just something very nice that can be introduced here:)

  • Abbott Analytics: both industry and research oriented posts covering any topic related to data mining (Will Dwinnell and Dean Abbott)

  • Crime Analysis and Data Mining: everything is in the title (Shyam Varan Nath)

  • Data Miners Blog: data analysis and visualization from an industry point of view (Data Miners Team)

  • Data Mining, Analytics and Artificial Intelligence: this blog gives news about data mining and AI very frequently (Alberto Roldan)

  • Data Mining et al.: A new blog about data mining with details on particular applications in this field (Georg Russ)

  • Data Mining Lab: the blog of the data mining laboratory at Brigham Young University, mainly about social communities and meta-learning (Data Mining Lab)

  • Data Mining: Text Mining, Visualization and Social Media: a focus on data visualization and the blogosphere (Matthew Hurst)

  • Data Mining in MATLAB: posts related to the use and possibilities of Matlab for data mining related problems (Will Dwinnell)

  • DataSciences Analytics: discuss statistics and predictions among other interesting topics (John Aitchison)

  • Data Strategy: This new blog (started in June) discuss data strategy in general. Data acquisition, visualization and data mining are examples of topics (Chuck Lam)

  • Data Wrangling: comprehensive posts on technology and news related to data mining and machine learning. Also a lot of very useful resources (Pete Skomoroch)

  • Diamond Information and Analytics: analytics and its applications in marketing and operations (Amaresh Tripathy)

  • Foraging in the Data Forest: although not updated recently, this blog has interesting posts about data visualization and statistics (Donald Farmer)

  • Intelligent Machines: news related to data mining, machine learning and artificial intelligence (Damien François)

  • Jamie's Junk: a blog that focus on data mining using Microsoft SQL Server (Jamie Mac)

  • Juice Analytics: data analytics with an emphasis on data visualization and corresponding tools (Juice Team)

  • Machine Learning, etc: Theory behind machine learning and news related to this field (Yaroslav Bulatov)

  • Machine Learning (Theory): a strong emphasis on theoretical aspects of machine learning (John Langford)

  • Machine Learning Thoughts: philosophical and theoretical discussions about machine learning in general (Olivier Bousquet)

  • Math Stats and Data Mining: data mining with a point of view from statistics (Rachel Graham)

  • MineThatData: data mining from the marketing point of view (Kevin Hillstrom)

  • Oracle Data Mining and Analytics: A blog focusing on the use of Oracle for data mining. It covers news, code and applications related to Oracle (Marcos M. Campos)

  • Shane's Blog: a personal view on data mining with posts on different applications and news (Shane Butler)

  • Smart (Enough) Systems: data mining and analytics (among others) for decision management (James Taylor)

  • Undirected Grad: a machine learning blog from a PhD student at Cambridge (Jurgen Van Gael)

  • Yet Another Machine Learning Blog: more machine learning oriented but contains a lot of useful information (Pierre Dangauthier)

Sunday, May 18, 2008