Python, web log data mining for frequent patterns

生来就可爱ヽ(ⅴ<●) 提交于 2019-12-21 05:16:09

问题


I need to develop a tool for web log data mining.

Having many sequences of urls, requested in a particular user session (retrieved from web-application logs), I need to figure out the patterns of usage and groups (clusters) of users of the website.

I am new to Data Mining, and now examining Google a lot. Found some useful info, i.e. querying Frequent Pattern Mining in Web Log Data seems to point to almost exactly similar studies.

So my questions are:

  1. Are there any python-based tools that do what I need or at least smth similar?
  2. Can Orange toolkit be of any help?
  3. Can reading the book Programming Collective Intelligence be of any help?
  4. What to Google for, what to read, which relatively simple algorithms to use best?

I am very limited in time (to around a week), so any help would be extremely precious. What I need is to point me into the right direction and the advice of how to accomplish the task in the shortest time.

Thanks in advance!


回答1:


1&2: Orange has a frequent pattern mining module. It also supports clustering.

3.I have just check the content of the book. There is not a chapter for frequent pattern mining. Anyway, it is generally a good book for beginners in data mining. You will find it very useful to help you define your problem precisely.

4.You need to understand the input and output of clustering, frequent pattern mining/association rule mining. So google these algorithms, or find a good data mining text book to read.




回答2:


The Pattern module might be what you are looking for. http://www.clips.ua.ac.be/pages/pattern



来源:https://stackoverflow.com/questions/2925555/python-web-log-data-mining-for-frequent-patterns

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!