jagomart
digital resources
picture1_Structure Ppt 80047 | Lecture12 Clustering


 201x       Filetype PPT       File size 1.27 MB       Source: web.stanford.edu


File: Structure Ppt 80047 | Lecture12 Clustering
introduction to information retrieval introduction to information retrieval today s topic clustering document clustering motivations document representations success criteria clustering algorithms partitional hierarchical ch 16 introduction to information retrieval introduction ...

icon picture PPT Filetype Power Point PPT | Posted on 07 Sep 2022 | 3 years ago
Partial capture of text on file.
  Introduction to Information Retrieval                                                          
  Introduction to Information Retrieval                                                          
       Today’s Topic: Clustering
        Document clustering
              Motivations
              Document representations
              Success criteria
        Clustering algorithms
              Partitional
              Hierarchical
                                                                                                Ch. 16
  Introduction to Information Retrieval                                                          
  Introduction to Information Retrieval                                                          
       What is clustering?
        Clustering: the process of grouping a set of objects 
            into classes of similar objects
              Documents within a cluster should be similar.
              Documents from different clusters should be 
                dissimilar.
        The commonest form of unsupervised learning
                    Unsupervised learning = learning from raw data, as 
                     opposed to supervised data where a classification of 
                     examples is given
              A common and important task that finds many 
                applications in IR and other places
                                                                                                Ch. 16
  Introduction to Information Retrieval                                                          
  Introduction to Information Retrieval                                                          
       A data set with clear cluster structure
                                                                                How would 
                                                                                   you design 
                                                                                   an algorithm 
                                                                                   for finding 
                                                                                   the three 
                                                                                   clusters in 
                                                                                   this case?
                                                                                                Sec. 16.1
  Introduction to Information Retrieval                                                          
  Introduction to Information Retrieval                                                          
       Applications of clustering in IR
        Whole corpus analysis/navigation
              Better user interface: search without typing
        For improving recall in search applications
              Better search results (like pseudo RF)
        For better navigation of search results
              Effective “user recall” will be higher
        For speeding up vector space retrieval
              Cluster-based retrieval gives faster search
  Introduction to Information Retrieval                                                          
  Introduction to Information Retrieval                                                          
       Yahoo! Hierarchy isn’t clustering but is the kind 
       of output you want from clustering
    www.yahoo.com/Science
                                                                                      … (30)
          agriculture               biology           physics              CS                 space
                    ...                      ...                 ...                 ...             ...
    dairy                      botany       cell                       AI       courses
              crops                                                                           craft
                  agronomy                         magnetism               HCI                   missions
     forestry                         evolution              relativity
The words contained in this file might help you see if this file matches what you are looking for:

...Introduction to information retrieval today s topic clustering document motivations representations success criteria algorithms partitional hierarchical ch what is the process of grouping a set objects into classes similar documents within cluster should be from different clusters dissimilar commonest form unsupervised learning raw data as opposed supervised where classification examples given common and important task that finds many applications in ir other places with clear structure how would you design an algorithm for finding three this case sec whole corpus analysis navigation better user interface search without typing improving recall results like pseudo rf effective will higher speeding up vector space based gives faster yahoo hierarchy isn t but kind output want www com science agriculture biology physics cs dairy botany cell ai courses crops craft agronomy magnetism hci missions forestry evolution relativity...

no reviews yet
Please Login to review.