enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. Bibliographic coupling - Wikipedia

    en.wikipedia.org/wiki/Bibliographic_coupling

    Bibliographic coupling, like co-citation, is a similarity measure that uses citation analysis to establish a similarity relationship between documents. Bibliographic coupling occurs when two works reference a common third work in their bibliographies. It is an indication that a probability exists that the two works treat a related subject ...

  3. Content similarity detection - Wikipedia

    en.wikipedia.org/wiki/Content_similarity_detection

    A study was conducted to test the effectiveness of similarity detection software in a higher education setting. One part of the study assigned one group of students to write a paper. These students were first educated about plagiarism and informed that their work was to be run through a content similarity detection system.

  4. Turnitin - Wikipedia

    en.wikipedia.org/wiki/Turnitin

    This prevents one student from using another student's paper, by identifying matching text between papers. In addition to student papers, the database contains a copy of the publicly accessible Internet, with the company using a web crawler to continually add content to Turnitin's archive. It also contains commercial and/or copyrighted pages ...

  5. Substantial similarity - Wikipedia

    en.wikipedia.org/wiki/Substantial_similarity

    A showing that features of the two works are not similar does not bar a finding of substantial similarity, if such similarity as does exist clears the de minimis threshold. [3] The substantial similarity standard is used for all kinds of copyrighted subject matter: books, photographs, plays, music, software, etc. It may also cross media, as in ...

  6. Co-citation - Wikipedia

    en.wikipedia.org/wiki/Co-citation

    Co-citation is the frequency with which two documents are cited together by other documents. [1] If at least one other document cites two documents in common, these documents are said to be co-cited. The more co-citations two documents receive, the higher their co-citation strength, and the more likely they are semantically related. [1]

  7. File:Similarity analysis and clustering of modules.pdf

    en.wikipedia.org/wiki/File:Similarity_analysis...

    Testing various clustering algorithms and analyzing their results to find a suitable match for our task (determining which modules are similar and possible candidates to be merged). Also contains a brief literature review of code similarity detection. List of possible candidates for improvement of clustering using better algorithms.

  8. Gestalt pattern matching - Wikipedia

    en.wikipedia.org/wiki/Gestalt_Pattern_Matching

    The similarity of two strings and is determined by this formula: twice the number of matching characters divided by the total number of characters of both strings. The matching characters are defined as some longest common substring [3] plus recursively the number of matching characters in the non-matching regions on both sides of the longest common substring: [2] [4]

  9. SimRank - Wikipedia

    en.wikipedia.org/wiki/SimRank

    SimRank is a general approach that exploits the object-to-object relationships found in many domains of interest. On the Web, for example, two pages are related if there are hyperlinks between them. A similar approach can be applied to scientific papers and their citations, or to any other document corpus with cross-reference information. In ...