This is the "Introduction" page of the "Text and Data Mining" guide.
Alternate Page for Screenreader Users
Skip to Page Navigation
Skip to Page Content
Last Updated: Feb 12, 2017 URL: Print Guide RSS Updates

Introduction Print Page

What is text and data mining?

Data mining is the process of finding patterns or relationships between large set of structured data using computational analysis.

Text mining is similar to data mining but, instead of analysing structured data sets, qualitative or “unstructured” natural language text is mined to identify patterns and trends.

Text and data mining are often classified as 'non-consumptive' research as, although the content is analysed, the research does not actually involve reading or viewing of the individual texts.

For more information see:


Introductory video


  • Australian National Data Service
    The Australian National Data Service (ANDS) is making Australia’s research data assets more valuable for researchers, research institutions and the nation.
    24 August 2016: They are currently running a "23 Things" training project on research data. This training project is nearing an end, but will remain available.
  • Liber Europe
    LIBER, its library members and the researchers they support, are actively advocating for a more flexible copyright system that will allow Text and Data Mining to be used to its full potential.
  • Research at Google
    Research at Google tackles the most challenging problems in Computer Science and related fields. A big challenge is in developing metrics, designing experimental methodologies, and modeling the space to create parsimonious representations that capture the fundamentals of the problem. Data mining lies at the heart of many of these questions, and the research done at Google is at the forefront of the field.
  • Institute of Analytics Professionals of Australia
    The Institute of Analytics Professionals of Australia (IAPA) is the professional organisation for the analytics industry in Australia, incorporating business analytics and data mining across multiple disciplines and sectors.
  • Wellcome Trust
    The Wellcome Trust is advocating changes in copyright legislation that will allow researchers to use data and text mining tools to interrogate and extract value from the ever-expanding volume of research literature and data.


  • petermr's blog
    A blog from scientist Peter Murray-Rust, an advocate for open science. He leads the ContentMine project, a Shuttleworth Foundation initiative demonstrating the usefulness of content mining and providing tools and services for this purpose.
  • Data Mining Research
    Data Mining Research covers both research and applications in data mining. Among others, posts discuss research issues, recent applications, important events, interviews with leading actors, current trends, book reviews, etc.
  • KDnuggets
    KDnuggets™ is a leading site on Business Analytics, Big Data, Data Mining, and Data Science, and is managed by Gregory Piatetsky-Shapiro, a leading expert in the field.

Loading  Loading...