Page 152 - Computer - 8
P. 152

DATA MINING

          Data mining is the process used by companies to turn raw data into
          useful information. By using software to look for patterns in large
          batches of data, businesses can learn more about their customers
          to develop more effective marketing strategies, increase sales, and
          decrease costs. Data mining depends on effective data collection,
          warehousing, and computer processing.

          How Data Mining Works?
          Data mining involves exploring and analysing large blocks of information to glean meaningful patterns
          and trends. It can be used in a variety of ways, such as database marketing, credit risk management,
          fraud detection, spam email filtering, or even to discern the sentiment or opinion of users.
          The data mining process breaks down into five steps:
            1.  Organisations collect data and load it into their data warehouses.
            2.  They store and manage the data either on in-house servers or in the cloud.

            3.  Business analysts, management teams, and information technology professionals access the data
                and determine how they want to organise it.
            4.  Application software sorts the data based on the user’s results.
            5.  The end-user presents the data in an easy-to-share format, such as a graph or table.

          The Data Mining Process

          To be effective, data analysts generally follow a certain flow of tasks along the data mining process. The
          data mining process is usually broken into the following steps.
          Step 1: Define the Problem

          Before any data is touched, extracted, cleaned, or analysed, it is important to understand the underlying
          entity and the project at hand. What are the goals the company is trying to achieve by mining data?
          What  is  their  current  business  situation?  Before  looking  at  any  data,  the  mining  process  starts  by
          understanding what will define success at the end of the process.

          Step 2: Identify Required Data
          Once the business problem has been clearly defined, it’s time to start thinking about data. This includes
          what sources are available, how it will be secured and stored, how information will be gathered, and
          what the final outcome or analysis may look like. This step also critically thinks about what limits there
          are to data, storage, security, and collection, and assesses how these constraints will impact the data
          mining process.

















              150
   147   148   149   150   151   152   153   154   155   156   157