Data categorization

DATA CATEGORIZATION

Data categories are groupings of data with common characteristics or features. Data classification is the process of organizing data into categories for its most effective and efficient use. A well-planned data classification system makes essential data easy to find and retrieve. This can be of particular importance for risk management, legal discovery, and compliance.

Data categories are groupings of data with common characteristics or features. Data classification is the process of organizing data into categories for its most effective and efficient use. A well-planned data classification system makes essential data easy to find and retrieve. This can be of particular importance for risk management, legal discovery, and compliance.

1. How to start process of data categorization:

First step is to evaluate and divide the various applications and data into their respective category as follows:

1. Relational or Tabular data (around 15% of non audio/video data)
  1. Generally describes proprietary data which can be accessible only through application or application programming interfaces (API)
  2. Applications that produce structured data are usually database applications.
  3. This type of data usually brings complex procedures of data evaluation and migration between the storage tiers.
  4. To ensure adequate quality standards, the classification process has to be monitored by subject matter experts.
2. Semi-structured or Poly-structured data (all other non audio/video data that does not confirm to a system or platform defined Relational or Tabular form).
  1. Generally describes data files that have a dynamic or non-relational semantic structure (e.g. documents , XML ,JSON ,Device or System Log output , Sensor Output).
  2. Relatively simple process of data classification is criteria assignment.
  3. Simple process of data migration between assigned segments of predefined storage tiers.

2. Types of data classification:

1. Geographical: i.e. according to area (supposing the rice production of a state or country etc.)
2. Chronological: i.e. according to time (sale of last 3 months)
3. Qualitative: i.e. according to distinct categories. (E.g.: population on the basis of poor and rich)
4. Quantitative: i.e. according to magnitude(a) discrete and b)continuous.


3. Benefits of data classification:

Benefits of effective implementation of appropriate data classification can significantly improve ILM process and save data centre storage resources. If implemented systemically it can generate improvements in data centre performance and utilization. Data classification can also reduce costs and administration overhead. "Good enough" data classification can produce these results:

1. Data compliance and easier risk management. Data are located where expected on predefined storage tier and "point in time"

2. Simplification of data encryption because all data need not be encrypted. This saves valuable processor cycles and all related consecutiveness.

3. Data indexing to improve user access times

4. Data protection is redefined where RTO (Recovery Time Objective) is improved.

Python Programming ↓ 👆
Java Programming ↓ 👆
JAVA EasyExamNotes.com covered following topics in these notes.
JAVA Programs
Principles of Programming Languages ↓ 👆
Principles of Programming Languages
EasyExamNotes.com covered following topics in these notes.

Practicals:
Previous years solved papers:
A list of Video lectures References:
  1. Sebesta,”Concept of programming Language”, Pearson Edu 
  2. Louden, “Programming Languages: Principles & Practices” , Cengage Learning 
  3. Tucker, “Programming Languages: Principles and paradigms “, Tata McGraw –Hill. 
  4. E Horowitz, "Programming Languages", 2nd Edition, Addison Wesley

    Computer Organization and Architecture ↓ 👆

    Computer Organization and Architecture 

    EasyExamNotes.com covered following topics in these notes.

    1. Structure of desktop computers
    2. Logic gates
    3. Register organization
    4. Bus structure
    5. Addressing modes
    6. Register transfer language
    7. Direct mapping numericals
    8. Register in Assembly Language Programming
    9. Arrays in Assembly Language Programming

    References:

    1. William stalling ,“Computer Architecture and Organization” PHI
    2. Morris Mano , “Computer System Organization ”PHI

    Computer Network ↓ 👆
    Computer Network

    EasyExamNotes.com covered following topics in these notes.
    1. Data Link Layer
    2. Framing
    3. Byte count framing method
    4. Flag bytes with byte stuffing framing method
    5. Flag bits with bit stuffing framing method
    6. Physical layer coding violations framing method
    7. Error control in data link layer
    8. Stop and Wait scheme
    9. Sliding Window Protocol
    10. One bit sliding window protocol
    11. A protocol Using Go-Back-N
    12. Selective repeat protocol
    13. Application layer
    References:
    1. Andrew S. Tanenbaum, David J. Wetherall, “Computer Networks” Pearson Education.
    2. Douglas E Comer, “Internetworking with TCP/IP Principles, Protocols, And Architecture",Pearson Education
    3. KavehPahlavan, Prashant Krishnamurthy, “Networking Fundamentals”, Wiley Publication.
    4. Ying-Dar Lin, Ren-Hung Hwang, Fred Baker, “Computer Networks: An Open Source Approach”, McGraw Hill.