What are RDD Traits

The Traits of RDD are :

  1. In Memory : Data can be as big as it can be and can there be as long as it needs.
  2. Inmutable : Read only data, it can only be transformed into new RDD.
  3. Lazily Evaluated : Computed only when action are performed, until then RDD is just a definition without data.
  4. Typed : RDD data is typed like Int, String, etc.
  5. Parallel : Data processing is done in parallel on each node.
  6. Partitioned : Data in RDD is split into partition and distributed with the nodes in the cluster.
  7. Cached : Data can be in RAM or Disk.