Schema on Write VS Schema on Read

Schema on Write (RDBMS) Schema on Read (Hadoop)
Schema must be created before any data can be loaded Data is simply copied to the file store, no transformation is needed.
An explicit load operation has to take place which transforms data to DB internal structure A SerDe (Serializer/Deserializer) is applied during read time to extract the required columns (Late Binding)
New columns must be added explicitly before new data for such columns can be loaded into the database New data can start flowing anytime and will appear retroactively once the SerDe is updated to parse it.
Pros

Read is fast
Standards / Goverance

Pros

Load is fast
Flexibility / Agility

Naveen P.N

12+ years of experience in IT with vast experience in executing complex projects using Java, Micro Services , Big Data and Cloud Platforms. I found NPN Training Pvt Ltd a India based startup to provide high quality training for IT professionals. I have trained more than 3000+ IT professionals and helped them to succeed in their career in different technologies. I am very passionate about Technology and Training. I have spent 12 years at Siemens, Yahoo, Amazon and Cisco, developing and managing technology.