Cap Theorem in Big Data | Big Data Cap Theorem

Explain CAP

CAP theorem is also called Brewer’s theorem, which stands for Consistency, Availability, and Partition Tolerance.

Consistency:

This situation expresses, all nodes have similar information simultaneously. Implementing a read function will return the estimation of the latest write function making all nodes provide similar information. A framework has consistency if an exchange begins with the framework in a reliable state, and finishes with the framework in a predictable state. A framework can (and does) move into a conflicting state during an exchange, however the whole transaction gets moved back if there is a mistake during any process all the while. We have 2 unique records (“Bulbasaur” and “Pikachu”) at various timestamps given in the picture below. The result on the third part is “Pikachu”, the most recent input. The nodes will require time to refresh and won’t be available on the organization as frequently.

Availability:

This situation provides that each solicitation gets a reaction on success/failure. Accomplishing availability in an appropriated framework necessitates that the framework stays operational 100% of the time. Each customer gets a reaction, paying little heed to the condition of any individual node in the framework. This measurement is trifling to quantify: possibly you can submit the read/write commands, or you can’t. Thus, the databases are time autonomous as they should be accessible online consistently. In contrast to the past model, we couldn’t say whether “Pikachu” or “Bulbasaur” was included at first. The result could be any one among both. Consequently, high accessibility isn’t feasible when dissecting streaming information at high frequency.

Partition Tolerance:

This situation expresses that the framework keeps on operating, in spite of the quantity of messages being deferred by the organization among nodes. A framework which is partition tolerant can support any measure of organization failure which does not bring about a failure of the whole network. Information records are adequately duplicated across blends of nodes and organizations to maintain the framework up through discontinuous blackouts. While managing current distributed frameworks, Partition Tolerance is a requirement and not a choice. Thus, we need to exchange among Consistency and Availability.