The quiet revolution: the internet of data structures with IPFS

Everything of value involves data. Where data lives and how it’s accessed is about to change. The internet of data structures (IoDS) is transforming the web from linking data by location to linking data with hashes. Is your organization prepared?

The internet of data structures (IoDS) is emerging as one of the most significant advancements in data within the last decade.

HTTP (the hypertext transfer protocol) is the foundation communication the World Wide Web. Hypertext is structured text that enables us to access content throughout the web, using logical links (hyperlinks) between nodes containing data. But what if HTTP was no longer needed? What if there was a better way to communicate and connect data. IPFS (interplanetary file system) is a globally distributed storage system. Content is addressable and shared through a peer-to-peer hypermedia distribution protocol.

URLs are out; hashes are in.

The movement from HTTP to IPFS

Previously, I described the IPFS storage model and the benefits for healthcare. Today, we’ll step a layer deeper into how the structure is designed and I’ll offer an introduction to the IPFS stack.

Where does IPFS fit into our existing infrastructure? How do you communicate the value of this technology and the application of business transformation? Those questions are exactly what we’ll be tackling.

HTTP uses hyperlinks that translate into locations to connect discrete objects and data sets. IPFS is like HTTP, but instead of using locations provided by a group of servers, IPFS uses a peer-to-peer network to share context using hash values or hashes. In IPFS, content is addressable using hashes, the hashed value of the content.

IPFS is a Merkle addressed transport protocol for distributed data structures. The IPFS stack breaks down into three general buckets, each offering particular value.

  1. Using the data: applications (the IPFS stack)
  2. Defining the data: naming, Merkle-DAG (IPNS, IPLD)
  3. Moving the data: exchanges, routing, network (Libp2p)

These three primary buckets further divide into five broad categories that compose the infrastructure stack.

  1. Applications: EtherpadVLCGitEthereumWhisper
  2. Naming: DNSIPNSEthNamesNamecoin or IPLD
  3. Exchange: BitTorrentBitswapFTPHTTP
  4. RoutingGossipChordKad DHTmDNSDelegatedI2PTOR
  5. NetworkCJDNSUDTuTPWebRTCQUICTCPWebSocketsI2PTOR

Accessing files on IPFS

It’s easier to understand IPFS if we frame it next to concepts we’re already familiar with, like DNS.

The below HTTP example shows a typical website URL for a company logo and the host name translated into an IP address using DNS. Next, the IPFS example offers a comparative example using the IPNS (interplanetary naming system) and IPFS working together. IPNS allows the storage of a reference to an IPFS hash under the namespace of your peerID (the hash of your public key). This IPFS hash references an addressable object in IPFS, using a hash value that points to a hash object linked to another hash object until your destination is found.

IPFS also achieves immutability by separating key management from file system security. The filenames contain public keys making them self-certifying pathnames. Public key hashes, resolve pointers that are signed with a private key to access content.

HTTP

  • http://peterbnichol.com/linktohash/logo.jpeg (domain name service)
  • http://10.11.12.13/linktohash/logo.jpeg (IP address)

IPFS

  • /ipns/ReE45fRer5LR3/linktohash/logo.jpeg (InterPlanetary name service – optional)
  • /ipfs/ReE78kGrd5KJ2/linktohash/logo.jpeg (hash address)

The process of linking by objects is similar to how inodes operate, except using hash values. An inode is a data structure on a file system that stores all the information about a file except its name and its actual data.

Posting content with IPFS

IPFS offers a unique approach to addressing and moving content within a network. If other peers were uninterested in your content, then the standard paid backup solutions (AWS, Azure, Swam) could be leveraged. Also, unlike other peer-to-peer distributed networks, IPFS only downloads explicitly required data. IPFS does not pull full copies of data.

Publishing content to IPFS is similar to publishing content through a private blockchain. It’s also possible to distribute content on IPFS and then remove yourself as a host who serves that content (remove the need for infrastructure?). Here is an example of posting data in an IPFS world:

  1. Create content
  2. Generate key names
  3. Sign content
  4. Distribute to peer-to-peer network
  5. Register key name and point to hash of public key

In theory, this process removes the need for locally owned and managed infrastructure. In practice, standard paid backup services may be required such as those listed above.

What will be impacted?

Any products, services or interactions that leverage storage or save data have the potential to be affected. This foundational technology layer will affect where data is stored (traditional databases to IPFS) and how data is accessed (URLs to hashes). Every platform that requires linked and encrypted communications has the potential to benefit from IPFS. Dapps and mobile applications will quietly shift to internet of data structures as scale and interoperability become increasingly critical.

Distributed denial-of-service (DDoS) attacks would be harder to execute on platforms running IPFS. HTTP routes static traffic to a central server, this decreases the attack surface, making targeted attacks more efficient. However, IPFS is a distributed storage system. By using IPFS and distributing the attack surface across peers, this makes conducting DDoS attacks significantly more difficult because content can be accessed through the distributed storage network.

Innovative leaders are learning about IPFS: what it is, where it impacts the organization and how it can be used to create a strategy to leverage this foundational technology.

Now available on Amazon! Click here!

Previous articleFrom medical records to Merkle trees with IPFS
Next articleRecharge your knowledge of the modern data warehouse
Peter is a technology executive with over 20 years of experience, dedicated to driving innovation, digital transformation, leadership, and data in business. He helps organizations connect strategy to execution to maximize company performance. He has been recognized for Digital Innovation by CIO 100, MIT Sloan, Computerworld, and the Project Management Institute. As Managing Director at OROCA Innovations, Peter leads the CXO advisory services practice, driving digital strategies. Peter was honored as an MIT Sloan CIO Leadership Award Finalist in 2015 and is a regular contributor to CIO.com on innovation. Peter has led businesses through complex changes, including the adoption of data-first approaches for portfolio management, lean six sigma for operational excellence, departmental transformations, process improvements, maximizing team performance, designing new IT operating models, digitizing platforms, leading large-scale mission-critical technology deployments, product management, agile methodologies, and building high-performance teams. As Chief Information Officer, Peter was responsible for Connecticut’s Health Insurance Exchange’s (HIX) industry-leading digital platform transforming consumerism and retail-oriented services for the health insurance industry. Peter championed the Connecticut marketplace digital implementation with a transformational cloud-based SaaS platform and mobile application recognized as a 2014 PMI Project of the Year Award finalist, CIO 100, and awards for best digital services, API, and platform. He also received a lifetime achievement award for leadership and digital transformation, honored as a 2016 Computerworld Premier 100 IT Leader. Peter is the author of Learning Intelligence: Expand Thinking. Absorb Alternative. Unlock Possibilities (2017), which Marshall Goldsmith, author of the New York Times No. 1 bestseller Triggers, calls "a must-read for any leader wanting to compete in the innovation-powered landscape of today." Peter also authored The Power of Blockchain for Healthcare: How Blockchain Will Ignite The Future of Healthcare (2017), the first book to explore the vast opportunities for blockchain to transform the patient experience. Peter has a B.S. in C.I.S from Bentley University and an MBA from Quinnipiac University, where he graduated Summa Cum Laude. He earned his PMP® in 2001 and is a certified Six Sigma Master Black Belt, Masters in Business Relationship Management (MBRM) and Certified Scrum Master. As a Commercial Rated Aviation Pilot and Master Scuba Diver, Peter understands first hand, how to anticipate change and lead boldly.