1 of 54

IPFS Primer

Introduction

The IPFS Primer

Getting Help

Tutorials

The primer contains tutorials about

Concepts

Cryptographic Hashes and Content Addressability
Authenticated Graphs
Turning Files into Trees
Turning any Data into Trees
Publishing hashes on the DHT
Getting data from the Peer to Peer Network
Immutability: "Changes" as additions to the tree
CRDTs
Pubsub
Authenticated Streams (with pubsub)

Format

Note for Contributors

To build the HTML, PDF, epub and mobi versions of the book with one command, run ./build-book.sh

Contributors

Contributors to those original docs included

@whyrusleeping
@jbenet
@lgierth
@lynnandtonic
@wraithgar
@adambrault
@donothesitate
@djdv

Tutorial: Install and Initialize IPFS

These Lessons are tested with go-ipfs version 0.5.0. Please update this file on github to reflect any other versions that have been tested.

Prerequisites

You should have some familiarity with the commandline

Learning Objectives

These Lessons will teach you how to

Install IPFS
Initialize an IPFS repository
Locate where IPFS stores the contents of your local IPFS repository

Key Concepts

IPFS Repositories

Lessons

Next Steps

Lesson: Download and Install IPFS

Goals

After doing this Lesson you will be able to

Download IPFS and install it on your operating system
Display which version of IPFS you're using
Get a list of commands the ipfs binary supports

Steps

Step 1: Download the Prebuilt IPFS Package

Step 2: Unzip the Prebuilt Package

This will create a directory called go-ipfs.

The file named ipfs is your executable ipfs binary.

Step 3: Install the IPFS Binary on your executable path

To install the binary, all you need to do is put the ipfs binary file somewhere on your executable PATH.

If you're on Mac OSX or Linux, you can use the provided install script by running

Read the output from running this. If it complains about being unable to write the file, you need to deal with permissions (see the note above about permissions)

Step 4: Display the IPFS version

When you're troubleshooting, it's important to know which version of ipfs you're using. To find out the current version, run

Step 5: Display the IPFS help page and list of commands

If you need help remembering how to use any ipfs commands, run

This should display information beginning with

For a complete list of commands that the ipfs executable supports, run

Next Steps

Lesson: Initialize your IPFS Repository

Goals

After doing this Lesson you will be able to

Initialize a local ipfs repository
Locate where IPFS stores the contents of your local IPFS repository
Open the IPFS Configuration file

Steps

Step 1: Initialize the Repository

Use the ipfs init command to initialize the repository. This will generate a local ipfs repository for the current user account on your machine. It also generates a cryptographic keypair that allows your ipfs node to cryptographically sign the content and messages that you create.

$ ipfs init
initializing ipfs node at /Users/jbenet/.go-ipfs
generating 2048-bit RSA keypair...done
peer identity: Qmcpo2iLBikrdf1d6QU6vXuNb6P7hwrbNPW9kLAH8eG67z
to get started, enter:

  ipfs cat /ipfs/QmYwAPJzv5CZsnA625s3Xf2nemtYgPpHdWEz79ojWnPbdG/readme

Note: If you have already initialized ipfs on your machine, you will get an error message like:


initializing ipfs node at /Users/sally/.ipfs
Error: ipfs configuration file already exists!
Reinitializing would overwrite your keys.

This is ok. It means you've already done this step. You can safely proceed to Step 2.

Step 2: Use IPFS to explore the post-install documentation

If you installed a different version of ipfs, you may have gotten a slightly different path to use here. Either path will work for this tutorial. The path you got from the ipfs init command will give you documentation that's accurate for the version of ipfs you're using.

When you ran ipfs init, it provided a hint for how you can get started. It said:

to get started, enter:

  ipfs cat /ipfs/QmYwAPJzv5CZsnA625s3Xf2nemtYgPpHdWEz79ojWnPbdG/readme

This ipfs cat command tells ipfs to read the content matching the path you provided. If the content isn't available locally, ipfs will attempt to find it on the peer-to-peer network.

In order to run the following command, the ipfs daemon must be running. In order to run the ipfs daemon, type ipfs daemon &. This will start the ipfs daemon and place it into the background of your current console.

$ ipfs daemon &

Run the ipfs cat command with the path you got from the init message:

$ ipfs cat /ipfs/QmYwAPJzv5CZsnA625s3Xf2nemtYgPpHdWEz79ojWnPbdG/readme

You should see something like this:

Hello and Welcome to IPFS!

██╗██████╗ ███████╗███████╗
██║██╔══██╗██╔════╝██╔════╝
██║██████╔╝█████╗  ███████╗
██║██╔═══╝ ██╔══╝  ╚════██║
██║██║     ██║     ███████║
╚═╝╚═╝     ╚═╝     ╚══════╝

If you're seeing this, you have successfully installed
IPFS and are now interfacing with the ipfs merkledag!

 -------------------------------------------------------
| Warning:                                              |
|   This is alpha software. use at your own discretion! |
|   Much is missing or lacking polish. There are bugs.  |
|   Not yet secure. Read the security notes for more.   |
 -------------------------------------------------------

Check out some of the other files in this directory:

  ./about
  ./help
  ./quick-start     <-- usage examples
  ./readme          <-- this file
  ./security-notes

You can explore other objects in there. For example, check out security-notes:

ipfs cat /ipfs/QmYwAPJzv5CZsnA625s3Xf2nemtYgPpHdWEz79ojWnPbdG/security-notes

Step 3: Locate where IPFS Stores the Repository Contents on your Machine

ipfs stores its local object repository in ~/.ipfs

$ ls ~/.ipfs

The contents of that directory look like this:

blocks        config        datastore    version

All of the contents of your IPFS repository are stored within this directory. For example, the readme file from above is stored in here, along with the other files it links to. You can run a grep to find out the exact location.

Step 4: Open the IPFS Configuration file

The configuration for your ipfs repository is in a json file that's usually stored at ~/.ipfs/config. To view the current config, run:

$ ipfs config show

One of the useful details in this config file is at Datastore.Path. This tells you where the ipfs repository's contents are being stored. As we saw in Step 3, this is usually ~/.ipfs

Next Steps

Tutorial: Files on IPFS

These Lessons are tested with go-ipfs versions: 0.5.0, 0.9.0

Please update this file on github to reflect any other versions that have been tested.

Prerequisites

You should have some familiarity with the command line.

Learning Objectives

These Lessons will teach you how to

Add files to your local IPFS node
Read files out of your local IPFS node
List the files in your IPFS node
Tell IPFS to hold onto files by pinning them

Key Concepts

Distinction between IPFS and your regular Filesystem
Identifying files by their Hashes
IPFS Garbage Collection
Pinning files on an IPFS Node

Lessons

Next Steps

Lesson: Add Content to IPFS and Retrieve It

Goals

After doing this Lesson you will be able to

Add a file's content to IPFS
Read content out of IPFS using its hash
Explain the relationship between IPFS hashes and the content you've added

Steps

Step 1: Create a file that you will add to IPFS

You can add any type of content to IPFS. For this lesson we will put some text content into a `.txt` file, but you can do this same process with any content or any file.

It would be a good idea to make a new directory for this example. Navigate to somewhere you are comfortable putting a new folder (such as ~/Desktop), and then create a new directory and go into it. Here is an example command:

$ cd ~/Desktop
$ mkdir ipfs-tutorial
$ cd ipfs-tutorial

Now, create a file called mytextfile.txt and put the text "version 1 of my text" in it. One easy way to do this on the command line is with this command:

$ echo "version 1 of my text" > mytextfile.txt

You can read the file's contents using the cat command:

$ cat mytextfile.txt
version 1 of my text

Step 2: Add the File to IPFS

$ ipfs add mytextfile.txt
added QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy mytextfile.txt

Save the hash QmZtmD2qt... that ipfs returned. This is the content's cryptographic hash. If the file's content changes, the hash will change, but if the file's content remains the same, the hash will always be the same.

Step 3: Read the content out of IPFS

Just like the regular cat command lets you read the contents of a file, the ipfs cat command lets you read the contents of a file that has been added to ipfs.

Use the ipfs cat command to read the content by passing it the content's cryptographic hash -- this is the hash that ipfs returned when you ran ipfs add mytextfile.txt.

$ ipfs cat QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy
version 1 of my text

Notice that this returned the content of the file, not the text file itself. That's because QmZtmD2qt... is the hash of the content, not the file itself. We'll test that in the next step.

Step 4: Confirm that the hash points to the content, not the file

When you used ipfs cat to read the file's contents it returned the content of the file, not the text file itself. That's because the hash QmZtmD2qt... is the hash of the content. You can test that by adding the text content directly to IPFS without ever putting it in a file.

$ echo "version 1 of my text" | ipfs add
added QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy

The hash should be exactly the same as the hash you got when you added mytextfile.txt. If you want to triple-check, you can run each of these commands as many times as you want. The hash should always be the same.

$ ipfs add mytextfile.txt
added QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy mytextfile.txt
$ echo "version 1 of my text" | ipfs add
added QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy
$ cat mytextfile.txt | ipfs add
added QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy

As long as the content remains the same, you will always get the same hash. As far as IPFS is concerned, it is the same content.

Step 5: Change the content and get a different hash

Now change the text content to "version 2 of my text" and add it to ipfs. You will get a different hash.

As you confirmed in the previous step, you can add the new text directly to IPFS or you can modify mytextfile.txt and add it to IPFS. You will get the same hash either way.

$ echo "version 2 of my text" | ipfs add
added QmTudJSaoKxtbEnTddJ9vh8hbN84ZLVvD5pNpUaSbxwGoa QmTudJSaoKxtbEnTddJ9vh8hbN84ZLVvD5pNpUaSbxwGoa

Step 5: Pipe content from IPFS into a File

You can read this content (any version) out of ipfs and write it into a file. For example, you can toggle the contents of mytextfile.txt from "version 1" to "version 2" and back as many times as you want:

$ ipfs cat QmTudJSaoKxtbEnTddJ9vh8hbN84ZLVvD5pNpUaSbxwGoa > mytextfile.txt
$ cat mytextfile.txt
version 2 of my text
$ ipfs cat QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy > mytextfile.txt
$ cat mytextfile.txt
version 1 of my text

You can also write the content from ipfs into a completely new file.

$ ipfs cat QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy > anothertextfile.txt
$ cat anothertextfile.txt
version 1 of my text

Explanation

IPFS tracks content based on its cryptographic hash. This hash uniquely identifies exactly that content. As long as the content stays the same, the hash stays the same. If the content changes at all you will get a different hash.

If you have two different files that contain identical content, IPFS will track that content with one hash. The filenames are different, but the content is the same, so the hash of the content will be identical.

This leads to the question: how does IPFS track file names? That's the topic of the next lesson.

Next Lesson: Add Filenames and Directory Info to IPFS

Lesson: Wrap Filenames and Directory Info around Content

Goals

After doing this Lesson you will be able to

Add a file to IPFS, including its filename, permissions, etc.
Add directories to IPFS
Explain how IPFS represents two files that have identical content
Read content out of IPFS using the hash of a directory that contains the file

Steps

Step 1: Create the file you're going to add

You may already have this file from the previous lesson. If you do, make sure the content of the file matches. Otherwise the hashes you get won't match the examples in this lesson.

Create a file called mytextfile.txt and put the text "version 1 of my text" in it. Here is an easy way to do this on the command line:

$ echo "version 1 of my text" > mytextfile.txt

Step 2: Add the file to IPFS

$ ipfs add -w mytextfile.txt
added QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy mytextfile.txt
added QmPvaEQFVvuiaYzkSVUp23iHTQeEUpDaJnP8U7C3PqE57w

In the previous lesson, when you ran ipfs add mytextfile.txt without the -w flag, ipfs only returned one hash. This time it returned two hashes. The first hash QmZtmD2... is the same as before — it's the hash of the content inside the file. The second hash QmPvaEQF... is the hash of the directory and filename information that ipfs "wrapped" around our content.

In the next steps, you will use ipfs commands to see what that directory and filename information looks like and how you can use it.

Step 3: List the directory information

The -w flag tells ipfs to include the directory and filename information along with the content — it "wraps the file in a directory". For more info about this, run ipfs add --help and read the description there.

To list this directory and filename information, use ipfs ls. You will use the -v flag to include header information. To learn more about this command, run ipfs ls --help

$ ipfs ls -v QmPvaEQFVvuiaYzkSVUp23iHTQeEUpDaJnP8U7C3PqE57w
Hash                                           Size Name
QmZtmD2qt6fJot32nabSP3CUjicnypEBz7bHVDhPQt9aAy 29   mytextfile.txt

This command ipfs ls QmPvaEQFVvuiaYzkSVUp23iHTQeEUpDaJnP8U7C3PqE57w translates to "list the files referenced by the directory whose hash is QmPvaEQFVvuiaYzkSVUp23iHTQeEUpDaJnP8U7C3PqE57w".

The response shows that the directory contains one file — "mytextfile.txt" — and the hash of that file's content is QmZtmD2q...

Note that you had to use ipfs ls instead of ipfs cat to read this info because it's a directory. If you try to read the directory using ipfs cat you will get an error:


$ ipfs cat QmPvaEQFVvuiaYzkSVUp23iHTQeEUpDaJnP8U7C3PqE57w
Error: this dag node is a directory

Step 4: Read the File's contents using the parent directory's hash

You can use the directory's hash to read the file's content like this:

$ ipfs cat QmPvaEQFVvuiaYzkSVUp23iHTQeEUpDaJnP8U7C3PqE57w/mytextfile.txt
version 1 of my text

This command translates to "return the content that's referred to as mytextfile.txt within the directory whose hash is QmPvaEQFVvuiaYzkSVUp23iHTQeEUpDaJnP8U7C3PqE57w"

Bonus Steps

Some things to try:

Create a directory with multiple files. Tell ipfs to recursively add the directory and all of its files.
Create two different files with the same content. Add them both to ipfs with ipfs add -w and confirm that ipfs is re-using the hash of that content when it builds the directory and filename information.

Explanation

When you add a file to your ipfs repository, ipfs calculates the cryptographic hash of the file's contents and returns that hash to you. You can then use the hash to reference the file's contents and read them back out of the ipfs repository.

In order to keep track of information like filenames and paths, ipfs lets you "wrap" directory and filename information around the file contents you've added. The directory and filename information has its own hashes. This makes it possible to retrieve content from the ipfs repository using "ipfs paths" that are a combination of hashes, filenames and directory names.

Next Steps

Lesson: Pinning - Tell IPFS to Keep a File

Goals

This lesson covers the topic of "pinning" files in your IPFS repository and removing files with the ipfs garbage collector. Pinning is a very important concept in IPFS. Pinning is the mechanism that allows you to tell IPFS to always keep a given object local.

After doing this Lesson you will be able to

Tell IPFS to hold onto specific files in your local IPFS repository
Tell IPFS to clean up unwanted files from your local IPS repository

Steps

Step 1: Create the file you're going to add and pin

Create a file called foo.txt and put the text "ipfs rocks" in it. Here is an easy way to do this on the command line:

Step 2: Add the file to IPFS

Step 3: List objects pinned to local storage

The first object listed above is the foo.txt file. Objects added through ipfs add are pinned recursively by default.

There are three types of pins in the ipfs world:

a) direct pins, which pin just a single block, and no others in relation to it;

b) recursive pins, which pin a given block and all of its children;

c) indirect pins, which are the result of a given block's parent being pinned recursively.

Step 4: Unpin an object

You can unpin foo.txt like this:

Ok, now verify that it no longer exists:

Wait, it still appears to be there! Ok, you must run the garbage collector and then verify again:

IPFS has a fairly aggressive caching mechanism that will keep an object local for a short time after you perform any ipfs operation on it, but these objects may get garbage collected fairly regularly.

A pinned object cannot be garbage collected, if you don't believe me try this:

Explanation

IPFS nodes treat the data they store like a cache, meaning that there is no guarantee that the data will continue to be stored. Pinning a CID (hash) tells an IPFS node that the data is important and mustn’t be thrown away. You should pin any content you consider important, to ensure that content is retained long-term. Since data important to someone else may not be important to you, pinning lets you have control over the disk space and data retention you need.

Next Steps

Tutorial: Going Online - Joining the Distributed Web

Prerequisites

To do the lessons in this tutorial you must:

Be familiar with using the command line

Learning Objectives

These Lessons will teach you how to

Connect your local IPFS node to the IPFS network
Find/examine Peers on the IPFS network
Retrieve content from a Peer on the IPFS network

Key Concepts

Connecting and interacting with the IPFS network

Lessons

Lesson: Connect your node to the IPFS network

This lesson shows how to connect the IPFS node on your local computer to the IPFS network, or “the swarm”. Everything that you have done so far has been done locally. Now it gets a lot more interesting!

Prerequisites

To do the steps in this lesson you must:

Be familiar with using the command line

Goals

After doing this Lesson you will be able to

Start the IPFS daemon to connect your local node to the IPFS network

Steps

Step 1: Start the IPFS daemon

Start the IPFS daemon by running

$ ipfs daemon

You will see output from the daemon like the following:

Initializing daemon...
go-ipfs version: 0.5.0-dev-17e886e29
Repo version: 7
System version: amd64/linux
Golang version: go1.13.5
Swarm listening on /ip4/12.2.0.36/tcp/4001
Swarm listening on /ip4/127.0.0.1/tcp/4001
Swarm listening on /ip6/::1/tcp/4001
Swarm listening on /p2p-circuit
Swarm announcing /ip4/12.2.0.36/tcp/4001
Swarm announcing /ip4/127.0.0.1/tcp/4001
Swarm announcing /ip6/::1/tcp/4001
API server listening on /ip4/127.0.0.1/tcp/5001
WebUI: http://127.0.0.1:5001/webui
Gateway (readonly) server listening on /ip4/127.0.0.1/tcp/8080
Daemon is ready

Step 2: Examine your ipfs node id info

Let's look at the details of your connections made by the daemon with ipfs id. Open up another command line and run:

$ ipfs id
{
    "ID": "QmRX....xQTp",
    "PublicKey": "CAAS....AAE=",
    "Addresses": [
        "/ip4/127.0.0.1/tcp/4001/ipfs/QmRX....xQTp",
        "/ip4/12.2.0.36/tcp/4001/ipfs/QmRX....xQTp",
        "/ip6/::1/tcp/4001/ipfs/QmRX....xQTp",
        "/ip6/2802:285:8360:da70::9191/tcp/4001/ipfs/QmRX....xQTp",
        "/ip6/2802:285:8360:da70:5146:9a0a:e910:19c3/tcp/4001/ipfs/QmRX....xQTp",
        "/ip6/2802:285:8360:da70:ccb4:bd10:baa3:d022/tcp/4001/ipfs/QmRX....xQTp",
        "/ip4/83.24.208.218/tcp/26521/ipfs/QmRX....xQTp"
    ],
    "AgentVersion": "/go-ipfs/0.5.0-dev/17e886e29",
    "ProtocolVersion": "ipfs/0.1.0"
}

Note: The hashes above have been shortened for readability.

The "ID" field is your Peer ID, used to uniquely identify your node on the IPFS network. The "PublicKey" field goes along with your Peer ID, used under-the-hood by IPFS for public key cryptography. The "Addresses" shown are an array of IP addresses used for IPFS network traffic. Addresses using TCP port 4001 are known as "swarm addresses" that your local daemon will listen on for connections from other IPFS peers.

Step 3: Shutdown the daemon

You may shut down the daemon by typing Ctrl-C in the command line that you started with:

...
Daemon is ready
^C
Received interrupt signal, shutting down...
(Hit ctrl-c again to force-shutdown the daemon.)

Note: You may run the IPFS daemon as a background process using the command ipfs daemon &. If you want to stop the background process just type fg (foreground) to bring that process to the foreground and stop it with Ctrl-C.

$ ipfs daemon &
pid 8469
$ Initializing daemon...
go-ipfs version: 0.5.0-dev-17e886e29
Repo version: 7
System version: amd64/linux
Golang version: go1.13.5
Swarm listening on /ip4/10.0.0.35/tcp/4001
...
Gateway (readonly) server listening on /ip4/127.0.0.1/tcp/8080
Daemon is ready
fg
ipfs daemon
^C
Received interrupt signal, shutting down...
(Hit ctrl-c again to force-shutdown the daemon.)

Explanation

You run the IPFS daemon in order to have your local IPFS node become part of the IPFS network and listen to other IPFS peers.

Next Lesson: Find Peers on the Network

Lesson: Find Peers on the Network

This lesson shows how to find and examine the peers you connect to on the IPFS network. You will use the ipfs swarm and ipfs id tools for this purpose. The swarm is the component that opens, listens for, and maintains connections to other IPFS peers. You can also examine connected peers and the network using the Web UI.

Prerequisites

To do the steps in this lesson you must:

Be running the ipfs daemon

Goals

After doing this Lesson you will be able to

Find and examine peers on the IPFS network

Steps

Step 1: Start the IPFS daemon

Start the IPFS daemon by running

$ ipfs daemon

Step 2: Find peers that we are connected to

You can use the command ipfs swarm peers to examine for connected peers:

$ ipfs swarm peers
/ip4/147.75.69.143/tcp/4001/ipfs/QmNn....GAJN
/ip4/147.75.83.83/tcp/4001/ipfs/QmbL....75Nb
/ip4/147.75.85.167/tcp/4001/ipfs/QmXA....qhfW
/ip6/2604:1380:0:c100::1/tcp/4001/ipfs/QmQC....uLTa
/ip6/2604:1380:3000:1f00::1/tcp/4001/ipfs/QmcZ....3dwt
/ip6/2604:1380:40b0:c00::3/tcp/4001/ipfs/QmYA....yYdN

Step 3: Examine a connected peer

You will use the ipfs id <hash> command to examine a connected peer:

$ ipfs id Qmf1...mx36
{
    "ID": "Qmf1....mx36",
    "PublicKey": "CAAS....AAE=",
    "Addresses": [
        "/ip4/127.0.0.1/tcp/4001",
        "/ip6/::1/tcp/4001",
        "/ip4/134.215.4.214/tcp/4001"
    ],
    "AgentVersion": "go-ipfs/0.4.21/8ca278f45",
    "ProtocolVersion": "ipfs/0.1.0"
}

Note: The "ID" field shown above is the Peer's ID, and this was also the hash that was shown when you ran ipfs swarm peers. Peers are identified on the network directly by their Peer ID.

Step 4: Examine using the Web UI

The IPFS daemon also serves up a modern Web UI that you are able to open in a browser. Did you notice when you started the daemon that there was the following?

Open the link above in your browser. You will see the Web UI displayed with sections on Status, Files, Explore, Peers, and Settings. Click on the Peers section and you will see a world map indicating the location of connected peers. Scroll down the page to see information on each of the peers, their country/city location, network latency, Peer ID, etc. Spend some time looking at the other different sections of the Web UI.

Explanation

Once you have connected to the IPFS network by running the daemon, other IPFS nodes (peers) will begin to connect and communicate with your node. Using the commands ipfs swarm and ipfs id allows you to examine the connected nodes. The Web UI also shows in-depth information about peers.

Next Lesson: Retrieve content from a Peer

Proceed to the next lesson to learn how to Retrieve content from a Peer

Lesson: Retrieve content from a Peer

Prerequisites

To do the steps in this lesson you must:

Be familiar with using the command line

Goals

After doing this Lesson you will be able to

Access any content through your local IPFS node using its command line interface

Steps

Step 1: Start the IPFS daemon

Start the IPFS daemon by running

If the daemon is not running, your IPFS node won't be able to retrieve content from other nodes on the network.

Step 2: Read the content on the command line

Explanation

You can use a local IPFS node to read content from the worldwide IPFS network. One way to do this is through the command line using commands like ipfs cat and ipfs ls. When you pass the content-addressed (hash) identifiers of the content you want into these commands, your IPFS node will check to see if it has a local copy of the content you're requesting. If your node has a local copy, it will return that content to you immediately. If your node does not have a local copy, it will attempt to find a peer on the IPFS network that does have the content. As long as at least one peer has the content you want, your IPFS node will be able to find that peer, retrieve the content from the peer, and return that content to you.

This is the essential function of an IPFS node. It uses content-addressed (hash) identifiers to find content on the peer to peer network. It also provides that content to other peers who want it.

Next Steps

Tutorial: Interacting with the Classical (HTTP) Web

Prerequisites

To do the lessons in this tutorial you must:

Be familiar with using the command line

Learning Objectives

These Lessons will teach you how to

Use your browser to retrieve content through different IPFS gateways

Key Concepts

Flexibly downloading content from the IPFS network

Lessons

Lesson: Use an HTTP browser to retrieve files from local IPFS gateway

Prerequisites

To do the steps in this lesson you must:

Goals

After doing this Lesson you will be able to

Access any content through your local IPFS node's HTTP gateway

Steps

Step 1: Start the IPFS daemon

Start the IPFS daemon by running

$ ipfs daemon

If the daemon is not running, your IPFS node won't be able to retrieve content from other nodes on the network. It also won't start the HTTP gateway that you're going to use in Step 2.

Step 2: Read request content through your IPFS node's HTTP gateway

You must tell the gateway whether you're requesting content with an IPFS hash or an IPNS hash. If you're using the hash of a specific snapshot of content -- for example a file that someone added to IPFS, use a path that starts with /ipfs/. If you're using an IPNS hash to get the latest version of some content that gets updated over time, for example a website that gets fresh content every day, use a path that starts with /ipns/.

Explanation

You can use a local IPFS node to read content from the worldwide IPFS network. The two ways of interacting with your local node are 1) through the command line and 2) through the HTTP gateway. You can use either of those interfaces to pass IPFS the content-addressed (hash) identifiers of the content you want. The IPFS node will use those identifiers to find that content on the network and retrieve it for you.

Next Steps

Lesson: Access IPFS content through any IPFS gateway

Goals

This lesson covers using any IPFS gateway to access IPFS content. It's a condensed review of the Lesson on Using an HTTP browser to retrieve files from a local IPFS gateway

After doing this Lesson you will be able to

Use the HTTP address of any IPFS gateway to access IPFS content

Steps

Step 1: Get the address of a gateway

For these examples we will use the gateway at http://dweb.link

Step 2: Build the Path to your Content

As described in the Lesson on Using an HTTP browser to retrieve files from local IPFS gateway, you must tell the gateway whether you're requesting content with an IPFS hash or an IPNS hash. If you're using the hash of a specific snapshot of content -- for example a file that someone added to IPFS, use the path /ipfs/<your-ipfs-hash>. If you're using an IPNS hash to get the latest version of some content that gets updated over time, for example a website that gets fresh content every day, use the path /ipns/<your-ipns-hash>

Step 3: Request the content from the gateway

Combine the gateway's address (ie. http://dweb.link) with the path to your content (ie. /ipfs/<your-ipfs-hash>). Use that to request the content.

Explanation

With the above examples, we are using an HTTP connection over the internet to someone (http://dweb.link) providing a gateway onto the IPFS network. In this way you can access information in the IPFS network at large, and you do not need to run your own IPFS gateway.

TODO

Restricting the content that your gateway will serve
Security concerns -- the gateway can see all the things that an HTTP server can see.

Next Steps

Tutorial: The Myriad ways to Access and Distribute IPFS Content

These Lessons are tested with go-ipfs version 0.5.0. Please update this file on github to reflect any other versions that have been tested.

All of the lessons use the same content: a snapshot of the Turkish version of Wikipedia.

Learning Objectives

These Lessons will teach you how to

Define content addressing and compare it with location-addressing
Use IPFS content hashes to access the same content in many ways with the same link
Access content through the public IPFS gateways at ipfs.io
Access content through any IPFS node's http gateway
Access content using the IPFS browser extension
Access IPFS content through Tor
Use a sneakernet to move and redistribute IPFS content
Explain the implications of being able to access IPFS content through so many different paths

Lessons

Review the lesson on Retrieving content from a peer
Review these lessons from the Tutorial on Interacting with the Classical (HTTP) Web
- Review: Lesson: Using an HTTP browser to retrieve files from local IPFS gateway
- Review: Lesson: Using the public IPFS gateways at ipfs.io

Next Steps

Review these lessons from the Tutorial on Interacting with the Classical (HTTP) Web

Review: Access IPFS content through any IPFS gateway

Lesson: Access IPFS content through Tor gateways (experimental)

Goals

This lesson covers accessing IPFS content through Tor gateways.

After doing this Lesson you will be able to

Use the Tor browser and a public IPFS gateway on the Tor network to access IPFS content

Steps

Step 1: Download the Tor browser

Step 2: Request the content you want from the IPFS-Tor gateway

ipfs4uvgthshqonk.onion is a volunteer-run IPFS Gateway on the Tor network. You will use this gateway to request IPFS content. (Warning: The IPFS project does not run this gateway. We cannot guarantee stability or security.) There are probably many other IPFS gateways on the Tor network. You can use any of them in this way -- simply replace ipfs4uvgthshqonk.onion with the name of the gateway you're trying to access.

Explanation

This approach relies on the IPFS gateway at ipfs4uvgthshqonk.onion to retrieve content from the IPFS network for you. The difference with this gateway, as opposed to the gateways at ipfs.io, is that it's listening for requests directly over Tor protocol. This allows you to access the gateway anonymously.

Next Steps

Lesson: Run IPFS over Tor transport (experimental)

IPFS has an experimental feature that allows an IPFS node to interact with other IPFS nodes over the Tor transport protocol. The goal of this feature is to allow IPFS nodes to anonymously communicate with each other. This feature is experimental! Until we have tested this feature and removed the "experimental" designation, you should assume that information about your node might leak.

Prerequisites

To do the steps in this lesson you must:

Be familiar with using the command line

Goals

After doing this Lesson you will be able to

Configure an IPFS node to use the Tor transport
Request content through that node

Steps

Step 1: Configure IPFS to use the Tor transport

Step 2: Start the IPFS daemon

Start the IPFS daemon

$ ipfs daemon

Step 3: Request the content you want from your local IPFS node's gateway

Explanation

This feature is experimental! Until we have tested this feature and removed the "experimental" designation, you should assume that the explanation here is aspirational and provisional. We are describing what should be true but we have not yet tested and confirmed that the approach works without leaking information.

When you configure an IPFS node to use the Tor transport, the node will pipe all of its peer-to-peer communications through the Tor onion network. This means that when you request content from your local node, whether through its http gateway at localhost:8080 or through the command line, the node will access the IPFS network over the tor transport protocol. When it connects with peer nodes on the IPFS network, the peers will not know which node they are talking to nor where it is.

Next Steps

Lesson: Access IPFS content through a browser extension

This is a placeholder. There are currently four web browser extensions that help your retrieve content from IPFS. Each works in slightly different ways. We are in the process of consolidating that code and making it more secure before we encourage people to rely on it.

When the IPFS browser extension is complete, we will publish it on the app stores for all of the browsers that support it. When you download the extension, it will automatically recognize IPFS links and will use the IPFS peer-to-peer network to retrieve the content for you -- no HTTP gateway needed, nothing else to install on your computer, no need to use the command line. You will only have to install the browser extension and the whole IPFS network will become available to you.

2017-04-30 snapshot: dweb:/ipfs/Qme2sLfe9ZMdiuWsEtajWMDzx6B7VbjzpSC2VWhtB6GoB1/wiki/Anasayfa.html
latest (IPNS): dweb:/ipns/QmQP99yW82xNKPxXLroxj1rMYMGF6Grwjj2o4svsdmGh7S/wiki/Anasayfa.html
latest (DNS): dweb:/ipns/wikipedia-on-ipfs.io

Next Steps

Lesson: Explore the types of software that use hash trees to track data (to come)

Disclaimer: Dynamic content on IPFS is a Work in Progress (to come)

Lesson: Add data to the DAG (locally) (to come)

Lesson: Tell peers about your Changes (to come)

Lesson: Use hashes to get someone's changes from IPFS (to come)

Lesson: Use a pub/sub strategy to pass around messages about changes (to come)

Lesson: Resolve conflicts with a merge strategy (CRDTs) (to come)

Privacy and Access Controls on the Distributed Web (to come)

Reader Privacy & Writer Privacy (to come)

Private Networks (to come)

Encrypting Content (to come)

More dynamic encryption: capabilities-based encryption (to come)

Comparing with the classic HTTP web (feudal security, etc) (to come)

Keeping Data Alive: Durable Data on the Permanent Web (to come)

IPFS Cluster (to come)

Filecoin (to come)

Distributed Computation (to come)

The Power of Content-addressing

Goals

This lesson introduces the concept of content addressing and explores the powerful implications of using this approach.

After doing this Lesson you will be able to

Define content addressing and compare it with location-addressing
Explain the implications of being able to access IPFS content through so many different paths

Explanation

The Problem: Identifying Content by its Location

When you use an http:// or https:// link to point to a webpage, image, spreadsheet, dataset, tweet, etc, you're identifying content by its location. The link is an identifier that points to a particular location on the web, which corresponds to a particular server, or set of servers, somewhere on the web. Whoever controls that location controls the content. That's how HTTP works. It's location-addressed. Even if a thousand people have downloaded copies of a file, meaning that the content exists in a thousand locations, HTTP points to a single location. This location-addressed approach forces us all to pretend that the data are in only one location. Whoever controls that location decides what content to return when people use that link. They also decide whether to return any content at all.

If I identify the book by its content, saying "Check out the book called Why Information Grows by César Hidalgo. The ISBN is 0465048994.", you will be able to get any copy of the book from any source and know that you're reading the information I recommended. You might even say "Oh. I already read it." or "My roommate has it in the other room. I'll borrow it from him.", saving yourself the cost or effort of getting another copy.

By contrast, if I used location-addressing to identify the book, I would have to point to a location, saying something like "Go to the news stand at Market & 15th in Philadelphia and ask for the thing 16 inches from the south end of the third shelf on the east wall" Those instructions are confusing and awkward, but that is how http links work. They identify content by its location and they rely on the 'host' at that location to provide the content to visitors. There are lots of things that could go wrong with this approach. It also puts a lot of power and responsibility on the shoulders of whoever controls the location you're pointing to - in this case the news stand.

Let's consider the responsibilities of whoever controls the location we've pointed to. If the people running the news stand want my directions (aka. my "link") to remain valid, allowing people to access the book, they have to:

Always be open, 24/7, in case someone wants to read the book.
Provide the book to everyone who seeks the book, whether it's one person or hundreds of thousands of people.
Protect the integrity of the book by preventing anyone from tampering with it.
Never remove the book from its shelf - if they get rid of it, or even move it, my link is broken and nobody will be able to use my instructions to find the book.

Along with those responsibilities come a great amount of power. The proprietors of the news stand control the location that my directions point to, so they can choose to:

Dictate who is allowed to see the book.
Move the book without telling anyone.
Destroy the book.
Charge people money to access the book or force them to watch ads when they walk in the door.
Collect data about everyone who accesses my book, using that information however they want.
Replace the book with something else -- They might not even put a book there, since my instructions are just describing a location, a malicious actor could replace the book with something dangerous, turning the location into a trap!

Location-addressing has worked on the web for 25 years, but it's starting to get painful and It's about to get much worse. As long as we continue to rely on it, the web will continue to be unstable, insecure, and prone to manipulation or exploitation.

The Solution: Identify Information by its Fingerprint, not its Location

When we identify content in this way, using the content's cryptographic hash instead of its location to identify it, this is called content-addressing. The cryptographic hash for a piece of content never changes, which means content addressing guarantees that the links will always return the same content, regardless of where I retrieve the content from, regardless of who added the content to the network, and regardless of when the content was added. That's the essential power of using a content-addressed protocol like IPFS instead of using a location-addressed protocol like HTTP.

The Implications of Content Addressing

It lets us store data together.

This decentralized, content-addressed approach radically increases the durability of data. It ensures that data will not become endangered as long as anyone is still relying on it because anyone can hold a valid copy of the data they care about. If you hold a copy of a dataset on any of your devices, or if you pay someone to host it on an IPFS node for you, you become part of the network of stewards who protect that dataset from being lost. You won't have to worry about whether someone is going to turn off the servers where your data are hosted because you are one of the hosts. You and your peers hold the data among yourselves and are able to share the data directly with each other without relying on centralized points of failure.

It increases the integrity of data.

Decentralization also increases the integrity of data because links are content-addressed. This means we can validate data by checking the data's fingerprints against the links. That kind of validation is impossible with location-addressed links. This is especially powerful on the large scale, where millions of websites and datasets reference each other billions of times. With location-addressed links, all of those connections are brittle. With content-addressed links, the connections become resilient and reliable.

Links can come back to life.

As soon as any node has the content, everyone's links start working. Even if someone destroys all the copies on the network, it only takes one node adding the content in order to restore availability. A cryptographic hash permanently points to the content it was derived from, so IPFS links permanently point to their content. Even if the content becomes unavailable for a period, the links will work as soon as anyone starts providing the content again.

Harder to attack, easier to recover.

Even if the original publisher is taken down, the content can be served by anyone who has it. As long as at least one node on the network has a copy of the content, everyone will be able to get it. This means the responsibility for serving content can change over time without changing the way people link to the content and without any doubt that the content you're reading is exactly the content that was originally published.

The content you download is cryptographically verified to ensure that it hasn’t been tampered with.

IPFS can work in partitioned networks - you don’t need a stable connection to the rest of the web in order to access content through IPFS. As long as your node can connect to at least one node with the content you want, it works!

If one IPFS gateway gets blocked, you can use another one. IPFS gateways are all capable of serving the same content, so you’re not stuck relying on one point of failure.

Lightening the load: With IPFS, people viewing the content are also helping distribute the content (unless they opt out) and anyone can choose to pin a copy of some content on their node in order to help with access and preservation.

You can read anonymously. As with HTTP, IPFS can work over Tor and other anonymity systems

IPFS does not rely on DNS. If someone blocks your access to DNS or spoofs DNS in your network, it will not prevent IPFS nodes from resolving content over the peer-to-peer network. Even if you're using the DNSlink feature of IPFS, you just need to find a gateway that does have access to DNS. As long as the gateway you're relying on has access to DNS it will be able to resolve your DNSlink addresses.

IPFS does not rely on the Certificate Authority System, so bad or corrupt Certificate Authorities do not impact it.

IPFS nodes work hard to find each other on the network and to reconnect with each other after connections get cut.

(experimental) You can even form private IPFS networks to share information only with computers you've chosen to connect with.