A URL is a global address of documents and protocols to retrieve resource on a computer network. URLs occur most frequently in reference to web pages (HTTP) but can also be used for database access using JDBC, email (mailto), file transfer (FTP), and many other applications. The full form of URL is Uniform Resource Locator.
In this URL Vs. URI tutorial, you will learn:
- What is the URL?
- What is URI?
- Ven Diagram of URIs and URL
- Syntax of URL
- Syntax of URI
- Confusion about URN
- Why URL?
- Why URI?
- URL Vs. URI
A URI is a string containing characters that identify a physical or logical resource. URI follows syntax rules to ensure uniformity. Moreover, it also maintains extensibility via a hierarchical naming scheme. The full form of URI is Uniform Resource Identifier.
As mention in the above figure, there are two types of URI:
- URL: URL specifies a location on the computer network and technique for retrieving it.
- URN: Uniform Resource Name (URN) is an internet resource that specifies URN scheme.
- URL is a subset of URI that specifies where a resource exists and the mechanism for retrieving it, while URI is a superset of URL that identifies a resource
- The main aim of URL is to get the location or address of a resource whereas the main aim of URI is to find a resource.
- URL is used to locate only web pages, on the other hand, URI in used in HTML, XML and other files.
- URL contains components such as protocol, domain, path, hash, query string, etc. while, URI contains components like scheme, authority, path, query, etc.
- Example of URL is : https://google.com while example of URI is :urn:isbn:0-486-27557-4.
As mention in the above diagram, “your name” can be a URI because it identifies you. It cannot be URL since it does not assist any person to find your home location.
On the other hand, “your home location” can be URI as well as URL. The reason is both identify you and gives a home location for you.
Here is a Syntax of URL:
We can divide the above URL into the following parts:
- Protocol: It is the first part of the URL. Here, the protocol name is Hypertext Transfer Protocol (HTTP).
- http://www.domainname.com/: It is your domain name. It is also known as server id or the host.
- /folder-name/: It indicates that the website page referenced in “filed” in a given folder on the webserver.
- web-page-file-name.htm: It is actually a web page file name. The “.htm” is an extension for the HTML file, which shows that it is a static web page. File names can have different extensions or it is depend on how you have set up a web server. There could be no extension at all, and the URL could end with a slash line (/).
This example URL has a folder but no extension
This example URL has no folder
This example URL has no extension
Here is a syntax of URI:
URI = scheme:[//authority]path[?query][#fragment]
The URI includes the following parts:
- Scheme component: It is a non-empty component followed by a colon (:). Scheme contains a sequence of characters starting with a letter and followed by any combination of digits, letters, period (.), hyphen (-), or plus (+).Examples of well-known schemes include HTTP, HTTPS, mailto, file, FTP, etc. URI schemes must be registered with the Internet Assigned Numbers Authority (IANA).
- Authority component: It is an optional field and is preceded by //. It consists of
- Optional userinfo subcomponent that might consist of a username and password (optional).
- A host subcomponent containing either an IP address or a registered name.
- An optional port subcomponent that is followed by a colon (:)
- Path: A path contains a sequence of segments that are separated by a slash.
- Query component: It is optional and preceded by a question mark (?). Query component contains a query string of non-hierarchical data.
- Fragment component: It is an optional field and preceded by a hash (#). Fragment component includes a fragment identifier giving direction to a secondary resource.
Example of URI
No protocol mentioned
Domain not mentioned
ldap://[2001:db8::7]/c=GB?objectClass?one mailto:email@example.com tel:+1-816-555-1212 telnet://192.0.2.16:80/
There is a confusion about URN that, if you implement protocols like https, ftp, etc, then it is called a URL, even though it is a URI.
The problem with such a debate is that appropriate RFC are extremely dense and sometimes even contradictory. For example, RFC 3986 says, URI can be either a name, locator or both.
Here are the important reasons of using URL:
- The Information written in the URL gives you the ability to switch from one web page to another with just one mouse click.
- URL tells you how to access a particular resource.
- Whenever you type a URL into your browser or click any hypertext link, your web browser sends a request to a web server to download one or more files.
- Every URL is unique and identifies one particular file.
- A website URL or domain is one of the most crucial parts of your website. By using simple words or string that usually end with a .org, .com, or .net, you would be able to get traffic to your website.
Here are the important reasons of using URI:
- A Uniform Resource Identifier is essential to the semantic web because it prevents ambiguity.
- A URI search the name as well as the location of a resource or file, which is in a uniform format.
- It has a string of characters for the specific filename and path.
- URI provides a method for resources to be accessed by other systems over the World Wide Web or across a network. It is used by web browsers and P2P (Peer to Peer) file-sharing software to find and download files.
- URI allows new file types to be defined without affecting old files you have.
- You can assign a single resource to associate with multiple representations.
Here are the main differences between URL and URI:
|URL stands for Uniform Resource Locator.||URI stands for Uniform Resource Identifier.|
|URL is a subset of URI that specifies where a resource is exists and the mechanism for retrieving it.||A URI is a superset of URL that identifies a resource either by URL or URN (Uniform Resource Name) or both.|
|The main aim is to get the location or address of a resource||The main aim of URI is to find a resource and differentiate it from other resources using either name or location.|
|URL is used to locate only web pages||Used in HTML, XML and other files XSLT (Extensible Stylesheet Language Transformations) and more.|
|The scheme must be a protocol like HTTP, FTP, HTTPS, etc.||In URI, the scheme may be anything like a protocol, specification, name, etc.|
|Protocol information is given in the URL.||There is no protocol information given in URI.|
|Example of URL: https://google.com||Example of URI:
|It contains components such as protocol, domain, path, hash, query string, etc.||It contains components like scheme, authority, path, query, fragment component, etc.|
|All URLs can be URIs||Not all URIs are URLs since a URI can be a name instead of a locator.|