Identity and access management

Identity and access management is how Vault governs access to APIs and data. This feature has two components:

Access control – to control user access to the APIs.
Policy management – to control user access to data.

Users and roles

A user represents the identity of a caller that requires access to an API operation or data in Vault. Each user has one role, and gains access to Vault using a bearer token that is an API key, JWT token from an identity provider (IdP) such as Auth0, Azure AD or any IdP supporting JWT. Each role is a collection of access control capabilities and data access policies.

Vault has a built-in Admin user, with a special role called Admin. This role has the special "CapSystem" capability, which provides access to all API operations. By default, the Admin user cannot access data in production. However, you can control whether the Admin user can access data using the PVAULT_SERVICE_ADMIN_MAY_READ_DATA environment variable.

note

If you're using the hosted version of Vault, contact us if you wish to provide the Admin user with the ability to read data.

Access control

Access control governs access to API operations. API operations are classified into scopes. For example, the collections scope represents all the REST API operations with the /api/pvlt/1.0/ctl/collections prefix in their path. Capabilities then define access to methods within the scope. For example, the "CapCollectionsReader" capability provides access to all the GET operations in the "collections" scope. Each role can have one or more of these capabilities.

Capability evaluation and data access

When a user makes a REST API call or uses a CLI command, Vault determines whether the user's role includes the capability to perform the requested action. For a REST API call, for example, if the user doesn't have the required capability, the call is denied and a 403 error is returned.

Policy management

Policy management determines whether a user is allowed to complete an operation based on the data that the operation manipulates and how the data is manipulated. To do this, policy management uses the collection of policies associated with the user's role. A policy is a rule that contributes to the decision about whether a user can execute the requested operation. Policies control access to data and therefore only apply to methods whose URL begins with /api/pvlt/1.0/data. In particular, they do not apply to methods that manage collections, properties and data types.

This information defines each policy:

Type, either allow or deny.
Operations, a list of actions on data, including:
- "read"
- "write"
- "delete"
- "search"
- "tokenize"
- "detokenize"
- "encrypt"
- "decrypt"
- "hash"
- "stats"
Resources, a list of resource identifiers that describe data. See Resource identifiers for more information.
Reasons, a list of reasons for accessing data.

Operations, Resources, and Reasons must contain at least one value. The value of "*" denotes all possible values.

Resource identifiers

Each resource identifier in the Resources list describes a set of data using a pattern for matching the data. There are five types of resource identifiers.

Resource Identifier Type	Resource identifier	Examples
Active Property	`{collection-name}/properties/{property-name-or-transformation-binding}`	`employees/properties/email` matches all values of the `email` property of all active objects, in all collections whose name begins with `employees`. `employees/properties/email.mask` matches the value of the `email.mask` transformation binding of all active objects, in all collections whose name begins with `employees`.
Archived Property	`{collection-name}/archived/properties/{property-name-or-transformation-binding}`	`/archived/properties/address*` matches all values of all properties whose name contains `address`, of all archived objects, in all collections.
Active Tokens	`{collection-name}/tokens`	`customers/tokens` matches all active tokens of all objects in the `customers` collection.
Archived Tokens	`{collection-name}/archived/tokens`	`customers/archived/tokens` matches all archived tokens of all objects in the `customers` collection.
Type	`{collection-name}/types/{type-name-or-transformation-binding}`	`credit_/types/CC` matches all values, of all properties, of all objects (both active and archived), in all collections whose name begins with `credit_` such that the data type of the property begins with `CC` (i.e. `CC_HOLDER_NAME`,`CC_NUMBER`,`CC_EXPIRATION_STRING`,`CC_CVV`) `credit_*/types/ssn.mask` matches all `mask` transformations of all values, of all properties, of all objects (both active and archived), in all collections for which the collection name begins with `credit_` such that the data type of the property matches SSN.

Note

The{collection-name}, {property-name-or-transformation-binding}, and {type-name} strings can contain alphanumeric ASCII characters (a-z, A-Z, and 0-9), an underscore ('_'), and any number of non-consecutive '*' characters.
The '*' character matches any number of consecutive characters.
The matching of all curly-braced segments in a resource identifier is case-sensitive.

See IAM configuration file reference for more details on defining policies.

Policy evaluation and data access

When a user performs an operation in Vault, through the CLI or an API, the policy engine evaluates the operation against each policy in the user's role. Each policy then votes on whether to allow the user access. The policy's vote can be for, against, or an abstention. The policy engine allows the operation only when there is at least one allow vote and no deny votes.

To see how this works, assume that a user attempts to update all the properties of an object in a collection called employees using the Update object REST API operation. Also, assume that the employees collection has four properties, first_name, last_name, phone_number, and ssn.

If a user has a role that includes only this policy:

[policies.WriteAll]
policy_type = "allow"
operations = ["write"]
reasons = ["*"]
resources = ["*"]

When the user attempts to update all the details for an object, the policy votes to allow access. This is because the policy permits the user to write data for any property and any reason. With no votes denying access, the policy engine allows the user to perform the operation.

If the role were to include this second policy:

[policies.WriteAll]
policy_type = "deny"
operations = ["tokenize"]
reasons = ["*"]
resources = ["*/phone_number"]

Now, when the user attempts to update all of an object's details, two policies vote. This policy abstains from voting because the requested operation isn't trying to tokenize a property. With one vote for and one abstention, the policy engine allows the user to perform the update.

However, if the role includes this third policy:

[policies.DenyWriteSSN]
policy_type = "deny"
operations = ["write"]
reasons = ["*"]
resources = ["employees/ssn"]

When the user attempts to update all of an object's details, this policy votes against allowing access. This is because the policy denies the user the ability to update the ssn property of the employees collection. Consequently, although there is still one vote for (and one abstention), the vote against allowing access means that the policy engine denies the request to update the object's details. For the REST API, this results in the call not executing, and a 403 error is returned.

Identity and access management

Users and roles​

Access control​

Capability evaluation and data access​

Policy management​

Resource identifiers​

Policy evaluation and data access​

On this page