The metadata harvesting process

Collibra uses Power BI REST APIs to harvest Power BI metadata.

To enable the lineage harvester to access metadata in Power BI workspaces, you must have the correct configurations in Microsoft Azure.

Note There are some limitations to the metadata harvesting process. Ensure that you understand these limitations before you start the harvesting process.

Tip 

The content in this topic differs according to the authentication method.

Overview of the metadata harvesting process with username / password authentication

Step

Description

1

The lineage harvester uses the username, password and application ID to access the Power BI APIs. These APIs retrieve basic Power BI metadata, for example metadata in the Power BI tenant or server and reports.

2

The lineage harvester uses Power BI API calls to retrieve more specific metadata, for example Power BI columns and lineage.

Important The Power BI application in Microsoft Azure must be granted administrator rights, such as Office 365 Global Administrator or Power BI Service Administrator. Delegated permissions are supported.

Note The lineage harvester accesses the metadata of all Power BI workspaces. If you don't use filtering, all workspaces are ingested in Collibra. We recommend that you use filtering and domain mapping to structure your Power BI assets in Collibra.

Overview of the metadata harvesting process with service principal authentication

Step

Description

1

The lineage harvester uses the application ID and the client secret key of the Azure Active Directory application to access the Power BI APIs. These APIs retrieve basic Power BI metadata, for example metadata in the Power BI tenant or server and reports.

2

The lineage harvester uses Power BI API calls to retrieve more specific metadata, for example Power BI columns and lineage.

Note The lineage harvester accesses the metadata of all Power BI workspaces. If you don't use filtering, all workspaces are ingested in Collibra. We recommend that you use filtering and domain mapping to structure your Power BI assets in Collibra.