Lumada Data Catalog provides a REST API to access metadata held in the catalog. The same API allows applications to insert metadata such as property values, tags, tag associations, and lineage relationships. The API provides access to the same operations available from the Data Catalog browser application.
The API uses JSON objects as request and response payloads. The HTTP call returns a general pass/fail status message; calls that fail at the Data Catalog server return an failure response message with an error code and more detailed message.
The API accepts the same user credentials as the Data Catalog browser application. Before sending API calls, an application sends an authentication request. The server responds to a successful request with a session cookie, which the application then uses in the header of API calls. The token is valid for the length of the session.
name | path | methods | description |
---|---|---|---|
Agent |
|
|
Endpoints under /agent provide access to Agent objects. Agents are the processing engine components
that can run on multiple remote clusters.
|
Audit Events |
|
|
Endpoints under /auditevent provide access to audit events—Waterline Data system and user activities.
You can use audit event characteristics such as user, date, or type of activity to filter the events returned.
|
Authentication |
|
|
The Authentication API accepts the same user credentials as the Waterline Data browser. These API calls control the creation and destruction of a session cookie to validate other API calls. |
Browse: Field-level |
|
|
Endpoints under /browse/field manage the facets available when users view a list of fields for a given resource.
|
Browse: Resource-level |
|
|
Endpoints under /browse allow you to list and manage metadata for data resources based commonalities of location.
Compare with the /search functionality, that lists data resources which share some metadata characteristics.
Browse functionality depends on how data is stored and its location on a file system or database.
|
ComponentNetworkStatus |
|
|
|
Configuration |
|
|
Endpoints under /config provide access to the configuration options that control job
and web server behavior.
|
Custom Header |
|
|
The Custom Header API returns the customised header and footer details. |
Data Objects |
|
|
|
Data Preview |
|
|
Endpoints under /datapreview provide access to previews of data resource content based on the authorization of the
current user.
|
Data Resources |
|
|
Endpoints under /dataresource provide access to DataResource objects in the Waterline Data repository.
Waterline Data defines a data resource as an HDFS file or folder, a set of files that are organized in
folders (known as a Waterline Data collection), as a Hive database or table, or as a relational database
or table. DataResource objects are uniquely identified by a "key" value, assigned on creation.
|
Data Sets |
|
|
Endpoints under /dataset provide access to Virtual Dataset objects. A Waterline Virtual Data Set
allow users to create groups of resources having the same schema and spanning different folders in your data lake,
into a single virtual unit for easier management.
Virtual Data Sets are considered as user defined virtual collections that have the matching schema but may have
different path specifications/hierarchy with respect to a data source.
|
Data Sources |
|
|
Endpoints under /datasource provide access to DataSource objects. A Waterline Data data source is the
location of an HDFS directory, a Hive instance or specific database, a cloud storage location, or a relational
database instance or specific database. A data source hosts data resources, in the form of folders, files,
databases, and tables.
|
DataOps |
|
|
Endpoints under /dataops provides following details
Number of top level resources, including collections and data sets,
Total number of fields in the data lake,
All tag associations, at all levels.
All significant objects hosted by WDC
Count up all items curated or authored by the community
All users that we have seen on WDC
Total number of searches performed on WDC
including catalogmetrics,metadataobjectmetrics,usagemetrics and communitymetrics
|
Discussions |
|
|
|
Entity Specifications |
|
|
An entity specification defines properties for Waterline Data objects such as tags, users, roles, data resources,
and so on. Properties of entities are represented by PropertySpec objects. An EntitySpec object associates
a set of properties with a specific entity type. Endpoints under /entityspec allow you to create,
query, update, and delete properties associated with entities, including setting the access level for properties.
|
External Sources |
|
|
External Sources are sources of metadata to be imported into the Waterline Data catalog. Endpoints under
/externalsource provide a way to identify and connect to these outside applications. For example,
the extensions created to import metadata from applications such as Apache Atlas or Cloudera Navigator use
external sources to persist connection information for the application instance.
|
Favorites |
|
|
Endpoints under /favorites provide a way to manage user bookmarks on tags and resources.
|
Homepage |
|
|
|
JobManagement |
|
|
|
JobManagement |
|
|
|
JobManagement |
|
|
|
JobManagement |
|
|
|
Lineage |
|
|
Waterline Data defines a LineageRelations object that records and presents the lineage relationships between data
resources. Endpoints under /lineage provide access to these objects.
|
Metadata Rest Server Config |
|
|
Endpoints under /metadata-server provide access to synchronise the metadata rest server
configurations to the app-server.
|
Metadata Server |
|
|
Endpoints under /metadata-server provide access to Metadata Rest Server details objects. Metadata Rest Server provides the endpoints
for agents to perform CRUD operations on the repository.
|
Notification |
|
|
|
Overlap tables |
|
|
Endpoints under /overlap/table allow you to pass tables information and receive a preliminary overlap
report computed based on the table schema
|
QueryController |
|
|
|
Reviews |
|
|
|
Roles |
|
|
Waterline Data uses roles to indicate what data resources from the catalog are available to each user.
User roles determine what actions each user can
perform for metadata management inside Waterline Data. Endpoints under /role allow creation
of new roles and modification of existing roles. To associate roles to users, use a PUT request to
/user/{key} .
|
Rules |
|
|
Endpoints under /rule allow you to list and manage the rules.
|
SSLController |
|
|
|
Saved Search |
|
|
Endpoints under /savedsearch allow you to list and manage the Saved Searches.
|
Search |
|
|
Endpoints under /search allow you to collect data resources that share some metadata characteristics.
Compare the browse functionality, that returns a set of data resources based on the resource's location.
Searchable characteristics include facets and keywords.
|
SessionController |
|
|
|
Tag Associations |
|
|
Endpoints under /tagassociation provide access to TagAssociation objects in the Waterline Data repository.
A tag association object defines the relationship of a tag to a specific data resource (folders, files, collections,
and tables) or specific field in a data resource.
|
Tag Associations Features |
|
|
|
Tag Domains |
|
|
Waterline Data tags are maintained in a tag glossary that is organized into tag domains. Endpoints under
/tagdomain provide access to TagDomain objects.
|
Tags |
|
|
Endpoints under /tag provide access to Tag objects in the Waterline Data repository. Tags are
labels associated with data resources (folders, files, collections, or tables), or with specific fields in data resources.
|
Tokens |
|
|
Endpoints under /token provide access to Token objects. Tokens are used to authenticate inter component communication
for the distributed architecture.
|
User Profiles |
|
|
Endpoints under /user allow administrators to manage user profiles, represented by the User object.
The caller must have a roles that includes an Administrator access level to create, update, or delete user profiles.
|
Virtual Folders |
|
|
Endpoints under /virtualfolder provide access to VirtualFolder objects. Virtual folders allow users to
create groups of resources belonging to a data source into smaller units for easier management. In addition,
Virtual folders allow data resources to be part of multiple folders thus letting customers create folders
with overlapping sets of data resources.
|
type | description |
---|---|
AbstractField | |
AccessLevel | |
Agent | |
AgentConfigurationParams | |
AllValuesRequest | |
AllValuesTO | |
Asset | |
AssociationCount | |
AtomicField | |
AuditEvent | |
AuditEventListResponse | |
AuditFilterCriteria | |
BrowseRequest | |
BrowseResult | |
CatalogMetricsTO | |
CatalogMetricsUnit | |
Category | Enumeration that describes groupings for configuration properties. These categories are for organization and are not functional. |
CommunityMetrics | |
ComponentNetworkStatus | |
ConfigProperty | Type that describes the configuration settings used to control Waterline Data behaviors. Properties include controls such as whether to profile files that failed previously. The information provided for each property includes the property name, the label used in the UI, a description of what the property controls, and attributes for who and how the property can be viewed or changed. |
ContributionsCount | |
CreateAgentResponse | |
Credential | |
Credentials | Type that describes the username and password for the current user's authenticated session. Credentials details are passed to the system, but cannot be returned. |
CurrentUserInformation | |
CustomHeaderProperties | Type that defines the various properties in the custom header properties configuration file |
CustomProperty | |
DataObjectRequest | |
DataPartition | |
DataPreview | |
DataResource | |
DataResourceFlexTO | |
DataResourceLite | |
DataSource | |
DataSourceDependencies | |
DataType | |
DatasetMember | |
DatasetRequest | |
DiscussionCommentTO | |
DiscussionItem | |
DiscussionItemsMetrics | |
DiscussionTopic | |
Edge | |
Entity | |
EntitySpecification | |
EventType | |
ExecutionMetrics | |
Experts | |
ExpertsType | |
ExtendedMetrics | |
ExternalSource | |
FacetResult | |
FacetSelection | |
Favorite | |
FieldNode | |
IterableOfAgent | |
IterableOfExternalSource | |
IterableOfTableComparisionTO | |
IteratorOfString | |
JobDetailsTO | |
JobExecutionResultTO | |
JobExecutionTO | |
JobSequenceTO | |
JoinConditionTO | |
KeyMetrics | |
LineageHopCountTO | |
LineageRelationGraph | |
LineageRelationPath | |
LineageRelations | |
LineageSearchPathTO | |
LoginResponse | Type that provides a response to a login attempt. Successful attempts return a SUCCESS flag and message and echo the username. Failed attempts return a FAILURE flag and a message that describes the reason for failing. |
LoginResult | Enumeration of the possible outcomes of a login attempt, including "SUCCESS" and "FAILURE". |
LogoutResponse | |
Member | |
MetadataObjectMetrics | |
MetadataServer | |
MetadataServerConnectionParams | |
Metrics | |
Node | |
Node | |
NodeKeyAndLevelWrapper | |
NodeType | |
Notification | |
NotificationPaginatedResponse | |
NotificationRequestTO | |
NotificationType | |
ObjectField | |
OpDetails | |
OperationExecution | |
OperationNode | |
OrderbySpec | |
PaginatedResponse | |
PaginatedResponse2OfDataResourceFlexTO | |
PaginatedResponseOfDataResource | |
PaginatedResponseOfDataResourceLite | |
PaginatedResponseOfDiscussionCommentTO | |
PaginatedResponseOfDiscussionTopic | |
PaginatedResponseOfJobDetailsTO | |
PaginatedResponseOfPairOfStringAndDataResource | |
PaginatedResponseOfPairOfStringAndString | |
PaginatedResponseOfSavedSearch | |
PaginatedResponseOfVirtualFolder | |
PagingCriteria | |
PairOfStringAndDataResource | |
PairOfStringAndString | |
Participation | |
ParticipationLevel | |
PathElement | |
PathInfo | |
PathSpecification | |
PatternTO | |
PatternTopBottomTO | |
PatternValue | |
PatternValue | |
PatternValue | |
PreFilterTO | |
Principal | |
Properties | |
Property | |
PropertySpecification | |
PropertyType | |
QueryRequest | |
QueryResult | |
RegexTester | |
RelatedResourcesMap | |
ReportedField | |
ResourceField | |
ResourceNode | |
ReviewsAggregate | |
Role | |
Rule | |
RuleAction | |
RuleScope | |
SSLDetailsTO | |
SavedSearch | |
SearchCriteria | |
SearchResult | |
SearchType | |
Semantic | |
Semantic | |
SequenceTO | |
SessionData | |
SolrConnectionParams | |
Source | |
SparkJob | |
State | |
State | |
StatusDetail | |
SystemNode | |
SystemTO | |
TableComparisionTO | |
Tag | |
TagAssociation | |
TagAssociationContainer | |
TagAssociationFeature | |
TagAssociationRequest | |
TagAssociationView | |
TagDetail | |
TagDetails | |
TagDomain | |
TagDomainContainer | |
TagDomainCountByResource | |
TagDomainCountTO | |
TagRequest | |
TagState | |
Token | |
UpdatedUserReview | |
UsageMetrics | |
User | |
UserInfo | Type that provides information about the current user. |
UserReview | |
UserTO | |
VirtualFolder | |
VirtualFolderDependencies | |
Visibility |