LOC Features Overview
- To understand features in LOC and their hierarchy.
- To understand logic types in LOC.
- To understand what are triggers, tasks and executions.
- To understand what are (data) events.
Terminologies: Features, Assets and Resources
In LOC, we'll refer various functionalities with the following terminologies:
Type | Description |
---|---|
Features | Accessible functionalities or asset types that can be created in LOC |
Assets | Created instance of features that owned by users |
Resources | Allocated hardware resources (CPU cores and memory) for LOC runtime |
The available resources for LOC runtime - which executes the data processes - is defined in your license based on your purchased options. The rest of LOC requires additional infrastructure resources to run its services, which has nothing to do with the runtime resources.
Features that can be created as assets:
- Projects and scenarios
- Logic
- Data processes
- Triggers
- Agent configurations
- Tags
Features that cannot be created as assets:
- Agents
- Task context and payload
- SDK types
A special case of asset is the workspace (unit), which is defined by your LOC license along with available hardware resources.
Project, Scenario and Data Process Hierarchy
In LOC, the hierarchy of projects, scenarios, data processes and logic are arranged as follows:
Term | Description |
---|---|
Unit | represents the topmost LOC development workspace for all projects. |
Project | represents a business logic which may contain several scenarios. |
Scenario | represents a use case in a business logic and may contain several data processes. |
Data process | represents a data pipeline that have several logic linked to it. |
Logic | represents a code module which contains scripts to perform data-processing tasks. |
A project owner owns all scenarios and data processes under it (but not the linked logic).
LOC asset naming rules
The name or any LOC assets cannot be blank, cannot exceed 128 characters and can only contain the following characters:
- spaces
- dot:
.
- dash:
-
- underscore:
_
- sharp:
#
- tilde:
~
- single quote:
'
- parentheses:
()
- square bracket:
[]
Spaces at both ends will be trimmed. All empty string is not allowed.
Whereas descriptions or assets have no such restriction and can be blank.
By default the entrypoint file path (the filename of source logic) is the same as the logic name.
Data Process and Logic
Logic Types
A data process always has at least one generic logic and only one aggreggaror logic, which are the two main logic types in LOC.
- Generic logic is the general purpose logic for performing any desired extracting, transformation or loading actions.
- Aggregator logic is responsible for returning finalised results from an executed data process. It is the last logic to be executed in a data process. Ideally, generic logic should pass their results to the aggregator logic via internal session storage variables.
Logic are designed to be reusable, so they are created and deployed independent of data processes. A data process is in fact a manifest of some metadata and a series of linked logic's IDs.
Currently LOC supports two languages for developing logic, both available in Studio and CLI:
- JavaScript (
ES6/ES2015
and above) - TypeScript (version
3.7.0
and above)
You can read the following tutorials to learn more about how to create and use logic:
Logic Modularisation
The same logic can be shared between more thah one data processes:
LOC thus llows you to create common, reusable logic modules to reduce duplicated code, logic coupling as well as development time. Of course, how and what to modularise your code really depends on use cases of your business and Data.
Cloud Logic vs. Native Logic
Depending on where the logic code is developed and deployed, both generic and aggregator logic stored in LOC Core can have two types - cloud and native logic:
These two types of logic essentially work in the same way with a few difference:
Logic type | Deployed from | Source code editable on the cloud | Support 3rd-party libraries | Suitable For |
---|---|---|---|---|
Cloud logic | Studio | Yes | No | Demostraion or proof of concept |
Native logic | CLI | No | Possible | General development |
Source code - or "entry file" - of both cloud and native logic can be viewed in Studio. However only cloud logic entry files are editable.
The only difference is that cloud logic would store both source code and compiled code directly in LOC, while native logic only uploads the compiled code (with the source code files or source logic remains on the developer's local machine).
Despite not being editable in Studio, native logic have the advantages of combining with local developmenting, testing and source control tools so that you can design and test-run logic before pushing them to production. The deployed assets can then be managed easily via Studio's graphic interface.
For demostration purposes we will still mostly use cloud logic in our tutorials. See CLI Handbook to learn more about CLI.
For maintaining data security, LOC's logic runtime does not allow third party packages to make external connections. You have to use SDK's (agents) to access external data sources. Other than that, you can add local code libraries and import them into different logic.
Agents
Each logic can access a range of internal and external data source functionalities, which are integrated into the LOC runtime. These "agents" allow logic to share data with other logic or access HTTP endpoints, file servers and databases, making LOC data processes a truly powerful data integration tool between a sea of data silos and legacy systems.
During an execution, the session storage agent can access the session storage and the event agent can access the event store.
You can find full detailed interface documentation at SDK Reference.
Agent configurations
For four of the agents that can access external data sources, they require special setups - agent configurations - for defining the data sources like web servers and databases.
You can learn how to set and use agent configurations in the following tutorials:
Tags
Tags are metadata that can be added to data processes and logic. They are useful for improve the readability and searchability in both Studio and CLI.
Triggers, Tasks and Executions
Triggers are features for invoking one or more data processes.
Trigger type | Trigger message source | Can has payload | Return result |
---|---|---|---|
API route (HTTP endpoint) | HTTP request | Yes | Synchronous/asynchronous |
Message queue | MQ client | Yes | Asynchronous |
Schedule | LOC scheduler | No | None (execution history only) |
A data process linked to a trigger asset is referred as a trigger task. A trigger can have multiple trigger tasks.
When a trigger asset is invoked, the LOC runtime will start an execution. Each data processes run in the execution is called a task. All tasks would be run simultaneously and would receive the same payload from the trigger if there is one.
Both execution and task would generate their unique IDs, and the execution result/logs would be stored as execution history.
Execution Result and Task Result
After the execution, a trigger would return a execution result, which is the combined output of all task results. A result in JSON format would look like this:
{
// execution metadata
"_status": 200,
"_metadata": {
"executionId": "...",
"triggerType": "ApiRoute",
"triggerId": "...",
"creationTimestamp": "...",
"completionTimestamp": "...",
"status": "success"
},
"data": {
// task result(s)
"task_result_1": {
// ...
},
"task_result_2": {
// ...
}
}
}
An execution result contains
- execution metadata (auto-generated by LOC):
- HTTP status code and execution status
- Trigger ID and type
- Execution timestamps
- At least one task results returned from each task (executed data process), which are fully customisable by user's logic
By default a trigger - synchronous API route or message queue - returns both execution metadata and task result(s). You can have an API route to return task results only by turning the encapsulation off.
If possible, the task result would be sent directly back to the trigger. The execution response (combined with task results from each data process) will also be stored in the execution histories, which can be queried later if the trigger is asynchronous.
You can learn how to create an API route with the following tutorial:
LOC Studio and CLI only supports managing API routes.
MQ and schedule were once available to manage via CLI in v0.7.0
, however since the new CLI is currently being ported from TypeScript to Rust, these functionalities will be available again in later releases.
Events
Events or data events are metadata emitted by users with Event Agent to the LOC event store, which is built on Elasticsearch.
Ideally, an event represents a business logic event and the metadata can be shared across different data processes, or to generate data lineage (which represents data flow between data processes).
You can read the following tutorials to learn more about using events:
In future releases events can even serve as triggers as well (invoke data processes with data events).
Other Active Metadata
Other than events, LOC offers some different types of automated logs:
- Build History (Building/compiling status logs of all cloud logic)
- Audit Logs (creation, execution and deletion of data processes) - currently not available in Studio
v1.4.x