Hi Chris. I'm mostly thinking of direct manipulation, consisting on clicking and tagging structured data. Prototype tools like
and
Lapis can infer the structure of a document and create patterns to scrap data collections that way.
This kind of tools usually create an API endpoint to be accessed programatically, which is not accesible to end users. I would pipe such extracted data through a cleanup tool like
or
, which convert a sequence of direct manipulation transformations into a persistent sequence of commands so it can be reapplied to new datasets.
As you see, these workflows already exist in several tools with different degrees of maturity. However, they are separate silos, making it difficult to transfer information among different applications; and they expose information either through an API endpoint or as a whole file.
My novel idea is to build an always-present capacity at the GUI level (either the windowing environment or mobile) which, like copy-paste, allows you to capture any information that the user can see and insert it at any other place that accepts editable content, but using these structured data transformation pipelines rather than raw text; and setting up some interactive widgets at the enpoint so that the user can shape the way data is injected into it. Again there are some "build your own workflow" commercial tools trying to create similar connectivity, but I don't see them working at this granularity I want to achieve of allowing a single interaction over a multiple selection of data.