Download data

You can download your records on both the settings and databrowser page by clicking on "Download records". This will create a records-oriented JSON file.

Let's take a look at an export to better understand the schema:

[
  {
    "headline": "$200 Laptops Break a Business Model",
    "running_id": "1479",
    "__clickbait__MANUAL": null,
    "__clickbait__WEAK_SUPERVISION": "yes",
    "__clickbait__WEAK_SUPERVISION__confidence": "0.4685",
    "headline__entities__MANUAL": ["O", "O", "O", "O", "O", "O", "O"],
    "headline__entities__WEAK_SUPERVISION": [
      "MONEY",
      "MONEY",
      "O",
      "O",
      "O",
      "O",
      "O"
    ],
    "headline__entities__WEAK_SUPERVISION__confidence": [
      0.83, 0.83, 0.0, 0.0, 0.0, 0.0, 0.0
    ]
  }
]

headline and running_id were given in the initial data import. Labeling tasks are indicated with a double underscore __, and for each labeling task you have a triplet:

  • MANUAL: the manually set labeled for this task.
  • WEAK_SUPERVISION: the weakly supervised labeled for this task.
  • WEAK_SUPERVISION_CONFIDENCE: the confidence for the weakly supervised label.

Extraction tasks contain the labels per token. The attributes for each task are given at the beginning - if the attribute is full record, the triplets begin with a double underscore __.