User Tools

Site Tools


tagging_across_spans

If you have morphs or words broken across lines/columns/pages in diplomatic transcriptions, the tree-tagger will not have respected the spans in the norm layer. You can deal with this in two ways:

1. Account for it in your data layer before you tag ( as shown in this video). Make a copy of the norm layer in a new sheet. In this new sheet:

  1. Unmerge all spans
  2. Add a new column with a serial ID (1,2,3…) (select the content of the new column, go to edit>fill>series in the menu bar
  3. Sort by norm to get all the blanks together
  4. Sort the non-blanks by ID
  5. Tag that
  6. Paste the tags back in
  7. Sort everything back by ID again
  8. Auto-stretch spans down

2. Manually correct after tagging.

You will need to search your ORIGINAL norm layer for spans and make sure they are aligned properly with the tagger. You can look for this manually in a variety of ways. One example of manual correction is on this video:

  1. select the ORIGINAL norm column (not the one you just pasted in; to be safe, you might rename the new one ignore:norm or something like that).
  2. Click the “unmerge cells” button to unmerge the spans.”
  3. Using the Find function, find the next empty cell. (If the norm layer is selected, it will only find empty cells in that column.)
  4. In the norm column, select the empty cell and the cell above it; merge the two cells.
  5. Make sure the pos is aligned with the proper norm cell. You may need to: Select the relevant cell in ignore:norm and pos; use Insert>Cell to insert a cell above each of these cells.
  6. Merge the cell in pos with the blank cell to correspond with the span in norm. (You do not have to merge the cell in ignore:norm; you will delete this layer soon.)
  7. select the ORIGINAL norm column, and repeat until you are done.
  8. select ignore:norm and delete the column.
tagging_across_spans.txt · Last modified: 2018/09/11 10:02 (external edit)