- Learn about new and experimental features that have been introduced to a subset of maps in the David Rumsey Map Collection.
- Discover what’s new, how to make the most of the new available tools, and how new computational methods enabled this new way search maps.
- Find out how to get involved.
- Quick Start
- Search Text on Maps
- View Search Results
- Become a Contributor
In addition to a traditional search of Catalog Data (e.g. titles, authors, dates), the Luna Viewer now allows you to search the text content of many maps. We call this ‘Text on Maps”.
Only maps that have been geo-referenced can be searched by the text they contain.
Using the Advanced Search, you can perform combined searches of Catalog Data and Text on Maps. In the Advanced Search, you also can specify which version of the Text on Maps you want to search. Read more about these options here.
Not only can you view the text you searched for, you can also view all other text on this map and correct the underlying data!
When you arrive from the search page, you will see only your search term highlighted. To see or hide all Text on Maps for this map, click on the icon you see below in the top left corner of the map image.
To edit a particular selection, click on the highlighted text. Now, you will see the annotation pop up. To learn more about how to annotate (correct, confirm, etc.) text on maps, read this guidance.
To edit an incorrect bounding polygon around text, click on the highlighted text. In the pop up, click ‘edit’. You will see the vertices of the polygon appear: you can move these to improve the way that the polygon surrounds the text, or you can delete a polygon that has been created in error. Read this guidance for more details.
Please note, you must be logged in to edit the data.
If you want to try this new feature, type a word or phrase that interests you in the search field in the top left of your browser window. Then, from the drop down menu, select “Text on Maps”.
The other options (“Catalog Data”, “Catalog Data & Text in documents”, and Advanced Search) are reviewed in more detail below.
- The initial prompt for searching “Text on Maps” accepts 1-word queries. If you want to search for multiple words, please see the section on this below.
- Searches are not case sensitive, nor can they accept regex.
- Although we are working on improving the performance of mapKurator for all languages, it is currently not possible to search Text on Maps for words in non-latin alphabets.
- Your search results will be displayed in a random order that will change each time. So, if you run the same search more than once you will see the same results, but displayed in a different order.
- If you want to refine the results of your query with filters based on Catalog Data, you need to use the Advanced Search feature.
When you select “Text on Maps”, you will search for occurrences of the word on the entire dataset. (Read more here to learn about how this data was created.) For example, if you type “Paris” you will see how many times that word is printed on any of the ~57,000 maps in the collection that we have processed. (This includes text within and outside of the neatline, e.g. it includes map titles and other descriptive information.) The searchable datasets represents content on David Rumsey collection maps that have been digitized and georeferenced up to 2022.
Note: Georeferencing means establishing points on the scanned image that point to locations on the earth. These control points allow content on the digitized map to be geolocated. You can try georeferencing out yourself.
mapKurator output is saved only as individual words: there is no prediction of phrases, for example, for places names that contain more than one word (e.g. “South Ponte Vedra Beach”)
However, multi-word search is possible because of the way the data has been indexed. Briefly, multi-word searches will be successful when adjacent words are within a 2-character length from the two points of the bounding polygon that are the furthest away from each other. This is based based on the size of the characters in the least common word in the search, e.g. “Ponte” below.
Example of multi-word search.
Advanced searching allows you to combine queries of Text on Maps with Catalog Data, i.e. you can leverage the power of the search by text, but also filter the results by their metadata. You can access the advanced search options by clicking on the dropdown menu in the search field.
Clicking on the “Advanced Search” option, you will access a dedicated interface where you can refine your queries in the collection, using the different fields. You can choose one or more criteria in your search.
Find all these words: Click on the drop down menu of the first white box and select one of the data fields from the David Rumsey Map Collection catalog, for example “country”. In the corresponding box on the right, you can type the value that you want to match, for example “United States”. As a result, you will get all maps that have both the words “United” and “States” as a value for “country”.
Find any of these words: Click on the drop down menu of the first white box and select one of the data fields from the David Rumsey Map Collection catalog, for example “country”. In the corresponding box on the right, you can type the value that you want to match, for example “United States”. As a result, you will get all maps that have the words “United” OR the word “States” as a value for “country”. (this will include, for example, maps featuring the United Republic of Congo).
Find this exact wording: Click on the drop down menu of the first white box and select one of the data fields from the David Rumsey Map Collection catalog, for example “country”. In the corresponding box on the right, you can type the value that you want to match, for example “United States of America”. As a result, you will get ONLY the maps that exactly match your query and have “United States of America” as a value (this will NOT include maps that have only “United States” as a value).
Date range: Click on the drop down menu to select the data field you want to search. The options are either “date” (the date the map was printed) or “pub date” (the publication date of the item in which a map appears, for example an atlas). In the corresponding fields on the right, enter two dates that represent the beginning and end of your date range. For example “1830” and “1850”. As a result, you will see all the maps that have a date (or pub date depending on your query) that falls in that range. You can combine this filter with the previous one.
Words in text on maps: click on the drop down menu to choose what type of text on maps you want to search.
The options are:
- “Mapkurator output”: This is the raw output of the machine learning pipeline (mapKurator) applied to the maps, in other words it is the corpus of mapKurator’s predicted transcription of text within its predicted bounding polygon. No subsequent edits or processes have been applied.
- “MapKurator output (post-processed)”: In this dataset, the output from mapKurator has undergone another processing step. Based on the predicted transcription and the geographic coordinates associated with it, another machine learning module attempts to match the predicted text against a vocabulary of feature names in Open Street Map. Content in this field is always CAPITALIZED. This process may help reduce errors, but can also introduce new ones, and it could make some less known features less visible.
- “User annotations”: Updates to mapKurator predictions created by users like you, through edits and/or validation of automatic transcription generated by mapKurator. Currently, this dataset is very small but it will grow with time. The index of annotations is updated daily, so new annotations may not be immediately searchable.
- “All Text on Maps”: includes all of the above.
Then, in the field on the right, write the word you want to search in any of the selected options for Text on Maps.
Please note: In the regular, non advanced search, the “Text on Maps” searches the “MapKurator output (post-processed)” data by default.
Please also note: once you refine the search in this way, it’s not possible to further organize the results. To complete searches with more complex sorting and filtering, please use the “Catalog Data” search without also including the “Text on Maps” option.
- You can refine the search to produce results only for maps that were published between 1700-1800 and where the sheet contains the raw mapKurator output for “Paris”. In the search box in the browser this is expressed as: pub_date=1700…1800 AND ocrText=”Paris” LIMIT:RUMSEY~8~1.
Pub_daterepresents the date range and
ocrTextrepresents the raw mapKurator output.
- A variation on this search limits by publication date, but searches the post-processed mapKurator transcriptions instead of the raw mapKurator output: pub_date=1500…1700 AND postOcrText=”France” LIMIT:RUMSEY~8~1
By default, you will be viewing the results of your Text on Maps search in “masonry view”, i.e., a collage of all the map text that matches your search.
If you hover the pointer over any result’s “brick”, you will see a small preview of the entire map and a yellow pin signaling the position of that particular annotation in relation to the map.
The masonry view is a quick way to compare the variety of ways that a word or words appears on maps from many cultures and centuries.
It’s also a handy way to visualize errors in the automatic text detection and recognition parts of the method creating this data. To learn more about this, you can read more here.
You can also select the “tile view”, to directly see all thumbnails of the maps that match your search. This view is more like the traditional view you see when searching via Catalog Data (e.g. the title).
In both the masonry or tile views, if you click on any of the map thumbnails, you will be taken to a larger view of the map. Here you can also see all the machine-generated bounding boxes around the labels and their transcriptions by clicking the lines icon in the toolbar (see annotating Text on Maps).
When you look at the machine-generated annotations (“MapKurator output” or “MapKurator output (post-processed)”), you may notice errors.
Maps, and the historical ones in particular, are a very challenging input data source for text detection and recognition (or, “text spotting”, as used by mapKurator), and the quality of the results will vary depending on the color(s) of the background, the fonts, the printing technique, the language, the conservation status, and so on.
We invite all the users of the David Rumsey Map Collection to team up with the machine and improve or confirm the annotations.
If you spot a mistake, please consider contributing a better transcription, and/or a more accurate bounding box.
You can fix a text label’s transcription so that it matches what appears on the sheet.
Please note: The goal is to exactly replicate what is on the map.
This includes what might be perceived as “errors” given changes to place names or other factors. Here are some guidelines to help you:
- If there is a typo on the map, or a place name has changed (based on your knowledge), or you want to write the place name in a different language (transliteration), do not make changes that do not reflect what is on the map.
- Similarly, do not expand abbreviations.
- Please include punctuation or spaces as relevant.
If you are interested in learning more about best practices in annotating text on maps, the annotation guidelines developed by Machines Reading Maps can be found here.
Editing is very easy!
- Click on the annotation you want to improve. You can view previous transcriptions by unfolding the “n more transcriptions” part of the box.
- Click on “Edit” to add a transcription that will be saved under your user name.
- The bounding polygon and the text field will now be editable and you can change the text transcription as needed.
- It is possible to cancel your changes or to save them by clicking on the relevant buttons.
- Afterwards, your transcription will be immediately visible (they will be searchable after 1 day).
When you select an annotation, several points along the polygon become active, and you’ll be allowed to move them around, changing the shape of the polygon. However, you are not currently able to draw new polygons around text.
If the polygon incorrectly surrounds a word, you can modify the polygon around that text.
Bounding polygons may be incorrect when:
- They do not include all the characters of one word
- Two polygons overlap/duplicate the transcription of a single word
- Fail to capture a text label at all
Annotations are stored and will appear to the public online immediately, however changes will be searchable only after a delay of 1 day. These changes are logged alongside the existing data: nothing is removed from the underlying data.
Your changes will be identified by your user name.
Once the text and the bounding polygon have been confirmed by 1 user, a green check will appear next to the transcription. This operates as a guide to future users so that they can focus elsewhere.