What should I take note of with regard to character encoding?
RDF data is provided in UTF-8 decomposed character encoding, which is also known as Normalization Form Decomposed (NFD). Here diacritics are for example treated as separate characters (Unicode segment “Combining Diacritics”), and this may have to be taken into account when processing data (e.g. indexing). Depending on the application context, it may be advisable to convert the data into the normal form NFC before processing.
Short-URL:
https://www.dnb.de/metadataservice
Contact:
metadatendienste@dnb.de