Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Commit 655b729

Browse filesBrowse files
committed
Editorial fixes (typos, etc.)
1 parent 3150b92 commit 655b729
Copy full SHA for 655b729

File tree

1 file changed

+12
-14
lines changed
Filter options

1 file changed

+12
-14
lines changed

‎bp.html

Copy file name to clipboardExpand all lines: bp.html
+12-14
Original file line numberDiff line numberDiff line change
@@ -650,7 +650,7 @@ <h2>Introduction</h2>
650650

651651
<p>Depending on circumstances, sensitive information about individuals might include full name, home address, email address, national identification number, IP address, vehicle registration plate number, driver's license number, face, fingerprints, or handwriting, credit card numbers, digital identity, date of birth, birthplace, genetic information, telephone number, login name, screen name, nickname, health records etc. Although it is likely to be safe to share some of that information openly, and even more within a controlled environment, publishers should bear in mind that combining data from multiple sources may allow inadvertent identification of individuals.</p>
652652

653-
<p>A general Best Practice for publishing Data on the Web is to use standards. Different types of organizations specify standards that are specific to the publishing of datasets related to particular domains &amp; applications, involving communities of users interested in that data. These standards define a common way of communicating information among the users of these communities. For example, there are two standards that can be used to publish transport timetables: the General Transit Feed Specification [[GTFS]] and the Service Interface for Real Time Information [[SIRI]]. These specify, in a mixed way, standardized terms, standardized data formats and standardized data access. Another general Best Practice is to use Unicode for handling character and string data. Unicode improves multilingual text processing and makes easier software localization easier. The Best Practices set out in this document serve a general purpose of publishing and using Data on the Web and are domain &amp; application independent. They can be extended or complemented by other Best Practices documents or standards that cover more specialized contexts.</p>
653+
<p>A general Best Practice for publishing Data on the Web is to use standards. Different types of organizations specify standards that are specific to the publishing of datasets related to particular domains &amp; applications, involving communities of users interested in that data. These standards define a common way of communicating information among the users of these communities. For example, there are two standards that can be used to publish transport timetables: the General Transit Feed Specification [[GTFS]] and the Service Interface for Real Time Information [[SIRI]]. These specify, in a mixed way, standardized terms, standardized data formats and standardized data access. Another general Best Practice is to use Unicode for handling character and string data. Unicode improves multilingual text processing and makes software localization easier. The Best Practices set out in this document serve a general purpose of publishing and using Data on the Web and are domain &amp; application independent. They can be extended or complemented by other Best Practices documents or standards that cover more specialized contexts.</p>
654654
<!-- <p>Taking that into account, this document sets out a series of Best Practices that will help publishers and consumers face the new challenges and opportunities posed by data on the Web. They intend to serve a general purpose of publishing and using Data on the Web, but they may be specialized according to specific domains, such as Spatial Data on the Web Best Practices [[SDW-BP]].</p>-->
655655
<p>Best Practices cover different aspects related to data publishing and
656656
consumption, like data formats, data access, data identifiers and
@@ -1213,7 +1213,7 @@ <h4 class="subhead">Why</h4>
12131213
</section>
12141214
<section class="description">
12151215
<h4 class="subhead">Intended Outcome</h4>
1216-
<p>Humans will be able to understand data license information describing possible restrictions placed on the use of a given distribution and software agents to automatically detect the data license of a distribution.</p>
1216+
<p>Humans will be able to understand data license information describing possible restrictions placed on the use of a given distribution, and software agents will be able to automatically detect the data license of a distribution.</p>
12171217
</section>
12181218
<section class="how">
12191219
<h4 class="subhead">Possible Approach to Implementation</h4>
@@ -1253,7 +1253,7 @@ <h5 class="subhead">Human-readable</h5>
12531253
<section class="test">
12541254
<h4 class="subhead">How to Test</h4>
12551255
<p>Check if the metadata for the dataset itself includes the data license information in a human-readable format.</p>
1256-
<p>Check if a user agent can automatically detect /discover the data license of the dataset.</p>
1256+
<p>Check if a user agent can automatically detect or discover the data license of the dataset.</p>
12571257
</section>
12581258
<section class="ucr">
12591259
<h4 class="subhead">Evidence</h4>
@@ -2106,7 +2106,7 @@ <h4 class="subhead">Why</h4>
21062106
becomes useful when it has been processed and transformed into
21072107
information. Note that there is an important distinction between formats that can be read and edited by humans using a computer and formats that are machine-readable. The latter term implies that the data is readily extracted, transformed and processed by a computer. </p>
21082108
<p>Using non-standard data formats is costly and inefficient, and
2109-
the data may lose meaning as it is transformed. On the other hand,
2109+
the data may lose meaning as it is transformed. By contrast,
21102110
standardized data formats enable interoperability as well as
21112111
future uses, such as remixing or visualization, many of which
21122112
cannot be anticipated when the data is first published. It is also important to note that most machine-readable standardized formats are also locale-neutral.</p>
@@ -2117,7 +2117,7 @@ <h4 class="subhead">Intended Outcome</h4>
21172117
</section>
21182118
<section class="how">
21192119
<h4 class="subhead">Possible Approach to Implementation</h4>
2120-
<p>Make data available in a machine-readable standardized data format that is easily parseable including but not limited to CSV, XML, HDF5, JSON and RDF serialization syntaxes like RDF/XML, JSON-LD, Turtle.</p>
2120+
<p>Make data available in a machine-readable standardized data format that is easily parseable including but not limited to CSV, XML, HDF5, JSON and RDF serialization syntaxes like RDF/XML, JSON-LD, or Turtle.</p>
21212121
<aside class="example">
21222122

21232123
<p>John knows that tabular data is commonly used on the Web and he decides to use CSV as the data format for one of the distributions of the bus stops dataset. To facilitate data processing, he uses the <a href = https://www.w3.org/TR/2015/REC-tabular-data-model-20151217/> Model for Tabular Data and Metadata on the Web</a> for publishing the CSV distribution (<code>stops-2015-05-05.csv</code>). The example below presents a fragment of the CSV distribution which complies with the structural metadata defined in <a href="#StructuralMetadata">Example 4</a>.</p>
@@ -2360,7 +2360,7 @@ <h4 class="subhead">Possible Approach to Implementation</h4>
23602360
provide lists of codes, terminologies and Linked Data vocabularies that can be used by everyone.
23612361
A key point is to make sure the dataset, or its documentation, provides enough (human- and machine-readable) context
23622362
so that data consumers can retrieve and exploit the standardized meaning of the values. In the context of the Web, using unambiguous, Web-based identifiers (URIs) for standardized vocabulary resources
2363-
is an efficient way to do this, noting that the same URI may have multilingual labels attached for greater cross-border interoperability. The European Union's multilingual thesaurus, <a href="http://eurovoc.europa.eu/">Eurovoc</a> provides a prime example.</p>
2363+
is an efficient way to do this, noting that the same URI may have multilingual labels attached for greater cross-border interoperability. The European Union's multilingual thesaurus, <a href="http://eurovoc.europa.eu/">Eurovoc</a>, provides a prime example.</p>
23642364
<aside class="example"><ol>
23652365
<li>The DCAT vocabulary expresses metadata concerning datasets [[VOCAB-DCAT]] and
23662366
re-uses elements from several pre-existing vocabularies: Dublin Core, FOAF, SKOS and vCard.
@@ -2622,10 +2622,7 @@ <h4 class="subhead">Why</h4>
26222622
human-readable and machine-readable data, using RDFa for example.
26232623
However, as the Architecture of the Web [[WEBARCH]] and DCAT [[VOCAB-DCAT]] make clear,
26242624
a resource, such as a dataset, can have many representations. The same data might be available
2625-
as JSON, XML, RDF, CSV and HTML. These multiple representations can be made available via and API but should be made available
2626-
from <em>the same</em> URL using <a href="/DesignIssues/Conneg">content negotiation</a> to return the appropriate representation (what
2627-
DCAT calls a distribution). Specific URIs can be used to identify individual representations of the data directly, by-passing
2628-
content negotiation.</p>
2625+
as JSON, XML, RDF, CSV and HTML. These multiple representations can be made available via an <abbr title="Application Programming Interface">API</abbr>, but should be made available from <em>the same</em> URL using <a href="/DesignIssues/Conneg">content negotiation</a> to return the appropriate representation (what DCAT calls a distribution). Specific URIs can be used to identify individual representations of the data directly, by-passing content negotiation.</p>
26292626
</section>
26302627
<section class="outcome">
26312628
<h4 class="subhead">Intended Outcome</h4>
@@ -3149,7 +3146,7 @@ <h4 class="subhead">Possible Approach to Implementation</h4>
31493146
gtfs:longName "Airport - Bullfrog" ;
31503147
gtfs:agency &lt;http://data.mycity.example.com/transport-agency/DTA&gt; ;
31513148
gtfs:routeType ex:three ;
3152-
ex:usualVehicleType dbpedia:Roumaster ;
3149+
ex:usualVehicleType dbpedia:Routemaster ;
31533150
foaf:isPrimaryTopicOf ex:Airport_Bullfrog
31543151
.
31553152

@@ -3194,7 +3191,7 @@ <h4 class="subhead">Possible Approach to Implementation</h4>
31943191
gtfs:longName "Airport - Bullfrog" ;
31953192
gtfs:agency &lt;http://data.mycity.example.com/transport-agency/DTA&gt; ;
31963193
gtfs:routeType ex:three ;
3197-
ex:usualVehicleType dbpedia:Roumaster ;
3194+
ex:usualVehicleType dbpedia:Routemaster ;
31983195
foaf:isPrimaryTopicOf ex:Airport_Bullfrog
31993196
.
32003197

@@ -3736,7 +3733,8 @@ <h2>Glossary</h2>
37363733
</dd>
37373734
<dt><dfn id="locale_parameter">Locale</dfn></dt>
37383735
<dd>
3739-
<p>A collection of international preferences, generally related to a language and geographic region that a (certain category) of users require. These are usually identified by a shorthand identifier or token, such as a language tag, that is passed from the environment to various processes to get culturally affected behavior</p> <p>From <a href="https://www.w3.org/TR/ltli/#locale">Language Tags and Locale Identifiers for the World Wide Web</a> [[LTLI]].</p>
3736+
<p>A collection of international preferences, generally related to a language and geographic region that a (certain category) of users require. These are usually identified by a shorthand identifier or token, such as a language tag, that is passed from the environment to various processes to get culturally affected behavior.</p>
3737+
<p>From <a href="https://www.w3.org/TR/ltli/#locale">Language Tags and Locale Identifiers for the World Wide Web</a> [[LTLI]].</p>
37403738
</dd>
37413739
<dt><dfn id="machine_readable">Machine-readable data</dfn></dt>
37423740
<dd>
@@ -3775,7 +3773,7 @@ <h2>Glossary</h2>
37753773
</dd>
37763774
<dt><dfn id="vocabulary">Vocabulary</dfn></dt>
37773775
<dd>
3778-
<p>Vocabulary is A collection of "terms" for a particular purpose.
3776+
<p>A vocabulary is a collection of "terms" for a particular purpose.
37793777
Vocabularies can range from simple such as the widely used RDF
37803778
Schema [[RDF-SCHEMA]], FOAF [[FOAF]] and Dublin
37813779
Core [[DCTERMS]] to complex vocabularies with

0 commit comments

Comments
0 (0)
Morty Proxy This is a proxified and sanitized view of the page, visit original site.