User:David Mason/P2PU week 1 assignment: Difference between revisions

From canbudget Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
 
(25 intermediate revisions by one other user not shown)
Line 4: Line 4:
</div>
</div>


For my [http://p2pu.org/ Peer to Peer University]'s [http://p2pu.org/general/open-journalism-open-web Open Journalism & the Open Web] course [http://p2pu.org/node/5644/document/9468 week one, Assignment one], I decided to investigate the numbers behind the G20/G8 2010 combined summits in Canada. These summits, particularly the G20, were controversial because of their [[2010/G20 and G8 Budget/Summit costs were unreasonable|high costs]], [[2010/G20 and G8 Budget/Budget numbers are transparently being released|lack of transparency]], [[2010/G20/Summit was disruptive|disruptive impact on the city]], and [[2010/G20/Handling by authorities was a problem|heavy-handed and potentially manipulative behaviour by government and police authorities]].
For the [http://p2pu.org/ Peer to Peer University]'s [http://p2pu.org/general/open-journalism-open-web Open Journalism & the Open Web] course [http://p2pu.org/node/5644/document/9468 week one, Assignment one], I decided to investigate the numbers behind the G20/G8 2010 combined summits in Canada. These summits, particularly the G20, were controversial because of their [[2010/G20 and G8 Budget/Summit costs were unreasonable|high costs]], [[2010/G20 and G8 Budget/Budget numbers are transparently being released|lack of transparency]], [[2010/G20/Summit was disruptive|disruptive impact on the city]], and [[2010/G20/Handling by authorities was a problem|heavy-handed and potentially manipulative behaviour by government and police authorities]].


In this case, after freedom of information requests, the government released part of the budget, in fax form. The fax became available online, and I posted a message to the [http://canbudget.zooid.org/frame/?l=http%3A//groups.google.com/group/visiblegovernment-discuss/browse_thread/thread/54a5551a854e13e5 Visible Government mailing list] and [http://www.facebook.com/nostriluu?v=wall&story_fbid=156947327666270 Facebook], asking for people interested in transcribing this data into reusable spreadsheet data, and suggesting the creation of a semantic wiki to make the data reusable and examinable through addition of [[#Structured features|structure]] and [[#Discourse features|discourse features]], explained below.
In this case, after [[2010/G20 and G8 Budget/Budget numbers are transparently being released|access to information requests]] initiated by [[Dan McTeague, MP]], the government released part of the budget, in fax form. The fax became available online, and a message was posted to the [http://canbudget.zooid.org/frame/?l=http%3A//groups.google.com/group/visiblegovernment-discuss/browse_thread/thread/54a5551a854e13e5 Visible Government mailing list] and [http://www.facebook.com/nostriluu?v=wall&story_fbid=156947327666270 Facebook], asking for people interested in transcribing this data into reusable spreadsheet data, and suggesting the creation of a semantic wiki to make the data reusable and examinable through addition of [[#Structured features|structure]] and [[#Discourse features|discourse features]], explained below.


A number of individuals helped [https://spreadsheets.google.com/ccc?key=tZVtPjZYfnOcWglkGpGMTgQ&authkey=CMia450N#gid=1 transcribe the data], which was imported, and discourse features added, along with a number of examples.
A number of individuals helped [https://spreadsheets.google.com/ccc?key=tZVtPjZYfnOcWglkGpGMTgQ&authkey=CMia450N#gid=1 transcribe the data], which was imported, and discourse features added, along with a number of examples.


The result is highly re-usable, organized information. Because the software is based on the Free Software [http://www.mediawiki.org Mediawiki], the same software used by Wikipedia, it is free to use, can store many entries, and a culture of access is included. All content is malleable and subjective, so having the insights of Wikipedia ([http://en.wikipedia.org/wiki/Wikipedia:Neutral_point_of_view neutral point of view], managing content) and the [http://www.semanticweb.org Semantic Web] (anyone can say anything about anything) helps work through issues.
The result is highly re-usable, organized information. Because the software is based on the Free Software [http://www.mediawiki.org Mediawiki], the same software used by Wikipedia, it is free to use, can store many entries, and a culture of access is included. All content is malleable and subjective, so having the insights of Wikipedia ([http://en.wikipedia.org/wiki/Wikipedia:Neutral_point_of_view neutral point of view], managing content as a group) and the [http://www.semanticweb.org Semantic Web] (anyone can say anything about anything) helps work through issues.


All site editing is done via the Web, so anyone can view any page's contents (via the edit or view source tab), [[Special:RecentChanges|site editing history]], page edit history (via the history tab), and [[Special:Version|components of the site]], as well as progressively learn how to use and contribute to the site (knowledge which will transfer to similar sites, and which I'd assert is digital literacy past filling in forms). Because the site uses [http://creativecommons.org/licenses/by-sa/3.0/ cc-by-sa] terms of use, anyone can take the content and use it for their own purposes, as long as they attribute the site and make modified works available under the same terms. All the [[:Category:Site components|site components]] can be exported and re-used easily on any site using [[Special:Export]].
All site editing is done via the Web, so anyone can view any page's contents (via the edit or view source tab), [[Special:RecentChanges|site editing history]], page edit history (via the history tab), and [[Special:Version|components of the site]], as well as progressively learn how to use and contribute to the site (knowledge which will transfer to similar sites, and which I'd assert is digital literacy past filling in forms). Because the site uses [http://creativecommons.org/licenses/by-sa/3.0/ cc-by-sa] terms of use, anyone can take the content and use it for their own purposes, as long as they attribute the site and make modified works available under the same terms. All the [[:Category:Site components|site components]] can be exported and re-used easily on any site using [[Special:Export]].
Line 16: Line 16:
== Structured features ==
== Structured features ==


While [http://www.semantic-mediawiki.org Semantic Mediawiki] may seem to focus on the Semantic Web, in this case it is used to make the site easier to use, while making the content re-usable across the site. Therefore queries can be '''dynamically''' used in different ways;
While [http://www.semantic-mediawiki.org Semantic Mediawiki] may seem to focus on the Semantic Web, in this case it is used to make the site easier to use, while making the content re-usable across the site. Therefore queries can be '''dynamically''' used in different ways; here is a queried list of current Debatable items.
 
{{ #ask: [[Position::Debatable]]
{{ #ask: [[Position::Debatable]]
|?Description
|?Description
Line 26: Line 25:
|link=none
|link=none
}}
}}
----
<div style="float: right; width: 48%">
{{ #ask: [[GPS location::+]]
|?Description
|?GPS location
|mainlabel=-
| format=map
}}
</div>


{{ #ask: [[Canadian dollar cost 2010::>2000000]]
{{ #ask: [[Canadian dollar cost 2010::>2000000]]
|?Canadian dollar cost 2010
|?Description
|?Description
|?Canadian dollar cost 2010
|width=50%
|sort=Canadian dollar cost 2010
|sort=Canadian dollar cost 2010
|mainlabel=-
|order=desc
|width=100%
|format=jqplotseries
|format=jqplotbar
|limit=12
|charttitle=Cost items over $2M
|link=all
|bardirection=horizontal
|datalabels=value
|charttype=donut
|group=property
|chartlegend=nw
}}
}}
----


{{ #ask: [[Start date::+]]  
{{ #ask: [[Start date::+]]  
Line 48: Line 63:
}}
}}


See the [[procurement exhibit]] for a particular tool to narrow down cost items by procurement process, and [[Graph of procurement process and topic|a graph of procurement processes and topics]].
----
 
A compressed overview of current discourse ([[Discourse graph overview|full size]]):
 
{{ #ask: [[Debatable::+]] OR [[Supports::+]] OR [[Refutes::+]] OR [[Mixed::+]]
|?Debatable
|?Supports
|?Refutes
|?Mixed
|format=graph
|graphlabel=Yes
|graphsize=10,10
|graphlink=Yes
|graphcolor=Yes
}}
 
 
See the [[procurement exhibit]] for a particular tool to narrow down cost items by procurement process, and [[Graph of procurement process and topic|a graph of procurement processes and topics]]. [[Special:Ask|Ad-hoc queries]] are also supported.


Since the underlying facilities are in an integrated system, as different visualizations emerge they can also be supported. Ideally the set of visualizations will be consistent, so those accessing the site can learn how to use them as interesting looking and useful tools.
Since the underlying facilities are in an integrated system, as different visualizations emerge they can also be supported. Ideally the set of visualizations will be consistent, so those accessing the site can learn how to use them as interesting looking and useful tools.
Line 54: Line 86:
=== Types of content ===
=== Types of content ===


The types of content stored in the site are [[Template:Subject|subject]] (a very general collection of cost items and discourse elements), [[Template:Responsible person|responsible person]], [[Template:Responsible unit|responsible unit]], and [[Template:Supplier|supplier]]. An integrated issue system will be [[Issue system|explained separately]]. [[:Category:Procurement Process|procurement process]] is also broken out, providing the opportunity to precisely describe these processes using [http://km.aifb.kit.edu/projects/process/index.php/Review_process process graphs].
The types of content stored in the site are [[Template:Subject|subject]] (a very general collection of cost items and discourse elements), [[Template:Responsible person|responsible person]], [[Template:Responsible unit|responsible unit]], and [[Template:Supplier|supplier]]. An integrated issue system will be [[Issue system|explained separately]]. [[:Category:Procurement Process|Procurement process]] is also separate content, providing the opportunity to precisely describe these processes using [http://km.aifb.kit.edu/projects/process/index.php/Review_process process graphs].


== Discourse features ==
== Discourse features ==
Line 68: Line 100:
Since I'm developing this type of site for several projects, this week I'm investigating a "pipeline" that includes annotation using the [http://gate.ac.uk GATE] software. This could allow importing many Web based pages. While computer software will never have the real understanding of a human, entities such as people, places and dates can be recognized, and  [http://gate.ac.uk/sentiment/ sentiments] can be read with limited accuracy, therefore ideally people will correct this information to make it more than statistically useful.
Since I'm developing this type of site for several projects, this week I'm investigating a "pipeline" that includes annotation using the [http://gate.ac.uk GATE] software. This could allow importing many Web based pages. While computer software will never have the real understanding of a human, entities such as people, places and dates can be recognized, and  [http://gate.ac.uk/sentiment/ sentiments] can be read with limited accuracy, therefore ideally people will correct this information to make it more than statistically useful.


The process behind developing this site was interesting to me, and has invoked interest that may spark an ongoing project that can expand to include the structure of governments and connections between government figures and business. The semantic facilities may become more important,  allowing the site to interact in a reusable web of information. Hopefully this type of participatory site will become more common, allowing people to constantly learn, organize and re-use information past typical unstructured content today; computer-developed content that is often no better than a typewritten page or fax.
The process behind developing this site was interesting to me, and has invoked interest that may spark an ongoing project that can expand to include the structure of governments and connections between government figures and business. The semantic facilities may become more important,  allowing the site to interact in a reusable web of information. Hopefully this type of participatory site will become more common, allowing people to constantly learn, organize and re-use information past typical unstructured content today; computer-developed content that is often no better than a typewritten page or [http://canbudget.zooid.org/mediawiki/images/1/1b/38025479-Breakdown-of-G20-security-costs.pdf fax].
 
Obviously, while part of a movement, this will take time, and I hope to support and contribute to different groups using these kinds of systems.


If you're interested in participating, please add your name [[Site developers|here]].
If you're interested in participating, please add your name [[Site developers|here]].
== Discourse ==
{{Subject
|Description=Overview of discourse system for journalism
|Project=Integrated semantic discourse systems
|Topic=Integrated semantic discourse systems
|Start date=Oct 3, 2010
|Source=David Mason
|Position=Supports
}}
[[Category:Open Journalism & the Open Web]]

Latest revision as of 13:23, 15 April 2024

For the Peer to Peer University's Open Journalism & the Open Web course week one, Assignment one, I decided to investigate the numbers behind the G20/G8 2010 combined summits in Canada. These summits, particularly the G20, were controversial because of their high costs, lack of transparency, disruptive impact on the city, and heavy-handed and potentially manipulative behaviour by government and police authorities.

In this case, after access to information requests initiated by Dan McTeague, MP, the government released part of the budget, in fax form. The fax became available online, and a message was posted to the Visible Government mailing list and Facebook, asking for people interested in transcribing this data into reusable spreadsheet data, and suggesting the creation of a semantic wiki to make the data reusable and examinable through addition of structure and discourse features, explained below.

A number of individuals helped transcribe the data, which was imported, and discourse features added, along with a number of examples.

The result is highly re-usable, organized information. Because the software is based on the Free Software Mediawiki, the same software used by Wikipedia, it is free to use, can store many entries, and a culture of access is included. All content is malleable and subjective, so having the insights of Wikipedia (neutral point of view, managing content as a group) and the Semantic Web (anyone can say anything about anything) helps work through issues.

All site editing is done via the Web, so anyone can view any page's contents (via the edit or view source tab), site editing history, page edit history (via the history tab), and components of the site, as well as progressively learn how to use and contribute to the site (knowledge which will transfer to similar sites, and which I'd assert is digital literacy past filling in forms). Because the site uses cc-by-sa terms of use, anyone can take the content and use it for their own purposes, as long as they attribute the site and make modified works available under the same terms. All the site components can be exported and re-used easily on any site using Special:Export.

Structured features

While Semantic Mediawiki may seem to focus on the Semantic Web, in this case it is used to make the site easier to use, while making the content re-usable across the site. Therefore queries can be dynamically used in different ways; here is a queried list of current Debatable items.


Loading map...


A compressed overview of current discourse (full size):


See the procurement exhibit for a particular tool to narrow down cost items by procurement process, and a graph of procurement processes and topics. Ad-hoc queries are also supported.

Since the underlying facilities are in an integrated system, as different visualizations emerge they can also be supported. Ideally the set of visualizations will be consistent, so those accessing the site can learn how to use them as interesting looking and useful tools.

Types of content

The types of content stored in the site are subject (a very general collection of cost items and discourse elements), responsible person, responsible unit, and supplier. An integrated issue system will be explained separately. Procurement process is also separate content, providing the opportunity to precisely describe these processes using process graphs.

Discourse features

This site includes discourse features, which allow assigning a position to subjects. For example, an initial cost item can have another item added with a Debatable position. That item can then have items added with Supports or Refutes positions (you may not see a lot of Position content today, it can be easily added using forms, a good example of a similar populated discourse system is the inspiring Discourse DB).

Use in journalism

This type of system could be used by journalists as an in-house or public database. As related stories are developed, content will be constantly added. Interactive tools can be used for investigation and final display. As part of the Web, journalists can use their authority to work with authorities and "members of the public" for a truly inclusive culture, creating stories out of data and creating comprehensive world views based on facts and positions.

Going forward

Since I'm developing this type of site for several projects, this week I'm investigating a "pipeline" that includes annotation using the GATE software. This could allow importing many Web based pages. While computer software will never have the real understanding of a human, entities such as people, places and dates can be recognized, and sentiments can be read with limited accuracy, therefore ideally people will correct this information to make it more than statistically useful.

The process behind developing this site was interesting to me, and has invoked interest that may spark an ongoing project that can expand to include the structure of governments and connections between government figures and business. The semantic facilities may become more important, allowing the site to interact in a reusable web of information. Hopefully this type of participatory site will become more common, allowing people to constantly learn, organize and re-use information past typical unstructured content today; computer-developed content that is often no better than a typewritten page or fax.

Obviously, while part of a movement, this will take time, and I hope to support and contribute to different groups using these kinds of systems.

If you're interested in participating, please add your name here.

Discourse

Overview of discourse system for journalism

Source David Mason
Dates Oct 3, 2010
Project Integrated semantic discourse systems
Topic Integrated semantic discourse systems
Position Supports



Add subject »



Debates