<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Misspelled nemesis club &#187; standards</title>
	<atom:link href="http://moreati.org.uk/blog/category/standards/feed/" rel="self" type="application/rss+xml" />
	<link>http://moreati.org.uk/blog</link>
	<description>A blog about life, technology &#38; databases</description>
	<lastBuildDate>Sat, 10 Jul 2010 16:57:04 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>The Shapefile 2.0 manifesto</title>
		<link>http://moreati.org.uk/blog/2009/03/01/shapefile-20-manifesto/</link>
		<comments>http://moreati.org.uk/blog/2009/03/01/shapefile-20-manifesto/#comments</comments>
		<pubDate>Sun, 01 Mar 2009 13:13:44 +0000</pubDate>
		<dc:creator>Alex Willmer</dc:creator>
				<category><![CDATA[arcgis]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[rants]]></category>
		<category><![CDATA[standards]]></category>

		<guid isPermaLink="false">http://moreati.org.uk/blog/?p=54</guid>
		<description><![CDATA[Geographic Information Systems (GIS) are by their nature data driven. The data comes in a wide variety of raster and vector formats. Rasters hold raw, continuous data recorded striaght from the real world. An example is Satellite/aerial imagery, this is a commonly held in an open format with broad support, such as GeoTIFF or GeoJPEG.]]></description>
			<content:encoded><![CDATA[<p>Geographic Information Systems (GIS) are by their nature data driven. The data comes in a wide variety of  raster and vector formats. Rasters  hold raw, continuous data recorded striaght from the real world. An example is Satellite/aerial imagery, this is a commonly held in an open format with broad support, such as <a href="http://en.wikipedia.org/wiki/GeoTIFF">GeoTIFF</a> or <a href="http://en.wikipedia.org/wiki/JPEG">GeoJPEG</a>.</p>
<p><a href="http://en.wikipedia.org/wiki/GIS_file_formats#Vector_formats">Vector formats</a> hold refined, discrete data, which has been manually traced or otherwise derived other data sources. Examples include  building outlines, contours, road routes, pipe networks land land parcels and locations. Vector data is usually traced or derived, at great expense from raster data, to encode business information &#8211; as a result it&#8217;s usually highly valuable.</p>
<p>Unfortunately, there are many GIS  vector file formats,  and most are proprietary. They can only be used to their full in their native software. Three of the biggest are AutoCAD <a href="http://en.wikipedia.org/wiki/AutoCAD_DXF">DXF</a>, MapInfo <a href="http://en.wikipedia.org/wiki/MapInfo_TAB_format">TAB</a> and ArcGIS <a href="http://en.wikipedia.org/w/index.php?title=Personal_Geodatabase">Personal Geodatabase</a>. One vector format is unique &#8211;  both an open standard, and in wide use: Shapefile</p>
<p><a href="http://en.wikipedia.org/wiki/Shapefile">Shapefile</a> is publicly documented  in <a href="http://www.esri.com/library/whitepapers/pdfs/shapefile.pdf">ESRI Shapefile Technical Description</a> by <a href="http://www.esri.com">ESRI Inc.</a>, it&#8217;s creator. Any GIS software worth it&#8217;s salt can read and write to the format, so it&#8217;s become the least common denominator. It is <em>the</em> format for storing and exchanging vector data between teams, departments, businesses and government. In my opinion this makes Shapefile the best thing ever to happen to GIS, without it the GIS market would be a fraction of it&#8217;s current size.<span id="more-54"></span></p>
<p>Despite it&#8217;s popularity, Shapefile does have some serious limitations, mainly due to it&#8217;s DBF heritage:</p>
<ul>
<li>A shapefile is limited to <span style="text-decoration: line-through;">2</span> 4 GB or <span style="text-decoration: line-through;">65535</span> 4 billion/len(record) records.<br />
Where len(record) is greater of either the average feature length in bytes, or the length of a DBF record.</li>
<li>Records are limited to <span style="text-decoration: line-through;">1000</span> 65536 bytes or <span style="text-decoration: line-through;">32</span> between 257 &amp; 2038 fields.</li>
<li>Field names are limited to <span style="text-decoration: line-through;">8</span> 10 characters, character fields can hold up to 254 bytes.</li>
<li>Unicode is <span style="text-decoration: line-through;">not supported</span> not widely supported.</li>
</ul>
<p>Currently the only real alternative, for data exchange, is <a href="http://en.wikipedia.org/wiki/Geography_Markup_Language">Geography Markup Language (GML)</a> as defined by the <a href="http://www.opengeospatial.org/">Open Geospatial Consortium (OGC)</a>. An XML dialect, GML has none of the limitations of Shapefile this is why Ordnance Survey use GML to supply <a href="http://www.ordnancesurvey.co.uk/oswebsite/products/osmastermap/">MasterMap</a>, a highly detailed vector map of Great Britain. Support for GML in software is growing, but  it&#8217;s unsuitable as a storage format.</p>
<p>Viewing and editing vector data requires support for random access by attribute and by spatial extent. As an XML dialect GML cannot do this, to find one record, the entire file must be parsed from beginning to end. GML is almost always converted to another format, or loaded into a spatial database before it is used.</p>
<p>A spatial database is a database with data types and functions able to handle geospatial data. For the major databases there is <a href="http://www.oracle.com/technology/products/spatial/index.html">Oracle Spatial</a>, <a href="http://postgis.refractions.net/">PostgreSQL PostGIS</a>, <a href="http://www.microsoft.com/sqlserver/2008/en/us/spatial-data.aspx">SQL Server Spatial</a>, <a href="http://dev.mysql.com/doc/refman/5.1/en/spatial-extensions.html">MySQL Spatial</a> and <a href="http://www-01.ibm.com/software/data/spatial/">DB2 Spatial Extender</a>. All are based on <a href="http://en.wikipedia.org/wiki/Simple_Features">Simple Features for SQL</a> an open standard, meaning spatial data can be queried and updated with SQL like any other data type.</p>
<p>I believe that a portable, standalone spatial database, would make a very good successor to Shapefile.  Such a format would drive the GIS market forward, increasing usage of GIS by making it easier to share edit, publish and share GIS data. A portable spatial database would negate the need for the import, view, edit, export cycle that GML imposes.</p>
<p>At the moment I see 3 contenders for the crown:</p>
<ol>
<li><a href="http://www.esri.com/software/arcgis/geodatabase/about/file-gdbs.html">File Geodatabase</a> is a format from ESRI, it is natively supported by ArcGIS. ESRI proclaim it &#8220;Allow[s] users to easily exchange geodatabases.&#8221; That is true only if both users are running ESRI&#8217;s ArcGIS software. File Geodatabase is a proprietary format, despite promises by ESRI when it was launched.</li>
<li><a href="http://fdo.osgeo.org/fdosdf/index.html">Spatial Data Format</a> (SDF) is a format from Autodesk, it is native support . Support is included as part of their Feature Data Objects library, released as Open Source. SDF is based on the popular SQLite embedded database engine.</li>
<li><a href="http://www.gaia-gis.it/spatialite/">Spatialite</a> is another format based on SQLite, by an Alessandro Furieri. Spatialite is in it&#8217;s infancy still, it&#8217;s first release was 11 months ago.</li>
</ol>
<p>Unfortunately none of these looks like it will become a clear winner any time soon. Each is supported by only one application currently. If ESRI releases the specification for File Geodatabase, I expect it will quickly gain widespread support due to their position as market leader. As open source applications such as <a href="http://lists.osgeo.org/pipermail/qgis-developer/2009-January/005791.html">QGIS  gain Spatialite support</a>, it could slowly achieve dominance in a grass roots fashion. SDF seems to be going nowhere.</p>
<p>So ESRI, please publish the details of File Geodatabase. At it&#8217;s launch, during the 2006 ESRI User Conference, you promised that File Geodatabase would be an interoperable format. You promised to release a software library, so we  could read and write them without ArcGIS. Neither has happened. So File Geodatabase is just another closed format, another pretender to the throne that&#8217;s achieved only 1% of it&#8217;s true potential.</p>
<p>Publish File Geodatabase, or we&#8217;ll take the Shapefile crown by force.</p>
<p>Update 27 Mar 2009: Corrected Shapefile limits, based on <a href="http://www.clicketyclick.dk/databases/xbase/format/dbf.html">Xbase file structure</a> rather than <a href="http://www.clicketyclick.dk/databases/xbase/format/dbase_spec.html">dBASE software specifications</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://moreati.org.uk/blog/2009/03/01/shapefile-20-manifesto/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>Free/Open Source Government: part 1</title>
		<link>http://moreati.org.uk/blog/2008/04/08/freeopen-source-government-part-1/</link>
		<comments>http://moreati.org.uk/blog/2008/04/08/freeopen-source-government-part-1/#comments</comments>
		<pubDate>Tue, 08 Apr 2008 23:09:51 +0000</pubDate>
		<dc:creator>Alex Willmer</dc:creator>
				<category><![CDATA[open source]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[standards]]></category>

		<guid isPermaLink="false">http://www.moreati.org.uk/blog/2008/04/08/freeopen-source-government-part-1/</guid>
		<description><![CDATA[Several events for me  in the last fortnight converged almost perfectly on a common theme On Thursday 28 March The Register reported from an anonymous source, that the British Standards Institute (BSI) would reverse their vote on the proposed DIS 29500 standard from &#8216;No &#8211; with comments&#8217; to Yes. In response John Pugh MP, Liberal]]></description>
			<content:encoded><![CDATA[<p>Several events for me  in the last fortnight converged almost perfectly on a common theme</p>
<ol>
<li>On Thursday 28 March The Register reported from an anonymous source, that the <a href="http://www.channelregister.co.uk/2008/03/26/bsi_vote_yes_ooxml/">British Standards Institute (BSI) would reverse their vote</a> on the proposed DIS 29500 standard from &#8216;No &#8211; with comments&#8217; to Yes. In response John Pugh MP, Liberal Democrat member wrote <a href="http://www.computerworlduk.com/community/blogs/index.cfm?entryid=654">a letter to the BSI Director urging BSI not to vote yes</a>.</li>
<li>On Tuesday 1 April Pieter Hintjens, former FFII president gave a talk on <a href="http://spring2008.ukuug.org/talk_abstracts.html#51">&#8216;Software Patents and Open Standards&#8217;</a> at the UKUUG Spring 08 conference.</li>
<li>On Wednesday ISO announced that <a href="http://www.iso.org/iso/pressrelease.htm?refid=Ref1123">Microsoft OOXML/Ecma 376 is to be approved as DIS 29500</a>.</li>
<li>On Thursday David Cameron MP, Conservative leader gave a speech on &#8217;<a href="http://www.nesta.org.uk/informing/webcasts/david_cameron/index.aspx">Innovation and its role in public policy</a>&#8216; to NEST. He said a Tory government would open UK government data and &#8220;We also want to see how open source methods can help overcome the massive problems in government IT programs.”</li>
<li>On Thursday evening <a href="http://www.bbc.co.uk/radio4/science/thematerialworld.shtml">Material World</a> broadcast <a href="http://www.bbc.co.uk/radio4/science/thematerialworld_20080403.shtml">&#8216;Redefining the Kilogram&#8217;</a> on efforts towards a better international standard of mass and weight.</li>
</ol>
<p>The theme is how the Free/Open Source software movement might aid the political establishment.<span id="more-21"></span></p>
<h3>Civil Service</h3>
<p>I&#8217;ve worked as a civil servant for nearly 5 years; first as a system adminstrator, then a DBA. I&#8217;ve seen and provided input to the lower levels of IT decision making &amp; procurement within an executive agency. From afar I&#8217;ve followed media coverage of projects such as the NHS National Program for IT (NPfIT) and the former Child Support Agency (CSA) payments system.</p>
<p>In that time I&#8217;ve learned that <a href="http://en.wikipedia.org/wiki/Yes_Minister">Yes Minister</a> is comic genius  &amp; still highly applicable, to a civil service that is very conservative (with a small c) about novel ideas. Large government IT projects typically deliver late, over spend or under perform. Microsoft and other large providers of closed source software are so deeply entrenched in central government, that by default they&#8217;re seen as the only choice, if any choice is recognised</p>
<h3>Open Source</h3>
<p>The civil service has made moves toward F/OSS, in 2004 the <a href="http://www.ogc.gov.uk/">Office of Government Commerce (OGC)</a> released a <a href="http://www.govtalk.gov.uk/policydocs/policydocs_document.asp?docnum=905">policy document</a> that stated:</p>
<blockquote><p>&#8220;UK Government will consider OSS solutions alongside proprietary ones in IT procurements. Contracts will be awarded on a value for money basis.&#8221;</p></blockquote>
<p>I&#8217;ve never seen a case where this has been put into practice.</p>
<p>With the recent conservative announcement, all parties have made noises to some degree about use of F/OSS. It would appear it is entering the consciousness of the political establishment. However, more pertinant than the issue of Free/Open Source software is the issue of how much choice the civil service &amp; government has in their software purchase. Currently the choice is close to zero, because every team, agency, MP and department have enourmous silos of Microsoft Office documents.</p>
<p>Word, Excel, Powerpoint &amp; Outlook are the defacto industry standards. They&#8217;re proprietary, meaning that a single company controls them. Until this changes we are all at Microsoft&#8217;s mercy.</p>
<h3>Open Standards</h3>
<p>The <a href="http://www.iso.org/iso/home.htm">International Organisation for Standards (ISO)</a> made a delayed <a href="http://www.iso.org/iso/pressrelease.htm?refid=Ref1123">announcement</a> on Wednesday that baring any appeals, Microsoft Office OpenXML (OOXML) will become DIS 29500. It will join the existing <a href="http://opendocument.xml.org/">OpenDocument Format (ODF)/DIS 26300</a> as an international standard for document storage and interchange.</p>
<p>MS OOXML passed amid widespread process &amp; voting irregularities amongst the national standards bodies. Accusations of improper behaviour have been made against against ISO, Microsoft &amp; ODF supporters. As part of the Microsoft anti trust investigation, the EU has asked questions of Microsoft and the national bodies regarding voting on OOXML.</p>
<p>This relates to the kilogram, currently defined by a lump of metal in Paris, because that most basic standard is being improved in the open following rigorous scientific debate &amp; evaluation. Listen to last Thursday&#8217;s material world of the low down, but I&#8217;m confident the new kilogram standard will be trusted by all.</p>
<p>Like the kilogram, the standard for our office documents is central to our society. Billions in trade and public expenditure are dependant on the software we choose and the standards to which it conforms. If we can&#8217;t that trust the standard is sound then we can&#8217;t have the freedom to choose our software. Our software will continue to control us, instead of putting us in control.</p>
]]></content:encoded>
			<wfw:commentRss>http://moreati.org.uk/blog/2008/04/08/freeopen-source-government-part-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
