My site will have genealogical records. Old birh, death, marriage records transcribed from various sources. My goal is to have them display in various ways, the most important being a sortable table and I need advanced search capabilities as in the ability to search one or more fields/columns. The records are fairly small and simple. Surname, Given name, Date, Place etc. 10-20 fields. They are getting transcribed into excel so the universal output from there would be csv. There's a new standard being adopted for genealogical data and it's xml so at some point I'll be converting csv to xml. The standard is for a complete family tree but individuals/person is a part of that. Here's a sample of what a record would look like.
<person id="BBB-BBBB">
<source description="#EEE-EEEE"/>
<gender type="http://gedcomx.org/Male"/>
<name id="789">
<nameForm>
<fullText>George Washington</fullText>
<part type="http://gedcomx.org/Given" value="George"/>
<part type="http://gedcomx.org/Surname" value="Washington"/>
</nameForm>
</name>
<fact type="http://gedcomx.org/Birth" id="123">
<date>
<original>February 22, 1732</original>
<formal>+1732-02-22</formal>
</date>
<place description="#888">
<original>pope's creek, westmoreland, virginia, united states</original>
</place>
</fact>
<fact type="http://gedcomx.org/Death" id="456">
<date>
<original>December 14, 1799</original>
<formal>+1799-12-14T22:00:00</formal>
</date>
<place description="#999">
<original>mount vernon, fairfax county, virginia, united states</original>
</place>
</fact>
</person>
So my question at last is: Would it be best to have that xml go into a database(MySQL) or could I have the xml docs on the server and work with them as they are? The issues are display in sortable tables on the site and be searchable by field. There will be tens or hundreds of thousands of these eventually. They can/would be categorized either by State or County. There's 3143 counties in the U.S. I would think it would be nice to leave them as xml since it is or will be the native format. It seems like advsearch, being based on solr would be capable of dealing with xml. I'm not a coder but I'm learning and I do realize it will take quite a bit no matter how I go. Serializing, indexing, schemas etc. Just looking for some ideas as to what direction to take so I don't spend months heading down the wrong path.
Thanks, John
PS, the above record is George Washington of course and there would be one for Martha and then one for relation which links them together and more relations for parent/child. It is possible that someday I would be using the relation records and at that point the whole thing becomes relational and that's what the entire spec/standard is for.