<?xml version="1.0" encoding="iso-8859-1"?>

<rdf:RDF
  xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
  xmlns:dc="http://purl.org/dc/elements/1.1/"
  xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
  xmlns:admin="http://webns.net/mvcb/"
  xmlns:cc="http://web.resource.org/cc/"
  xmlns="http://purl.org/rss/1.0/">

<channel rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/">
<title>Chronica Project: Internet Archive Temporal Search</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/</link>
<description>Updates about Chronica development.</description>
<dc:language>en-us</dc:language>
<dc:creator></dc:creator>
<dc:date>2004-12-03T09:47:13-08:00</dc:date>
<admin:generatorAgent rdf:resource="http://www.movabletype.org/?v=2.64" />


<items>
<rdf:Seq><rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000532.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000529.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000521.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000505.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000502.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000501.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000500.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000499.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000498.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000492.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000490.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000487.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000486.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000485.html" />
<rdf:li rdf:resource="http://cs.usfca.edu/~rstevens/archiveproject/archives/000480.html" />
</rdf:Seq>
</items>

</channel>

<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000532.html">
<title>Release 4.</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000532.html</link>
<description>We have finalized the contents and status of Chronica for the final release....</description>
<dc:subject></dc:subject>
<dc:creator>rstevens</dc:creator>
<dc:date>2004-12-03T09:47:13-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000529.html">
<title>And the latest is.... a GUI.</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000529.html</link>
<description>For those dependent on their mouse, we now have a GUI for setting the index parameters and beginning the indexing run....</description>
<dc:subject></dc:subject>
<dc:creator>rstevens</dc:creator>
<dc:date>2004-11-28T17:33:19-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000521.html">
<title>Mini wayback is a go</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000521.html</link>
<description>After some annoying debug runs, I&apos;ve gotten a portable version of the wayback machine going, for locally stored arc files. Yay. For now I&apos;m borrowing archive&apos;s javascript, which, while covering most stuff, misses a lot of other off-site references...I may...</description>
<dc:subject></dc:subject>
<dc:creator>defendio</dc:creator>
<dc:date>2004-11-05T23:48:54-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000505.html">
<title>Documentation, first go.</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000505.html</link>
<description>Here is the API documentation for Chronica so far, some comments need to be updated, etc. Will be updated....</description>
<dc:subject></dc:subject>
<dc:creator>rstevens</dc:creator>
<dc:date>2004-10-20T16:51:08-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000502.html">
<title>Behold..</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000502.html</link>
<description> Time-plot search The time-plot search is working. I set the images up using this gpl chart package I found on sourceforge, and it&apos;ll graph any query or combination of queries that you give it (see the &quot;need help?&quot; button...</description>
<dc:subject></dc:subject>
<dc:creator>defendio</dc:creator>
<dc:date>2004-10-19T03:57:29-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000501.html">
<title>chroninca index update</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000501.html</link>
<description>The current searchable index contains 2,194,824 documents. Thats the combination of the UK docs from the IA, as well as the local crawls done here at USF. The total indexing took 57 hours to do 1,473 arc files totaling 104GB...</description>
<dc:subject></dc:subject>
<dc:creator>cfrasche</dc:creator>
<dc:date>2004-10-18T18:05:26-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000500.html">
<title>Added XML config reader</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000500.html</link>
<description>I added the XML config reader to the respository and added support to ARCRecordReader......</description>
<dc:subject></dc:subject>
<dc:creator>rstevens</dc:creator>
<dc:date>2004-10-16T00:45:03-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000499.html">
<title>Now (almost) a real search engine</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000499.html</link>
<description>Since we&apos;re now able to index real data that&apos;s already in the wayback machine, I set it up to link straight to it...and it works. As long as the page is in the wayback machine, the links will work. Feel...</description>
<dc:subject></dc:subject>
<dc:creator>defendio</dc:creator>
<dc:date>2004-10-15T12:09:08-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000498.html">
<title>Inwardlinks code</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000498.html</link>
<description>Well, i can&apos;t find the cd with the back up of my inward links code... crap. Anyway, it&apos;s not as bad as it sounds because i was probably going to rewrite the whole thing anyway. The code had a lot...</description>
<dc:subject></dc:subject>
<dc:creator>jendo</dc:creator>
<dc:date>2004-10-12T00:29:39-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000492.html">
<title>An e-mail regarding the chronica source</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000492.html</link>
<description>From an email i sent earlier to day: Fellas, http://chronica.cs.usfca.edu/chronica/src/ that address will point you to the src that the web ui is currently running off of... it&apos;s essentially our &apos;stable&apos; build. There&apos;s some jsp pages in another directory that...</description>
<dc:subject></dc:subject>
<dc:creator>cfrasche</dc:creator>
<dc:date>2004-10-06T14:43:32-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000490.html">
<title>Some interesting stats on parsing...</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000490.html</link>
<description>I ran one of our larger arc files with 215 pdf and 1287 html formatted docs... I timed each parse and calculated the average parse time for each .. PDF: 3.89 seconds HTML: 0.019 seconds...</description>
<dc:subject></dc:subject>
<dc:creator>cfrasche</dc:creator>
<dc:date>2004-10-02T18:47:26-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000487.html">
<title>First Test of Large Index a Success!</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000487.html</link>
<description>So I finally had enough iterations through indexing the large set of arc files we&apos;ve gathere and cleaning up bugs and errors in the code/files to where I was comfortable letting the index finish and to try searching on it....</description>
<dc:subject></dc:subject>
<dc:creator>cfrasche</dc:creator>
<dc:date>2004-10-01T01:42:37-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000486.html">
<title>Deliverable 1 9.30.04</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000486.html</link>
<description>Release 0.1 of Chronica is now active.......</description>
<dc:subject></dc:subject>
<dc:creator>rstevens</dc:creator>
<dc:date>2004-09-30T14:35:43-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000485.html">
<title>PDF parser up and running.</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000485.html</link>
<description>After taking the time to clean up the summaries and titles of the html parser otuput, I moved on to adding more functionality.. PDFBox was the solution to pdf files. So the current index built off a small crawl from...</description>
<dc:subject></dc:subject>
<dc:creator>cfrasche</dc:creator>
<dc:date>2004-09-30T01:50:23-08:00</dc:date>
</item>
<item rdf:about="http://cs.usfca.edu/~rstevens/archiveproject/archives/000480.html">
<title>Fixed the text reader...</title>
<link>http://cs.usfca.edu/~rstevens/archiveproject/archives/000480.html</link>
<description>Yeah, so check your email, I fixed the weird character bug in the ARR.extractText().... My bad, collecting the read data wrong.... I think we are good to go for thursday.......</description>
<dc:subject></dc:subject>
<dc:creator>rstevens</dc:creator>
<dc:date>2004-09-28T00:25:20-08:00</dc:date>
</item>


</rdf:RDF>