Ever feel like you’re missing the obvious?


I’ve had this lingering technology idea floating around in my head recently. I don’t think it’s at all inspired. I know someone has thought of this long before me. But I honestly can’t say I’ve ever seen it in function or practice. So anyone who wants to point out my ignorance of this technology and where it exists please feel free.

The Internet uses Meta tags to easily make information searchable. Well, I want to do this with files. I would like to assign categories or keywords to files to make them more searchable. I’m not necessarily looking to store the files in a database; a pointer to the file would work fine (storage is an option though). I’m looking to build a searchable library; my own kind of knowledge base.

I have no idea why I’m being vague. I had the idea of using something like this to help Pastor Bob keep track of all his study and sermons. He manuscripts all of them and they stay stored in a Word file (typically never to be used again after the Sunday he teaches it). I figured I could save him a lot of re-research if I gave him some way of categorizing and searching what he’s done before. Also, we collect all kinds of stories, devotionals, rants, etc. from the web. It would be great to database them and aid in his teaching prep.

The programmer in me knows I’m looking for something that deals with Metadata. That’s a fun little concept that I haven’t had to use since college. Man, I miss programming. I was always one of ‘those’ people that really enjoyed it. I was the only one.

There is no limit to how much metadata you might want to assign to a document. I guess that’s the real challenge of projects like this.

I kind of hinted to Bob I was thinking about this and his eyes widened and his face lit up. Now, I’m committed. I should have kept my mouth shut. :)

I’ve look up various content management systems, but all the ones I found were designed for web use. I’ve casually looked at some document management systems (which seem like a really logical name for it), however the few I found look way more complicated and advanced beyond what I really need or want. Plus, ‘document management system’ typically describes a stored, scanned version of a document. And I don’t want or need that much complexity.

It wouldn’t at all be hard to write a basic database to do this, but I know someone, somewhere has created a really great something that already does this. Again, I’m not fooled; I know this isn’t a revolutionary idea.

So please help the ignorant. What am I looking for?

Information and Links

Join the fray by commenting, tracking what others have to say, or linking to it from your blog.


Other Posts
life on the river
So I should pay more attention

Write a Comment

Take a moment to comment and tell us what you think. Some basic HTML is allowed for formatting.

Reader Comments

It’s been along time since I have used it so I don’t know if it still has it or not but the full version of Adobe Acrobat use to have a catalog feature built in. When you added new files to a folder, you could run the catalog program and it would index the new files and then you could search for keywords.

dj

just use the google desktop search :-)

I’m using Ubuntu and have a thing called Beagle that indexes all my documents. It’s probably similar to the Google desktop search that Jason mentioned. The Mac users have Spotlight which is similar. You almost don’t need meta-data for such purposes since the entire document is indexed.

However, in OpenOffice (and presumably MS Office) you can assign “properties” to a document. In OOo, “properties” is in the file menu. In the description tab you can add title, subject, keywords and comments. Beagle successfully finds documents while searching for items placed in this meta-data area.

I’m also using Alfresco for some of this, but beagle seems to be good enough for simple stuff like this.