Anyone know of an API (php preferable but I\'d be interested in any language) for creating wiki-like data storage?
How about any resources on rolling your own plaintext
It sounds like you are essentially just looking for version control. If that is the case, you may want to look into a diff algorithm.
Here is the Wikipedia Diff page.
I did a quick php diff google search, but nothing really stood out as a decent example, since I only have basic PHP knowledge.
So, basically this is a "how do I version text information in my DB".
Well, the simplest way is simply copying the data.
Simply, create a "version" table that holds "old versions" of the data, and link it back to your main table.
create table docs {
id integer primary key not null,
version integer not null,
create_date date,
change_date date,
create_user_id integer not null references users(id),
change_user_id integer references users(id),
text_data text
}
create table versions {
id integer primary key not null,
doc_id integer not null references docs(id),
version integer,
change_date date,
change_user integer not null references users(id),
text_data text
}
Whenever you update your original document, you copy the old text value in to this table, copy the user and change date and bump the version.
select version, change_date, change_user, text_data
into l_version, l_change_data, l_change_user, l_text_data
from docs where id = l_doc_id;
insert into versions values (newid, l_doc_id, l_version,
l_change_date, l_change_user, l_text_data);
update docs set version = version + 1, change_date = now,
change_user = cur_user, text_data = l_new_text where id = l_doc_id;
You could even do this in a trigger if your DB supports those.
Faults with this method are that its a full copy of the data (so if you have a large document, the version stay large). You can mitigate that by using something like diff(1) and patch(1).
For example:
diff version2.txt version1.txt > difffile
Then you can store that difffile as "version 1".
In order to recover version 1 from version 2, you grab the version 2 data, run patch on it using the diff file data, and that gives you v1.
If you want to go from v3 to v1, you need to do this twice (once to get v2, and then again to get v1).
This lowers your storage burden, but increases your processing (obviously), so you'll have to judge how you want to do this.
Will's huge answer is right on, but can be summed up, I think: you need to store the versions, and then you need to store the metadata (who what when of the data).
But your question was about resources on Wiki-like versioning. I have none (well, one: Will's answer above). However, about the storage of Wikis, I have one. Check out the comparison matrix from DokuWiki. I know. You're thinking "what do I care what brand of DB different Wikis use?" Because DokuWiki uses plain text files. You can open them and they are indeed plain. So that's one approach, and they've got some interesting arguments as to why DBMS are not the best way to go. They don't even hold much metadata: most of the stuff is done through the flat files themselves.
The point of the DokuWiki for you is that maybe it's a relatively simple problem (depending on how well you want to solve it :)
Here's a list of all 12 wikis on WikiMatrix that are written in PHP and do their storage using text files. Perhaps one of them will have a storage method you can adapt into the database:
http://www.wikimatrix.org/search.php?sid=1760