I work with AI tools often and I find myself constantly swapping between file types to navigate different sites and use cases. I built anydoc — I imagined a universal document format drive layer that would allow me to essentially have a file that is many different file types at once. CSV for organizing, markdown for feeding into LLMs, HTML for fast sharing... so here it is and I am working on making it better all of the time. Just shipped an update that makes the most relevant format types available faster on a document upload @boshjerns on X
RileyJames 8 hours ago [-]
It’s an interesting idea.
I’ve definitely felt the pain of file formats in some unexpected ways recently.
Like airdropping a photo from iPhone only to discover a .HEIC file, which nothing will accept.
I’ve previously used “what ever turns up first on google”, but I now won’t for anything of significance (privacy)
I’ve recently discovered Automator (on Mac) and the quick actions menu. Which can achieve a lot of image and pdf related conversions, but takes some setup (not a mass market solution)
I like the idea of this product. But I think the challenge will be:
- reaching the user at the moment they have this problem
- making your solution frictionless to solve their immediate problem, while also bootstrapping to solve it next time around (without them forgetting it exists)
If you can nail that experience for a single use case, I think this will be a winner.
DontchaKnowit 6 hours ago [-]
I think the real problem is getting it to actually work....
unsnap_biceps 8 hours ago [-]
Have you considered writing this as a FUSE system rather than a web service?
jeroenhd 5 hours ago [-]
You might not want to use https://anydocai.com/result/<incremental number> for URLs like that. Anyone can enumerate the ~300 files from the home page and look at what others have uploaded.
That said, the website doesn't seem to work anymore. It just errors out.
pavel_lishin 4 hours ago [-]
I wonder if they ran out of credits.
gus_massa 4 days ago [-]
I think calling it a "drive" will confuse a lot of people. I expected a device driver, and not technical people have no idea of drives, they just have documents and photos.
I tried with a text file
X Y
1 2
3 4
and for some reason the convert version has 1 2 3 4 in the same row.
gus_massa 2 days ago [-]
Too late to edit: Google drive is call "drive", so I guess it's more usual than I noticed.
alentred 8 hours ago [-]
Hm, I don't know, I am OK with "drive". Google Drive, Microsoft OneDrive, iCloud Drive.
Y_Y 6 hours ago [-]
I thought this was something like a FUSE driver that would on-the-fly generate any file you tried to read, with some consistency. Like if you open stories/zombie-party.txt it will have some generative network make it, and cache it. If you later ask for stories/zombie-party.odt it can just do a conversion.
I vibe-coded a demo of such a thing, with the idea of making game assets like textures/outdoor/wall.jpg etc. You can do it easily enough, but you need to be patient, and not particularly discerning.
raphman 5 hours ago [-]
FWIW, I wrote a small paper on this general topic a few years ago, collecting earlier work and own ideas.
"Files as Directories: Some Thoughts on Accessing Structured Data within Files"
Thanks! Sorry - I didn't realize that the paper in the ACM DL is not open-access.
quesera 5 hours ago [-]
I played with this idea for media servers.
I want iTunes and Audiobookshelf and beets and Jellyfin, etc to all work on the same filesystem and media archive.
There are challenges.
TomMasz 6 hours ago [-]
I got "No video with supported format and MIME type found." in the How It Works section.
troyvit 5 hours ago [-]
I think you can get past that if you download the video, then upload it back up to anydoc and ask it to translate it to Markdown.
edit: /s
sigmaisaletter 7 hours ago [-]
It's a fancy file format conversion utility.
Am I missing something?
dsr_ 5 hours ago [-]
Yes: it's a fancy file format conversion service that adds errors so your QA people have more work.
voidUpdate 7 hours ago [-]
So I can have my exe convert into a shapefile and an mp3?
voidUpdate 7 hours ago [-]
Well I tried to convert an exe into a pptx and it outputted a file that looked like an attempt at html, saying that the conversion wasn't feasible due to its nature and size
lloeki 6 hours ago [-]
You have it the wrong way around? Usually you are handed a pptx by customers and your job is to turn that into an exe.
voidUpdate 6 hours ago [-]
I mean the website cays it can convert anything to anything ("every file exists as all file types all of the time") so it should be able to do exe to pptx
thebeardisred 6 hours ago [-]
You missed the joke
jeroenhd 5 hours ago [-]
Based on important research like https://www.youtube.com/watch?v=uNjxe8ShM-8, it should definitely be possible to generate a .pptx that will run Windows inside of an emulator inside PowerPoint slides. That HTML file is lying to you!
ramoz 7 hours ago [-]
AI as a use case doesn’t make sense to me.
You’re using AI to create a transpilation of whatever modality. It’s a wasted step if the purpose is to feed back into AI.
ramses0 6 hours ago [-]
Back in the day there were a bunch of `x2y` programs[1], like html2pdf, xls2csv, rst2odt, jpg2png, png2jpg, etc...
You could imagine something like `any2zip`, or `any2tgz` or `iso2mp4` or something.
It seems like there could/should be some sort of virtual filesystem where you could say "cat inventory.xls.csv", or "wine.exe excel.exe inventory.csv.xls" (please bear with me on these examples). Effectively "$BLOB.format.format", where "." becomes a sort of "convert to this $TYPE".
...if you requested `README.md.pdf`, maybe it could intuit the intermediate `md2html2pdf` (HTML) portion?
I really wish local linux filesystems (for end-users) would at least match Apple's capabilities. eg: `$RECENT`, spotlight, auto-OCR. We've really regressed since the era of `locate`, but I'd _LOVE_ some sort of modern equivalent.
Imagine: `inotify`, `auditd`, just anything that can avoid full-disk scans during "normal end user" daily operation... wired up to `llm-summary $FILE >> sqlite.db ; `llava-describe $IMAGE >> sqlite.db ; etc...`
For bonus points, catch anything missed with some sort of full daily/weekly backup operation. We're on the cusp of a much more intimate "partnership" with the compute boxes underneath our desks, but so much is getting sucked into the void of "the network is the computer".
> Back in the day there were a bunch of `x2y` programs[1], like html2pdf, xls2csv, rst2odt, jpg2png, png2jpg, etc...
They're still around. A problem is loss of information on each conversion. For example, wav->mp3 loses info. Converting back (mp3->wav) won't get you the exact .wav you started with. Similar thing with file types supporting different resolution graphics, vector vs. bitmap, metadata being stripped, features in format A not supported in format B, etc.
Another problem is the explosion of M:N file format combinations. A possible fix would be a universal (?) in-between format, functioning as a container for [portions of a file] + whatever metadata was extracted from original. That way you can at least do conversions along the lines of video container formats, where container type is changed but video inside does not get decompressed/re-encoded. Or simular operations like extracting/shuffling pages in a pdf document.
All in all this is not an easy problem & therefore unlikely to be solved anytime soon.
globular-toast 8 hours ago [-]
I like the idea if it was deterministic. So if there are standard ways to convert to/from document types, like Pandoc, being able to write to any one of them and have it update the rest would be interesting.
I hate it if it's built with "AI". Can't imagine a use case for this apart from just shit you don't care about. Why would I be hoarding data I don't care about?
nkrisc 8 hours ago [-]
So it turns one file into many? Or is it actually one file that is simultaneously a valid HTML document and PNG?
IAmBroom 8 hours ago [-]
According to what I read, the latter.
c0wb0yc0d3r 7 hours ago [-]
And you’d be mislead. The video shows the original file is converted to different formats, depending on the user’s selection. The video shows jpeg to html (using AI to perform OCR?).
Pandoc but extra AI steps.
lawlessone 5 hours ago [-]
Wouldn't this make every file a lot bigger?
SPBS 7 hours ago [-]
the page is really laggy on Edge, kills any interest in wanting to explore more (strangely, it's much snappier on Chrome)
emadda 7 hours ago [-]
Could have called it quantumdoc
_wire_ 5 hours ago [-]
Imagine no file types
♪ It's easy if you try
No hell below us
Above us only sky
Imagine all the people
Visualize whirled peas
Ah ah ah oooo!
You may say I'm a dreamer...
jy14898 6 hours ago [-]
Now make it an HTTP API where content negotiation always succeeds
dankobgd 8 hours ago [-]
[flagged]
rpgraham84 7 hours ago [-]
[flagged]
dgan 7 hours ago [-]
i ve read the title 5 times, and can't make sense of it. Is this even valid English ?
Akronymus 6 hours ago [-]
>Imagine a drive[,] where every file exists[,] as all file types[,] all of the time
Basically treating one file type as if it were any arbitrary other file type
quesera 5 hours ago [-]
Punctuated like that, I can't help reading it in the movie trailer guy[0] voice.
I’ve definitely felt the pain of file formats in some unexpected ways recently.
Like airdropping a photo from iPhone only to discover a .HEIC file, which nothing will accept.
I’ve previously used “what ever turns up first on google”, but I now won’t for anything of significance (privacy)
I’ve recently discovered Automator (on Mac) and the quick actions menu. Which can achieve a lot of image and pdf related conversions, but takes some setup (not a mass market solution)
I like the idea of this product. But I think the challenge will be: - reaching the user at the moment they have this problem
- making your solution frictionless to solve their immediate problem, while also bootstrapping to solve it next time around (without them forgetting it exists)
If you can nail that experience for a single use case, I think this will be a winner.
That said, the website doesn't seem to work anymore. It just errors out.
I tried with a text file
and for some reason the convert version has 1 2 3 4 in the same row.I vibe-coded a demo of such a thing, with the idea of making game assets like textures/outdoor/wall.jpg etc. You can do it easily enough, but you need to be patient, and not particularly discerning.
"Files as Directories: Some Thoughts on Accessing Structured Data within Files"
https://dl.acm.org/doi/pdf/10.1145/3191697.3214323
Is this paper freely available somewhere?
Link found here: https://scholar.google.com/scholar?cluster=14832107127874645...
I want iTunes and Audiobookshelf and beets and Jellyfin, etc to all work on the same filesystem and media archive.
There are challenges.
edit: /s
Am I missing something?
You’re using AI to create a transpilation of whatever modality. It’s a wasted step if the purpose is to feed back into AI.
You could imagine something like `any2zip`, or `any2tgz` or `iso2mp4` or something.
It seems like there could/should be some sort of virtual filesystem where you could say "cat inventory.xls.csv", or "wine.exe excel.exe inventory.csv.xls" (please bear with me on these examples). Effectively "$BLOB.format.format", where "." becomes a sort of "convert to this $TYPE".
Imagine being able to say:
...if you requested `README.md.pdf`, maybe it could intuit the intermediate `md2html2pdf` (HTML) portion?I really wish local linux filesystems (for end-users) would at least match Apple's capabilities. eg: `$RECENT`, spotlight, auto-OCR. We've really regressed since the era of `locate`, but I'd _LOVE_ some sort of modern equivalent.
Imagine: `inotify`, `auditd`, just anything that can avoid full-disk scans during "normal end user" daily operation... wired up to `llm-summary $FILE >> sqlite.db ; `llava-describe $IMAGE >> sqlite.db ; etc...`
For bonus points, catch anything missed with some sort of full daily/weekly backup operation. We're on the cusp of a much more intimate "partnership" with the compute boxes underneath our desks, but so much is getting sucked into the void of "the network is the computer".
[1]: compgen -c | grep 2 | grep -v '2$' | grep -v '\.2' | grep -v '2\.'
[2]: https://en.wikipedia.org/wiki/Locate_(Unix)
They're still around. A problem is loss of information on each conversion. For example, wav->mp3 loses info. Converting back (mp3->wav) won't get you the exact .wav you started with. Similar thing with file types supporting different resolution graphics, vector vs. bitmap, metadata being stripped, features in format A not supported in format B, etc.
Another problem is the explosion of M:N file format combinations. A possible fix would be a universal (?) in-between format, functioning as a container for [portions of a file] + whatever metadata was extracted from original. That way you can at least do conversions along the lines of video container formats, where container type is changed but video inside does not get decompressed/re-encoded. Or simular operations like extracting/shuffling pages in a pdf document.
All in all this is not an easy problem & therefore unlikely to be solved anytime soon.
I hate it if it's built with "AI". Can't imagine a use case for this apart from just shit you don't care about. Why would I be hoarding data I don't care about?
Pandoc but extra AI steps.
♪ It's easy if you try No hell below us Above us only sky Imagine all the people Visualize whirled peas Ah ah ah oooo!
You may say I'm a dreamer...
Basically treating one file type as if it were any arbitrary other file type
[0] https://en.wikipedia.org/wiki/Don_LaFontaine .. wow, dead for 17 years!