"Technology reveals the active relation of man to nature" - Karl Marx
(i have already made this in other chans, i need all the help i can get)

so i'm motivated to planing to create a archive for threads and websites. thread and web writings that are important enough, have quality, and or can be used to counter western media and history naratives.

the archive i want to create for the threads is different from things like internet archive or things like that because i want to actually save all the file that is uploaded unlike regular archive where not every file and many that are uploaded in the thread were not saved in the archive.

if i can i want to make a website for this but i do not have any experience about creating website and coding nor can i do it. i also have special-ed mental that make me unable to learn coding like normal people so its hard.

my main plan is to use httrack and use every file format list from wikipedia and other websites, then copy that list to httrack file format selection thing

i want help from every people here, so if you can please send something

also adhd brained, if anyone can please help me tidy the "list of file format" list on wikipedia.

delete whatever decription/writing near it, make it sort of like a list like the pic i post.

add +*. to the front.



so something like archived.moe but for /leftypol/?


i don't use moe, but yes, like that. and u can have all the file that get posted


i like my internet content ephemeral


note: main plan was too save the webpage offline in my computer. making website was the second


if anyone know other chan/IB/forum that can help post please


Have you tried lainchan? They're more tech-centered.


I used to know people who were obsessed with this imageboard archival shit years and years ago
Literally all of this unpaid effort just to archive the worst slop on the internet even against the wishes of the userbase creating that content
They even tried to acquire special magnetic tape drives that would last longer
It was hilariously strange in retrospect, although at the time I was just interested in their knowledge on sysadmin'ing



there is only two IB like this with content like this and quality like this, it should be archived


you can see what files are allowed from the codebase, take meds and go to work
archivebox is a thing btw, idk if it exactly suits your needs but it might be interesting to you
anon why tf do you not just keep some form of personal knowledge organiztion, and when you see something cool you copy it down? Also stuff you read from anons on a forum should be taken with a grain of salt - many posts could be stripped down to some interesting research avenues, which could be what you actually save/write down

Keeping a [bunch of text] whole post only makes sense if you really appreciate the writing as a quote, for its special value in conveying something elegantly for example. Saving a [dialogue] thread makes pretty much no sense ever even though it might have in some rare cases entertainment or educational value… its so bulky, and for what?


>quality like this
what are you talking about there are hundreds of low quality imageboards.


just fucking take screenshots of noteworthy things


i know, but i want more, and it have limitation



i dont want my posts archived forever by some bot


i know


uyghur they're already being archived and harvested by the glowies that run this shitsite


thats nice dear


guys, i keep getting derailed by adhd and procrastination, how do i cope and finish this ? i want to atleast be done with the note file format thing


already tried the p3 thing in the gui httrack, it does not work


leftypol supported file types:
JPEG Files
BMP Files
GIF Files
PNG Files
MP3 Files
MP4 Files (Supports thumbnail)
WEBM Files (Supports thumbnail)
PDF Files (Supports thumbnail)
EPUB Files
DJVU Files (Supports thumbnail)
Text Files (Supports thumbnail)
ZIP Files
GZ Files
BZ2 Files

is this actually the only file format that can be uploaded ? or are there any other that can be uploaded but are not on the list ?




wd comrade


only one thing that can help you

a m p h e t a m i n e




there is stuff that does not have a dot (".") and also stuff that are capitalized. at the same place there are stuff that have dots and are not capitalized. can anyone here explain and help ?


currently adding dots


some look like file extensions, some look like MIME names.


the wikipedia page i use (in case if it get updated, which makes it hard to make sure the list are perfect)


for incase:
checkpoint: clear text
line of the text (description line included)= 1773
last line: Pseudo-pipelines, Pseudo-pipeline


almost done.
im starting to think i have ocd. and adhd at the same time


if anyone still not sure you can use winmerge or some other thing to compare the wiki txt file with this one (v4)


oh god


i want to make a thread about website to archive, in what board should i put it ?


copypaste edition.
there are some that i miss.
NIfTI,z10-z99,cursor [edit].

its been hard to do simple things like this. i do not think having adhd is this hard. i may be the worst variant of adhd person group. AND I CANNOT DO ANYTHING ABOUT IT!!!.


also this is the final fix (i think). later maybe i will post the preious version with the fix. post if there are something i miss


i forgot that you actually have to make it into horizontal line for it to work in httrack. anyone know a program or cript that can help ? anyone wanna help ?


maybe just archiving iB thread using the faq supported file types is fine…..


from kate gang(???) leftychan.org




I may be an idiot, but… WHAT THE FUCK IS THIS THREAD ABOUT??

OP hasn't archived shit, it's just text files with filetype lists. He spent a month putting a dot in front of extensions he took from Wikipedia.
>from kate gang(???) leftychan.org
it's filetypes, one of which is .apk, I didn't know leftychan.org hosted APKs.


the filetype list txt files are for archiving tool.
to copy paste in httrack.
hence why i want a list full of all the file kind taken straight out of wikipedia.

this thread is the creation of my autismretardationspecialeducationadhdbrain.
hence why thread is looking derailed.


but anon, I have autismretardationadhdbrain and that's why I hate things that are complicated. Now that I understand what you want to do, I can tell you you could have used a single wget line.

wget -mpckE --user-agent="" -e robots=off --wait 1 www.foo.com

Explanation: https://dheinemann.com/posts/2022-02-05-archiving-a-website-with-wget

Then with { } and the && operator (or a shell script) you can download all kinds of websites, e.g.
wget -mpckE --user-agent="" -e robots=off --wait 1 $1
echo "done"

save as archive.sh, chmod +x it and then in terminal:
$ ./script.sh www.leftypol.{org,net}

wget will automatically convert all pages to .html, make the links relative, and you will have an offline mirror of the website.


woah. i will use this


based wget guru
it blew my mind how good it was for downloading multiple issues of periodicals from libgen

