[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

Hello /adv/ i require asistance. I need to download this entire

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 15
Thread images: 2

File: estudios.png (102KB, 1903x951px) Image search: [Google]
estudios.png
102KB, 1903x951px
Hello /adv/ i require asistance.

I need to download this entire site. I was offered 2 months of flash facts for the ULSME first aid step one.

There are around 10,505 facts that resume the entire book and are very useful for studying.

How i can download the entire site without the painful process of taking screenshots of everything?

In return, i'll upload the flashcards somewhere if anyone is interested.
>>
Use your browser's inspect tool to see if each of those boxes has an id property (id="something"). If they do, you can write a script in your language of choice to pull the source code of each page and cut out only the content in those elements.

This might be way over your head.
>>
File: 1456079079333.png (23KB, 750x750px) Image search: [Google]
1456079079333.png
23KB, 750x750px
>>16939883
I'm a med student only, anon. I suck at this stuff.
>>
>>16939883
>>
>>16939855
>>
>>16939883
or just use wget, princess. don't hurt your fingers "programming".
>>
>>16940274
pls teach me?
>>
>>16940854
>>16940854
>>16940854
>>
>>16939855
You can use wget as some anon said, although that might be awkward for flashcards depending on site layout.

Using a website crawler is another option, I've had good success with scrappy before to dump whole sites public user info to SQL so I imagine it'd be ideal for flash cards. But it requires you to know about site layout and be at least semi-proficient in python.

The website might also have anti-scrapping protection since it's their whole business model, but in my experience most porn sites don't so I can't see why a flash card website would.

Basically since you have no real computer experience (have you used GNU/Linux before ?) you're not going to get this done alone. My little sister and brother are in premed right now so that might be of some interest to them, and I think I'll have a little free time 2 weeks from now. Mail me at [email protected] and I'll keep your contact information somewhere in case I'm bored but no promise.
>>
>>16939855
Easy mode: httrack
>>
>>16939855
bump
>>
>>16941310
bump
>>
>>16942686
bump
>>
>>16942781
>>
>>16942781
Thread posts: 15
Thread images: 2


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.