How to Download New Articles From a List of Websites? Recommendations? - web

I've ran into several dead ends trying to come up with a result from a Google search. Essentially, I have a list of say 20 websites, all research institutes that occasionally update their websites/blogs with their latest findings.
I'm trying to either A - find a software that can check for new articles, then send me the title and link to the article, or B - write a script that check for new articles, then send me the title and link.
Any suggestions or software recommendations?

You should first see if any of the sites have an RSS feed. That is fairly common, and will do the work for you.
I've built similar things. If the articles have a published date, you could keep a file or database of with the new

The easiest way to save a web page is to download it to your computer. In Chrome, open the three-dot menu and select More Tools > Save page as. For Firefox, open the hamburger menu and choose Save Page As. On Safari, go to File > Save as or File > Export as PDF, and in Microsoft Edge, open the three-dot menu and choose More tools > Save page as.

Related

Links i harmon.ie offers a download, not the actual SharePoint document

When I attach a link in an email using harmon.ie, I get an url that includes the file name. However when the recipient clicks it, it starts to download a copy instead of opening the actual SharePoint document.
The link is typically on this format:
[company].sharepoint.com/sites/[subsite]/[library]/[filename.docx]
The same link retrieved directly from SharePoint would look something like this:
[company].sharepoint.com/:w:/s/sites/[subsite]/EZ31C3wRKuVPnOOB7vNGMsMBiI9eYAjGTuiaEODu_c3wpw
That link will typically open the document in SharePoint as expected.
It did not use to be like this. The links from harmon.ie used to open documents instead of downloading copies. I am afraid I cannot say when we discovered this, but I believe it was a couple of months ago. We are using the free plan, and are not qualified for support from harmon.ie. We were pointed in the direction of this forum, and I would be most grateful if someone could help us.
Thanks
Eigil
Harmon.ie creates plain valid links to Sharepoint document and they are opened by default browser (like any link you will add inside an email)
The behavior is only controlled by the browser not by harmon.ie (you can check it out by adding the url in the browser)
The only solution to control the behavior is to use the feature to generate OWA link so that it opens in Office online.
Indeed, we have feature allowing to create links that will open in OWA (hence in the browser)
Please read carefully register key: GenerateOWALinks as detailed at https://harmon.ie/provision-harmonie-outlook-all-users
Note also that if the recipient of the email has harmon.ie installed, he can right click on the link, select Open in Harmon.ie. The document will appear in harmon.ie sidebar (at the Sharepoint location), double clicking on it with open the document in MS Word.
Thanks,
----- Jean

The new extension is not visible in Store

I noticed that from December 1 published extensions are not visible on the page https://chrome.google.com/webstore/ and google after such indexing is visible? Could it be some sort of algorithm update?
Or is it a temporary error?
You may want to try Updating a web app's Chrome Web Store listing with these steps:
Open the web app script in the script editor.
Click Publish > Register in Chrome Web Store. Click Edit store item. This will bring up the Edit Item form you filled out during publication.
Make the changes you want to the description, graphic assets or other settings.
Click Publish changes.
Visit the Chrome Web Store developer dashboard to see the publication status of your web app. Note that the publication process can take up to an hour to finish. Once the "Status" column says "Published", your web app will be available in the store.
In addition to that, you may also check the possible reasons why your item may not be showing in search.
If any of the given links doesn't help, I suggest that you submit your issue.

Sharepoint - Download editable file from url in chrome

We have been stuck on using Internet Explorer in my company for one single reason: when you click on a sharepoint link to a Document in IE, it opens the file as editable. So when you save it, it is automatically saved on the server.
On the other hand, with Chrome or other browsers, it downloads a copy of this file, so the user might loose changes because he thinks the file is saved on the server whereas really it is not...
Anyone thinks of a solution for that?
Thanks !
There is a similar resolved question to yours right here:
How to open SharePoint files in Chrome/Firefox
give it a shot!

Editing and emailing a page from a website

Is it possible to download a page from a website, edit parts of that webpage, save it and then email it to someone so they can see the changes?
I have been using "save as" "web page, complete" to download the page. I can edit it locally but whenever I email this file either the stylesheet or the images do not appear correctly when they open it!
Is there a better way??
It's quite easy to edit a web page using tools like edit.lize.it, but the best way of saving the result is to take a screenshot, like this.
Link: edit.lize.it

Can't open PDF files in SharePoint 2010 with Internet Explorer

So we couldn't open .pdf in the browser in our SP2010 site. I set the setting to permissive browser file handling in central admin. I then found out that there's a bug that if a site is created from a custom template the pdf files uploaded to that site will still prompt for either Save or Cancel. I ran a hotfix on the server
http://support.microsoft.com/kb/2459108
Consider the following scenario:
You set Browser File Handling to Permissive for a web application in the General settings page in SharePoint 2010 Central Administration.
You create a document library, and then upload an html document.
You open the html document in the browser.
Note You are not prompted to download the html document and it is rendered in the browser.
You select to include the content when you save the SharePoint site as a template.
You use the template to create a new SharePoint site in the same web application.
In this scenario, the Browser File Handling list setting for the document library in the new site is set to Strict. Additionally, when you open the html document, you are prompted to download the file.
Now when I click on a pdf with firefox I can open it directly but with internet explorer (8 and 9, default settings) I still can't do it, what's the solution here?
Edit: Maybe it always worked in firefox, anyway, when I create a new library it works as expected. How can I run this setting on all libraries?
There's a different, more subtle, but simpler root cause of this problem.
After much web searching and many hours with MSFT support, as hard as this may be to believe, it turns out that the root cause of my "SharePoint won't open PDF documents" problem was actually an Adobe extension/add-on. The symptom was an Adobe error msg "failed to open" after clicking the PDF list item in a document library. The culprit, an Adobe extension/add-on: "Adobe Acrobat SharePoint OpenDocuments Component".
I do not know how this got installed. What I do (finally) know is that this component actually does the exact opposite of what its name implies, i.e., it apparently prevents PDF documents from opening up when clicked in a SharePoint 2010 document library.
After various failed attempts to solve this problem (including changing "Browser File Handler" settings on the web app server from "Strict" to "Permissive" and other fixes suggested below and elsewhere on various blogs and web sites), nothing fixed the problem until we disabled this Adobe extension/add-on. Then, problem solved.
Note that you may not see this component in the "Tools > Manage Add-Ons" list until after attempting to open a PDF document from the library: apparently the add-on isn't activated (won't appear in that list) until an 'open' attempt is made. SO - if at first you don't see the component listed, try to open a PDF file and check the list again. If this component appears, disable it, and your problem is likely to go away.
Baffling, at best; or worse, actually nefarious on Adobe's part ...?
I'd still like to know how to get the PDF to open in a separate browser tab in IE vs. displacing the active tab. If anyone can help with that, please let me know! No custom coding solutions, PLEASE!
There is a better way to handle "Browser File Handle" issue. Take a look at my blog here: http://www.pdfsharepoint.com/sharepoint-2010-and-pdf-integration-series-part-1/
Solution #2 addresses Pdf extension without exposing entire Web Application to "Permissive" browsing. Setting "Browsing File Handle" to "permissive" opens too many vulnerabilities with other file extensions.
Thanks,
Dmitry
I have the same problem - originally installed Office Web apps, then turned that off, turned on the open in client application, then changed the setting on each doc library to open in browser .. Still have a problem with PDFs though.
If somebody includes a link to them in an announcement, then that person can open, other not. But only in IE - in FF there is no problme
Just change the Browser File Handling for the Web Application from the central admin as:
Central Administration > Application Management > Manage Web Applications
go to your Web Application example "http://sharepoint:80, just select it
from the top ribbon click "General Settings"
go down to "Browser File Handling" and change it to "Permissive"
If am not clear go to http://www.pdfsharepoint.com/sharepoint-2010-and-pdf-integration-series-part-1/
try this:
Make sure you're the site collection admin. Go into the site (not the central admin) and then go to site settings then go to site collection features. In there you will find the setting for " Open Documents in Client Applications by Default " it will probably be deactivated. Active it and you're good to go. users will then open attachments in their windows assigned applications, not the sharepoint web apps.
Also, try going into adobe reader and in the settings there is an option to open with the browser. check or uncheck it based on what you want it to do.
Encryption and SharePoint don't play well together
Right click My Documents or source folder
Select Properties > Advanced (button)
Uncheck "Encrypt contents to secure data"
This should solve many SharePoint problems you might have, including files not opening properly.
Appreciate this is an old post but still very relevant today. I spent a while trying to get this to work - just thought I'd share my findings.
This is specific to Adobe Acrobat. If you use a different PDF viewer, such as SumatraPDF the issue does not occur.
1. To prevent the 'Open, Save, Save As' dialog box in Internet Explorer:
This is specific to the versions of Acrobat. Set the following key/value:
Key: HKLM\SOFTWARE\Policies\Adobe\Acrobat Reader\*acrobat_version_number*\FeatureLockDown\cSharePoint
Value Name: bDisableSharePointFeatures
Value Type: REG_DWORD
Value: 0x1 (hex)
e.g.
For Acrobat X:
HKLM\SOFTWARE\Policies\Adobe\Acrobat Reader\11.0\FeatureLockDown\cSharePoint
2. To disable PDFs opening in the browser
This is specific to the versions of Acrobat. Set the following key/value:
Key: HKCU\Software\Adobe\Acrobat Reader\acrobat_version_number\Originals
Value Name: bBrowserIntegration
Value Type: REG_DWORD
Value: 0x0 (hex)
e.g.
For Acrobat X:
HKCU\Software\Adobe\Acrobat Reader\11.0\Originals
Thanks,
References:
Adobe Acrobat - Lockable Settings
Adobe Acrobat - General Application Settings

Resources