How can I use Excel's Power Query From Web option to extract news headlines from various sites, and keep the...
Are there incongruent pythagorean triangles with the same perimeter and same area?
Button changing it's text & action. Good or terrible?
Is bread bad for ducks?
Multiply Two Integer Polynomials
What to do when moving next to a bird sanctuary with a loosely-domesticated cat?
How technical should a Scrum Master be to effectively remove impediments?
Why hard-Brexiteers don't insist on a hard border to prevent illegal immigration after Brexit?
Who coined the term "madman theory"?
Origin of "cooter" meaning "vagina"
Why isn't the circumferential light around the M87 black hole's event horizon symmetric?
If a Druid sees an animal’s corpse, can they wild shape into that animal?
Deal with toxic manager when you can't quit
What is the accessibility of a package's `Private` context variables?
Why not take a picture of a closer black hole?
Is "plugging out" electronic devices an American expression?
Does a dangling wire really electrocute me if I'm standing in water?
One word riddle: Vowel in the middle
Is an up-to-date browser secure on an out-of-date OS?
Is a "Democratic" Oligarchy-Style System Possible?
What is the meaning of Triage in Cybersec world?
Did 3000BC Egyptians use meteoric iron weapons?
Does the shape of a die affect the probability of a number being rolled?
Are there any other methods to apply to solving simultaneous equations?
Aging parents with no investments
How can I use Excel's Power Query From Web option to extract news headlines from various sites, and keep the data in real-time?
The 2019 Stack Overflow Developer Survey Results Are InIn Excel's chart wizard, how can I use the arrow keys?Where can I find guidelines for how to use Excel's “Data and Model” cell styles (specifically “Linked Cell”)?Grouping labels and concatenating their text values (like a pivot table)How do I extract the following data from an Excel data collection?How to extract Date and time from data entered as October 29, 2014 4:20PM PDTExcel: Force “Get data from web” web scrapper to use the exact urlCreate a pivot from multiple consolidation ranges with more than one row?How do you sort by row instead of column in Excel tables?When loading data from text file, how to use Text Import Wizard instead of Power Query?Microsoft Query can't find tables in the Excel file
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}
I am familiar with how this tool can extract data from tables on websites like Wikipedia, but I've run into a few issues. Here is what I need to happen, if possible with Excel, or any other software:

That was accomplished with a simple Copy & Paste to Match Destination Format, but that won't keep it dynamic. When I use the query editor, I immediately notice that it won't work the way I'd like it to unless the tables are actual HTML tables. It is possible to drill down until I find text, but I have no viable method to understand where I'm going.

There is no option available to select regions containing data that I'd like to be in a table, unless it fits the aforementioned format. Also, some pages that would normally allow it, have an IE compatibility issue that I'm not sure how to circumvent. If I could use Chrome or Edge instead that would be helpful.
microsoft-excel html power-query dynamic
add a comment |
I am familiar with how this tool can extract data from tables on websites like Wikipedia, but I've run into a few issues. Here is what I need to happen, if possible with Excel, or any other software:

That was accomplished with a simple Copy & Paste to Match Destination Format, but that won't keep it dynamic. When I use the query editor, I immediately notice that it won't work the way I'd like it to unless the tables are actual HTML tables. It is possible to drill down until I find text, but I have no viable method to understand where I'm going.

There is no option available to select regions containing data that I'd like to be in a table, unless it fits the aforementioned format. Also, some pages that would normally allow it, have an IE compatibility issue that I'm not sure how to circumvent. If I could use Chrome or Edge instead that would be helpful.
microsoft-excel html power-query dynamic
add a comment |
I am familiar with how this tool can extract data from tables on websites like Wikipedia, but I've run into a few issues. Here is what I need to happen, if possible with Excel, or any other software:

That was accomplished with a simple Copy & Paste to Match Destination Format, but that won't keep it dynamic. When I use the query editor, I immediately notice that it won't work the way I'd like it to unless the tables are actual HTML tables. It is possible to drill down until I find text, but I have no viable method to understand where I'm going.

There is no option available to select regions containing data that I'd like to be in a table, unless it fits the aforementioned format. Also, some pages that would normally allow it, have an IE compatibility issue that I'm not sure how to circumvent. If I could use Chrome or Edge instead that would be helpful.
microsoft-excel html power-query dynamic
I am familiar with how this tool can extract data from tables on websites like Wikipedia, but I've run into a few issues. Here is what I need to happen, if possible with Excel, or any other software:

That was accomplished with a simple Copy & Paste to Match Destination Format, but that won't keep it dynamic. When I use the query editor, I immediately notice that it won't work the way I'd like it to unless the tables are actual HTML tables. It is possible to drill down until I find text, but I have no viable method to understand where I'm going.

There is no option available to select regions containing data that I'd like to be in a table, unless it fits the aforementioned format. Also, some pages that would normally allow it, have an IE compatibility issue that I'm not sure how to circumvent. If I could use Chrome or Edge instead that would be helpful.
microsoft-excel html power-query dynamic
microsoft-excel html power-query dynamic
edited yesterday
Alex M
551312
551312
asked yesterday
Starnes StudentStarnes Student
12
12
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
This tool doesnt work well on most modern commercial sites as their pages are incredibly complex and dynamic. For example they will try to detect your location and browser and serve you varying content. Most other tools also struggle on those pages.
You will get better results by extracting from RSS feeds, rather than trying to parse the entire home page. RSS is a stable, machine-readable standard for news sites to publish stories/items. E.g. for Reuters they have many feeds available from:
https://www.reuters.com/tools/rss
Use the URL for the one you want and Power Query will quickly give you a nested document. Click on the Table cells to drill down, e.g. to content and then to item.
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "3"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f1423164%2fhow-can-i-use-excels-power-query-from-web-option-to-extract-news-headlines-from%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
This tool doesnt work well on most modern commercial sites as their pages are incredibly complex and dynamic. For example they will try to detect your location and browser and serve you varying content. Most other tools also struggle on those pages.
You will get better results by extracting from RSS feeds, rather than trying to parse the entire home page. RSS is a stable, machine-readable standard for news sites to publish stories/items. E.g. for Reuters they have many feeds available from:
https://www.reuters.com/tools/rss
Use the URL for the one you want and Power Query will quickly give you a nested document. Click on the Table cells to drill down, e.g. to content and then to item.
add a comment |
This tool doesnt work well on most modern commercial sites as their pages are incredibly complex and dynamic. For example they will try to detect your location and browser and serve you varying content. Most other tools also struggle on those pages.
You will get better results by extracting from RSS feeds, rather than trying to parse the entire home page. RSS is a stable, machine-readable standard for news sites to publish stories/items. E.g. for Reuters they have many feeds available from:
https://www.reuters.com/tools/rss
Use the URL for the one you want and Power Query will quickly give you a nested document. Click on the Table cells to drill down, e.g. to content and then to item.
add a comment |
This tool doesnt work well on most modern commercial sites as their pages are incredibly complex and dynamic. For example they will try to detect your location and browser and serve you varying content. Most other tools also struggle on those pages.
You will get better results by extracting from RSS feeds, rather than trying to parse the entire home page. RSS is a stable, machine-readable standard for news sites to publish stories/items. E.g. for Reuters they have many feeds available from:
https://www.reuters.com/tools/rss
Use the URL for the one you want and Power Query will quickly give you a nested document. Click on the Table cells to drill down, e.g. to content and then to item.
This tool doesnt work well on most modern commercial sites as their pages are incredibly complex and dynamic. For example they will try to detect your location and browser and serve you varying content. Most other tools also struggle on those pages.
You will get better results by extracting from RSS feeds, rather than trying to parse the entire home page. RSS is a stable, machine-readable standard for news sites to publish stories/items. E.g. for Reuters they have many feeds available from:
https://www.reuters.com/tools/rss
Use the URL for the one you want and Power Query will quickly give you a nested document. Click on the Table cells to drill down, e.g. to content and then to item.
answered 14 hours ago
Mike HoneyMike Honey
1,8091611
1,8091611
add a comment |
add a comment |
Thanks for contributing an answer to Super User!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f1423164%2fhow-can-i-use-excels-power-query-from-web-option-to-extract-news-headlines-from%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown